The present invention generally relates to methods and system for automatic coronary angiography interpretation using machine learning techniques.
Coronary heart disease (CHD) is the leading cause of adult death in the United States and worldwide. Coronary angiography is a minimally-invasive catheter-based procedure that provides the gold-standard diagnostic assessment of CHD is performed more than 1 million times a year in the United States alone. The decision to provide procedural treatment for CHD, either through stent placement or bypass surgery, relies largely upon the determination of whether narrowing of the coronary artery at any location is greater or less than 70% in severity. The most common approach, and present standard-of-care, for determining coronary stenosis severity remains ad-hoc visual assessment, even though this method suffers from high inter-observer variability, operator bias and poor reproducibility. The variability is further exacerbated by the wide range of procedural experience amongst cardiologists: 39.2% of operators in the U.S. perform less than 50 procedures a year, which is considered low-volume. Visual assessment of coronary stenosis severity has therefore been shown to have high variance and inter-observer variability ranging from 15 to 45% and this diagnostic standard has not changed in over 70 years. Variability in stenosis assessment has significant clinical implications, and likely contributes to inappropriate use of coronary artery bypass surgery in 17% of patients and of stents in 10% patients. A standardized and reproducible approach to coronary angiogram interpretation and coronary stenosis assessment would address a clinically impactful unmet need underpinning CHD diagnosis and the critical decision of procedural CHD treatment.
While methodologies to assist with quantifying coronary stenosis severity exist, such as quantitative coronary angiography (QCA), they require significant operator input to function, namely selection of an optimal frame within the angiogram video, manual identification of a reference object (usually the guide catheter), and manual tracing of the vessel wall. The requirement for manual input at multiple steps is time-consuming and has relegated QCA to infrequent clinical use, reserved primarily for research applications.
Further, left ventriculography, imaging of the left ventricle with an injection of significant quantities of iodine dye, may be performed often performed at the time of the coronary angiography to determine the left ventricular ejection fraction (LVEF), which has important diagnostic and therapeutic implications. It has been linked to increase radiation and increase exposure to dye, leading to 2.3 the odds of acute kidney injury post procedure, contributing to increased morbidity, mortality, and hospitalization costs. Despite these known complications and lack of clinical guidelines recommending the procedure, it's use has decreased over time, in favor of alternate modalities such as transthoracic echocardiograms (TTEs), but is still performed in over 50% of the coronary angiogram procedures.
Various embodiments relate to a method for estimating left ventricular ejection fraction, the method including: producing one or more angiogram images of a patient and an estimate of left ventricular ejection fraction of the patient to produce training data; training a machine learning model with the training data; providing one or more angiogram images of another patient; and estimating the left ventricular ejection fraction of the one or more angiogram images of the other patient using the trained machine learning model.
Various other embodiments relate to a method for estimating arterial stenoses severity, the method including: classifying a primary anatomic structure of one or more angiogram images of a first patient; classifying a projection angle of the one or more angiogram images of the first patient; labeling stenoses within the one or more angiogram images of the first patient classified as including a left or right coronary artery; filtering out certain labels in the one or more angiogram images based on certain classified projection angles; producing one or more angiogram images of a second patient with corresponding estimated stenoses of the second patient to produce training data; training a machine learning model with the training data; and estimating the arterial stenoses severity of the first patient by running the machine learning model on the filtered and labeled one or more angiogram images of the first patient, wherein the machine learning model is only run on angiogram images previously labeled as including stenoses.
Various other embodiments relate to a method of analyzing coronary angiograms, the method including: producing one or more coronary angiogram images with a corresponding estimated feature of the one or more coronary angiogram images to produce training data; training a machine learning model with the training data; and running the machine learning model on another one or more coronary angiogram images to estimate features of the other one or more angiogram images.
The description and claims will be more fully understood with reference to the following figures and data graphs, which are presented as exemplary embodiments of the invention and should not be construed as a complete recitation of the scope of the invention.
Full automation of coronary angiography interpretation previously includes numerous and complex sequences of component tasks which currently require expertise from highly specialized physicians to accomplish. Deep neural networks have recently been applied to various areas of cardiology to automate tasks such as echocardiogram interpretation, and electrocardiogram analysis and coronary angiography vessel segmentation. The potential obstacles to achieving automated angiographic analysis may include use of multiple non-standard projections in most studies due to anatomic variation, multiple objects of interest that change location throughout the video, variable contrast opacification of the artery, coronary artery overlap and “foreshortening,” which is caused by 2D visualization of 3D structures, and integration of stenosis estimates across multiple frames of a single video and across projections of the same vessel from multiple videos to determine a final stenosis percentage.
Systems and methods in accordance with many embodiments of the invention are capable of overcoming the limitations of visual assessment of coronary stenosis. In a number of embodiments, a pipeline is utilized that includes multiple deep neural networks which sequentially accomplish a series of tasks which may perform automated assessment of coronary stenosis severity. In several embodiments the pipeline performs a sequence of tasks including (but not limited to): classification of angiographic projection angle, anatomic angiographic structure identification (including identification of the left and right coronary arteries), localization of coronary artery objects including coronary artery segments and stenosis, and determination of coronary stenosis severity. The algorithmic pipeline may provide a broad foundation to accomplish most tasks related to automated coronary angiogram interpretation including assessing coronary artery stenosis severity.
In some embodiments, artificial intelligence using deep learning may be applied to allow sophisticated recognition of subtle pattern in digital data in numerous areas of cardiology including interpretation of electrocardiograms, left ventricular ejection fraction (LVEF) prediction using transthoracic echocardiograms (TTEs) or electrocardiograms and diabetes detection using smart devices such as smartphones. Advantageously, subtle morphological derangements associated with reduced LVEF may be differentiated from a normally functional heart with normal LVEF in routine coronary vessel angiograms using deep learning which may alleviate the need to perform the left ventriculography. In embodiments of the invention, a deep neural network may be trained, validated, and then tested on a large real-world dataset, and then externally validated in a separate dataset.
The computer 100 may further include an input 114. The input 114 may be used to input unprocessed data or training data into the projection angle classifier 106, the primary anatomic structure classifier 108, the object labeler 110, and/or the severity estimator 112. The input 114 may be a wired or wireless connection. Input 114 may also be provided through removeable storage, or other types of data transfer mechanisms as may be appropriate. The computer 100 may also include an output 116 which may be used to various processed data such as a patient's estimated coronary artery stenosis severity. The output 116 may be a wired or wireless connection. Output may also be provided through removeable storage, or other types of data transfer mechanisms as may be appropriate. The processor 102 may also be configured to control a display having a graphical user interface 118 to display the estimated coronary artery stenosis. The user interface 118 or another display may allow a user to interact with the computer 100.
The method further includes labeling (206) objects within the angiogram image which may include the steps described in Algorithm 3 below. Labeling 206 may be performed by the object labeler 110 described in connection with
The method further includes estimating (208) coronary stenosis severity using the angiogram image which may include the steps described in Algorithm 4 below. Estimating 208 may be performed by the severity estimator 112 described in connection with
For each of the projection angle classifier 106, the primary anatomic structure labeler 108, the object labeler 110, and the severity estimator 112, the training data 302 and the manually processed training data 304 may be different. For example, for the severity estimator 112, the training data 302 may be the resultant data from the object labeler 110 and the manually processed training data 304 may be a coronary stenosis severity produced by a cardiologist. The training data 302 may include multiple sets of images from multiple patients. After training the machine learning model 306 with the manually processed training data 304, automatically processed data 310 can be generated by feeding unprocessed anatomical data 308 into the trained machine learning model 306. For example, for the projection angle classifier 106, an angiogram image may be fed into the trained machine learning model 306 which may produce automatically processed data 310 which may include the angiographic projection angle of the given angiogram image. The unprocessed anatomical data 308 may include data similar to the training data 302. Although a specific data flow is described above with respect to
While specific steps are described in connection with
In some embodiments, the automated angiographic interpretation may include a sequence of 4 neural network algorithms organized in a pipeline each trained to accomplish a discrete step, with angiographic images “flowing” from one algorithm to the next. The primary steps may include the following:
During a standard clinical coronary angiogram procedure, catheters may be inserted into and maneuvered through the aorta to canalize the coronary arteries. Fluoroscopic X-ray videos may visualize the coronary artery lumen during injection of iodine contrast from the catheter into a coronary artery. Multiple individual angiogram videos may be obtained by a cardiologist to optimally visualize arteries and structures in different angiographic projections. Since any single projection angle may capture a two-dimensional representation, multiple different angiogram videos may capture different projection angles to achieve optimal three-dimensional visualization of coronary arteries. Coronary stenosis may be visualized as a narrowing of the contrast-opacified coronary artery and may be reported as a percentage, where 0% represents absence of stenosis and 100% represents a completely occluded coronary artery. The most severe stenosis visualized from any projection angle for that artery segment is then typically reported by the performing cardiologist in the clinical procedural report.
In some embodiments, the algorithmic pipeline may include a sequence of neural network algorithms, each aiming to accomplish a discrete task illustrated in
In some embodiments, Algorithm 1 may take individual images as its input and identify the angiographic projection used. The projection may refer to the fluoroscopic angulation used to obtain the image, commonly described as LAO cranial, RAO caudal, etc. images which may be extracted during the pre-processing step and labeled using the primary and secondary angles extracted from each video's metadata, into 12 classes of angiographic projections (described in the Table 1 below). Angles may be extracted as two continuous variables ranging between −180 and 180 degrees for the primary angle and −50 and 50 degrees for the secondary angle. The Full Dataset may include 1,418,297 images from 11,972 patients and 195,195 videos for identifying angiographic projection divided into Training/Development/Test sets (e.g. 990,082 images in Training, 128,590 images in Development and 299,625 images in Test).
In some embodiments, the algorithm architecture may be XceptionNet, which is a convolution neural network that has achieved state-of-the-art performance at image recognition tasks. The convolution neural network may be initialized with ‘ImageNet’ weights, a previously described dataset of 1.3 million labelled images, which is often used in computer vision to initialize weights for faster algorithm convergence when the goal of the algorithm is to perform image classification, such as in this case. Images may be augmented by random zoom (e.g. range: 0.2) and shear rotate (e.g. range: 0.2). In some embodiments, all the layers of XceptionNet may be trained with the dataset. The Training dataset may be used to update the algorithm weights, the Development dataset may be used to measure the different algorithm performance and fine tune the hyperparameters using grid search, and the Test dataset may be used for the algorithm performance. In some embodiments, other architectures may be used such as VGG-16, ResNet50 and InceptionNet. In some embodiments, the learning rate may be 10e-2; 10e-3; or 10e-4. In some embodiments, the early stopping criteria may be 4, 8, or 16. In some embodiments, the optimizer may be Adam or RAdam.
In some embodiments, Algorithm 1 may be trained to identify the angiographic projection angle used in a given image, and may be based on the XceptionNet architecture. The left-right and cranio-caudal projection angles recorded in metadata for each video may be grouped into 12 distinct categories providing training data for Algorithm 1. These distinct categories are illustrated in Table 1:
Extracted Full Dataset images may be divided into training (990,082), development (128,590), and test datasets (299,625). In Algorithm 1's hold-out test dataset, overall frequency-weighted precision, sensitivity and F1 score may be 0.90 for each. Performance may be worse in the less commonly used antero-posterior and Right Anterior Oblique lateral projections. In some examples, Algorithm 1 performed poorly on the heterogeneous “other” class, which consisted of any image that may not be a member of other listed classes. Once trained, Algorithm 1 may be applied to all 1,418,297 images extracted from Full Dataset videos which may then be flowed into Algorithm 2. In some embodiments, Algorithm 1 and Algorithm 2 may be performed separately (e.g. Algorithm 2 may be performed first and then Algorithm 1 or Algorithm 1 and Algorithm 2 may be performed simultaneously). The predicted angiographic projection for a video may be the most common prediction across all of its extracted frames. Ties may be addressed by selecting the projection with the highest average probability across all frames.
In some embodiments, Algorithm 2 may identify the main ‘anatomical structure’ present in an image, among 11 possible classes. 14,366 randomly selected images may be extracted from videos in a pre-processing step, then a cardiologist may label each image in one of 11 classes. The possible classes are identified in Table 2:
The dataset may be split into Training sets, Development sets, and Testing sets (e.g. 70%-9,887 training images/10%-1,504 development images/20%-2,975 testing images). Algorithm 2 may be trained by initializing the weights using the XceptionNet architecture and/or weights from the trained Algorithm 1. Images may be augmented by random zoom (range was 0.2) and shear rotate (range was 0.2). The algorithm may be tuned using the same hyperparameters as for Algorithm 1. Other architectures may be used such as VGG-16, ResNet50 and InceptionNet. In some embodiments, the learning rate may be 10e-2; 10e-3; or 10e-4. In some embodiments, the early stopping criteria may be 4, 8, or 16. In some embodiments, the optimizer may be Adam or RAdam.
In some embodiments, to obtain video-level labelling of the cardiac structure present, the main anatomical structure may be predicted on individual frames of videos from the Full Dataset. Then, the probability may be averaged for the anatomical structure across each of the 7 frames containing the coronary, extracted from each video. In some embodiments, the frame in the first position which does not contain the artery may be excluded. In some embodiments, the anatomical structure may be extracted from the output of the softmax layer. Each video may be labelled according to the mode of the anatomical structure present in the 7 frames. Then, only videos where a right or left coronary artery was identified may be kept for subsequent analyses (e.g. Algorithm 3). For training both Algorithm 1 and Algorithm 2, grid-search may be used to tune hyperparameters, searching for the best optimizer, architecture, learning rate, batch size and the early stopping criteria in the development dataset.
In some embodiments, Algorithm 2 identifies the primary anatomic structure present in an angiographic video, enabling the coronary angiography interpretation pipeline to focus subsequent analysis on videos containing coronary arteries. Videos containing non-cardiac anatomic structures such as the aorta or the femoral artery may be captured during a coronary angiography procedure. Algorithm 2 may be based on the XceptionNet architecture and/or its weights may be initialized from Algorithm 1. Training data for Algorithm 2 may be generated by manually classifying 14,366 angiographic images randomly selected from the extracted Full Dataset images into 11 classes describing the primary anatomic structure in the image. In some embodiments, the number of classes may be adapted based on the situation. In some embodiments, the Full Dataset images may be divided into 9,887 training datasets, 1,504 development datasets, and 2,975 test datasets. For each input image, Algorithm 2 may output a score predicting the primary anatomic structure contained, and scores from all images from the same video may be averaged to predict the primary anatomic structure in the video. In some examples, Algorithm 2's weighted average precision, sensitivity, and F1 score may be 0.89 for each. F1 score performance may vary by anatomic class, but in general, classes with lesser frames may have lower performance. In some embodiments, improved performance may be obtained with more available labeled data. Exceptions to this may be ventriculography or aortography classes, which may perform well since they may be highly visually distinct from other classes. Algorithm 2 may be particularly useful in identifying the left and right coronary arteries. Sensitivity of 0.94 and 0.93 may be achieved for left and right coronary arteries, respectively. Once trained, Algorithm 2 may be deployed on all contrast-containing extracted Full Dataset images to identify videos primarily containing the left and right coronary artery to flow into Algorithm 3.
In various embodiments in accordance with the invention, Algorithm 3 may use frames from the left and right coronary artery videos as its input. The left and right coronary artery videos may be extracted from the output of Algorithm 2. Algorithm 3 may perform at least one of (i) identify anatomic coronary artery segments (e.g. proximal left anterior descending artery), (ii) identify stenosis (if present) and/or (iii) localize additional angiographically relevant objects such as interventional guidewires or sternal wires. In some embodiments, Algorithm 3 may be trained or validated by labeling 2,338 images of left and right coronary arteries that were healthy or diseased. In some embodiments, Two versions of Algorithm 3 may be trained, Algorithm 3a and 3b. Algorithm 3a may focus on left and right coronary arteries and Algorithm 3b may focus on the right coronary artery in LAO projection. In some embodiments, the labelled images may be split for this task into two separate datasets: one containing left/right coronary arteries (e.g. 2,338 images) and one containing right coronary arteries in the straight LAO projection (e.g. 450 images). Each dataset may be subsequently split into 90% training images (e.g. 2104 and 405 images respectively) and 10% test images (e.g. 234 and 45 images respectively). In some embodiments, Algorithm 3 may only localize stenoses in the main epicardial vessel and not side branches (such as diagonals or marginals). Algorithm 3a may be trained by manually labeling 2,338 images with 12,685 different classes and Algorithm 3a may be trained for 50 epochs. Algorithm 3b may be trained by manually labeling 450 images with 2,447 different classes and Algorithm 3b may be trained for 50 epochs.
The Algorithm 3a or 3b may use the RetinaNet architecture and may be trained using hyperparameters. RetinaNet may achieve state-of-the-art performance for object localization such as the pedestrian detection for self-driving cars and in medicine, may be used to localize and classify pulmonary nodules in lung CT-scans. Algorithms 3a and 3b output stenoses and coronary artery segments along with their coordinates on an image. The predicted coordinates may be compared with the annotated coordinates using the ratio of the area of overlap over the area of union (called Intersect-over-union [IoU]). An IoU≥0.5 between the predicted and annotated coordinates may be considered a true positive. Next, the mean average precision (mAP) may be measured, which represents the ratio of true positives over true and false positives at different thresholds of loU, for each class. A mAP of 50% may compare with state-of-the-art results for this type of task.
In some embodiments, Algorithm 3 localized relevant objects within angiogram images containing left and right coronary arteries, including coronary artery sub-segments and stenoses. Algorithm 3 may be based upon the RetinaNet architecture which localizes target objects by predicting surrounding bounding boxes. To train Algorithm 3, pea contrast frames from a random selection of Full Dataset videos (e.g. 1126 frames of the left coronary artery and 462 frames of the right coronary artery) may be manually labeled by placing bounding boxes around the 11 dominant coronary artery segments (per SYNTAX28), coronary stenoses, and/or other objects. The classes of labels are described in Table 3a:
The abbreviations for each of these are as follows: RCA: Right Coronary Artery; LAD: Left Anterior Descending Artery; LCX: Left Circumflex. In some embodiments, two versions of Algorithm 3 may be used: Algorithm 3a may accept both left and right coronary artery images as input, whereas Algorithm 3b may only take right coronary artery images in the LAO projection as input. Because this projection contained the most annotated images, Algorithm 3b may examine possible performance gains achievable by focusing the algorithm on a specific angiographic projection. In some embodiments, input variability may be decreased into Algorithm 3a since all Right Coronary Artery LAO images may be processed by Algorithm 3b, which resulted in performance improvements for both Algorithm 3a and 3b. Algorithms 3a and/or 3b may be trained using the original described RetinaNet hyperparameters. In some embodiments, a post-hoc heuristic may exclude Algorithm 3a and 3b predicted artery segments for certain angiographic projections which are known a-priori to be not visible or foreshortened. These angiographic projections may yield false results and thus it may be advantageous to exclude results with certain angiographic projections. The angiographic projections may be the angiographic projections classified by Algorithm 1. In some embodiments, there are certain objects that should not be seen at certain projection angles and thus labels for these objects may be filtered out.
Example of the excluded predicted artery segments are included in the below Table 3b:
In some embodiments, the performance of Algorithm 3a/3b may be assessed by measuring the area of intersection over the area of union (IoU) between predicted bounding-box coordinates and the expert-annotated bounding-box coordinates of objects in each class in the test dataset. An IoU≥0.5 signifies at least 50% overlap between the predicted and true bounding-boxes, which may be considered a true positive. Second, the mean average precision (mAP) metric may be measured which may represent the ratio of true positives over true and false positives at different thresholds of loU, for every class. A value of 50% compares with state-of-the-art results for this type of task. In the hold-out test dataset at the image-level, Algorithm 3a may exhibit a 48.1% weighted average mAP. The mAP may be 37.0% for left coronary segments, 42.8% for right coronary artery segments, and 13.7% for stenosis. Algorithm 3b may exhibit a weighted average mAP of 58.1%; average mAP of 54.5% for right coronary artery segments and average mAP of 26.0% for stenosis. Once trained, Algorithms 3a/3b may be deployed on all images from videos primarily containing Left or Right Coronary arteries, as determined by Algorithm 2. In some embodiments, the location of any identified coronary stenosis may be assigned to the coronary artery sub-segment whose bounding box exhibited maximal overlap (by intersection over union) with a coronary stenosis bounding box.
In some embodiments, the automated angiographic interpretation pipeline may conform with standard cardiologist practice and AHA/ACC guideline recommendations. The automated angiographic interpretation may assess coronary stenosis severity at any artery location as seen in the “worst view” from all angiographic videos that visualize that stenosis. Therefore, Algorithm 3 may identify stenoses by aggregating predictions from all images that visualized an artery segment across multiple videos (artery-level), compared against stenoses described in a procedural report. In some embodiments, Algorithm 3a and 3b may identify 68.2% of stenoses (e.g. 6,667 of 9,782) that may be described in procedural reports, among those angiographic studies that had matching procedural reports. These 6,667 stenoses may be identified across 105,014 frames. There may be better localization of right versus left coronary artery stenoses (e.g. 70.6% vs 65.8% respectively; p<0.005).
In various embodiments in accordance with methods and system of the invention, Algorithm 4 may predict the percentage of coronary artery stenosis. In some embodiments, each video may be matched with a clinical angiographic report associated with that study, constituting the “Report Dataset”. Then, Algorithm 3a and Algorithm 3b was run across this dataset to identify coronary artery segments and localize stenoses. As described above, Algorithm 3a may be run on all images not meeting the criteria for input into Algorithm 3b. Algorithm 3b may be run on all images labelled as right coronary artery in LAO projection which may be determined by Algorithm 1 and Algorithm 2. Each frame containing a stenosis bounding box with an intersection-over-union ≥0.20 with the underlying artery segment bounding box may be recorded. The overlap intersection between a stenosis and an artery segment, as identified by Algorithm 3a or 3b, may be used to assign a stenosis to an artery segment (e.g. if a stenosis overlapped the mid-RCA as measured by the loU, then that stenosis was assigned to the mid-RCA). As discussed above, certain coronary segments may be hidden or foreshortened in certain angiographic projections and thus may be excluded from the different views. Afterwards, stenoses found by Algorithm 3a and/or 3b may be cross-matched with the stenosis percentage found in the procedural report. If a matching stenosis percentage is found in the artery segment, as extracted from the procedural report, that percentage may be assigned to the image of the stenosis identified by Algorithm 3a and/or 3b and this may be used to train Algorithm 4. Non-matched stenoses may be removed from the dataset. In some embodiments, videos where an intracoronary guidewire is present in more than 4 frames may be excluded, since these could represent a stenting procedure which may lead to a modification in the stenosis percentage due to the angioplasty and subsequent labelling errors. In some embodiments, videos of these procedures prior to the insertion of an intracoronary guidewire may be separately kept.
In some embodiments, once a stenosis is identified, the bounding box coordinates may be expanded by 12 px. The images may be cropped and resized to multiple predetermined sizes. For example, three predetermined sizes may be used: 256*256 pixels (aspect ratio no.1), 256*128 pixels (aspect ratio no.2), and 128*256 pixels (aspect ratio no.3). Predetermined sizes may maximize signal-to-noise (vessel-to-background) ratio, due to the different vessel orientations and sizes of the stenosis. The “Report Dataset” used for Algorithm 4 may consist of 105,014 images (6,667 lesions coming from 2,736 patients and 5,134 healthy vessel segments from 1,160 patients). Since healthy vessel segments can be longer than focal stenosis which could bias the training, all healthy segments may be cropped randomly to a height and width that followed the distribution of the sizes of the stenoses in that coronary segment. This may create uniform vessel sizes between the stenotic and healthy counterparts for each vessel segment. This, may allow Algorithm 4 to learn features of healthy vessels as well as diseased ones. Images in the dataset may be split into three groups: Training, Development, and Testing. In some embodiments, the makeup of each group may be 70% Training, 10% Development and 20% in Testing.
In some embodiments, Algorithm 4 may be based on a modified XceptionNet architecture where the last layer (e.g. Softmax layer, used for classification) may be removed and replaced with an ‘average pool’ layer. A dense layer with a linear activation function may be included to enable prediction of stenosis severity as a continuous percentage value. Furthermore, image metadata may include the coronary artery segment label and cropped aspect ratio which may be added as inputs to the final layer of Algorithm 4. Algorithm 4 may output a percentage stenosis value between 0 and 100 for every segmented stenosis input and learn from stenoses localized in different coronary artery segments. In some embodiments, model weights may be initialized using those from the trained Algorithm 1. Images may be augmented by random flip (both horizontal and vertical), random contrast, gamma and brightness variations, random application of CLAHE (To improve contrast in images). In some embodiments, a one-hot encoded vector input containing information about the coronary segment prior and the aspect ratio category may be added to the dense layer, so that Algorithm 4 may learn characteristics specific to each vessel segment and each aspect ratio. The algorithms may be trained to minimize the squared loss between the automatically estimated stenosis and the manually estimated stenosis using RADAM Lookahead as an optimizer with an initial learning rate of 0.001, momentum of 0.9, and batch size of 12, for 50 epochs. In some embodiments, training may be halted once the loss function stopped improving for 8 consecutive epochs in the test dataset. Image metadata including the coronary artery segment and the cropped aspect ratio may be added as additional inputs into Algorithm 4.
In some embodiments, training may be performed using different stenosis datasets. For example, the training data may be pre-processed differently, such as non-segmented stenoses or zero padded stenosis without resizing, include varying the input image size, include stenoses resized to different sizes, and/or include using up the index frame and adjacent frames as input. However, increased complexity training data may increase complexity in the computational tasks without gains in estimation accuracy. Examples of different hyperparameters are illustrated in Table 4:
As described previously, in some embodiments, Algorithm 4 may use the three aspect ratios of cropped images of coronary arteries with and without stenosis as its input, to predict the degree of stenosis in the image. In some embodiments, for Algorithm 4, each full epoch may be trained on one aspect ratio, then switch to the other aspect ratio size for the next full epoch. In some embodiments, each subsequent epoch may copy all weights from the previous epoch. In some embodiments, the aspect ratio size may be iterated until convergence. Algorithm 4 performance may be measured on the whole test dataset, including the three aspect ratios. The convergence of the multi-size input training may be similar to other algorithms that use a fixed aspect ratio size for training. In addition, this type of training may be performed in the past in other deep learning networks using multi-size inputs.
For all algorithms, the data may be split into Training data, Development data, and Test data. The split may be as follows: Training (70%), Development (10%) and Test (20%) datasets, each containing non-overlapping patients. The development dataset may be used for algorithm tuning. For Algorithm 3, the dataset splits may be Training (80%) and Test (20%); since hyperparameters may be used as described and additional Algorithm 3 tuning may not be performed.
In some embodiments, once coronary artery segments and stenoses were identified by Algorithm 3a and/or 3b, the severity of identified coronary stenoses may be estimated. Procedure reports may be used as training data which may contain the cardiologist-interpretation of angiographic studies from Jan. 1, 2013 to Dec. 31, 2019. These reports may be matched with their corresponding angiographic studies from the Full Dataset to derive the Report Dataset. Example results from Algorithm 3a and 3b may identify 4,328 Report Dataset angiograms with stenoses from 3,721 patients, totaling 46,168 videos. The procedure report text from these studies may be parsed to identify any description of coronary stenosis, the maximal stenosis percentage, and the corresponding location in one of 12 coronary artery segments. By doing this, 9,122 coronary artery segments may be identified within the reported images including stenoses and 10,088 non-stenosed artery segments (derived from 2,538 non-stenosed full coronary arteries). The reported images including stenoses may include a stenosis percentage and the corresponding artery images which may be used to train Algorithm 4.
In some embodiments, the training data may be using 1257 exams coming from 916 patients where each coronary stenosis was annotated by two experts in a core lab, using quantitative coronary angiography (QCA). As discussed previously, QCA is a highly accurate method for assessment of coronary stenoses using coronary angiograms. The QCA may include a cutoff. When using a QCA cutoff of 50% to determine a severe from a non-severe stenosis, as is commonly done in this setting, the method achieved a result of AUC-ROC of 0.73 discriminating between severe and non-severe stenoses. While this performance was lower than in the two external datasets where manual visual assessment of stenoses was used, the algorithm may be able to generalize to datasets where the stenosis was obtained using QCA as opposed to the clinical standard of visual assessment, without further re-training of the algorithm.
In some embodiments, Algorithm 4 may be trained to predict the maximum stenosis severity contained in input images cropped around artery segments from the Report Dataset, and may be based on a modified XceptionNet architecture. Bounding boxes from Algorithm 3 may be used to crop images around stenosed artery segments and non-stenosed arteries, and used to train Algorithm 4. Algorithm 4′s output score from 0-1 may be converted to an automatically estimated stenosis percentage from 0-100%. The threshold for binary prediction may be 70% stenosis and may be chosen to optimize the F1 score. Since the bounding boxes used to crop images varied in size, they may be resized to the closest of three defined aspect ratios before being input into Algorithm 4.
In some examples, in the Test dataset, Algorithm 4's AUC may be 0.862 (95% Cl: 0.843-0.880) to predict “obstructive” coronary artery stenosis, defined as ≥70% stenosis, at the artery-level. AUC may be 0.814 (95% Cl: 0.797-0.831) at the video-level and 0.757 (95% Cl: 0.749-0.765) at the image-level. In some examples, of those that had <70% estimated stenosis, Algorithm 4 may identify 78.1% correctly (using the F1 score-optimized binary threshold of 0.54; 95% Cl:76.1-80.1%; 1082/1385). Of those ≥70% stenosed by the estimated stenosis, Algorithm 4 may identify 74.5% correctly (95% Cl: 70.0-78.4%; 260/349). When Algorithm 4's sensitivity to detect obstructive coronary stenosis may be fixed at 80.0%, its specificity to detect obstructive stenosis may be 74.1%. When Algorithm 4's specificity is fixed at 80.0%, its sensitivity to detect obstructive stenosis may be 71.6%. In some examples, the mean absolute percentage difference between the automatically estimated stenosis and manually estimated stenosis may be 17.9±15.5% at the artery-level, 18.8±15.8% at the video-level, and 19.2±15.1% at the frame level. In some examples, there may be a significantly lower artery-level mean absolute percentage difference for the right coronary versus the left coronary artery (16.4±15.0 vs 19.0±15.8; p<0.001), at similar training dataset sizes-likely reflecting the right coronary artery exhibiting less anatomic variation than the left. At the artery level, there may be medium-to-strong Pearson and intra-class correlations between the automatically estimated stenosis and manually estimated stenoses values. In some embodiments, Algorithm 4 may overestimate milder stenoses and underestimate more severe stenoses. In some embodiments, there may only be minor differences in performance between anatomic coronary artery segments, though mid vessels, which may have lower mean squared error and absolute difference compared to proximal or distal vessels.
In some embodiments, patients may be determined to have obstructive stenoses (</≥70%) based upon automatically estimated stenoses that were either concordant (1,336) or discordant (398) based on manually estimated stenoses. In some embodiments, automatically estimated stenosis may be more likely to be discordant with manually estimated stenosis in older patients (e.g. 62.7±13.2 vs 65.1±12.3, <0.001), in the left coronary artery, the proximal RCA, distal RCA, the right posterolateral, and the distal LAD.
In addition to Algorithm 4, an alternative approach to estimating stenosis severity may be performed such as a sensitivity analysis. This sensitivity analysis may serve to corroborate the ability of Algorithm 4 to predict stenosis using cropped angiogram images, while also providing an alternative approach that may perform better in some settings. In some embodiments, this approach includes Algorithm 5 to segment the boundaries of the coronary artery within a cropped input image (e.g. the output of Algorithm 3) and exclude all background information by setting non-artery pixel values to 0 (called the “Segmented image”); Algorithm 6 then predicts the percentage of stenosis from Algorithm 5's segmented images (similar to Algorithm 4).
In some embodiments, Algorithm 5 may use the cropped images of coronary artery stenosis (from Algorithm 3) and may perform segmentation of the coronary artery in these images, which are then fed into Algorithm 6. This serves as a parallel, alternative approach to predicting the degree of coronary artery stenosis. The segmentation Algorithm 5 may classify each individual ‘pixels’ within a coronary artery-containing image into ‘vessel’ or ‘non-vessel’ pixels (also may be called “pixel-wise segmentation”). In some embodiments, the vessel from the background may be isolated, to minimize background noise in the estimation of stenosis. Thus, the non-vessel pixels may be omitted. To do so, all stenosis and healthy artery segments may be extracted as described above, respecting the three aspect ratios. Then, to generate the dataset used for Algorithm 5 training, a cardiologist may trace the vessel contour of 160 images of stenoses and 40 images of healthy coronary segments to generate ‘vessel masks’ used for training. Annotated Algorithm 5 data may be then divided into 90% training and 10% test datasets.
In some embodiments, to perform this segmentation task, a Generative Adversarial Network may be used. The Generative Adversarial Network may perform automatic retinal vessel segmentation using small datasets (less than 40 images for training). As discussed above, it may be advantageous to use a finite number of aspect ratios. For example, three separate algorithms may be trained (Algorithms 5a/5b/5c), one for each of the predetermined sizes of the image (Aspect Ratio 1:120 images, Aspect Ratio 2:80 images, Aspect Ratio 3:80 images), using the default parameters. Each image may be normalized to the Z-score of each channel and augmented by left-right flip and rotation. The datasets may be split into 80% training and 20% test. The discriminator and the generator may be trained for successive epochs, alternatively, for up to 50,000 iterations. In some embodiments, Learning rate may be 2e-4, the optimizer may be ‘ADAM’, the GAN2SEGMENTATION loss may be 10:1 and the discriminator may be dataset to the ‘image-level’. The performance of Algorithms 5a/5b/5c on the test dataset may be measured using the sum of the Area Under the Curve for the Receiver Operating Characteristic (ROC-AUC) and the Area Under the Curve for Precision and Recall Curve (PR-AUC). A value of 2.00 may represent perfect segmentation meaning that the mask generated by Algorithm 4 may be perfectly overlaps the human generated mask. The Dice coefficient may represent the area of overlap divided by the total pixels between the predicted vessel mask and the traced vessel mask. For the dice coefficient, the probability map may be thresholded with the Otsu threshold which may be used to separate foreground pixels from background pixels.
In some embodiments, Algorithm 6 may be a modified XceptionNet. Algorithm 6 may be trained similarly to, but separately from, Algorithm 4. Algorithm 6 took as input the same images as Algorithm 4 masked by the Algorithm 5 predicted vessel masks (discussed above). Due to the black-box nature of DNN algorithms, the Algorithm 5 and Algorithm 6 sensitivity analysis may also help determine whether background elements in the image spuriously contributed to Algorithm 4's automatic prediction.
In some examples, Algorithm 5 may demonstrate excellent segmentation performance on the test dataset. For example, the average DICE coefficient may be 0.79, AUC may be 0.88, AUC-PR may be 0.82, and an AUC-PR sum may be 1.71. Algorithm 5 may predict coronary artery boundaries from cropped input images, and may be trained by manually segmented “ground-truth” boundaries. Algorithm 5's predicted boundaries may be then used to mask coronary artery images, setting all non-vessel pixels to 0 (e.g. black). These resulting images may then be input into Algorithm 6 to predict stenosis percentage, trained using the same manually estimated stenosis as in Algorithm 4. In some examples, Algorithm 6 predicted stenosis may be strongly correlated with those from Algorithm 4 at the artery-level (e.g. r=0.70) in the test dataset. The average difference in predicted stenosis severity between Algorithm 4 and Algorithm 6 may be 12.4±10.9%, with 12.9±11.2% for right coronary arteries and 12.0±10.6% for left coronary arteries. Thus, Algorithm 4 performance may not substantially rely on image features outside of the coronary artery boundaries.
In some embodiments, besides cropping the stenoses, a dataset was developed using multiple aspect ratios (e.g. 256×256px, 128×256 px, and 256×128 px) to better account for the different variations in the vessel orientation. The multiple aspect ratios may be constant aspect ratios. The multiple aspect ratios may be multiple aspect ratios, e.g. 256×256px, 128×256 px, and 256×128 px. In some embodiments, the aspect ratio may be one of multiple aspect ratios depending on the different variation in a vessel orientation of the artery. the AI model may be trained to include multiple consecutive video frames of the cropped bounding box to give more data during training. In some embodiments, The multiple consecutive video frames may be three consecutive video frames. Training a convolutional neural network with three aspect ratios and/or three consecutive video frames may provide increased performance.
Once a stenosis is identified, bounding box coordinates may be expanded by 12 pixels in all dimensions, then cropped and resized to the nearest of three predetermined sizes, e.g. 256*256 pixels (aspect ratio no.1), 256*128 pixels (aspect ratio no.2), and 128*256 pixels (aspect ratio no.3). Due to varying vessel orientations and stenosis sizes, utilizing multiple aspect ratios may maximize signal-to-noise ratio (e.g. vessel-to-background ratio).
Examples of Statistical Analysis
In some embodiments, final algorithm performance may be reported in the Test Dataset. In some embodiments, Algorithm 1 and Algorithm 2 results may be presented on the frame level. For each of these algorithms, class performance may be calculated using precision (e.g. positive predictive value) and recall (sensitivity) and the performance may be plotted using confusion matrices. An F1 score may be derived for each class, which may be the harmonic mean between the precision and recall. The F1 score may range between 0 and 1 and may be highest in algorithms that maximize both precision and recall of that class simultaneously.
To measure the performance of Algorithm 3a and 3b, the intersection-over-union (loU) may be measured between the predicted coordinates and the actual coordinates on the test dataset. The loU may be the ratio between the area of overlap over the area of union between the predicted and annotated sets of coordinates. The performance of Algorithm 3a and 3b may be reported as the mean average precision (mAP), which represents the ratio of true positives over true and false positives at different thresholds of loU, starting from loU of 0.5, with steps of 0.05, to a maximum loU of 0.95, for each class. The mean average precision for Algorithm 3a and Algorithm 3b may be obtained by calculating the proportion of correct class prediction with an IoU≥0.5 with the ground-truth labelling across all the classes in the test dataset. For Algorithms 5a/5b/5c, the sum of the PR-AUC and the ROC-AUC may be obtained.
Algorithm 4 may be used to derive the average absolute error between the manually estimated stenosis and the automatically estimated stenosis, at the artery segment level. In some embodiments, the stenosis may be automatically estimated by estimating the stenosis in multiple orthogonal projections and reporting the final value in the projection demonstrating the narrowest level of luminal narrowing. To compute the artery-level automatically estimated stenosis, an automatically estimated stenosis may be first obtained for all frames where that stenosis may be localized (e.g. frame-level automatically estimated stenosis), then those values may be averaged across a video to obtain video-level automatically estimated stenosis. Then, the maximal video-level automatically estimated stenosis percentage may be kept to obtain an overall estimate of the artery-level stenosis percentage. Pearson correlation and Bland-Altman plots may be used to describe agreement between the manually estimated stenosis and automatically estimated stenosis at the video-level and artery-level. Intra-class correlation (ICC2,2) may be used to determine interobserver reliability. The interobserver reliability may be between the manually estimated stenosis and automatically estimated stenosis). The reliability level may be further classified as slight (0.0-0.20), fair (0.21-0.40), moderate (0.41-0.60), substantial (0.61-0.80), or excellent (0.81-1.0). Finally, the mean squared error may be presented between manually estimated stenosis and automatically estimated stenosis at the video-level.
As a sensitivity analysis, the automatically estimated stenosis and manually estimated stenosis may be divided into two groups (≥70% and <70%). The Algorithm 4 percentage outputs may be recalibrated, by obtaining the automatically estimated stenosis threshold maximizing the F1 score in the Test Dataset, ensuring optimal sensitivity and specificity for manually estimated stenosis of ≥70% and <70%. Then, the ROC-AUC may be calculated in the Test dataset and may be used to describe the performance of Algorithm 4 using the sensitivity, specificity and diagnostic odds-ratio, at the frame level, video level, and artery level, based on the cutoff. Confidence intervals for these performance metrics may be derived by bootstrapping 80% of the test data over 1000 iterations to obtain 5th and 95th percentile values. The performance of the algorithm stratified by the left and right coronary arteries may be presented by coronary segment and by age group. Automatically estimated stenosis and manually estimated stenosis may be categorized in concordant and discordant lesion groups based on the visual ≥70% cut off. For discordant lesions, the prevalence may be presented, stratified by coronary vessel segment. For lesion/vessel level data, a mixed effects logistic regression model may be used to account for within-subject correlation and for repeated angiograms.
The ICC may illustrate the influence of the background elements in the automatically estimated stenosis, Pearson, and the mean stenosis difference between the Algorithm 4 (e.g. taking vessels with the background as input) and Algorithm 6 (taking the segmented vessels, without their background)
For Algorithm 5, a separate version of the algorithm may be trained that instead of being used for regression, may be used for classification into ≥70% and <70% manually estimated stenosis to derive Saliency maps. To illustrate this, 5 images may be randomly selected of severe stenoses ≥70% from the test dataset and plotted their saliency maps.
In various embodiments of systems and methods in accordance with the invention, an automatic assessment of left ventricular ejection fraction (LVEF) using a general coronary angiogram may be performed. LVEF may not typically be estimated through a general coronary angiogram. A coronary angiogram where large amounts of dye are injected directly into the heart may be used to estimate LVEF. However, injecting dye directly into the heart such may be harmful to the patient and should be avoided. Further, LVEF may be estimated using a TTE however this would require another procedure to estimate LVEF. Also, a TTE may not be able to estimate LVEF values with a high degree of accuracy. Continuous LVEF values may be values of LVEF ranging between 5% and 70% whereas dichotomous LVEF values may be values of LVEF less than or equal to 40% or greater than 40%. The dichotomous LVEF values may only measure whether the LVEF percentage is beyond a certain LVEF threshold. A 40% LVEF threshold may be used to determine the presence of clinically significant cardiomyopathy. Other LVEF thresholds may be used. In some embodiments, the automatic assessment of LVEF using the general coronary angiogram may be able to estimate both continuous LVEF values and dichotomous LVEF with a high degree of accuracy.
The automatic assessment may include use of a Full Dataset. The Full Dataset may include retrospective, de-identified coronary angiographic studies from all patients 18 years or greater from the University of California, San Francisco (UCSF), between Dec. 12, 2012 and Dec. 31, 2019 that also had a TTE performed either 3 months before or up to 1 month after the Coronary angiogram. Coronary angiograms may be acquired with Philips (Koninklijke Philips N.V., Amsterdam, Netherlands) and Siemens (Siemens Healthineers, Forchheim, Germany) systems. TTEs may be acquired by skilled sonographers using ultrasound machines and the processed images may be stored in a Philips Xcelera picture archiving system. In some embodiments, an estimated LVEF percentage and corresponding general angiogram images may be used to train a machine learning algorithm. In some embodiments, the estimated LVEF percentage may be obtained from TTE or left ventricular angiography.
Advantageously, the one or more angiogram images of the patient and the other patient may be normal angiogram images without dye injected directly into the patient's aorta or ventricle. In some embodiments, the method 700 may further include classifying the projection angle of the angiogram image. Only angiogram images of certain projection angles may be used to produce the training data and may be used to estimate the LVEF. In some embodiments, the method 700 may further include classifying the primary anatomic structure of the one or more angiogram images of the patient and the other patient. Only angiogram images classified as a left coronary artery may be used to produce the training data and may be used to estimate the LVEF. In some embodiments, only angiogram image of a certain projection angle and classified as left coronary artery may be used to produce the training data and may be used to estimate the LVEF. Filtering the angiogram images utilized may provide a more accurate estimate of LVEF.
For every coronary angiography study, the Digital Imaging and Communication in Medicine (DICOM) files (the native file format of the radiologic exam) may be identified where the left coronary artery may be present using an algorithm. Then each DICOM file may be converted to a 512×512 pixels .MP4 Video file where all identifying information may be removed. For every TTE, the LVEF may be measured from an echocardiogram report. In some embodiments, the LVEF may be measured using the Simpson formula. If multiple TTEs are performed around the coronary angiography, only the TTE closest to the date of the angiogram may be used to determine the measured LVEF.
Patient data may be randomized and their respective videos in the Full Dataset divided into Training datasets, Validation datasets, and Testing datasets. The division of the Full Dataset may be Training dataset (70%), Validation dataset (10%) and Testing dataset (20%). In some embodiments, no patient may be in more than one group.
In some embodiments, the automatic assessment may classify a coronary angiogram video of the left coronary artery to low LVEF (defined as ≤40% on the TTE). The automatic assessment may be based on a X3D architecture. The X3D architecture may be a video neural network that expands a 2D Image classification architecture, along multiple network area, in space, time, width, and depth. The automatic assessment may preserve the temporal input resolution for all features throughout the network hierarchy, preserving all temporal frequencies in all features, which may be crucial for LVEF determination. The automatic assessment may be light weight and may be implemented on a mobile device or on current hardware powering different coronary angiogram hardware suites.
In some embodiments, the automatic assessment may begin by performing Algorithm 1 and Algorithm 2 to the dataset. Algorithm 1 (discussed above) may be used to classify the angiographic projection angle of the angiogram images. Algorithm 2 (discussed above) may be used to classify the primary anatomic structure within the angiogram images. After obtaining the primary anatomic structure of each angiogram image, the angiogram images may be sorted for images including a left coronary artery. These angiogram images may be used for automatic assessment of LVEF using an LVEF Algorithm. In some embodiments, the classified angiographic projection angle from Algorithm 1 may be used to select the commonly obtained views. For example, the classified angiographic projection angle may be used to select the three commonly-obtained views.
In some embodiments, model weights may be initialized. Angiogram images may be augmented by random flip (both horizontal and vertical), random contrast, gamma and brightness variations, random application of CLAHE (e.g. to improve contrast in images). The LVEF algorithm may be trained to minimize the binary cross entropy between the predicted LVEF category (low vs normal) and the actual LVEF category. In some embodiments, ADAM may be used as an optimizer with an initial learning rate of 0.001, momentum of 0.9, and batch size of 8, for 500 epochs. Training may be halted once the loss function stopped improving for a certain amount of consecutive epochs in the test dataset. Then, a grid-search may be performed. In addition, different model architectures and temporal convolutions may be used such as R3D and R2+1D, as well as a TimesFormer model.
In some embodiments, the LVEF extracted from the TTE report may be divided into two groups (≥50% and <50%). The 50% efficiency cutoff obtained during the TTE may be used to define a significant left ventricular dysfunction and carries therapeutic and prognosis implications. The LVEF Algorithm percentage outputs may be calibrated, by obtaining the threshold of the Softmax probability of low LVEF threshold maximizing the F1 score in the Validation Dataset, ensuring optimal sensitivity and specificity. Then, the ROC-AUC may be calculated in the Test dataset and the performance of the LVEF Algorithm may be described using the sensitivity, specificity and diagnostic odds-ratio, at the video level and exam-level. Confidence intervals for these performance metrics may be derived by bootstrapping 80% of the test data over 1000 iterations to obtain 5th and 95th percentile values. The performance of the algorithm may be presented stratified by the projection and by the age group.
In some examples, a total of 3679 patients and 4042 coronary angiogram exams with paired TTE may be identified in the study cohort for the presented analysis. After excluding very short angiogram videos or those with invalid metadata, a Full Dataset may be obtained including 3445 patients, 4042 coronary angiograms, and 36,566 videos of the left coronary artery. The videos may be split as follows: 17,982 in the training dataset, 2691 in the validation dataset, and 5414 in the hold-out test dataset. Patients in the Full Dataset may have an average age of 51.2±4.2 years. In some examples, in the LVEF group (595 patients), the ejection fraction may be 28.3±7.6% whereas in the normal LVEF group (2850 patients), the ejection fraction may be 61.0±9.3% (p<0.001).
In some examples, the model may finish training after 29 epochs and the train dataset AUC-ROC may be 0.962 whereas the loss may be 0.186. In the validation dataset, the AUC-ROC may be 0.817 (95% Cl:0.795-0.839) at the video-level. The cut-off separating low-LVEF from normal-LVEF that maximized the F1-score may be 0.90. In the Test dataset, an AUC-ROC of 0.851 (95% Cl: 0.839-0.863) may be observed at the video-level which increased to an AUC-ROC of 0.891 (95% Cl: 0.860-0.923) when averaging out predictions around left coronary artery videos performed during the same exam. In some embodiments, the sensitivity may be 0.83 whereas the specificity may be 0.77 at the exam-level. When looking at the coronary projections that performed best, the left anterior oblique (LAO) cranial, the right anterior oblique (RAO) cranial, the anteroposterior (AP) cranial and the LAO caudal views achieved the highest AUC at the video-level for discriminating between low-LVEF and normal-LVEF.
Although many embodiments of the invention have been described in detail, it should be appreciated that the invention may be implemented in many other forms without departing from the spirit or scope of the invention. For example, embodiments such as enumerated below are contemplated:
Item 1: A method for estimating left ventricular ejection fraction, the method comprising:
Item 2: The method of Item 1, wherein the estimate of the left ventricular ejection fraction of the patient is produced by transthoracic echocardiogram (TTE) or left ventricular angiography.
Item 3: The method of Item 1, wherein the one or more angiogram images of the patient and the other patient are normal angiogram images without dye injected directly into the patient's aorta or ventricle.
Item 4: The method of Item 1, further comprising classifying a projection angle of the angiogram image, wherein only angiogram images of certain projection angles are used to produce the training data and are used to estimate the left ventricular ejection fraction.
Item 5: The method of claim 1, further comprising classifying a primary anatomic structure of the one or more angiogram images of the patient and the other patient, wherein only angiogram images classified as a left coronary artery are used to produce the training data and are used to estimate the left ventricular ejection fraction.
Item 6: A method for estimating arterial stenoses severity, the method comprising:
Item 7: The method of Item 6, wherein classifying the primary anatomic structure, classifying the projection angle, and labeling one or more relevant objects is performed using a machine learning technique.
Item 8: The method of Item 6, further comprising segmenting the coronary artery by classifying each individual pixel of the one or more angiogram images of the first patient as vessel containing pixels and non-vessel containing pixels and omitting non-vessel containing pixels before estimating the arterial stenoses severity of the first patient.
Item 9: The method of Item 6, further comprising cropping one or more angiogram images labeled to include stenoses to focus on the stenoses prior to estimating the arterial stenoses severity.
Item 10: The method of Item 9, wherein cropping one or more angiogram images comprises expanding a bounding box including the stenoses and resizing an aspect ratio of the bounding box to one of multiple aspect ratios depending on the different variation in a vessel orientation of the artery.
Item 11: The method of Item 10, wherein the multiple aspect ratios comprises three constant aspect ratios.
Item 12: The method of Item 6, wherein estimating the arterial stenoses severity of the first patient is performed on multiple angiogram images previously labeled as including stenoses.
Item 13: The method of Item 12, wherein the multiple angiogram images are consecutive frames of an angiogram.
Item 14: The method of Item 6, wherein primary anatomic structure of one or more angiogram includes a left coronary artery, a right coronary artery, bypass graft, catheter, pigtail catheter, left ventricle, aorta, radial artery, femoral artery, and/or pacemaker.
Item 15: The method of Item 6, further comprising labeling anatomic coronary artery segments and/or additional angiographically relevant objects within the one or more angiogram images.
Item 16: The method of Item 15, wherein the anatomic coronary artery segments includes a proximal right coronary artery (RCA), middle RCA, distal RCA, posterior descending artery, left main artery, proximal left anterior descending artery (LAD), middle LAD, distal LAD, proximal left circumflex (LCX), and/or distal LCX.
Item 17: The method of Item 15, wherein the additional angiographically relevant objects includes guidewires and/or sternal wires.
Item 18: A method of analyzing coronary angiograms, the method comprising: producing one or more coronary angiogram images with a corresponding estimated feature of the one or more coronary angiogram images to produce training data;
Item 19: The method of Item 18, wherein the estimated feature comprises coronary stenoses.
Item 20: The method of Item 18, wherein the estimated feature comprises anatomic coronary artery segments and/or additional angiographically relevant objects.
Item 21: The method of Item 20, wherein the additional angiographically relevant objects includes guidewires and/or sternal wires.
While the above description contains many specific embodiments of the invention, these should not be construed as limitations on the scope of the invention, but rather as an example of one embodiment thereof. It is therefore to be understood that the present invention may be practiced in ways other than specifically described, without departing from the scope and spirit of the present invention. Thus, embodiments of the present invention should be considered in all respects as illustrative and not restrictive. Accordingly, the scope of the invention should be determined not by the embodiments illustrated, but by the appended claims and their equivalents.
This application claims priority to U.S. Provisional Application 63/208,406 filed on Jun. 8, 2021, the disclosure of which is incorporated by reference in its entirety.
This invention was made with government support under grant no. K23 HL135274 awarded by The National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US22/72817 | 6/8/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63208406 | Jun 2021 | US |