The field of the disclosure relates generally to systems and methods of medical image processing, and more particularly, to systems and methods of correcting motion in images used for radiation planning.
Ventricular tachycardia is a life threatening illness, which may be treated with radioablation. The effectiveness of radioablation is affected by target precision of radiation because radiation is nonselective and may also harm healthy tissue from an unsatisfactory precision. CT (computed tomography) images are used in radiation planning. Motion, such as respiratory and heartbeat motion, in these images renders them unsuitable in precision radiation planning.
This background section is intended to introduce the reader to various aspects of art that may be related to various aspects of the present disclosure, which are described and/or claimed below. This discussion is believed to be helpful in providing the reader with background information to facilitate a better understanding of the various aspects of the present disclosure. Accordingly, it should be understood that these statements are to be read in this light, and not as admissions of prior art.
In one aspect, a computer-implemented method of correcting motion in respiratory four-dimensional computed tomography (4DCT) images of a subject in radiation planning is provided. The method includes receiving respiratory 4DCT images, wherein the respiratory 4DCT images were acquired while a subject was free breathing. The method also includes receiving cardiac four dimensional (4D) images of the subject, wherein the cardiac 4D images were acquired while the subject was in breath hold. The method further includes deriving a cardiac motion model based on the cardiac 4D images, wherein the cardiac motion model includes frames of images of the subject, each frame corresponding to a cardiac phase in a cardiac cycle, each frame of images including motion fields at the cardiac phase. The method further includes correcting motion in the respiratory 4DCT images using the cardiac motion model, and outputting the corrected respiratory 4DCT images.
In another aspect, a radiation planning system is provided. The system includes a computing device, the computing device including at least one processor in communication with at least one memory device. The at least one processor is programmed to receive respiratory 4DCT images, wherein the respiratory 4DCT images were acquired while a subject was free breathing. The at least one processor is also programmed to receive cardiac 4D images of the subject, wherein the cardiac 4D images were acquired while the subject was in breath hold. The at least one processor is further programmed to derive a cardiac motion model based on the cardiac 4D images, wherein the cardiac motion model includes frames of images of the subject, each frame corresponding to a cardiac phase in a cardiac cycle, each frame of images including motion fields at the cardiac phase. In addition, the at least one processor is programmed to correct motion in the respiratory 4DCT images using the cardiac motion model and output the corrected respiratory 4DCT images.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
These and other features, aspects, and advantages of the present disclosure will become better understood when the following detailed description is read with reference to the accompanying drawings in which like characters represent like parts throughout the drawings, wherein:
The disclosure includes systems and methods of correcting motion of respiratory images used in radiation planning of a subject. As used herein, a subject is a human, an animal, or a phantom, or part of the human, the animal, or the phantom. A phantom may be a physical phantom or a computational phantom. The method aspects will be in part apparent and in part explicitly discussed in the following description.
In the example embodiment, the method 100 further includes receiving 103 cardiac 4D images of the subject. As used herein, cardiac images are a series of images of the subject acquired while the subject was in breath hold. The cardiac images may be in 4D as a series of images of a volume in the subject with the fourth dimension being time. Cardiac images may be acquired by CT or other modalities such as magnetic resonance imaging (MRI). Cardiac images may be acquired at the same imaging session with the respiratory images. In some embodiments, cardiac images are acquired at different time from the respiratory images or using a different CT scanner or different modalities such as MRI. 4D cardiac images acquired by CT may be referred to as cardiac 4DCT images. Because cardiac images are acquired while the subject is in breath hold, the effects of respiratory motion on the images are reduced. As such, the image quality of cardiac images are less affected by respiratory motion and is better than that of respiratory images. Further, cardiac images may be acquired to have higher temporal and/or spatial resolution than respiratory images. The cardiac images and the respiratory images are images of the same anatomy, or at least having overlapping anatomies such that respiratory images may be registered to the cardiac images to correct motion in respiratory images. For example, both respiratory images and cardiac images are images of a heart and/or surrounding organs of the subject. Therefore, cardiac images are used to improve the accuracy of dosage calculation for radiation planning by registering respiratory images with cardiac images.
In the example embodiment, the method 100 further includes deriving 104 a cardiac motion model based on the cardiac 4D images. As used herein, a cardiac motion model includes frames of images, each frame of images corresponding to a cardiac phase in a cardiac cycle and having a motion field associated with each voxel in the images that indicate the motion of that voxel at that cardiac phase. A motion model may be used to estimate motion fields between frames or phases by interpreting the motion fields at the intermediate time point between adjacent phases using motion fields at the adjacent phases. A motion field or a motion vector field is a vector field with each vector in the field indicating motion at that voxel, where the magnitude of the vector indicates the magnitude of the motion and the direction of the vector indicates the direction of the motion. A frame is an image of a plane or images of a volume at a point of time in a cardiac or respiratory cycle, such as a cardiac phase in a cardiac cycle or a respiratory phase in a respiratory cycle. The images may be in 2D or 3D. 4DCT images or 4D images acquired by other modalities include a series of frames of 3D images of a volume. The fourth dimension of the 4D images may be time, a cardiac phase in a cardiac cycle, or a respiratory phase in a respiratory cycle. An example number of frames is 10. The cardiac 4D images may be rebinned or rearranged according to the cardiac phases of the images to generate frames of cardiac images, with each frame corresponding to a cardiac phase.
In the example embodiment, the method 100 also include correcting 106 the motion of the respiratory 4DCT images. The respiratory images maybe rebinned or rearranged according to the respiratory phases of the images to generate frames of respiratory images, with each frame corresponding to a respiratory phase. As used herein, frames of respiratory images are series of respiratory images of the entire slice coverage or portions of the slice coverage (referred to as stacks of respiratory images as described below) at corresponding respiratory phases. In order to correct the respiratory images using the cardiac motion model, a corresponding cardiac phase in the cardiac motion model for each frame of respiratory images is identified. If the respiratory images are acquired simultaneously with ECG signals, the corresponding cardiac phase of each frame of respiratory images is identified using the ECG signals by locating the cardiac phase of the frame of respiratory images in the cardiac cycle based on the ECG signals. If ECG signals are not acquired during the acquisition of respiratory images, the corresponding phase is determined by the image similarity of the frame of respiratory images with each phase in the cardiac motion model. Image similarity may be measured using indexes such as structural similarity index measure indexes. The phase in the cardiac motion model that is most similar to the respiratory images is selected as the cardiac phase for that frame of respiratory images. The selection of the corresponding cardiac phase is repeated for each frame of respiratory images.
In the example embodiments, the respiratory images are stacks of respiratory images, each stack covering a portion of a slice coverage (see, e.g., the stacks 701 marked in
In the example embodiment, once the corresponding cardiac phase of the respiratory images is determined, the cardiac motion model is used to correct 106 the motion of the respiratory images, by using the motion fields in the cardiac motion model at the determined cardiac phase as parameters of transformation to correct motion. Stacks of corrected respiratory images may be combined to form corrected respiratory images by stacking the stacks together, and generate frames of corrected respiratory images, where each frame covers the entire slice coverage.
In the example embodiment, artifacts in the corrected respiratory images may be reduced by generating a weight map corresponding to outliers across frames of the cardiac images at the same voxels. The artifacts may be reduced by downweighing or dividing the corrected respiratory images at each voxel with the weight in the weight map at the voxel. Image quality may be further increased by groupwise registering all frames of cardiac images, respiratory images, or stacks of respiratory images with a common reference frame. Groupwise registration may be combined with down weighting of artifacts.
If the cardiac images are acquired during the same session as the respiratory images, the imaged anatomy is the same, and rigid registration between cardiac images and respiratory images may be used to correct motion in respiratory images. As used herein, rigid registration refers to image registration using linear transformation models such as rotation, translation, and/or scaling. If the cardiac images are acquired at different time, the imaged anatomy may have changed. Similarly, if the cardiac images are acquired with a different modality, such as MRI, from CT used for acquiring respiratory images, the anatomy appeared in the respiratory images are also different from that in the cardiac images, for example, due to different modality having different imaging contrast mechanisms. As such, rigid registration between cardiac images and respiratory images to correct the motion of respiratory images may fail.
To solve this problem, the cardiac images and the respiratory images are segmented in the same way, such as AHA (American Heart Association) 17 segments used in clinic. The segments may be derived using a neural network model, using a method not based on a neural network model such as principal components analysis, using a manual method such as manually annotating by a clinician, or any combination thereof. For example, anatomical features such as left and right ventricles, atria, vales, and/or left anterior descending coronary artery may be auto-segmented using a neural network model. Anatomical keypoint features such as interventricular grooves and the long axis of the left ventricle are derived based on the anatomical features, using methods such as principal components analysis. Anatomical keypoint features are used to map a standardized segment model, such as the standardized myocardial segmentation model of the AHA 17 segment model having standardized myocardial segments, onto the left ventricle of each image in the cardiac images, respiratory images, or stacks of respiratory images. Although the anatomy of the subject may appear different in different imaging sessions and/or different modalities, the anatomical features identified by the AHA 17 segment model remain relatively unchanged. The mapped segment models are used to guide the registration process, where the segment model identifies the corresponding segments in each frame and the identified corresponding segments are registered in derive cardiac motion model and improve image quality of respiratory images. The segment-based registration described herein is advantageous in improving the image quality because the segments used in the registration are anatomical segments of a subject, which are more reliable features for registration than typical segments of an image which may change in different imaging sessions or modalities.
In the example embodiment, the method 100 further includes outputting 108 the corrected respiratory images. The corrected respiratory images may be used in determining the dosage for radiation planning.
In the example embodiment, the input layer 502 may receive different input data. For example, the input layer 502 includes a first input a1 representing training images, a second input a2 representing patterns identified in the training images, a third input a3 representing edges of the training images, and so on. The input layer 502 may include thousands or more inputs. In some embodiments, the number of elements used by the neural network model 204 changes during the training process, and some neurons are bypassed or ignored if, for example, during execution of the neural network, they are determined to be of less relevance.
In the example embodiment, each neuron in hidden layer(s) 504-1 through 504-n processes one or more inputs from the input layer 502, and/or one or more outputs from neurons in one of the previous hidden layers, to generate a decision or output. The output layer 506 includes one or more outputs each indicating a label, confidence factor, weight describing the inputs, and/or an output image. In some embodiments, however, outputs of the neural network model 204 are obtained from a hidden layer 504-1 through 504-n in addition to, or in place of, output(s) from the output layer(s) 506.
In some embodiments, each layer has a discrete, recognizable function with respect to input data. For example, if n is equal to 3, a first layer analyzes the first dimension of the inputs, a second layer the second dimension, and the final layer the third dimension of the inputs. Dimensions may correspond to aspects considered strongly determinative, then those considered of intermediate importance, and finally those of less relevance.
In other embodiments, the layers are not clearly delineated in terms of the functionality they perform. For example, two or more of hidden layers 504-1 through 504-n may share decisions relating to labeling, with no single layer making an independent decision as to labeling.
In some embodiments, some inputs lack an explicit weight, or have a weight below a threshold. The weights are applied to a function a (labeled by a reference numeral 510), which may be a summation and may produce a value z1 which is input to a function 520, labeled as ƒ1,1(z1). The function 520 is any suitable linear or non-linear function. As depicted in
It should be appreciated that the structure and function of the neural network model 204 and the neuron 550 depicted are for illustration purposes only, and that other suitable configurations exist. For example, the output of any given neuron may depend not only on values determined by past neurons, but also on future neurons.
The neural network model 204 may include a convolutional neural network (CNN), a deep learning neural network, a reinforced or reinforcement learning module or program, or a combined learning module or program that learns in two or more fields or areas of interest. Supervised and unsupervised machine learning techniques may be used. In supervised machine learning, a processing element may be provided with example inputs and their associated outputs, and may seek to discover a general rule that maps inputs to outputs, so that when subsequent novel inputs are provided the processing element may, based upon the discovered rule, accurately predict the correct output. The neural network model 204 may be trained using unsupervised machine learning programs. In unsupervised machine learning, the processing element may be required to find its own structure in unlabeled example inputs. Machine learning may involve identifying and recognizing patterns in existing data in order to facilitate making predictions for subsequent data. Models may be created based upon example inputs in order to make valid and reliable predictions for novel inputs.
Additionally or alternatively, the machine learning programs may be trained by inputting sample data sets or certain data into the programs, such as images, object statistics, and information. The machine learning programs may use deep learning algorithms that may be primarily focused on pattern recognition, and may be trained after processing multiple examples. The machine learning programs may include Bayesian Program Learning (BPL), voice recognition and synthesis, image or object recognition, optical character recognition, and/or natural language processing-either individually or in combination. The machine learning programs may also include natural language processing, semantic analysis, automatic reasoning, and/or machine learning.
Based upon these analyses, the neural network model 204 may learn how to identify characteristics and patterns that may then be applied to analyzing image data, model data, and/or other data. For example, the model 204 may learn to identify features in a series of data points.
Systems and methods described herein may be implemented on any suitable computing device 800 and software implemented therein.
Moreover, in the example embodiment, computing device 800 includes a display interface 817 that presents information, such as input events and/or validation results, to the user. The display interface 817 may also include a display adapter 808 that is coupled to at least one display device 810. More specifically, in the example embodiment, the display device 810 may be a visual display device, such as a cathode ray tube (CRT), a liquid crystal display (LCD), a light-emitting diode (LED) display, and/or an “electronic ink” display. Alternatively, the display interface 817 may include an audio output device (e.g., an audio adapter and/or a speaker) and/or a printer.
The computing device 800 also includes a processor 814 and a memory device 818. The processor 814 is coupled to the user interface 804, the display interface 817, and the memory device 818 via a system bus 820. In the example embodiment, the processor 814 communicates with the user, such as by prompting the user via the display interface 817 and/or by receiving user inputs via the user interface 804. The term “processor” refers generally to any programmable system including systems and microcontrollers, reduced instruction set computers (RISC), complex instruction set computers (CISC), application specific integrated circuits (ASIC), programmable logic circuits (PLC), and any other circuit or processor capable of executing the functions described herein. The above examples are example only, and thus are not intended to limit in any way the definition and/or meaning of the term “processor.”
In the example embodiment, the memory device 818 includes one or more devices that enable information, such as executable instructions and/or other data, to be stored and retrieved. Moreover, the memory device 818 includes one or more computer readable media, such as, without limitation, dynamic random access memory (DRAM), static random access memory (SRAM), a solid state disk, and/or a hard disk. In the example embodiment, the memory device 818 stores, without limitation, application source code, application object code, configuration data, additional input events, application states, assertion statements, validation results, and/or any other type of data. The computing device 800, in the example embodiment, may also include a communication interface 830 that is coupled to the processor 814 via the system bus 820. Moreover, the communication interface 830 is communicatively coupled to data acquisition devices.
In the example embodiment, the processor 814 may be programmed by encoding an operation using one or more executable instructions and providing the executable instructions in the memory device 818. In the example embodiment, the processor 814 is programmed to select a plurality of measurements that are received from data acquisition devices.
In operation, a computer executes computer-executable instructions embodied in one or more computer-executable components stored on one or more computer-readable media to implement aspects of the invention described and/or illustrated herein. The order of execution or performance of the operations in embodiments of the invention illustrated and described herein is not essential, unless otherwise specified. That is, the operations may be performed in any order, unless otherwise specified, and embodiments of the invention may include additional or fewer operations than those disclosed herein. For example, it is contemplated that executing or performing a particular operation before, contemporaneously with, or after another operation is within the scope of aspects of the invention.
Sudden cardiac arrest is the single largest cause of death in the developed world, and causes over 325,000 deaths per year in the US; more than lung cancer, breast cancer and AIDS combined. Ventricular tachycardia (VT) is a life-threatening heart rhythm and the most common cause of sudden cardiac arrest. The underlying cause of VT is development of abnormal electrical circuits within scar tissue inside myocardium, either from a previous myocardial infarction or a non-ischemic cardiomyopathy. Current treatment for VT includes medications, implantable cardiac defibrillator (ICD) placement, and catheter ablation. Medications used for treatment of VT, such as amiodarone and mexilitine, have considerable acute and late toxicities, are poorly tolerated by patients, and have poor durable arrhythmia control. Placement of an ICD allows for acute disruption of VT through anti-tachycardia pacing or shocks, but at the expense of reduced quality of life or further decline in cardiac function from continuous shocks. Often, these interventions are insufficient to effectively control VT on their own. Worldwide, catheter ablation is increasingly performed to treat VT. While the procedure is especially effective in the absence of ventricular scar, recurrence rates as high as 50% at 6 months are seen in the setting of cardiomyopathy VT. Recurrences are more common because of a larger scar size, which leads to: 1) multiple coexisting VT circuits; 2) circuits that extend deep within the myocardium, outside the reach of standard catheter ablation energy; 3) formation of new circuits after the ablation procedure. Recurrent VT after catheter ablation is associated with 5 to 10-fold higher odds of death, often due to incomplete ablation. Further, many patients will not be candidates for catheter ablation due to age, comorbidities, or technical inability to safely ablate the region of interest. Thus, many patients will ultimately develop irreversible debilitating refractory VT.
Given the limitations of currently available treatment modalities for patients with treatment refractory ventricular arrhythmia, there is a compelling need to a develop safer, more effective therapy. In principle, this would be best achieved by 1) improving the quality of the ablation (gap-free, full myocardial thickness) while 2) mitigating the high risks of invasive procedures. An attractive alternative to invasive catheter ablation is cardiac radioablation, first developed through a small number of clinical case studies and preclinical studies. Cardiac radioablation includes the combination of multi-modality imaging and electroanatomical information to guide the delivery of stereotactic ablative body radiotherapy, a highly precise form of radiotherapy used to treat small, localized targets such as brain, lung, and liver tumors and metastases.
A pilot program was initiated to explore treatment of patients with high risk, refractory VT with a combination of noninvasive cardiac mapping and noninvasive radioablation. In cardiac radioablation, patients underwent pre-treatment targeting by using available cardiac scar imaging and electroanatomical mapping. The data was converted into a target on the radioablation planning CT and a single dose of 25 Gy was then delivered using standard stereotactic ablative body radiotherapy (SABR) techniques on a linear accelerator with on-board image guidance. Five patients with high risk, treatment-refractory VT underwent treatment. Mean noninvasive VT ablation time was 14 minutes (range 11-18 minutes), performed awake. In total, there was a >99% reduction in total VT burden (6577 ICD therapies in the 3 months before treatment to 4 ICD therapies in the 12 months after treatment). Reduction in VT burden was achieved in all patients (mean 1315 per-patient ICD therapies (range 5-4312) being reduced to 1 (range 0-2)) despite stopping antiarrhythmic medication. Cardiac function (left ventricular ejection fraction) did not adversely change over time. Mild adjacent lung inflammatory changes were observed at 3 months, which resolved by one year.
A phase I/II trial of cardiac radioablation (ENCORE-VT) was conducted. Nineteen subjects with high-risk refractory VT were enrolled who previously underwent a median of 1 VT catheter ablation procedures (range 0-4) and with 59% on >1 anti-arrhythmic drug.
As in stereotactic ablative body radiotherapy, for cardiac radioablation the cognitive treatment design work, or treatment planning, occurs up front and includes virtual construction and assessment of the planned radiation delivery. Despite the promising early clinical results described above, it has become clear that current cardiac radioablation imaging and treatment planning techniques will not be suitable for widespread deployment to the community and treatment of the wider population of patients. The philosophy in early trials and case studies has been to treat conservatively, irradiating a target with a substantial safety margin to ensure, despite uncertainty in imaging and planning, that the target is not missed. However, this results in larger target volumes, which as shown in
In contrast, high spatial and temporal resolution multi-modality motion models for cardiac radioablation treatment planning will enable high precision (<3 mm overall error) planning and delivery. In the long term, this will lead to better efficacy and toxicity prediction, reduced cardiotoxicity, and better availability of cardiac radioablation to more patients. The following three key problems will be solved, enabling effective translation of cardiac radioablation into the community:
Current cardiac and respiratory-correlated 4DCT techniques are not designed to measure motion of structures, such as the heart, undergoing cardiac motion during free breathing. Cardiac 4D imaging is widely available on modern CT scanners, with electrocardiogram (ECG)-correlated scans routinely being acquired with the patient in breath hold. Cardiac 4D imaging may be ECG triggered, where the acquisition of images is triggered by ECG signals. Cardiac 4D imaging may not ECG triggered, where ECG signals are used retrospectively to rearrange the images according to the cardiac phases of the images. Similarly, respiratory-correlated 4DCT emerged in the early 2000s for radiotherapy, using a respiratory signal to sort projection or image data into bins. However, as heartbeat and respiration are generally uncorrelated, no current imaging technique may simultaneously capture high frequency 3D frames of both cardiac and free-breathing respiratory motion of the heart. Cardia-correlated 4DCT may be referred to as cardiac 4DCT. Respiratory-correlated 4DCT may be referred to as respiratory 4DCT. Respiratory 4DCT images are typically acquired while the subject is free breathing.
Currently, a respiratory 4DCT captures the total motion of the heart to form a motion envelope or ‘internal target volume’ to encompass the entirety of motion. The issue with this approach is that currently available respiratory 4DCT is not designed to accurately measure heartbeat, which results in motion artifacts.
There are several options to bring joint cardiac/respiratory 4D imaging into the clinic and reduce the errors in respiratory 4DCT. Modern cardiac CT scanners have sufficient longitudinal coverage (>160 mm) to cover the entire heart and motion range in a single table position. Multiple fast ‘frames’ may be acquired to construct a motion pattern representative of both breathing and heartbeat, although this would result in very high patient imaging dose. However, such cardiac scanners are not capable of being used as CT simulators, as the scanners do not have sufficient bore size, capture a smaller field of view (300 mm) of images which truncates the patient and thus cannot be used for dose calculations, and do not have CT simulation software required for radiation therapy. Furthermore, a larger scan range (˜200 mm) is typically desirable to capture position and motion of surrounding organs at risk such as lung and stomach. Another option is cine (volumetric) MRI, but cine MRI does not currently achieve a 3D frame rate high enough to accurately measure heartbeat during free breathing, and there are challenges in calculating dose on MRI due to lack of electron density information, low resolution, and spatial distortion. Further, nearly ubiquitous presence of metal objects (i.e., implantable cardioverter defibrillator (ICD)) limits MRI use as the sole source of treatment planning information.
Development and evaluation of a new algorithm SCARAB-4D, Simultaneous Cardiac And Breathing motion correction from 4DCT (
SCARAB-4D will generate a cardiac motion model from a high quality cardiac 4DCT scan, and then fit the model to each acquired phase and ‘stack’ of slice positions on the respiratory 4DCT to generate a simultaneous cardiac and respiratory motion estimate.
Major challenges integrating imaging information into cardiac radioablation planning include the availability of specific types of imaging and time between imaging studies. Many clinics do not have cardiac imaging available in the radiotherapy department where radiotherapy-specific imaging is acquired. It is therefore the SCARAB-4D technique is broadened to be applicable to clinics without radiotherapy-specific cardiac imaging capabilities. To do so, cardiac 4D imaging acquired on a different day or time is used to create the motion model in SCARAB-4D. One possible scenario in the community is the acquisition of a cardiac 4DCT or cardiac MRI scan in the cardiology or radiology department, followed by a planning CT (with respiratory 4DCT) on a radiation oncology CT simulator, typically on a different day. In this scenario, there may be days or weeks between acquisition of the diagnostic cardiac imaging and respiratory 4DCT for radiotherapy simulation. Furthermore, some clinics may not have cardiac 4DCT available, and have cardiac MRI instead. Thus, the algorithm developed is capable of using a multi-modality cardiac motion model.
To broaden applicability of the SCARAB-4D technique, the cardiac 4D imaging acquired on a different day may be used as the underlying heart motion model. However, anatomical deformations and volume changes (e.g., due to change in volume status) along with imaging modality differences are managed. To do so, a technique combining deep learning-based autosegmentation of anatomical heart features with a feature-based deformable image registration algorithm is used to accurately register cardiac and respiratory 4DCT images acquired on different days, and to bring in multi-modality imaging such as MRI, where images acquired by a different modality is registered with 4DCT images. The data demonstrate that deep learning is feasible on contrast and non-contrast CT for segmenting the heart chambers (see
Segmentation of the left and right ventricles may then be used to extract anatomical keypoint information and fit a 17 segment model onto the epicardial surface. Along with endocardial keypoint anatomy, this information may be used to guide an anatomical keypoint-based deformable registration algorithm. The segment model-based registration approach will fit the cardiac model onto the anatomy of the day of the respiratory 4DCT scan, which will then be used to reconstruct the heart/respiratory motion of the heart using SCARAB-4D. This approach may be used to fuse any type of multi-modality imaging data (e.g., CT, MR, PET, SPECT) with the planning CT.
Currently, radiation dose calculations during planning assume the anatomy is static. Moving anatomy such as the heart perturbs the dose distribution, resulting in a discrepancy between the planned and delivered dose. This results in uncertainty and error in the estimation of target radiation dose and also in estimation of dose to normal organs. For example, dosimetric errors up to 20% have been observed in the thorax for static compared to dynamic dose. There are several implications of this error. First, a safety margin known as an internal target volume (ITV) is used to address the effect of such motion but results in a larger target and therefore more normal tissue irradiated. Second, since the true dose delivered is not known, the estimated planned dose is typically used in radiation response models as an independent predictor of toxicity. For cancer treatments, better estimation of true dose results in better outcomes and toxicity prediction models. However, this is untested in cardiac radioablation patients.
In general, 4D dose calculation methods are available in research systems. However, there are no methods available to calculate dose to the heart and adjacent organs due to both breathing and heartbeat. 4D dose calculation methods are adapted to work with the SCARAB-4D motion models described above in order to estimate the true delivered dose to the target, heart substructures (such as chambers, valves, and coronary arteries), and other risk structures such as stomach and lungs. Because CT images have electron density information, 4D CT images are used to calculate radiation dosage, based on the relationship between the image intensity and electron density. The 4D dose calculation provides the dose at discrete phases of motion, due to heartbeat and/or respiration, which may be summed or averaged across phases to provide a more accurate representation of the delivered dose in the presence of motion. Further, because corrected respiratory images using the systems and methods described herein have increased image quality and reduced artifacts, 4D dose calculation based on the corrected respiratory images has increased precision. Delivered dose will then be used to establish appropriate safety margins (ITV and analogous planning at risk volumes for organs at risk). The delivered, cumulative dose is used to improve cardiac radioablation outcome and toxicity models.
In addition to solving problems in treatment planning for cardiac radioablation, the results of this project may have wider ranging impact. In cancer radiotherapy, the systems and methods described herein of improved respiratory 4DCT may enhance imaging for planning in the thorax (such as lung, esophagus, breast), where heart-induced artifacts affect target and organ at risk definition and dosimetry. That is, systems and methods described herein may be used to enhance planning for therapy of thoracic anatomies other than the heart. The cardiac models described herein may also be useful in improving on-treatment imaging such as cone beam CT and/or onboard MRI, where motion model-based image reconstruction may result in faster and better quality images to help guide patient setup and plan adaptation.
Cardiac radioablation is an emerging modality for treatment of VT. At least one technical effect of the systems and methods described herein include:
Development of strategies for dealing with combined respiratory and cardiac motion. To date, there are no methods for accurate imaging of the heart under free breathing for radiotherapy planning, particularly with CT. Methods described above are developed for this purpose, which may be deployed on existing CT infrastructure already present in radiotherapy departments.
Development of multi-modality cardiac image registration strategies. Using ventricular segment models, a hybrid model-, intensity-, and contour-driven deformable registration algorithm is used to align multi-modality cardiac imaging to radiotherapy simulation CT.
Estimation of cumulative dose in the cardiac radioablation setting. All clinical cardiac radioablation studies to date use static dose estimates. Here, cardiac and respiratory motion is incorporated into dose calculation and provides an estimate of cumulative dose in consideration of motion.
Establishment of cardiotoxicity outcomes models for cardiac radioablation. There are no outcomes models for cardiac radioablation. The tools developed here will be a first step in establishing such models.
SCARAB-4D CT correction algorithm. The goal is to develop an approach to correct respiratory 4DCT scans to better measure the shape and motion of the heart and adjacent anatomy during the scan acquisition, so that better estimates of target volume and motion may be used in target definition. SCARAB-4D will need a respiratory 4DCT that will be corrected, and an ECG-gated cardiac 4DCT, where each may include 8-10 frames of temporal data. Only the reconstructed scans are needed, which are frames of images at different cardiac phases. That is, SCARAB-4D does not need the raw projection data, or data or images output after a CT scan before being rebinned to different cardiac phases. This is an advantage because SCARAB-4D may be deployed without modifications to the scanner or reconstruction algorithms and is therefore applicable with existing scanners and retrospective data.
In the example embodiment, first, a cardiac-only motion model is generated from the cardiac 4DCT with the patient in breath hold to mitigate respiratory motion. Briefly, a groupwise deformable image registration algorithm is used to align each frame of the cardiac 4DCT to a selected reference frame (shown in
After the cardiac phase of each stack is selected, each stack for each respiratory phase is then registered individually to the selected cardiac phase image from the motion model, which may be referred to as “model to stack registration”. Because the cardiac 4DCT was acquired at the same time as the respiratory 4DCT, minimal change occurs in heart shape and volume between these two scans. For this reason, a rigid registration should be suitable for matching the cardiac model to the stack. Finally, the registration result for each stack is used to map the appropriate cardiac phase image onto the respiratory 4DCT scan, for each stack and each respiratory phase. The final result is a motion model-reconstructed estimate of heart shape and position during the respiratory 4DCT scan.
SCARAB-4D (
Available Datasets. Two datasets are available for this work, one from 20 patients enrolled on the ENCORE-VT trial and another from 18 patients treated off-label afterwards. A total of over 100 4DCT and dynamic MRI images are available between the two datasets. Each patient has respiratory 4DCT and cardiac 4DCT. In the first dataset, cardiac 4DCT was acquired at a separate time on a separate, cardiology CT scanner. For the second dataset, of 18 patients, we had acquired a CT simulator (Siemens Confidence RT Pro) in the Radiation Oncology department with both respiratory and cardiac gating capabilities. This second group of 18 patients have cardiac 4DCT and respiratory 4DCT acquired during the same session on the same day on this scanner, without the patient moving between scans. Approximately half of the patients in each dataset have dynamic cardiac MRI available as well, and all patients have electrocardiogramaging (ECGI) available if needed.
To address these challenges, a groupwise artifact-reduction algorithm is used. Briefly, groupwise registration inherently is robust to artifact as all frames of the cardiac 4DCT are simultaneously considered in the registration, and therefore preferentially registers tissue present in most frames while ignoring artifact which is unique to each individual frame (i.e., metal artifacts appear different in each frame whereas the tissue boundaries are present in all). Contribution in Study 1.2 is to add an outlier detection when the groupwise registration is used to map all frames to a common reference frame, which may be referred as artifact-weighted groupwise registration. Briefly, outlier detection is performed by evaluating the consistency of a region across temporal frames in the registered image (
This artifact-weighted groupwise approach will be evaluated to determine if it may be used to reduce the impact of artifact on the motion model creation and model to stack registration. For model to stack registration, instead of registering the stack to a single frame from the cardiac model, the artifact-weighted groupwise registration is used to map all frames from the cardiac motion model to the selected cardiac frame, and then perform a groupwise registration of the stack to all of these frames. This process is analogous to the artifact-weighted groupwise registration but will robustly register the stack to the aligned cardiac frames, minimizing the impact of artifact on the model to stack registration.
Evaluation methodology. To evaluate the quality and accuracy of SCARAB-4D reconstructions, both phantom and clinical images will be used. Using the previously described XCAT phantom, a series of ground truth motion patterns will be constructed with various heartbeat and breathing frequencies, motion amplitudes, and heart and patient size, shape, and position variations. A software package has been developed for simulating a 4DCT acquisition (see
In addition, clinical images will be used to estimate performance of SCARAB-4D. The same epicardial contours and landmarks as in Study 1.1 will be used for validation here. These data will be used to establish the ground truth in the clinical images. Landmarks may be used directly to assess spatial accuracy using existing techniques. The contours of each stack will be used to estimate registration accuracy based on distance to agreement and overlap between the model-reconstructed contours and physician-delineated contours.
Power analysis. From preliminary data, the spatial error between the original respiratory 4DCT and ground truth cardiac and respiratory motion is estimated as 16.1 mm+/−4.3 mm while our target goal is 3 mm. To achieve a power of 80% and a level of significance of 5% (two sided), for detecting a mean of the differences of 13.1+/−4.3 mm (paired study) 5 patient datasets are needed, each with a same day respiratory and cardiac 4DCT. These data are already available from our patient database which contains 18 patients with same day scans.
Patient respiration may vary breath to breath, and voluntary breath hold at end exhalation/end inhalation may not correspond directly with the same motion range during free breathing. To ensure that the respiratory motion acquired during respiratory 4DCT matches that during patient end inhale/end exhale breath hold during cardiac 4DCT acquisition, a visual feedback system will be used to assist the patient in performing reproducible free breathing and breath hold maneuvers.
A successful outcome will be a final SCARAB-4D algorithm capable of retrospectively reconstructing accurate respiration and cardiac motion from a standard respiratory 4DCT and cardiac 4DCT. The model to stack registration approach may be found not performing with sufficient accuracy, as the model to stack registration will be a challenging registration problem due to the small image size of each stack, the potentially large motion range, and the ubiquitous presence of metal objects. If this main approach fails, methods will be evaluated to further improve the registration accuracy, including 1) selection of a more constrained (e.g., smoother) registration algorithm, 2) try to add more data to the stack by extrapolation or combining stacks from the same cardiac phase, 3) investigate a supervised registration approach such as using phantom and/or clinical data with contours to train a registration algorithm using deep learning.
The basic concept of this section is to develop a SCARAB-4D algorithm in cases where a cardiac 4DCT is not available during radiotherapy simulation, as assumed in Section A.1. Here, instead, available cardiac 4DCT acquired at a different time, or multimodality cardiac motion data such as dynamic cardiac MRI, will be used. Anatomical variability between days, and differences in image intensity and content between different imaging modalities, will additionally complicate the already challenging model to stack registration process. For this reason, a technique will be used that combines deep learning-based autosegmentation of anatomical heart features with a segment model-based deformable image registration algorithm to accurately register cardiac images to respiratory 4DCT images to perform reconstruction of the respiratory 4DCT scan as in Section A.1. The model to stack registration process will be extended to incorporate the American Heart Association 17 segment left ventricle model to guide the deformation of the cardiac model onto each stack. The performance of the algorithm will be evaluated on existing multimodality imaging data.
The basic structure of the Section A.2 approach is presented in
The 17 segment model is mapped onto the left ventricle in each image. Because the segments in this model are based on anatomy (the regions perfused by each coronary artery along with the interventricular grooves), registration of the heart anatomy may be accomplished by mapping the segment model onto each image and then using this model to identify corresponding keypoint anatomy (e.g. interventricular grooves) and interpolate correspondence at other locations. A prototype algorithm has been developed for mapping the segment model onto a cardiac image (
First, the segment model will be applied to each frame from the dynamic cardiac CT or MRI scan to build a motion model. In each frame, the left and right ventricles will be autosegmented and the model applied as described above. Then, the segment model in each frame will be used to identify the epicardial surface correspondence between frames. A point-based registration using iterative closest points will be used to register the corresponding segments.
Next, the segment model will also be applied to the respiratory 4DCT. This will result in a segment model with discontinuities at the stack boundaries, but with a complete segment model on the surface. A hybrid registration approach will be used, which jointly registers the cardiac model (epicardial segments, endocardial surface contour, and image intensity) to each stack in the respiratory 4DCT (epicardial segments, endocardial surface, and image intensity). The endocardial surface typically has more features than the epicardial surface, which tends to be smooth. Hence, the endocardial surface contour is directly used in the registration whilst the epicardial surface is represented by the segment model.
Evaluation methods and power analysis. The reconstructed respiratory 4DCT will be evaluated using similar methods as in Study 1.2, by comparison to physician-drawn contours and to annotated landmarks. The dual end inhale/end exhale cardiac 4D-generated ground truth from Study 1.3 will be used to assess performance in this section. In addition, the results of reconstruction using the multimodality cardiac model will be compared to that from Section A.1, where the cardiac model was acquired on the same day. The cohort size is based on a paired non-inferiority comparison between the accuracy of the model-based reconstruction using the methods in this section (Study 2.3 methods) to those using the Section A.1 methods on the day-of cardiac 4DCT based model (Study 1.2 methods). The expected mean error in model reconstruction of left ventricle is 3.7 mm+/−2.9 mm, and the inferiority margin is 2 mm, resulting in a cohort size of 18 patients. These data are available already.
The major outcome is a functioning SCARAB-4D algorithm to generate accurate measures of respiratory and cardiac of heart and/or other organs using multi-modality or different-day cardiac imaging. Preliminary results with deep learning demonstrate strong performance in segmenting the ventricles in non-contrast CT. This model should be adaptable to cardiac MRI segmentation. If the basic strategy of transfer learning does not result in sufficient accuracy, style transfer (MR to CT intensity conversion) will be used as in the alternative solution in Section A.1. Other alternatives include retraining the model on MRI data, which may be time intensive. Finally, manual contours may be used directly, although the clinical process would be labor intensive. If the registration approach is not sufficiently accurate, conversion of MR to CT intensity will be evaluated. Then, an approach similar to the Section A.1 approach may be used to register a synthetically-generated cardiac 4D model to the respiratory 4DCT.
The systems and methods use the developed cardiac motion models along with the reconstructed respiratory 4DCT scans to better estimate the cumulative per-voxel dose to the heart and organs at risk during radiation delivery. The reconstructed 4DCT scans will be used to estimate motion during delivery and measure the cumulative dose. Here the ‘4D dose calculation’ method will be extended from breathing motion to incorporate heartbeat-induced motion.
The corrected respiratory 4DCT scan for each patient measures the shape and position of the heart, target, and organs at risk during scan acquisition. The transformation linking the cardiac motion model to each ‘frame’ in the SCARAB-4D reconstructed respiratory 4DCT is derived, as it is inherently measured by the model to stack registration. Dose calculated on any frame of the respiratory 4DCT scan may be linked to a reference frame, and the transformation used to map per-frame dose to the reference. Once individual doses from each frame are mapped to the common reference frame, the doses may be summed to measure the cumulative delivered dose from the treatment.
For the dose calculation, a state of the art dose calculation algorithm will be used to ensure highly accurate and high throughput dose calculation. There may be several hundred frames of temporal data, each frame having a 3D image of the heart and surrounding anatomy, on which a treatment plan dose calculation is performed. It will be challenging to do this work in a clinical treatment planning system, which is designed for dose calculation on a single 3D image. Instead, a commercial-grade research dose calculation engine, SciMoCa, will be used. SciMoCa is a standalone Monte Carlo dose calculation service available in a client-server architecture. Each 3D frame may be submitted along with the radiation plan to a dose calculation server, which computes and returns the radiation 3D dose. SciMoCa has been shown to have accuracy indistinguishable from a state of the art commercial dose calculation algorithm. Once returned from the server, the individual 3D dose images at each frame are deformed using the SCARAB-4D reconstruction and summed to generate the cumulative dose.
There will be three sets of doses and dose-volume metrics (e.g., the volume of an organ receiving at least a certain dose, which are widely used as surrogates for target control and toxicity in radiotherapy) computed and compared. As described above, the cumulative dose at the landmark positions will serve as ground truth. The cumulative dose as estimated from the reconstructed respiratory 4DCT, extracted at the corresponding landmark locations in the reference frame, will be tabulated. Finally, a static dose, which is the calculated radiation dose on the reference frame only (the clinically-used approach currently), at the landmark locations, will be tabulated to serve as a comparison of the clinical standard of care. The cumulative dose error will be evaluated, which is the difference in cumulative dose at the landmark locations between the SCARAB-4D reconstructed 4DCT approach and the actual cumulative dose at the landmarks. The static dose error will also be evaluated, which is the difference in cumulative dose, static dose calculation compared to the landmark-calculated dose. These dosimetric errors will be performed voxel to voxel, and also for a variety of clinically meaningful dose-volume statistics to target, heart substructures, and organs at risk such as stomach, esophagus, and spinal cord.
Power analysis. The primary endpoint of the study is error, model-reconstructed dose vs. landmark-calculated dose. Because dose is directly related to registration error, the power calculation for Study 1.2 may be applied here, resulting in a minimum of 5 patients. However, the results in all 28 patients will be evaluated with respiratory and cardiac 4DCT (18 patients from existing data and 10 patients from the Study 1.3 prospective imaging study).
Currently, the most commonly used method for managing motion is to design a ‘motion envelope’ or internal target volume from the conventional respiratory 4DCT to compensate for heartbeat and respiratory motion during free-breathing. Using the motion models, a variety of motion management strategies may be simulated, including using this motion envelope (but with improved accuracy using the motion model rather than the uncorrected respiratory 4DCT), delivery during breath hold, and respiratory and/or cardiac gating. Here, each strategy will be simulated, the cumulative dose to the target and organs at risk will be measured, and the appropriate safety margins needed to expand each structure will be determined to ensure that target coverage and organ at risk constraints are not violated.
Systems and methods described herein are used to determine the cumulative dose to the target and organs at risk, and a set of safety margins for various respiration management strategies. These safety margins may then be used by the community as they deploy more sophisticated motion management strategies. One potential risk of the methods is the reliance on the resulting cardiac motion models. Preliminary data demonstrates (
At least one technical effect of the systems and methods described herein includes (a) correcting motion in respiratory images for radiation planning using a cardiac motion model; (b) correcting motion in respiratory images using cardiac images acquired at different time or with a different modality; (c) reducing artifacts in respiratory images using a weight map; (d) estimating dose based on corrected respiratory images.
Example embodiments of systems and methods of motion correction are described above in detail. The systems and methods are not limited to the specific embodiments described herein but, rather, components of the systems and/or operations of the methods may be utilized independently and separately from other components and/or operations described herein. Further, the described components and/or operations may also be defined in, or used in combination with, other systems, methods, and/or devices, and are not limited to practice with only the systems described herein.
Although specific features of various embodiments of the invention may be shown in some drawings and not in others, this is for convenience only. In accordance with the principles of the invention, any feature of a drawing may be referenced and/or claimed in combination with any feature of any other drawing.
This written description uses examples to disclose the invention, including the best mode, and also to enable any person skilled in the art to practice the invention, including making and using any devices or systems and performing any incorporated methods. The patentable scope of the invention is defined by the claims, and may include other examples that occur to those skilled in the art. Such other examples are intended to be within the scope of the claims if they have structural elements that do not differ from the literal language of the claims, or if they include equivalent structural elements with insubstantial differences from the literal language of the claims.
This patent application claims the benefit of U.S. Provisional Patent Application No. 63/314,798, filed on Feb. 28, 2022, titled “SYSTEMS AND METHODS OF CORRECTING MOTION IN IMAGES FOR RADIATION PLANNING,” the entire contents and disclosures of which are hereby incorporated herein by reference in their entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2023/062634 | 2/15/2023 | WO |
Number | Date | Country | |
---|---|---|---|
63314798 | Feb 2022 | US |