Fluence map generation methods for radiotherapy

Information

  • Patent Grant
  • 10918884
  • Patent Number
    10,918,884
  • Date Filed
    Wednesday, September 5, 2018
    7 years ago
  • Date Issued
    Tuesday, February 16, 2021
    5 years ago
Abstract
Described herein are methods for fluence map generation or fluence map optimization (FMO) for radiation therapy. One variation of a method for generating a fluence map comprises smoothing out nondifferentiable penalty functions and using an accelerated proximal gradient method (e.g., FISTA) to compute a fluence map that may be used by a radiotherapy system to apply a selected dose of radiation to one or more regions of interest (ROI) or volumes of interest (VOI).
Description
BACKGROUND

Fluence map optimization (FMO) problems in radiation treatment planning are most often solved by interior point methods or by gradient-based methods such as the projected gradient method or quasi-Newton methods. In the first approach, the optimization problem is typically reformulated as either a linear program or a quadratic program, which is then solved using an interior point method. While interior point methods perform very well for small and medium-size problems, they have the disadvantage that they require solving a large linear system of equations at each iteration. For large scale problems, including large fluence map optimization problems, this can become prohibitively computationally intensive. Gradient-based methods do not suffer from this limitation; however, they are unable to handle nondifferentiable objective functions and complex constraints. This puts significant restrictions on how the fluence map optimization problem can be formulated, and limits the quality of the resulting treatment plans.


BRIEF SUMMARY

Described herein are methods for generating fluence maps and/or fluence map optimization (FMO) for radiation treatment planning. One variation of a method for FMO comprises a proximal gradient method (e.g., an accelerated proximal gradient method such as FISTA) with a smoothed-out nondifferentiable penalty function to compute a fluence map that may be used by a radiotherapy system to apply a specified dose of radiation to one or more regions of interest (ROI) or volumes of interest (VOI). The fluence map may comprise a set of radiation beamlet data (e.g., beamlet intensity data) derived from a prescribed radiation dose plan (e.g., a treatment plan). The fluence map may be used to position a radiation source at one or more selected angles with respect to the ROI and to adjust the beam intensity of the radiation source such that a desired radiation dose is applied to the ROI, while reducing radiation exposure of organs-at-risk (OAR). The methods described herein may compute a fluence map such that the radiation exposure of OARs is below a preselected threshold, while still delivering a selected dose of radiation to a ROI. Some variations may use one or more L1-type penalty or cost functions, while other variations may use one or more L2-type penalty or cost functions.


Calculating or generating a fluence map for radiation therapy may comprise selecting a volume of interest, selecting a plurality of voxels within the volume of interest, and selecting a set of candidate beamlets b={bi}. A beamlet may be a portion of a full radiation beam that is defined by a multi-leaf collimator leaf opening (e.g., as depicted in FIG. 1B). Each of the plurality of voxels may have an acceptable dose range (e.g., a maximum radiation dose level and a minimum radiation dose level), which may be defined by a treatment plan and/or a clinician. The set of candidate beamlets may have initial beamlet intensity weights x0={xi0}. The method may comprise calculating a dose matrix A for the volume of interest based on the set of candidate beamlets b. The dose matrix A represents per-voxel dose delivered to each of the plurality of voxels by the set of candidate beamlets b. One example of a dose calculation matrix A for n candidate beamlets {bi} and for a VOI with k pre-selected voxels is a (k×n) matrix. An i-th column of the dose calculation matrix A (which has k elements) represents a dose contribution from a unity-weighted beamlet bi to each of the k voxels. Dose matrix A is may be calculated column-by-column, for example, by ray-tracing each beamlet's aperture along the path through the patient's volume and calculating the contribution of a unity-weighted beamlet to each of the k voxels. Several well-known algorithms exist for this dose calculation process, differing in their accuracy and speed. Examples of dose calculation algorithms that may be used in any of the methods described herein may include Monte-Carlo simulation, collapsed-cone convolution superposition, pencil-beam convolution, and others.


A fluence map generation method may also comprise calculating a fluence map having a final set of beamlet intensity weights xk by adjusting the initial beamlet intensity weights according to a proximal gradient method (xk−1→xk) with a penalty function containing one or more linear penalties. The proximal gradient method may be an accelerated proximal gradient method such as fast iterative shrinkage-thresholding algorithm (FISTA). The proximal gradient method may iterate on the initial beamlet intensity weights until the adjusted beamlet intensity weights converge on a final set of beamlet intensity weights such that changes between iterations of the beamlet intensity weights are less than a predetermined residual criterion. More generally, the method may use any proximal methods. Solving an optimization problem involves finding an input value that minimizes or maximizes a real-valued function. When minimization is used, the function is often called a “cost function” or “penalty function”. Convex optimization restricts the types of functions to so-called convex functions. Algorithms for convex optimizations guarantee convergence to a global minima, and may have other useful properties. A proximal algorithm or method is an algorithm for solving a convex optimization problem, and may be, in fluence map generation for example, an algorithm for minimizing a convex penalty function. Proximal algorithms use proximal operators of the components of the penalty functions. Evaluating a proximal operator of a function involves solving a small convex optimization problem. For these small sub-problems, a closed form solution usually exists, making the overall algorithm efficient. A proximal gradient algorithm is one example of a proximal algorithm, and it assumes that a cost or penalty function can be split as f(x)+g(x), where f(x) is differentiable and g(x) has a simple closed form of the proximal operator. For radiotherapy fluence map optimization and/or fluence map generation, the optimization problem involves multiple beamlets and multiple voxels, but the penalty function must be a scalar real-valued function. A penalty function (or cost function) that is typically used may include a sum of multiple components, where each component directs the iterative process to a solution to satisfy a specific problem goal. In the case of radiation therapy, the problem goal to be satisfied is the prescription target dose for a VOI (or a plurality of VOIs) in a patient. Each component in turn may also a summation across multiple beamlets of multiple voxels. A common choice for a penalty function component is a L2 penalty, also known as a quadratic penalty. A quadratic penalty or cost function is a sum of squares, for instance sumi (di2) is a penalty that would tend to minimize overall dose. Some components of a penalty or cost function may be a L1 penalty, also known as a linear penalty, which is a simple sum, for example sumi (di). A penalty or cost function may comprise one or more L1 penalties and/or one or more L2 penalties. An accelerated proximal gradient method may include addition terms (such as a momentum term) to help direct and/or speed up (i.e., increase the rate of, decrease the number of iterations) convergence to a solution set.


Generating or calculating a fluence map may comprise smoothing the penalty function such that it is differentiable. For example, the penalty function may be smoothed using Moreau-Yosida regularization. The initial set of beamlet intensity weights {x0} is an all-zero vector. The beamlets in the set of candidate beamlets b are divided between a smaller set of firing positions or angles {fi}. A firing position is a location where a radiation source may be positioned (e.g., relative to a patient region) to fire a beamlet. In a radiation therapy system where the radiation source is mounted on a circular rotatable gantry, a firing position may be a firing angle and identified by the locations around the circular gantry (from 0 to 360 degrees) around a patient treatment area. For example, the set of firing angles {fi} may comprise a plurality of angles around a patient area of a radiation treatment device. The plurality of firing angles may be evenly distributed 360 degrees around the patient area.


Any of the proximal gradient method penalty functions described herein may comprise one or more quadratic or L2 penalties. Penalty functions may penalize voxel dose excursions outside of the acceptable dose range. That is, as part of a proximal gradient method iteration, the magnitude of the dose deviation outside of the acceptable dose range is incorporated into the calculation of a set of beamlet weights. Such penalty functions may discourage beamlets that result in a dose delivery to a voxel or VOI that is not within the acceptable dose range. Some methods may have a penalty function that comprises a single-value penalty function that aggregates the voxel dose excursions outside of the acceptable dose range of all of the voxels in the volume of interest. The acceptable dose range of each of the plurality of voxels may be determined at least in part by a treatment plan.


Fluence map generation or calculation may comprise selecting a second volume of interest, selecting a second plurality of voxels within the second volume of interest where each of the voxels has an acceptable dose range, and calculating the dose matrix A to include the second volume of interest and second plurality of voxels. That is, the dose matrix A may represent per-voxel dose delivered to each of the first and second plurality of voxels by the set of candidate beamlets b. Optionally, some methods may comprise segmenting a fluence map into a set of multi-leaf collimator and radiation source positioning instructions.


Also described herein is a system for calculating or generating a fluence map for radiation therapy. A system may comprise a processor that configured to select a volume of interest, select a plurality of voxels within the volume of interest, where each of the voxels has an acceptable dose range, select a set of candidate beamlets b={bi} having initial beamlet intensity weights x0={xi0}, calculate a dose matrix A for the volume of interest based on the set of candidate beamlets b, where the dose matrix A represents per-voxel dose delivered to each of the plurality of voxels by the set of candidate beamlets b, and calculate a fluence map comprising a final set of beamlet intensity weights xk by adjusting the initial beamlet intensity weights according to an proximal gradient method update (xk−1→xk) with a penalty function containing one or more linear penalties that iterates on the initial beamlet intensity weights until the adjusted beamlet intensity weights converge on a final set of beamlet intensity weights such that changes between iterations of the beamlet intensity weights are less than a predetermined residual criterion. The processor may also be configured to store the fluence map in a processor memory. A beamlet may be a portion of a full radiation beam that is defined by a multi-leaf collimator leaf opening (e.g., as depicted in FIG. 1B). Each of the plurality of voxels may have an acceptable dose range (e.g., a maximum radiation dose level and a minimum radiation dose level), which may be defined by a treatment plan and/or a clinician. The set of candidate beamlets may have initial beamlet intensity weights x0={xi0}. The dose matrix A represents per-voxel dose delivered to each of the plurality of voxels by the set of candidate beamlets b. One example of a dose calculation matrix A for n candidate beamlets {bi} and for a VOI with k pre-selected voxels is a (k×n) matrix. An i-th column of the dose calculation matrix A (which has k elements) represents a dose contribution from a unity-weighted beamlet bi to each of the k voxels. Dose matrix A is may be calculated column-by-column, for example, by ray-tracing each beamlet's aperture along the path through the patient's volume and calculating the contribution of a unity-weighted beamlet to each of the k voxels. Several well-known algorithms exist for this dose calculation process, differing in their accuracy and speed. Examples of dose calculation algorithms that may be used in any of the methods described herein may include Monte-Carlo simulation, collapsed-cone convolution superposition, pencil-beam convolution, and others.


A system processor may be configured to iterate on a proximal gradient method that may be an accelerated proximal gradient method such as fast iterative shrinkage-thresholding algorithm (FISTA). The proximal gradient method may iterate on the initial beamlet intensity weights until the adjusted beamlet intensity weights converge on a final set of beamlet intensity weights such that changes between iterations of the beamlet intensity weights are less than a predetermined residual criterion. More generally, the method may use any proximal methods. Solving an optimization problem involves finding an input value that minimizes or maximizes a real-valued function. When minimization is used, the function is often called a “cost function” or “penalty function”. Convex optimization restricts the types of functions to so-called convex functions. Algorithms for convex optimizations guarantee convergence to a global minima, and may have other useful properties. A proximal algorithm or method is an algorithm for solving a convex optimization problem, and may be, in fluence map generation for example, an algorithm for minimizing a convex penalty function. Proximal algorithms use proximal operators of the components of the penalty functions. Evaluating a proximal operator of a function involves solving a small convex optimization problem. For these small sub-problems, a closed form solution usually exists, making the overall algorithm efficient. A proximal gradient algorithm is one example of a proximal algorithm, and it assumes that a cost or penalty function can be split as f(x)+g(x), where f(x) is differentiable and g(x) has a simple closed form of the proximal operator. For radiotherapy fluence map optimization and/or fluence map generation, the optimization problem involves multiple beamlets and multiple voxels, but the penalty function must be a scalar real-valued function. A penalty function (or cost function) that is typically used may include a sum of multiple components, where each component directs the iterative process to a solution to satisfy a specific problem goal. In the case of radiation therapy, the problem goal to be satisfied is the prescription target dose for a VOI (or a plurality of VOIs) in a patient. Each component in turn may also a summation across multiple beamlets of multiple voxels. A common choice for a penalty function component is a L2 penalty, also known as a quadratic penalty. A quadratic penalty or cost function is a sum of squares, for instance sumi (di2) is a penalty that would tend to minimize overall dose. Some components of a penalty or cost function may be a L1 penalty, also known as a linear penalty, which is a simple sum, for example sumi (di). A penalty or cost function may comprise one or more L1 penalties and/or one or more L2 penalties. An accelerated proximal gradient method may include addition terms (such as a momentum term) to help direct and/or speed up (i.e., increase the rate of, decrease the number of iterations) convergence to a solution set.


Some processors configured for generating a fluence map may be configured to smooth out the penalty function such that it is differentiable. For example, the penalty function may be smoothed using Moreau-Yosida regularization. The initial set of beamlet intensity weights {x0} is an all-zero vector. The beamlets in the set of candidate beamlets b are divided between a smaller set of firing positions or angles {fi}. A firing position is a location where a radiation source may be positioned (e.g., relative to a patient region) to fire a beamlet. In a radiation therapy system where the radiation source is mounted on a circular rotatable gantry, a firing position may be a firing angle and identified by the locations around the circular gantry (from 0 to 360 degrees) around a patient treatment area. For example, the set of firing angles {fi} may comprise a plurality of angles around a patient area of a radiation treatment device. The plurality of firing angles may be evenly distributed 360 degrees around the patient area.


A system processor configured to generate a fluence map may use any of the proximal gradient method penalty functions described herein with one or more quadratic or L2 penalties. Penalty functions may penalize voxel dose excursions outside of the acceptable dose range. That is, as part of a proximal gradient method iteration, the magnitude of the dose deviation outside of the acceptable dose range is incorporated into the calculation of a set of beamlet weights. Such penalty functions may discourage beamlets that result in a dose delivery to a voxel or VOI that is not within the acceptable dose range. Some methods may have a penalty function that comprises a single-value penalty function that aggregates the voxel dose excursions outside of the acceptable dose range of all of the voxels in the volume of interest. The acceptable dose range of each of the plurality of voxels may be determined at least in part by a treatment plan.


A system for generating a fluence map may comprise a processor configured to select a second volume of interest, select a second plurality of voxels within the second volume of interest where each of the voxels has an acceptable dose range, and calculate the dose matrix A to include the second volume of interest and second plurality of voxels. That is, the dose matrix A may represent per-voxel dose delivered to each of the first and second plurality of voxels by the set of candidate beamlets b. A radiation system may further comprise a multi-leaf collimator disposed in a beam path of the therapeutic radiation source, and the processor may be configured to segment the fluence map into a set of multi-leaf collimator instructions and to transmit the instructions to the radiation therapy system. The radiation therapy system may comprise one or more PET detectors. The therapeutic radiation source of a radiation therapy system may be movable about the patient area at a speed of at least about 40 RPM.


Fluence map generation methods comprising a proximal gradient method with a penalty function (also known as a cost function) having one or more smoothed linear penalties (e.g., regularized L1 penalties) may have advantages over fluence map generation methods that use weighted quadratic penalties. Fluence map generation methods typically use weighted quadratic penalties (e.g., L2 penalties) derived from user-specified dose constraints and weghts as components in an overall penalty function. The use of quadratic penalties to model minimum and maximum dose constraints on voxels, often results in solutions with lots of small magnitude violations of the desired dose constraint (e.g. min-dose, or max-dose or other), and may require users to increase the min-dose or decrease the max-dose parameters in the penalty function beyond what is clinically desired, so that a problem converges to a solution with no violations of the original clinically desired constraints. Alternative strategy that is employed by users is manually tuning the penalty function component weights. Quadratic penalties may also require a greater number of iterations before converging on a set of beamlet weights. In some cases, a fluence map generation method that includes a quadratic penalty or cost function may generate a set of beamlet weights that results in a greater number of violations of user-imposed constraints. In contrast, the fluence map generation methods described herein which comprise a proximal gradient method (such as an accelerated proximal gradient method, for example, FISTA) having linear (e.g., L1 penalty) components in the penalty function may converge on a set of beamlet weights faster (i.e., in fewer iterations), may result in a solution that has fewer violations of clinical dose and other constraints with fewer required interventions (such as weight tuning) by the user, may be simpler and less-computationally intensive to implement in a processor, and/or may be more amenable to parallelization on multi-core CPUs and/or GPUs. A fluence map generation method comprising a proximal gradient method with a penalty or cost function having one or more L1 penalties may promote better plan conformance to user-specified planning goals, i.e., minimum target ROI or VOI dose, maximum OAR dose as compared to methods with a penalty or cost function having one or more L2 penalties.





BRIEF DESCRIPTION OF THE DRAWINGS


FIGS. 1A and 1B are schematic depictions of the fluence map optimization problem. FIG. 1C depicts one variation of a method for generating a fluence map. FIG. 1D depicts one example of a fluence map and FIG. 1E depicts an axial slice of simulated dose delivered to a patient based on the fluence map of FIG. 1D.



FIG. 2 depicts Table 1: Examples of penalty functions.



FIG. 3 depicts Table 2: Notation and Definitions.



FIG. 4 depicts Table 3: Prox-operator calculus rules.



FIG. 5 depicts a variation of a proximal gradient method with fixed step size (Algorithm 1).



FIG. 6 depicts a variation of a proximal gradient method with line search (Algorithm 2).



FIG. 7 depicts a variation of a FISTA method with fixed step size (Algorithm 3).



FIG. 8 depicts a variation of a FISTA method with line search (Algorithm 4).



FIG. 9 depicts a variation of a Chambolle-Pock method with overrelaxation (Algorithm 5).



FIG. 10 depicts a dose distribution that results from a fluence map generation method comprising the Chambolle-Pock algorithm.



FIG. 11 is a dose-volume histogram of the dose distribution of FIG. 10.



FIG. 12 depicts the Moreau-Yosida regularizations of the ramp penalty function with parameters t=1 and t=0.5.





DETAILED DESCRIPTION

Disclosed herein are methods for generating a fluence map having a set of beamlet intensities and angles that may be used by a radiation therapy system to position a radiation source and to control the intensity of the generated radiation beam such that a selected/prescribed dose of radiation is applied to the ROI (e.g., target volume, irradiation target volume such as tumor regions) while limiting the amount of radiation applied to one or more organs-at-risk or OARs (e.g., irradiation-avoidance volumes). That is, given an image of the patient (e.g., a 3-D digital image) that includes information about the location(s) and shape(s) of the target(s)-to-be-irradiated, prescription dose(s) to the target(s), locations of OARs and dose limits on OARs (as well as other dose constraints), a FMO or fluence map generation method computes a set of beamlet intensities and angles that deliver the prescription dose to the target(s) while meeting OAR dose limits and other constraints.


In some variations, a system configured to generate a fluence map (e.g., a treatment planning system) may be in communication with a radiation therapy system. One variation of a radiation therapy system may comprise a gantry that is movable (e.g., rotatable) about a patient treatment area, a radiation source mounted on the gantry, and a controller that is in communication with the gantry and the radiation source. Optionally, a radiation therapy system may comprise a detector located opposite the radiation source that is also in communication with the controller. The controller may provide signals to a gantry motion system to position the radiation source at a particular location with respect to the radiation treatment area and may provide a sequence of radiation beamlet data (e.g., pulse intensity, width, duration, etc.) to the radiation source based on a fluence map (e.g., a fluence map generated by any of the fluence map generation methods described herein). The radiation source may comprise a multi-leaf collimator to shape the radiation beam. A system controller may be configured to convert a fluence map into a set of gantry motion and/or multi-leaf collimator instructions (using segmentation methods, for example). A schematic depiction of a patient 102 located within the treatment area of a radiotherapy system 100 is provided in FIGS. 1A and 1B.


In order to solve the dose optimization problem, the radiation beam that may be delivered to a patient may be divided into beamlets. As depicted in FIG. 1B, a beamlet 107 may be a portion of a full radiation beam 105 that is defined by a multi-leaf collimator leaf 101 opening at a particular firing position (e.g., a firing position 106a with respect to a patient area 111). Given a discrete set of all possible beam angles (e.g., firing locations) around a gantry 104 (or, more generally, firing locations about a patient area), a set of all possible beamlets (a subset of which is represented by 106a, 106b, 106c, 106d) may be selected. For example, a radiation therapy system 100 having m firing positions (e.g., firing angles around a rotatable/circular gantry) may comprise a multi-leaf collimator 103 that may be positioned at each of them firing positions. The MLC may have n leaves, and as such, there may be a total of m×n possible beamlets. In some variations, a radiation therapy system comprising a binary multi-leaf collimator having 64 leaves that is located on a circular or rotatable gantry having 100 firing positions may have a total of 6400 possible beamlets. Optionally, the total number of possible beamlets may also take into account patient platform movement through the therapy system, such that a system with p patient platform positions may have a total of m×n×p possible beamlets. For example, the radiation therapy system described above may have from about 10 to about 100 patient platform positions, which may result in a number of possible beamlets from about 64,000 (64×100×10) to about 640,000 (64×100×100). Accordingly, an individual beamlet may be uniquely identified by its firing position or angle, collimator leaf index, and optionally, patient platform position. In some variations, a set of candidate beamlets for fluence map generation may be a subset of the total number of possible beamlets. For example, a set of candidate beamlets may be derived by removing the beamlets that do not intersect a volume of interest (e.g., an irradiation target volume, etc.) from the total number of possible beamlets. In some variations, the beamlets where all beamlet weights (i.e., beamlet intensities) are 0 can be omitted. A mathematical optimization problem may be solved based on the set of candidate beamlets to calculate the beamlet weights (e.g., beamlet intensities) that apply a prescribed dose of radiation to a target region/ROI 108. Fluence map optimization (FMO) is the method by which a set of “optimal” (i.e., satisfying the imposed constraints) beamlet weights are found. In some variations, a FMO or fluence map generation method may comprise computing a set of beamlet weights that deliver the prescribed dose to the target while limiting radiation dose to OARs 110a, 110b, 110c. The shading of the beamlets 106a-106d may represent the weight of that beamlet (e.g., intensity), where a darker shade represents a higher beamlet weight (i.e., greater beamlet intensity). The fluence map generated by a FMO method may result in the application of radiation according to the profile depicted in FIG. 1A, where the target 108 may receive the prescribed dose of radiation while the radiation exposure of the OARs 110a, 110b, 110c is reduced (e.g., below a selected threshold).


In some variations, volumes of interest (including irradiation target regions and irradiation-avoidance regions) may be divided into a plurality of voxels. Based on the data provided in a treatment plan, which outlines the dose distribution and profile that is prescribed for each VOI in a patient, each voxel may have an acceptable dose range. For example, a voxel in an irradiation target region may have a minimum dose threshold for the treatment session to meet treatment goals and a maximum dose threshold above which a patient may be subject to undesired radiation risk. A voxel in an irradiation-avoidance region may have a maximum dose threshold above which undesired tissue damage may be expected to occur. In some variations, this maximum dose threshold may be lower than the maximum dose threshold for the irradiation target region(s), since tissue in the irradiation-avoidance region may be particularly sensitive or prone to radiation damage. The acceptable dose range of a voxel may be calculated based on the prescribed dose for that volume of interest as specified by a treatment plan. The dose constraints on the voxels in the volumes of interest (VOIs) may be used in the fluence map generation methods described herein to derive a set of beamlet weights that meet these voxel dose constraints. In some variations, fluence map generation methods may utilize the acceptable dose range per voxel as a constraint for evaluating whether a set of beamlet weights meets clinical goals. Alternatively or additionally, fluence map generation methods may aggregate the acceptable dose ranges of all of the voxels of a volume of interest as a single-value constraint (e.g., a single-value penalty or cost function) for evaluating whether a set of beamlet weights meets clinical goals. In one variation, a fluence map generation method may comprise an iterative method including assigning a set of beamlet weights to a set of initial values (e.g., zero or a baseline value), calculating the dose per voxel based on the current value of the set of beamlet weights, comparing the calculated dose per voxel with the acceptable dose range per voxel to determine whether the current set of beamlet weights meets clinical goals and/or whether the current set of beamlet weights meets one or more stopping criteria, and if not, updating the set of beamlet weights to a new set of values. Examples of stopping criteria may include, but are not limited to, the set of beamlet weights converging to across iterations (e.g., the difference between the set of beamlet weights of the current iteration and the set of beamlet weights of a previous iteration is less than a predetermined threshold; residual r is less than less than a threshold ε), and/or attaining an upper bound or number of iterations. Updating the set of beamlet weights from a previous iteration (xk−1) to a new set of beamlet weights for the current iteration (xk) may be based on an accelerated proximal gradient method (such as FISTA), or any proximal algorithms (such as Chambolle-Pock methods), with one or more linear penalty functions. The fluence map generation methods described herein may be used to calculate beamlet weights for delivering a dose to one or more VOIs within the acceptable dose range of each VOI. Examples of VOIs may include irradiation target regions, irradiation-avoidance regions (e.g., organs-at-risk, areas of particular radiation sensitivity), and/or any combination of such regions. Fluence mapping methods that are described in the context of generating a set of beamlet weights based on acceptable dose ranges for a single VOI may be expanded to generate a set of beamlet weights based on acceptable dose ranges for multiple VOIs.


The dose range limits of a VOI and/or a plurality of voxels in the generation of a fluence map may be represented by a penalty function. A penalty function may comprise a plurality of penalties that represent VOI or voxel criteria or conditions that a fluence map generation method seeks to fulfill. In some variations, a penalty function that may be included with a proximal gradient method (e.g., an accelerated proximal gradient method such as FISTA) for fluence map generation may be based on clinically-derived constraints or conditions and/or mathematical constraints or conditions. In some variations, a penalty function may comprise one or more linear or nonlinear (e.g., quadratic) penalties that represent constraints based on acceptable dose ranges per voxel and/or VOI (e.g., as may be extracted from a treatment plan), as well as one or more linear or nonlinear (e.g., quadratic) penalties that represent constraints based on the smoothness of a set of beamlet weights. A linear penalty may be one in which deviations from a desired set of constraints are linearly weighted when evaluating whether a solution satisfies a set of requirements, while a nonlinear penalty (e.g., a quadratic penalty) may be one in which deviations from a desired set of constraints are nonlinearly weighted by a higher-order multiplicative factor. For example, a quadratic (or L2) type, penalty may amplify, or weight heavily, large deviations from a desired set of constraints (e.g., large deviations from acceptable dose ranges per voxel or VOI) when evaluating whether a solution (e.g., a set of beamlet weights) satisfies a set of requirements (e.g., dose as specified in a treatment plan). Linear (or L1 type) penalties that may be included in a fluence map generation method may help converge on a set of beamlet weights that reduce the number of voxels and/or VOIs where the delivered dose exceeds the acceptable dose range. This may be described as promoting sparsity in the number of dose violations on a voxel-by-voxel or VOI-by-VOI basis. In some variations, the penalty function may be a linear penalty function (e.g., having only linear penalties) while in other variations, the penalty function may be a nonlinear penalty function (e.g., comprising one or more nonlinear penalties). Some variations of a method for generating a fluence map may comprise an accelerated proximal gradient method having a single-value penalty function, which may be derived by aggregating the dose constraints of each voxel in a volume of interest. For example, the constraints imposed by a VOI may be represented by a single-value penalty function, and an accelerated proximal gradient method may iterate on a set of beamlet weights based on the single-value penalty function of each of the VOIs in a patient. Other types of penalties that may be included in a penalty function are described in greater detail below and/or tabulated in Table 1 of FIG. 2. Any of the fluence map generation methods described herein may optionally include a step of generating one or more penalty functions (e.g., multiple single-value penalty functions that correspond to multiple VOIs, multiple penalty functions for multiple sets of voxels) that represents the clinical and/or mathematical constraints that may be relevant to a particular patient and/or set of VOIs outlined in the treatment plan. In some variations where the one or more penalty functions do not meet the smoothness criteria for fluence map generation using an accelerated proximal gradient method, a smoothing function (e.g., convex regularization, Moreau-Yosida regularization) may be applied to the one or more penalty functions.


In some variations, convex optimization techniques may be used to address fluence map generation problems that may arise during radiation planning. For example, proximal algorithms may be used for solving very large scale, constrained convex optimization problems with nondifferentiable objective functions. Some variations may comprise the use of proximal algorithms to address fluence map generation issues during treatment planning. In some variations, a method for fluence map generation may comprise the use of the Alternating Direction Method of Multipliers (ADMM), while in other variations, a method for fluence map generation may comprise the use of a proximal algorithm such as the Chambolle-Pock method. The Chambolle-Pock method may be able to handle nonquadratic dose-penalty terms, including nondifferentiable L1-based penalties in the objective, and also hard constraints on the amount of radiation delivered to the planning target volume (PTV) and OARs. Another class of algorithms, accelerated proximal gradient methods (including FISTA, for example) may also be used in a fluence map generation method. These accelerated methods have a convergence rate which has been shown to be in some sense optimal for first-order methods.


Disclosed herein are methods for generating a fluence map comprising a set of beamlet intensities that delivers radiation dose to every VOI within the acceptable dose range for that VOI. For example, these methods may generate a fluence map such that the prescribed radiation dose to irradiation target regions (e.g., planning target volumes PTV, tumor regions, etc.) while not exceeding a maximum dose to irradiation-avoidance regions (e.g., organs at risk OARs). The acceptable radiation dose range for a VOI (and/or for the individual voxels within the VOI) may be determined at least in part by a treatment plan. In some variations, a treatment plan may be generated based on images of the patient that have been acquired before a treatment session (e.g., during a diagnostic imaging session) and/or during a previous treatment session or fraction.


Some variations for generating a fluence map may comprise generating a set of beamlet weights or intensities using an accelerated proximal gradient method, such as FISTA or a proximal method, such as the Chambolle-Pock method. FISTA has been used for solving inverse problems in signal or image processing and in particular, for compression, denoising, image restoration, sparse approximation of signals, compressed sensing and the like. Accelerated proximal gradient methods may include a penalty function having a linear or L1 penalty term to promote signal sparsity so that signals can be compressed. Since signal sparsity, and/or data compression, and/or image reconstruction are not primary goals of fluence map optimization, accelerated proximal gradient methods such as FISTA and proximal methods such as Chambolle-Pock have not been considered for generating a set of beamlet weights or intensities for radiation treatment plans and systems. However, as described herein, accelerated proximal gradient methods such as FISTA may be able to provide a computationally efficient method to generate a fluence map. The fluence map generation methods described herein may comprise using an accelerated proximal gradient method such as FISTA with one or more linear or L1 penalty terms. In some variations, the L1 penalty terms may be smoothed by a regularization method to help reduce discontinuities in the FISTA method. FISTA with linear penalty functions having one or more L1 penalty terms may be readily implemented on multi-core processors (CPUs and/or GPUs), facilitate faster convergence to a final set of beamlet weights, and/or may promote better plan conformance to user or clinician-specified planning goals or treatment plans. For example, FISTA with L1 penalty terms may result in a set of beamlet weights that deliver dose to VOIs that conform better to treatment plan specifications as compared to accelerated proximal gradient methods have L2 penalty terms. Fluence map generation methods that have L2 penalty terms may result in a set of beamlet weights that deliver fluence such that a higher proportion of VOIs or voxels receive dose levels that are outside of their acceptable dose ranges (as compared to methods with L1 penalty terms). Fluence map generation methods with L2 penalties may require users or clinicians to over-constrain the problem or iteratively tuning the objective weights, etc., resulting in higher computational load on the processor and a fluence map that does not conform as closely to treatment plan specifications.



FIG. 1C depicts one variation of a method for generating a fluence map. The method 150 may comprise selecting 152 one or more volumes of interest (VOIs). A VOI may include one or more irradiation target (e.g., PTV, tumor region, etc.) and/or may include one or more irradiation-avoidance regions). The method 150 may comprise selecting 154 voxels in the one or more VOIs. In some variations, the selected voxels may, cumulatively, approximate the overall size, shape and location of each of the VOIs. The method 150 may comprise selecting 156 a set of candidate beamlets b={bi} having initial beamlet intensity weights x0={xi0}. The method 150 may comprise calculating 158 a dose matrix A for each volume of interest based on the set of candidate beamlets b. The dose matrix A may represent per-voxel dose delivered to each of the plurality of voxels by the set of candidate beamlets b. In some variations, the method may comprise initializing 160 beamlet intensity weights to zero (e.g., the initial set of beamlet weights x0 may be an all-zero vector). Next, the method may comprise calculating 162 a set of beamlet intensity weights xk by adjusting the initial beamlet intensity weights according to a proximal gradient method update (xk−1→xk) with a penalty function containing one or more linear penalties, such as an accelerated proximal gradient method. This may be an iterative method where a set of beamlet intensity weights is adjusted based on the penalty function until one or more stopping criteria are met (step 164), and may be any of the methods described below. For example, the accelerated proximal gradient method may be a FISTA method having one or more L1 cost or penalty functions (e.g., as explained further below and represented in FIGS. 7-8). In some variations, calculating 162 a set of beamlet intensity weights may comprise adjusting beamlet intensity weights according to a proximal gradient method or more generally, a proximal method. In some variations, the stopping criteria may include convergence of the set of beamlet intensity weights to a set of intensity values. Convergence from one iteration to the next may be determined, for example, by comparing the set of beamlet intensity weights of iteration x to the set from iteration x−1 and taking the difference (e.g., a residual) in values between the two sets. If the difference is smaller than a predetermined threshold the stopping criterion may be met and the iteration may stop. Other stopping criteria are described below. Alternatively or additionally, iteration may stop if the number of iterations attains an upper bound. After meeting the stopping criteria, the method 150 may comprise calculating a fluence map comprising a final set of beamlet intensity weights xk.


Additional details regarding these steps, including illustrative examples, are provided below.



FIG. 1D depicts one example of a fluence map (i.e., set of beamlet intensities) that has been generated using the FISTA method having a smoothed L1 penalty function. This fluence map was generated for a radiation therapy system having 100 firing positions (e.g., 100 firing angles around a circular gantry) and a multi-leaf collimator having 60 leaves. In this example, the multi-leaf collimator is a binary multi-leaf collimator. The fluence map in FIG. 1D may represent the beamlet intensity for a single patient platform position, e.g., a single tomographic slice. The intensity of a pixel in the plot may be proportional to the beamlet intensity. As the gantry moves the radiation source from firing position 1 to firing position 100, the multi-leaf collimator opens certain leaves at each of those firing locations. The intensity of the radiation beam at a particular firing position for a particular leaf is represented by the intensity of the pixel in the fluence map (where a black pixel represents a beamlet intensity of zero and a white pixel represents a beamlet having the maximum allowable intensity). FIG. 1E depicts an axial slice of a simulated patient body where radiation beamlets were delivered based on the fluence map of FIG. 1D. The fluence map of FIG. 1D resulted in a dose delivery that concentrates on a ROI 130, with little or no irradiation of other patient regions.


One of the computational challenges of fluence map optimization is working with the dose calculation matrix, a huge matrix which (despite its sparsity) has a very large number of nonzero entries. When applied to fluence map optimization, the Alternating Direction Method of Multipliers requires solving a linear system of equations involving this dose calculation matrix at each iteration. This is a significant and potentially prohibitive computational expense. While decomposition methods are available to reduce this computational burden, such methods may lead to a more complicated algorithm that is more difficult to implement, and takes more iterations to converge, as compared with a straightforward implementation of ADMM (which might be unworkable due to the size of the dose calculation matrix). The Chambolle-Pock algorithm and accelerated proximal gradient methods (e.g., FISTA), on the other hand, may be advantageous in that they perform matrix-vector multiplications with the dose calculation matrix at each iteration. These methods may not include solving a linear system at each iteration, and hence may not include decomposition methods to make such a calculation manageable. The fluence map generation methods described herein may be effective algorithms that parallelize naturally and are particularly easy to implement in a controller of a radiotherapy system.


The general fluence map optimization problem contemplated in this document is:












minimize
x







Γ


(


A
0


x

)



+




i
=
0

N




Φ
i



(


A
i


x

)



+

Ψ


(
Dx
)


+

Θ


(
x
)



,




(
1
)








where the matrices Ai are the dose-calculation matrices for one or more VOIs (e.g., planning target volume PTV, irradiation target regions or volumes, OARs, irradiation avoidance regions or volumes, the tumor, etc.), N is the number of OARs, the matrix D represents a discrete derivative or gradient operator, and the functions Γ, Φi, Ψ and Θ, are convex penalty functions. The term Γ(A0x) encourages or defines a minimum level of radiation to be delivered to the PTV, while the terms (Aix) encourage or require the radiation delivered to the PTV and OARs not to exceed a maximum dosage. The regularization terms Ψ(Dx) and Θ(x) encourage smooth or piecewise-smooth nonnegative fluence maps. Problem (1) may contain most standard fluence map optimization formulations as special cases, with simple and convenient choices of the convex penalty functions.


The optimization algorithms described herein may be able to handle nonquadratic and nondifferentiable penalty terms Γ, Φi and Ψ in fluence map optimization. By taking the penalty functions to be indicator functions (defined in equation (4)), hard constraints may be enforced on the amount of radiation delivered to the PTV and OARs. The advantages of the L1-norm may apply in fluence map optimization, as will be described further below.


Disclosed herein are fluence map generation methods that comprise proximal algorithms with one or more penalties, such as L1 penalties. In one variation, the Chambolle-Pock algorithm may be used to solve problem (1) in the case where the penalty functions Γ, Φi, Ψ and Θ have proximal operators that may be evaluated efficiently. This may include most fluence map optimization problems encountered in practice, including those with hard constraints and nondifferentiable objective functions. In some variations, a fluence map generation method may comprise a smoothing technique from convex analysis, such as the Moreau-Yosida regularization, to smooth out the penalty functions Γ, Φi and Ψ, and an accelerated proximal gradient method (such as FISTA) to solve the smoothed out problem (which may involve nonquadratic penalties). The smoothed problem may contain, as a special case, all fluence map generation problems considered in the unified approach to inversion problems in intensity modulated radiation therapy (IMRT).


Fluence map generation methods may include the selection of penalty functions Γ, Φi, Ψ and Θ. The problem defined in equation (1) may contain most standard FMO models as special cases, with simple and convenient choices of the penalty functions. Examples of penalty functions that may be used in the fluence map generation methods described herein are summarized in table 1, depicted in FIG. 2.


Often Γ is taken to be a one-sided quadratic penalty

Γ(ν)=∥(ν−l)−∥22,

where l is a vector that lists the prescribed doses that are to be delivered to each voxel in the tumor, but an arguably superior option is to take Γ to be a one-sided L1-norm-based penalty:

Γ(ν)=∥(ν−l)−∥1.


The usual benefits of L1-norm, such as robustness against outliers, apply in this context as well. By taking Γ to be a one-sided L1-based penalty, a small number of voxels in the tumor are allowed to be significantly underdosed. This extra flexibility can allow for a reduction in the amount of radiation delivered to OARs, leading to an overall superior treatment plan. An L2-based penalty does not allow this flexibility—any significant underdosing is penalized severely. Additionally, an L2-based penalty tends to allow a large number of voxels to be slightly underdosed, which is undesirable, and on practical FMO problems, consistently leads to slight underdosing of the target, and slight overdosing of the OAR. In comparison, an L1-based penalty is discourages the presence of small residuals, and encourages most residuals to be 0. A third important option is to take Γ to be an indicator function:










Γ


(
v
)


=


I







(
v
)


=

{



0




if





v











otherwise









(
2
)







With this choice of Γ, the hard constraint that (A0x)>l may be applied or enforced to the fluence map generation method.


Similar considerations apply to the penalty functions Φi, which are chosen to encourage or enforce upper bounds on the dose delivered to the tumor and to the OARs. In some variations, Φi may include a one-sided L1-based penalty Φi(yi)=αi∥yi−ui1 a one-sided L2-based penalty Φi(yi)=(αi/2)∥yi−ui22, or an indicator function penalty








Φ
i



(

y
i

)


=



I



u
i





(

y
i

)


=

{



0




if






y
i




u
i









otherwise









This indicator function penalty may be able to enforce the hard constraint that Aix≤ui. In some variations, the method may include the penalty function











Φ
i



(

y
i

)


=



α
i







(


y
i

-

u
i


)

+



1


+



β
i

2







y
i



2
2

.







(
3
)







By selecting Φi as indicated above,








Φ
i



(


A
i


x

)


=



α
i







(



A
i


x

-

u
i


)

+



1


+



β
i

2








A
i


x



2
2

.







The term βi/2∥Aix∥22. may be thought of as providing additional guidance on how to select x, in cases where Aix<ui is easily satisfied. (We take β0=0, because there is no need for an additional penalty for radiation delivered to the tumor.)


Some variations of fluence map optimization methods and/or fluence map generation methods may include a quadratic regularization term







Ψ


(
Dx
)


=



1
2





Dx


2
2


=




m
=
1

M




1
2







D
x



x
m




2
2









where M is the number of beamlets in the IMRT system, x is the vector of beamlet intensities, xm is the mth block of x (consisting of beamlet intensities for the mth beamlet), and each matrix Dm represents a discrete derivative or gradient operator. This regularization term encourages adjacent beamlets to have similar intensities, thereby leading to less chaotic fluence maps. The regularization term may facilitate (i.e., speed up) convergence of optimization algorithms so that a solution (e.g., a set of beamlet weights or intensities) may be attained in fewer iterations. However, due to the use of a quadratic penalty function, large components of Dx may be penalized severely (because they get squared), and as a result this regularization term tends not to allow any sharp jumps in intensity between adjacent beamlets. This may detract from the creation of treatment plans that are highly conformable to the tumor. A similar problem may be encountered in image restoration and reconstruction problems, where the use of a quadratic regularization term does not allow sharp edges in the image to be preserved. One example of a regularization term may be the total variation regularization term







Ψ


(
Dx
)


=




Dx


1

=




m
=
1

M








D
m



x
m




1

.







Alternatively, Ψ may be the indicator function for an L-norm ball. A hard upper bound on the change in intensity between adjacent beamlets may be enforced. These choices of Ψ may facilitate computations that are able to handle nonquadratic and nondifferentiable penalty terms.


Typically Θ is chosen to be the indicator function for the nonnegative orthant, denoted by I≥0. In this case, the term Θ(x) in the objective simply enforces the constraint that x≥0. Θ may also be chosen to enforce upper bounds on the beamlet intensities; for example, Θ may be the indicator function of the set S={x|a≤x≤b}, for given vectors a and b. Other choices of Θ are possible, such as Θ(x)=I≥o(x)+(ϵ/2)∥x∥22 or Θ(x)=I≥o(x)+ϵ∥x∥1. These choices of Θ control the size of x while also enforcing x≥0. Penalizing the L1-norm of x could be useful, for example, to limit the number of beamlets that fire during treatment delivery (in other words, to promote sparsity in the fluence map). Penalizing the L2-norm of x may help to limit the total energy delivered during treatment.


In some variations, problem (1) may be solved using the Chambolle-Pock algorithm, under the assumption that the penalty functions Γ, Φi, Ψ and Θ have proximal operators that can be evaluated efficiently. This may include one or more (e.g., all) of the penalty functions described herein. Also disclosed are numerical results for the FMO problem








minimize
x






μ






(



A
0


x

-


)

-



1


+




i
=
0

N




α
i







(



A
i


x

-

u
i


)

i



1



+




i
=
1

N





β
i

2







A
i


x



2
2



+

η




Dx


1














subject





to





x


0.





This is the special case of problem (1) where Φi is given by equation (3) and Γ(ν)=u∥(ν−l)1, Ψ(YN+1)=η∥YN+11, and Θ=I≥0. Also disclosed are numerical results for the FMO problem








minimize
x











i
=
1

N





β
i

2







A
i


x



2
2




+

η




Dx


1









subject





to






A
0


x









x

0.




This is the special case of problem (1) where Γ is taken to be the indicator function given by equation (2), Φi(yi)=(βi/2)∥yi22. for 1, . . . , N, Φ0=0, Ψ(yN+1)=η∥yN+11, and Θ=I≥0


Table 2 summarizes the notation and definitions used herein.


In convex analysis, a function f: custom characterncustom character∪{∞} is called “closed” when it is lower semicontinuous. This is a mild condition satisfied by most convex functions that one encounters in practice. A function f: custom characterncustom character∪{∞} is called “proper” when ƒ(x)<∞ for at least one point x∈custom charactern.


Indicator and Projection Functions.


Let C⊂custom charactern be a closed convex set. The indicator function of C is the convex function IC: custom characterncustom character∪{∞} defined by











I
C



(
x
)


=

{



0





if





x


C

,









otherwise
.









(
4
)







Indicator functions are useful in convex optimization (including any of the methods described herein) for enforcing hard constraints on the optimization variable x. The problem of minimizing ƒ(x) subject to the constraint that x∈C, is equivalent to the problem of minimizing ƒ(x)+IC(x). Indicator functions are highly nondifferentiable, but this poses no problem for proximal algorithms, which are able to handle indicator functions naturally.


When C=[a, b]m=[a, b]× . . . ×[a, b]={x∈custom charactern|a≤xj≤b for j=1, . . . , m}, the indicator function of C is denoted by I[a,b] rather than by I[a,b]m. The notation I≥l, where l∈custom charactern, denotes the indicator function of the set S={x∈custom charactern|x≥l}, where the inequality x≥l is interpreted to mean that xj≥lj for j=1, . . . , m.


The function that projects onto C is denoted by PC:








P
C



(
x
)


=


closest





point





to





x





in





C

=



arg





min


u

C








u
-
x



2

.







When C=[a, b]m, the function that projects onto C will be denoted by P[a,b] rather than by P[a,b]n. The notation P≥l to denotes the function that projects onto the set S={x∈custom charactern|x≥l}. Note that P≥l(x)=max(x, l) (where the maximum is taken componentwise).


Conjugate.


The convex conjugate of a function ƒ: custom characterncustom character∪{∞} is the function ƒ*: custom characterncustom character∪{∞} defined by








f
*



(
z
)


=



sup

x


S
n







z
,
x




-


f


(
x
)


.






The convex conjugate ƒ* is a “dual version” of f which tends to appear when formulating dual problems in convex optimization. When ƒ is closed and convex, ƒ can be recovered from ƒ* via the formula ƒ=ƒ**.


The following “separable sum” rule for conjugates will be useful later. Suppose ƒ is a block-separable sum:










f


(


x
1

,





,

x
κ


)


=




k
=
1

K





f
k



(

x
k

)


.






(
5
)







Then ƒ* (z1, . . . , zK)=Σk=1Kƒ*k(zk). The input vectors x1, . . . , xK can be viewed as blocks of a larger vector x, and the vectors z1, . . . , zK can be viewed as blocks of a larger vector z.


Proximal Operator.


Let ƒ: custom characterncustom character∪{∞} be a proper closed convex function. The proximal operator (also known as “prox-operator”) of ƒ, with parameter t>0, is defined by











prox
tf



(
x
)


=




arg





min

u



f


(
u
)



+


1

2

t








u
-
x



2
2

.







(
6
)







Proximal algorithms are iterative algorithms that require the evaluation of various prox-operators at each iteration. For many important convex penalty functions, the prox-operator has a simple closed-form expression and can be evaluated very efficiently, at a computational cost that is linear in m. The term “proximable” may describe a function whose prox-operator can be evaluated efficiently.


Several useful rules for evaluating prox-operators are listed in table 3, depicted in FIG. 4. In this table, the functions ƒ and ƒk are assumed to be proper closed convex. Formula 3.3 is known as the Moreau decomposition and is very useful for proximal algorithms because it expresses the prox-operator of ƒ* in terms of the prox-operator of ƒ. If the prox-operator off can be evaluated efficiently, then the prox-operator of ƒ* can be evaluated equally efficiently. (The term “decomposition” suggests that x has been decomposed as a sum of prox-operator terms.) In formula 3.1, the function g is a separable sum of the functions ƒk. The input vectors x1, . . . , xK can be viewed as blocks of a larger vector x. According to this rule, evaluating the prox-operator of the separable sum g reduces to independently evaluating the prox-operators of the functions ƒk. Rule 3.4 can be derived by using the Moreau decomposition to express the prox-operator of ƒ(x)=∥x+1 in terms of the prox-operator of its conjugate ƒ*(z)=I[0,1](z). In the last two rows of the table, pj denotes the jth component of the vector proxtg(x), and xj denotes the jth component of x.


One of the most fundamental proximal algorithms, the proximal gradient method solves optimization problems of the form

minimize ƒ(x)+g(x)  (7)

where ƒ and g are closed convex functions and ƒ is differentiable with a Lipschitz continuous gradient. The proximal gradient iteration for a fixed step size t>0 is recorded in algorithm 1 (FIG. 5). If t≤2/L, where L is the Lipschitz constant for ∇ƒ, then the proximal gradient iteration is guaranteed to converge to a minimizer of (7), assuming that a minimizer exists.


In the special case where g(x)=IC(x), where C is a closed convex set and IC is the indicator function of C, problem (7) is equivalent to

minimize ƒ(x)
subject to x∈C.  (8)


The proximal gradient iteration reduces to

xk=PC(xk−1−t∇ƒ(xk−1)),

where PC is the function that projects onto C. In this case, the proximal gradient method is known as the projected gradient method.


While a proximal gradient method may be used with a fixed step size, a significant advantage of the proximal gradient method is that there is a simple and effective line search procedure that may be used to select the step size adaptively at each iteration. One variation of a proximal gradient method with line search is represented in algorithm 2 (FIG. 6).


One variation of a fluence map generation method may comprise accelerated versions of the projected gradient method and the proximal gradient method for radiation treatment planning. For example, FISTA (short for “fast iterative shrinkage-thresholding algorithm”). FISTA is an accelerated version of the proximal gradient method for solving problem (7), where (as before) ƒ and g are closed convex functions, and ƒ is differentiable with a Lipschitz continuous gradient (with Lipschitz constant L>0). One variation of a FISTA iteration for a fixed step size t>0 is represented in algorithm 3 (FIG. 7). When t=1/L, this iteration converges at a rate of 1/k2, whereas the proximal gradient iteration only converges at a rate of 1/k. FISTA's convergence rate of 1/k2 is may be desirable or optimal for a first-order method.


The Chambolle-Pock algorithm solves optimization problems of the canonical form

minimize ƒ(x)+g(Ax)  (9)

where ƒ: custom characterncustom character and g: custom characterncustom character are proper closed convex functions and A∈custom characterm×n is a matrix. This canonical problem form may be useful because many important problems in areas such as signal and image processing can be expressed in the form (9) with particularly simple choices of ƒ, g, and A. By “simple”, we mean specifically that the prox-operators of ƒ and g can be evaluated inexpensively, and that multiplications by A and by AT can be performed efficiently. The fluence map optimization problems considered herein may be conveniently expressed in this form. (The problem form (9) may be addressed by the Fenchel-Rockafellar approach to duality, which takes this problem form as a starting point.)


The Chambolle-Pock algorithm is a primal-dual algorithm, meaning that it simultaneously solves the primal problem (9) and the dual problem, which is








minimize
z




f
*



(


-

A
T



z

)



+



g
*



(
z
)


.





One variation of a Chambolle-Pock iteration with step sizes s and t, and overrelaxation parameter ρ∈(0, 2), is represented in algorithm 5 (FIG. 9). At each iteration we perform matrix-vector multiplications by A and AT are performed, but are not required to solve linear systems involving A. This is an advantage of the Chambolle-Pock algorithm over Douglas-Rachford-based methods such as ADMM. Algorithm 5 (FIG. 9) is an overrelaxed version of the Chambolle-Pock algorithm. The step sizes s and t are required to satisfy st∥A∥2≤1, where ∥⋅∥ is the matrix norm induced by the L2-norm. When this step size restriction is satisfied, the algorithm 5 may be expected to converge to a minimizer for the problem (9) (assuming that a minimizer exists). While one choice of s and t is s=t=1/∥A∥, convergence may be improved substantially by tuning the values of s and t.


While the Chambolle-Pock algorithm may facilitate the generation of a solution for fluence map optimization problems, one might hope to achieve faster convergence by using accelerated methods such as FISTA which have an optimal (among first-order methods) convergence rate of O(1/k2) (where k is the iteration number). When accelerated methods can be applied, this O(1/k2) convergence rate can be significantly faster than the O(1/k) convergence rate of the Chambolle-Pock algorithm. Another advantage of FISTA is that simple and effective line search procedures are available, meaning that there is no difficulty with step size selection. Thus, the use of accelerated proximal gradient methods (specifically, FISTA) may be preferred for fluence map optimization.


The difficulty in using the proximal gradient method for fluence map optimization is that one must first express the optimization problem in the form (7), with a differentiable function ƒ and a simple (i.e., proximable) function g, and this is not always possible. This challenge may be addressed by smoothing out the nondifferentiable penalty functions appearing in problem (1). Convex analysis provides an elegant way to smooth out a nondifferentiable convex function—the Moreau-Yosida regularization.


Moreau-Yosida Regularization.


Let ϕ: custom characterncustom character∪{∞} be closed and convex. The Moreau-Yosida regularization of ϕ with parameter γ>0 is defined by











ϕ

(
γ
)




(
x
)


=



inf

u


R
n





ϕ


(
u
)



+


1

2

t








u
-
x



2
2

.







(
11
)







It can be shown that ϕ(γ): custom characterncustom character is a convex, differentiable approximation to ϕ, and that the gradient of ϕγ is given by the formula













ϕ

(
γ
)




(
x
)



=


1
t




(

x
-


prox
γϕ



(
x
)



)

.






(
12
)







The gradient of ϕ(γ) is Lipschitz continuous with Lipschitz constant L=1/t. The parameter γ controls the amount of smoothing—for small values of γ, there is little smoothing and ϕ(γ) is a close approximation to ϕ. Correspondingly, when γ is small, the Lipschitz constant L is large. FIG. 12 shows the Moreau-Yosida regularizations of the ramp penalty function with parameters t=1 and t=0.5. Note that the Moreau-Yosida regularizations are differentiable, whereas the ramp penalty function is not.


Note that








ϕ

(
γ
)




(
x
)


=


ϕ


(


prox
γϕ



(
x
)


)


+


1

2

γ










prox

t





ϕ




(
x
)


-
x



2
2

.







When ϕ=IC, the indicator function of a closed convex set C, we have











ϕ

(
γ
)




(
x
)


=


1

2

γ










P
C



(
x
)


-
x



2
2

.






(
13
)







This is the proximity function that is used as the basis for the unified approach to inversion problems in IMRT presented in. The gradient of the proximity function follows from equation (12): ∇ϕ(γ)(x)=(1/γ)(x−PC(x)).


Smoothed FMO Problem.


Below, the problem (1) is replaced with the smoothed problem











minimize
x








Γ

(
γ
)




(


A
0


x

)



+




i
=
0

N








Φ

(
i
)


(
γ
)




(


A
i


x

)



+


Ψ

(
γ
)




(
Dx
)


+


Θ


(
x
)


.





(
14
)







The functions Γ(γ), Φi(γ), and Ψ(γ) are the Moreau-Yosida regularizations (with parameter γ>0 of the functions Γ, Φi, and Ψ from problem (1). The function Θ may not be smoothed. In some variations, the same smoothing parameter γ may be used for each penalty function, but other smoothing parameters may also be used. It is assumed that the (possibly nondifferentiable) functions Γ, Φi, Ψ and Θ have prox-operators that can be evaluated efficiently.


As a special case, when Γ, Φi, and Ψ are taken to be indicator functions, their Moreau-Yosida regularizations are proximity functions (defined in equation (13)) and may be used with the fluence map optimization problem considered in the unified approach to inversion problems in IMRT. The smoothed problem may be solved using the projected gradient method, rather than an accelerated projected gradient method.


The smoothed problem as indicated in equation (14) may be solved using accelerated proximal gradient methods. Problem (14) has the form (7), where










f


(
x
)


=



Γ

(
γ
)




(


A
0


x

)


+




i
=
0

N








Φ
i

(
γ
)




(


A
i


x

)



+


Ψ

(
γ
)




(
Dx
)







(
15
)








and g=Θ. When using accelerated proximal gradient methods, each iteration may comprise computing the gradient of ƒ and the prox-operator of g. The prox-operator of g=Θ may be computed efficiently by assumption. Hence, it remains only to show how the gradient of ƒ can be computed efficiently.


By the chain rule, the gradient of ƒ is given by









f


(
x
)



=



A
0
T






Γ

(
γ
)




(


A
0


x

)




+




i
=
0

N








A
i
T






Φ
i

(
γ
)




(


A
i


x

)





+


D
T







Ψ

(
γ
)




(
Dx
)



.







The gradients on the right can be computed using equation (12):











Γ

(
γ
)




(
v
)



=


1
γ



(

v
-


prox
γΓ



(
v
)



)



,









Φ
i

(
γ
)




(

y
i

)



=


1
γ



(


y
i

-


prox

γΦ
i




(

y
i

)



)



,









Ψ

(
γ
)




(

y

N
+
1


)



=


1
γ




(


y

N
+
1


-


prox
γΦ



(

y

N
+
1


)



)

.







Each of these gradients can be computed efficiently because we are assuming that the prox-operators of Γ, Φi, and Ψ are inexpensive.


When using the proximal gradient method or FISTA with a fixed step size to solve problem (7), the step size is typically chosen to be t=1/L, where L is the Lipschitz constant for ∇ƒ. In practice, the Lipschitz constant for ∇ƒ is usually not known exactly, so L is taken to be the best available upper bound for the Lipschitz constant for ∇ƒ. When ƒ is given by equation (15), an upper bound for the Lipschitz constant for of is






L
=


1
γ




(





A
0



2

+




i
=
0

N










A
i



2


+



D


2


)

.






This upper bound on the Lipschitz constant might be overly pessimistic, however, a line search procedure may be used when solving problem (14) by the proximal gradient method or by FISTA.


Below, one example of how to use the Chambolle-Pock algorithm to solve the general fluence map optimization problem (1) is described and depicted, under the assumption that the convex penalty functions Γ, Φi, Ψ, and Θ have proximal operators that can be evaluated efficiently. This may include one or more (e.g., all) of penalty functions described previously.


Algorithm Derivation


Problem (1) can be expressed in the canonical form (9) by taking ƒ=Θ and







A
=

[




A
0






A
0






A
1











A
N





D



]


,






g


(

v
,

y
0

,

y
1

,

,

y

N
+
1



)


=


Γ


(
v
)


+




i
=
0

N








Φ
i



(

y
i

)



+


Ψ


(

y

N
+
1


)


.







(Note that A0 appears twice in A.) Having expressed the fluence map optimization problem in the form (9), which is suitable for the Chambolle-Pock algorithm, it remains only to show how to compute the prox-operators of ƒ and g*.


The prox-operator of ƒ is just the prox-operator of Θ, and in some variations, may be evaluated efficiently. To see how to evaluate the prox-operator of g*, first note that g is a separable sum. By the separable sum rule for conjugates








g
*



(

w
,

z
0

,





,

z

N
+
1



)


=



Γ
*



(
w
)


+




i
=
0

N








Φ
i
*



(

z
i

)



+



Ψ
*



(

z

N
+
1


)


.






Let t>0. The separable sum rule for prox-operators








prox

tg
*




(

w
,

z
0

,





,

z

N
+
1



)


=


[





prox

t






Γ
*





(
w
)







prox

t







Φ
0
*



(

z
0

)















prox

t






Φ
N
*





(

z
N

)








prox

t






Ψ
*





(

z

N
+
1


)





]

.





The Moreau decomposition theorem expresses the prox-operators of the functions Γ*, Φ*i, and Ψ* in terms of the prox-operators of the functions Γ, Φi, and Ψ. These prox-operators are assumed to be inexpensive. Thus, it may be that the prox-operator of g* may be evaluated efficiently.


Table 1 depicted in FIG. 2 gives formulas for the prox-operators of the most typical penalty functions Γ, Φi, Ψ, and Θ. These formulas follow from the results listed in section 3, including the scaling and shifting rule (3.2).


Numerical Results


The Chambolle-Pock algorithm may be used to solve the fluence map optimization problem












minimize
x






μ






(



A
0


x

-


)

-



1


+




i
=
0

N








α
i







(



A
i


x

-

u
i


)

+



1



+




i
=
1

N









β
i

2







A
i


x



2
2



+

η




Dx


1














subject





to





x


0.





(
10
)







Referring now to FIG. 10, the PTV is delineated by contour 1000. The region that receives at least 50 gray delineated by contour 1002 coincides almost exactly with the PTV contour 1000.


Stopping Criteria for FISTA


As explained above, the iterative portion of the fluence map generation method (e.g., accelerated proximal gradient methods such as FISTA or proximal algorithms such Chambolle-Pock) may exit once stopping criteria are attained (step 164 of the method 150). Fluence map generation or optimization methods comprising FISTA may have stopping criteria as described below.


A proximal gradient method (such as an accelerated proximal gradient method, including FISTA, as described herein) seeks to solve the primal problem:











minimize
x







g


(
x
)



+

h


(
x
)






(
11
)








via the iteration

x+=proxth(x−t∇g(x)).  (12)


(Here h: custom characterNcustom character∪{∞} is a proper closed convex function, g: custom characterNcustom character is convex, differentiable, and has a Lipschitz continuous gradient, and t>0). The primal optimality condition for (11) is

0∈∇g(x)+∂h(x).  (13)


A proximal gradient method may iterate to find r such that

r∈∇g(x)+∂h(x)

and r is nearly equal to 0, then this means that (13) is nearly satisfied and x is close to optimal.









x
=




prox
th



(

x
^

)






x
^

-
x

t







h


(
x
)



.






(
14
)







Using (14), equation (12) may be equivalent to











x
-

t




g


(
x
)




-

x
+


t






h


(

x
+

)






x
-

x
+


t







g


(
x
)



+




h


(

x
+

)



.






(
15
)







Condition (15) almost has the desired form, which may be rewritten equivalently as the following:












x
-

x
+


t

+



g


(

x
+

)



-



g


(
x
)









g


(

x
+

)



+




h


(

x
+

)



.






(
16
)







Condition (16) shows that x+ is a nearly optimal, provided that the residual






r
=



x
-

x
+


t

+



g


(

x
+

)



-



g


(
x
)









is small. Hence, a possible stopping criterion may be ∥r∥<ϵ. In some variations, a stopping criterion may include a “relative” residual, in which case the stopping criterion may be approximately:









r





x
+




<
ϵ




FISTA solves (11) via the iteration










y
=


x

k
-
1


+


(


k
-
1


k
+
2


)



(


x

k
-
1


-

x

k
-
2



)











x
k

=



prox
th



(

y
-

t




g


(
y
)





)


.






(
17
)







Equation (17) may be equivalent to:








y
-

t




g


(
y
)




-

x
k


t






h


(

x
k

)







y
-

x
k


t

+



g


(

x
k

)



-



g


(
y
)









g
(


x
k

+




h


(

x
k

)



.








Hence the residual may be written as:






r
=



y
-

x
k


t

+



g


(

x
k

)



-



g


(
y
)









to get a stopping criterion for FISTA.


Alternatively, if a Lipschitz constant L for ∇g were known, a less computationally-intensive stopping criterion (i.e., which may not require evaluating the gradient of g at xk) may include:









r








y
-

x
k


t



+






g


(

x
k

)



-



g


(
y
)











1
t





y
-

x
k





+

L





x
k

-
y






=


(


1
t

+
L

)





y
-

x
k










Stopping Criteria for Chambolle-Pock


Fluence map generation or optimization methods comprising Chambolle-Pock may have stopping criteria as described below. Chambolle-Pock solves the primal problem:











minimize
x



F


(
Kx
)



+

G


(
x
)






(
18
)









    • and simultaneously the dual problem











minimize
z




G
*



(


-

K
T



z

)



+


F
*



(
z
)








    • via the iteration

      zn+1=proxσF*(zn+σKxn)  (19)
      xn+1=proxrG(xn−τKTzn+1)  (20)
      xn+1=xn+1+θ(xn+1−xn).  (21)





In one variation, where a dual feasible variable may be available, a stopping criterion may be based on the duality gap. In the fluence map estimation problem, G is the indicator function for the nonnegative orthant, and it can be shown that G* is the indicator function for the nonpositive orthant. If the condition −KT z≤0 is met, z may be dual feasible. In some variations, this may not be satisfied by zn+1. Moreover, in the fluence map estimation problem, F*(z) may be finite only when z=(z1, z2, z3) satisfies ∥z2≤λ and ∥z3≤λ. These are additional constraints that z must satisfy to be dual feasible.


Alternatively, a stopping criterion may be based on residuals in KKT conditions. The Karush-Kuhn-Tucker (KKT) conditions are the set of necessary conditions for a solution of an optimization problem to be optimal, which can be expressed as a system of equations and inequalities involving the cost function and constraints on inputs. This may be used in certain variations where a stopping criterion based on the duality gap may be computationally intensive and/or where the conditions for a stopping criterion based on the duality gap are not met. The KKT conditions for the problem (18) can be written as

0∈KTz+∂G(x)  (22)
0∈−Kx+∂F*(z).  (23)


The key idea for this stopping criterion is to find r1 and r2 such that

r1∈KTz+∂G(x),
r2∈−Kx+∂F*(z)

and r1 and r2 are nearly equal to 0. If these conditions are met, then this means that (22) and (23) are nearly satisfied and x and z may be close to optimal.


Notably, x may be written as:









x
=




prox
tf



(

x
^

)






x
^

-
x

t











f


(
x
)



.






(
24
)







Using (24), equation (19) may be rewritten as follows:












z
n

+

σ





K



x
_

n


-

z

n
+
1



σ











F
*



(

z

n
+
1


)







z
n

-

z

n
+
1



σ






-
K




x
_

n


+





F
*



(

z

n
+
1


)



.






(
25
)








and (20) is equivalent to












x
n

+

τ






K
T



z

n
+
1



-

x

n
+
1



τ










G


(

x

n
+
1


)







x
n

-

x

n
+
1



τ






K
T



z

n
+
1



+




G


(

x

n
+
1


)



.






(
26
)







As such, equation (25) may be expressed equivalently as:













z
n

-

z

n
+
1



σ

+

K


(



x
_

n

-

x

n
+
1



)






-

Kx

n
+
1



+





F
*



(

z

n
+
1


)



.






(
27
)







Conditions (26) and (27) taken together now show that (xn+1, zn+1) is a nearly optimal pair of primal and dual variables, provided that the residuals







r
1

=





x
n

-

x

n
+
1



τ






and






r
1


=




z
n

-

z

n
+
1



σ

+

K


(



x
_

n

-

x

n
+
1



)









are small. As such, one variation of a possible stopping criterion for Chambolle-Pock may be ∥r1∥<ϵ and ∥r∥<ϵ. In some variations, a stopping criterion may include a “relative” residual, in which case the stopping criterion may be approximately:











r
1






x

n
+
1





<
ϵ

,





r
2






z

n
+
1





<
ϵ





This stopping criterion may include an extra multiplication by K each iteration (e.g., K(xn−xn+1)). To help reduce the computational load of additional multiplication by K and KT, one variation of a method for generating a fluence map comprising a Chambolle-Pock method may be to check the stopping criteria or residuals at some iterations but not others. For example, the residuals may be calculated once every 20 or so iterations instead of every iteration. In other variations, the residuals may be calculated once every 2 iterations, once every 5 iterations, once every 10 iterations, once every 12 iterations, once every 25 iterations, once every 30 iterations, once every 50 iterations, etc. Alternatively, the residuals may be calculated at every iteration.


The FMO or fluence map generation methods described herein may be implemented by a computer or controller, for example, in a computer program or software stored in a machine-readable storage medium. The computer or controller may be part of a radiation therapy system such that a fluence map generated by the FMO or fluence map generation methods described herein may be used, for example, to position a radiation source at a particular angle with respect to the ROI or patient (by rotating a gantry on which the radiation source is mounted) and to adjust the beam intensity generated by the radiation source in accordance with the fluence map. Alternatively or additionally, the FMO or fluence map generation methods described herein may be implemented by a computer or controller that is separate from a radiation therapy system. The resultant fluence map may then be transferred to the controller of a radiation therapy system using a machine-readable transmission medium, such as (but not limited to) electrical, optical, acoustical, or other type of medium suitable for transmitting electronic information.


Controller


A system (e.g., a treatment planning system) that may be configured to generate fluence maps based on treatment plan parameters may comprise a controller in communication with a radiation therapy system and/or a clinician and/or operator. The controller may comprise one or more processors and one or more machine-readable memories in communication with the one or more processors. The controller may be connected to a radiation therapy system and/or other systems by wired or wireless communication channels. In some variations, the controller of a treatment planning system may be located in the same or different room as the patient. For example, the controller may be coupled to a patient platform or disposed on a trolley or medical cart adjacent to the patient and/or operator.


The controller may be implemented consistent with numerous general purpose or special purpose computing systems or configurations. Various exemplary computing systems, environments, and/or configurations that may be suitable for use with the systems and devices disclosed herein may include, but are not limited to software or other components within or embodied on personal computing devices, network appliances, servers or server computing devices such as routing/connectivity components, portable (e.g., hand-held) or laptop devices, multiprocessor systems, microprocessor-based systems, and distributed computing networks.


Examples of portable computing devices include smartphones, personal digital assistants (PDAs), cell phones, tablet PCs, phablets (personal computing devices that are larger than a smartphone, but smaller than a tablet), wearable computers taking the form of smartwatches, portable music devices, and the like.


Processor


In some embodiments, a processor may be any suitable processing device configured to run and/or execute a set of instructions or code and may include one or more data processors, image processors, graphics processing units, physics processing units, digital signal processors, and/or central processing units. The processor may be, for example, a general purpose processor, Field Programmable Gate Array (FPGA), an Application Specific Integrated Circuit (ASIC), or the like. The processor may be configured to run and/or execute application processes and/or other modules, processes and/or functions associated with the system and/or a network associated therewith (not shown). The underlying device technologies may be provided in a variety of component types, e.g., metal-oxide semiconductor field-effect transistor (MOSFET) technologies like complementary metal-oxide semiconductor (CMOS), bipolar technologies like emitter-coupled logic (ECL), polymer technologies (e.g., silicon-conjugated polymer and metal-conjugated polymer-metal structures), mixed analog and digital, or the like.


Memory


In some embodiments, memory may include a database (not shown) and may be, for example, a random access memory (RAM), a memory buffer, a hard drive, an erasable programmable read-only memory (EPROM), an electrically erasable read-only memory (EEPROM), a read-only memory (ROM), Flash memory, etc. The memory may store instructions to cause the processor to execute modules, processes and/or functions associated with the system, such as the calculation of fluence maps based on treatment plan and/or clinical goals, segmentation of fluence maps into radiation therapy system instructions (e.g., that may direct the operation of the gantry, therapeutic radiation source, multi-leaf collimator, and/or any other components of a radiation therapy system and/or diagnostic or treatment planning system), and image and/or data processing associated with treatment planning and/or delivery.


Some embodiments described herein relate to a computer storage product with a non-transitory computer-readable medium (also may be referred to as a non-transitory processor-readable medium) having instructions or computer code thereon for performing various computer-implemented operations. The computer-readable medium (or processor-readable medium) is non-transitory in the sense that it does not include transitory propagating signals per se (e.g., a propagating electromagnetic wave carrying information on a transmission medium such as space or a cable). The media and computer code (also may be referred to as code or algorithm) may be those designed and constructed for the specific purpose or purposes. Examples of non-transitory computer-readable media include, but are not limited to, magnetic storage media such as hard disks, floppy disks, and magnetic tape; optical storage media such as Compact Disc/Digital Video Discs (CD/DVDs); Compact Disc-Read Only Memories (CD-ROMs), and holographic devices; magneto-optical storage media such as optical disks; solid state storage devices such as a solid state drive (SSD) and a solid state hybrid drive (SSHD); carrier wave signal processing modules; and hardware devices that are specially configured to store and execute program code, such as Application-Specific Integrated Circuits (ASICs), Programmable Logic Devices (PLDs), Read-Only Memory (ROM), and Random-Access Memory (RAM) devices. Other embodiments described herein relate to a computer program product, which may include, for example, the instructions and/or computer code disclosed herein.


A user interface may serve as a communication interface between an operator or clinician and the treatment planning system. The user interface may comprise an input device and output device (e.g., touch screen and display) and be configured to receive input data and output data from one or more of the support arm, external magnet, sensor, delivery device, input device, output device, network, database, and server. Sensor data from one or more sensors may be received by user interface and output visually, audibly, and/or through haptic feedback by one or more output devices. As another example, operator control of an input device (e.g., joystick, keyboard, touch screen) may be received by user and then processed by processor and memory for user interface to output a control signal to one or more support arms, external magnets, intracavity devices, and delivery devices.


Some variations of a treatment planning system for generating fluence maps may comprise a display device that may allow an operator to view graphical and/or textual representations of fluence maps, and/or dose distributions, and/or regions of interest, and/or volumes of interest, and/or patient anatomical images, and/or patient data (e.g., physiological and/or biological), and the like. In some variations, an output device may comprise a display device including at least one of a light emitting diode (LED), liquid crystal display (LCD), electroluminescent display (ELD), plasma display panel (PDP), thin film transistor (TFT), organic light emitting diodes (OLED), electronic paper/e-ink display, laser display, and/or holographic display.


Communication


In some embodiments, a treatment planning system may be in communication with other computing devices (not shown) via, for example, one or more networks, each of which may be any type of network (e.g., wired network, wireless network). A wireless network may refer to any type of digital network that is not connected by cables of any kind. Examples of wireless communication in a wireless network include, but are not limited to cellular, radio, satellite, and microwave communication. However, a wireless network may connect to a wired network in order to interface with the Internet, other carrier voice and data networks, business networks, and personal networks. A wired network is typically carried over copper twisted pair, coaxial cable and/or fiber optic cables. There are many different types of wired networks including wide area networks (WAN), metropolitan area networks (MAN), local area networks (LAN), Internet area networks (IAN), campus area networks (CAN), global area networks (GAN), like the Internet, and virtual private networks (VPN). Hereinafter, network refers to any combination of wireless, wired, public and private data networks that are typically interconnected through the Internet, to provide a unified networking and information access system.


Cellular communication may encompass technologies such as GSM, PCS, CDMA or GPRS, W-CDMA, EDGE or CDMA2000, LTE, WiMAX, and 5G networking standards. Some wireless network deployments combine networks from multiple cellular networks or use a mix of cellular, Wi-Fi, and satellite communication. In some embodiments, the systems, apparatuses, and methods described herein may include a radiofrequency receiver, transmitter, and/or optical (e.g., infrared) receiver and transmitter to communicate with one or more devices and/or networks.

Claims
  • 1. A method for calculating a fluence map for radiation therapy, the method comprising: selecting a volume of interest;selecting a plurality of voxels within the volume of interest, wherein each of the voxels has an acceptable dose range;selecting a set of candidate beamlets b={bi} having initial beamlet intensity weights x0={xi0};calculating a dose matrix A for the volume of interest based on the set of candidate beamlets b, wherein the dose matrix A contains per-voxel dose values delivered to each of the plurality of voxels by each of the candidate beamlets b; andcalculating a fluence map having a final set of beamlet intensity weights xk by adjusting the initial beamlet intensity weights according to a proximal gradient method (xk−1→xk) with a penalty function containing one or more linear penalties that iterates on the initial beamlet intensity weights until the adjusted beamlet intensity weights converge on a final set of beamlet intensity weights such that changes between iterations of the beamlet intensity weights are less than a predetermined residual criterion.
  • 2. The method of claim 1, wherein the proximal gradient method is an accelerated proximal gradient method.
  • 3. The method of claim 2, wherein the proximal gradient method is a fast iterative shrinkage-thresholding algorithm (FISTA).
  • 4. The method of claim 1, wherein the one or more linear penalties comprises one or more L1 penalties.
  • 5. The method of claim 1, wherein the penalty function has been smoothed such that the penalty function is differentiable.
  • 6. The method of claim 5, wherein the penalty function has been smoothed using Moreau-Yosida regularization.
  • 7. The method of claim 1, wherein the initial set of beamlet intensity weights {x0} is an all-zero vector.
  • 8. The method of claim 1, wherein the beamlets in the set of candidate beamlets b are divided between a smaller set of firing angles {fi}.
  • 9. The method of claim 8, wherein the set of firing angles {fi} comprises a plurality of angles around a patient area of a radiation treatment device.
  • 10. The method of claim 9, wherein the plurality of firing angles are evenly distributed 360 degrees around the patient area.
  • 11. The method of claim 1, wherein the proximal gradient method penalty function comprises one or more quadratic penalties.
  • 12. The method of claim 11, wherein the one or more quadratic penalties comprises one or more L2 penalties.
  • 13. The method of claim 1, wherein the penalty function penalizes voxel dose excursions outside of the acceptable dose range.
  • 14. The method of claim 13, wherein the penalty function is a single-value penalty function that aggregates the voxel dose excursions outside of the acceptable dose range of all of the voxels in the volume of interest.
  • 15. The method of claim 1, wherein the acceptable dose range of each of the plurality of voxels is determined at least in part by a treatment plan.
  • 16. The method of claim 1, wherein the volume of interest is a first volume of interest and the method further comprises: selecting a second volume of interest;selecting a second plurality of voxels within the second volume of interest, wherein each of the voxels has an acceptable dose range; andwherein calculating the dose matrix A further comprises calculating the dose matrix A for the first and second volumes of interest based on the set of candidate beamlets b, wherein the dose matrix A contains per-voxel dose values delivered to each of the first and second plurality of voxels by each of the candidate beamlets b.
  • 17. The method of claim 1, further comprising segmenting the fluence map into a set of multi-leaf collimator and radiation source positioning instructions.
  • 18. The method of claim 1, wherein the proximal gradient method penalty function comprises one or more indicator functions, the one or more indicator functions comprising an indicator function for an L∞-norm ball.
  • 19. A system for calculating a fluence map for radiation therapy, the system comprising: a processor configured to: select a volume of interest;select a plurality of voxels within the volume of interest, wherein each of the voxels has an acceptable dose range;select a set of candidate beamlets b={bi} having initial beamlet intensity weights x0={xi0};calculate a dose matrix A for the volume of interest based on the set of candidate beamlets b, wherein the dose matrix A contains per voxel dose values delivered to each of the plurality of voxels by each of the candidate beamlets b;calculate a fluence map comprising a final set of beamlet intensity weights xk by adjusting the initial beamlet intensity weights according to a proximal gradient method update (xk−1→xk) with a penalty function containing one or more linear penalties that iterates on the initial beamlet intensity weights until the adjusted beamlet intensity weights converge on a final set of beamlet intensity weights such that changes between iterations of the beamlet intensity weights are less than a predetermined residual criterion; andstore the fluence map in a processor memory.
  • 20. The system of claim 19, wherein the proximal gradient method is an accelerated proximal gradient method.
  • 21. The system of claim 20, wherein the accelerated proximal gradient method is a fast iterative shrinkage-thresholding algorithm (FISTA).
  • 22. The system of claim 19, wherein the one or more linear penalties comprises one or more L1 penalties.
  • 23. The system of claim 19, wherein the penalty function has been smoothed such that the penalty function is differentiable.
  • 24. The system of claim 23, wherein the penalty function has been smoothed using Moreau-Yosida regularization.
  • 25. The system of claim 19, wherein the penalty function penalizes voxel dose excursions outside of the acceptable dose range.
  • 26. The system of claim 25, wherein the penalty function is a single-value penalty function that aggregates the voxel dose excursions outside of the acceptable dose range of all of the voxels in the volume of interest.
  • 27. The system of claim 19, wherein the acceptable dose range of each of the plurality of voxels is determined at least in part by a treatment plan.
  • 28. The system of claim 19, wherein the processor is further configured to: select a second volume of interest;select a second plurality of voxels within the second volume of interest, wherein each of the voxels has an acceptable dose range; andwherein calculating the dose matrix A further comprises calculating the dose matrix A for the first and second volumes of interest based on the set of candidate beamlets b, wherein the dose matrix A contains per-voxel dose values delivered to each of the first and second plurality of voxels by each of the candidate beamlets b.
  • 29. The system of claim 19, further comprising a radiation therapy system comprising a therapeutic radiation source movable about a patient area and configured to apply radiation beamlets to the patient area according to the fluence map.
  • 30. The system of claim 29, wherein a set of firing angles{fi} comprises a plurality of angles around the patient area of a radiation treatment device.
  • 31. The system of claim 30, wherein the plurality of firing angles are evenly distributed 360 degrees around the patient area.
  • 32. The system of claim 29, wherein the radiation therapy system further comprises a multi-leaf collimator disposed in a beam path of the therapeutic radiation source, and wherein the processor is configured to segment the fluence map into a set of multi-leaf collimator instructions and to transmit the instructions to the radiation therapy system.
  • 33. The system of claim 29, wherein the radiation therapy system further comprises one or more PET detectors.
  • 34. The system of claim 29, wherein the therapeutic radiation source is movable about the patient area at a speed of at least about 40 RPM.
  • 35. The system of claim 19, wherein the proximal gradient method penalty function comprises one or more indicator functions, the one or more indicator functions comprising an indicator function for an L∞-norm ball.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Patent Application Number PCT/US2017/021647, filed on Mar. 9, 2017, which claims priority to U.S. Provisional Patent Application No. 62/305,974, filed on Mar. 9, 2016, each of which is hereby incorporated by reference in its entirety.

US Referenced Citations (256)
Number Name Date Kind
3794840 Scott Feb 1974 A
3906233 Vogel Sep 1975 A
3987281 Hodes Oct 1976 A
4389569 Hattori et al. Jun 1983 A
4503331 Kovacs, Jr. et al. Mar 1985 A
4529882 Lee Jul 1985 A
4563582 Mullani Jan 1986 A
4575868 Ueda et al. Mar 1986 A
4642464 Mullani Feb 1987 A
4647779 Wong Mar 1987 A
4677299 Wong Jun 1987 A
4868844 Nunan Sep 1989 A
5015851 Singh et al. May 1991 A
5075554 Yunker et al. Dec 1991 A
5206512 Iwao Apr 1993 A
5207223 Adler May 1993 A
5317616 Swerdloff et al. May 1994 A
5329567 Ikebe Jul 1994 A
5351280 Swerdloff et al. Sep 1994 A
5390225 Hawman Feb 1995 A
5394452 Swerdloff et al. Feb 1995 A
5396534 Thomas Mar 1995 A
5418827 Deasy et al. May 1995 A
5442675 Swerdloff et al. Aug 1995 A
5647663 Holmes Jul 1997 A
5751781 Brown et al. May 1998 A
5813985 Carroll Sep 1998 A
5818902 Yu Oct 1998 A
5851182 Sahadevan Dec 1998 A
5937028 Tybinkowski et al. Aug 1999 A
6038283 Carol et al. Mar 2000 A
6180943 Lange Jan 2001 B1
6184530 Hines et al. Feb 2001 B1
6188748 Pastyr et al. Feb 2001 B1
6239438 Schubert May 2001 B1
6260005 Yang et al. Jul 2001 B1
6281505 Hines et al. Aug 2001 B1
6385288 Kanematsu May 2002 B1
6393096 Carol et al. May 2002 B1
6438202 Olivera et al. Aug 2002 B1
6449331 Nutt et al. Sep 2002 B1
6449340 Tybinkowski et al. Sep 2002 B1
6455856 Gagnon Sep 2002 B1
6459762 Wong et al. Oct 2002 B1
6473634 Barni Oct 2002 B1
6504899 Pugachev et al. Jan 2003 B2
6552693 Leisten Apr 2003 B1
6560311 Shepard et al. May 2003 B1
6618467 Ruchala et al. Sep 2003 B1
6661866 Limkeman et al. Dec 2003 B1
6696694 Pastyr et al. Feb 2004 B2
6700949 Susami et al. Mar 2004 B2
6714620 Caflisch et al. Mar 2004 B2
6730924 Pastyr et al. May 2004 B1
6735277 McNutt et al. May 2004 B2
6794653 Wainer et al. Sep 2004 B2
6810103 Tybinkowski et al. Oct 2004 B1
6831961 Tybinkowski et al. Dec 2004 B1
6865254 Näfstadius Mar 2005 B2
6888919 Graf May 2005 B2
6914959 Bailey et al. Jul 2005 B2
6934363 Seufert Aug 2005 B2
6965661 Kojima et al. Nov 2005 B2
6976784 Kojima et al. Dec 2005 B2
6990175 Nakashima et al. Jan 2006 B2
7015490 Wang et al. Mar 2006 B2
7020233 Tybinkowski et al. Mar 2006 B1
7026622 Kojima et al. Apr 2006 B2
7110808 Adair Sep 2006 B2
7154096 Amano Dec 2006 B2
7167542 Juschka et al. Jan 2007 B2
7199382 Rigney et al. Apr 2007 B2
7227925 Mansfield et al. Jun 2007 B1
7242750 Tsujita Jul 2007 B2
7263165 Ghelmansarai Aug 2007 B2
7265356 Pelizzari et al. Sep 2007 B2
7266175 Romesberg Sep 2007 B1
7280633 Cheng et al. Oct 2007 B2
7291840 Fritzler et al. Nov 2007 B2
7297958 Kojima et al. Nov 2007 B2
7298821 Ein-Gal Nov 2007 B2
7310410 Sohal et al. Dec 2007 B2
7386099 Kasper et al. Jun 2008 B1
7397901 Johnsen Jul 2008 B1
7397902 Seeber et al. Jul 2008 B2
7453983 Schildkraut et al. Nov 2008 B2
7453984 Chen et al. Nov 2008 B2
7469035 Keall et al. Dec 2008 B2
7508967 Harari et al. Mar 2009 B2
7555103 Johnsen Jun 2009 B2
7558378 Juschka et al. Jul 2009 B2
7560698 Rietzel Jul 2009 B2
7564951 Hasegawa et al. Jul 2009 B2
7596209 Perkins Sep 2009 B2
7611452 Allison et al. Nov 2009 B2
7627082 Kojima et al. Dec 2009 B2
7639853 Olivera et al. Dec 2009 B2
7656999 Hui et al. Feb 2010 B2
7715606 Jeung et al. May 2010 B2
7742575 Bourne Jun 2010 B2
7755055 Schilling Jul 2010 B2
7755057 Kim Jul 2010 B2
7778691 Zhang et al. Aug 2010 B2
7792252 Bohn Sep 2010 B2
7795590 Takahashi et al. Sep 2010 B2
7801270 Nord et al. Sep 2010 B2
7835494 Nord et al. Nov 2010 B2
7839972 Ruchala et al. Nov 2010 B2
7949095 Ning et al. May 2011 B2
7957507 Cadman Jun 2011 B2
7965819 Nagata Jun 2011 B2
7983380 Guertin et al. Jul 2011 B2
8059782 Brown Nov 2011 B2
8063376 Maniawski et al. Nov 2011 B2
8073103 Otto et al. Dec 2011 B2
8077936 Wang et al. Dec 2011 B2
8085899 Nord et al. Dec 2011 B2
8116427 Kojima et al. Feb 2012 B2
8139713 Janbakhsh Mar 2012 B2
8139714 Sahadevan Mar 2012 B1
8144962 Busch et al. Mar 2012 B2
8148695 Takahashi et al. Apr 2012 B2
8149991 More Apr 2012 B2
8180020 Kilby et al. May 2012 B2
8232535 Olivera et al. Jul 2012 B2
8280002 Bani-Hashemi et al. Oct 2012 B2
8295906 Saunders et al. Oct 2012 B2
8304738 Gagnon et al. Nov 2012 B2
8335296 Dehler et al. Dec 2012 B2
8357903 Wang et al. Jan 2013 B2
8384049 Broad Feb 2013 B1
8406844 Ruchala et al. Mar 2013 B2
8442287 Fordyce, II et al. May 2013 B2
8461539 Yamaya et al. Jun 2013 B2
8467497 Lu et al. Jun 2013 B2
8483803 Pertain et al. Jul 2013 B2
8509383 Lu et al. Aug 2013 B2
8536547 Maurer, Jr. et al. Sep 2013 B2
8537373 Humphrey Sep 2013 B2
8581196 Yamaya et al. Nov 2013 B2
8588367 Busch et al. Nov 2013 B2
8617422 Koschan et al. Dec 2013 B2
8664610 Chuang Mar 2014 B2
8664618 Yao Mar 2014 B2
8712012 O'Connor Apr 2014 B2
8716669 Myaoka et al. May 2014 B2
8767917 Ruchala et al. Jul 2014 B2
9283403 Mazin et al. Mar 2016 B2
9421397 Purdie et al. Aug 2016 B2
9649509 Mazin et al. May 2017 B2
9694208 Mazin et al. Jul 2017 B2
9731148 Olivera et al. Aug 2017 B2
9764161 Mazin et al. Sep 2017 B2
9782607 Wiersma et al. Oct 2017 B2
10143857 Mazin et al. Dec 2018 B2
10159852 Mazin et al. Dec 2018 B2
10617890 Mazin et al. Apr 2020 B2
10688320 Voronenko et al. Jun 2020 B2
10695583 Mazin et al. Jun 2020 B2
20020051513 Pugachev et al. May 2002 A1
20020163994 Jones Nov 2002 A1
20030036700 Weinberg Feb 2003 A1
20030128801 Eisenberg et al. Jul 2003 A1
20040024300 Graf Feb 2004 A1
20040096033 Seppi et al. May 2004 A1
20040158416 Slates Aug 2004 A1
20050068488 Tajima Mar 2005 A1
20050111621 Riker et al. May 2005 A1
20050207531 Dempsey et al. Sep 2005 A1
20050216266 Gong et al. Sep 2005 A1
20060002511 Miller et al. Jan 2006 A1
20060113482 Pelizzari et al. Jun 2006 A1
20060159220 Heuscher Jul 2006 A1
20060173294 Ein-Gal Aug 2006 A1
20060256915 Otto et al. Nov 2006 A1
20060293583 Saracen et al. Dec 2006 A1
20070043289 Adair Feb 2007 A1
20070053491 Schildkraut et al. Mar 2007 A1
20070055144 Neustadter et al. Mar 2007 A1
20070075273 Birgy et al. Apr 2007 A1
20070085012 Thompson Apr 2007 A1
20070211857 Urano et al. Sep 2007 A1
20070221869 Song Sep 2007 A1
20070242801 Mackie et al. Oct 2007 A1
20070265528 Xu et al. Nov 2007 A1
20080002811 Allison Jan 2008 A1
20080008291 Alakuijaia et al. Jan 2008 A1
20080031404 Khamene et al. Feb 2008 A1
20080071131 Rietzel Mar 2008 A1
20080128631 Suhami Jun 2008 A1
20080156993 Weinberg et al. Jul 2008 A1
20080230705 Rousso et al. Sep 2008 A1
20090003655 Wollenweber Jan 2009 A1
20090116616 Lu et al. May 2009 A1
20090169082 Mizuta et al. Jul 2009 A1
20090256078 Mazin Oct 2009 A1
20100040197 Maniawski et al. Feb 2010 A1
20100051824 Nord et al. Mar 2010 A1
20100054411 Nord et al. Mar 2010 A1
20100054412 Brinks et al. Mar 2010 A1
20100054413 Sobering et al. Mar 2010 A1
20100067660 Maurer, Jr. et al. Mar 2010 A1
20100069742 Partain et al. Mar 2010 A1
20100074408 Bert et al. Mar 2010 A1
20100150309 Nord et al. Jun 2010 A1
20100166274 Busch et al. Jul 2010 A1
20100177871 Nord Jul 2010 A1
20100232572 Nord et al. Sep 2010 A1
20110006212 Shchory et al. Jan 2011 A1
20110044429 Takahashi et al. Feb 2011 A1
20110073763 Subbarao Mar 2011 A1
20110092814 Yamaya et al. Apr 2011 A1
20110122997 Lu et al. May 2011 A1
20110272600 Bert et al. Nov 2011 A1
20110282181 Wang et al. Nov 2011 A1
20110291015 Mazin Dec 2011 A1
20110301449 Maurer, Jr. Dec 2011 A1
20110309255 Bert et al. Dec 2011 A1
20110313231 Guertin et al. Dec 2011 A1
20120165652 Dempsey Jun 2012 A1
20120230464 Ling et al. Sep 2012 A1
20130083004 Nord et al. Apr 2013 A1
20130101156 Holt Apr 2013 A1
20130102830 Otto Apr 2013 A1
20130177135 Boettger et al. Jul 2013 A1
20130181711 Chaari et al. Jul 2013 A1
20130188856 Adler, Jr. et al. Jul 2013 A1
20130197878 Fiege et al. Aug 2013 A1
20130266116 Abenaim et al. Oct 2013 A1
20130289332 Purdie et al. Oct 2013 A1
20130324784 Fredriksson Dec 2013 A1
20130328919 Holt Dec 2013 A1
20140072109 Van Heteren et al. Mar 2014 A1
20140107390 Brown et al. Apr 2014 A1
20140239204 Orton et al. Aug 2014 A1
20140241485 Di Fonzo et al. Aug 2014 A1
20140275696 Dempsey et al. Sep 2014 A1
20140294147 Chen et al. Oct 2014 A1
20150139526 Jeong et al. May 2015 A1
20150206504 Pajak et al. Jul 2015 A1
20150302467 Marenko Oct 2015 A1
20150367143 Muraki et al. Dec 2015 A1
20160008630 Ranganatham et al. Jan 2016 A1
20160023019 Filiberti et al. Jan 2016 A1
20160038767 Wiersma et al. Feb 2016 A1
20160361566 Larkin et al. Dec 2016 A1
20170368372 Mazin et al. Dec 2017 A1
20180001109 Mazin et al. Jan 2018 A1
20180133518 Harper et al. May 2018 A1
20180345042 Voronenko et al. Dec 2018 A1
20190001152 O'Connor et al. Jan 2019 A1
20190083815 Mazin et al. Mar 2019 A1
20190091487 Pal et al. Mar 2019 A1
20190118001 Mazin et al. Apr 2019 A1
20190255362 Voronenko et al. Aug 2019 A1
20200222724 Mazin et al. Jul 2020 A1
Foreign Referenced Citations (41)
Number Date Country
101739610 Jun 2010 CN
201716569 Jan 2011 CN
101970043 Feb 2011 CN
103505819 Jan 2014 CN
204143239 Feb 2015 CN
104640205 May 2015 CN
104866928 Aug 2015 CN
104954772 Sep 2015 CN
104994909 Oct 2015 CN
10 2008 053321 May 2010 DE
0 212 135 Sep 1991 EP
2 188 815 May 2010 EP
2 687 259 Jan 2014 EP
09-33658 Feb 1997 JP
9-189769 Jul 1997 JP
2000-105279 Apr 2000 JP
2001-340474 Dec 2001 JP
2003-534823 Nov 2003 JP
2004-513735 May 2004 JP
2006-145281 Jun 2006 JP
2011-514213 May 2011 JP
WO-2000015299 Mar 2000 WO
WO-2005031629 Apr 2005 WO
WO-2008011725 Jan 2008 WO
WO-2009111580 Sep 2009 WO
WO-2009111580 Sep 2009 WO
WO-2009114117 Sep 2009 WO
WO-2009114117 Sep 2009 WO
WO-2010015358 Feb 2010 WO
WO-2012135771 Oct 2012 WO
WO-2014167461 Oct 2014 WO
WO-2014181204 Nov 2014 WO
WO-2014181204 Nov 2014 WO
WO-2014191204 Dec 2014 WO
WO-2015150575 Oct 2015 WO
WO-2015168431 Nov 2015 WO
WO-2015184549 Dec 2015 WO
WO-2016001046 Jan 2016 WO
WO-2016015163 Feb 2016 WO
WO-2016023786 Feb 2016 WO
WO-2018183748 Oct 2018 WO
Non-Patent Literature Citations (85)
Entry
Chen, X, et al. (2012). “Smoothing proximal gradient method for general structured sparse regression,” The Annals of Applied Statistics 6:719-752. (Year: 2012).
Bangert, M. et al. (2016). “Accelerated iterative beam angle selection in IMRT,” Medical Physics 43.3:1073-1082.
Beck, A. et al. (2009). “Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems,” Image Processing, IEEE Transactions on 18:2419-2434.
Beck, A. et al. (2009). “A fast iterative shrinkage-thresholding algorithm for linear inverse problems. SIAM journal on imaging sciences,” 2:183-202.
Beck, A. et al. (2009). “Gradient-based algorithms with applications to signal recovery,” Convex Optimization in Signal Processing and Communications, pp. 42-88.
Becker, S. et al. (2011). “Nesta: a fast and accurate first-order method for sparse recovery. SIAM Journal on Imaging Sciences,” 4:1-39.
Boyd, S. et al. (2004). Convex optimization. Cambridge university press, 2004.
Censor, Y. et al. (2006). “A unified approach for inversion problems in intensity-modulated radiation therapy,” Physics in Medicine and Biology 51:2353.
Chambolle, A. et al. (2011). “A first-order primal-dual algorithm for convex problems with applications to imaging,” Journal of Mathematical Imaging and Vision 40:120-145.
Chen, X. et al. (2012). “Smoothing proximal gradient method for general structured sparse regression,” The Annals of Applied Statistics 6:719-752.
Combettes, P.L. et al. (2011). “Proximal splitting methods in signal processing,” In Fixed-point algorithms for inverse problems in science and engineering, pp. 185-212. Springer.
Combettes, P.L. et al. (2005). “Signal recovery by proximal forward-backward splitting,” Multiscale Modeling & Simulation, 4:1168-1200.
Condat, L. (2013). “A primal-dual splitting method for convex optimization involving lipschitzian, proximable and linear composite terms,” Journal of Optimization Theory and Applications 158:460-479.
Dieterich, S. et al. (2003). “Skin respiratory motion tracking for stereotactic radiosurgery using the CyberKnife,” Elsevier Int'l Congress Series 1256:130-136.
Extended European Search Report dated Oct. 7, 2015, for European Application No. 12 763 280.0, filed on Mar. 30, 2012, 11 pages.
Extended European Search Report dated Nov. 21, 2016, for European Application No. 18 168 947.2, filed on Mar. 30, 2012, 8 pages.
Fan, Q. et al. (2013). “Towards a Planning Scheme for Emission Guided Radiation Therapy (EGRT): FDG Based Tumor Tracking in a Metastatic Breast Cancer Patient,” Med. Phys. 40(8):081708, 12 pages.
Final Office Action dated Aug. 2, 2016, for U.S. Appl. No. 15/684,710, filed Aug. 23, 2017, 8 pages.
Gao, H. (2015). “Robust fluence map optimization via alternating direction method of multipliers with automatic parameter optimization,” AAPM 2015 talk.
Gibbons, J.P. (2004). “Dose calculation and verification for tomotherapy,” 2004 ACMP Meeting, Scottsdale, AZ., 71 total pages.
Goldstein, T. et al. (2009). “The split Bregman method for Ll-regularized problems,” SIAM Journal on Imaging Sciences 2:323-343.
Handsfield, L.L. et al. (2014). “Phantomless patient-specific TomoTherapy QA via delivery performance monitoring and a secondary Monte Carlo dose calculation,” Med. Phys. 41:101703-1-101703-9.
Hindi, H. (2013). “A tutorial on optimization methods for cancer radiation treatment planning,” In American Control Conference (ACC), pp. 6804-6816. IEEE.
International Search Report dated Jul. 20, 2012, for PCT Patent Application No. PCT/US2012/31704, filed on Mar. 30, 2012, 2 pages.
International Search Report dated Jun. 27, 2017, for PCT Patent Application No. PCT/US2017/021647, filed on Mar. 9, 2017, 3 pages.
International Search Report dated Jan. 17, 2018, for PCT Application No. PCT/US2017/061728, filed on Nov. 15, 2017, 2 pages.
International Search Report dated Oct. 3, 2018, for PCT Application No. PCT/US2018/035188, filed on May 30, 2018, 4 pages.
Japanese Office Action dated Dec. 25, 2015, for Japanese Patent Application No. 2014-502881 filed Mar. 20, 2012, 14 pages (with English translation).
Kapatoes, J.M. et al. (2001). “A feasible method for clinical delivery verification and dose reconstruction in tomotherapy,” Med. Phys. 28:528-542.
Kapatoes, J. M. (2001). “On the accuracy and effectiveness of dose reconstruction for tomotherapy,” Physics in Med. Biol. 46:943-966.
Keall, P .J. et al. (2001). “Motion adaptive x-ray therapy: a feasibility study,” Physics in Med. Biol. 46:1-10.
Lu, W. (2008). “Real-time motion-adaptive delivery (MAD) using binary MLC: I. Static beam (topotherapy) delivery,” Phys. Med. Biol. 53:6491-6511.
Lu, W. (2009). “Real-time motion-adaptive-optimization (MAO) in tomotherapy,” Phys. Med. Biol. 54:4373-4398.
McMahon, R. et al. (2008). “A real-time dynamic-MLC control algorithm for delivering IMRT to targets undergoing 2D rigid motion in the beam's eye view,” Med. Phys. 35:3875-3888.
Nesterov, Y. (1983). “A method of solving a convex programming problem with convergence rate O(1/k2),” In Soviet Mathematics Doklady, vol. 27, pp. 372-376.
Nesterov, Y. (2004). “Introductory lectures on convex optimization,” vol. 87. Springer Science & Business Media.
Nesterov, Y. (2005). “Smooth minimization of non-smooth functions,” Mathematical programming 103:127-152.
Nesterov, Y. (2007). Gradient methods for minimizing composite objective function.
Non-Final Office Action dated Feb. 24, 2017, for U.S. Appl. No. 15/069,390, filed Mar. 14, 2016, 6 pages.
Non-Final Office Action dated Feb. 21, 2018, for U.S. Appl. No. 15/684,710, filed Aug. 23, 2017, 10 pages.
Non-Final Office Action dated Mar. 27, 2018, for U.S. Appl. No. 15/684,693, filed Aug. 23, 2017, 7 pages.
Notice of Allowance dated May 18, 2017, for U.S. Appl. No. 15/069,390, filed Mar. 14, 2016, 5 pages.
Notice of Allowance dated Jul. 19, 2017, for U.S. Appl. No. 15/499,671, filed Apr. 27, 2017, 8 pages.
Notice of Allowance dated Oct. 3, 2018, for U.S. Appl. No. 15/684,693, filed Aug. 23, 2017, 5 pages.
Notice of Allowance dated Oct. 25, 2018, for U.S. Appl. No. 15/684,710, filed Aug. 23, 2017, 7 pages.
Nguyen, D. et al. (2015). “Dose domain regularization of MLC leaf patterns for highly complex IMRT plans,” Medical Physics 42:1858-1870.
Nguyen, D. et al. (2015). “A new intensity modulation radiation therapy (IMRT) optimizer solution with robust fluence maps for MLC segmentation,” Medical physics 42:3740.
Nguyen, D. et al. (2015). “A novel Haar wavelet based approach to deliver non-coplanar intensity modulated radiotherapy using sparse orthogonal collimators,” Medical physics 42:3532-3532.
Oiivera, G.H. et al. (2000). “Modifying a plan delivery without re-optimization to account for patient offset in tomotherapy,” Proceedings of the 22nd Annual EMBS International Conference, Jul. 23-28, 2000, Chicago, IL, pp. 441-444.
Papanikolaou, N. et al. (2010). “MU-Tomo: Independent dose validation software for helical tomo therapy,” J. Cancer Sci. Ther: 2:145-152.
Parikh, N. et al. (2013). “Proximal algorithms,” Foundations and Trends in Optimization 1:123-231.
Parikh, N. et al. (2014). “Block splitting for distributed optimization,” Mathematical Programming Computation 6:77-102.
Rockafellar, R.T. (1967). “Duality and stability in extremum problems involving convex functions,” Pacific Journal of Mathematics 21:167-187.
Romeijn, H. et al. (2005). “A column generation approach to radiation therapy treatment planning using aperture modulation,” SIAM Journal on Optimization 15.3:838-862.
Rudin, L. et al. (1992). “Nonlinear total variation based noise removal algorithms,” Physica D: Nonlinear Phenomena, 60(1):259-268.
Salari, E. (2013). “Mathematical optimization in radiotherapy treatment planning,” Department of Radiation Oncology Massachusetts General Hospital and Harvard Medical School, Presentation, 80 total pages.
Scheinberg, K. (2014). “Fast first-order methods for composite convex optimization with backtracking,” Foundations of Computational Mathematics 14:389-417.
The Partial Supplementary European Search Report, dated Jun. 25, 2015 for European Application No. 12763280.0, filed on Mar. 30, 2012, 6 pages.
Tseng, P. (2008). “On accelerated proximal gradient methods for convex-concave optimization,” SIAM J, Optim., pp. 1-20.
Voet, P. et al. (2013). “Toward fully automated multicriterial plan generation: A prospective clinical study,” International Journal of Radiation Oncology* Biology* Physics 85:866-872.
Written Opinion dated Jul. 20, 2012, for PCT Patent Application No. PCT/US2012/31704, filed on Mar. 30, 2012, 10 pages.
Written Opinion of the International Searching Authority dated Jun. 27, 2017, for PCT Patent Application No. PCT/US2017/021647, filed on Mar. 9, 2017, 5 pages.
Written Opinion of the International Searching Authority dated Jan. 17, 2018, for PCT Application No. PCT/US2017/061728, filed on Nov. 15, 2017, 7 pages.
Written Opinion of the International Searching Authority dated Oct. 3, 2018, for PCT Application No. PCT/US2018/035188, filed on May 30, 2018, 28 pages.
Yan, D. et al. (1997). “Adaptive radiation therapy,” Physics Med. Biol. 42:123-132.
Zarepisheh, M. et al. (2015). “An integrated alternating direction method of multipliers for treatment planning optimization,” AAPM 2015 talk.
Chang, J.Y. et al. (2008). “Image-Guided Radiation Therapy for Non-Small Cell Lung Cancer,” J. Thorac. Oncol. FEB 3(2):177-186.
Erdi, Y.E. (Feb. 2007). “The Use of PET for Radiotherapy,” Current Medical Imaging Reviews 3(1):3-16.
Extended European Search Report dated Oct. 15, 2019, for European Application No. 17 764 132.1, filed on Mar. 9, 2017, 4 pages.
Fan, Q. (Nov. 2012). “Emission Guided Radiation Therapy for Lung and Prostrate Cancers: A Feasibility Study on a Digital Patient,” Med. Phys. 39(11):7140-7152.
Hunt, M.A. et al. (2003). “Treatment Planning Considerations using IMRT,” pp. 103-121.
International Search Report dated Jun. 14, 2019, for PCT Application No. PCT/US2019/017855, filed on Feb. 13, 2019, 4 pages.
International Search Report dated Apr. 23, 2020, for PCT Application No. PCT/US2020/013927, filed on Jan. 16, 2020, 3 pages.
Lee, S. et al. (2015). “Treatment plan comparison of Linac step and shoot, tomotherapy, RapidArc, and proton therapy for prostate cancer using dosimetrical and biological index,” J. Korean Physical Society 67:7-16 (with tables 1-5), 28 total pages.
Mackie, T.R. et al. (Nov.-Dec. 1993). “Tomotherapy: A New Concept for the Delivery of Dynamic Conformal Radiotherapy,” Med. Phys. 20(6):1709-1719.
Mazin, S.R. et al. (Dec. 2010). “Emission-Guided Radiation Therapy: Biologic Targeting and Adaptive Treatment,” Journal of American College of Radiology 7(12):989-990.
Non-Final Office Action dated Aug. 30, 2019, for U.S. Appl. No. 16/193,725, filed Nov. 16, 2018, 5 pages.
Non-Final Office Action dated Sep. 19, 2019, for U.S. Appl. No. 16/217,417, filed Dec. 12, 2018, 7 pages.
Non-Final Office Action dated Dec. 6, 2019, for U.S. Appl. No. 15/993,325, filed May 30, 2018, 8 pages.
Notice of Allowance dated Jan. 21, 2020, for U.S. Appl. No. 16/193,725, filed Nov. 16, 2018, 11 pages.
Notice of Allowance dated Mar. 13, 2020, for U.S. Appl. No. 16/217,417, filed Dec. 12, 2018, 6 pages.
Notice of Allowance dated Apr. 20, 2020, for U.S. Appl. No. 15/993,325, filed May 30, 2018, 7 pages.
Written Opinion of the International Searching Authority dated Jun. 14, 2019, for PCT Application No. PCT/US2019/017855, filed on Feb. 13, 2019, 10 pages.
Written Opinion of the International Searching Authority dated Apr. 23, 2020, for PCT Application No. PCT/US2020/013927, filed on Jan. 16, 2020, 4 pages.
Yamaya, T. et al. (Jan. 14, 2008). “A Proposal of an Open PET Geometry,” Physics in Medicine and Biology 53:757-773.
Related Publications (1)
Number Date Country
20190001152 A1 Jan 2019 US
Provisional Applications (1)
Number Date Country
62305974 Mar 2016 US
Continuations (1)
Number Date Country
Parent PCT/US2017/021647 Mar 2017 US
Child 16122735 US