The present application is based on provisional application Ser. No. 61/392,076, filed Oct. 12, 2010, the entire contents of which are herein incorporated by reference.
1. Technical Field
The present disclosure relates to time series images and, more specifically, to an interaction method for regions-of-interest in time series images.
2. Discussion of Related Art
Medical images may be either two-dimensional or three-dimensional and may be comprised of a series of image frames that together show a progression over time. After acquisition, medical images may be segmented such that the bounds of various anatomical structures are correctly identified. Segmentation may be fully manual, fully automatic, or partially automatic.
In manual segmentation, a human user is responsible for delineating the bounds of the anatomical structure. In automatic segmentation, a computer is used to delineate the bounds of the anatomical structure without human intervention. However, most commonly, the user provides some basic identification of the anatomical structure and computer vision techniques are used to work out many of the details involved with discovering the full bounds of the anatomical structure.
Many approaches to semi-automatic segmentation provide a user an opportunity to modify a computer-derived segmentation so that errors may be corrected or ask the user for input that initializes the automatic segmentation procedure. However, where the medical images being segmented are comprised of a series of image frames acquired over a span of time, it may be time consuming and tedious for the user to manually modify each image frame. Some approaches therefore use user modifications as a basis for re-executing the segmentation algorithm. However, such approaches may be computationally expensive and, where multiple image frames have been manually modified, re-executing the segmentation algorithm based on one manually modified frame may overwrite manual modifications made to other image frames.
These concerns may be exacerbated for medical images that are both three-dimensional and include a series over time. Moreover, it is possible that the medical images have a higher dimensionality than three-dimensions, for example, where functional data, flow data, or other characteristics are superimposed over the two or three spatial dimensions of the medical image.
A method for identifying a region of interest within a time sequence of images includes acquiring a time sequence of images comprising a plurality of image frames. Image segmentation is performed to segment a region of interest (ROI) from within each of the plurality of image frames of the time sequence of images. Manual edits are received for the ROI within one or more of the plurality of image frames. The manual edits are propagated to other image frames of the plurality of images. An extent to which each of the manual edits are propagated to other image frames is dependent upon a transformation function or deformation field used to propagate the manual edits and a weighing factor that is influenced by a distance in time between the other image frames and the frames that have been manually edited.
A method for identifying a region of interest within a time sequence of images includes acquiring a time sequence of images comprising a plurality of image frames. Image segmentation is performed to segment a region of interest (ROI) from within each of the plurality of image frames of the time sequence of images. Manual edits are received to the ROI within one or more of the plurality of image frames. The manual edits are propagated to other image frames of the plurality of images. An extent to which each of the manual edits are propagated to other image frames is dependent upon a distance in time between the other image frames and the frames that have been manually edited.
The extent to which each of the manual edits are propagated to other image frames may be dependent upon a transformation function or deformation field used to propagate the manual edits and a weighing factor that is influenced by the distance in time between the other image frames and the frames that have been manually edited.
Propagating manual edits to the other image frames may include, for each of the other image frames, determining a first transformation based on an effect of manual edits to a nearest left manually edited image frame, determining a second transformation based on an effect of manual edits to a nearest right manually edited image frame, and calculating a new contour for each of the other image frames by combining the first transformation and the second transformation in accordance with a weighing that is proportional to the distance between the other image frame and the nearest left manually edited image frame and the nearest right manually edited image frame.
The time sequence of images may be considered circular such that the first image of the time sequence of images is understood to follow the last image of the time sequence of images and manual edits are propagated past the last image to the first image.
The region of interest may be a contour, one or more landmarks, or a group of pixels. Multiple regions of interest may be identified within the time sequence of images. The time sequence of images may include a sequence of x-rays, magnetic resonance (MR) images, or computed tomography (CT) images. The ROI may delineate a particular anatomical structure, region of disease or injury, or foreign body that is observable within the time sequence of images. Each image frame of the time sequence of images may be a two-dimensional image. Each image frame of the time sequence of images may be a three-dimensional image. Image segmentation may be automatically performed using a computer algorithm. The manual edits may be provided by a human user and include manual adjustments to the ROI to more accurately represent a particular structure. Propagation of the manual edits to the other image frames may include modifying the ROI of one or more of the image frames that have not been manually edited in a manner similar to the manner of the manual edits. The extent to which each of the manual edits are propagated to other image frames may be inversely proportional to the distance in time between the other image frames and the frames that have been manually edited.
A method for identifying a region of interest within a time sequence of images includes acquiring a time sequence of images comprising a plurality of image frames. Image segmentation is performed to segment a region of interest (ROI) from within each of the plurality of image frames of the time sequence of images. Manual edits are received for the ROI within a first edited image frame of the plurality of image frames. Manual edits are received for the ROI within a second edited image frame of the plurality of image frames. The manual edits to the ROI within the first edited image frame are propagated to image frames of the plurality of images preceding the first edited image frame and to image frames between the first edited image frame and the second edited image frame, to an extent that is dependent upon a first transformation function used to propagate the manual edits to the ROI within the first edited image and a first weighing factor that is dependent upon a distance between the image frames and the first edited image frame. The manual edits to the ROI within the second edited image frame are propagated to image frames of the plurality of images subsequent to the second edited image frame and to image frames between the first edited image frame and the second edited image frame, to an extent that is dependent upon a second transformation function used to propagate the manual edits to the ROI within the second edited image and a second weighing factor that is dependent upon a distance between the image frames and the second edited image frame.
The first edited image may be a first keyframe and the second edited image may be a second keyframe. The region of interest may be a contour, one or more landmarks, or a group of pixels. Multiple regions of interest may be identified within the time sequence of images. The time sequence of images may include a sequence of x-rays, magnetic resonance (MR) images, or computed tomography (CT) images. The ROI may delineate a particular anatomical structure, region of disease or injury, or foreign body that is observable within the time sequence of images. Each image frame of the time sequence of images may be a two-dimensional image. Each image frame of the time sequence of images may be a three-dimensional image. Image segmentation may be automatically performed using a computer algorithm. The manual edits may be provided by a human user and include manual adjustments to the ROI to more accurately represent a particular structure. Propagation of the manual edits to the other image frames may include modifying the ROI of one or more of the image frames that have not been manually edited in a manner similar to the manner of the manual edits.
A computer system includes a processor and a non-transitory, tangible, program storage medium, readable by the computer system, embodying a program of instructions executable by the processor to perforin method steps for identifying a region of interest within a time sequence of images. The method includes acquiring a time sequence of images comprising a plurality of image frames. Image segmentation is performed to segment a region of interest (ROI) from within each of the plurality of image frames of the time sequence of images. Manual edits to the ROI within one or more of the plurality of image frames are received. The manual edits are propagated to other image frames of the plurality of images using a transformation function or deformation field. An extent to which each of the manual edits are propagated to other image frames is dependent upon the transformation function or deformation field and a weighing factor that is influenced by a distance in time between the other image frames and the frames that have been manually edited. The manual edits from one of the plurality of images are not propagated to other image frames that have been manually edited.
A more complete appreciation of the present disclosure and many of the attendant aspects thereof will be readily obtained as the same becomes better understood by reference to the following detailed description when considered in connection with the accompanying drawings, wherein:
In describing exemplary embodiments of the present disclosure illustrated in the drawings, specific terminology is employed for sake of clarity. However, the present disclosure is not intended to be limited to the specific terminology so selected, and it is to be understood that each specific element includes all technical equivalents which operate in a similar manner.
Exemplary embodiments of the present invention provide systems and methods for propagating user edits of computer-derived segmentation and/or computer-identified landmarks throughout a time series of images. In particular, the time series of images may be two or three dimensional medical images with multiple image frames that span a length of time. For example, the images may be X-ray images, fluoroscopy images, computed tomography (CT) images, magnetic resonance (MR) images, ultrasound images, positron emission tomography (PET) images, or the like.
The computer-derived segmentation may include the identification of a region-of-interest (ROI) which may include the delineation of a particular anatomical structure, a region or tumor of diseased tissue, a foreign body, or any other aspect of the imagery that is of particular concern.
The user edits may include changes made to the delineation of the ROI. Changes may be made, for example, by displaying the acquired imagery on a display device and providing the user with an interface within which the delineation of the ROI may be adjusted, for example, using a mouse and cursor, keyboard commands, and/or a touch-screen interface. The user may have the ability to select which frames of the time series that the user desires to edit. Alternatively, the user may be presented with one or more image frames of particular significance and asked to provide edits to those particular frames. For example, where the image sequence is of the heart, the user may be presented with an end-diastolic image frame and an end-systolic image frame and be asked to perform edits on these frames, such as to correct a computerized delineation of the left ventricle.
As stated above, the image may be two-dimensional; however, where the medical image is three-dimensional, the user may also have the ability to adjust the point of view of the rendering, for example, by rotating the imagery both left-to-right and top-to-bottom. Where the medical image is a time sequence, the user may have the ability to play or step through the sequence either forward or backward with respect to time.
First, image data may be acquired (Step S11). As discussed above, image data may be medical image data. The image data may be of N-dimensions, where N is a positive integer, and may include a time series of images. The data may therefore be referred to herein as N+1 dimensional image data as it may include N dimensions (1, 2, or 3 of which may be spatial dimensions) and the “+1” dimension representing time.
Segmentation may then be performed on the acquired image data (Step S12). The performance of segmentation may include identification of one or more regions-of-interest (“ROIs”) within the image data. The segmentation may be performed across multiple frames of the acquired image, for example, all frames may be segmented. The segmentation may be fully automatic or may involve initial user input, for example, the selection of seed points within at least one image frame. Segmentation may be performed using a known segmentation algorithm or using one or more trained classifiers.
The user may then perform one or more edits to adjust the computer-calculated segmentation (Step S13). Editing of the segmentation results may be performed either at predetermined frames or frames selected by the user. The user may use a cursor or touchscreen to manually adjust the segmentation results to better fit the anatomical structure.
Each image frame that the user edits may become a “keyframe.” As the user may edit multiple image frames, there may be multiple keyframes within the sequence of images. Each ROI may be edited one or more times in each keyframe. Where the segmentation includes multiple errors, each error may be corrected by the user with a distinct edit. There may also be multiple ROIs within each image frame, and accordingly, each keyframe may include edits to more than one ROI. Alternatively, some keyframes may include edits to one ROI while other keyframes may include edits to a distinct ROI. In fact, according to one exemplary embodiment of the present invention, a particular image frame may be a keyframe with respect to one ROI but not be a keyframe with respect to another ROI.
Exemplary embodiments of the present invention may then use the edits made to the one or more keyframes to automatically adjust one or more other image frames (Step S14). However, exemplary embodiments of the present invention may refrain from making any automatic adjustments to the keyframes. In this respect, the keyframes may be considered fixed as they may represent the user's knowledge about the ROI.
The extent to which other frames are adjusted based on the edits made to the keyframes may be determined by the distance of those other frames to the keyframes. Thus, the closer in time a given frame is to a keyframe, the greater the influence of the edits on the keyframe are on that given frame. Similarly, the farther in time a given frame is to a keyframe, the lesser the influence of the edits on the keyframe are on that given frame. In this sense, the edits made to the keyframe are propagated to neighboring frames with an influence that is inversely proportional to the distance between each frame and the keyframe; the greater the distance, the lesser the influence.
Accordingly, the shape of a current ROI may be most strongly influenced by the keyframe that is in closest proximity to it. This may guarantee a smooth transition between user edits and geometric relationships of the ROIs resulting in a consistent representation of the ROI in the N+1 dimensions.
Propagation of the edits may occur at a per-pixel level. For example, if a given pixel in a keyframe is moved up and to the right, a corresponding pixel in a proximate image frame may also be moved up and to the right, albeit, not necessarily to the same extent as the corresponding pixel of the keyframe was moved during editing.
Alternatively, propagation of the edits may occur with respect to anatomical structures or other landmarks so that if a given landmark is moved up and to the right, a corresponding landmark in a proximate image frame may also be moved up and to the right, regardless of where that given landmark is found within the image frame. This approach may be particularly useful where the ROI experiences significant motion or otherwise changes shape from frame-to-frame.
Propagation of the edits may occur both forwards and backwards in time. Accordingly, edits made to a keyframe may influence other frames that occur before and after the keyframe. Where multiple keyframes are present in a single image sequence, many image frames may be positioned between two keyframes, one occurring prior to a given image frame and another occurring subsequent to the given image frame. In such an event, the edits of two keyframes may each influence the ROI of the given image frame in accordance with the relative distance between the given image frame and each of the neighboring keyframes.
Accordingly, this interaction concept may define an ROI of a current frame that is located between two keyframes as a weighted interpolation between the corresponding ROI in the keyframe to the left and the keyframe to the right that are geometrically propagated to the current frame.
As stated above, the influence of an edit of a keyframe on a given image frame may diminish as the distance between that keyframe and the given image frame, in time, grows larger. This influence may end at or prior to the next keyframe. Thus, according to exemplary embodiments of the present invention, edits performed with respect to the ROI of one keyframe may not at all influence another keyframe, and while the frames occurring between these two keyframes may be influenced by the edit of both keyframes, to varying degrees, because the influence of one keyframe ends as the next keyframe, image frames beyond that next keyframe may not be influenced at all by the edits of the one key frame.
A function Φ may be generated to transform edits made in a keyframe to another frame of the image sequence and thereby generate the mapping between the frames to propagate individual locations, for example, pixels, contour points, etc., from frame to frame. This function may therefore represent the geometric (e.g. spatial) relationship between an ROI in a keyframe and the same ROI in another frame. The function Φ may be generated by a registration of all frames representing the geometrical mapping between the frames. However, exemplary embodiments of the present invention are not limited to this exemplary approach. Generation of the function Φ may be performed in other ways. For example, the function Φ may be generated by functional mapping of single locations generated by a Fast-Fourier Transform. In this way, the function Φ may be a function of landmarks.
The function Φ may represent a deformation field that acts upon the segmentation/landmarks of the unedited frame(s) to propagate the user edits. The deformation fields may signify how the deformation field may be influenced by a weighing factor w such that deformation is more pronounced at image frames closer to the user-edited image frame.
In
While the manner of propagation illustrated in
The edits of the keyframe shown in image (a) are propagated throughout the image sequence. The edits made to image (a) are to a lesser extent applied forward and backward through time (although no image frames prior to image (a) are shown). The influence of the edits of image (a) fall off as the time approaches T=5, and becomes zero at time T=5, which is the point of the next keyframe. However, while the degree of influence the edits have on the surrounding image frames may be related to the distance between the user-edited frames and the surrounding frames, the degree of influence need not be linearly dependent upon this distance. Rather, the degree of influence may be determined by the particular deformation fields and the weighing factor w that is proportional to the linear distance between the user-edited frames and the surrounding frames.
Similarly, the edits of the keyframe shown in image (e) are propagated throughout the image sequence. Influence of the keyframe is maximum at time T=5 and then dissipates in both directions as frames get farther away. The effect of the keyframe (e) goes to zero at time T=1, which is the point of the other keyframe (a). Moreover, the time series may be defined as circular, in which case the first image frame at time T=1 is understood to follow the last image frame, here, at time T=5. Thus, if there is only one keyframe, that single keyframe may be considered both the left and right keyframe with respect to the non-keyframe image frames.
Graph 31 of
Exemplary embodiments of the present invention may be broadly applied to various purposes. For example, the ROIs may represent contours of the heart in 2D, areas of flow in 2D, surfaces of various organs in 3D, and so on, that may be created by an automated process. The ROIs may further represent anatomical points of interest such as landmarks or contours in either 2D or 3D.
Moreover, exemplary embodiments of the present invention may provide an approach by which only a subset of the image frames, for example, two or three frames, are viewed by the user. For example, the user may, upon examining the full set of image frames, choose to more closely analyze a particular subset of images that the user may select. Alternatively, the user may be presented with an automatically selected or predetermined subset of images that may be interpreted as illustrating a key feature of the full set of image frames. In either event, the user may be provided with the opportunity to edit one or more frames of the subset.
After editing one or more keyframes of the subset, the edits may either be propagated throughout the entire subset or it may be assumed that in light of the user's opportunity to individually inspect and edit each frame of the subset, all frames of the subset, whether edited or not, may be treated as keyframes.
The user may then choose to revert the subset of images back into the full set of image frames. Upon reversion, exemplary embodiments of the present invention may propagate, forward and backward with respect to time, the edits made to the keyframes of the subset, for example, as described above, throughout those image frames of the full set of image frames that were not included in the subset.
Exemplary embodiments of the present invention may provide for a diffeomorphic relationship between the user-edited ROIs and the ROIs of related image frames in which the user edits have been propagated to. In this way, the morphology of the ROIs, for example, contours, may remain unchanged. Accordingly, propagation of the user edits would not introduce any folding or overlapping of contours onto themselves and each other.
Frames 42-44 are not keyframes and accordingly, the edits of the keyframes 41 and 45 may be propagated to each of the non-keyframes 42-44 representing phases 1-3. In so doing, the segmentation result 22 for each of the non-keyframes 42-44 may be modified based on the influence of the nearest left keyframe and the nearest right keyframe. Because the non-keyframes 42-44 occur between keyframes 41 and 45, the edits to keyframe 41 contour (show in 41′) may be considered the nearest left keyframe contour (Cleft) and the edits to keyframe 45 contour (shown in 45′) may be considered the nearest right keyframe contour (Cright).
For the purposes of providing a simple and concise explanation, the manner of propagating the user edits of the keyframes to the non-keyframes will be described herein with respect to frame 43 at time T=2, however, it is to be understood that the effects of the user edits on the other non-keyframes may be similarly calculated. In calculating the effects of the user edits on frame 43 (to arrive at frame 43′), two component effects may first be calculated. The first component effect is the effect on the frame by the nearest left keyframe (Cleft). This component effect may be referred to herein as Φ02 (Cleft) as it represents the transformation from the edits to the frame at time T=0 on the frame at time T=2. The second component effect is the effect on the frame by the nearest right keyframe (fright). This component effect may be referred to herein as Φ42 (Cright) as it represents the transformation from the edits to the frame at time T=4 on the frame at time T=2. The component effect of Cright may also be represented as Φ−124, as Φ42 is equivalent to the inverse of Φ24, which is Φ−124.
The new contour for the frame at time T=2 (illustrated in 43′) Cnew, may be an interpolation between Φ02 (Cleft) and Φ42 (Cright). A weighing factor w may be used to weigh the effects of Cleft and Cright in accordance with a linear distance between the image frame being calculated (here, at time T=2) and the nearest left and right keyframes. For example, Cnew may be calculated as follows:
Cnew=wΦ02(Cleft)+(1−w)Φ42(Cright)
However, in calculating Cnew, rather than simply adding the weighted effects of the Cleft transformation with the weighted effects of the Cright transformation, as shown, exemplary embodiments of the present invention may use a more sophisticated interpolation method. For example, the contour interpolation method may return the centerline that lies right in between both left and right contours (Cleft and Cright) where the weight w then “moves” this centerline towards one of the two contours depending on its value. Where the contours are landmarks, the given formula may apply since an “interpolation” between two landmarks can be defined by a straight line. However, where the contours are more generally defined, other interpolation methods may be used.
The computer system referred to generally as system 1000 may include, for example, a central processing unit (CPU) 1001, random access memory (RAM) 1004, a printer interface 1010, a display unit 1011, a local area network (LAN) data transmission controller 1005, a LAN interface 1006, a network controller 1003, an internal bus 1002, and one or more input devices 1009, for example, a keyboard, mouse etc. As shown, the system 1000 may be connected to a data storage device, for example, a hard disk, 1008 via a link 1007.
Exemplary embodiments described herein are illustrative, and many variations can be introduced without departing from the spirit of the disclosure or from the scope of the appended claims. For example, elements and/or features of different exemplary embodiments may be combined with each other and/or substituted for each other within the scope of this disclosure and appended claims.
Number | Name | Date | Kind |
---|---|---|---|
20080100612 | Dastmalchi et al. | May 2008 | A1 |
20110280461 | Bystrov et al. | Nov 2011 | A1 |
Entry |
---|
Hautvast et al. “Automatic Contour Propagation in Cine Cardiac Magnetic Resonance Images.” IEEE, 2006, pp. 1472-1482. |
Feng et al. “A dual propagation contours technique for semi-automated assessment of systolic and diastolic cardiac function by CMR”, J Cardiovasc Magn Reson. Aug. 13, 2009, pp. 1-13. |
Number | Date | Country | |
---|---|---|---|
20120087561 A1 | Apr 2012 | US |
Number | Date | Country | |
---|---|---|---|
61392076 | Oct 2010 | US |