1. Field of the Invention
The embodiments described herein relate generally to methods and systems for collecting and processing images in optical coherence tomography.
2. Description of Related Art
Current trends in ophthalmology make extensive use of 3D imaging and image processing techniques. Such images may be utilized for diagnosing diseases such as glaucoma, and other medical conditions affecting the human eye. One of the challenges posed by the current technological advances in imaging techniques is the efficient and meaningful processing of the massive amounts of data collected at ever increasing imaging rates. Some approaches have been to convert a 3D data set into a manageable two-dimensional (2D) image that then can be analyzed. Traditionally, a technique that has been used for data reduction from a 3D data set into a 2D image is that of 2D ‘En Face’ image processing. (See for example, Bajraszewski et al., [Proc. SPIE 5316, 226-232 (2004)], Wojtkowski et al., [Proc. SPIE 5314, 126-131 (2004)], Hitzenberger et al., [Opt Express. October 20; 11 (21):2753-61 (2003)], U.S. Pat. No. 7,301,644, or U.S. Pat. No. 7,505,142). This technique includes the summing of the intensity signals in the 3D data set along one direction, preferentially the Z-direction hereby identified with the axial direction of an Optical Coherence Tomography (OCT) scan, between two retinal tissue layers. The summation takes place among voxels having the same XY position. Typically, voxels located outside of the layers of interest are ignored during processing.
One common problem with this type of ‘En Face’ image processing technique and other volume rendering techniques is the appearance of artifacts created by the involuntary motion of the subject's eye while a data set is being collected. The movement introduces relative displacements of the collected images, so that physical features end up appearing discontinuous in the resulting 3D data set, rendering the entire set unreliable.
Another challenge that commonly occurs in such image processing is that of correlating a sequence of 3D data sets from a given sample, the different data sets having been collected during different imaging sessions spanning a long period of time. The sample can be a subject's eye, a selected vascular structure, or other selected region of interest. In this case, the lack of an efficient and reliable method for correlating each 3D data set to the same physical feature in the sample prevents an accurate assessment of the evolution of the images over time. In ophthalmology, for instance, such a change can be utilized to evaluate certain diseases or conditions in the subject.
What is needed is a better image processing technique that is capable of producing accurate and meaningful information.
An imaging device to obtain images, comprising: an optical source; an x-y scanner receiving light from the optical source and directing it onto a sample; a detector receiving reflected light from the scanner; and a computer receiving a signal from the detector and providing a 3D data set containing voxels with a given voxel resolution of a sample, the computer further executing instructions for finding values other than a single axes sum to represent a set of voxels; and composing a 2D image using the values.
A method for enhancing ophthalmology images according to some embodiments of the present invention includes obtaining a 3D image data set containing voxels with a given voxel resolution of a sample; finding values other than a single axes sum to represent a set of voxels; and composing a 2D image using the values.
a shows an embodiment of the present invention wherein a 2-D image is formed from median values of voxels along the Z-axis.
b is an exemplary flowchart according to some embodiments described in
a shows an embodiment of the present invention wherein a 2-D image is formed by collecting the median from a 3D neighborhood for each voxel and summing the 3-D medians along the Z-axis.
b is an exemplary flowchart according to some embodiments described in
a shows an embodiment of the present invention wherein a 2-D image is formed by a ‘zig zag’ summation in the Z direction.
b is an exemplary flowchart according to some embodiments described in
a shows 2D images of a segment of the RPE obtained from a 3D data set.
b shows 2D images of a segment of the RPE obtained from a 3D data set.
c shows 2D images of a segment of the ILM obtained from a 3D data set.
a shows a volume rendering process is used to create a 2D image using a perspective projection method according to some embodiments of the present invention.
b shows an embodiment of the present invention wherein a volume rendering process is used to create exemplary 2D representations of 3D frame cubes.
Existing imaging technology in ophthalmology makes extensive use of information-rich content in three-dimensional (3D) data sets obtained by imaging techniques such as optical coherence tomography (OCT). Particularly useful for the diagnostics and treatment of common diseases in ophthalmology are scanning techniques that provide images of the fundus of the eye, which is the interior surface of the eye, including the retina, optic disc, macula and fovea, and blood vessels spreading through the tissue. The retina is a light sensitive, layered tissue including an inner limiting membrane (ILM), a nerve fiber layer, a photo-receptor cell layer (also known as Inner Segment/Outer Segment (IS/OS)), including Rod and Cone cells, and a retinal pigment epithelium (RPE). A thorough clinical analysis of the retina includes a detailed and precise description of each of these layers. A problem that normally arises for imaging techniques is that the retina is a surface extending in two dimensions, with a thickness, and is curved and covers approximately 72% of a sphere of about 22 mm in diameter. This complex geometry calls for the use of methods and techniques for data processing that can provide accurate information about a subject's condition, and can be correlated through multiple images taken at different times. Another problem associated with imaging in ophthalmology is the involuntary motion of the subject's eye while a scan is being collected. This movement not only introduces blurring of the images, but may significantly destroy valuable information contained in a given data set if care is not taken to properly correlate physical features to their accurate position relative to other features in the collected data set.
A single element in a 3D data set obtained by a given scanning technique usually corresponds to a value for the signal strength in a detector, also referred to as ‘intensity’. This value is associated with a coordinate in 3D space, which is the location in the tissue sample from where the signal is being collected. In the case of OCT, the signal strength corresponds to the intensity of the light backscattered from the tissue sample. In some embodiments, the OCT signal is obtained through a first 2D scanning of illumination source 1201 on sample 1209, which may include the fundus of the eye, displacing the beam from one point to an adjacent point along a first line with XY scan 1204, and repeating this for a selected pattern of lines on a 2D surface. As shown in
In some embodiments of an OCT technique, the ‘axial’ profile is obtained by correlating the ‘depth’ coordinate Z with the spectral interference pattern produced by OCT interferometer. Some embodiments may further make use of a Fourier domain technique, as described in U.S. Pat. No. 7,480,058, which is hereby incorporated by reference in its entirety. The combination of a first scan in 2D and the axial profile results in a 3D grid of data values, where each value and position location can form a ‘voxel’. Therefore, a voxel is associated with the coordinate point in the sample that corresponds to a given X-Y position of the 2D scan of the beam, and a ‘depth’ coordinate Z, associated with the axial backscattering light intensity profile.
The sampling resolution of the imaging technique is given by the dimensions of a single voxel in the 3D data set collected as described above. This can normally be divided in a 2D sampling resolution corresponding to the first scan of a laser beam in the XY plane, and the axial sampling resolution corresponding to the Z-axis, normally associated with the number of voxels acquired per OCT axial profile.
The summing of voxels 102 can take place between two retinal tissue layers, and voxels 102 outside of the two layers can be ignored. All the voxels 102 considered in the summation have the same XY position. The sum of these voxel values 102 are then divided by the thickness between the two retinal layers.
a shows an embodiment of the present invention wherein the median value of the voxels 100 along the Z-axis is used to form each XY pixel 312 in the resulting 2D image 310. To obtain an image according to some embodiments as depicted in
In detail, the technique depicted in
The voxels of interest are determined in step 320. In step 330, a median value of the pixels of interest is determined that corresponds with each of pixels 312. As shown in
In step 340, 2D image 310 is composed by assigning the median value at each X-Y position to the corresponding pixel 312. In step 350, the image can be displayed either in gray scale or in color utilizing a look-up table to provide brightness and contrast adjustments.
a shows a method for image enhancement according to some embodiments of the present invention. In
In some embodiments of the present invention, further processing includes the summation of voxels 402 between two different layers of retinal tissue according to the XY coordinate of the voxels 400, as illustrated in
The 3D median filter technique in some embodiments as depicted in
Referring to some embodiments of the invention depicted in
As shown in
In step 440, the pixel value 412 is utilized to compose 2D image 410. 2D image 410 can then be displayed in step 450. 2D image 410 can be displayed with a gray scale or color lookup table to provide brightness and contrast adjustment.
a shows a ‘zig zag’ summation method to produce a 2D image 510 from the 3D data set 100, according to some embodiments of the present invention. Instead of summing voxels directly along a line in the Z axis from the selected tissue layer, alternating voxels from adjacent Z columns 502 and 504 (for example, adjacent axial scans) are selected and then further processed, for example as summation, median, average, or other processing function. The path of summation (or other processing) can follow a zig zag pattern 506. The summation uses values from two or more axial scans 502 and 504 as opposed to other methods (as in
As noted in the previous embodiments, an image 510 with the same X and Y dimensions as the original 3D volume 100 is formed. As shown in
In step 530, pixel values 512 are determined. As shown in
Furthermore, in some embodiments, other combinations may be chosen for process voxels 102 having an even/odd Z coordinate. As indicated above, processing may include summing, averaging, or taking a median. In some embodiments of the present invention, the voxel value may be squared before summation. In other embodiments, values having an even Z-coordinate may be chosen from the X, Y+1 column, and pixels having an odd Z-coordinate may be chosen from the X, Y column. Some embodiments of the present invention may use a ‘helicoidal’ combination of voxels along the axial direction, Z, which resembles a ‘staircase’ sum, or a ‘screw’ sum, whereby voxels having a Z-coordinate with a value of 0 modulo 4 (that is, the Z coordinate of the voxel is divisible by 4 with an integer residue of 0) are chosen from column X,Y; voxels having a Z mod 4 coordinate equal to 1 are chosen from column X+1, Y; voxels having a Z mod 4 coordinate equal to 2 are chosen from column X+1, Y+1; and voxels having a Z mod 4 coordinate equal to 3 are chosen from column X, Y+1. The above summation method uses values in different X and Y column in either a ‘zig zag’ or ‘helicoidal’ manner with a step size of one voxel. A different step size can be selected to cover the same depth with less computation time or selected based on the desired effect of different voxel sampling.
In some embodiments, using the ‘zig zag’ approach as described above is to reduce scatter noise inherent in OCT data by combining voxels from adjacent axial scans. As a result, anatomy and pathology of the selected tissue layers can be more clearly shown and the image details obtained are improved compared to techniques that average out over a larger set of voxel neighbors.
a illustrates the different results obtained when processing a 3D data set to produce a 2D image according to some embodiments of the present invention, as described above. The 3D data set corresponds to the retinal pigmented epithelium (RPE) from a single subject and was collected using an OCT raster scan of the region of interest, where light source 1201 is a white light illumination source. Axial information can be obtained by spectral decomposition of the OCT spectral interference signal from OCT system 1200, as described above. The images in
Some of the conspicuous features in
a illustrates an En Face image 610, a “zig-zag” image 620, and a median image 630. En Face image 610 is produced as discussed with
b shows 2D image renderings analogous to those shown in
c shows 2D image renderings analogous to that of
Additional 3D imaging processing techniques can help the visualization and understanding of the 3D data set to produce more accurate and meaningful information. Examples of such techniques are perspective image processing and volume rendering methods that can reveal information available in a 3D data set. These methods are described in details below.
a shows a perspective projection from a specific view point 720, and the summation method in a volume rendering technique according to some embodiments of the present invention. Volume rendering can of course be done from any other view point. However, even from the specific view point 720 shown in
b illustrates the difference in a 2D image obtained between a summation along the Z-axis, as in
It should also be noted that this 3D image 800 can include a depth buffer (in the direction of the Z-axis). If other geometric constructs are added to the 3D scene, this buffer is used during rendering to determine how those constructs intersect with the data set. For example, a vertical plane with original OCT data could be placed in the 3D scene to show where the layer is, vertically, by overlapping and correlating the 3D data set and the vertical plane including the OCT data. By comparison, images obtained through summation along the Z-axis, as in
In some embodiments of the present invention, the 3D rendering shown in
The details of how to determine the geometric location of tissue surface is complex. In general, the detection of layer surfaces can be performed by 1D intensity or texture profile analysis. Using the 1D intensity profile as an example, a 1D intensity profile at a location (X,Y) can be constructed by mapping all the intensity values along the Z-axis (axial scan or A-scan direction). To make the 1D intensity profile less sensitive to scattering noise, the 1D intensity profile at the location (X,Y) may be given by the average or median value around (X,Y) for each depth location Z. A 1D transition (edge) detection method can then be applied to find the rising and falling edges that define the layer of interest. Depending on the layer of interest, there are a few parameters involved in the detection such as search range, operating window size, and minimum edge strength threshold. Finally, a 2D surface smoothing method may be applied to smooth the layer surfaces at all (X,Y)s, to conform the anatomical information of the layer of interest.
The resulting 2D surface describes the geometric location of a tissue layer in 3D (like a curved sheet of paper). This can be used to create a 3D alpha mask for use during volume rendering. The mask will hide tissue above and below the tissues of interest. With proper transparency settings, this results in a 3D En Face image 800. To compute a mask, a top and a bottom surface are used. These two surface can come from:
2 different tissue surfaces (these surfaces could also be translated by Z offsets)
the same surface description translated by 2 different Z offsets
2 intermediate tissue surfaces computed by interpolating between known surfaces
The mask is created by allocating a volume the same size as the original 3D data set and filling it with alpha (transparency) values. Voxels below the bottom surface and above the top surface are given fully transparent (e.g. zero) values. Voxels between the two surfaces are given opaque values (e.g. 255 for an 8 bit mask). Image quality can be improved by calculating intermediate transparency values along both surfaces to produce a smoother anti-aliased mask. The original 3D data set and the mask are then combined to form a single 2 component volume. The original 3D data set is the luminance channel, while the mask is the alpha channel. During rendering, the mask values are combined with a global transparency function, so that the overall transparency of the final 3D En Face image 800 can be adjusted.
Finally, the 3D En Face image can be rendered. Rendering of such image is performed by standard volume rendering software, for example, VTK from www.vtk.org can be used.
There are a number of benefits to this method. The method:
When a 3D data set is constructed, the X, Y, and Z voxel dimensions determine the maximum degree of accuracy expected for an alignment of two consecutive 3D data sets from OCT scans. Unfortunately, unavoidable eye movement of a subject during image acquisition will generate misalignment between consecutive OCT scans. Furthermore, this effect will be increased due to the relative displacement of consecutive OCT data sets generally not in exact multiples of the voxel dimension in X, Y, and Z direction. Therefore, alignment with sub-voxel accuracy is desirable, especially when motion defects appear in the 2D ‘En Face’ image. Sub-voxel alignment of a 3D data set can be implemented in the following way.
First, the data size along each dimension is up-scaled by a factor of ‘k’. Assuming that an original 3D data set contains a total of N voxels, this has the effect of creating a number of (k3−1)×N new voxels with the same X, Y, and Z voxel dimensions as the original voxels, spaced in between the original voxels, where the new voxels are assigned specific intensity values. These intensity values can be determined by an interpolation technique, such as linear or cubic-spline interpolation. The 3D up-scaled data set is then aligned by an iterative process, until an optimal 3D alignment offset vector (i.e., ΔX, ΔY, and ΔZ) is achieved such that the resulting 2D ‘En Face’ image conforms well to anatomical physiologies. For example, blood vessels should be continuous in all dimensions. In some embodiments of the present invention, a multi-scale coarse-to-fine technique is used to accelerate the iterative alignment process. Finally, sub-voxel alignment in the original dimension can be achieved by dividing the optimal alignment offset vector by k. In some interpolation technique, the ‘En Face’ image f(X,Y) at the location (X,Y) may not only be contributed from the voxels of the same (X,Y) or a small local neighborhood, but also from those voxels of distant location; depending on the severity of the misalignment produced by the unavoidable eye movement of the subject's eye during a scan.
By introducing a sub-voxel positioning resolution as described above, defects in an image due to eye motion can be reduced below the ordinary voxel resolution of the system. This more robust alignment technique helps generate a 3D baseline data set of the subject's eye for further data registration and eye image correlation in subsequent imaging sessions, either within a single visit or across future visits for eye exams. Furthermore, having a more accurate 3D baseline data set improves subsequent image processing techniques, such as segmentation and enhancement, to be even more reliable; such may lead to better diagnosis.
Due to computation efficiency and to facilitate alignment by region-of-interest, another 3D alignment technique is disclosed; the results of not performing this alternative alignment technique and those performing such techniques are shown in
The search ranges of the alignment offset vectors can be pre-determined empirically based on anatomical physiologies or clinical needs. For each offset vector (ΔX, ΔY, and ΔZ), a 2D simulated scanning laser ophthalmoscope (SSLO) image, such as 1000 and 1100, is generated similar to a fundus image or an ‘En Face’ image, composed by incorporating all the tissue signals of the scanned OCT images and considering all voxels of interest according to some embodiments as described in
The determination of whether an optimal alignment offset vector set has been achieved is by evaluating some measure of alignment quality, such as landmark positions, feature curve accuracy, and feature curve smoothness, against a set of pre-determined threshold values. These threshold values can be pre-determined empirically based on anatomical physiologies and clinical needs. An optimal alignment is achieved by finding the offset vector (ΔXi, ΔYi, and ΔZi) for the ith 2D OCT image such that a total score calculated by comparing some measures of alignment quality from the sequence of all OCT images yields the highest score among all possible offset vectors within the respective search ranges. Image processing techniques such as autocorrelation and Fourier transform can be utilized to calculate the score. High-performance computing techniques, such as the hardware-related Streaming SIMD (Single Instruction Multiple Data) Extension (SSE), can also be used to increase the performance speed.
The previous descriptions and figures are exemplary only and do not intend to be limiting. One of ordinary skills in the art may identify other embodiments of the present invention that are obvious variations of the embodiments described above. Those alternative embodiments are also intended to be within the scope of this disclosure. As such, the invention is limited only according to the following claims.
This application claims priority to provisional application 61/256,277, filed on Oct. 29, 2009, which is herein incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
61256277 | Oct 2009 | US |