1. Field of the Invention
The embodiments described herein relate generally to methods and systems for processing and representing images in ophthalmology for diagnosis and treatment of diseases or any other physiological conditions.
2. Description of Related Art
Optical Coherence Tomography (OCT) is an optical signal and processing technique that captures three-dimensional (3D) data sets with micrometer resolution. This OCT imaging modality has been commonly used for non-invasive imaging of object of interest, such as retina of the human eye, over the past 15 years. A cross sectional retinal image as a result of an OCT scan allows users and clinicians to evaluate various kinds of ocular pathologies in the field of ophthalmology. However, due to limitation of scan speed in imaging device based on time-domain technology (TD-OCT), only a very limited number of cross-sectional images can be obtained for evaluation and examination of the entire retina.
A new generation of OCT technology, Fourier-Domain or Spectral Domain Optical Coherence Tomography (FD/SD-OCT), is significantly improved from TD-OCT, reducing many of the limitations of OCT such as data scan speed and resolution. 3D data set with dense raster scan or repeated cross-sectional scans can now be achieved by FD-OCT with a typical scan rate of approximately 17,000 to 40,000 A-scans per second. Newer generations of FD-OCT technology will likely further increase scan speed to 70,000 to 100,000 A-scans per second.
These technological advances in data collection systems are capable of generating massive amounts of data at an ever increasing rate. As a result of these developments, myriad scan patterns were employed to capture different areas of interest with different directions and orientations. A system and data presentation design is disclosed to more systematically present a 3D data set and to set a standard and consistent expectation of data representation for different clinical needs.
Current trends in ophthalmology make extensive use of 3D imaging and image processing techniques to generate high resolution images. Such images may be utilized for diagnosing diseases such as glaucoma, and other medical conditions affecting the human eye. One of the challenges posed by the current technological advances in imaging techniques is the efficient and meaningful processing and presentation of the massive amounts of data collected at ever increasing imaging rates. Some approaches have converted 3D data sets into manageable two-dimensional (2D) images to be analyzed. An example of such technique used for data reduction from a 3D data set to a 2D image is 2D “en-face” image processing. (See for example, Bajraszewski et al., [Proc. SPIE 5316, 226-232 (2004)], Wojtkowski et al., [Proc. SPIE 5314, 126-131 (2004)], Hitzenberger et al., [Opt Express. October 20; 11(20:2753-61 (2003)]). This technique includes the summing of the intensity signals in the 3D data set along one direction, for instance, along the axial direction of an Optical Coherence Tomography (OCT) scan, between two retinal tissue layers.
One common problem with this type of en-face image processing technique and other volume rendering techniques is the appearance of artifacts created by the involuntary motion of the subject's eye while a data set is being collected. The motion introduces relative displacements of the collected images so that salient physical features appear discontinuous in the resulting 3D data set, rendering the entire data set unreliable.
Another challenge that commonly occurs in the processing of OCT images is the central focus on reliable and reproducible layer segmentation in the B-scan (X-Z) images. Reliable layer segmentation can often be obtained when the retina is normal or with relatively small topographical changes. However, it becomes very unreliable, and in some cases impossible, to segment various layers accurately where there are significant layer profile alternations.
Therefore, there is a need for better processing and presentation of OCT image data.
In accordance with some embodiments of the present invention, a method of computer-aided diagnosis for ophthalmology includes acquiring an OCT dataset; obtaining an RPE fit from the OCT dataset; and generating a set of frontal en-face images based on the RPE fit, wherein the frontal en-face images are suitable for qualitative and quantitative assessment of a retina.
An OCT imaging system according to some embodiments includes an OCT imager that acquires OCT data; a computer coupled to the OCT imager, the computer executing instructions for: obtaining an RPE fit from the OCT dataset; and generating a set of frontal en-face images based on the RPE fit, wherein the frontal en-face images are suitable for qualitative and quantitative assessment of a retina.
These and other embodiments are further discussed below with reference to the following figures.
Optical Coherence Tomography (OCT) technology has been commonly used in the medical industry to obtain information-rich content in three-dimensional (3D) data sets. OCT can be used to provide imaging for catheter probes during surgery. In the dental- industry, OCT has been used to guide dental procedures. In the field of ophthalmology, OCT is capable of generating precise and high resolution 3D data sets that can be used to detect and monitor different eye diseases in the cornea and the retina. A new data presentation scheme and design, tailored to retrieve the most commonly used and expected information from these massive 3D data sets, can further expand the application of OCT technology for different clinical application and further enhance the quality and information-richness of 3D data set obtained by OCT technologies.
In some embodiments of the present invention, noise suppression can be used in the processing of OCT images in step 1120. One common approach is to apply linear or nonlinear spatial filters (e.g. window-averaging and median-filtering) to the images. One problem with this approach is that the parameters used in the spatial filters often need to be adjusted for images containing various levels of details (a balance between feature resolution and scale). It is not a trivial task to automatically adjust these parameters in general. Another simple but powerful approach to noise suppression is by temporal filtering such as frame averaging. This approach can substantially reduce the amount of noise by scanning multiple frames of the same region of interest (ROI) and then summing or averaging the repeated data. In many cases, however, eye movement may prevent application of this approach to obtain reasonable results. To alleviate this problem, image alignment methods based on the correlation among the acquired data can be used. An eye-tracking method and system can also be used to improve frame averaging. Moreover, using newer generations of FD-OCT technology with the increased scan speed of 70,000 to 100,000 A-scans per second may further assist in more accurate time averaging of multiple frames.
Contrast enhancement is another step in the processing of OCT images in some embodiments, and may be performed in step 1130. Contrast enhancement can accentuate features of interest and facilitate diagnosis of data in a desired intensity range. Contrast enhancement can be performed globally and locally. Global contrast enhancement uses transformation function such as a look up table (LUT). One of the simplest examples is contrast stretching; where a transformation function stretches a portion of the image histogram for amplitudes that contain desired information are placed across the whole amplitude range.
In many cases, local contrast enhancement methods are more suitable in the analysis of OCT images and frontal en-face images. The image contents of these images inherently have a wide dynamic range of intensities. A classical solution to this problem is to use a local histogram equalization technique. Another commonly used local technique is spatial enhancement (sharpening) of high-frequency details in the ROI. An overview of similar techniques can be found in an article by D. H. Rao and P. P. Panduranga, “A survey on image enhancement techniques: classical spatial filter, neural network, cellular neural network, and fuzzy filter,” IEEE International Conference on Industrial Technology, pp. 2821-2826, December 2006.
A Frontal En-face view is an observation direction along the axial direction of an OCT imager as in
A more useful and clinically meaningful C-scan, as shown in
As discussed above, in step 1150 En Face Images are generated based on the RPE fit.
To observe vitreo retinal interface abnormality, such as vitreous membrane detachment using image 710, an offset from the inner limiting membrane (ILM) can be applied, where the ILM is the boundary between the retina and the vitreous body. The ILM offset 712 can be set to −20 to 20 μm 714, with a slice thickness of 5 to 50 μm 716. In some embodiments, the ILM offset 714 is set to 0 μm and slice thickness 716 is set to 12 μm. To assess edema in the subject eye using image 720, the RPE reference offset 722 can be set to −300 to −20 μm 724, to −150 μm in some embodiments (i.e., 150 μm above RPE reference), with a slice thickness of 5 to 50 μm 726, to 12 μm in some embodiments, if the retinal full thickness is equal or less than 300 μm; in the alternative, the ILM reference offset can be set to 20 to 300 μm, to 160 μm in some embodiments (i.e., 160 μm below ILM), with a slice thickness of 5 to 50 μm, to 12 μm in some embodiments, if the retinal full thickness is more than 300 μm. To observe drusen, GA, PED and other retinal degeneration using image 730, the RPE reference offset 732 can be set to 10 to 100 μm 734, to 40 μm in some embodiments (i.e., 40 μm below RPE reference) with a slice thickness of 5 to 50 μm 736, to 12 μm in some embodiments. To observe characteristics of the choroid using image 740, the RPE reference offset 742 can be set to 50 to 350 μm 744 with a slice thickness of 5 to 50 μm 746; to 40 μm in some embodiments (i.e., 40 μm below RPE reference) with a slice thickness of 12 μm for thin atrophic choroid or to 100 μm (i.e., 100 μm below RPE reference) with a slice thickness of 30 μm for normal choroid. As discussed above, other segmented layer of interest, such as the ILM and the RPE, can be used for these assessments.
The discussed offsets and slice thicknesses are used to display these four key areas of interests; alternatively, a range of clinically meaningful values obvious to a person of ordinary skills in the art can be used in place. Additionally, the number of image displays can also be customized by the users based on their preferences so that different number of en face images of different number of key areas of interests can be displayed based on the specific workflow and evaluation of the user. The user interface can take in different customized inputs to allow different number of area of interests and to display a range of clinically meaningful values.
This presentation scheme can further highlight the morphological and structural characteristics of retinal edema such as Cystoid Macular Edema (CME) and choroidal vessels located at different depth, such as Sattler and Haller of the choroid.
Images 710-740 in
Another examples of the use of qualitative assessment can be appreciated in
Qualitative assessment can provide useful information for clinical specialists for diagnosis and treatment, quantitative assessments can be further employed to provide objective, reproducible and accurate measurements to assist diagnosis and treatment.
In step 1180, the first step to obtain quantitative measure is to identify the region of interest to be assessed.
After the region of interest is determined, quantitative measures of the characteristics discussed above can be parameterized, namely, intensity measures, texture measures, structure measures, and morphological measures.
The maximum, minimum, average, and standard deviation (homogeneity) of the intensity inside S are calculated and represented by Imax, Imin, Iavg, and Istd, respectively.
The texture measure is defined by the ratio of edge (grainy) pixels inside S to the total number of pixels in S. It can be explicitly represented by
m
tx=(Area[edge pixels inside S])/(Area[S]),
where Area[S] denotes the pixel number of S. The edge pixels can be detected by using the Canny edge operator for an example.
The smoothness, connectedness, and thickness uniformity of the blob border curve as are computed by
m
sm=1.0/(average of the curvature change along ∂S),
m
cn=1.0/(standard deviation of the edge strength along ∂S),
m
tu=1.0/(standard deviation of the edge thickness along ∂S),
respectively. If as is smooth, the curvature change along as becomes small in average, and hence the smoothness measure, msm, would be large. The edge strength of an edge pixel is computed by its edge slope along ∂S. If ∂S is well-connected, the edge strength along ∂S would have small variations, and hence the connectedness measure, mcn, would become large. Similarly, if ∂S has uniform thickness, the standard deviation of the edge thickness would be small, and hence the thickness uniformity measure, mtu, would become large.
Pattern spectrum, a shape-size descriptor, can be used to quantitatively evaluate the shape and size of S. Large impulses in the pattern spectrum at a certain scale indicate the existence of major (protruding or intruding) substructures of S at that scale. The bandwidth of the pattern spectrum, mbw, can then be used to characterize the size of S. An entropy-like shape-size complexity measure based on the pattern spectrum, mir, can be used to characterize the shape and irregularity of S. Mathematically, the pattern spectrum of S relative to a binary structuring element B (disk shape) of size (scale) r, is denoted by PSS(r, B). The measures mbw and mir are defined by
m
bw
=r
max
−r
min, and
m
ir
=−Σp(r)log [p(r)],
respectively. The scale parameters rmax and rmin denote the maximum and minimum size in PSS(r, B), respectively. Here p(r)=PSS(r, B)/Area(S) is the probability function by treating PSS(r, B) from a probabilistic viewpoint. The maximum value of mir is attained whenever the pattern spectrum is flat, indicating that S is very irregular or complex by containing B (disk) patterns of various sizes. Its minimum value (0) is attained whenever the pattern spectrum contains just an impulse at, say, r=k; then S is simply a pattern B (disk) of size k and therefore considered to be the most regular (or the least irregular).
It should be appreciated that alternative and modifications apparent to one of ordinary skills in the art can be applied within the scope of the present inventions. For example, the offset value, slice thickness in the 4-up en-face representation, and the quantitative measures can be varied from the specific embodiments disclosed herein within the scope and spirit of the subject invention.
This application claims priority to U.S. Provisional Application No. 61/437,449, filed on Jan. 28, 2011, which is herein incorporated by reference in its entirety.
Number | Date | Country | |
---|---|---|---|
61437449 | Jan 2011 | US |