The present disclosure relates to a method and apparatus for aligning a two-dimensional image with a predefined axis, such as, but is not limited to, aligning a two-dimensional optical coherence tomography (OCT) image of an eye of a subject with a predefined axis.
Glaucoma is a group of heterogeneous optic neuropathies characterized by progressive loss of axons in the optic nerve. Data from WHO show that glaucoma accounts for 5.1 million of blindness in the world and is the second leading cause of blindness worldwide (behind cataract) as well as the foremost cause of irreversible blindness [1]. As the number of elderly in the world rapidly increases, glaucoma morbidity will rise, causing increased health care costs and economic burden. Since cataract is easily treated, glaucoma will become the most common cause of blindness in the world later this century with almost 70 million cases of glaucoma worldwide. More importantly, between 50-90% of people with glaucoma worldwide are unaware that they have the disease [2, 3].
Glaucoma is classified according to the configuration of the angle (the part of the eye between the cornea and iris mainly responsible for drainage of aqueous humor, as shown in
Angle closure results from obstruction of the trabecular meshwork by the iris, impeding the drainage of aqueous humour in the angle of the eye, causing an increase in intraocular pressure (IOP).
Previously reported anatomical risk factors for angle closure include a shallow central anterior chamber depth (ACD), a thick and anterior lens position and short axial length (AL) [14-17]. Amongst these, a shallow ACD is regarded as a sine qua non (cardinal risk factor) for the disease. However, population based data suggests that only a small proportion of subjects with shallow ACD ultimately develop PACG [18, 19]. Therefore, it is likely that other ocular factors are related to PACG development. The crystalline lens is thought to play a crucial role in the pathogenesis of angle closure disease due to either an increase in its thickness with age or a more anterior position which causes a decrease in ACD [15, 20-23]. This results in is angle crowding and a greater predisposition to pupillary block due to iridolenticular apposition in eyes with small anterior segments. Previous studies by the Singapore Eye Research Institute (SERI) have also shown that demographic factors identify high-risk groups. For example, women and Chinese have greater risk for angle closure [24, 25]. Recent studies by SERI investigated determinants of angle closure and found that while the most important independent determinants of angle closure were female gender, Chinese race, shorter AL and shallower ACD.
The inventors' previous work proposed several automated anterior chamber angle (ACA) assessment/classification methods. In particular, the methods involve feeding suggested clinical features, such as AOD500 [27] and Schwalbe's Line Bounded Area (SLBA) [28], or image features such as HOG [29, 30], BIF [30] and HEP [30] into pre-learned classifiers (e.g. linear SVM) for angle classification.
For example, an anterior chamber angle measurement method was described in [27]. In this method, the input ultrasound biomicroscopy (UBM) image is a converted binary image with a fix threshold value 128, and then a naïve method was used to detect the edge points of the anterior chamber angle by counting continuous 0 or 1 in each vertical line. Lastly, a line fitting is performed to obtain the two edges of the anterior chamber angle.
Another similar method was proposed in reference [28], but it works on high definition (HD) anterior segment optical coherence tomography (AS-OCT) images [28]. Firstly, the input gray-scale image was converted to a binary image with an adapt threshold. The cornea and the iris regions were then segmented by a connected component labeling segmentation method. This is followed by the detection of the Schwalbe's line by using linear regression and the SLBA is obtained.
In a further method, image features such as HOG [29, 30], BIF [30] and HEP [30] are fed into pre-learned linear SVM classifiers to perform angle classification. This image feature based machine learning approach has been shown to outperform traditional clinical feature based approaches. However, having tested on larger dataset, ACA localization based on complicated heuristics methods is not robust due to various kinds of imaging artifacts. The performance of ACA classification is also affected.
Therefore, it is desirable to have an improved method and apparatus which addresses one or more of the problems of the existing methods.
In general terms, the present invention proposes a method for aligning a two-dimensional image of an eye with a pre-defined axis by identifying values of a rotation angle and a de-noised image which minimize a cost function. The method may be used for aligning a plurality of images of an image sequence, such as multiple frames acquired during a scan video. This may allow misalignments caused by, for example, patient movements during the image acquisition to be compensated for. This allows for a “stabilized” 3-D rendered image (or 2D image) to be obtained for better visualization, localization and/or identification of anatomic structures, for example, an anterior chamber angle of an eye of a subject.
According to a first expression, there is provided a method of aligning a first two-dimensional image of an eye of a subject with a predefined axis. The method comprises identifying values of a rotation angle θ and of a de-noised image I0. These values minimise a cost function which comprises (i) a complexity measure of the de-noised image I0, and (ii) a measure of the magnitude of a noise image. The noise image is obtained by rotating the first image by the rotation angle and subtracting the denoised image. The first image is aligned with the predefined axis by rotating it by the identified rotation angle.
The method may be performed on a two-dimensional OCT image of an eye of a subject. The complexity measure of the de-noised image is a norm of the de-noised image. The measure of the magnitude of the noise image is the sum of the absolute values of the elements of the noise image.
According to a second expression, there is provided a method of aligning a plurality of images which are each a two-dimensional OCT image of an eye of a subject. The method comprises aligning the plurality of images with a predefined axis by a method described above. The method further includes aligning the images in a first direction transverse to the predefined axis. This is performed by determining a centre of symmetry axis in each image parallel to the predefined axis, and aligning the centres of symmetry axes. The method further includes aligning the images in a second direction parallel to the predefined axis by locating a landmark in each image, and aligning the positions of the landmarks in the second direction. The landmark may be, for example, the corneal ceiling of the eye image.
According to a third expression, there is provided a method of forming an average of a set of two-dimensional OCT images of an eye of a subject. The method comprises aligning the images by a method described above and forming an average of the aligned images.
Specifically, the above expression or a combination of them may achieve effective and efficient alignment for cross-sectional AS-OCT images. The various embodiments may be integrated into an OCT instrument. The embodiments may improve overall performance of ACA classification at a system level via enhanced mage stabilization, speckle reduction, accurate ACA localization and/or alignment.
There is also proposed a method for classifying a type of angle closure of the anterior chamber angle of an eye of a subject, for example, using the stabilized eye image obtained by any of the method described above. In preferred embodiments, the method classifies the subject as exhibiting primary angle closure glaucoma (PACG) or primary open angle glaucoma (POAG) using a similarity-weighted linear reconstruction (SWLR) method. Experimentally, the present inventors have shown that the SWLR method attains a higher ACA classification accuracy and overall efficiency than the existing methods.
According to another expression, there is provided a method of determining whether an eye of a subject exhibits angle closure. The method employs a database of reference images. Each reference image is a two-dimensional image of a corresponding eye and includes an intersection of an iris of the corresponding eye and a cornea of the eye. The reference images comprises first reference images for which the corresponding eyes are known to exhibit angle closure, and second reference images for which the corresponding eyes are known not to exhibit angle closure. The method includes receiving a two-dimensional image of a portion of the eye of the subject. The image is transverse to the lens of the eye and includes an intersection of an iris of the eye and a cornea of the eye. The method further includes determining respective weights for each of the reference images. The weights are chosen to minimise a cost function comprising the difference between the received image and a sum of the reference images weighted by the respective weights. The method further comprises identifying at least one of the first reference images having least difference with the received image, and at least one of the second references images having least difference with the received image. It is then determined whether the eye exhibits angle closure by determining whether the received image is closer to the identified first reference images weighted by the respective weight, or to the identified second reference images weighted by the respective weight.
The received image may be an image generated by any method described above. Since the received image has been “stabilized” by rotational adjustments (and optionally with vertical and horizontal adjustments too). The received images in combination with the above proposed classification method has been shown experimentally to produce robust results in ACA classification. In other words, the embodiment may provide a fully automated PACG detection system with high accuracy and robustness.
Preferably, an averaged image is obtained from the aligned images and a binarized image of the averaged image is obtained such that a location of an ACA vertex is identified from the binarized image. The received image is an image generated using the location of the ACA vertex.
According to a further expression, there is provided a computer system comprising a processor and a memory device configured to store program instructions operative, when performed by the processor, to cause the processor to perform any one of the methods described above.
The present invention may also be expressed as an apparatus configured to perform any one of the above methods. The apparatus is typically a computer system having appropriate hardware modules including a rotation adjustment module, an alignment adjustment module, a noise reduction module, and/or an angle-closure classification module configured to perform various relevant operations as described above with reference to the respective methods. It will be apparent to a skilled person in the art that the various hardware modules may be implemented by one or more computer processors that are in communication with memory devices storing program instructions. The memory devices may be, but is not limited to, random-access memory (RAM) and read-only memory (ROM). The one or more processors are configured to execute the program instructions so as to implement the functionality of the hardware modules. It is understood that by programming and/or loading executable instructions onto the computer system, at least one of the processors, the RAM, and the ROM are changed, transforming the computer system in part into a specific purpose machine or apparatus having the novel functionality taught by the present disclosure. It is fundamental to the electrical engineering and software engineering arts that functionality that can be implemented by loading executable software into a computer can be converted to a hardware implementation by well-known design rules.
In a preferred embodiment, the apparatus has the rotation adjustment module, the alignment adjustment module and the noise reduction module configured to perform some of the methods described above. This may be integrated into the AS-OCT machine to provide higher quality imaging output with speckle noise reduction and better 3D rendering of the imaging system.
In a preferred embodiment, the apparatus has the angle-closure classification module configured to perform some of the methods described above. In other words, a PACG detection system may be provided which is robust to noise, rotation and artifacts thereby achieving a high accuracy for anterior chamber angle classification. The PACG detection system can be used as an assistant diagnostic tool to be used in conjunction with UBM and/or OCT scanning machines.
According to yet a further expression, there is proposed a computer program product, such as a tangible recording medium. The computer program product stores program instructions operative, when performed by a processor, to cause the processor to perform any one of the methods described above.
It will be apparent to a skilled person in the art that various expressions of the present disclosure may be combined or performed independently of one and another.
Embodiments of the present invention will now be described for the sake of example only with reference to the following drawings, in which:
The embodiment will be illustrated with reference to two-dimensional OCT images of an eye, and specifically AS-OCT images, but it will be understood that the method is not limited to be performed with OCT images.
Referring to
At step 100, preprocessing is performed to remove certain artifacts of the images. An OCT image often contains high-intensity vertical lines or segments around the central meridian, such as the one shown in the left image of
In the embodiment illustrated in
The adjustment of a misaligned OCT image I to a recovered image I0 (also referred to as a de-noised image) with the correct position and orientation can be expressed as an affine transformation τ composed of a rotation θ, horizontal shift Δx and vertical shift Δy:
where (x′, y′) represents the coordinate of the de-noised image I0, and (x, y) represents the coordinate of the misaligned OCT image I (i.e. the actual coordinate of the image I). To recover τ for each image, each of the three parameters (θ, Δx, Δy) are determined sequentially.
Because of the symmetric structure of the anterior chamber, an image of it captured at the ideal, perfectly horizontal position should have a lower rank than any rotated version of the image.
Let us denote an OCT image as Iεm×n, and a rotation of I by angle −θ as Iθ. Also, let Iθ=I0+E, where I0εm×n is a low-rank approximation of Iθ, and Eεm×n models sparse noise and other artifacts.
To recover the rotation angle θ, the rotation adjustment is configured to solve for I0, E and θ as
where α>0 is a regularization parameter. In other words, the optimal values of the rotation angle θ and of a de-noised image I0, which a cost function is to be solved.
Since this optimization problem is NP-hard in general, its complex surrogate may be optimized instead, which has been shown to give a good approximation under fairly broad conditions [31]. This involves relaxing the rank(•) function to the matrix nuclear norm |•|* (i.e., sum of all singular values), and the l0-norm to the l1-norm (i.e., the sum of absolute values of all entries), which yields the following objective function:
This problem can be efficiently solved with the TILT toolbox [32], which includes a pyramid acceleration strategy. An example of the resulting rotation adjustment is shown in
Once the rotationally-corrected image Iθ is recovered, its horizontal shift can be adjusted utilizing a symmetric property of the image. In particular, a translation adjustment module is configured to determine a vertical symmetry axis of each image and align the images by aligning their respective vertical symmetry axis. The vertical axis of symmetry can be found, for example, through the following search procedure:
where wε
is a predefined search range, and hε
is the chosen observation window width:
It will be apparent to a skilled person that other algorithms may be used to determine an axis of symmetry of the image. At the end of step 220, horizontal shifts may be corrected by aligning the axis of symmetry among the plurality of OCT images.
(iii) Sub-Step 230: Vertical Adjustment
The alignment adjust module is configured to perform vertical alignment of the images by locating a landmark in each image and aligning the positions of the landmarks in the images. Ophthalmologists often use the scleral spur or trabecular network as a landmark for locating the ACA (
In the exemplary embodiment, the ACA localization is performed using an averaged image of the plurality of the aligned images from step 200. This helps reduce the contribution of noises or artifacts which may affect the accuracy of ACA localization. The present inventors have found that it is advantageous to use an averaged image of the image sequence (e.g. a full circular OCT scan), since the averaged image should be relatively unaffected by artifacts (e.g. speckle noises) which are presumably randomly present in the individual OCT images. As a result, the averaged image achieves speckle noise reduction. A noise reduction module is configured to compute the averaged image using the aligned OCT images and the ACA is localized by identifying an ACA vertex as illustrated in the example below.
The aligned images may be binarized by thresholding and an averaged image is obtained as illustrated by
Note that the step 300 may be adapted for use with a single OCT image (i.e. without multiple frame information). It will be understood that for a single OCT image, the ACA vertex may be similarly identified from a binarized image of the image itself.
At step 400, an angle-closure classification module is configured to determine whether an eye of a subject exhibits angle closure using a classification algorithm. According to a particular embodiment, a similarity-weighted linear reconstruction (SWLR) is used to determine whether the image of the subject more likely exhibits Primary angle closure glaucoma (PACG) or primary open angle glaucoma (POAG).
Step 400 employs a database of reference images. The reference images are divided into two sets, which comprise images which are known to exhibit angle closure and those are known not to exhibit angle closure, respectively. Each of the reference images is a two-dimensional image of a corresponding eye and includes an intersection of an iris of the corresponding eye and a cornea of the eye.
Step 400 comprises sub-steps 410-430 which are performed for each of the plurality of ACA patches (i.e. a portion or sub-portion of the eye image which includes at least an ACA vertex or edge). A cost function is employed which comprises the difference between the ACA patch and a weighted sum of the reference images (such as a reconstruction error described below). At sub-step 410, the weights for each of the reference images are determined such that the cost function is minimized. The reference images which have the least difference with the ACA patch is then selected from each of the two sets of reference images at sub-step 420. At sub-step 430, the classification module is configured to determine whether the ACA patch is closer to those identified reference images selected from the first set or those selected from the second set. Based on the comparison, the classification module determines whether the eye exhibits angle closure. Each of the sub-steps 410-430 are explained in more detail in the example below.
In this example, each ACA region is represented by an image patch of size 20×20. The ACA patches were downsized from 154×154 to 20×20 to reduce the effects of noise and slight misalignments between test and reference images (i.e. the “dictionary”). The image may be a gray scale or binary image.
In forming the reference image sets, each ACA image with a known classification (i.e. determined by an ophthalmologist) is used to produce a plurality of, for example, 9 reference images, in order to further accommodate misaligmnent among test and reference images. This is referred to as “dictionary expansion”. Specifically, in this example, each of the ACA patches of size 154×154 is first resized to 22×22, and from which 9 images of size 20×20 are obtained. The 9 images, which correspond to respective sub-regions of the 22×22 patch, are included as the reference images.
Suppose the dictionary consists of k reference images, denoted by D={d1, d2, . . . , dk}εf×k where each column di is a reference image expressed as a vector. For a given test image (e.g. a test ACA patch) gεf×1, at sub-step 410, the classification module is configured to compute optimal linear reconstruction coefficients wεk×1, |w|=1, that minimize the reconstruction error ∥g−Dw∥2. The objective function may further include a similarity cost term that penalizes the use of references that are less similar to the test image. Let us denote the costs for the reference images in D as the vector c={c1, c2, . . . , ck}Tεk×1, where ci is the cost of using di for reconstruction. The similarity cost term can then be expressed as ∥c⊙w∥2 where ⊙ denotes the Hadamard product. Combining this cost term with the reconstruction error gives the following objective function:
where λ>0 is a regularization parameter. This objective function can be minimized in closed form using the Lagrange multiplier method, without the need for iterations:
where C=diag(c) and denotes the Kronecker product. For simplicity, we define in our implementation the cost ci as the Gaussian distance between the test image g and the i-th reference image di, i.e.,
where σ is a parameter that accounts for imaging noise. It will be appreciated by a skilled person that σ may be determined by cross-validation. For example, the optimal value of a is determined empirically by analyzing is a subset of the data (i.e. including some reference and test images) and validating using the rest of the data; typically, multiple rounds of cross-validation are performed using different subsets.
In this embodiment, the classification module classifies images to two classes D+ and D− respectively representing POCG and POAG groups. For a given test image g, at sub-step 420, the k most similar reference images in each class (D+ and D−) are selected to reconstruct g. The most similar reference images may be identified based on the similarity cost c described above. For example, only references images each associated with a similarity cost c which is below a cut-off value are chosen. Typically, the reconstruction error is also minimized at the same time so as to achieve the minimum objective value. After identifying the optimal reconstruction coefficients w+ and w− for D+ and D−, respectively, the classification module is configured to compute the difference in reconstruction errors for the two classes,
ψ(g)=∥g−D−w−∥2−∥g−D+w+∥2, (6)
as the decision value for classification, i.e., g is classified as angle-closure when ψ(g)≧0, and is otherwise classified as open-angle.
The present inventors have found that the ACA classification performance is mainly influenced by (1) ACA localization accuracy and (2) ACA labeling/assessment accuracy of the reference data set (i.e. the ground truth labeling by the ophthalmologist, which is subject to inter-subject variability). While both tasks are challenging, the first step has a directly impact on the accuracy of the second step. This is due to the relative large rotation of the eyeball during the imaging process and also due to various artifacts, such as speckle noise, corneal reflex, corneal breaks (e.g. those caused by eyelash shadows), iris holes (e.g. from laser surgery), motion blur, and eye lid intrusion, as illustrated by
Different from the previous work [29, 30] which extracts visual features from an ACA patch for classification, the embodiments of the present invention take a reconstruction-based approach, which has been shown to yield higher accuracy.
The performance of embodiments of the invention are assessed and compared with other existing methods. The results have demonstrated that the embodiments of the present invention are able to achieve significant improvements over the existing methods. Table 1 below provides a summary of limitations of existing methods.
Images were collected using AS-OCT machine CASIA SS-1000 from Tomey Ltd. (Japan), which provided a 360 degree circular scan of up to 128 cross-sectional images of the anterior chamber in 0.3-2.4 s. Compared to existing methods which uses a single image (which was also collected using the above machine), the present method utilizes the relationship of a sequence of cross-sectional images which provide more information to achieve high accuracy.
The present method was implemented in Matlab and tested on a four-core 3.2 GHz PC with 24 GB RAM. A total of 3840 OCT images (i.e. with 7680 ACAs) are used for experimentation. The images are obtained from circular scan videos, each with 128 frames, of 30 patients' eyes, 16 of which with primary angle-closure glaucoma (PACG) and the other 14 with primary open-angle glaucoma (POAG). The tests are performed by classifying each individual ACA, with the ground truth label provided by a senior ophthalmologist. The ground truth label represents the correct classification.
In terms of localization accuracy, the method of [30] fails in 16 videos, which is equivalent to 5.72% failure in total. The method of [29] fails in 10 videos, i.e. in total 3.49% ACAs are not localized accurately. For the embodiments of the present invention, all ACAs are localized accurately.
In terms of speed, the method of [30] takes 0.4 s per image (2 angles), while the present method method takes 1.2 s per image. This is mainly due to the alignment steps (i.e. rotation and translational adjustments). This is three times slower but provides more robust results in both ACA localization as well as ACA classification as will be described later.
In the classification comparison, all ACAs are localized with the embodiment of the present invention, for fair comparison. To evaluate classification accuracy, we perform five rounds of tests. In each round, images from six of the patients are used for testing while the others are used for training. To balance between PAOG and PACG, one of the rounds has 4 PACG and 2 POAG for testing, and the others have 3 PACG and 3 POAG each. We assess performance using the same metrics as in [29, 30], namely balanced accuracy (
Since our previous work [29] and [30] show great improvement of image feature based classification over clinical feature based approaches, in this work, we compare with the method of [29], [30] and several reconstruction-based methods, excluding the clinical feature based methods.
Our ACA classification comparison includes five methods, namely SWLR, LLE, k-NN, HOG [29] and HEP [30]. SWLR (similarity-weighted linear reconstruction) refers to the proposed reconstruction method with a similarity cost (i.e. λ≠0 in Eq.(4)). LLE (locally linear embedding) refers to the proposed reconstruction method without the cost constraint (i.e. λ=0 in Eq.(4)). k-NN (k-nearest neighbors) uses only the similarity cost computed as the sum of the k smallest distances to the test image from each class. HOG and HEP refer to linear SVM classification with HOG and HEP features, respectively. For HOG and HEP feature extraction, the optimal parameters reported in [29] and [30] are used. We also use the optimal parameters found for SWLR (k=250, σ=1, λ=100), LLE (k=250), and k-NN (k=50, σ=1). The results, as shown in
Overall, the improvement of the PACG detection using the embodiment is more significant, comparing with using the ACA localization algorithms described in the previous work [29, 30].
The parameter k affects the accuracy and speed of all of the reconstruction-based methods. As shown in
As shown in
(iii) Validation of Dictionary Expansion
In
Whilst the foregoing description has described exemplary embodiments, it will be understood by those skilled in the art that many variations of the embodiment can be made within the scope and spirit of the present invention. For example, the proposed method is not only suitable to process image sequences but also works well on a single image. For another example, the present invention may be used for diagnosis of other eye diseases.
Number | Date | Country | Kind |
---|---|---|---|
10201408823W | Dec 2014 | SG | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/SG2015/050505 | 12/23/2015 | WO | 00 |