Automated anterior segment OCT image analysis for Angle Closure Glaucoma mechanisms classification

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75 journal homepage: www.intl.elsevierhealth.com/j...

Download PDF

2MB Sizes 0 Downloads 44 Views

Report

PDF Reader
Full Text

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

journal homepage: www.intl.elsevierhealth.com/journals/cmpb

Automated anterior segment OCT image analysis for Angle Closure Glaucoma mechanisms classiﬁcation Swamidoss Issac Niwas a,∗ , Weisi Lin a , Xiaolong Bai b,c , Chee Keong Kwoh a , C.-C. Jay Kuo d , Chelvin C. Sng f , Maria Cecilia Aquino e , Paul T.K. Chew f a

School of Computer Engineering, Nanyang Technological University (NTU), 639798 Singapore, Singapore The State Key Laboratory of Fluid Power Transmission and Control, Zhejiang University, Hangzhou 310027, People’s Republic of China c School of Electrical and Electronics Engineering, Nanyang Technological University (NTU), 639798 Singapore, Singapore d Ming Hsieh Department of Electrical Engineering, Signal and Image Processing Institute, University of Southern California, Los Angeles, CA 90089, USA e Eye Surgery Centre, National University Health System (NUHS), 119228 Singapore, Singapore f Department of Ophthalmology, Yong Loo Lin School of Medicine, National University of Singapore (NUS), 119228 Singapore, Singapore b

a r t i c l e

i n f o

a b s t r a c t

Article history:

Background and objectives: Angle closure glaucoma (ACG) is an eye disease prevalent through-

Received 25 November 2015

out the world. ACG is caused by four major mechanisms: exaggerated lens vault, pupil block,

Received in revised form

thick peripheral iris roll, and plateau iris. Identifying the speciﬁc mechanism in a given

10 February 2016

patient is important because each mechanism requires a speciﬁc medication and treatment

Accepted 17 March 2016

regimen. Traditional methods of classifying these four mechanisms are based on clinically important parameters measured from anterior segment optical coherence tomography (AS-

Keywords:

OCT) images, which rely on accurate segmentation of the AS-OCT image and identiﬁcation

Angle closure glaucoma

of the scleral spur in the segmented AS-OCT images by clinicians.

Compound image transforms

Methods: In this work, a fully automated method of classifying different ACG mechanisms

Feature selection

based on AS-OCT images is proposed. Since the manual diagnosis mainly based on the

Segmentation-free method

morphology of each mechanism, in this study, a complete set of morphological features is

Machine learning classiﬁer

extracted directly from raw AS-OCT images using compound image transforms, from which a small set of informative features with minimum redundancy are selected and fed into a Naïve Bayes Classiﬁer (NBC). Results: We achieved an overall accuracy of 89.2% and 85.12% with a leave-one-out crossvalidation and 10-fold cross-validation method, respectively. This study proposes a fully automated way for the classiﬁcation of different ACG mechanisms, which is without intervention of doctors and less subjective when compared to the existing methods.

∗

Corresponding author. Tel.: +65 98677034. E-mail addresses: [email protected] (S.I. Niwas), [email protected] (W. Lin), [email protected] (X. Bai), [email protected] (C.K. Kwoh), [email protected] (C.-C. Jay Kuo), [email protected] (C.C. Sng), [email protected] (M.C. Aquino), [email protected] (P.T.K. Chew). http://dx.doi.org/10.1016/j.cmpb.2016.03.018 0169-2607/© 2016 Elsevier Ireland Ltd. All rights reserved.

66

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

Conclusions: We directly extracted the compound image transformed features from the raw AS-OCT images without any segmentation and parameter measurement. Our method provides a completely automated and efﬁcient way for the classiﬁcation of different ACG mechanisms. © 2016 Elsevier Ireland Ltd. All rights reserved.

1.

Introduction

Glaucoma is a kind of optic neuropathy disorder which makes a gradual loss of eye vision due to the damages of retinal cells and retinal nerve ﬁber layer in the eye [1]. It is mainly associated with increased intraocular pressure (IOP) of ﬂuid inside the eye [2]. If it is left untreated, the damaged optic nerves will not be recovered, leading to eventual blindness; however, reducing IOP may slow disease progression. It is the second major cause of visual impairment and blindness worldwide. Globally, the number of glaucoma patients is estimated as 60.5 millions in 2010, which may increase to almost 80 millions by 2020 [3]. The timing of interference is a key element in glaucoma treatment and it is usually asymptomatic in its early stage; so early diagnosis is required to slow down the disease progression toward complete vision loss. There are mainly three types of glaucoma, namely, angle closure glaucoma (ACG), open angle glaucoma and developmental glaucoma. ACG is a speciﬁc type of glaucoma in which a sudden rise in IOP is experienced, usually as a result of poor drainage due to the eye, producing more aqueous humor than it can remove, hence causing a build-up of ﬂuids [4]. ACG is more prevalent than the other two types. An eye that is susceptible to ACG usually has noticeable features which can be identiﬁed upon visual examination [5]. Anterior chamber angle (ACA) assessment is mainly used for the detection of ACG, and it can be visualized and quantiﬁed by anterior segment optical coherence tomography (AS-OCT) imaging technique [6,7]. It has been observed that ACG could be the result of one or more mechanisms in the anterior segment of the eye, such as exaggerated lens vault (L), pupil block (PB), thick peripheral iris roll (PIR), and plateau iris (PL) [6]. Laser peripheral iridotomy (LPI) is performed at an early and appropriate time for the eyes with anatomically narrow angles. Ophthalmologists and clinicians have found that the LPI is not always suitable for treating ACG patients with different mechanisms, and many patients continue to have appositional angle closure after treatment using LPI [7]. The optimal treatments for ACG patients with different mechanisms should be different. Therefore, it is important to classify these four mechanisms of ACG effectively, in order to not only provide different treatments for patients with different mechanisms, but also help the doctors tailor the best treatment for each mechanism accordingly. It has been demonstrated that these four mechanisms of ACG have different patterns of angle conﬁgurations with different anterior chamber (AC) parameters [7]. AS-OCT provides excellent repeatability and reproducibility for imaging the anterior segment of eyes. AS-OCT was used to measure

the AC parameters in the above mentioned four different mechanisms of ACG [6]. Wirawan et al. used some key parameters which were provided by customized software measured from the segmented AS-OCT images such as scleral spurto-scleral spur distance, anterior chamber depth, anterior chamber angle and area [8]. Some reliable selected features were fed into Adaboost classiﬁer and it was found to be clinically important to distinguish the four mechanisms of ACG [8]. In another study [9], the selected features are cross examined with four types of ACG mechanisms. In [10], various supervised and unsupervised features selection methods are used to ﬁnd the reliable features and achieve the classiﬁcation of different ACG mechanisms. However, the measurement of AC parameters relies on the accurate identiﬁcation of the scleral spur in the segmented AS-OCT images by the clinicians, which is subjective and may not be reliable, introducing additional noise to the features. In [11], a new ensemble learning method based on error-correcting output code (ECOC) is proposed with application to classiﬁcation of four ACG mechanisms and it is shown as an effective approach for multiclass classiﬁcation. The existing methods for the classiﬁcation of different ACG mechanisms are still not fully automated, requiring the help of the clinicians in feature extraction. In this paper, our study is motivated by providing a fully automated expert system for the classiﬁcation of different ACG mechanisms. We attempt to directly extract the features from the raw AS-OCT images without segmentation and parameter measurement. It is less subjective without the intervention of doctors. Speciﬁcally, in the proposed method, compound transform [12] is applied to the raw AS-OCT images to extract around three thousand different morphological features, from which a small set of discriminative features is selected for classiﬁcation. In the rest of this paper, Section 2 reviews related literature on general glaucoma diagnosis methods. The proposed method is presented in Section 3, followed by experimental results in Section 4. The conclusion is made with future works highlighted in the last section.

2.

Related methods

During the past one decade, several studies have investigated the usefulness of the computer-based expert support systems for the early detection of glaucoma using different imaging modalities as listed in the literature [13–18]. From the fundus images, using image processing techniques, the optic disk and blood vessels were extracted which could provide useful information to diagnose glaucoma. Higher order spectra features and textural features of fundus images combined with a random-forest classiﬁer resulted with an

67

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

accuracy of more than 91% for correctly identifying the glaucoma images [13]. An algorithm based on neural network classiﬁer using the internal morphological features from the fundus images [14] used for glaucoma diagnosis. Another study [15] explored the ability of the combination of the imaging methods for distinguishing the normal eyes from the glaucoma defect using optic nerve head stereo photographs, scanning laser polarimetry, confocal scanning laser ophthalmoscopy and optical coherence tomography (OCT) images. The optical nerve head (ONH) topography images were explored by orthogonal decomposition method for the detection of glaucoma in [16]. The super pixel information from three-dimensional OCT images was used in the boosting algorithm based machine classiﬁer to improve early detection of glaucoma [17]. An expert support system for glaucoma using three-dimensional ONH features, stereometric ONH parameters, disc and cup margin features were measured and used in [18]. Aforementioned works have attempted to distinguish the glaucoma and non-glaucoma cases using various image sources, rather than classifying any subtypes of glaucoma. AS-OCT images are most convenient in diagnosis for ACG, since they provide useful information for clinicians to observe a cross sectional view of the anterior chamber and the inside angle structures of the eye. Classifying various mechanisms of ACG using reliable features from the measured parameters of AS-OCT images was studied through machine learning classiﬁers with improved accuracy in [8–11]. There were a number of tools available for glaucoma with different diagnostic approaches. However, the sensitivity and speciﬁcity of any single diagnostic test are not adequate to be considered as the golden standard. Also, to the best of our knowledge, no works have been done previously for ACG mechanism classiﬁcation using textural features without prior segmentation. Hence, in this research work, we make an attempt to classify the different ACG mechanisms just based on AS-OCT images in a fully automated process. Features based on the compound hierarchy of algorithms representing morphology (CHARM) algorithm are extracted, followed by a multivariate classiﬁer for classiﬁcation.

3.

Proposed method

The proposed method for the classiﬁcation of different ACG mechanisms is shown in Fig. 1, including the acquisition of raw AS-OCT images followed by the extraction of a large number of morphological features using compound image transforms and image statistics [19]. A small set of discriminative and inter-dependent features selected by using maximum relevance-minimum redundancy (mRMR) algorithm [20] is fed into a Naïve Bayes Classiﬁer for classiﬁcation.

Table 1 – The number of samples with different ACG mechanisms in the experimental dataset. Class number 1 2 3 4

Mechanism

Number of samples

Exaggerated lens vault (LV) Pupil block (PB) Thick peripheral iris roll (PIR) Plateau iris conﬁguration (PL)

26 28 10 10

Total

3.1.

74

AS-OCT images

The data used in this work consist of AS-OCT images of 74 ACG patients provided by the Department of Ophthalmology in the National University Hospital, Singapore (NUHS). Ethics approval was obtained from the review board of NUHS and the written consent was obtained from all subjects prior to AS-OCT imaging. The numbers of patients with different mechanisms are listed in Table 1. A skilled technician obtained the clinical images through a horizontal scan, including sections of the nasal and temporal quadrants, of all subjects using AS-OCT (Visante, software version 2.01.88; Carl Zeiss Meditec, Dublin, CA) in a dark room (0 lux), with the images centered on the pupil. 1300 nm infrared light was used to acquire cross-sectional view of the anterior segment image with high resolution. The standard AS single-scan protocol, producing 256 scans in 0.125 s, was used to obtain the scans. The image saturation and noise were adjusted, and the polarization for each scan was optimized by the examiner to obtain the best quality images. Because several AS-OCT images were acquired for each patient, the image with the smallest amount of artifacts was selected for analysis. AS-OCT images with artifacts resulting from movement and the eyelids, and poor quality images secondary to tear ﬁlm abnormalities and corneal scars were excluded. Each eye image was captured several times with undilated state of the pupil and only images with clearly visible scleral spurs were analyzed qualitatively by three glaucoma specialists (P.T.K. Chew, M.C. Aquino and C.C. Sng). They were categorized into four groups of images based on ACG mechanism. Each mechanism has its own distinctive morphological characteristics [5] as shown in Fig. 2 which is illustrated in the dotted lines. The LV mechanism (Fig. 2a) can be spotted by the lens pushing the iris upward, which caused to reduce the angle between the cornea and the iris. The PB mechanism (Fig. 2b) can be seen by a convex forward iris proﬁle, causing a shallow peripheral anterior chamber. The PIR mechanism (Fig. 2c) can be detected by a thick iris, which narrows the angle between the iris and cornea due to the circumferential folds along the periphery of the iris. The PL mechanism (Fig. 2d) can be identiﬁed by a sharp rise of the iris at the periphery, closer to the angle wall,

Fig. 1 – Block diagram of the proposed method for ACG mechanism classiﬁcation.

68

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

Fig. 2 – AS-OCT image of an eye with different mechanisms: (a) exaggerated lens vault – LV; (b) pupil block – PB; (c) thick peripheral iris roll – PIR; (d) plateau iris conﬁguration – PL.

before sharply turning away from the angle wall toward the visual axis.

3.2.

Extraction of CHARM features

The feature set used in the analysis is based on the compound image transforms and image statistics. The principle is to extract a very large number of image generic image descriptors (up to ∼3000 features) from the raw AS-OCT image. Compound transform has been validated to be useful in classiﬁcation of cell images [12,19]. The features are listed into three major groups (A, B and C) as shown in Fig. 3 and the graphical representation of the extracted features is illustrated in Fig. 4. The types of features described in each group as follows; Group A: high contrast features includes edge statistics, object statistics and Gabor statistics features; Group B: polynomial decompositions features includes Chebyshev statistics, Chebyshev-Fourier statistics, Zernike polynomials features;

Group C: pixel statistics and textures features includes Haralick and Tamura features, multi-scale histograms, ﬁrst four moments, pixel intensity statistics features, radon features, fractal features and Gini coefﬁcient. These features are calculated from the raw pixels, and image transforms as well as multi-order transforms, including the Fourier transform, wavelet transform (Symlet 5, Level 1), Chebyshev transform and tandem combinations of these transforms. Altogether, the feature vector consists of 2919 values, each of which provides the various information of an image (Fig. 4). Features generated by using compound transform have been widely used for analysis of complex visual arts and paintings [21], and it is very effective in classiﬁcation of galaxy images [22–24], knee radiography images [25,26]. As for face recognition, both coarse and ﬁne features generated by compound transform are required [27]. The abundant features generated by compound transform can fall into the following groups [12].

Fig. 3 – Basic features used for the construction of compound transform features.

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

Fig. 4 – Graphical representation for the construction of compound hierarchy of algorithms representing morphology features.

69

70

3.2.1.

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

Group A: high contrast features

High contrast features include edges and objects, statistics about object number, spatial distribution, size and shape information. • • Edge statistics features were calculated using Prewitt gradient [28], which comprises the mean, median, variance, and 8-bin histogram of both the magnitude and the direction components. Some of the additional edge features consist of the total number of edge pixels, the direction homogeneity [29], and the difference between the direction histogram bins are sampled into a four-bin histogram. • Object statistics such as Otsu and Inverse Otsu were calculated for all 8-connected objects found in the Otsu and Inverse Otsu binary mask of the object [30] respectively. The Euler Number, and the minimum, maximum, mean, median, variance, and a 10-bin histogram of both the objects areas and distances from the objects to the image centroid are also included [31]. • Gabor statistics features were calculated through the convolution with a kernel in the form of a Gaussian harmonic function [32,33] since the Gabor ﬁlters is suitable for ﬁltering in the spatial and frequency domain. Seven distinct frequencies were used for providing seven image descriptor values.

3.2.2.

•

•

•

•

Group B: polynomial decompositions features

In polynomial decomposition, a polynomial is produced which can approximate the image to some extent of ﬁdelity. The generated polynomial coefﬁcients are used as descriptors of the image. • Chebyshev statistics [34] were generated using a 32-bin histogram of a 1 × 400 vector calculated by Chebyshev transformation of the image with an order of N = 20. • Chebyshev-Fourier features [35] included a 32-bin histogram of the polynomial coefﬁcients of a ChebyshevFourier transformation with a polynomial order of up to N = 23. • Zernike features [36] are the absolute values of Zernike polynomial approximation coefﬁcients of the image [29] which generates 72 image descriptors. These features include rotation invariance, expression efﬁciency, robustness to noise and multilevel representation for describing the shapes of the patterns.

3.2.3.

•

described in [37], and contribute 28 image descriptors. These were obtained by a tabulation of how often many combinations of pixel brightness values occur in the image. Multi-scale histograms were calculated using various numbers of bins (3, 5, 7, and 9), as proposed by [38], providing 24 image descriptors. Tamura texture features [39] of contrast, directionality, and coarseness and its 3-bin histogram were utilized to provide 6 image descriptors. Radon transform features [40] were calculated for four different directions (0◦ , 45◦ , 90◦ , and 135◦ ), as well as each of the resulting series were convolved into a 3-bin histogram, which provide a total of 12 image features. Fractal features were computed as proposed by [41,42], providing a total of 20 image features. Since, the concept of fractal dimension is an indicator of the surface roughness, fractal-based texture analysis has a correlation between texture coarseness. Pixel intensity features such as mean, median, standard deviation, minimum and maximum pixel intensity values were calculated using standard formulas. Gini coefﬁcient is used to compute the quantitative measure of the inequality with which an object’s brightness is distributed amongst its constituent pixels. One Gini coefﬁcient is calculated using the method described in [43].

Group C: pixel statistics and texture features

Pixel statistics are mainly based on the distribution of pixel intensities of the image, including histograms and moments. Texture features are the inter-pixel variations in intensity for several directions and resolutions. • First mean, standard deviation, skewness, and kurtosis were computed on each image stripes in four different directions (0◦ , 45◦ , 90◦ and 135◦ ). Each set of stripes is ﬁnally sampled into a 3-bin histogram, which provides 48 image descriptors. • Haralick features were computed on the image’s cooccurrence matrix based on standard equations as

Since, the larger set of image features can be more informative; nevertheless, all the 2919 image features are not equally informative since some of the features are expected to be noisy and more redundant. In the proposed method, mRMR feature selection algorithm is used, which is shown to be superior for reliable and useful feature selection in previous works on ACG mechanisms classiﬁcation [8–10]. The mRMR feature selection algorithm is used as described in the following subsection, for selecting the most informative image features as well as rejecting the redundant and noisy features for classiﬁcation.

3.3.

Feature selection using mRMR

Minimum Redundancy Maximum Relevance (mRMR) is a feature selection algorithm that aims to ﬁnd features that are most relevant to the target classes, while reducing the redundancy between the selected features simultaneously [20]. To ﬁnd relevant features, the mutual information between a feature and the target class should be maximized. The mutual information between two variables m and n is deﬁned as,

I(m, n) =

p(m, n) · log

p(m, n) dm dn p(m) · p(n)

(1)

The mutual information between the selected features set F and class n can be deﬁned as, D(F, n) =

1 I(mi , n) |F| mi ∈ F

(2)

71

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

with F containing x features {m1 , m2 , . . ., mx } while that between each two features mi and mj (i, j = 1, 2, . . ., x) in F is deﬁned as, 1 R(F) = |F|2

Accuracy = I(mi , mj )

(4)

Concerning the statistical signiﬁcance, the F-measure is measured to calculate the test’s accuracy using precision and recall rates;

Naive Bayes Classiﬁer

Naive Bayes Classiﬁer (NBC) demonstrates its effectiveness in binary and multiclass classiﬁcation when the features are independent of each other [44]. It generates a single probabilistic model for each class, assuming conditional independence of the features given the class label. NBC can often outpace more sophisticated classiﬁcation methods for the medical diagnosis model [45]. Since the mutual information among the features selected by using mRMR is minimized, these features are predominantly independent of each other and suitable for NBC. Besides, classiﬁcation based on Bayes theorem is useful for medical model diagnosis since it combines training samples with a priori knowledge to get the posterior probability of a hypothesis. It considers the contribution of all the features independently to the probability of the target class and hence, it provides more useful information for decision support other than a class label or class probability [46].

3.5.

TP + TN TP + TN + FP + FN

(3)

mi ,mj ∈ F

The objective of feature selection is to select a compact feature set with high discrimination and no redundancy by maximizing D(F,n) and minimizing R(F)simultaneously. Hence the ﬁnal feature set is determined by maximizing the objective function ˚(D, R) which is deﬁned as D(F, n) − R(F) and D(F, n)/R(F) by using mutual information difference (MID) and mutual information quotient (MIQ) criteria, respectively. In practice, the best feature set is found by using incremental search methods such as sequential forward selection method by maximizing ˚(D, R) in each searching step.

3.4.

true negatives (TN), false positives (FP) and false negatives (FN) using standard formulas [9].

Performance analysis

The performance of the classiﬁer can be computed in terms of accuracy, F-measure, sensitivity and speciﬁcity and which in turn can be derived from the rates of true positives (TP),

F-measure = 2 ·

Precision ∗ Recall

(5)

Precision + Recall

where Precision =

4.

TP (TP + FP)

and Recall =

TP (TP + FN)

(6)

Experimental results and discussion

Our experiment was conducted using Matlab Toolbox from the original authors for feature selections with mRMR method [19]. Each feature in the original feature set was normalized to have zero mean and a standard deviation of unity and discretized in order to compute the mutual information for feature selection. Prior to classiﬁcation using NBC, ranked features are selected amongst extracted 2919 features using mRMR algorithm with mutual information difference. Leaveone-out cross-validation (LOOCV) is used due to the small sample size. Fig. 5 shows the graph of the accuracy when training the classiﬁer using NBC based on the mRMR ranked features and it is found that the accuracy of the mRMR feature selection algorithm grow quickly to a peak of 89.20%, at its top ﬁfteen features. Beyond this peak, the accuracy of the algorithm was found to dip slightly and eventually stabilized. The accuracy is gradually reached to 84.90% when using all 2919 extracted features as shown in Fig. 5. The mRMR feature selection algorithm with NBC was able to perform well on a small feature set that is 15 out of 2919 of the entire feature set, due to the selection of features that had high relevance to the target class while reducing features that may have been correlated to the features already selected. The ﬁfteen selected features are shown in Table 2, which would hence be signiﬁcant in the detection of the ACG mechanism.

Fig. 5 – Accuracy graph using mRMR ranked features using NBC classiﬁer.

72

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

Table 2 – Top ﬁfteen ranked features by mRMR method and its accuracy using NBC. Feature rank

Feature number

Feature name

1 2

917 1843

3 4

2326 2740

5 6

1215 2392

7 8

731 1701

9

1836

10 11

218 1565

12 13 14

516 2187 1509

15

1143

Radon Coefﬁcients (Wavelet (*)) Haralick Textures (Fourier (Chebyshev (*))) Radon Coefﬁcients (Edge (*)) Zernike Coefﬁcients (Wavelet (Edge (*))) Fractal Features (Chebyshev (*)) Chebyshev Coefﬁcients (Fourier (Edge (*))) Zernike Coefﬁcients (Wavelet (*)) Haralick Textures (Fourier (Wavelet (*))) Comb Moments (Fourier (Chebyshev (*))) Zernike Coefﬁcients (*) Chebyshev Coefﬁcients (Fourier (Wavelet (*))) Zernike Coefﬁcients (Fourier (*)) Zernike Coefﬁcients (Edge (*)) Pixel Intensity Statistics (Wavelet (Fourier (*))) Haralick Textures (Chebyshev (*))

(*) – raw input AS-OCT image.

All these ﬁfteen features are revealing the morphology and shape information, high-contrast features, textural statistical distribution of the pixel values as explained in the Section 3.2, hence it is useful for the ACG mechanism classiﬁcation using AS-OCT images.

Table 5 – Confusion matrix of 4-way classiﬁer using NBC with LOOCV method. ACG mechanism

Actual class

LV PB PIR PL

Predicted class LV

PB

PIR

PL

24 3 0 0

2 25 2 1

0 0 8 0

0 0 0 9

The accuracy with different numbers of features (from 1 to 2919) is compared in Fig. 5 when the classiﬁer is trained using NBC based on the mRMR ranked features. It is found that the accuracy of the mRMR feature selection algorithm improves quickly to a peak of 89.20%, at its top ﬁfteen features. Beyond this peak, the accuracy of the algorithm dips slightly, and eventually stabilizes. The accuracy reaches 84.90% when using all 2919 ranked features as shown in Fig. 5. This shows robustness of feature selection in the proposed method. We also compared the accuracy with different combinations of the top 15 features. The variation of accuracy with different combinations of top 15 features is given in Table 3. It is observed that the accuracy is 58.35% when only the top feature is used and it progressively improves once more features are added. The accuracy, F-measure, speciﬁcity, sensitivity and receiver operating characteristic (ROC) area of each class using Naïve Bayes Classiﬁer based on the selected 15 features are calculated and is shown in Table 4. Our method achieves 89.20% of accuracy, 89.30% of F-measure with a sensitivity of 88.90% and a speciﬁcity of 89.19%. Table 5 shows the confusion matrix of 4-way classiﬁer using NBC through LOOCV method. From the experimental results as shown in Tables 4 and 5, we can

Table 3 – Variation of accuracy using different combinations of the top 15 features. S.No. 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Feature combination

Accuracy in %

917 917, 1843 917, 1843, 2326 917, 1843, 2326, 2740 917, 1843, 2326, 2740, 1215 917, 1843, 2326, 2740, 1215, 2392 917, 1843, 2326, 2740, 1215, 2392, 731 917, 1843, 2326, 2740, 1215, 2392, 731, 1701 917, 1843, 2326, 2740, 1215, 2392, 731, 1701, 1836 917, 1843, 2326, 2740, 1215, 2392, 731, 1701, 1836, 218 917, 1843, 2326, 2740, 1215, 2392, 731, 1701, 1836, 218, 1565 917, 1843, 2326, 2740, 1215, 2392, 731, 1701, 1836, 218, 1565, 516 917, 1843, 2326, 2740, 1215, 2392, 731, 1701, 1836, 218, 1565, 516, 2187 917, 1843, 2326, 2740, 1215, 2392, 731, 1701, 1836, 218, 1565, 516, 2187, 1509 917, 1843, 2326, 2740, 1215, 2392, 731, 1701, 1836, 218, 1565, 516, 2187, 1509, 1143

58.35 59.46 64.86 68.02 68.92 67.56 71.63 74.32 71.62 77.02 77.20 77.50 83.79 85.13 89.20

Table 4 – Accuracies, sensitivities and speciﬁcities of each class using NB Classiﬁer with leave-one-out cross-validation method. Class No.

Accuracy (%)

Sensitivity (%)

Speciﬁcity (%)

F-measure (%)

ROC area

1 2 3 4

92.96 89.19 97.06 98.51

92.31 89.29 80.00 90.00

93.33 89.13 100.00 100.00

90.57 86.21 88.89 94.74

0.96 0.92 0.95 0.99

Weighted

89.20

88.90

89.19

89.30

0.95

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

Table 6 – Comparison of accuracy using various classiﬁers for ACG detection. S.No 1 2 3 4 5

Classiﬁer type

LOOCV (%)

10-Fold CV (%)

NB classiﬁer ANN classiﬁer k-NN classiﬁer Adaboost classiﬁer RF classiﬁer

89.20 75.67 74.32 63.51 58.10

85.12 79.72 74.32 78.40 59.45

see that the classiﬁcation accuracy of PIR and PL mechanisms is relatively higher than that of L and PB. There are two reasons for this: (1) from a medical point of view, both PB and L have very evident visual characteristics, such as convex forward iris proﬁle in PB and greatly reduced AC volume in L, which are highly discriminative; (2) the sample size of the experimental dataset is imbalanced, with more samples in PB and L and less samples in the PIR and PL. Since the number of samples collected for PIR and PL is small when compared to other two mechanisms which will reﬂect the imbalanced dataset for machine learning. When we provide more samples with equal number, the overall accuracy and F-measure can be improved. In machine learning application, feature selection (e.g., mRMR) is an important task to obtain high classiﬁcation performance. However, most feature selection methods are based on statistics of the data. More data will make the selected features more conﬁdent. That is to say, more samples will help to improve the performance of feature selection, and also the classiﬁcation accuracy, and F-measure. The performance of the NBC classiﬁer of the proposed method has been compared (see Table 6) with the other most used contemporary classiﬁers for medical diagnosis such as ANN viz. multilayer perceptron model using a back-propagation algorithm, k-Nearest Neighborhood (k-NN), Adaboost classiﬁer and random forest (RF) classiﬁer. For each classiﬁer, we used leave-one-out cross-validation (LOOCV) and k-fold cross-validation (here k = 10) for a fair comparison. In 10fold cross validation, the training set is divided into 10 sets of equal number of samples. Each time, 9 of these sets are used for training (k-1) and the remaining one for testing. This procedure is repeated 9 times (folds) using a different part for testing in each case. Finally the average value of all ten results is estimated to analyze the classiﬁer performance. LOOCV method will work in the same above mentioned concept when k = 1 in k-fold cross validation method. Due to small number of samples, the statistics of the data is hard to be fully modeled, and there is some difference between LOOCV and 10-fold results for this present work. Since the selected features using mRMR are independent with each other; the combination of mRMR with NBC provides better result than the other multivariate classiﬁers. To the best of our knowledge, there is no such an automated method for classifying different ACG mechanisms. The other methods are mainly based on the manual observation of clinicians, which is not an automated way for classiﬁcation of different ACG mechanisms. Future work includes using a large dataset to learn better discriminative features and improve the classiﬁcation accuracy. Furthermore, some samples may have multiple mechanisms simultaneously, which should be

73

considered by proposing new machine learning methods. This study proposes a fully automated way for the classiﬁcation of different ACG mechanisms, which is without intervention of doctors and less subjective when compared to the existing methods.

5.

Conclusions

Angle-closure glaucoma, which is a speciﬁc type of glaucoma, has been observed to be the result of one or more mechanisms in the anterior segment of the eye. Traditional methods for classiﬁcation of the different mechanisms are based on calculation of the key parameters, such as the angle opening distance (AOD), trabecular-iris space area (TISA) and angle recess area (ARA) from the segmented AS-OCT images with the help of doctors, which are not fully automated. Since the manual diagnosis mainly based on the morphology of each mechanism, in this study, we directly extract multiple morphological features using compound transform and image statistics, from which a small set of features is selected by using mRMR, and then used for classiﬁcation. Our method provides a completely automated and efﬁcient way for the classiﬁcation of different ACG mechanisms. Segmentation of anterior chamber area and its measurement is not required in this method. Experimental results show that our method achieves 89.20% classiﬁcation accuracy. The limitation of this method is that the all features extracted from the raw AS-OCT images are based on compound image transformed features and its supervised learned feature selection method. Practically, manual labeling of various mechanism in huge ACG dataset are time consuming tasks for medical practitioners. In future work, unsupervised learning method and transfer learning method will be explored to conquer this problem. The possibility that a mechanism occurs in conjunction with other mechanisms could also be studied e.g. an iris roll mechanism occurring with a lens mechanism. Classifying all samples under only one class label, when some of these samples may have the features of two or more mechanisms, may adversely affect the performance of feature selection and classiﬁcation; this can be investigated as well.

Conﬂict of interest statement None declared.

Acknowledgement This work was supported by Ministry of Education (MoE) AcRF Tire 1 Funding, Singapore, under Grant M4010981.020 RG36/11.

references

[1] Z.B. Eliash, G. Wollstein, T. Chu, J.D. Ramsey, C. Glymour, R.J. Noecker, H. Ishikawa, J.S. Schuman, Optical coherence tomography machine learning classiﬁers for glaucoma detection: a preliminary study, Invest. Ophthalmol. Vis. Sci. 46 (11) (2005) 4147–4152.

74

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

[2] J. Cheng, J. Liu, B.H. Lee, D.W.K. Wong, F. Yin, T. Aung, M. Baskaran, P. Shamira, T.Y. Wong, Closed angle glaucoma detection in RetCam images, in: Proc. Annual Int. Conf. IEEE EMBC, 2010, pp. 4096–4099. [3] H.A. Quigley, A.T. Broman, The number of people with glaucoma worldwide in 2010 and 2020, Br. J. Ophthalmol. 90 (3) (2006) 262–267. [4] Glaucoma Research Foundation, 2013. Available at: http://www.glaucoma.org/glaucoma/types-of-glaucoma.php. [5] C.C. Sng, M.C. Aquino, J. Liao, M. Ang, C. Zheng, S.C. Loon, P.T. Chew, Pretreatment anterior segment imaging during acute primary angle closure: insights into angle closure mechanisms in the acute phase, Ophthalmology 121 (1) (2014) 119–125. [6] N. Shabana, M.C. Aquino, J. See, A.M. Tan, W.P. Nolan, R. Hitchings, S.M. Young, S.C. Loon, C.C. Sng, W. Wong, P.T.K. Chew, Quantitative evaluation of anterior chamber parameters using anterior segment optical coherence tomography in primary angle closure mechanisms, Clin. Exp. Ophthalmol. 40 (2012) 792–801. [7] C.K. Leung, W.M. Chan, C.Y. Ko, S.I. Chui, J. Woo, M.K. Tsang, R.K. Tse, Visualization of anterior chamber angle dynamics using optical coherence tomography, Ophthalmology 112 (2005) 980–984. [8] A. Wirawan, C.K. Kwoh, P.T.K. Chew, M.C.D. Aquino, C.L. Seng, J. See, C. Zheng, W. Lin, Feature selection for computer-aided angle closure glaucoma mechanism detection, J. Med. Imaging Health Inform. 2 (4) (2012) 438–444. [9] S. Issac Niwas, W. Lin, C.K. Kwoh, C.-C. Jay Kuo, C.C. Sng, M.C. Aquino, P.T.K. Chew, Cross-examination for angle-closure glaucoma feature detection, IEEE J. Biomed. Health Inform. 20 (1) (2016) 343–354. [10] S. Issac Niwas, X. Bai, W. Lin, C.K. Kwoh, C.C. Sng, M.C. Aquino, P.T.K. Chew, Reliable feature selection technique for automated angle closure glaucoma mechanism detection, J. Med. Syst. 39 (3) (2015) 1–10. [11] X.S. Bai, S. Issac Niwas, W. Lin, B.-F. Ju, C.K. Kwoh, L. Wang, C.C. Sng, M.C. Aquino, P.T.K. Chew, Learning ECOC code matrix for multiclass classiﬁcation with application to glaucoma diagnosis, J. Med. Syst. 40 (4) (2016) 1–10. [12] N. Orlov, L. Shamir, T. Macura, J. Johnston, D.M. Eckley, I.G. Goldberg, WND-CHARM: multipurpose image classiﬁcation using compound image transforms, Pattern Recognit. Lett. 29 (11) (2008) 1684–1693. [13] U.R. Acharya, S. Dua, X. Du, V.S. Sree, C.K. Chua, Automated diagnosis of glaucoma using texture and higher order spectra features, IEEE Trans. Inf. Technol. Biomed. 15 (3) (2011) 449–455. [14] J. Nayak, U.R. Acharya, P.S. Bhat, A. Shetty, T.C. Lim, Automated diagnosis of glaucoma using digital fundus images, J. Med. Syst. 33 (5) (2009) 337–346. [15] M.J. Greany, D.C. Hoffman, D.F.G. Heath, M. Nakla, A.L. Coleman, J. Caprioli, Comparisons of optic nerve imaging methods to distinguish normal eyes from those with glaucoma, Invest. Ophthalmol. Vis. Sci. 43 (1) (2002) 140–145. [16] M. Balasubramanian, S. Zabic, C. Bowd, H.W. Thompson, P. Wolenski, S.S. Iyengar, B.B. Karki, L.M. Zangwill, A framework for detecting glaucomatous progression in the optic nerve head of an eye using proper orthogonal decomposition, IEEE Trans. Inf. Technol. Biomed. 13 (5) (2009) 781–793. [17] J. Xu, H. Ishikawa, G. Wollstein, J.S. Schuman, 3D optical coherence tomography super pixel with machine classiﬁer analysis for glaucoma detection, in: Proc. IEEE Eng. Med. Biol. Soc., 2011, pp. 3395–3398. [18] J. Xu, H. Ishikawa, G. Wollstein, R.A. Bilonick, K.R. Sung, L. Kagemann, K.A. Townsend, J.S. Schuman, Automated

[19]

[20]

[21]

[22] [23] [24] [25]

[26]

[27]

[28] [29]

[30] [31] [32] [33]

[34] [35]

[36] [37]

[38]

[39]

[40] [41]

assessment of the optic nerve head on stereo disc photographs, Invest. Ophthalmol. Vis. Sci. 49 (6) (2008) 2512–2517. L. Shamir, N. Orlov, D.M. Eckley, T. Macura, J. Johnston, I.G. Goldberg, Wndchrm – an open source utility for biological image analysis, Source Code Biol. Med. 3 (2008) 1–13. H. Peng, F. Long, C. Ding, Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy, IEEE Trans. Pattern Anal. Mach. Intell. 27 (8) (2005) 1226–1238. L. Shamir, T. Macura, N. Orlov, D.M. Eckley, I.G. Goldberg, Impressionism, expressionism, surrealism: automated recognition of painters and schools of art, ACM Trans. Appl. Percept. 7 (2) (2010) 8. L. Shamir, Automatic detection of peculiar galaxies in large datasets of galaxy images, J. Comput. Sci. 3 (2012) 181–189. L. Shamir, G-analyzer: a tool for automatic galaxy image analysis, Astrophys. J. 736 (2) (2011) 141. L. Shamir, Automatic morphological classiﬁcation of galaxy images, Mon. Notices R. Astron. Soc. 399 (2009) 1367–1372. L. Shamir, S.M. Ling, W.W. Scott, A. Bos, N. Orlov, T. Macura, D.M. Eckley, I.G. Goldberg, Knee X-ray image analysis method for automated detection of osteoarthritis, IEEE Trans. Biomed. Eng. 56 (2) (2009) 407–415. L. Shamir, S.M. Ling, W. Scott, M. Hochberg, L. Ferrucci, I.G. Goldberg, Early detection of radiographic knee osteoarthritis using computer-aided analysis, Osteoarthr. Cartil. 17 (10) (2009) 1307–1312. L. Shamir, Evaluation of face datasets as tools for assessing the performance of face recognition methods, Int. J. Comp. Vis. 79 (2008) 225–229. R.C. Gonzalez, R.E. Woods, Digital Image Processing, Pearson Education Inc, 2008. R.F. Murphy, M. Velliste, J. Yao, G. Porreca, Searching online journals for ﬂuorescence microscopy images depicting protein subcellular location patterns, in: Proc. 2nd IEEE International Symposium on Bioinformatics and Biomedical Engineering, 2001, pp. 119–128. N. Otsu, A threshold selection method from gray level histograms, IEEE Trans. Syst. Man Cybern. 9 (1979) 62–66. S.B. Gray, Local properties of binary images in two dimensions, IEEE Trans. Comput. 20 (1971) 551–561. D. Gabor, Theory of communication, J. IEEE 93 (1946) 429–457. C. Gregorescu, N. Petkov, P. Kruizinga, Comparison of texture features based on Gabor ﬁlters, IEEE Trans. Image Process. 11 (2002) 1160–1167. I. Gradshtein, I. Ryzhik, Table of Integrals, Series and Products, 5th ed., Academic Press, 1994. N. Orlov, J. Johnston, T. Macura, L. Shamir, I. Goldberg, Computer vision for microscopy applications, in: Vision Systems – Segmentation and Pattern Recognition, ARS Press, 2007, pp. 221–242. M. Teague, Image analysis via the general theory of moments, J. Opt. Soc. Am. 70 (1980) 920–930. R.M. Haralick, K. Shanmugam, I. Dinstein, Textural features for image classiﬁcation, IEEE Trans. Syst. Man Cybern. 6 (1973) 269–285. E. Hadjidementriou, M. Grossberg, S. Nayar, Spatial information in multiresolution histograms, IEEE Conf. Comput. Vis. Pattern Recognit. 1 (2001) 702. H. Tamura, S. Mori, T. Yamavaki, Textural features corresponding to visual perception, IEEE Trans. Syst. Man Cybern. 8 (1978) 460–472. J.S. Lim, Two-dimensional Signal and Image Processing, Prentice Hall, New Haven, 1990. C.M. Wu, Y.C. Chen, K.S. Hsieh, Texture features for classiﬁcation of ultrasonic liver images, IEEE Trans. Med. Imaging 11 (2) (1992) 141–152.

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 3 0 ( 2 0 1 6 ) 65–75

[42] C.C. Chen, J.S. Daponte, M.D. Fox, Fractal feature analysis and classiﬁcation in medical imaging, IEEE Trans. Med. Imaging 8 (2) (1989) 133–142. [43] R. Abraham, S. van den Bergh, P. Nair, A new approach to galaxy morphology: I. Analysis of the Sloan digital sky survey early data release, Astrophys. J. 588 (2003) 218–229. [44] I. Rish, An empirical study of the Naïve Bayes classiﬁer, in: Proc. of IJCAI Workshop on Empirical Methods in AI, 2001.

[45] A. Mahdi, A. Razali, A. Alwakil, Comparison of fuzzy diagnosis with K-nearest neighbor and Naïve Bayes classiﬁers in disease diagnosis, Broad Res. Artif. Intell. Neurosci. 2 (2) (2011) 58–66. [46] A.Y. Ng, M.I. Jordan, On discriminative vs. generative classiﬁers: a comparison of logistic regression and Naïve Bayes, Neural Inf. Process. Syst. 14 (2001) 605–610.

75

Automated anterior segment OCT image analysis for Angle Closure Glaucoma mechanisms classification

Automated anterior segment OCT image analysis for Angle Closure Glaucoma mechanisms classification

Recommend Documents