An extensive analysis of various texture feature extractors to detect Diabetes Mellitus using facial specific regions

Computers in Biology and Medicine 83 (2017) 69–83 Contents lists available at ScienceDirect Computers in Biology and Medicine journal homepage: www...

Download PDF

2MB Sizes 2 Downloads 46 Views

Report

PDF Reader
Full Text

Computers in Biology and Medicine 83 (2017) 69–83

Contents lists available at ScienceDirect

Computers in Biology and Medicine journal homepage: www.elsevier.com/locate/compbiomed

An extensive analysis of various texture feature extractors to detect Diabetes Mellitus using facial speciﬁc regions

MARK

⁎

Ting Shu, Bob Zhang , Yuan Yan Tang Department of Computer and Information Science, Avenida da Universidade, University of Macau, Taipa, Macau, China

A R T I C L E I N F O

A BS T RAC T

Keywords: Texture feature analysis Facial key block analysis Diabetes Mellitus detection Image gray-scale histogram Medical biometrics

Introduction: Researchers have recently discovered that Diabetes Mellitus can be detected through noninvasive computerized method. However, the focus has been on facial block color features. In this paper, we extensively study the eﬀects of texture features extracted from facial speciﬁc regions at detecting Diabetes Mellitus using eight texture extractors. Materials and methods: The eight methods are from four texture feature families: (1) statistical texture feature family: Image Gray-scale Histogram, Gray-level Co-occurance Matrix, and Local Binary Pattern, (2) structural texture feature family: Voronoi Tessellation, (3) signal processing based texture feature family: Gaussian, Steerable, and Gabor ﬁlters, and (4) model based texture feature family: Markov Random Field. In order to determine the most appropriate extractor with optimal parameter(s), various parameter(s) of each extractor are experimented. For each extractor, the same dataset (284 Diabetes Mellitus and 231 Healthy samples), classiﬁers (k-Nearest Neighbors and Support Vector Machines), and validation method (10-fold cross validation) are used. Results: According to the experiments, the ﬁrst and third families achieved a better outcome at detecting Diabetes Mellitus than the other two. Conclusions: The best texture feature extractor for Diabetes Mellitus detection is the Image Gray-scale Histogram with bin number=256, obtaining an accuracy of 99.02%, a sensitivity of 99.64%, and a speciﬁcity of 98.26% by using SVM.

1. Introduction Diabetes Mellitus (DM) or more commonly known as diabetes, is a disease that occurs when the body cannot eﬀectively use the insulin it produces [1]. In addition, it can occur when the pancreas does not produce enough insulin [1]. The World Health Organization (WHO) estimated that in 2014 there were 422 million people worldwide suﬀering from DM [2] and this number will increase to 642 million by 2040 [3]. This means 1 in 10 adults will have DM, with the Western Paciﬁc (including East/Southeast Asia and all of Oceania) accounting for 214.8 million DM patients [3]. There are four DM traditional diagnostic techniques [4]: A1C test, Fasting Plasma Glucose (FPG) test, 2-h Plasma Glucose (2-h PG), and Oral Glucose Tolerance Test (OGTT). All four diagnostic methods draw blood from the patient, which can cause pain and discomfort as well as being invasive. For example, in order to administer OGTT [5,6], the patient must drink a liquid containing glucose after at least 8 h of fasting. Then, a small blood sample is usually taken every hour for 2–3 h.

⁎

Recently, researchers discovered that DM can be detected through facial block color analysis [7]. Compared with the traditional DM diagnostic method [5,6], this method is non-invasive and causes no discomfort to the patient. There are two kinds of features in facial block analysis: color [7] and texture. However, there exists little to none in the literature on facial block analysis using texture features. More speciﬁcally, applying these texture features to detect DM. There are many texture feature extractors other than Gabor ﬁlters in texture analysis, such as co-occurrence matrices [8], eigenﬁlter [9], fractal model [10], Local Binary Pattern [11], etc. In [12], the texture features are categorized into four families: statistical texture feature, structural texture feature, signal processing based texture feature, and model based texture feature. Statistical texture features are computed based on the statistical distribution of image intensities at speciﬁed relative pixel positions, which measures the spatial distribution of pixel values. In this family, there are many features with ﬁrst to higher order statistics which are determined based on the pixels number of each observation [12]. In

Corresponding author. E-mail addresses: [email protected] (T. Shu), [email protected] (B. Zhang), [email protected] (Y. Yan Tang).

http://dx.doi.org/10.1016/j.compbiomed.2017.02.005 Received 16 January 2017; Received in revised form 16 February 2017; Accepted 16 February 2017 0010-4825/ © 2017 Elsevier Ltd. All rights reserved.

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

order to explore the inﬂuences of this type of feature in DM detection, diﬀerent orders of features are used in this paper. Image Gray-scale Histogram (IGH) [13–15] is a typical ﬁrst order statistical texture feature, Gray-level Co-occurrence Matrix (GLCM) [16–18] belongs to the second order, and Local Binary Pattern (LBP) [11,19–22] as a higher order feature is also employed. Structural texture features are characterized by texture primitives which can be as simple as individual pixels, a region with uniform graylevels, or line segments [12]. In our paper, Voronoi Tessellation (VT) [23–25] as the texture primitives of the facial speciﬁc regions is used. Most signal processing based texture features are computed based on the energy of the responses produced by applying ﬁlter banks on the image. Three ﬁlters (Gaussian [26–28], Steerable [29–31], and Gabor [32–34]) are explored in this paper. Model based texture features are generally presented through stochastic and generative models with the estimated model parameters as the texture features for texture analysis [12]. Models used in this type of feature include many types, such as random ﬁeld models. Here, Markov Random Field [35–37] (MRF) is applied as the model in the model based texture feature. Based on categorization mentioned above, we will extensively analyze each texture extractor family in order to determine the optimal extractor for DM detection. The facial images are ﬁrst captured through a specially designed device where four facial key blocks are extracted automatically from the facial speciﬁc regions. Next, we use all of the eight texture feature extractors to extract the texture features from each facial key block. Two traditional classiﬁers (k-Nearest Neighbors (kNN) [38–40] and Support Vector Machines (SVM) [41–43]) are applied to detect DM based on the extracted texture feature values with 10-fold cross validation [44–46]. The optimal texture feature extractor is selected according to this classiﬁcation result. In addition, diﬀerent parameter values in the extractors are experimented to evaluate the best method and its most ﬁtting parameter(s). The rest of this paper is organized as follows. Section 2 introduces the facial speciﬁc regions and the eight texture feature extraction methods. The experimental results of the eight methods for DM detection are described in Sections 3 and 4 concludes this paper.

Fig. 1. Facial diﬀerent regions according to TCM.

Fig. 2. An example of a facial image with four located key blocks.

blocks are located through Eq. (1)–(4).

2. Materials and methods 2.1. Facial speciﬁc regions As mentioned above, each facial image is captured through a specially designed image capture device. The device is a black box consisting of a 3-CCD camera in the center and two lamps on each side. When capturing the facial image, the individual places his/her head on the chin rest in front of the camera. This ensures all facial images captured under diﬀerent environments are not aﬀected by lighting. In Traditional Chinese Medicine (TCM) it is believed that the status of the internal organs can be determined from diﬀerent regions of the face [47–49]. Fig. 1 shows a human face partitioned into various regions according to TCM [50]. Applying this idea to our proposed method, four facial key blocks are automatically extracted from each facial image representing the main regions. No facial block is used to represent region C in Fig. 1 due to the existence of facial hair. Fig. 2 is an example of a facial image with its four key blocks labeled. Forehead Block (FHB) is on the forehead, Left Cheek Block (LCB) and Right Cheek Block (RCB) are symmetric and positioned below the left and right eyes respectively, and the last block (Nose Bridge Block (NBB)) is on the mid-point of LCB and RCB, located on the nose. We extract the four facial key blocks automatically. In the automatic key blocks extraction procedure the pupils are ﬁrst detected and marked. The positions of the two pupils are denoted as LP: (w1, h1) (left) and RP: (w2, h2 ) (right). Based on LP and RP, the four facial key

⎛ w + w2 h1 + h2 1 ⎞ LFHB = ⎜ 1 , + H⎟ ⎝ 2 2 3 ⎠

(1)

⎛ 1 ⎞ LLCB = ⎜w1, h1 − H ⎟ ⎝ 4 ⎠

(2)

⎛ w + w2 h1 + h2 2 ⎞ LNBB = ⎜ 1 , − H⎟ ⎝ 2 2 9 ⎠

(3)

⎛ 1 ⎞ LRCB = ⎜w2, h2 − H ⎟ ⎝ 4 ⎠

(4) th

where L ith key block name means the position of i key block, such as LFHB is the position of FHB; and W and H are the width and height of the facial image, respectively. Fig. 2 depicts the locations of the four facial key blocks based on the left and right pupil positions. The size of each facial key block is ﬁrst set to be 64×64. In order to select the optimal block size, various facial key block sizes are experimented in Section 3.6. More details about the facial image capture device and facial key blocks can be found in [7]. 2.2. Texture feature extraction The eight texture extractors are described in this section. First, the three methods (IGH, GLCM, and LBP) from the statistical texture feature family are given along with a description of VT from the structural texture feature family. Afterwards, the three ﬁlters (Gaussian, Steerable, and Gabor) from the third family (signal processing based) are described. Finally, MRF coming from the last family (model based) is introduced. Table 1 lists the eight methods from the four families. 70

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

often pairs of pixels with speciﬁc values in a speciﬁed spatial relationship occur in the image [16–18]. For GLCM, a matrix is ﬁrst generated by calculating the frequency occurrence of a pair of pixels in an image. Fig. 4 is an example of GLCM generation [54]. After the matrix is generated, various properties can be extracted to represent the texture of the image. In this paper, four properties are extracted: (1) contrast: evaluating the local variations in the matrix, (2) correlation: measuring the joint probability occurrences of the pairs, (3) energy: the sum of squared elements in the matrix, and (4) homogeneity: estimating the proximity of the distribution of elements in the matrix. An example of the four properties of FHB from a DM sample and a Healthy sample are shown in Table 2.

Table 1 List of the texture feature extractors. Family

Method name

1. Statistical

Image Gray-scale Histogram (IGH) Gray-level Co-occurrence Matrix (GLCM) Local Binary Pattern (LBP) Voronoi Tessellation (VT) Gaussian Steerable Gabor Markov Random Field (MRF)

2. Structural 3. Signal processing based

4. Model based

2.3. Statistical texture feature family

2.3.3. Local Binary Pattern (LBP) LBP is a particular case of the Texture Spectrum model and was ﬁrst described in [56]. LBP has been proven to be a powerful feature and is commonly used for texture classiﬁcation [11]. Fig. 5 is an example on how to extract LBP from an image. The image is ﬁrst divided into N non-overlapping cells (3 ≤ N ≤ ImageSize ). For each cell a histogram with size of B is extracted to represent the cell. Finally, the histograms of all the cells are concatenated into the LBP feature of the image. The size of this LBP feature is 1 × L , where L = B × N . An example of calculating LBP is shown in Fig. 6. Comparing each pixel of a cell with its 8 neighbors, the number at this position is ‘0′, where the pixel is greater than its neighbor; otherwise, it is ‘1′. Following the pixels from left to right and top to bottom order, an 8-bit binary number is generated and converted to decimal for convenience. Over the cell, a histogram is computed based on the frequency of each number occurring. Figs. 7a and b show an example of LBP applied to FHB of a DM and a Healthy sample, respectively, where the cell size is 32.

The statistical texture features measure the spatial distribution of pixel values by computing the local features based on the statistical distribution of the image intensities at speciﬁed relative pixel positions [12,51,52]. Based on the number of pixels which deﬁnes the local feature, statistical texture feature extraction algorithms can be categorized into ﬁrst order (one pixel), second order (two pixels), and higher order (three and more pixels) statistics [52]. The ﬁrst order statistics estimate properties of one pixel value, whereas second and higher order statistics evaluate properties of the spatial interaction between two and more image pixels [51,53]. In order to explore the various orders of statistics for DM detection, we select one typical method from each order. Hence, IGH, GLCM, and LBP are the ﬁrst, second, and higher order statistical texture feature extractors used, respectively. They are introduced one by one in this subsection. 2.3.1. Image Gray-scale Histogram (IGH) IGH calculates the image gray-scale value intensities and uses the calculated histogram to represent the image [13–15]. IGH is easy and fast to compute, which analyzes the spatial distribution of gray-scale values of each facial key block. Let H = [h (0), h (1), …, h (C )] denote one IGH of each block, where h(i) corresponds to the frequency of the grayscale value i of this block and 1 ≤ C ≤ 255 means the max gray level. It is deﬁned as:

h (i ) =

∑ fi (x, y),

2.4. Structural texture feature family Structural texture feature is characterized by texture primitives and the spatial placement rules of these primitives [12]. For this feature type, texture primitives are ﬁrst extracted from the images and the spatial placement rules are then generalized. The texture primitives can be pixels or line segments, while the placement rules can be geometric relationships or statistical properties among these primitives [53]. In [57], Tuceryan et al. proposed an algorithm based on VT to segment texture and proved VT is eﬀective in texture analysis. Except for texture segmentation, VT has been employed in many other applications (such as cluster ﬁnding, distribution study, and so on) and proven to be eﬀective and easy in texture processing [58,59]. Therefore, in this paper, VT and its distances are used as the texture primitive and the spatial placement rule for the structural texture feature, respectively.

i = 0, 1, …, C

x, y

(5)

where fi (x, y) is given as:

⎧1, if im (x, y) = i , fi (x, y) = ⎨ ⎩ 0, otherwise

(6)

and im (x, y) is the gray-scale value of pixel (x,y) in this block. Therefore, each facial image has an IGH vector with a size of C + 1 feature values for each block ×4 facial key blocks = 4C + 4 dimensionality. An example of IGH applied to FHB for both DM and Healthy is shown in Fig. 3a and b, respectively. Here the number of bins is 256.

2.4.1. Voronoi Tessellation (VT) For VT, some tokens [60], which are some meaningful points in the image are ﬁrst speciﬁed beforehand. A region corresponding with each token is next created and is called a Voronoi cell. The region consists of all points closer to that token than to any other. Hence, an image is

2.3.2. Gray-level Co-occurrence Matrix (GLCM) GLCM represents the texture of an image through calculating how

Fig. 3. An example of IGH applied to FHB for DM (left) and Healthy (eight).

71

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Fig. 4. An example of GLCM generation [55].

Edge Detector) in image processing [62]. In two dimensions (2-D), the Gaussian function is deﬁned as:

Table 2 An example of FHB's four properties for both a DM and a Healthy sample using GLCM. Class

Contrast

Correlation

Energy

2

Homogeneity

Gaussian (x, y) = DM

0.2406

0.4991

0.3384

0.8797

Healthy

2.3059

0.8352

0.6696

0.9588

x +y 1 − ·e 2σ 2 2πσ 2

2

(7)

where x indicates the distance between the point and the origin in the horizontal axis while the distance from the origin in the vertical axis is denoted by y, and σ means the standard deviation of the Gaussian distribution. In this paper, diﬀerent values of the two parameters: ﬁlter size and σ are set to produce the optimal result. Initially, we apply the 2-D Gaussian function to generate a Gaussian ﬁlter bank. Next, each Gaussian ﬁlter convolves with each facial key block to produce a response R (x, y):

partitioned into some regions. An example of a VT diagram is shown in Fig. 8. Using VT to extract the texture feature from images was ﬁrst proposed in [61]. A VT feature can be represented based on the properties of the partitioned regions, such as its area and so on. Table 3 shows an example of the VT feature vector for both a DM and Healthy FHB, respectively, where the token number is 4.

R (x, y) = G (x, y)*im (x, y)

(8)

where im (x, y) is a facial key block. Here, G (x, y) denotes each ﬁlter in the Gaussian ﬁlter bank, and * represents convolution. The texture value of each response is the mean of all its pixels. Therefore, the size of each sample feature vector extracted by the Gaussian ﬁlter is 4 (1 feature value for each block ×4 blocks). An example of FHB's Gaussian feature value for both DM and Healthy samples is represented in Table 4, where the ﬁlter size is 13 and σ = 3.

2.5. Signal processing based texture feature family Signal processing based texture feature is commonly extracted by applying ﬁlter banks to extract edges, lines, isolated dots, etc. from the image. The Gaussian ﬁlter is a common ﬁlter in the third family [26– 28], and a Steerable ﬁlter is a derivative of the Gaussian ﬁlter and is very useful for texture analysis [29–31]. At the same time, the Gabor ﬁlter is similar with the optical system of human beings [33,32]. Each ﬁlter will be described below.

2.5.2. Steerable ﬁlter The Steerable ﬁlter as a derivative of the Gaussian ﬁlter and was ﬁrst proposed by Freeman and Adelson in 1991 [30]. Steerable ﬁlters are a bank of ﬁlters with arbitrary orientations. Each ﬁlter in this bank is generated through a linear combination of a set of bias functions. The Gaussian derivative ﬁlters are applied to generate the bias functions in this paper. A ﬁrst derivative ﬁlter for any direction θ can be deﬁned as:

2.5.1. Gaussian ﬁlter In signal processing, a Gaussian ﬁlter is a ﬁlter which convolves with a signal (or image) producing a Gaussian function (or its approximation) response. The Gaussian ﬁlter has always been used in preprocessing to remove noise for edge detection (such as the Canny

Fig. 5. An example of LBP extraction from an image.

72

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Fig. 6. An example of calculating a local binary pattern.

Dθ = Gx cos θ + Gy sin θ

(9)

where Gx and Gy are the ﬁrst x derivative and the ﬁrst y derivative of a Gaussian function, while cos θ and sin θ are known as the interpolation functions of the basis functions Gx and Gy. Varying σ and θ values are set to locate the optimal parameters of the Steerable ﬁlter in DM detection. Similar with the Gaussian ﬁlter, the Steerable ﬁlter convolves with each block through Eq. (8) where G (x, y) represents the Steerable ﬁlter. Table 5 shows an example of the Steerable feature value for a DM FHB and a Healthy FHB, respectively, where σ = 2 and θ = 210°.

2.5.3. Gabor ﬁlter In the spatial domain, a 2D Gabor ﬁlter is a Gaussian kernel function modulated by a sinusoidal plane wave [32]. We applied a Gabor ﬁlter bank consisting of 40 ﬁlters (ﬁve σ values ×eight θ values) which convolved with each facial key block. The feature value of each block is the mean of the 40 ﬁlter responses. The size of each Gabor ﬁlter is 39×39 and no explanation was given for this size. For this paper, we set various ﬁlter sizes, σ, and θ values to extract the texture features from each block and to locate the best parameters of the Gabor ﬁlter in DM detection. To compute the texture value of each block a 2-D Gabor ﬁlter is applied and deﬁned as: x ′2 + γ 2·y ′2

G (x , y ) = e

−2σ 2

⎛ x′ ⎞ cos ⎜2π ⎟ ⎝ λ⎠

Fig. 8. An example of a VT diagram. Table 3 An example of VT's feature vector for a DM FHB and a Healthy FHB. Class

Feature vector

DM

[872.58, 139.40, 519.47, 2.23, 110.51, 112.58, 155.09, 954.61, 112.96, 165.62, 200.96, 1.47, 235.52, 294.22, 220.68, 115.88] [502.43, 445.60, 435.40, 465.90, 294.54, 115.36, 127.49, 189.17, 172.75, 90.00, 146.86, 217.78, 1.33, 1.04, 1.05, 242.92]

Healthy

(10)

Table 4 An example of FHB's Gaussian feature value for both DM and Healthy samples.

where x′ = x·cos θ + y·sin θ , y′ = −x·sin θ + y·cos θ , σ is the variances, λ is the wavelength, γ is the aspect ratio of the sinusoidal function, and θ is the orientation. The texture feature value is calculated in the same way as the Gaussian ﬁlter using Eq. (8). Here G (x, y) in Eq. (8) is a Gabor ﬁlter. An example of FHB's Gabor feature vector for DM and Healthy is shown in Table 6, where the ﬁlter size is 16, σ = 1, and θ = 315°.

Class

Feature value

DM Healthy

189.7529 216.1719

Fig. 7. An example of LBP applied to the FHB of a DM (left) and Healthy (right) sample.

73

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

while the linear kernel function is used in SVM. Other values of k and diﬀerent kernel functions were experimented for k-NN and SVM respectively, but did not improve upon the accuracy. The focus of this paper is to extensively analysis diﬀerent texture features for our application. Therefore, the traditional classiﬁers of k-NN and SVM were chosen due to its high performance and robustness. The 284 DM samples were collected from The Hong Kong Foundation for Research and Development in Diabetes, Prince of Wales Hospital, Hong Kong SAR in mid-2012. The 231 Healthy samples were captured in The Guangdong Provincial TCM Hospital, Guangdong, where 142 Healthy samples were captured in late 2011 and the other 89 taken in 2015. In 10-fold cross validation, the facial image dataset is split randomly into 10 parts: the size of the ﬁrst 9 parts are equal (28 for DM and 23 for Healthy) and the size of the 10th part is 56 (32 for DM and 24 for Healthy). In each step, one of the 10 parts is the testing set and the rest of the data is the training set. This is repeated until each part has been the testing set. The ﬁnal accuracy is the mean of the ten accuracies from the ten steps, where accuracy is the proportion of testing data that is classiﬁed correctly. In this paper, three performance measurements including accuracy, sensitivity, and speciﬁcity are used to measure the experimental results and are deﬁned as:

Table 5 An example of the Steerable feature value from a DM FHB and a Healthy FHB. Class

Feature value

DM Healthy

0.0095 0.0034

Table 6 An example of FHB's Gabor feature vector for a DM and a Healthy FHB. Class

Feature vector

DM Healthy

[2.52, 0.60, 0.60, 0.51, 2.54, 0.53, 0.47, 0.56] [2.89, 0.53, 0.45, 0.44, 2.84, 0.46, 0.61, 0.70]

2.6. Model based texture feature family Model based texture feature considers a texture to be a stochastic, possibly periodic, two-dimensional image ﬁeld. Therefore, it generally uses stochastic and generative models to represent images. Its texture features are characterized based on the estimated model parameters [12,37]. For this family, the models can be fractal models, random ﬁeld models, and so on. Here, MRF as a typical method of random ﬁeld models is used to extract texture features from each facial key block.

Accuracy =

2.6.1. Markov Random Field (MRF) MRF assumes that the suﬃcient point of achieving a good global image representation is local information. Therefore, one major challenge is to eﬃciently estimate the model parameters. There are four main steps in MRF extraction for computer vision [63]: (i) a set of nodes that may correspond to pixels or agglomerations of pixels is generated from the image, (ii) hidden variables corresponding with the nodes are recommended into a model, (iii) over the pixel values and the hidden variables, a joint probabilistic model is created, (iv) the groups are expressed based on the hidden variables and often depicted as edges in the graph of MRF. As for each facial key block, the MRF feature vector has a dimensionality of 50, hence it is diﬃcult to show an example of a DM and a Healthy sample. The MRF feature value of each facial key block is represented as the mean MRF energy from each iteration. Therefore, the 50 dimensionality feature vector is from the 50 iterations of MRF energy mean values.

True Pos. +True Neg. All Data Number

(11)

Sensitivity =

True Pos. True Pos. +False Neg.

(12)

Specificity =

True Neg. True Neg. +False Pos.

(13)

The experimental results are displayed in the order of methods in Table 1. In Figs. 9–16, the black line represents accuracy, the light purple line is for sensitivity, and speciﬁcity is shown as the blue line. 3.1. Statistical texture feature family results In accordance with Subsection 2.2, 3 methods (IGH, GLCM, and LBP) from the ﬁrst family were used to extract texture features from each facial key block. The DM detection results using the 3 methods are discussed in this subsection. Figs. 9–11 show the results using the 3 methods with k-NN and SVM. 3.1.1. IGH results The results based on IGH with its histogram bin number varying using k-NN and SVM are illustrated in Figs. 9a and b, respectively. According to Subsection 2.3.1, the histogram bin number equals to C + 1 and 1 ≤ C ≤ 255, therefore the range of the histogram bin number is [2256]. In order to focus on the optimal parameter value (the histogram bin number) of IGH, diﬀerent bins

3. Results and discussions For the experiments, 284 DM and a new Healthy dataset consisting of 231 samples are classiﬁed via k-NN and SVM with 10-fold cross validation based on the facial key block texture features extracted by the eight extractors. In all the experiments, k equals to 1 for k-NN,

Fig. 9. IGH k-NN and SVM results with bin numbers changing.

74

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Fig. 10. GLCM k-NN and SVM results with diﬀerent property combinations.

([21, 22, …, 28] = [2, 4, …, 256]) were experimented. As shown in Fig. 9a, the accuracy and sensitivity were increasing with an increase in the number of bins. The best accuracy of 93.62% with a sensitivity of 99.64%, and a speciﬁcity of 86.18% was achieved where the histogram bin number was 256. Fig. 9b represents the results of IGH using SVM. The highest accuracy (99.02%) was also reached with 256 bins, while its sensitivity and speciﬁcity were 99.64% and 98.26%, correspondingly. The speciﬁcity value was better than sensitivity when the histogram bin number was smaller than 64, while after that the opposite was true.

Correlation, Energy, and Homogeneity) in the matrix of GLCM. Hence, all of the 15 combinations for the four properties were used to represent each facial key block. The matrix property combinations are shown in Table 7. The results based on GLCM with various property combinations using k-NN and SVM are depicted in Figs. 10a and b, respectively. As shown in Fig. 10a, using k-NN the best accuracy of 70.99% with a sensitivity of 81.38% and a speciﬁcity of 58.12% was obtained, where Contrast + Correlation was used. The highest accuracy using SVM (71.97%) was 0.98% higher than that of k-NN, with a sensitivity of 89.51% and a speciﬁcity of 50.33%, using only Correlation.

3.1.2. GLCM results From Subsection 2.3.2, there are four properties (Contrast,

Fig. 11. LBP k-NN and SVM results with varying cell sizes.

75

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Fig. 12. VT k-NN and SVM results with diﬀerent token numbers.

than that of even sizes (refer to Fig. 13a). From ﬁlter size =12 to 64, the highest accuracy of even ﬁlters was 87.20%, and that of odd sized ﬁlters was 87.98%. As shown in Fig. 13c (ﬁlter size=13 and σ changing), the best accuracy with the same sensitivity and speciﬁcity remained unchanged. The results using the Gaussian ﬁlter with its ﬁlter size and σ increasing classiﬁed by SVM are represented in Figs. 13b and d, respectively. In both ﬁgures, it is obvious that the results remained the same with the size and σ changing in the Gaussian ﬁlter. The best accuracy was 82.58% with a better sensitivity of 98.93% and a worse speciﬁcity of 62.17% compared to k-NN.

3.1.3. LBP results LBP as a higher order statistical texture feature extraction method was used as the third extractor in facial key block extraction. As the size of each facial key block is 64×64, the range of the cell size N is from 3 to 64 (refer to Subsection 2.3.3). The results based on LBP with N ﬂuctuating using k-NN and SVM are represented in Fig. 11. As shown in Fig. 11a, the accuracy and sensitivity using k-NN increased signiﬁcantly from N=3 to 30, while afterwards this value they had little change. The best accuracy of 91.89% with a sensitivity of 91.56% and a speciﬁcity of 92.25% was reached when N=32. The sensitivity based on LBP using SVM was always higher than the speciﬁcity for all N values, while the accuracy was between them (refer to Fig. 11b). With a sensitivity of 93.94% and a speciﬁcity of 90.53%, the highest accuracy using SVM was 92.44%, where N was 21.

Three ﬁlters including Gaussian, Steerable, and Gabor were applied on facial key blocks to extract texture features. Figs. 13–14 show the results based on the three ﬁlters with diﬀerent parameters using k-NN and SVM.

3.3.2. Steerable ﬁlter result The Steerable ﬁlter as a derivative of the Gaussian ﬁlter and was the second one used in this family. According to Subsection 2.5.2, the Steerable ﬁlter has two parameters: θ and σ. The range of these parameters are 0°:15°:360° and 1: 1: 8, respectively. The k-NN and SVM classiﬁcation results using the Steerable ﬁlter with the varying θ and σ are depicted in Fig. 14. The highest accuracy of 73.53% with a sensitivity of 75.94% and a speciﬁcity of 70.54% using k-NN was achieved, where θ = 210° and σ = 2 . As shown in Fig. 14a (θ changing and σ = 2 ), compared with the accuracy, the sensitivity and speciﬁcity ﬂuctuated a lot. The ﬂuctuation range of the accuracy was from 50% to 74%. The results based on the Steerable ﬁlter with θ = 210° and σ changing using k-NN are illustrated in Fig. 14b. From σ = 2 , the speciﬁcity fell, however the sensitivity raised with the increasing of σ. The accuracy had two peaks, where σ = 2 and 4. The highest accuracy of 78.02% with the sensitivity of 82.37% and the speciﬁcity of 72.70% based on the Steerable ﬁlter with θ = 315° and σ = 4 using SVM was obtained. According to Fig. 14c (θ changing and σ = 4 ), the same thing happened in Fig. 14a, the sensitivity and speciﬁcity oscillated, while the accuracy had a smaller change ([55%78.1%]). The results with the constant θ and the changing σ are represented in Fig. 14d. The speciﬁcity rose from 50.65% to 82.68% with the increasing of σ, while from σ = 2 the sensitivity fell from 87.68% to 53.30%. The accuracy had a peak of 78.02%, where σ = 4 .

3.3.1. Gaussian ﬁlter results The Gaussian ﬁlter, as the most commonly used ﬁlter in pattern recognition was ﬁrst applied. From Subsection 2.5.1 there are two parameters: ﬁlter size and σ for the Gaussian ﬁlter. The classiﬁcation results by k-NN and SVM based on the Gaussian ﬁlter with its size and σ increasing are illustrated in Fig. 13. The range of the ﬁlter size and σ are 1:1:64 and 1:1:8, respectively. The highest accuracy using k-NN based on the Gaussian ﬁlter with size = 13 and σ = 3 was 87.98% with a sensitivity of 90.58%, and a speciﬁcity of 84.84%. The accuracy of odd ﬁlter sizes was always better

3.3.3. Gabor ﬁlter result Similar with the human optical system, the Gabor ﬁlter is an eﬃcient texture feature extractor. In this paper, the Gabor ﬁlter has three parameters: ﬁlter size, σ, and θ (refer to Subsection 2.5.3). The range of these parameters are 1: 1: 64 , 1: 1: 8, and 0°: 45°: 315°, respectively. Fig. 15 shows the results based on the Gabor ﬁlter with size, σ, and θ increasing classiﬁed by k-NN and SVM. The highest accuracy obtained through k-NN was 96.18% with a sensitivity of 98.66%, and a speciﬁcity of 93.12%, where ﬁlter size = 16 , σ = 5, and θ = 315°. As shown in Fig. 15a, from size = 1 to 18, the

3.2. Structural texture feature family results In this family, only one method (VT) was used to extract texture features from each facial key block. For this paper, the minimum of each region in the block was applied as the token of VT and the mean distance between the minimum and its corresponding cell points were used to represent each facial key block. The token number ranges from [2, 4, 8, 16], as the max token number is ImageSize and the size of each 4 block is 64. The results based on VT with its various token numbers using k-NN and SVM are depicted in Fig. 12. In both Figs. 12a and b, the best accuracy was obtained when the token number was 4. However, the best accuracy of 54.15% using SVM was better than that of 51.60% using k-NN. Its corresponding sensitivity and speciﬁcity with k-NN were 54.46% and 48.04% and those of SVM were in turn 76.43% and 26.52%. 3.3. Signal processing based texture feature family results

76

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Fig. 13. Gaussian ﬁlter k-NN and SVM results.

used in this paper. MRF has two parameters: potential and iteration, ranging from 0.1: 0.1: 0.5 and 10: 10: 50 , respectively (refer to Subsection 2.6.1). Fig. 16 represents the detection results based on MRF with the two parameters changing using k-NN and SVM. The highest accuracy of 69.78% with a sensitivity of 76.29%, and a speciﬁcity of 61.83% using k-NN was obtained, where potential = 0.1 and iteration was 50. The results classiﬁed by k-NN based on MRF with potential and iteration varying are illustrated in Figs. 16a and b, respectively. The accuracy fell with an increase in potential, while it rose by increasing iteration. According to Figs. 16c and d, the highest accuracy using SVM was 71.61% with a sensitivity of 84.87%, and a speciﬁcity of 55.14%. The accuracy rose with an increase in potential from Fig. 16c, however it had little change in Fig. 16d.

results of all three performance metrics increased signiﬁcantly, while afterwards accuracy, sensitivity, and speciﬁcity ﬂuctuated only a little. The results based on the Gabor ﬁlter with the changing σ, constant ﬁlter size, and θ using k-NN are represented in Fig. 15c. In this ﬁgure, the accuracy decreased as σ went up. On the other hand, the accuracy rose with an increasing θ (refer to Fig. 15e). The best accuracy of 83.72% with a sensitivity of 87.90%, and a speciﬁcity of 78.44% using SVM was reached, with a ﬁlter size = 54 , σ = 5, and θ = 315°. From Fig. 15b (size changing, σ = 5, and θ = 315°) there was a minor change in accuracy with an increasing ﬁlter size, while in both Figs. 15d and f, the accuracy increased along with σ and θ, respectively. 3.4. Model based texture feature family results MRF from the fourth texture feature family was the last extractor 77

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Fig. 14. Steerable ﬁlter k-NN and SVM results.

after that shows its classiﬁcation time, and ﬁnally the last column is the total time. The parameters of each texture extractor are the same with Table 8, which shows the best result of each feature extractor and classiﬁer. From Table 9, it is obvious that IGH took the least amount of time (6.9184 ms) to extract its texture feature using 256 bins. For all texture extractors k-NN was faster than SVM. In terms of the total running time, IGH + k-NN performed the fastest at 7.2893 ms, while the longest time of 1587.2427 ms (1.587 s) was obtained by Gabor + SVM. This is reasonable as the Gabor ﬁlter needs to set up a 2-D ﬁlter bank ﬁrst, which requires time and further time is required to produce a response value for each pixel. The ﬁrst family of texture feature extractors on the other hand simply calculates the pixel value distribution, which is fast and easy. This is why on average its extraction time is quicker than the other three families.

3.5. Comparison Fig. 17 and Table 8 summarizes the best results from the eight facial key block texture feature extractors. In Fig. 17, the accuracies of k-NN are represented in the blue bar and the red bar depicts the results using SVM. As shown in Fig. 17, the 6 algorithms from the ﬁrst and third texture feature families are more eﬀective than the 2 methods of the second and fourth families, as all of its 12 accuracies (6 methods ×2 classiﬁers) were higher than those of the other 4 accuracies (2 methods ×2 classiﬁers). According to Section 2.2, the ﬁrst and third texture feature families focus on information such as the local area of the image, while the other two families are interested in the whole image information. Hence, the ﬁrst and third family of methods are more suitable for extracting texture features from the four facial key blocks than the other two, which is consistent with the experimental results. In Table 8, there are 5 results higher than 90%. They are IGH (k-NN and SVM), LBP (k-NN and SVM), and Gabor (k-NN), where the highest accuracy was 99.02% obtained through IGH with bin number = 256 using SVM. The four facial key blocks contain only skin, therefore, the texture feature extractors that focus on the structure of the block are not suitable for this application. However, IGH only calculates the distribution of the block pixel values and is more appropriate. The result proves that IGH can represent the facial key blocks well for DM detection, while the other seven extractors may ignore some useful information in the block. Another way to compare and contrast the diﬀerent extractors is to calculate its running time (in miliseconds shown in Table 9). In this table the ﬁrst column is the name of the facial key block texture feature extraction method, the following column is its corresponding parameter(s), the third column shows the texture extraction time (for all four facial key blocks), the next column gives the classiﬁer used, the one

3.6. Facial key block size analysis In order to explore the inﬂuences that various facial key block sizes have in DM detection, experiments using IGH with bin number = 256 to extract texture features from blocks with sizes ranging from [1: 1: 64] classiﬁed through SVM with the linear kernel function was implemented. This result is shown in Fig. 18 and analyzed in the following. In this ﬁgure, the blue stars indicate the classiﬁcation accuracy for a corresponding block size. Looking at Fig. 18, it is obvious that the accuracy increased with the increasing of the facial key block size, where the best result with a block size of 59 is indicated with a red star. For block sizes ranging from 19 to 64, the accuracy ﬂuctuated only a little, with a mean accuracy of 98.65% and a standard deviation of 0.00197. We can extrapolate from this result that a block size greater than 64 would not deviate too much this mean. Therefore, a larger block size ( > 64) would not only add to the computation cost, but also 78

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Fig. 15. Gabor ﬁlter k-NN and SVM results.

79

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Fig. 16. MRF k-NN and SVM results.

have a minor aﬀect on the accuracy.

Table 7 Matrix property combination list.

4. Conclusion

Matrix property 1. Contrast 3. Energy 5. Contrast + Correlation 7. Contrast + Homogeneity 9. Correlation + Homogeneity 11. Contrast + Correlation + Energy 13. Contrast + Energy + Homogeneity

In recent years, more and more researchers have been interested in developing non-invasive methods to detect DM, particularly through facial analysis. There are two types of features that can be extracted from facial key images: color and texture. Given the lack of research in analyzing facial texture features, especially for DM detection, in this paper we explore the inﬂuences that diﬀerent texture feature extractors have at detecting this disease. In texture classiﬁcation, there exists many texture feature extraction methods. They can be categorized into four families: (1) statistical, (2) structural, (3) signal processing based, and (4) model based. For each facial key block representing a facial speciﬁc region, eight methods (with diﬀerent parameters) from the four

2. Correlation 4. Homogeneity 6. Contrast + Energy 8. Correlation + Energy 10. Energy + Homogeneity 12. Contrast + Correlation + Homogeneity 14. Correlation + Energy + Homogeneity

15. All

Fig. 17. Best accuracies of the eight texture feature extractors using k-NN and SVM.

80

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Table 8 Summary of the best results.

enlarge the facial image dataset to include more diseases.

families were applied to extract its texture features. In the eight methods, 3 (IGH, GLCM, and LBP) are from the ﬁrst family, 1 (VT) from the second family, another 3 methods (Gaussian, Steerable, and Gabor) belong to the third family, and 1 method (MRF) is from the last family. Using the texture feature vectors generated by applying one of the eight extractors to a facial key block classiﬁed with two classiﬁers (kNN and SVM), and using 10-fold cross validation, the experimental results were conducted. Based on these results, it can be concluded that IGH from the ﬁrst family outperforms the other methods at DM detection with an accuracy of 99.02%, a sensitivity of 99.64%, and a speciﬁcity of 98.26% using 256 bins with a block size of 59 via SVM. Furthermore, IGH took the least amount of time (6.9184 ms) to extract the texture features. These results make sense since IGH would perform calculations on the distribution of the block pixel values rather than focusing on the block structure. As part of our future work, we will apply more classiﬁers and

Conﬂict of interest The authors declare that there is no conﬂict of interest regarding the publication of this paper.

Acknowledgements This work was supported by the Research Grants of University of Macau [MYRG2015-00049-FST, MYRG2015-00050-FST]; the Science and Technology DevelopmentFund (FDCT) of Macau [128/2013/A, 124/2014/A3]; and Macau-China join Project [008-2014-AMJ]. This research project was also supported by the National Natural Science Foundation of China [61273244] and [61602540].

Table 9 Running time (in miliseconds) of each sample based on the eight extractors. Method

Parameter (s)

Extraction time (ms)

Classiﬁer

Classiﬁcation Time (ms)

Total Time (ms)

IGH

Bin number=256 Bin number=256 Contrast + Correlation Correlation Cell Size=32 Cell Size=21 Token Number=4 Token Number=4 Size=13, σ = 3 Size=1, σ = 1 σ = 2 , θ=210° σ = 4 , θ=315° Size=16, σ = 1, θ=315° Size=54, σ = 5, θ=315° Potential=0.1, Iteration=50 Potential=0.5, Iteration=20

6.9184 6.9184 9.0252 9.0233 11.4951 9.1126 9.9903 9.9903 8.7534 7.4796 11.8854 14.1437 34.9534 1586.2718 374.6563 174.5961

k-NN SVM k-NN SVM k-NN SVM k-NN SVM k-NN SVM k-NN SVM k-NN SVM k-NN SVM

0.3709 60.8505 0.1495 0.2155 0.5476 1.0796 0.1592 131.3184 0.1553 66.6913 0.1476 0.1845 0.1689 0.9709 0.1961 7.3146

7.2893 67.7689 9.1748 9.2388 12.0427 10.1922 10.1495 141.3087 8.9087 74.1709 12.0330 14.3282 35.1223 1587.2427 374.8524 181.9107

GLCM LBP VT Gaussian Steerable Gabor MRF

81

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al.

Fig. 18. Classiﬁcation results of diﬀerent block sizes using IGH and SVM. Transactions on Pattern Analysis & Machine Intelligence, 9, 1991, pp. 891–906. [31] M. Nieto, L. Salgado, Real-time vanishing point estimation in road sequences using adaptive steerable ﬁlter banks, in: Advanced Concepts for Intelligent Vision Systems, Springer, 2007, pp. 840–848. [32] J.-K. Kamarainen, V. Kyrki, H. Kälviäinen, Invariance properties of gabor ﬁlterbased features-overview and applications, Image Process. IEEE Trans. 15 (5) (2006) 1088–1099. [33] L.-L. Huang, A. Shimizu, H. Kobatake, Robust face detection using gabor ﬁlter features, Pattern Recognit. Lett. 26 (11) (2005) 1641–1649. [34] R. Mehrotra, K.R. Namuduri, N. Ranganathan, Gabor ﬁlter-based edge detection, Pattern Recognit. 25 (12) (1992) 1479–1494. [35] S.Z. Li, Markov Random Field Modeling in Image Analysis, Springer Science & Business Media, 2009. [36] R. Paget, D. Longstaﬀ, Texture synthesis via a non-parametric markov random ﬁeld, Proc. DICTA-95 Digit. Image Comput.: Tech. Appl. 1 (1995) 547–552. [37] G.R. Cross, A.K. Jain, Markov random ﬁeld texture models, Pattern Anal. Mach. Intell. IEEE Trans. 1 (1983) 25–39. [38] L.E. Peterson, K-nearest neighbor, Scholarpedia 4 (2) (2009) 1883. [39] J.M. Keller, M.R. Gray, J.A. Givens, A fuzzy k-nearest neighbor algorithm, Syst., Man Cybern. IEEE Trans. 4 (1985) 580–585. [40] K. Fukunaga, P.M. Narendra, A branch and bound algorithm for computing knearest neighbors, Comput. IEEE Trans. 100 (7) (1975) 750–753. [41] I. Steinwart, A. Christmann, Support Vector Machines, Springer Science & Business Media, 2008. [42] B. Scholkopf, A.J. Smola, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond, MIT Press, 2001. [43] C.J. Burges, A tutorial on support vector machines for pattern recognition, Data Min. Knowl. Discov. 2 (2) (1998) 121–167. [44] R. Kohavi, et al., A study of cross-validation and bootstrap for accuracy estimation and model selection, in: Ijcai, 14, 1995, pp. 1137–1145. [45] M.W. Browne, R. Cudeck, Single sample cross-validation indices for covariance structures, Multivar. Behav. Res. 24 (4) (1989) 445–455. [46] G.H. Golub, M. Heath, G. Wahba, Generalized cross-validation as a method for choosing a good ridge parameter, Technometrics 21 (2) (1979) 215–223. [47] H. Wang, Huangdi Neijing, New World Publish, 1999. [48] Z. Bing, W. Hongcai, Basic Theories of Traditional Chinese Medicine, Singing Dragon, 2010. [49] Z. Bing, W. Hongcai, Diagnostics of Traditional Chinese Medicine, Singing Dragon, 2010. [50] S.-W. Youn, E.-S. Park, D.-H. Lee, C.-H. Huh, K.-C. Park, Does facial sebum excretion really aﬀect the development of acne?, Br. J. Dermatol. 153 (5) (2005) 919–924. [51] G. Srinivasan, G. Shobha, Statistical texture analysis, in: Proceedings of World Academy of Science, Engineering and Technology, 36, 2008, pp. 1264–1269. [52] T. Ojala, M. Pietikäinen, Texture classiﬁcation. Machine Vision and Media Processing Unit, University of Oulu, Finland, 2004. [53] R.M. Haralick, Statistical and structural approaches to texture, Proc. IEEE 67 (5) (1979) 786–804. [54] M. del Pozo-Baños, C.M. Travieso-González, J.R. Ticay-Rivas, J.B. Alonso, J. Arroyo, J. Cabrera-Falcón, L. Sánchez-Chavez, M. Ramírez-Bogantes, S.T. Pérez, Image processing for pollen classiﬁcation, INTECH Open Access Publisher, 2012. [55] Using a gray-level co-occurrence matrix (glcm), 〈http://matlab.izmiran.ru/help/ toolbox/images/enhanc15.html〉,(accessed 12.06.16). [56] T. Ojala, M. Pietikainen, D. Harwood, Performance evaluation of texture measures with classiﬁcation based on kullback discrimination of distributions, in: Pattern Recognition, 1994. Vol. 1-Conference A: Computer Vision & Image Processing., in: Proceedings of the 12th IAPR International Conference on, Vol. 1, IEEE, 1994, pp. 582–585. [57] M. Tuceryan, A.K. Jain, Texture segmentation using voronoi polygons, IEEE Trans. Pattern Anal. Mach. Intell. 12 (2) (1990) 211–216. [58] M. Ramella, W. Boschin, D. Fadda, M. Nonino, Finding galaxy clusters using voronoi tessellations, Astron. Astrophys. 368 (3) (2001) 776–786. [59] C. Duyckaerts, G. Godefroy, Voronoi tessellation to study the numerical density and the spatial distribution of neurones, J. Chem. Neuroanat. 20 (1) (2000) 83–92. [60] M. Boldt, R. Weiss, E. Riseman, Token-based extraction of straight lines, Syst. Man Cybern. IEEE Trans. 19 (6) (1989) 1581–1594. [61] M. Tüceryan, A.K. Jain, Texture segmentation using voronoi polygons, Pattern Anal. Mach. Intell. IEEE Trans. 12 (2) (1990) 211–216. [62] J. Canny, A computational approach to edge detection, Pattern Anal. Mach. Intell. IEEE Trans. 6 (1986) 679–698. [63] A. Blake, P. Kohli, C. Rother, Markov Random Fields for Vision and Image Processing, Mit Press, 2011.

References [1] Diabetes, 〈http://www.who.int/topics/diabetes_mellitus/en/〉,(accessed 09.01.17). [2] W. H. Organization, Global report on diabetes. [3] I.D. Federation, Idf diabetes atlas, 7th edn, Brussels, Belgium: International Diabetes Federation. [4] A.D. Association, et al., Classiﬁcation and diagnosis of diabetes, Diabetes Care 38 (Supplement 1) (2015) S8–S16. [5] P. Drouin, J. Blickle, B. Charbonnel, E. Eschwege, P. Guillausseau, P. Plouin, J. Daninos, N. Balarac, J. Sauvanet, Diagnosis and classiﬁcation of diabetes mellitus: the new criteria, Diabetes Metab. 25 (1) (1999) 72–83. [6] J.B. O'Sullivan, C. Mahan, Glucose tolerance test, in: Variability in Pregnant and Non-pregnant Women, 1966, pp. 345–351. [7] B. Zhang, B. Kumar, D. Zhang, Noninvasive diabetes mellitus detection using facial block color with a sparse representation classiﬁer, Biomed. Eng. IEEE Trans. 61 (4) (2014) 1027–1033. [8] L.-K. Soh, C. Tsatsoulis, Texture analysis of sar sea ice imagery using gray level cooccurrence matrices, Geosci. Remote Sens. IEEE Trans. 37 (2) (1999) 780–795. [9] P. Vaidyanathan, T.Q. Nguyen, Eigenﬁlters: a new approach to least-squares ﬁr ﬁlter design and applications including nyquist ﬁlters, Circuits Syst. IEEE Trans. 34 (1) (1987) 11–23. [10] G. New, M. Yates, J. Woerdman, G. McDonald, Diﬀractive origin of fractal resonator modes, Opt. Commun. 193 (1) (2001) 261–266. [11] T. Ahonen, A. Hadid, M. Pietikainen, Face description with local binary patterns: application to face recognition, Pattern Anal. Mach. Intell. IEEE Trans. 28 (12) (2006) 2037–2041. [12] X. Xie, M. Mirmehdi, A galaxy of texture features, Handb. Texture Anal. (2008) 375–406. [13] K. Tsukiyama, J.A. Acorda, H. Yamada, Evaluation of superﬁcial digital ﬂexor tendinitis in racing horses through gray scale histogram analysis of tendon ultrasonograms, Vet. Radiol. Ultrasound 37 (1) (1996) 46–50. [14] J.-H. Han, S. Yang, B.-U. Lee, A novel 3-d color histogram equalization method with uniform 1-d gray scale histogram, Image Process. IEEE Trans. 20 (2) (2011) 506–512. [15] V.M. Mohan, R.K. Durga, S. Devathi, K.S. Raju, Image processing representation using binary image; grayscale, color image, and histogram, in: Proceedings of the Second International Conference on Computer and Communication Technologies, Springer, 2016, pp. 353–361. [16] J. Virmani, V. Kumar, N. Kalra, N. Khandelwal, Prediction of cirrhosis based on singular value decomposition of gray level co-occurence marix and anneural network classiﬁer, in: Developments in E-systems Engineering (DeSE), 2011, IEEE, 2011, pp. 146–151. [17] J.-h. Feng, Y.-j. Yang, Study of texture images extraction based on gray level cooccurence matrix, Beijing Surv. Mapp. 3 (2007) 19–22. [18] B. Chanda, B. Chaudhuri, D.D. Majumder, On image enhancement and threshold selection using the graylevel co-occurence matrix, Pattern Recognit. Lett. 3 (4) (1985) 243–251. [19] Z. Guo, L. Zhang, D. Zhang, A completed modeling of local binary pattern operator for texture classiﬁcation, Image Process. IEEE Trans. 19 (6) (2010) 1657–1663. [20] G. Zhang, X. Huang, S.Z. Li, Y. Wang, X. Wu, Boosting local binary pattern (lbp)based face recognition, in: Advances in Biometric Person Authentication, Springer, 2004, pp. 179–186. [21] L. Liu, S. Lao, P.W. Fieguth, Y. Guo, X. Wang, M. Pietikäinen, Median robust extended local binary pattern for texture classiﬁcation, IEEE Trans. Image Process. 25 (3) (2016) 1368–1381. [22] J. Kim, S. Yu, D. Kim, K.-A. Toh, S. Lee, An adaptive local binary pattern for 3d hand tracking, Pattern Recognit. 61 (2017) 139–152. [23] Q. Du, M. Emelianenko, L. Ju, Convergence of the lloyd algorithm for computing centroidal voronoi tessellations, SIAM J. Numer. Anal. 44 (1) (2006) 102–119. [24] Q. Du, M. Gunzburger, Grid generation and optimization based on centroidal voronoi tessellations, Appl. Math. Comput. 133 (2) (2002) 591–607. [25] M. Tanemura, T. Ogawa, N. Ogita, A new algorithm for three-dimensional voronoi tessellation, J. Comput. Phys. 51 (2) (1983) 191–207. [26] I.T. Young, L.J. Van Vliet, Recursive implementation of the gaussian ﬁlter, Signal Process. 44 (2) (1995) 139–151. [27] G. Deng, L. Cahill, An adaptive gaussian ﬁlter for noise reduction and edge detection, in: Nuclear Science Symposium and Medical Imaging Conference, 1993, 1993 IEEE Conference Record., IEEE, 1993, pp. 1615–1619. [28] K. Ito, K. Xiong, Gaussian ﬁlters for nonlinear ﬁltering problems, Autom. Control IEEE Trans. 45 (5) (2000) 910–927. [29] E.P. Simoncelli, H. Farid, Steerable wedge ﬁlters for local orientation analysis, IEEE Trans. Image Process. 5 (9) (1996) 1377–1382. [30] W.T. Freeman, E.H. Adelson, The design and use of steerable ﬁlters, IEEE

Ting Shu is a PhD student in the Department of Computer and Information Science at

82

Computers in Biology and Medicine 83 (2017) 69–83

T. Shu et al. the University of Macau. Her research interests include pattern recognition, medical biometrics, texture classiﬁcation, and medical image analysis.

Yuan Yan Tang is a Chair Professor in the Department of Computer and Information Science at the University of Macau. His research interests include pattern analysis and machine intelligence, knowledge and data engineering, wavelet and multiresolution analysis, information security, as well as Chinese computing.

Bob Zhang is an Assistant Professor in the Department of Computer and Information Science at the University of Macau. His research interests include biometrics, pattern recognition, image processing, and medical image analysis.

83

An extensive analysis of various texture feature extractors to detect Diabetes Mellitus using facial specific regions

An extensive analysis of various texture feature extractors to detect Diabetes Mellitus using facial specific regions

Recommend Documents