Classification of the trabecular bone structure of osteoporotic patients using machine vision

Classification of the trabecular bone structure of osteoporotic patients using machine vision

Accepted Manuscript Classification of the trabecular bone structure of osteoporotic patients using machine vision Anushikha Singh, Malay Kishore Dutta...

1MB Sizes 0 Downloads 18 Views

Accepted Manuscript Classification of the trabecular bone structure of osteoporotic patients using machine vision Anushikha Singh, Malay Kishore Dutta, Rachid Jennane, Eric Lespessailles PII:

S0010-4825(17)30337-2

DOI:

10.1016/j.compbiomed.2017.10.011

Reference:

CBM 2804

To appear in:

Computers in Biology and Medicine

Received Date: 17 January 2017 Revised Date:

22 September 2017

Accepted Date: 11 October 2017

Please cite this article as: A. Singh, M.K. Dutta, R. Jennane, E. Lespessailles, Classification of the trabecular bone structure of osteoporotic patients using machine vision, Computers in Biology and Medicine (2017), doi: 10.1016/j.compbiomed.2017.10.011. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

RI PT

Classification of the Trabecular Bone Structure of Osteoporotic Patients using Machine Vision Anushikha Singh1, Malay Kishore Dutta1, Rachid Jennane2, Eric Lespessailles3 1

M AN U

SC

Department of Electronics & Communication Engineering Amity University, Noida, Uttar Pradesh, India 2 Univ. Orléans, I3MTO Laboratory, EA 4708, 45067 Orléans, France 3 Univ. Orléans, I3MTO Laboratory, EA 4708 and Hospital of Orleans, 45067 Orléans, France Emails: [email protected], [email protected] [email protected], [email protected]

AC C

EP

TE D

Abstract: Osteoporosis is a common bone disease which often leads to fractures. Clinically, the major challenge for the automatic diagnosis of osteoporosis is the complex architecture of bones. The clinical diagnosis of osteoporosis is conventionally done using Dual-energy X-ray Absorptiometry (DXA). This method has specific limitations, however, such as the large size of the instrument, a relatively high cost and limited availability. The method proposed here is based on the automatic processing of X-ray images. The bone X-ray image was statistically processed and strategically reformed to extract discriminatory statistical features of different orders. These features were used for machine learning for the classification of two populations composed of osteoporotic and healthy subjects. Four classifiers - support vector machine (SVM), k-nearest neighbors, Naïve Bayes and artificial neural network - were used to test the performance of the proposed method. Tests were performed on X-ray images of the calcaneus bone collected from the hospital of Orleans. The results are significant in terms of accuracy and time complexity. Experimental results indicate a classification rate of 98% using an SVM classifier which is encouraging for automatic osteoporosis diagnosis using bone X-ray images. The low time complexity of the proposed method makes it suitable for real time applications. Keywords: Osteoporosis, Bone X-ray Images, Image analysis, Feature Extraction, Supervised Classification.

1 Introduction Osteoporosis is a Greek term meaning “porous bone”. It is the most common disease of bones, and causes loss of bone density or insufficient bone formation, making the bones more susceptible to fractures [1]. Osteoporosis may be considered as a silent disease as no symptoms occur in the early stages of the disease. This silent disease

1

ACCEPTED MANUSCRIPT

TE D

M AN U

SC

RI PT

affects a large part of the population worldwide from a certain age. According to the estimation of the World Health Organization (WHO), osteoporosis affects more than 75 million people in the USA, Europe and Japan [2] and is the reason for more than 2.3 million fractures yearly in Europe and the USA alone. Currently, it is estimated that more than 200 million people all over the world are affected from this disease of bone [3]. It is a very common disease in India as well because of increasing longevity and a larger proportion of the population over the age of 50 years. It is estimated that more than 50 million people in India suffer from osteoporosis [3]. Bone in humans is living, growing tissue which is mainly composed of protein, calcium and collagen. The strength and flexibility of bone depend on the combination of calcium and collagen. Osteoporosis is a consequence of a steady loss of collagen and calcium, which leads to a gradual decrease in bone density and deterioration of bone microarchitecture [4]. Bones become weak, and fractures thus occur frequently, mainly in the wrist, hip, spine or femur. Fig. 1 shows the bone micro-architecture for healthy bone and osteoporotic bone.

Healthy Bone

Osteoporotic Bone

Fig. 1: Bone micro-architecture

AC C

EP

Osteoporosis is more common in women than in men, because generally women have smaller and thinner bones than men and also due to the hormonal changes that occur during menopause. The conventional technique used to measure bone mineral density (BMD) is Dual-Energy X-ray Absorptiometry (DXA), which emits an extremely low dose of ionizing radiation compared to radiographs. The results of texture analysis of the bone structure on radiographs have been compared to BMD in several studies [13]. Conventional bone radiographs are considered as a cheaper alternative to DXA. Moreover, it has been shown that texture analysis of X-ray radiographs is well suited to assess changes in trabecular bone architecture [41, 42].

The aim of the approach proposed here is to complete the diagnosis achieved in clinical routine based on the BMD measured by DXA, by characterizing and quantifying the quality of the bone micro-architecture. The main issue is how to improve the early diagnosis of osteoporosis and assess bone quality parameters on Xray radiographs. The calcaneus was used because of the limited amount of soft tissues surrounding this bone. Soft tissues could increase the variability of the method. Moreover, the calcaneus consists of 90% trabecular bone and is a good predictive site

2

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

of fracture in terms of bone mineral density [14]. Currently, X-ray is a widely used modality in the hospital environment. Quantitative Ultrasound (QUS) are also used to predict fragility fractures in postmenopausal women. Although conventional QUS approaches confer numerous advantages such as absence of ionizing radiation, portability, lower costs, availability in primary care, they do not provides added value compared to DXA for osteoporosis diagnosis and cannot be used to monitor the skeletal effects of treatments for osteoporosis [40]. Updated official positions of the International Society for Clinical Densitometry [43] validated only the heel as a skeletal site for the clinical use of QUS in osteoporosis management. However recent advanced quantitative echo sound methodology at the proximal femur might be promising [44]. In this section, relevant work on the image processing based diagnosis of osteoporosis is summarized. Geraets et al. [30] developed a method for the automated recognition of the radiographic trabecular pattern for osteoporosis diagnosis. Genant et al. [5] proposed an imaging assessment of bone quality in osteoporosis by computed tomography (CT) images and micro-computed tomography (µ-CT) images. E. Legrand et al. [6] showed the analysis of trabecular bone micro-architecture for measurement of bone mineral density and vertebral fractures for diagnosis of osteoporosis. This study was done for diagnosis of osteoporosis in male subjects and reported results were interesting and motivating. J. S. Gregory et al. [7] analyzed the trabecular bone structure using Fourier transforms and artificial neural networks for diagnosis of osteoporosis. M. G. Roberts et al. studied the image texture in dental panoramic radiographs for diagnosis of osteoporosis [8]. M.S. Kavitha et al. [9] developed computer aided system for diagnosis of osteoporosis from dental panoramic radiograph images. Support Vector Machine (SVM) classifier was used to separate osteoporotic patients from healthy subjects. M. Ascenzi et al. [10] studied the variation of trabecular architecture in proximal femur of postmenopausal women for diagnosis of osteoporosis. B. R. Gomberg et al. [11] explained topological analysis of trabecular bone MR images for diagnosis osteoporosis. V. Sapthagirivasan et al. [12] presented a computer vision based diagnosis system for osteoporotic risk detection using an SVM classifier for X-ray digital images. Haidekker et al. [31] proposed an image processing based method for the diagnosis of osteoporosis and estimation of the individual bone fracture risk. R. Dendere et al. [32] developed a model to measure phalangeal bone mineral mass on a slot-scanning digital radiography system. Bayarriet et al. [33] presented a mechanical characterization of trabecular bone applied to MRI examinations for osteoporosis diagnosis. Cheng et al. [34] proposed learning based landmark detection for osteoporosis analysis. E. Hassouni et al. classified trabecular bone X-ray images using fractional Brownian motion and the Probability Density Function. T. Whitmarsh et al. [35] developed a method for reconstruction of 3 dimensional shape and distribution of bone mineral density of proximal Femur from Dual-Energy X-Ray absorptiometry. V. Sapthagirivasan et al. [36] analyzed texture pattern of femur X-ray images for diagnosis of osteoporosis. Texture analysis was also done by Gaidel et al. [37] for automated diagnosis of osteoporosis by plain hip radiography. J. H. Potgieter et al. [38] developed a method to measure bone mineral density on a slot-scanning digital radiography system using dual energy X-ray absorptiometry. H. H. Lin et al. [39] developed a novel two

3

ACCEPTED MANUSCRIPT

component model to calculate bone mineral density and volume fractions from computed tomography images.

RI PT

Although all these studies are interesting and the experimental results reported are encouraging, there is still a need to design accurate, computer vision based systems for the diagnosis of osteoporosis from bone images.

M AN U

SC

The main contribution of this paper is a novel and efficient imaging method to improve the diagnosis of osteoporosis based on strategically extracted discriminatory features from bone X-ray images. Using these discriminatory features, supervised classification techniques are used to distinguish osteoporotic patients from healthy subjects automatically. The only manual intervention required is the specification of a measurement region within the image. The intensity and contrast of the bone X-ray images are adjusted prior to feature extraction, which ensures that the extracted features efficiently discriminate between the two populations. These features are subjected to different classifiers, resulting in an accurate classification in comparison to features extracted without pre-processing. Another highlight of the proposed work is the low time complexity required for the proposed method. Discriminatory features extracted from bone X-ray images are standardized using the z-score normalization technique and compressed using principal component analysis which ensures low time complexity.

2. Materials

TE D

The remaining part of the work is organized as follows. Section 2 presents the materials used in this work. Section 3 describes the proposed image processing based method including pre-processing, feature analysis and classification. Section 4 reports and discusses the experimental results obtained by the proposed algorithm. Section 5 concludes the paper.

AC C

EP

This section describes the protocol for selecting the clinical population and the radiographic process for data acquisition. 2.1 Populations This is a retrospective multi-centric study that was conducted at three French research hospitals (Amiens, Orleans and Cochin in Paris). Patients were recruited from a program originated by the hospital of Orleans between Nov. 2004 and Feb. 2006. The study involved 174 women aged 40 to 92 years attending the bone densitometry unit and hospitalized in the rheumatology, orthopedic, and geriatric units, and comprised 87 controls (CC) and 87 patients with osteoporotic fractures (OP). Among the OP subjects, there were 21 patients with hip fracture (HF), 22 with vertebral fracture (VF), 23 with wrist fracture (WF) and 21 with other fractures (OF). All OP patients (aged 71.34 ± 10.55 SD) and control subjects (aged 68.93 ± 9.78 SD) filled out an osteoporosis risk questionnaire (age, family history of fracture, menopausal status, use of tobacco or alcohol, use of hormonal replacement therapy (HRT), treatment by oral corticosteroids, rheumatoid arthritis). Patients treated with corticosteroids, fluoride, bisphosphonates, HRT, tibolone, calcitonin, SERM, and PTH for more than 6 months in the previous year were excluded from the study.

4

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

Patients who were immobilized for more than 2 months and with known diseases that could interfere with bone metabolism were also excluded from the study. All fracture cases were reviewed by experienced investigators who considered the diagnosis of fragility fracture if it occurred after the age of 40 years. The cases were described as either spontaneous fractures, fractures resulting from strenuous activity, fractures after falls from standing height or less (low trauma energy) and following radiologic data. The presence of vertebral fractures was diagnosed using lateral spine radiographs according to Genant’s semi-quantitative classification with grade II or more. BMD was performed by DXA on all patients and controls. Grade I vertebral fractures were not included in this study since in pivotal OP treatment trials only grades II or III were considered. However, OP characterized by wrist fractures (23 cases in the present study) was included, as they can be considered to represent an early stage of osteoporosis. Since these fractures occur in patients younger than those with hip or vertebral fractures. 2.2 Acquisition Images were obtained on the calcaneus (bone of the heel) with a direct digital Xray device (BMA™, D3A Medical Systems) [13].The calcaneus is a bone surrounded by limited soft tissues and it was chosen to limit the effect of soft tissues that could increase the variability of the method. Radiographic images were captured following a highly standardized protocol. The same parameters were used for all patients: focal distance (1.15 m), X-ray parameters (55 kV and 20 mAs). Using two anatomical landmarks as previously described in [14], a similar Region of Interest (ROI) was defined for each subject. An operator localized these land marks on each image (Fig. 2(a)) enabling the software device to position the ROI (1.6×1.6 cm2) (Fig. 2(b)). Images were coded into 16 bits (400×400 pixels size). Fig. 2(b) and Fig. 2(c) show respectively a representative image of a ROI of a control subject and of an osteoporotic patient.

(a)

(b)

(c)

Fig.2: X-ray of a calcaneus bone with the ROI (a), segmented ROI of a control subject (b), extracted ROI of an osteoporotic patient (c).

5

ACCEPTED MANUSCRIPT

3. The proposed Imaging Method

SC

RI PT

The main objective of this study was to design and develop an image processing based method to distinguish osteoporotic patients from healthy subjects. The proposed method is based on discriminatory features extracted from pre-processed bone X-ray images that are subjected to artificially intelligent classifiers. To extract efficient and discriminatory features, images were pre-processed to improve their quality before feature extraction. The extracted features were normalized and compressed to reduce the time complexity and to improve classifier performance. The proposed method is broadly divided into four steps: pre-processing, feature extraction and analysis, prominent feature selection using supervised classification. Fig. 3 explains the flowchart of the proposed method. Bone X-ray Image Acquisition

Machine learning/Computer Vision

X-ray Machine Set up

M AN U

Quality Enhancement/ Preprocessing of X-ray Image

X-ray of a calcaneus bone with the ROI

Testing Samples

Supervised Classification

Database

Training Data

EP

Healthy & Osteoporotic patient X- ray Image

AC C

p- value Calculation to select most Discriminatory Features

Most Discriminatory Feature Selection (p-value ≤ 10-6 )

TE D

Identification/Grading of Healthy & Osteoporotic patient X- ray Image

Statistical/Texture Feature Extraction and Analysis

Validation of Results with Doctor’s grading

Testing Samples

Classification Healthy/ Osteoporotic patient X- ray Image

Performance Measure

Accuracy

Time Complexity

Classification of two populations Healthy/osteoporotic

Fig. 3: The flowchart of the method using X-ray images of calcaneus bone.

6

ACCEPTED MANUSCRIPT

Step 1: Pre-processing

TE D

M AN U

SC

RI PT

X-ray images of the bone micro architecture from osteoporotic patients and healthy subjects show high similarity, making the diagnosis of osteoporosis from X-ray images a challenging task. To address this issue, each bone X-ray image was preprocessed to enhance its quality before feature extraction by adjusting the intensity and contrast of the input image. Intensity adjustment was done by transforming the intensity values to new values in such a way that 1% of data was saturated at lower and higher intensity values in an input image [15]. Contrast enhancement was accomplished using adaptive histogram equalization [15]. Fig. 4 (a) shows two samples of bone X-ray images from healthy subjects. Fig. 4(b) and 4(c) present the images after intensity adjustment and contrast enhancement, respectively. Similarly, Fig. 5(a) shows two samples of X-ray images from osteoporotic patients. Fig. 5(b) and Fig. 5(c) present the images after intensity and contrast adjustment, respectively. It can be seen that the quality of the images has been changed, and that trabeculae are more visible. The pre-processed images were further used for feature extraction.

AC C

EP

(a) (b) (c) Fig. 4: Two representative samples of bone X-ray images of healthy subjects. original input images (a), images after intensity adjustment (b), images after contrast adjustment(c).

(a) (b) (c) Fig. 5: Two representative samples of bone X-ray images of osteoporotic patients (with fracture). original input images (a), images after intensity adjustment (b), images after contrast adjustment (c).

7

ACCEPTED MANUSCRIPT

Step 2: Feature Extraction and Analysis

RI PT

The density of bone and its micro-architecture depend on the composition of calcium and collagen. In osteoporotic patients, the steady loss of collagen and calcium leads to a decrease in bone density and deterioration of bone micro-architecture which may disturb the pattern of pixel intensities in bone X-ray images. First-order Statistical Features: First-order statistical features characterize the distribution of the image intensities across an entire image. The mean and standard deviation of all pixel intensities within the ROI in each image were used.

M AN U

SC

Second-order Statistical Features: In osteoporotic patients, insufficient bone formation may cause changes in the texture of bone X-ray images. These changes will be reflected in the relationships between pixel intensities and their distribution over the image. In contrast to first‐order statistical features, second‐order features of an input image in the spatial domain quantify relationships between the intensities of neighboring pixels. Second-order statistical features can be calculated using a gray-level co-occurrence matrix (GLCM). A GLCM is a second-order statistical textural analysis, which shows the relationship between intensity values of neighborhood pixels in a gray image [16]. For an image, I of size R×C, the GLCM matrix, G, can be calculated [16] using the following equation: Gd (i,j) = |{(, ), ( + ,  + ) ∶ (, ) = , ( + ,  + ) = }| (1)

TE D

where (r,c), (r+ ,  + ) ∈R×  i = (, )and j = ( + ,  + ) d = ( , ) and |·| represents the cardinality of a set.

For a pixel intensity value i in the image, the probability of a pixel intensity value j at the ( , ) distance is:  (,)

Pd (i ,j) = ∑ 

 (,)

(2)

EP

Using equations (1) and (2), four texture features: energy, contrast, homogeneity and correlation can be calculated using the mathematical formulas given below: Energy = ∑ ∑[ (, )]!

(3)

AC C

Contrast= ∑ ∑ [( − )! { (, )}] (4)

Homogeneity = ∑ ∑

Correlation =∑ ∑

# (,)

(5)

$%(&)'

(&µ ) *+&µ , -# (,)

(6)

.) .,

A total of six statistical features were extracted and studied from the bone X-ray images before and after pre-processing to discriminate between healthy and osteoporotic patients. To check the discriminating behavior of the extracted features, a p-value test was calculated using the rank sum Wilcoxon test, also called the MannWhitney U test [18, 19].

8

ACCEPTED MANUSCRIPT

RI PT

The rank sum Wilcoxon test is a non-parametric statistical hypothesis test which uses a z‐statistic to calculate the p-value. This calculated p-value is used to determine if two sets of data are significantly different from each other. A lower p-value indicates higher discrimination and a larger p-value shows that there is no discrimination between two populations for that feature. On the basis of the calculated p-value, only the most prominent features were selected to distinguish osteoporotic patients from healthy subjects.

Step 3: Prominent Feature Selection

M AN U

SC

Intelligent classifiers based on machine learning were used to separate osteoporotic patients from healthy subjects. As the performance of the classifiers improves if the range of features subjected to the classifier is standardized, z-score normalisation [20] was applied to each feature. It is well known that the accuracy of the classifiers depends on the discrimination of features used for the classification. If the features subjected to the classifier are more prominent, then the classification accuracy is better. On the other hand, the complexity of the classifier will be reduced if fewer features are subjected to the classifier. In this study, a dimension reduction technique was used to reduce the feature vector by considering only discriminatory features. Dimension reduction techniques compress the high dimensional feature vector by assigning higher weight to discriminatory features. On the basis of experiments, it was found that the dimension reduction algorithm Principal Component Analysis (PCA) [21, 22] provided the most prominent features for the set of features used here.

EP

TE D

PCA transforms the observation data of possibly correlated variables into a set of values of linearly uncorrelated variables on the basis of orthogonal transformation. These transformed linearly uncorrelated variables are known as principal components (PCs). PCA maps the observation data in such a way that the first PC has a higher discrimination, the second has a lower discrimination than the first PC, and so on. Here, only discriminatory features were subjected to the classifiers. The p-value was calculated for all principal components after PCA reduction to select prominent features for classification as shown in the results below. TABLE 1 Statistical features and corresponding p– values (before pre-processing)

AC C

Features

Healthy subjects (mean ± std. deviation)

Osteoporotic patients (mean ± std. deviation)

p- values

(1.94E+003) ± 66.22 (272.73) ± 20.49

0.0626 0.1449

(2.70E+004) ± 256.5899 -0.02 ± 0.01 (6.2936e-006) ± (2.16E-008) 0.03 ± (1.72E-004)

0.5160 0.2309 0.4675 0.4563

First -order statistical feature: Mean Standard Deviation

(1.99E+003) ± 104.69 (289.39) ± 39.42

Second-order statistical features: Contrast Correlation Energy Homogeneity

(2.69E+004) ± 249.9562 -0.01 ± 0.01 (6.29E-006) ± (1.62E-008) 0.02 ± (1.81E-004)

9

ACCEPTED MANUSCRIPT

First -order statistical feature: Mean Standard Deviation

(3.27E+004) ± 50.75 (1.69E+004) ± 122.81

(3.27E+004) ± 90.22 (1.67E+004) ± 94.18

Second-order statistical features: (2.73E+004) ± 796.11 -0.05 ± 0.03 (7.92E-006) ± (1.39E-007) 0.02 ± (6.13E-004)

(2.76E+004) ± 689.29 -0.06 ± 0.03 (7.8953e-006) ± (1.63E-007) 0.02 ± (5.68E-004)

0.0020 1.21E-011

0.4735 0.2748 0.6000 0.7839

SC

Contrast Correlation Energy Homogeneity

p- values

RI PT

TABLE 2 Statistical features and corresponding p – values (after pre-processing) Healthy subjects Osteoporotic patients Features (mean ± std. deviation) (mean ± std. deviation)

TABLE 3 Features (Principal components) and their corresponding p-values (after reduction using PCA)

PC 1 PC 2 PC 3 PC 4 PC 5 PC 6

Healthy subjects (mean ± std. deviation) (-4.02E-006) ± (6.47E-006) (-1.80E-006) ± (4.06E-006) (-2.62E-008) ± (1.10E-007) (-1.63E-012) ± (8.32E-011) (-4.79E-012) ± (1.64E-011) (7.84E-014) ± (2.78E-012)

Osteoporotic patients (mean ± std. deviation) (4.01E-006) ± (6.85E-006) (1.80E-006) ± (2.83E-006) (2.61E-008) ± (1.96E-007) (1.63E-012) ± (9.52E-011) (4.79E-012) ± (2.40E-011) (-7.82E-014) ± (2.63E-012)

M AN U

Features

p- values

1.69E-006 2.24E-005 0.0013 0.8663 0.6834 0.8436

AC C

EP

TE D

Table 1 presents the results of feature analysis before pre-processing of the bone Xray images, showing the mean ± standard deviation and p-value for healthy subjects and osteoporotic patients. It can be seen that the p-value obtained is less significant for classification compared to the p-value obtained after pre-processing. Table 2 presents the results expressed as mean ± standard deviation and p-value for feature extraction after preprocessing. The lower p-value shows that the features extracted after pre-processing are more discriminative in comparison to those extracted before pre-processing. Table 3 presents the results of mean ± standard deviation and p-values for all six principal components after PCA reduction. It can be clearly seen in Table 3 that the first two principal components have much lower p-values, indicating a higher discrimination between osteoporotic patients and healthy subjects. Tables 1, 2 and 3 show the information content of the various features considered for classification. These data indicate that limited information is present in the secondorder features, and most of the information for classification is present in the mean and standard deviation of the intensities. These results also demonstrate that the amount of information present significantly increases by the pre-processing, and this has by far the greatest effect on the standard deviation, making it the most discriminatory feature for classification. Table 4 presents the linear combinations of the original variables that generate the principal components PC1 and PC2 which were used for classification in the proposed work. It can be clearly seen in Table 4 that texture feature: Contrast dominates first principal component PC1 and statistical feature: Std. deviation dominates second principal component PC2.

10

ACCEPTED MANUSCRIPT

RI PT

TABLE 4 Linear combinations of the original features that generate the principal components PC1 and PC2 Principal Components (PCs) PC1 PC2 -0.5160 -0.4609 -0.3014 0.8533 0.0268 0.8017 0.0052 -0.1397 0.0052 -0.1397 0.0052 -0.1397

Features Statistical Features

SC

Texture Features

Mean Std. deviation Contrast Correlation Energy Homogeneity

M AN U

Fig. 6 shows the box-and-whisker plots (median and the 25% and 75% quartiles) of the mean, standard deviation and GLCM features after pre-processing from healthy subjects and osteoporotic patients. As can be seen, these features have a limited capability to discriminate between healthy and osteoporotic subjects. Fig. 7 shows the box plots (mean ± standard deviation) of the first two principal components after PCA reduction. It is clearly visible in Figure 7 these two principal components can be subjected to classifiers. 4

4

x 10

x 10

1.71

3.32

Standard Deviation

1.7

3.3 3.29

3.27 3.26

0.026

0.025 0.0245

AC C

Homogeneity

0.0255

1.68 1.67 1.66 1.65 1.64

Osteoporotic Patients

EP

Healthy Subjects

0.024

Osteoporotic Patients

Healthy Subjects

Osteoporotic Patients

0 -0.02 -0.04 -0.06 -0.08

0.0235

-0.1

0.023

-0.12

Healthy Subjects

Healthy Subjects

0.02

Correlation

3.28

1.69

TE D

Mean

3.31

Osteoporotic Patients

Fig. 6: Box plots of the extracted features from healthy subjects and osteoporotic patients. (before applying PCA)

11

ACCEPTED MANUSCRIPT

-5

-6

x 10

x 10 10

1 0 -1

0

-5

Osteoporotic Patients

Healthy Subjects

Osteoporotic Patients

SC

Healthy Subjects

5

RI PT

2

Principal Com ponent 2

P rinc ipal Com ponent 1

3

Fig.7: Box plot of the two PCs subjected to the classifiers (after reduction using PCA).

M AN U

Step -4: Supervised classification of the two populations

The discriminatory feature vectors (first and second PCs) were used to separate healthy X-ray images from osteoporotic ones using different classifiers. The four most popular classification algorithms, namely Support Vector Machine (SVM) [23, 24], Naive Bayes classifier [25], Artificial Neural Network (ANN) [26] and k-Nearest Neighbors (k-NN) classifier [27], were applied to the first two principal components and the classifier that returned the best performance was finally selected for classification. Table 5 shows the best configuration settings used for the classification in terms of the results obtained.

TE D

TABLE 5 Design parameters used for classification.

Classifiers

Design parameters used for classification

Support Vector Machine (SVM) Naïve Bayes

RBF Kernel with Sigma =3 Laplace Correction – true Hidden sizes= 4 Training function: trainlm Number of nearest neighbors = 5 Distance = Cosine, Rule = Nearest

Artificial neural network (ANN)

EP

k- Nearest Neighbors (k-NN)

AC C

The SVM classifier [23, 24] was trained using a training set that included bone X-ray images from healthy subjects and osteoporotic patients. The performance of the classifier was checked for different kernels and the best parameter was finally selected to classify the test samples of bone X-ray images. Fig. 8 shows the classification process using the SVM.

12

ACCEPTED MANUSCRIPT

Test Sample Bone X-ray Images Kernels K (S1,St)

Machine Learning

S2

K (S2,St)

+

Sn

K (Sn,St)

(Healthy/Osteoporotic)

M AN U

Support Vectors from Training database of bone X-ray images

Test Sample Bone X-ray image is classified as Healthy/Osteoporotic

SC

S1

RI PT

St

Fig.8: Support Vector Machine (SVM) classification.

Training Set Bone X-ray images (Healthy/ Osteoporotic Patients)

Bayes Rule (Normal Distribution)

Find the probability of test sample to belong in possible classes (Healthy/Osteoporo tic Patients)

Best Prediction for test Sample (Healthy/Osteoporotic Patients)

EP

Test samples of X-ray images

TE D

The Naïve Bayes classifier [25] works on the Bayes rule and probability measurement. The classifier was trained using the training dataset of bone X-ray images from healthy subjects/osteoporotic patients and the best parameter setting was selected. Fig. 9 presents the flow diagram of the Naïve Bayes classification algorithm used for machine learning.

AC C

Fig.9: Bayes rule based classification.

The main parameter settings for artificial neural network classification [26] are hidden sizes and training function. The performance of the artificial neural network was checked for different parameters on the basis of the training database and the best parameter was finally selected for classification of the test images. Fig. 10 shows the process of the artificial neural network used for classification of healthy and osteoporotic patients.

13

ACCEPTED MANUSCRIPT

Hidden Layer

RI PT

Hidden Layer X1 X2

Xn Input Samples Healthy/Osteoporotic Patients from Bone X-ray Image

SC

Output Classification of Healthy/Osteoporotic Patients from Bone X-ray Image

M AN U

Neurons

Fig.10: Artificial Neural Network (ANN) classification.

In the nearest neighbors based classification algorithm [27], the number of nearest neighbors and distance metrics were selected according to the training dataset. The performance of the k-NN classification algorithm is highly dependent upon the number of nearest neighbors used so the best parameter setting was selected for classification of the test samples. Fig. 11 presents the flow of the k-NN classification algorithm used to separate healthy and osteoporotic patients.

TE D

Parameter Selection

Find the k nearest neighbors to the Test Sample of X-ray image

No. of Nearest Neighbors, Type of Distance

EP

Training Data Base Bone X-ray Image (Healthy/Osteoporotic)

Set Maximum Label Class of K to Test Sample

Classified Test Sample

AC C

Test Sample Bone X-ray Image

Fig. 11: k‐Nearest Neighbor (k-NN) classification.

4. Experimental Results and Discussion In this study, a database of 174 bone X-ray images was used for experiments. This section reports and discusses the results. 80 images out of 174 were used to train the classifiers and the remaining 94 images were used to check the performance of each classification method. There was no overlap between the images used for training and testing. Table 6 shows the distribution of healthy and osteoporotic images used for training and testing. Experiments were performed with four different classifiers to

14

ACCEPTED MANUSCRIPT

SC

RI PT

classify osteoporotic patients and healthy subjects and the classifier performances were evaluated using sensitivity, specificity and accuracy [29].The performance of all four classifiers was measured on the original features and results are reported in Table 7. It can be clearly seen in Table 7 that the original features were not significant enough for the efficient classification of the two populations. Table 8 shows the performance of the different classifiers on the discriminatory features selected after PCA. Table 8 compares the performance of the classifiers with/without pre-processing before feature extraction. As can be seen in Table 6 and Table 8, the performances of all four classifiers on the original features are not significant whereas their performance with the use of PCA is much better. All the classifiers provide an accuracy of more than 95%, which can be considered as encouraging results.

Bone X-ray image

M AN U

TABLE 6 Bone X-ray image database considered in this study. Total No. of samples used to train the classifiers 40 40 80

Healthy Subjects Osteoporotic Patients Total

Total No. of samples used to test the performance of classifiers 47 47 94

Total samples 87 87 174

TABLE 7 Performance of the different classifiers using the original features.

SVM Naive Bayes k-NN ANN

True Positive 45 45 42 40

False Positive 24 44 16 26

True Negative 23 3 31 21

TE D

Classifiers

False Negative 02 02 05 07

Sensitivity (%) 95% 95% 89% 85%

Specificity (%) 49% 7% 66% 44%

Accuracy (%) 72% 51% 77% 64%

EP

TABLE 8 Performance of the different classification methods and comparative performance with/without preprocessing before feature extraction True Positive

AC C

Classifiers

SVM Naive Bayes k-NN ANN

SVM Naive Bayes k-NN ANN

29 25 27 33

False Positive

True Negative

False Negative

Sensitivity (%)

Specificity (%)

Accuracy (%)

Feature extraction without pre-processing 23 23 19 17

24 24 28 30

18 22 20 14

61.70% 53.19 % 57.44 % 70.02 %

51.06 % 51.06 % 59.57 % 63.82 %

56.38 % 52.12 % 58.51% 67.02 %

95.74% 93.61 % 97.87 % 95.74 %

97.87% 95.74% 96.80 % 96.80 %

Feature extraction after pre-processing 47 46 45 46

02 03 01 02

45 44 46 45

15

00 01 02 01

100% 97.87 % 95.74 % 97.87 %

ACCEPTED MANUSCRIPT

x 10

RI PT

Healthy (training) Healthy (classified) Osteoporotic (training) Osteoporotic (classified) Support Vectors

1

0.5

SC

0

-0.5

-1 -3

-2

M AN U

Second Principal Component after PCA Reduction

-5

1.5

-1

0

1

2

First Principal Component after PCA Reduction

3 -5

x 10

Fig. 12: Classification results using SVM based classification

TE D

The SVM classifier shows the best performances, with 97.87% accuracy, 100% sensitivity and 95.74% specificity. Fig. 12 presents the plot of the two main discriminatory PCs for the training and testing set of SVM based classification. It is clearly visible in Fig. 12 that the two main PCs which were selected for classification are sufficiently discriminatory for accurate classification.

EP

Fig. 13 presents the ROC curves for the four classifiers (SVM, k-NN, Naïve Bayes and ANN). The area under the ROC curve (AUC) and Youden’s index were also estimated from the ROC. Values are reported in Table 9. The highest AUC value was 0.9954 for the k-NN classifier and the highest Youden’s index was 0.9574 for the SVM classifier

AC C

Classifiers SVM Naive Bayes k-NN ANN

TABLE 9 ROC curve analysis Area under curve (AUC) 0.9824 0.9791 0.9954 0.9681

Youden’s index 0.9574 0.9361 0.9149 0.9361

The algorithms were implemented in MATLAB R2011b (Math Works) software using CPU@ 2.3 GHz, 4GB RAM, 32-bit operating system. The computational time needed for feature extraction was approximately 9 seconds per image and for feature normalization, reduction and classification approximately11 sec per image, which can be considered as very low. The low time complexity of the proposed method makes it suitable for classification of the trabecular bone structure of osteoporotic Patients in real time

16

ACCEPTED MANUSCRIPT

ROC

ANN Classifier

0.95

SVM Classifier k-nn Classifier

RI PT

1

0.9

0.85

SC

True Positive Rate

Naive Bayes Classifier

0.75

0

0.05

M AN U

0.8

0.1

0.15

0.2

0.25

False Positive Rate

Fig. 13: ROC curve for SVM, k-NN, Naïve Bayes and ANN classifier

Leave One Out Cross Validation (LOOCV)

TE D

The leave one out cross validation (LOOCV) technique [28] was used to distinguish osteoporotic patients from control subjects. In this method, only one sample was used for testing purposes and the remaining samples were used to train the classifier. This procedure was repeated for the complete dataset. The LOOCV technique was used for SVM as it provided the best classification performance. Table 10 presents the crossvalidation results. The LOOCV gives 96.55% accuracy with 95.40% specificity and 97.70% sensitivity with the SVM classifier, indicating that the proposed approach is efficient and may be helpful for the diagnosis of osteoporosis in real time.

EP

TABLE 10 Results obtained using the SVM classifier with the leave one out cross validation. Support Vector Machine Kernel: RBF with σ = 3 174 173 1 87 87 85 02 83 04 97.70 % 95.40 % 96.55 %

AC C

Classifier used Classifier setting parameters Total number of samples used Samples used for training Samples used for testing Total number of osteoporotic subjects Total number of control Subjects True positive (TP) False negative (FN) True negative (TN) False positive (FP) Sensitivity Specificity Accuracy

17

ACCEPTED MANUSCRIPT

TABLE 11: Comparison between published studies and the proposed approach (data taken from the literature)

M.S. Kavit ha et al. [9]

V. Sapthagiriv asan et al. [12]

A. Gaidel et al. [37]

Imaging method for the diagnosis of osteoporosis

Imaging analysis for osteoporosis diagnosis

Dental panoramic radiographs (DPRs) from 663 female patients

Extraction of texture feature from DPR images and classification

Dental panoramic radiographs from 100 female subjects

SVM classifier for continuous measurement of the cortical width of the mandible on dental panoramic radiographs.

Performance measurements

Time complexity

RI PT

CT images, QCT images µ-CT image

Not reported

Area under curve: 0.872

Sensitivity: 90.9% specificity: 83.8%

Not reported

Not reported

Average classification time: 9 sec per image

hip radiographs images of 50 subjects

trabecular features extraction and SVM classification

Mean accuracy: 90%

50 sec for feature extraction and 8 sec per image for classification

Plain hip radiograms of 42 patients

Texture analysis of plain hip radiograms

Diagnostic error probability 0.2

Not reported

Preprocessing, statistical feature Analysis, feature normalization & reduction, discriminatory feature Selection, classifiers for osteoporosis diagnosis

Accuracy: 97% AUC (area under curve) = 98.24%

20 Sec

Bone X-ray image 87 healthy subjects and 87 patients with osteoporotic fractures

EP

Proposed study

Imaging assessment of bone quality in osteoporosis Diagnosis of osteoporosis using image texture in Dental Panoramic Radiographs Diagnosis of osteoporosis from dental panoramic radiographs Diagnosis of osteoporosis by extraction of trabecular features from hip radiographs Texture analysis for diagnosis of osteoporosis by plain hip radiography

Methodology used

SC

M. G. Roberts et al. [8]

Database

M AN U

Genant & Jiang [5]

Objective

TE D

Reference

AC C

Table 11 compares different existing studies for the diagnosis of osteoporosis. We are aware that the datasets used in these various studies are not the same and that confrontation of the different methods should be done on the same database. Since we have no access to these data, this task remains difficult. The proposed method achieved 97% accuracy, which is encouraging. However, this study has some limitations. The proposed method has only been validated on a small database. The bone of the calcaneus that was used is not commonly used in the clinical routine. It is necessary to be aware that to complete the DXA diagnosis employed in the clinical routine, our proposed method will increase the cost of the diagnosis and augment the radiation dose as well as the duration of the whole examination.

18

ACCEPTED MANUSCRIPT

5. Conclusion

M AN U

SC

RI PT

This work has proposed a novel and efficient imaging method for osteoporosis diagnosis using bone X-ray images. Statistical and texture features strategically extracted from pre-processed X-ray images were used to discriminate between healthy subject and osteoporotic patient images. Extracted features normalized using the z-score method and reduced using PCA were subjected to machine learning and classification. Four different classifiers (SVM, Naïve Bayes, k-NN and ANN) were used to discriminate between the two populations. Results demonstrate that all the classifiers achieved accuracy higher than 95%. The SVM classifier provided the best performances with an accuracy of 97.87%, sensitivity of 100% and specificity of 95.74%, which is encouraging. The proposed diagnostic system shows a higher accuracy with lower time complexity in comparison with existing methods for the bone tissue characterization. Future research work in this area might be to explore other discriminatory features like orientation-dependent features and correlation of first-order features with BMD and to use different classifiers to obtain even greater accuracy with less time complexity.

References:

D. Marshall, O. Johnell, H. Wedel, Meta-analysis of how well measures of bone mineral density predict occurrence of osteoporotic fractures, BMJ. 312 (1996) 1254-1259. [2] J. A Kanis, P. Delmas, P. Burckhardt, C. Cooper, D. Torgerson, Guidelines for diagnosis and management of osteoporosis: The European Foundation for Osteoporosis and Bone Disease, Osteoporosis International. 7 (1997) 390–406. [3] Mithal A, Kaur P. Osteoporosis in Asia: a call to action. Curr Osteoporos Rep. 2012 (4), 245– 247. [PubMed]. [4] S. A. Goldstein, The mechanical properties of trabecular bone: dependence on anatomic location and function, J. Biomech. 20 (1987) 1055-1061. [5] H. K. Genant, J. Y. Jiang, Imaging assessment of bone Quality in osteoporosis, Clinical Reviews in Bone and Mineral Metabolism, Springer. 4(2006) 213–224. [6]E. Legrand, D. Chappard, C. Pascaretti, M. Duquenne, S. Krebs, V. Rohmer, M. F. Basle, M. Audran, Trabecular bone micro-architecture bone mineral density and vertebral fractures in male osteoporosis, J. Bone Mineral Res., 15 (2000), 13-19. [7] J. S. Gregory, R. M. Junold, P. E. Undrill, R. M. Aspen, Analysis of trabecular bone structure using Fourier transforms and neural networks, IEEE Transactions on Information Technology in Biomedicine, 3 (1999), 289 – 294. [8] M. G. Roberts, J. Graham, H. Devlin, Image Texture in Dental Panoramic Radiographs as a Potential Biomarker of Osteoporosis, IEEE Transactions on Biomedical Engineering, 60 (2013), 2384 – 2392. [9] M.S. Kavitha, A. Asano, A. Taguchi, Diagnosis of osteoporosis from dental panoramic radiographs using the support vector machine method in a computer-aided system, BMC Med. Imaging, 12 (2012), 1-11. [10] M. Ascenzi, N. Hetzer, A. Lomovtsev, Variation of trabecular architecture in proximal femur of postmenopausal women, J. Biomech., 44 (2011), 248-256. [11] B. R. Gomberg, P. K. Saha, H. K. Song, S. N. Hwang, F. W. Wehrli, Topological analysis of trabecular bone MR images, IEEE Transactions on Medical Imaging, 19 (2000), 166-174. [12] V. Sapthagirivasan, M. Anburajan, Diagnosis of osteoporosis by extraction of trabecular features from hip radiographs using support vector machine: an investigation panorama with DXA, Computers in Biology and Medicine, Elsevier. 43 (2013) 1910–1919.

AC C

EP

TE D

[1]

19

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

[13] E. Lespessailles, C. Gadois, G. Lemineur, J. P. Do-Huu, C. L. Benhamou, Bone texture analysis on direct digital radiographic images: precision study and relationship with bone mineral density at the osclacis. Calcified Tissue International, 80 (2007) 97–102. [14] C.L. Benhamou, E. Lespessailles, G. Jacquet, R. Harba, R. Jennane, T. Loussot, D. Tourliere, W. Ohley, Fractal organization of trabecular bone images on calcaneus radiographs, J. Bone Miner Res. 9 (1994) 1909–1918. [15] R.C. Gonzalez, R. E. Woods, Digital image processing, Prentice Hall, NJ, 2001. [16] M. M. Galloway, Texture classification using gray level run length, Comput. Graph. Image Process. 4(1975) 172–179. [17] J. H. Tan, E. Y. K. Ng, U. R. Acharya, Study of normal ocular thermogram using textural parameters, Infrared Phys. Technol, 53 (2010) 120–126. [18] J. D. Gibbons, Nonparametric, Statistical Inference. New York: Marcel Dekker, 1985. [19] M. Hollander, D. A. Wolfe, Nonparametric statistical methods. Hoboken, NJ: John Wiley & Sons, Inc. 1999. [20] M. H. Dunham, Data mining introductory and advanced topics. NJ: Prentice Hall, 2002. [21] I. T. Jolliffe, Principal component analysis series in statistics, Springer Verlag, 2002. [22] H. Abdi, L. J. Williams, Principal component analysis, Wiley Interdisciplinary Reviews: Computational Statistics, 2010. [23] C.C. Chang, C.J. Lin, LIBSVM: A library for support vector machines. 2001. http://www.csie.ntu.edu.tw/∼cjlin/libsvm. [24] S. S. Keerthi, S. K. Shevade, C. Bhattacharyya, K. R. K. Murthy, Improvements to platt’s SMO algorithm for SVM classifier design, Neural Comput.13 (2001) 637–649. [25] G. H. John, P. Langley, Estimating continuous distributions in bayesian classifiers, Proc. 11th Conf. Uncertainty Artif. Intell., San Mateo. (1995) 338–345. [26] S. Haykin, Neural networks a comprehensive foundation. 2nd edn. Pearson Education, 1999. [27] D. Coomans, D.L. Massart, Alternative k-nearest neighbour rules in supervised pattern recognition: Part 1. k-nearest neighbour classification by using alternative voting rules, 1982. Analytica ChimicaActa. 136(1982) 15–27. [28] B. Efron, R. Tibshirani, Improvements on cross-validation: The bootstrap method, Journal of the American Statistical Association, 92 (1997) 548-560. [29] E. J. Boyko, Ruling out or ruling in disease with the most sensitive or specific diagnostic test: short cut or wrong turn? medical decision making: an International Journal of the Society for Medical Decision Making. 14 (1999) 175–179. [30] W. G. M. Geraets, P. F. V. Stelt, C. J. Netelenbos, P. J. M. Elders, A new method for automatic recognition of the radiographic trabecular pattern,J. Bone Miner Res. 5 (1990) 227–233. [31] M. A. Haidekker, G. Dougherty, Medical imaging in the diagnosis of osteoporosis and estimation of the individual bone fracture risk, Biological and Medical Physics, Biomedical Engineering, Springer, 2011, 193-225. [32] R. Dendere, S. P. Whiley, T. S. Douglas, Computed digital absorptiometry for measurement of phalangeal bone mineral mass on a slot-scanning digital radiography system, Osteoporosis Int., 25 (2014), 2625-2630 [33] Bayarri, W. L. Roque, K. Arcaro, M. A. Perez, Mechanical characterization of trabecular bone applied to MRI examination: In vivo results from 100 patients, Computational vision and medical image processing, Taylor and Francis, 2014, 383-385. [34] E. Cheng, L. Zhu, J. Yang, Azhari, S. Sitam , X. Liang, V. Megalooikonomou, H. Ling, Learning based landmark detection for osteoporosis analysis, Medical Imaging, Image Processing, SPIE 9784, 2016. [35] T. Whitmarsh, L. Humbert, M. D. Craene, L. M. D. R. Barquero, A. F. Frangi, Reconstructing the 3D Shape and Bone Mineral Density Distribution of the Proximal Femur From Dual-Energy X-Ray Absorptiometry, IEEE Transactions on Medical Imaging, 30 (2011), 2101 – 2114.

20

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

[36] V. Sapthagirivasan, M. Anburajan, Analysis of texture patterns in diagnosing osteoporosis using proximal femur x-ray images, Advances in Digital Image Processing and Information Technology, Springer, 205 (2011). pp. 383-392. [37] A. Gaidel, A. Khramov, Application of texture analysis for automated osteoporosis diagnostics by plain hip radiography, pattern recognition and image analysis, springer, 25 (2015), 301-305. [38] J. H. Potgieter, S. Steiner, S. P. Whiley, T. S. Douglas, R. Dendere, Dual-Energy X-Ray Absorptiometry for Measurement of Phalangeal Bone Mineral Density on a Slot-Scanning Digital Radiography System, IEEE Transactions on Biomedical Engineering, 62 (2015), 28502859. [39] H. H. Lin, S. L. Peng, J. Wu, T. Y. Shih, K. S. Chuang, C. T. Shih, A Novel Two-Compartment Model for Calculating Bone Volume Fractions and Bone Mineral Densities From Computed Tomography Images, IEEE Transactions on Medical Imaging, 36 (2017), 1094-1105. [40] E. S. Siris, Y. Chen, T. Abbott, E. Barett-Connor, P. Miller, L. Wehren, M. Berger, Bone Mineral Density Thresholds for Pharmacological Intervention to Prevent Fractures, Arch. Intern. Med., 164 (2004), 1108-12. [41] J. F. Veenland, J. L. Grashuis, H. Weinans, M. Ding, Vrooman, Suitability of texture features to assess changes in trabecular bone architecture, Pattern Recognition Letters.23 (2007), 395-403. [42] S.Majumdar, J. Lin, T. Link, J. Millard, P. Augat, X. Ouyang, D. Newitt, R. Gould, M. Kothari, H. Genant, Fractal analysis of radiographs: assessment of trabecular bone structure and prediction of elastic modulus and strength, Med Phys.26 (1999), 1333-1340. [43] B. C. Silva, S. B. Broy, S. Boutroy, J. T. Schousboe, J. A. Shepherd, W. D. Leslie, Fracture Risk Prediction by Non-BMD DXA Measures: the 2015 ISCD Official Positions Part 2: Trabecular Bone Score, Journal of Clinical Densitometry, Elsevier. 18 (2015), 309-330. [44] S. Casciaro, M. Peccaris, P. Pisani, R. Franchini, A. Greco, T. D. Marco, A. Grimaldi, L. Quarta, E. Quarta, M. Muratore, F. Conversano, An Advanced Quantitative Echosound Methodology for Femoral Neck Densitometry, Ultrasound in Medicine & Biology, Elsevier. 42 (2016), 13371356.

21