Classification of lung sounds using higher-order statistics: A divide-and-conquer approach

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20 journal homepage: www.intl.elsevierhealth.com/j...

Download PDF

1MB Sizes 0 Downloads 57 Views

Report

PDF Reader
Full Text

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20

journal homepage: www.intl.elsevierhealth.com/journals/cmpb

Classiﬁcation of lung sounds using higher-order statistics: A divide-and-conquer approach Raphael Naves ∗ , Bruno H.G. Barbosa, Danton D. Ferreira Engineering Department, Federal University of Lavras, MG, Brazil

a r t i c l e

i n f o

a b s t r a c t

Article history:

Background and objective: Lung sound auscultation is one of the most commonly used meth-

Received 6 June 2015

ods to evaluate respiratory diseases. However, the effectiveness of this method depends on

Received in revised form

the physician’s training. If the physician does not have the proper training, he/she will be

18 February 2016

unable to distinguish between normal and abnormal sounds generated by the human body.

Accepted 19 February 2016

Thus, the aim of this study was to implement a pattern recognition system to classify lung sounds.

Keywords:

Methods: We used a dataset composed of ﬁve types of lung sounds: normal, coarse crackle,

Lung sounds

ﬁne crackle, monophonic and polyphonic wheezes. We used higher-order statistics (HOS)

Pattern recognition

to extract features (second-, third- and fourth-order cumulants), Genetic Algorithms (GA)

Higher-order statistics

and Fisher’s Discriminant Ratio (FDR) to reduce dimensionality, and k-Nearest Neighbors

Genetic Algorithm

and Naive Bayes classiﬁers to recognize the lung sound events in a tree-based system. We used the cross-validation procedure to analyze the classiﬁers performance and the Tukey’s Honestly Signiﬁcant Difference criterion to compare the results. Results: Our results showed that the Genetic Algorithms outperformed the Fisher’s Discriminant Ratio for feature selection. Moreover, each lung class had a different signature pattern according to their cumulants showing that HOS is a promising feature extraction tool for lung sounds. Besides, the proposed divide-and-conquer approach can accurately classify different types of lung sounds. The classiﬁcation accuracy obtained by the best tree-based classiﬁer was 98.1% for classiﬁcation accuracy on training, and 94.6% for validation data. Conclusions: The proposed approach achieved good results even using only one feature extraction tool (higher-order statistics). Additionally, the implementation of the proposed classiﬁer in an embedded system is feasible. © 2016 Elsevier Ireland Ltd. All rights reserved.

1.

Introduction

Since antiquity, physicians have auscultated sounds inside the chest to identify signs of disease [1]. Lung sound auscultation by stethoscope is a simple, quick, and non-invasive method

to provide diagnostic information about a patient’s lung [1]. However, proper use of this method depends on the ability of the physician to recognize normal and abnormal sounds generated by the human body. In addition, lung sounds are non-stationary signals, which makes them both difﬁcult to analyze and hard to distinguish

∗ Corresponding author at: Engineering Department, Federal University of Lavras, Campus Universitario, Caixa Postal 3037, CEP: 37200-000, Brazil. Tel.: +55 35 9194 4943. E-mail addresses: [email protected]ﬂa.br (R. Naves), [email protected]ﬂa.br (B.H.G. Barbosa), [email protected]ﬂa.br (D.D. Ferreira).

http://dx.doi.org/10.1016/j.cmpb.2016.02.013 0169-2607/© 2016 Elsevier Ireland Ltd. All rights reserved.

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20

when using traditional auscultation methods. Thus, the use of an electronic stethoscope together with a pattern recognition system helps to overcome the limitations of traditional auscultation, providing an efﬁcient method for clinical diagnosis [2]. Lung sounds are classiﬁed as either normal (healthy individuals) or adventitious (abnormal). Adventitious sounds are divided into two categories: continuous sounds (wheezes and rhonchi) and discontinuous sounds (crackles). According to the American Thoracic Society (ATS) [3], wheezing sounds are deﬁned as high-pitched continuous sounds and rhonchi are low-pitched continuous sounds. The ATS [3] speciﬁes that a wheeze contains a dominant frequency above 400 Hz, while rhonchi are characterized by a dominant frequency of about 200 Hz or less. The wheeze can be monophonic if it contains a single frequency, or polyphonic when several frequencies are simultaneously perceived [4]. The diseases associated with wheezing sounds are asthma, pneumonia, and bronchitis. Crackles are discontinuous adventitious sounds caused by the sudden opening of small airways that had collapsed [2]. Crackles are usually deﬁned by their time-domain features such as initial deﬂection width (IDW) and two-cycle duration (2CD). According to the ATS [3], the average duration of IDW and 2CD of ﬁne crackles are 0.7 and 5 milliseconds (ms), respectively, and those of coarse crackles are 1.5 and 10 ms, respectively [5]. Fine crackles are heard in patients with pneumonia, pulmonary ﬁbrosis, and congestive heart failure (CHF) [6]. Other categories of lung sounds include pleural rub, stridor, and squawks. Pleural rub is the characteristic sound produced when inﬂamed pleural surfaces rub together during respiration. Stridor is very loud wheezes and squawks are short inspiratory wheezes [1]. Several authors have used a variety of techniques for ﬁltering, feature extraction, separation, and classiﬁcation of lung sounds. For instance, to increase the accuracy of the forced oscillation technique (FOT) in categorizing the level of airway obstruction in patients with chronic obstructive pulmonary disease (COPD), [7] developed a pattern recognition system using the k-Nearest Neighbors (k-NN), random forest (RF), and support vector machines (SVM) as classiﬁers. They used the Receiver Operating Characteristic (ROC) curve for performance evaluation and the results achieved were higher than 0.9. The SVM classiﬁer was used in [2] and [8] to classify lung sounds. In [2], normal, crackle, and rhonchus sounds were classiﬁed and the authors used the frequency ratio of Power Spectral Density (PSD) values and the Hilbert-Huang Transform (HHT) method for feature extraction. The classiﬁer achieved an accuracy of above 90%. In [8], the authors proposed a time-frequency and time-scale analysis of pulmonary signals. They also used the k-NN and Multilayer Perceptron to classify crackling and non-crackling sounds and they conclude that the SVM classiﬁer achieved the best result obtaining a classiﬁcation accuracy of 97.5%. Instantaneous kurtosis, discriminating function, and entropy were used in [9] to classify respiratory sounds ranging from normal to continuous adventitious, including wheezing, stridor, and rhonchi. The authors achieved an optimal classiﬁcation accuracy (between 97.7% and 98.8%). In [10], the authors

13

also classiﬁed normal and continuous adventitious signals, but only wheeze signals were used as continuous adventitious signals. Using the Mel-Frequency Cepstral Coefﬁcients (MFCC) for feature extraction and the Gaussian Mixture Model (GMM) to classify the signals, the authors achieved a classiﬁcation accuracy of 94.2%. Finally, a technique to obtain the time-frequency representation (TFR) of thoracic sounds was proposed by [11]. Using time-frequency patterns, the authors assessed the performance of ten TFRs for heart, adventitious, and normal lung sounds. After simulations, they concluded that the best TFR performance was found using the Hilbert–Huang spectrum (HHS). The purpose of our paper is to develop a pattern recognition technique to classify lung sounds including normal, ﬁne crackle, coarse crackle, monophonic wheezes and polyphonic wheezes. We used Higher Order Statistics [13] to extract features, Genetic Algorithms (GAs) [14] to reduce dimensionality, and k-Nearest Neighbors [15] and Naive Bayes [16] classiﬁers to recognize lung sound events. Our application of HOS for lung sound classiﬁcation is based on the study reported in [12] which used the parametric approach of bispectrum estimation (the Double Fourier Transform of the third-order moment sequence) to reveal information about lung sounds, such as deviations from normality. After performing several experiments, the authors conclude that bispectrum-based methods are more suitable and robust than second-order statistic-based methods for lung sound analysis and characterization due to their superior performance in noisy environments. However, unlike [12], we use a reduced number of second-, third-, and fourth-order cumulants extracted from the sound time series as features. In addition to properties of noise immunity, cumulant-based features have the following advantages: (i) they are not dependent on moments of lower order, in contrast to moment-based features; and (ii) from a computational perspective they are simpler than polyspectrum-based features. As such, cumulants provide a compact and representative feature vector signature for each sound class. The analysis of adventitious sounds is helpful in diagnosing pulmonary dysfunction. In contrast to other studies in this area, this paper looks to classify the ﬁve most important classes of lung sounds, including two types of crackle and wheezes that are linked to several lung diseases. The importance of this paper lies in its ability to provide classiﬁcation for lung sounds that can be easily implemented in embedded systems by using a new system based only on HOS and the divide-and-conquer approach. The system developed herein is capable of efﬁciently distinguishing several lung sounds. The materials and methods proposed in this paper are described in Section 2 and results are shown in Section 3. Finally, Section 4 presents the conclusions and future work.

2.

Materials and methods

Pattern recognition is a science that deals with the classiﬁcation and description of objects in particular classes by observing their features [17].

14

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20

Fig. 1 – Proposed pattern recognition system to classify lung sounds. The pattern recognition system developed in this study follows the sequence shown in Fig. 1 and each step is described in detail below.

2.1.

Input data

The real data set (respiratory sounds) used in this study was obtained from [1], where vesicular sounds, ﬁne crackle, coarse crackle, monophonic and polyphonic wheezing sounds are available. The data is composed of 36 samples: 8 vesicular respiratory cycles, 7 respiratory cycles with ﬁne crackle, 13 with coarse crackle, 4 with monophonic wheezes, and 4 with polyphonic wheezes. The signals are sampled at 8 kHz. The pre-processing stage and the construction of the data set are summarized in Fig. 2. The available lung sounds (signals) were divided (splitting and windowing) into non-overlapping 320-sample segments, which represent about 40 ms, as shown in Fig. 2. This procedure was applied to the entire available data set. Since crackles are discontinuous sounds, and according to [18] and [8] the duration of a crackle is less than 20 ms, some of the 320-sample split segments do not contain this type of adventitious sound, as noted in Fig. 2 (step 2). Thus, only the segments which contain crackles were selected for the next step, signal normalization. The other respiratory sounds, normal and wheezes, are continuous sounds, which means segment selection was not required. It is worth mentioning that the selection of segments containing a crackle is employed only during the design stage of the proposed pattern recognition system and it is performed manually by a lung-sound expert. The objective of this step is to guarantee the classiﬁers are trained using proper lung sound patterns. At the operation stage, only steps 1 and 3 are performed and the selection step is not necessary. Thus, nonoverlapping segments of the lung sound under evaluation are extracted and normalized. If at least one extracted event (split segment) is a crackle, it will be properly classiﬁed as a coarse or ﬁne crackle lung sound. The next step (Fig. 2, step 3) involves signal normalization. The selected events were normalized with zero mean and unit variance. Thus, we obtained 102 vesicular sound signals (events), 57 ﬁne crackle signals, 57 coarse crackle signals, 51 monophonic wheezing, and 51 polyphonic wheezing signals. Fig. 3 illustrates one event of each type of lung sound randomly selected from the input data set.

2.2.

Feature extraction

Our study uses HOS as its feature extraction tool. Methods based on HOS are more suitable to deal with non-Gaussian

processes and nonlinear systems [13]. The major advantage of using HOS as a feature extraction method for pattern recognition is its noise immunity. Let x be a random process composed of real random variables with zero mean. The second-, third-, and fourth-order cumulants are given, respectively [13]: cum(x1 , x2 ) = E(x1 x2 ), cum(x1 , x2 , x3 ) = E(x1 x2 x3 ),

(1)

cum(x1 , x2 , x3 , x4 ) = E(x1 x2 x3 x4 ) − E(x1 x2 )E(x3 x4 ) −E(x1 x3 )E(x2 x4 ) − E(x1 x4 )E(x2 x3 ), where “E” is the expected value operator. Suppose that x(t) is a stationary random process with zero mean and that the cumulants of order k, denoted by Ck,x ( 1 , 2 , . . ., 3 ), where 1 , . . ., k , are lags in time, are deﬁned in terms of the signals x(t), x(t + 1 ), . . ., x(t + k ). Setting 1 = 2 = 3 = , from (1), the second-, third-, and fourth-order cumulants can be rewritten as: C2,x () = E{x(t)x(t + )}, C3,x () = E{x(t)x2 (t + )},

(2)

C4,x () = E x(t)x3 (t + ) − 3C2,x ()C2,x (0). For a discrete signal x[n], (2) can be approximated using the circular method proposed by [19]: 1 x[n]x[mod (n + , N)], N N−1

C2,x () =

n=0 N−1

1 x[n]x2 [mod (n + , N)], C3,x () = N n=0 N−1

(3)

1 x[n]x3 [mod (n + , N)] C4,x () = N n=0 N−1

N−1

n=0

n=0

1 x[n]x[mod (n + , N)] · x2 [n], −3 2 N where x ∈ RN , = 0, 1, . . ., N − 1, and mod is the modulo operator (that is, the remainder after integer division). Fig. 4 illustrates the mean values (±2 standard deviations) of the second-, third-, and fourth-order cumulants extracted from the available vesicular and adventitious events. It can be inferred that each class has a different signature pattern according to their cumulants showing that HOS is a promising feature extraction tool for lung sounds. It is clear from Fig. 4 that although the extracted features of the lung sounds have different means, their large standard deviations can make

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20

15

Fig. 2 – Lung sounds data set construction, a crackle example.

classiﬁcation difﬁcult. The classiﬁcation task is even harder when the objective is to distinguish between coarse and ﬁne crackle or between monophonic and polyphonic wheeze.

2.3.

Dimensionality reduction

After applying (3) to each lung sound event, 160 second-order cumulants, 320 third-order cumulants, and 320 fourth-order cumulants were obtained. Since this is a high dimensional feature space (800 features), it is prudent to reduce its dimensionality. The core purpose of this step is to build a compact and representative cumulant-based signature vector for each sound class. Thus, in order to reduce the computational cost of the classiﬁcation system, we implemented Genetic Algorithms (GA) to reduce the feature space dimensionality and to maximize classiﬁcation performance. Genetic Algorithms, a branch of evolutionary algorithms, are based on the biological process of natural evolution. While

the ﬁrst study involving GA was presented by [14], this type of research became widely used in the 1980s. The basic operating principle of Genetic Algorithms can be summarized by the steps presented in Fig. 5. The ﬁrst step of the algorithm is to randomly create an initial population of individuals. Secondly, the ﬁtness of each population member is calculated based on its performance in an objective function; the higher the level of ﬁtness value, the better its performance, thus increasing its chance of survival over generations. The best individuals are then selected using a selection procedure. After selection, standard crossover and mutation genetic operators are applied to the population. Subsequently, the best individual from the current population is included in the new one (elitism strategy). These procedures are repeated until the user deﬁned maximum number of generations is reached. In our study, GA is designed to minimize a cost function that accomplishes the number of features (cumulants) used by the classiﬁers and their classiﬁcation

16

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20

Fig. 3 – Lung sound events: (a) vesicular, (b) ﬁne crackle, (c) coarse crackle, (d) monophonic wheeze, and (e) polyphonic wheeze.

Fig. 4 – Mean values (solid lines) and two standard deviations (dashed lines) of the second-, third-, and fourth-order cumulants of the classes: (a) vesicular, (b) ﬁne crackle, (c) coarse crackle, (d) monophonic wheeze, and (e) polyphonic wheeze.

17

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20

Fig. 6 – Individual evaluation.

Fig. 5 – Steps of the Genetic Algorithm.

performance. Thus, the evaluation of each individual is given as: Evaluation = ClassificationError + 0.001 ∗ NumberCumulants

(4)

where the variable ClassiﬁcationError is the classiﬁcation error of the evaluated individual in percentage and the variable NumberCumulants is the number of features (cumulants) used by the individual to perform its task. The latter variable was added to reduce the number of cumulants selected by the classiﬁcation method. When the same classiﬁcation error is obtained by distinct individuals, the one that selects fewer cumulants is given a better evaluation. Fig. 6 illustrates the individual evaluation process. Each individual of the GA population is a binary sequence with 800 positions, where each position indicates whether the respective cumulant is used. After this feature selection procedure, the classiﬁcation is completed by a previously deﬁned classiﬁcation method (in this study we used k-NN and Naive Bayes classiﬁers) and, subsequently, ﬁtness evaluation is performed (4). It is important to note that GA is performed only during the design stage. At the operation stage, only the selected features are extracted from the lung sound signal and presented to the classiﬁer.

2.4.

Classiﬁcation

simple classiﬁcation method ﬁnds, according to a distance function, the k nearest neighbors of the unknown object among the data set and uses the majority vote approach to predict the object label [20]. In our study, Euclidean distance was used and k was chosen to be 5. The Euclidean distance d(x, y) between two instances x and y is deﬁned as follows:

n 2 d(x, y) = (xi − yi ) .

(5)

i=1

According to [20], the basic steps of the k-NN algorithm are: • calculate the distances between the unknown object and all previous labeled objects; • select the k closest objects to the unknown object; • use the majority vote approach to predict the unknown object class. 2) Naive Bayes classiﬁer: the Naive Bayes classiﬁer [16] learns probability distributions from data and classiﬁes the unknown instance by choosing the class with the highest posterior probability. The class is chosen to satisfy:

ᏴMAP = argᏴiεᏴ max p(x|Ᏼi )p(Ᏼi ),

(6)

where P(Ᏼi ) is the a priori probability, and p(x| Ᏼ i ) is the conditional probability density function of class Ᏼi , in which x is the attribute value and i corresponds to the class number. The conditional probability density function is based on the Gaussian distribution expressed by:

1

1

The signals were classiﬁed using two different classiﬁcation methods: k-Nearest Neighbors and Naive Bayes.

p(x|Ᏼi ) =

1) k-Nearest Neighbors: the k-Nearest Neighbors (k-NN) classiﬁer was introduced in 1957 by Fix and Hodges [15]. This

with i being the mean value of class variance.

(2i2 )

1/2

exp

−

(x − i ) 2

2

2i

,

(7)

Ᏼi and i being its

18

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20

Assuming uniform a priori probabilities, (6) is simpliﬁed such that

ᏴML = argᏴiεᏴ max p(x|Ᏼi ),

(8)

where ᏴML is the Maximum Likelihood hypothesis. After selecting the cumulants through the GA individual binary sequence, the k-Nearest Neighbors and Naive Bayes classiﬁers are used to classify the lung sounds into vesicular, ﬁne crackle, coarse crackle, monophonic and polyphonic wheezes.

3.

Results and discussion

The input data set was divided into two subsets: a training set that included 70% of samples and a validation set with 30% of samples. The training set is composed of 71 vesicular samples, 39 ﬁne crackle samples, 39 coarse crackle samples, 35 monophonic and 35 polyphonic wheeze samples. The validation set is composed of 31 vesicular, 18 ﬁne crackle, 18 coarse crackle, 16 monophonic and 16 polyphonic wheeze samples. Using this data set, 40 different training and validation subsets were randomly created (holdout cross-validation procedure). They were generated to evaluate and compare the performance of the classiﬁers. We used MATLAB® software to run the tests. With the aim of ﬁnding the best results, some trial and error tests were performed in order to conﬁgure the GA. The selection operator, crossover operator, mutation operator, and their probabilities were chosen based on GA performance analysis over 50 executions of the algorithm using a population of 100 individuals. Thus, the selected GA conﬁguration was: • • • • • •

Selection function: stochastic uniform. Mutation function: uniform. Crossover function: two points. Crossover probability: 0.8. Mutation probability: 0.25. Number of generations: 100.

To deﬁne which classiﬁer would be used to classify the ﬁve types of lung sounds studied herein, the ﬁrst attempt was to use the k-NN or the Naive Bayes to classify all possible classes at the same time, using just one classiﬁer with ﬁve possible outputs. After training the classiﬁers, the mean training and validation classiﬁcation accuracies (and standard deviations) over the 40 different subsets were 92.3 ± 0.9% and 87.2 ± 1.4% for the k-NN classiﬁer and 94.4 ± 0.9% and 88.7 ± 1.3% for the Bayes classiﬁer. The mean number of cumulants was 316 ± 10 for the former and 223 ± 9 for the latter. In such cases the number of cumulants varied because the training set is different for each run of the algorithm (we used 40 different subsets), and the GA select the features (cumulants) according to the classiﬁcation accuracy and the number of cumulants used (see Eq. (4)). Therefore, vector size is not ﬁxed and unlikely to be the same in all executions of the algorithm. In order to prove GA efﬁciency, we implemented the Fisher’s Discriminant Ratio (FDR) [17] feature selection tool, considering its multiclass case and using the same data sets. The Bayes classiﬁer was employed and we compared the

Fig. 7 – Comparison between GA and FDR as feature selection tools using Tukey’s HSD criterion (95% conﬁdence level) on validation data.

results applying Tukey’s Honestly Signiﬁcant Difference (HSD) criterion, based on the Studentized range distribution (95% conﬁdence level) on validation data. In analyzing Fig. 7 we can conclude that the GA achieved better results than FDR. Moreover, GA is able to automatically deﬁne an appropriate number of features. In order to improve the classiﬁcation accuracy, a divideand-conquer approach was implemented. The objective was to divide the problem into smaller sub-problems, with the hope that the solutions of the sub-problems would be easier to ﬁnd. After some experimentation, we deﬁned a classiﬁcation tree (Fig. 8) in which a different classiﬁer method is implemented in each node. We tested many combinations of classes and classiﬁcation methods (k-NN or Bayes) to arrive at the ﬁnal classiﬁcation tree, which presented the best performance. In the ﬁrst step (node 1) the lung sound is classiﬁed into one of the three following classes: vesicular sound, wheeze (monophonic or polyphonic), or crackle (ﬁne or coarse). The classiﬁcation result is yielded by a k-NN classiﬁer. This classiﬁer used on average 223 ± 11 cumulants and its training and validation accuracies were 96.2 ± 1.8% and 92.1 ± 1.9%, respectively. Although the number of cumulants used by the classiﬁer is still high, it is important to note that the feature space was considerably reduced (from 800 parameters to approximately 223) and the validation performance suggests the classiﬁer achieved good capability for generalization. After the ﬁrst step, if the lung sound was classiﬁed as a wheeze, a separate classiﬁer (node 2) determined the sound classiﬁcation as either a monophonic or a polyphonic wheeze.

Fig. 8 – Classiﬁcation tree.

19

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20

Table 1 – Confusion matrix of the classiﬁcation performance on validation data set.

Vesicular Fine crackle Coarse crackle Monophonic wheeze Polyphonic wheeze

Vesicular

Fine crackle

94.4 ± 1.5%

0 91.9 ± 2.8% 9.2 ± 3.2% 0 0

0 0 0

Coarse crackle

A Naive Bayes classiﬁer was designed to solve this classiﬁcation problem (Bayes 1). This classiﬁer used on average 166 ± 3 cumulants and its training and validation accuracies were 94.2 ± 1.7% and 91.1 ± 2.3%, respectively. If the lung sound was classiﬁed as a crackle in the ﬁrst step, another classiﬁer (node 3) was implemented to classify the crackle sound as either a ﬁne or coarse crackle. This task was also performed by a Naive Bayes classiﬁer (Bayes 2). This classiﬁer used on average 167 ± 5 cumulants and its training and validation accuracies were 95.3 ± 1.4% and 91.4 ± 2.5%, respectively. The general training and validation classiﬁcation accuracies of the built classiﬁcation tree were 95.3 ± 1.6% and 92.2 ± 2.4%, respectively. The confusion matrix obtained using the validation sets is shown in Table 1. From Table 1 we can see that the proposed lung sound classiﬁcation approach achieved good classiﬁcation performance and it outperformed classiﬁers that do not use the divide-and-conquer strategy. Considering that only one classiﬁcation tree would be implemented in a practical application, we selected one of the best sets of results, which was: 98.1% for classiﬁcation accuracy on training, and 94.6% for validation data. The number of cumulants required to perform the classiﬁcation depends on the respiratory sound. If the sound was classiﬁed as vesicular sound, only the k-NN classiﬁer was used for evaluation and 210 cumulants were calculated. However, if the sound was classiﬁed as monophonic or polyphonic wheeze, the kNN and the Bayes 1 classiﬁers were used and 314 cumulants were extracted. Finally, if the sound was classiﬁed as ﬁne or coarse crackle, the k-NN and Bayes 2 classiﬁers were employed and 328 cumulants were calculated. Thus, in the best case, 210 cumulants have to be extracted from the sound event and, in the worst case, 328 cumulants are required. Since the calculation of cumulants is a simple computational task [19], the implementation of the proposed classiﬁer in an embedded system is feasible. Comparing the results obtained in this paper with the results discussed in Section 1 is a difﬁcult task as each study classiﬁes the respiratory sounds in a different way, or uses a different data set. For instance, [2,8,10] use fewer classes. The classiﬁcation accuracy obtained by [2,8] and [10] were 90%, 97.5% and 94.2%, respectively. However, it is important to note that the proposed approach achieved competitive results even using only one feature extraction tool (HOS). Furthermore, our study dealt with more lung sound classes than are usually found in research developed in this area.

4.

Conclusion

In this paper, a divide-and-conquer approach to classify lung sounds was developed. HOS-based features

Monophonic wheeze

0 8.1 ± 2.8% 90.8 ± 3.2% 0 0

Polyphonic wheeze

5.6 ± 1.5% 0 0 91.9 ± 2.3% 9.7 ± 3.3%

0 0 0 8.1 ± 2.3% 90.3 ± 3.3%

(second-, third- and fourth-order cumulants) were used in combination with feature selection and classiﬁcation methods. The selected cumulants were shown to be effective in representing lung sounds. Two different classiﬁcation methods were implemented to build a tree-based system to classify lung sounds. The kNearest Neighbors and the Naive Bayes classiﬁers were used for this purpose. The k-NN classiﬁer is simple, effective, intuitive and robust to noisy environments. Likewise, the Naive Bayes is also simple and effective, it can be trained quickly and it is quick to classify. HOS were used for feature extraction and GA were used to reduce dimensionality. GA individuals were evaluated using a ﬁtness function that accomplished its purpose to maximize the classiﬁcation performance and reduce the feature space. Therefore, the conﬁguration of GA parameters was extremely important for achieving satisfactory results, since these parameters could negatively inﬂuence the obtained results. For future work, we expect to implement lung sound pattern recognition on a portable device and classify even more types of lung sounds. Moreover, other feature extraction techniques and classiﬁcation methods can be studied to improve classiﬁcation performance.

Conﬂict of interest The authors declare no conﬂict of interest.

Acknowledgments The authors would like to thank the Brazilian agencies FAPEMIG, CNPq and CAPES for ﬁnancial support, and Dr. Evelyn R Nimmo for editing the manuscript.

references

[1] S. Lehrer, Understanding Lung Sounds, 3rd ed., WB Saunders, Philadelphia, 2002. [2] S. Ic¸er, S. Gengec¸, Classiﬁcation and analysis of non-stationary characteristics of crackle and rhonchus lung adventitious sounds, Dig. Signal Process. 28 (2014) 18–27. [3] American Thoracic Society, Updated nomenclature for membership relation, ATS News 3 (1977) 5–6. [4] A.R.A. Sovijärvi, F. Dalmasso, J. Vanderschoot, L.P. Malmberg, G. Righini, S.A.T. Stoneman, Deﬁnition of terms for applications of respiratory sounds, Eur. Respir. Rev. 10 (2000) 597–610. [5] G. Charbonneau, E. Ademovic, B.M.G. Cheetham, L.P. Malmberg, J. Vanderschoot, A.R.A. Sovijärvi, Basic

20

[6]

[7]

[8]

[9]

[10]

[11]

[12]

c o m p u t e r m e t h o d s a n d p r o g r a m s i n b i o m e d i c i n e 1 2 9 ( 2 0 1 6 ) 12–20

techniques for respiratory sound analysis, Eur. Respir. Rev. 10 (2000) 625–635. R. Palaniappan, K. Sundaraj, N.U. Ahamed, Machine learning in lung sound analysis: a systematic review, Biocybern. Biomed. Eng. 33 (3) (2013) 129–135. J.L.M. Amaral, A.J. Lopes, A.C.D. Faria, P.L. Melo, Machine learning algorithms and forced oscillation measurements to categorise the airway obstruction severity in chronic obstructive pulmonary disease, Comput. Methods Programs Biomed. 118 (2) (2015) 186–197. G. Serbes, C.O. Sakar, Y.P. Kahya, N. Aydin, Pulmonary crackle detection using time–frequency and time–scale analysis, Dig. Signal Process. 23 (3) (2013) 1012–1021. F. Jin, F. Sattar, D.Y.T. Goh, New approaches for spectro-temporal feature extraction with applications to respiratory sound classiﬁcation, Neurocomputing 123 (10) (2014) 362–371. M. Bahoura, Pattern recognition methods applied to respiratory sounds classiﬁcation into normal and wheeze classes, Comput. Biol. Med. 39 (9) (2009) 824–843. B.A. Reyes, S. Charleston-Villalobos, R. González-Camarena, T. Aljama-Corrales, Assessment of time–frequency representation techniques for thoracic sounds analysis, Comput. Methods Programs Biomed. 114 (3) (2014) 276–290. L.J. Hadjileontiadis, S.M. Panas, Higher-order statistics: a robust vehicle for diagnostic assessment and

[13]

[14] [15]

[16] [17] [18]

[19]

[20]

characterisation of lung sounds, Technol. Health Care J. 5 (5) (1997) 359–374. J.M. Mendel, Tutorial on higher-order statistics (spectra) in signal processing and system theory: theoretical results and some applications, Proc. IEEE 79 (1991) 278–305. J.H. Holland, Adaptation in Natural and Artiﬁcial Systems, The University of Michigan Press, 1975. E. Fix, J.L. Hodges, Discriminatory Analysis, Nonparametric Discrimination: Consistency Properties, USAF School of Aviation Medicine, Randolph Field, TX, 1951. T. Mitchell, Machine Learning, 1st ed., McGraw- Hill Education, New York, 1997. S. Theodoridis, K. Koutroumbas, Pattern Recognition, 4th ed., Academic Press, USA, 2009. A.R.A. Sovijärvi, L.P. Malmberg, G. Charbonneau, J. Vanderschoot, F. Dalmasso, C. Sacco, M. Rossi, J.E. Earis, Characteristics of breath sounds and adventitious respiratory sounds, Eur. Respir. Rev. 10 (2000) 591–596. M.V. Ribeiro, C.A.G. Marques, C.A. Duque, A.S. Cerqueira, J.L.R. Pereira, Detection of disturbances in voltage signals for power quality analysis using HOS, EURASIP J. Adv. Signal Process. (2007). V. Garcia, E. Debreuve, F. Nielsen, M. Barlaud, K-nearest neighbor search: fast GPU-based implementations and application to high-dimensional feature matching, in: 17th IEEE international conference on image processing (ICIP), 26–29 Sept., 2010, pp. 3757–3760.

Classification of lung sounds using higher-order statistics: A divide-and-conquer approach

Classification of lung sounds using higher-order statistics: A divide-and-conquer approach

Recommend Documents