Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller

BBE 200 1–10 biocybernetics and biomedical engineering xxx (2017) xxx–xxx Available online at www.sciencedirect.com ScienceDirect journal homepage: ...

Download PDF

2MB Sizes 0 Downloads 44 Views

Report

PDF Reader
Full Text

BBE 200 1–10 biocybernetics and biomedical engineering xxx (2017) xxx–xxx

Available online at www.sciencedirect.com

ScienceDirect journal homepage: www.elsevier.com/locate/bbe 1 2 3

Original Research Article

Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller

4 5 6

7 8 9 10

Q1

Rahul Kumar Chaurasiya a,*, Narendra D. Londhe b, Subhojit Ghosh b a

Department of Electronics and Telecommunication Engineering, National Institute of Technology, Raipur, Raipur, PIN-492010, India b Department of Electrical Engineering, National Institute of Technology, Raipur, India

article info

abstract

Article history:

P300 speller-based brain-computer interface (BCI) allows a person to communicate with a

Received 5 October 2016

computer using only brain signals. In order to achieve better reliability and user continence,

Received in revised form

it is desirable to have a system capable of providing accurate classiﬁcation with as few EEG

9 March 2017

channels as possible. This article proposes an approach based on multi-objective binary

Accepted 20 April 2017

differential evolution (MOBDE) algorithm to optimize the system accuracy and number of

Available online xxx

EEG channels used for classiﬁcation. The algorithm on convergence provides a set of paretooptimal solutions by solving the trade-off between the classiﬁcation accuracy and the

Keywords:

number of channels for Devanagari script (DS)-based P300 speller system. The proposed

BCI

method is evaluated on EEG data acquired from 9 subjects using a 64 channel EEG acquisition

Devanagari

device. The statistical analysis carried out in the article, suggests that the proposed method

Multi-objective Optimization

not only increases the classiﬁcation accuracy but also increases the over-all system reliabil-

Binary DE

ity in terms of improved user-convenience and information transfer rate (ITR) by reducing

P300-speller

the EEG channels. It was also revealed that the proposed system with only 16 channels was

SVM

able to achieve higher classiﬁcation accuracy than a system which uses all 64 channel's data for feature extraction and classiﬁcation. © 2017 Nalecz Institute of Biocybernetics and Biomedical Engineering of the Polish Academy of Sciences. Published by Elsevier B.V. All rights reserved.

13 11 12 14 15 16

17 18 19

1.

Introduction

P300 speller-based brain-computer interface (BCI) allows a person to communicate with a computer using only brain signals [1]. The communication does not require any physical

movements, and is particularly useful for patients suffering from severe motor disabilities but having cognitive abilities [2]. The most widely used P300 speller works in an odd-ball paradigm-based experimental environment [1,3]. In the oddball experiment, the subjects are randomly presented with two types of events, one of which rarely occurs (the odd-ball). The

* Corresponding author at: Department of Electronics and Telecommunication Engineering, National Institute of Technology, Raipur, Raipur-C.G, PIN-492010, India. E-mail addresses: [email protected] (R.K. Chaurasiya), [email protected] (N.D. Londhe), [email protected] (S. Ghosh). http://dx.doi.org/10.1016/j.bbe.2017.04.006 0208-5216/© 2017 Nalecz Institute of Biocybernetics and Biomedical Engineering of the Polish Academy of Sciences. Published by Elsevier B.V. All rights reserved. Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

20 21 22 23 24 25

BBE 200 1–10

2 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

rare events generates a P300 event related potential (ERP) in the recorded electroencephalogram (EEG). Farwell and Donchin ﬁrst developed a P300 speller for English alphabetic script using a 6 6 matrix [1]. All rows and columns of the matrix were randomly intensiﬁed and the subject was asked to focus on the target character (that one, which he wants to communicate). The speller is commonly known as row/column (RC) paradigm-based speller. The multiple trails of the intensiﬁcations were repeated and the averaged signals were used to improve the signal-to-noise ratio (SNR). Several methods have been reported to further improve the system reliability and information transfer rate (ITR). The method includes improvement in display paradigm by changing matrix size, background color, font size, and inter stimulus interval [4–7]. Different classiﬁcation methods such as SWLDA, Baysian linear discriminant analysis (BLDA), support vector machines (SVMs), and artiﬁcial neural networks (ANN), have been successfully applied for classiﬁcation in P300 spellers [8–15]. Although the RC paradigm is still the most widely used paradigm for P300 spellers, experiments with single character (SC) paradigm [16], region-based (RB) paradigm [17], and check board (CB) paradigms [18] have also been tried in recent years. The performance of classiﬁers used for detection of P300 ERPs signiﬁcantly depends on the choice of features and hence only the most discriminative features should be ideally used for classiﬁcation. However, in the case of P300 spellers, the channel set providing the most relevant information varies from subject to subject [19,20]. For an EEG device having 64 EEG channels, there are total 264 possible subsets and practically it is impossible to select the best channel subset using exhaustive search. Hence, different channel selection methods such as channel selection using recursive channel elimination [21], jump-wise regression [22], Gibbs sampling [23], multi-ganglion [24], particle swarm optimization (PSO) [25] and genetic algorithm (GA), have been proposed for improving the classiﬁer performance by selecting the best channel subset. Although, most of research work on the use of visual P300 spellers to aid communication has concentrated on languages that are written with English alphabetic script, a P300 speller systems capable of communicating text in Devanagari script (DS) has been developed in [15], which aimed at improving the system reliability by maximize the classiﬁcation accuracy. A binary differential evolution (DE)-based optimization method was used for selection of channel subset with a single objective of achieving maximum classiﬁcation accuracy. In the proposed study, we have extended the work of [15] for improving the user convenience and ITR, in addition to increasing the classiﬁcation accuracy of DS-based P300 speller system. An improved ITR and user-convenience in addition to accuracy is expected to further improve the reliability of the system. The ITR and user-convenience are directly related to the number of channels used for data acquisition. In this regard, the present work aims at the multiple objectives of improved classiﬁcation accuracy and reduced number of channels. Two different approaches are generally used for solving multi-objective optimization problem, the ﬁrst approach combine all the objective functions into a single composite objective function by assigning different

weightages to different objectives, while the second approach determine an entire pareto-optimal solution set. Considering the limitations related to the selection of proper weightage function, which is not known a priori, the second approach is preferable as it provides a set of pareto-optimal solutions and user can decide about which solution he wants to use, based on the priorities of the objectives. In this article, a multiobjective binary DE (MOBDE) algorithm is proposed for ﬁnding the pareto-optimal solution set for solving the trade-off between number of channels and classiﬁcation accuracy. Due to the requirement of lesser number of algorithm speciﬁc parameters and simplicity of the algorithm, MOBDE has been preferred over other optimization approaches. The rest of the paper is organized as follows: The description of signal acquisition procedure and the dataset is given in Section 2. The character detection mechanism is also described in Section 2. The framing of the multi-objective problem of accuracy maximization & channel minimization as an optimization problem and its solution using MOBDE algorithm is described in Section 3. The results are presented in Section 4. With discussions in Section 5, the study is concluded in Section 6.

2.

Materials and methods

2.1.

The dataset

86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109

Total 9 healthy volunteer subjects of age range 21–29 (mean 26.5) were used to collect EEG responses. All subjects were able to communicate using DS. The EEG responses were recorded with a BrainAmp DC hardware which was equipped with a 64channel actiCAP. The EEG responses were collected at 500 Hz sampling frequency. At the time of recording, a digital bandpass ﬁlter of 1 to 250 Hz was also applied. General purpose BCI2000 software was used for stimulus presentation and data collection using DS-based display paradigm [26]. The DS paradigm used for stimulation is shown in Fig. 1(a) and the channel conﬁguration used for data collection is shown in Fig. 1(b). Total 100 characters were presented in 20 runs as target characters for each subject. In a particular run, for each character, the rows and the column of the display matrix were randomly and successively intensiﬁed for 120 ms, the followed by 80 ms non-intensiﬁcation period. In order to enhance the SNR of the acquired EEG signals, the sequences of intensiﬁcations of 16 rows and columns were repeated 15 times for each character. After 15 trials of one character, the recording for the next character was started with a gap of 10 s between the two characters. The 10 second gap was incorporated to make the subjects relax and to ensure that they can comfortably ﬁnd the next target character in the display matrix. Total 240 responses (16 rows/columns 15 trials) were recorded per character. Two out of 16 responses were supposed to contain P300 ERPs.

110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135

2.2.

136

Preprocessing and feature extraction

The methodology for preprocessing and feature extraction in the present work has been adopted from reported works on P300 speller [2,27,28]. The EEG samples posterior to 600 ms

Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

137 138 139

BBE 200 1–10

3

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

Fig. 1 – (a) The 8 T 8 matrix containing 13 vowels, 37 consonants, and 10 digits of DS. Total 4 special characters were also used in stimulus presentation. (b) A 64 channel configuration for acquiring EEG responses. Channels AFz and FCz were also used as ground and reference, respectively.

140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158

from the starting of each ﬂashing were extracted from all 64 channels. As the interest was to capture the P300 ERPs, the selected time window of 600 ms was sufﬁcient to capture the relevant information. The extracted samples were passed through a band-pass ﬁlter of cut-off frequencies between 1 and 10 Hz. The ﬁlter samples were than decimated with a frequency of 10 Hz. At this stage, each EEG response consists of 6 samples per channel. For these samples, normalization was carried out independently for each channel. Afterwards, feature vectors were formed by concatenation of the signal samples of all 64 channels. Thus for a single character, there are total 240 feature vectors, each of size 384 (6 samples 64 channels). In these 240 feature vectors, 30 are from class +1 (1 row and 1 column per trial 15 trials) and are expected to contain P300 ERPs. The rest of the feature vectors are in class 1. Sample variations of the average of the class +1 and 1 EEG signals for all nine subjects for channel CPz are depicted in Fig. 2. A peak in class +1 signal near 300 ms shows that the P300 ERPs were properly captured in the oddball experiment.

159

2.3.

f ðxÞ ¼

(1)

171 172 173 174 175 176 177

Srjc

J 1X ð jÞ ¼ f xrjc J j¼1

(2)

where f(xr|c) is the score assigned to features xr|c of a give row/column (r/c), with J trials (J can be chosen to be between 1 and 15). The character which is common in predicted row and column is the target character.

3. Optimizing for accuracy and number of channels

Classiﬁcation

The classiﬁcation task of predicting the row and column containing the P300 ERP is a two-class classiﬁcation problem. Because of its better generalization capability over other machine learning algorithms [29], an SVM classiﬁer was employed for this task. From the training data points Xi, i = 1, 2 . . . ..N (with respective class labels yi =1 or + 1, i = 1, 2 . . . ..N), SVM learns maximizes-margin hyper-plane while also trying to minimize the total errors in classiﬁcation. The learned separating hyper-plane then can be used to assign a score to a new test data point X is represented as:

170

i¼1

where li, i = 1, 2 . . . ..N are the Lagrange's multipliers. Since in the presented work 15 trials have been recorded for each character, the rows/column are predicted based on the scores obtained by different rows/columns, as per the formulation of Eq. (2).

3.1. 160 161 162 163 164 165 166 167 168 169

N X yi li ðX:Xi Þ þ b

Binary differential evolution

DE is a population-based evolutionary optimization method. As compared to other evolutionary approaches, it is simple yet effective algorithm and requires only two control parameters [30]. The population of DE consists of continues valued ﬂoating-point encoded vectors. The population at iteration 0 is generated as a group of NP random vectors. Mutation, crossover and selection operation are generally performed in DE for updating the positions for coming iterations. Suppose we have NP, D-dimensional target vector xti ; i ¼ 1; 2; . . .NP at

Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

179 178 180 181 182 183 184 185 186

187 188 189 190 191 192 193 194 195

BBE 200 1–10

4

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

S2

Normalized Amp.

S1

1

1

0

0

0

-1

0

100

200

300

-1

0

100

200

300

1

1

0

0

0

0

100

200

300

-1

0

100

S7

200

300

200

300

0

200

300

100 200 Sample No.

300

100

1

Class +1 Class -1 0

0

100 200 Sample No.

-1

S9

1

0

100

S8

1

-1

0

S6

1

-1

Normalized Amp.

-1

S5

S4 Normalized Amp.

S3

1

300

-1

0

0

100 200 Sample No.

300

-1

0

Fig. 2 – Averaged Sample variations of recorded EEG signals (average over all the signals for first character) of class +1 and class S1 for channel CPz. A peak in class +1 signal near 300 ms shows that the P300 ERPs were properly captured in the oddball experiment. Class S1 signal are shown by dotted red.

196 197 198

iteration number t, then, in mutation stage, a mutation vector for next iteration is generated for each bit of every target utþ1 ij vector as

199

utþ1 ¼ xtr1;j þ F xtr2;j xtr3;j ij

(3)

200 201 202 203

where index of dimensionality j varies from 1 to D. F is a positive constant and xr1,j, xr2,j and xr3,j are three bits of randomly chosen individuals with indexes r1 6¼ r2 6¼ r3 6¼ i.

204 205 206

In crossover stage, a trial individual vi is generated by crossing over the target vector xi with the corresponding mutant vector ui as follows

207

( vtþ1 ij

¼

utþ1 ij ; xtij ;

if ðrand jCRÞorð j ¼ randðiÞÞ otherwise

better. Otherwise, the target individual xti is carried forward for next stage. Mathematically, the target individual for next iteration is selected as

(4)

208 209 210 211

where CR is the crossover probability between (0,1); rand j are stochastic random number uniformly distributed within ½0; 1Þ; rand(i) are random integers within 1, 2.., D.

212 213

In selection stage, the trial individual vtþ1 replaces the i , if its ﬁtness value is target individual xti for generation of xtþ1 i

xtþ1 i

¼

tþ1 > f xti vtþ1 i ; if f vi xti ; otherwise

214 215 216 217

(5)

A number of improved variants of DE have been proposed in recent years [31]. It has also been reported that DE can perform better than other optimization for real world problems [32]. However, the standard DE operates in continuous space and is not suitable for solving binary optimization problems. Hence, its binary versions have also been proposed to solve such problems [33–35]. In BDE algorithms, each bit of target vector is represented by either a 0 or 1. The methodology used to update the population in BDE is similar to DE; involving crossover after mutation, and ﬁnally selection operations. In order to ensure that the target vector consists of only 0 s and 1 s, a probability estimation operator is used to generate mutant vector in the mutation stage as follows

Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

219 218 220 221 222 223 224 225 226 227 228 229 230 231 232 233

BBE 200 1–10

5

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

234

8 1 tþ1 > > 3 ¼2 > P xij > 2bðMO0:5Þ > > < 6 1 þ 2F 7 41 þ e 5 > > > > > > : MO ¼ xt þ F xt xt r1;j

235 236 237 238 239 240

r2;j

Initialize 100 target vectors (6)

Decode target vector to obtain channel subset

r3;j

where b is a positive valued bandwidth factor. The mutant operator MO in BDE is analogous to the mutation operation used in standard DE as in Eq. (3). A binary-coded mutant vector uijtþ1 for target vector xtij is generated as ( uijtþ1 ¼

1;

if randð ÞP xtþ1 ij

0;

otherwise

Obtain the feature subset corresponding to selected channels

Train SVM with training data

Repeat 5 times for 5-fold CV

(7)

241 242 243 244 245 246 247 248 249 250

The crossover operation and selection process in binary DE are same as in Eqs. (4) and (5) respectively. The BDE has one more algorithm parameters (i.e. bandwidth factor b) in addition to two algorithm speciﬁc parameters F and CR used in DE. In this article, the method of Wang et al. has been adopted for cannel selection using MOBDE [34]. The values for algorithm speciﬁc parameters were chosen to be same as suggested in [34].

251

3.2.

252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272

In this article, the reduction in number of channels and improvement in the classiﬁcation accuracy has been framed as a binary optimization problem in. A binary coded target vector in the population represents a set of channel. As the dataset was recorded from 64 channels, a 64-dimensional binary vector xi ¼ xi;1 ; xi;2 ; ::; xi;64 constituted the target vector. Total 100 target vectors were randomly generated as initial population. In target vector xi, at iteration number t, the features for classiﬁcation are extracted from channel j (1 ≤ j ≤ 64), if xtij holds a value '1'. The update mechanism of the target vectors for the next iteration is same as described in Section 3.1. Each target vector in the population is a candidate solution and represents a possible combination of set of channels. The optimization process tries to search for a solution that maximizes the classiﬁcation accuracy and at the same time minimizes the number of channels. In multi-objective optimization frame work, the two objective functions, viz. maximizing the classiﬁcation accuracy and minimizing the number of channels has been mathematically formulated as Objective 1: maxA(xi) Objective 2: minC(xi)

273

s:t: xi x

274 275 276 277 278 279 280 281 282 283 284 285 286 287 288

In objective 1, A(xi) represents the accuracy of character detection, obtained from the SVM-based classiﬁcation approach while choosing the channels for target vector xi. In objective 2, C(xi) is the count of number of 1 s present in target vector (xi). In Eq. (8), the constraint ‘‘xi x’’ indicates that target vectors xis contain a channel-set which is subset of x containing all 64 channels (i.e. all the entries in x are 1). The MOBDE algorithm was applied for each subject separately. It was observed that for each subject, there were no changes in the accuracy and number of channels after 40 iterations. Fig. 3 depicts the ﬂow of the proposed methodology, involving SVM-based character detection and MOBDE-based channel selection method.

Obtain the classification accuracy with test data

Evaluate the fitness of the two objective functions

Update the target vectors using MOBDE algorithm

Find the pareto-optimal solutions

Multi-objective optimization framework

(8)

Iteration no.>40

No

Yes End Fig. 3 – Flow of the proposed methodology.

4.

Experimental results

289

In BCI, channel selection is usually an off-line process where the data acquired from all channels is used to select the optimal channel subset. Afterwards, the optimal channel subset is used for real-time applications. The contribution of the proposed MOBDE-based channel selection method for minimizing the number of channels and maximizing the classiﬁcation accuracy has been evaluated in this section, which has been further divided into the following subsections: The classiﬁcation accuracies obtained using SVM classiﬁers are presented in ﬁrst subsection. The results of proposed MOBDE-based method for optimal channel selection have been presented in second subsection. The statistical analysis carried out in the third subsection reﬂects the effectiveness of the proposed channel selection method.

290 291 292 293 294 295 296 297 298 299 300 301 302 303

4.1.

Results for character classiﬁcation using SVM

304

Table 1 depicts the accuracy of character detection obtained for 5, 10 and 15 trials using SVM classiﬁer (for all 64 channels). A 5-fold cross validation methodology was adopted for training and testing the classiﬁer. Average accuracy of 65.3%, 80.9% and 86.9% is obtained for 5, 10 and 15 trials, respectively.

305 306 307 308 309 310

Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

BBE 200 1–10

6

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

Table 1 – Percentage classification accuracy obtained using SVM classifier for the 5, 10 and 15 trials of the dataset for all 64 channels. Subject number

minimize the number of channel. The set of pareto-optimal solutions encountered by the target vectors have been shown as a pareto-front diagram in Fig. 4. The pareto-front diagram access the trade-off relationship between the number of channels and the classiﬁcation accuracy. The pareto-front solutions shown in Fig. 4 represent a boundary at which an improvement of classiﬁcation accuracy necessarily requires an increment in number of channels or, conversely, an attempt for reduction in the number of channels results in loss of accuracy. For 5, 10 and 15 trials, the classiﬁcation accuracy obtained and the corresponding numbers of channel selected by the pareto-optimal solutions points with maximum accuracy are presented in Table 2. It can be observed from Fig. 4 that a total of 35 paretooptimal solution points were selected for 15 trials across all nine subjects. In these points, each of the 64 channels might have got selected between 0 to 35 times. Similarly, a total of 37 and 38 pareto-optimal solution points were selected for 5 and 10 trials, respectively. In order to analyze the locations of the channels that can provide more discriminatory information and better accuracy, the topographical map of channelfrequency among all the pareto-optimal solutions were plotted. Fig. 5 shows the topographical maps (considered over

Accuracy with different number of trials

1 2 3 4 5 6 7 8 9 Average

5

10

15

68 59 70 56 62 80 56 60 76 65.2

84 74 82 79 78 87 72 80 86 80.2

85 82 91 88 84 91 88 89 90 87.6

311

4.2.

312 313 314 315

The MOBDE was used with SVM-based character detection method for optimum channel selection using 5, 10 and 15 trials. As mentioned in Section 3.2, the optimization objectives were to maximize the accuracy of character detection and

Results for optimization using MOBDE

Subject 1

Subject 2

Channel numbers

64 visited places Pareto front

48

64 visited places Pareto front

48 32

32

16

16

16

75

80

85

90

95

0 100 70

75

85

90

95

0 70

visited places Pareto front

visited places Pareto front

48

32

16

16

16

80

85

90

95

100

0 70

75

Subject 7

80

85

90

95

visited places Pareto front

visited places Pareto front

48

32

16

16

16

80 85 90 Acc uracy (%)

75

95

100

0 70

100

80

85

90

95

100

75

80 85 90 Acc uracy (%)

95

100

visited places Pareto front

48

32

75

95

64

32

0 70

90

Subject 9

64

48

0 100 70

Subject 8

64

85

visited places Pareto front

48

32

75

80

Subject 6

32

0 70

75

64

64

48

100

Subject 5

Subject 4 64 Channel numbers

80

visited places Pareto front

48

32

0 70

Channel numbers

Subject 3

64

95

100 70

75

80 85 90 Acc uracy (%)

Fig. 4 – Illustration of the pareto-optimal solutions found by the MOBDE algorithmin for different subjects. The horizontal and vertical axes denote the classification accuracy and the number of selected channels, respectively. The 'visited places' points show the fitness of all the positions visited by the target particles. The 'pareto front' points represent the positions that belong to the pareto-optimal solution set. Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338

BBE 200 1–10

7

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

Table 2 – Percentage classification accuracy (A) and the corresponding number of channels selected (Nc) for 5, 10 and 15 trials obtained by pareto-optimal solution with maximum classification accuracy. Subject No.

Accuracy (A) and the corresponding number of channels selected (Nc) for 5, 10 and 15 trials obtained by pareto-optimal solution with maximum classiﬁcation accuracy 5

1 2 3 4 5 6 7 8 9 Average

10

15

A

Nc

A

Nc

A

Nc

75 70 83 70 68 88 73 70 87 76.0

28 28 27 32 37 22 29 22 30 28.3

89 83 87 89 85 93 87 87 91 87.9

28 33 26 36 33 22 28 20 30 28.4

96 90 96 93 89 94 92 90 95 92.8

29 24 19 33 34 21 25 21 29 26.1

Fig. 5 – The topographical maps (common for all 9 subjects) showing the channel selection frequency in a total of 37, 38 and 35 pareto-optimal solution points for 5, 10 and 15 trials, respectively.

339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356

all 9 subjects) of the frequency of channel selection for 5, 10 and 15 trials. From Fig. 5, it can be observed that the parietal (P), occipital (O) and central (C) regions on scalp are most frequently selected. However, the optimal channel selection is a subject dependent problem i.e. there is inter subject variability in the optimal set [19,20,36]. Hence, in the present work, the topographical maps for individual subjects have also been analyzed. In order to ensure a sufﬁcient number of data points are available for analysis, the pareto-optimal points obtained with 5, 10 and 15 trails were considered for each subject. As from Fig. 4, the total number of pareto-optimal points may vary in different solutions. After the experimentations for the present case, in total 110 pareto-optimal points (37 for 5 trials, 38 for 10 trials, and 35 for 15 trials) 12, 9, 13, 15, 8, 13, 14, 12 and 14 points were present in the pareto-optimal solution set of subject 1 to 9, respectively. The subject-wise topographical maps are shown in Fig. 6.

357

4.3.

358 359 360 361 362 363 364

In order to validate the statistical signiﬁcance of the proposed method at different trials, the Friedman test was employed [37]. The test was applied for two different methods (i.e. SVM with all channels and SVM with MOBDE-based channel selection) for 5, 10 and 15 trials across all 9 subjects. The maximum classiﬁcation accuracy values were used for statistical comparison. Under the test, the methods were

Statistical analysis

ranked based on the results for different subjects. The p-value was obtained as 4.48 108, which was much lesser than 0.05. Hence, the null hypothesis (there is no signiﬁcant difference among the different methods) was rejected. Afterwards, a critical difference (CD) of 2.5135 was computed using a post hoc Nemenyi test [38]. The results of the post hoc test are depicted in Fig. 7. The average ranks of two methods with different trials are shown in increasing order on horizontal axis. Two methods connected by a colored line (below the horizontal axis) shows that the there is no signiﬁcant difference between them.

365 366 367 368 369 370 371 372 373 374 375

5.

376

Discussions

From the statistical analysis presented in the previous section, it can be concluded that the classiﬁcation accuracy increases with number of trials. The improvement in the system performance with respect to number of trials can be explained by Eq. (2), where the averaging over number of trials (j = 1 to J) is used to decide the target character. The averaging over the number of trials supresses the noise component and increases the SNR. However, increasing the number trials decreases ITR and user convenience. Hence, for better ITR and more userconvenience, it is required to decrease the number of channels in addition to increase the classiﬁcation accuracy. In this regard and to further enhance the system efﬁciency, a MOBDE-based optimal channel selection method was

Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

377 378 379 380 381 382 383 384 385 386 387 388 389

BBE 200 1–10

8

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

Fig. 6 – The topographical maps showing the channel selection frequency in pareto-optimal solution points for each subject, separately.

390 391 392 393 394 395 396 397 398

proposed. The aim of the method was used to optimize the trade-off between the number of channels and classiﬁcation accuracy. The effectiveness of the proposed technique can be veriﬁed by Fig. 7 which provides the statistical comparison of different approaches. From Fig. 7, it is also evident the average rank of the proposed MOBDE-based approach with 15 trials was 1, i.e. the approach performs best for all subjects. From Fig. 7, it can be seen that the average rank of MOBDE with 10 trials was better than the approach which used 15 trials of all

Fig. 7 – Visualization of post hoc Nemenyi test for showing the effectiveness of the proposed channel selection approach.

64 channels. This suggests that the proposed approach not only increase the reliability (by increasing the accuracy) and user convenience (by reducing the channels), but also increases the ITR by reducing the number of trials. It can also be observed from Fig. 6 that for given number of trials, there is no signiﬁcant difference between the two approaches. Further, the proposed method provides a set of paretooptimal solutions for optimizing the trade-off between the number of channels and classiﬁcation accuracy. So, based on the priority of objectives, a user can choose any solution from the pareto-front for optimal performance. For example, If minimizing number of channels is the main objective, then considering the average over the pareto-optimal solutions corresponding to the minimum number of channels (across all subjects), an average accuracy of 88% can be achieved with only 16 channels (Fig. 4). As the maximum accuracy of 87.6% was achieved without channel selection, it can be concluded that the proposed system with only 16 channels was able to produce better classiﬁcation accuracy then a system which uses all 64 channel's data for feature extraction and classiﬁcation.

Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419

BBE 200 1–10 biocybernetics and biomedical engineering xxx (2017) xxx–xxx

420 421 422 423 424 425 426 427 428 429 430 431 432 433

From the topographical maps shown in Fig. 5, it can be observed that the location of channel frequency is almost similar for 5, 10 and 15 trials. Additionally, taking the average over all the subjects, channel Pz(25), Oz(30), O2(31), CPz(53), P2 (58), PO7(60), PO4(63), and PO8(64) were selected in top 10 channel lists for 5, 10 and 15 trails. In other words, for majority of cases, the aforementioned channels comprised the optimal channel subset with signiﬁcant information. The results obtained by the proposed algorithm are also in correspondence with the results of [24,25] for most informative channels. Hence, if it is not possible (or feasible) to apply channel selection methods for individual users, the channels from the occipital and parietal regions should be selected for more discriminative features in a P300-based BCI application.

434

6.

435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 465 464

A novel method is proposed to optimize the performance of DS-based P300 speller systems. An SVM-based classiﬁcation method was used to detect the target characters on the dataset collected from 9 healthy subjects. Unlike most of the existing works on P300 spellers, which have only concentrated on improving the classiﬁcation accuracy, motivated by the signiﬁcance of reducing the number of channels for better user-convenience and improved ITR, the present work aims at optimizing the trade-off between the twin objectives of maximizing the classiﬁcation accuracy and minimizing the number of channels. The application of MOBDE algorithm is proposed to solve the aforementioned multi-objective problem. The proposed algorithm provided a set of pareto-optimal solutions with various conﬁguration of number of channels and corresponding accuracy values. With the pareto-front obtained after convergence of MOBDE, based on the requirements, the user gets a choice to select any optimal solution. The statistical analysis presented in the paper reﬂects that the proposed method not only increase the classiﬁcation accuracy, but also increases the ITR and user-convenience by reducing the number of channels. Further, the proposed system with only 16 channels achieved higher classiﬁcation accuracy than that achieved by a traditional method with all 64 channels. The concept of DS-based P300 speller presented in this paper was based on RC paradigm; exploration of differed types of paradigm such as CB paradigm is proposed as a future task on P300 spellers. It is also planned to concentrate on the application of other types of feature extraction techniques such as features from combined time-frequency domain and higher order statistics.

Conclusion

466 467

Acknowledgements

468 469 470 471 472 473 474 475

The authors would like to thank to the people associated with MILE Lab and Primates Research Lab of IISc, Bangalore, India, for providing us necessary support and facilities for recording the dataset. Authors acknowledge Department of Science and Technology, Government of India for ﬁnancial support vide Reference No. SR/CSRI/38/2015 (G) under Cognitive Science Research Initiative (CSRI) to carry out this work.

9

references

476

[1] Farwell LA, Donchin E. Talking off the top of your head: toward a mental prosthesis utilizing event-related brain potentials. Electroencephalogr Clin Neurophysiol 1988 Dec;70:510–23. [2] Akcakaya M, et al. Noninvasive brain-computer interfaces for augmentative and alternative communication. IEEE Rev Biomed Eng 2014;7:31–49. [3] Wolpaw JR, et al. Brain-computer interfaces for communication and control. Clin Neurophysiol 2002;113:767–91. [4] Allison BZ, Pineda JA. ERPs evoked by different matrix sizes: implications for a brain computer interface (BCI) system. Neural Systems and Rehabilitation Engineering IEEE Transactions on 2003;11:110–3. [5] Sellers EW, et al. A P300 event-related potential brain– computer interface (BCI): the effects of matrix size and inter stimulus interval on performance. Biol Psychol 2006;73:242–52. [6] Allison BZ, Pineda JA. Effects of SOA and ﬂash pattern manipulations on ERPs, performance, and preference: implications for a BCI system. Int J Psychophysiol 2006;59:127–40. [7] Salvaris M, Sepulveda F. Visual modiﬁcations on the P300 speller BCI paradigm. J Neural Eng 2009;6:046011. [8] Kaper M, et al. BCI Competition 2003–Data set IIb: support vector machines for the P300 speller paradigm. IEEE Trans Biomed Eng 2004;51:1073–6. [9] Hoffmann U, et al. An efﬁcient P300-based brain-computer interface for disabled subjects. J Neurosci Methods 2008;167:115–25. [10] Krusienski DJ, et al. A comparison of classiﬁcation techniques for the P300 Speller. J Neural Eng 2006;3:299–305. [11] Manyakov NV, et al. Comparison of classiﬁcation methods for P300 brain-computer interface on disabled subjects. Comput Intell Neurosci 2011;2011:519868. [12] Cecotti H. Toward shift invariant detection of event-related potentials in non-invasive brain-computer interface. Pattern Recognition Letters 2015;66:127–34. [13] Chaurasiya RK, et al. An efﬁcient P300 speller system for Brain-Computer Interface. Signal Processing, Computing and Control (ISPCC), 2015 International Conference on. 2015. pp. 57–62. [14] Bhatnagar V, et al. A modiﬁed approach to ensemble of SVM for P300 based brain computer interface. 2016 International Conference on Advances in Human Machine Interaction (HMI). 2016. pp. 1–5. [15] Chaurasiya RK, et al. Binary DE-Based Channel Selection and Weighted Ensemble of SVM Classiﬁcation for Novel Brain–Computer Interface Using Devanagari Script-Based P300 Speller Paradigm. Int J Human–Comput Interaction 2016;1–17. [16] Guan C, et al. High performance P300 speller for braincomputer interface. Biomedical Circuits and Systems, 2004 IEEE International Workshop on; 2004. pp. S3/5/INVS3/13-16. [17] Fazel-Rezai R, Abhari K. A region-based P300 speller for brain-computer interface. Canadian J Electr Comput Eng 2009;34:81–5. [18] Townsend G, et al. A novel P300-based brain–computer interface stimulus presentation paradigm: moving beyond rows and columns. Clin Neurophysiol 2010;121:1109–20. [19] Blankertz B, et al. The Berlin Brain–Computer Interface: accurate performance from ﬁrst-session in BCI-naive subjects. IEEE Trans Biomed Eng 2008 Oct;55:2452–62.

477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541

Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

Q2

BBE 200 1–10

10 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

[20] Xu M, et al. Channel selection based on phase measurement in P300-based brain-computer interface. PLoS ONE 2013;8:e60608. [21] Schröder M, et al. Robust EEG channel selection across subjects for brain-computer interfaces. EURASIP J Appl Signal Process 2005;2005:3103–12. [22] Colwell KA, et al. Channel selection methods for the P300 Speller. J Neurosci Methods 2014 Jul;232:6–15. [23] Speier W, et al. A method for optimizing EEG electrode number and conﬁguration for signal acquisition in P300 speller systems. Clin Neurophysiol 2015;126:1171–7. [24] Gao W, et al. Multi-ganglion ANN based feature learning with application to P300-BCI signal classiﬁcation. Biomed Signal Process Control 2015;18:127–37. [25] Jin J, et al. P300 Chinese input system based on Bayesian LDA. Biomed Tech (Berl) 2010 Feb;55:5–18. [26] Schalk G, et al. BCI2000: a general-purpose brain-computer interface (BCI) system. IEEE Trans Biomed Eng 2004;51: 1034–43. [27] Rakotomamonjy A, Guigue V. BCI competition III: dataset IIensemble of SVMs for BCI P300 speller. IEEE Trans Biomed Eng 2008 Mar;55:1147–54. [28] Kee C-Y, et al. Multi-objective genetic algorithm as channel selection method for P300 and motor imagery data set. Neurocomputing 2015;161:120–31. [29] Theodoridis S, Koutroumbas K. Pattern recognition. Fourth Edition. Academic Press; 2008.

[30] Storn R, Price K. Differential evolution-a simple and efﬁcient adaptive scheme for global optimization over continuous spaces vol. 3. ICSI Berkeley; 1995. [31] Das S, Suganthan PN. Differential evolution: a survey of the state-of-the-art. IEEE Trans Evolut Comput 2011;15: 4–31. [32] Vesterstrom J, Thomsen R. A comparative study of differential evolution, particle swarm optimization, and evolutionary algorithms on numerical benchmark problems. Evolutionary Computation, 2004. CEC2004. Congress on. 2004. pp. 1980–7. [33] Pampará G, et al. Binary differential evolution. Evolutionary Computation, 2006. CEC 2006. IEEE Congress on. 2006. pp. 1873–9. [34] Wang L, et al. A novel modiﬁed binary differential evolution algorithm and its applications. Neurocomputing 2012;98:55–75. [35] Chen Y, et al. A binary differential evolution algorithm learning from explored solutions. Neurocomputing 2015;149:1038–47. [36] Rivet B, et al. Impact of spatial ﬁlters during sensor selection in a visual P300 brain-computer interface. Brain Topogr 2012 Jan;25:55–63. [37] Demšar J. Statistical comparisons of classiﬁers over multiple data sets. J Mach Learn Res 2006;7:1–30. [38] Nemenyi P. Distribution-free multiple comparisons. Biometrics 1962;263.

569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596

Please cite this article in press as: Chaurasiya RK, et al. Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.04.006

Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller

Multi-objective binary DE algorithm for optimizing the performance of Devanagari script-based P300 speller

Recommend Documents