Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model

Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model

BBE 186 1–9 biocybernetics and biomedical engineering xxx (2017) xxx–xxx Available online at www.sciencedirect.com ScienceDirect journal homepage: w...

1MB Sizes 0 Downloads 41 Views

BBE 186 1–9 biocybernetics and biomedical engineering xxx (2017) xxx–xxx

Available online at www.sciencedirect.com

ScienceDirect journal homepage: www.elsevier.com/locate/bbe 1 2 3

Original Research Article

Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model

4 5

6 7 8 9 10 11

Q1

Narges Shafaei Bajestani a,*, Ali Vahidian Kamyad a, Ensieh Nasli Esfahani b, Assef Zare c a

Department of Electrical Engineering, Tehran Science and Research Branch, Islamic Azad University, Tehran, Iran Diabetes Research Center, Endocrinology and Metabolism Clinical Sciences Institute, Tehran University of Medical Sciences, Tehran, Iran c Department of Electrical Engineering, Gonabad Branch, Islamic Azad University, Tehran, Iran b

article info

abstract

Article history:

Choosing a proper method to predict and timely prevent the complications of diabetes could

Received 9 May 2016

be considered a significant step toward optimally controlling the disease. Since in medical

Received in revised form

research only small sample sizes of data are available and medical data always includes high

17 January 2017

levels of uncertainty and ambiguity, a type-2 fuzzy regression model seems to be an

Accepted 19 January 2017

appropriate procedure for finding the relationship between outcome and explanatory

Available online xxx

variables in medical decision-making. In this paper, a new type-2 fuzzy regression model based on type-2 fuzzy time series concepts is used to forecast nephropathy in diabetic

Keywords:

patients. Results in two examples show model efficiency. The use of such models in diabetes

Type-2 fuzzy logic

clinics is proposed.

Fuzzy time series

© 2017 Published by Elsevier B.V. on behalf of Nałęcz Institute of Biocybernetics and

Fuzzy regression

Biomedical Engineering of the Polish Academy of Sciences.

Nephropathy Forecasting

14 12 13 15 16 17

18 19 20 21 22

1.

Introduction

Diabetes is among the most common and dangerous diseases of the modern world, causing huge loss of life and financial resources in many societies every year. It is a difficult and incurable, but controllable, disease. Properly controlling this disease prevents or postpones further complications.

Therefore, choosing a proper method to predict and timely prevent diabetes complications could be considered a significant step toward optimally controlling the disease. Among patients starting renal replacement therapy, diabetic nephropathy is the most prevalent cause of kidney disease, affecting 40% of type 1 and type 2 diabetic patients [1]. Due to the prevalence of nephropathy among diabetic patients, they are usually advised to go for check-ups several

* Corresponding author at: Department of Electrical Engineering, Tehran Science and Research Branch, Islamic Azad University, Tehran, Iran. E-mail addresses: [email protected], [email protected] (N.S. Bajestani). http://dx.doi.org/10.1016/j.bbe.2017.01.003 0208-5216/© 2017 Published by Elsevier B.V. on behalf of Nałęcz Institute of Biocybernetics and Biomedical Engineering of the Polish Academy of Sciences. Please cite this article in press as: Bajestani NS, et al. Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.01.003

23 24 25 26 27 28 29 30

BBE 186 1–9

2 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

times a year, yet some of these check-ups are not necessary. Unnecessary check-ups are omitted from the treatment procedure for cases in which kidney complications are predicted, while more check-ups are prescribed for patients who need them. The existing research implicates the above omissions and additions as well. To detect and follow impairment of renal function, a knowledge of glomerular filtration rate (GFR) is required to allow the correct dosage of drugs cleared by the kidneys and to use the potentially nephrotoxic radiographic contrast media [2]. In clinical research, to evaluate the severity of disease in patients, such linguistic terms like high, medium, low, etc. are used. These terms can be modeled as fuzzy sets. Moreover, the borderline between these fuzzy sets is not crisp, although they are measured in numerical scale. As an example, to define high blood glucose in diagnosing diabetic patients, a cut-off point of 140 mg/dl for two-hour plasma glucose throughout an oral glucose tolerance test is not the precise borderline. In other words, cases within the neighborhood of the borderline indicate a vague status regarding the disease [3]. For these reasons, fuzzy models have been used in medical and especially diabetes research [4–10]. Some researchers have focused on diabetic complications with fuzzy models. For instance, the study of Bin Mansour et al. presented an algorithm based on fuzzy morphology for the computer-assisted improvement of exudates in fundus images of the human retina for the diagnosis of diabetic retinopathy [11]. The study by Narasimhan et al. researched the risk classification of diabetic nephropathy using fuzzy logic [12]. Rama Devi et al. used the design methodology of a fuzzy knowledge-based system to predict the risk of diabetic nephropathy. In their paper, the manageable risk factors like hyperglycemia, insulin, ketones, lipids, obesity, blood pressure and protein/creatinine ratio were considered as independent parameters, and the stages of renal disorder became the output parameter [13]. Since in medical research only small sample sizes of data are available, and as previously mentioned, medical data always includes some levels of uncertainty and ambiguity, some researchers have used a kind of fuzzy regression model to find the relationship between outcome and explanatory variables in medical decision-making. In 2005, Bolotin modeled two examples with fuzzy regression: one associated the quality of life with BMI categories, and the other modeled the analysis of high hemoglobin HbA1c levels among diabetic patients [14]. Pourahmad et al. predicted the existence of diabetes with respect to participants' sex, age, BMI, family history, and two-hour plasma glucose [3]. There are high levels of uncertainty in diabetes data (uncertainty in measurement device, doctor decision, patient body, patient life style, etc.) which encourage the use of type-2 fuzzy logic which can handle high levels of uncertainty. Some researchers have used type-2 fuzzy regressions as the modeling structure. Yicheng Wei and Junzo Watada built a type-2 fuzzy qualitative regression model. They implied that they used a general type-2 fuzzy number, while it seems that they used an interval type-2 fuzzy regression model. Poleshchuk and Komarov presented a regression model for

interval type-2 fuzzy sets based on the least squares estimation technique [15]. Hosseinzadeh et al. presented a weighted goal programming approach to fuzzy linear regression with crisp inputs and type-2 fuzzy outputs (WGP) [16]. This model, however, only tried to close the membership functions of observed and estimated responses by closing some of their parameters. It seems that none of the studies mentioned above could adequately model the type-2 fuzzy regression; instead, they reduced their models to only some points of type-2 fuzzy numbers. Fuzzy time series models have been applied to real life phenomena. Different fuzzy methods have been proposed to solve fuzzy time series problems. Watada applied fuzzy regression to solve the problems of fuzzy time series [17]. Song and Chissom proposed novel definitions for fuzzy time series [18]. Chen improved Song and Chissom's model [19]. Some other researchers also improved fuzzy time series models [20–22]. Fuzzy time series models have been proposed for various applications, such as enrollment, stock indexes, load forecasting, tourism demand forecasting, etc. [23–27]. Huarng and Yu proposed a framework for a type-2 fuzzy time series model to improve forecasting results [28]. Shafaei Bajestani et al. have optimized Huarng and Yu's model to forecast the Taiwan Stock Index based on optimized highorder type-2 fuzzy time series [29,30]. The GA algorithm was also used in fuzzy regression and fuzzy time series demands for optimizing models and their results [22,31–33]. The incorporation of fuzzy regression models and type-2 fuzzy time series models is named T2FRFTS, and using the benefits of both allowed the presentation in this research of a different viewpoint to type-2 fuzzy regression models that could predict GFR efficiently in two diabetic patients. In this study, our aim is to predict nephropathy due to priori of diabetic patient. It is expected that past data of GFR models could predict future GFR with good accuracy. In fact, the condition of kidneys in future referrals could be predicted using the proposed model. GFR (t) and GFR (t + 1) are the inputs and outputs of the model, respectively. Uncertainties of measurement devices make the data recorded in the clinics have a high level of ambiguity and uncertainty. These uncertainties confirm the use of type-2 fuzzy for modeling this data. GFR has three common formulas. These three formulas are another reason for using type-2 fuzzy sets to consider the effects of all three formulas in proper ways. It is expected that the proposed model, considering the effect of uncertainty, will be able to present a good prediction of GFR to predict nephropathy in a diabetic patient. Our motivation and final objective is to offer software which can predict diabetic complications to specialists and diabetes clinics as a large number of the world population suffers from diabetes. This prediction can help both the patients and the doctors to prevent unnecessary experiments and check-ups. Moreover, doctors can make better decisions for the type and dosage of medication they prescribe to postpone retinopathy among diabetic patients. The rest of current paper is organized as follows. The studies relevant to this research are reviewed in Section 2. In Section 3, the proposed model is described using two examples. Finally, Section 4 presents the conclusion.

Please cite this article in press as: Bajestani NS, et al. Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.01.003

91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150

BBE 186 1–9

3

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

151

2.

are minimized. This can be achieved by using the following linear program:

Review

152 153 154 155 156

As mentioned in the previous section, the proposed model was constructed based on some type-2 fuzzy time series and fuzzy regression concepts. Therefore, in this section type-1 fuzzy regression and type-2 fuzzy time series concepts are presented.

157

2.1.

158 159 160 161

At first, a formulation for type-1 fuzzy regression that proposed by Tanka is presented in this section. This model is the basis of most of type-1 fuzzy regression model. The model is assumed to be as below:

162

~ 1 xi1 þ A ~ 2 xi2 þ    þ A ~ n xin ~i ¼ A Y

163 164 165 166 167

~ j ð j ¼ 1; 2; :::; nÞ are the fuzzy coefficients in the form of where A (aj, cj) where aj is the middle and cj is the spread. The membership functions are assumed triangular membership as:   8 xaj  < 1 ; aj cj xaj þ cj Ai ðxÞ ¼ (2) cj : 0 otherwise

168

169 170 171 172

173 174 175 176 177 178 179 180 181

i¼1

j¼0 k X  ðajð1HÞ: cj Þxij þdþ iL þ diL ¼ yi ð1HÞei  þ  dþ iU ; diU ; diL ; diL  0 i ¼ 1; :::; n

(3)

~ j that minimize the Fuzzy regression model seeks to find A spread of the fuzzy estimated output for all data sets subject to a H-cut of observed output yi ¼ ðyi ; ei Þ is included in estimated output. ~ j which are the More specially, problem is to find out A solution of following linear programming problem: m X n n X X c0 þ cj xij min J ¼ subject to :

i¼1 j¼1

j¼1

n X

X

j¼1

j

aj xij þ ð1HÞ

cj jxij j  yi þ ð1HÞei

n X X  aj xij þ ð1HÞ cj jxij j  yi þ ð1HÞei j¼1

i ¼ 1; :::; n

j¼0

(1)

From (1) and (2), we get:  P  8   n > aj xij  y > > < 1 P j¼1 x 6¼ 0 n mY ðyÞ ¼ j¼1 cj xij j > > x ¼ 0; y ¼ 0 > : 1 0 x ¼ 0; y 6¼ 0

200

n X  þ  ðdþ iU þ diU þ diL þ diL Þ

k X  subject to : ðaj þð1HÞ: cj Þxij þdþ iU þdiU ¼ yi þð1HÞei i ¼ 1; :::; n

Type-1 fuzzy regression

a;c

(4)

j

c0 i ¼ 1; :::; m: 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197

Minimize

198 199

One of the proper models in type-1 fuzzy regression that shows its ability is Hojati's model used in this research. Hojati's model is briefly explained here. For more details please read the reference [34] which contains all necessary definition and literature review. ~ j ¼ ðaj ; cj Þ in In Hojati's model to find the fuzzy coefficients A ~ ~ ~ 2 xi2 þ    þ the fuzzy linear regression model, Y i ¼ A1 xi1 þ A ~ n xin , the following simple goal programming-like approach is A proposed. The observed value is assumed to be a symmetric triangular fuzzy number with center yi and half-width ei. In this model the fuzzy regression coefficients are chosen so that the total deviation of the upper points of H-certain predicted and associated observed intervals and the deviation of the lower points of H-certain predicted and associated observed intervals

aj ¼ free;

cj  0 j ¼ 0; :::; k (5)

where jdþ d iU j refers the distance between the upper point of iU the H-certain predicted interval and the upper point of the H certain observed interval, and jdþ iL diL j presents the distance between the lower point of H-certain predicted interval and the lower point of the H-certain observed interval. Minimizing the summation of these two distances is objective function [34].

201 202 203 204 205 206 207 208

2.2.

209

Required concepts

To understand the proposed model, it is necessary to know some definitions that are presented here.

210 211

Definition 1. (Type-2 fuzzy set) type-2 fuzzy set is an extension of type-1 fuzzy set that there can be a fuzzy set for any degree of membership. For example, there is a triangular fuzzy set (0.4, 0.5, 0.6) as a degree of membership for x = 1 in Fig. 1.

212 213 214 215

Definition 2. (Fuzzy time series) Y(t) (t = . . ., 0, 1, 2,. . .), is a subset of R. Let Y(t) be the universe of discourse defined by fuzzy set fi(t). If FðtÞ consists of fi(t) (i = 1,2,. . .), FðtÞ is defined as a fuzzy time series on Y(t).

216 217 218 219 220

Definition 3. Suppose F(t) is caused by F(t  1) only. The relationship between F(t  1) = Ai and F(t) = Aj can be represented by Ai ! Aj, if F(t  1) = Ai (a fuzzy set) and F(t) = Aj, where Ai and Aj are the LHS (left-hand side) and the RHS (right-hand side) of the fuzzy logic relationships (FLR), respectively [29].

221 222 223 224 225 226

Definition 4. Assume there are the subsequent FLRs:

227 228 229

Ai ! Aj1 Ai ! Aj1 .. . Ai ! Ajl

(6)

Fig. 1 – Type-2 fuzzy set.

Please cite this article in press as: Bajestani NS, et al. Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.01.003

230

BBE 186 1–9

4

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

233 231 232 234

Following Chen's model [19], these FLRs can be set into a fuzzy logic relationship group (FLRG) as: Ai ! Aj1, Aj2, . . ., Ajl

in the population that are selected for deletion. For more details of the GA algorithm please read reference [36].

235 236 237

Definition 5. (Type-2 fuzzy time series) in definition 1, if fi(t) is type-2 fuzzy set, F(t) is defined as type-2 fuzzy time series.

3.

238 239 240 241 242 243 244 245 246 247

The proposed type-2 fuzzy regression model utilizes the fuzzy relationships established by a type-1 fuzzy regression model based in type-1 and type-2 observations. Operators are used to include or screen out fuzzy relationships obtained from types 1 and 2 observations. Type-2 forecasts are then calculated from these fuzzy relationships. Two operators were defined. The first involves including and the other screening out fuzzy relationships. Hence, we propose union and intersection operators in type 2 model accordingly.

248 249

Definition 6. The relationships between two FLR or FLRGs are obtained by defining operators of union (_) and intersection (^):

250 251

_ ðLHSd ; LHSe Þ ¼ RHSd [ RHSe ^ ðLHSd ; LHSe Þ ¼ RHSd \ RHSe

(7)

252 253 254 255 256

where [ and \ refer the union the intersection operator for set theory, respectively. The LHSd indicates the LHS of an FLRG and RHSd indicates RHS of an FLRG.

257 259 258 260

Definition7. The definition of operators of union and intersection for multiple FLRGs, _m and ^m are as follows: _ m ðLHSc ; LHSd ; LHSe ; . . .Þ ¼ _ . . .ð _ ð _ ðLHSc ; LHSd Þ; LHSe Þ; . . .Þ ^ m ðLHSc ; LHSd ; LHSe ; . . .Þ ¼ ^ . . .ð ^ ð ^ ðLHSc ; LHSd Þ; LHSe Þ; . . .Þ (8)

262 261 263 265 264 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290

_ m ðLHSc ; LHSd ; LHSe ; . . .Þ ¼ ðRHSc [ RHSd [ RHSe [ . . .Þ ^ m ðLHSc ; LHSd ; LHSe ; . . .Þ ¼ ðRHSc \ RHSd \ RHSe \ . . .Þ

(9)

_ m ðLHSc ; LHSd ; LHSe ; . . .Þ ¼ ;, then Definition 8. (a) If _ m ðLHSc ; LHSd ; LHSe ; . . .Þ ¼ LHSx , where LHSx is acquired from the FLRG resulted through observations of type-1. (b) If ^ m ðLHSc ; LHSd ; LHSe ; . . .Þ ¼ ;, then ^ m ðLHSc ; LHSd ; LHSe ; . . .Þ ¼ LHSx , where LHSx, is obtained from the FLRG resulted through observations of type-1. For more details please read Refs. [28,29,35] that are type-2 fuzzy time series model. The GA algorithm, which is a powerful random global search technique to handle optimization problems and can help find the globally optimal solution over a domain, was used for optimizing in the proposed model. GA consists of gradient free parallel-optimization algorithms that use a performance criterion for evaluation and a population of possible solutions to search for a global optimum. These structured random search techniques are capable of handling complex and irregular solution spaces. GAs is inspired by the biological process of Darwinian evolution where selection, mutation, and crossover play a major role. Good solutions are selected and manipulated to achieve new and possibly better solutions. The manipulation is done by the genetic operators that work on the chromosomes in which the parameters of possible solutions are encoded. In each generation of the GA, the new solutions replace the solutions

293

Proposed type-2 fuzzy regression model

In this section, first the proposed model steps have been listed, and then the model is explained in two examples. Fig. 2 shows a flowchart of this model. The proposed algorithm for the type2 fuzzy regression models is listed below. 1. Selecting a model of type-1 fuzzy regression (Hojati's model). 2. Choosing observations of type-1. 3. Selecting observations of type-2. 4. Applying the model of type-1 fuzzy regression to observations of type-1 and type-2. 5. Using operators to the FLRGs for each of observations. 6. Defuzzifying the predictions. 7. Computing predictions for model of type-2. 8. Evaluating the model efficiency. Two examples are used to describe this model. GFR(t) is the input of model and GFR(t + 1) is the output of model.

3.1.

291 292

GFR forecasting

In this research about 500 files of diabetic patients were studied and checked. 106 patients that had attained some level of nephropathy were chosen for this research. The clinical records of these patients were followed for about 5 years. Of the 106 patients, 47 who regularly went to the diabetic clinic were used for this research as GFR values must be recorded orderly. The proposed model in more than 90% of patients presents the best results compared to other models. Two patients that have the best results with the proposed model were reported in this paper as examples. In this study, our aim is to predict nephropathy due to priori of diabetic patient. It is expected that using past GFR data models could predict future GFR with good accuracy. In fact, the condition of kidney could be predicted using proposed model. GFR(t) and GFR(t + 1) is input and output of model respectively. In the proposed model, 70% of the data was used as train data and 30% as the test value. GFR has three common formulas. As mentioned previously, these three formulas are one reason for using type-2 fuzzy sets to consider the effects of all three formulas.

294 295 296 297 298 300 299 301 300 303 302 301 305 304 302 307 306 303 308 304 310 309 305 312 311 306 314 313 307 316 315 308 309 317 310 318 311 319 312 320 313 314 315 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340

  Weight GFR1 ¼ ð140AgeÞ 72 pcr

(10)

GFR2 ¼ 175ðpcrÞ1:154 ðAgeÞ:203 ð:742 if femaleÞ

(11)

344 346 345

(12)

347

Chronic kidney disease (CKD) stages based on GFR level are shown in Table 1. In proposed model, type-1 and type-2 observations were required. The mean of three formulas in every t (every referral

348 349 350 351 352 353

GFR3 ¼

ð0:41HeightÞ pcr

341 343 342

Please cite this article in press as: Bajestani NS, et al. Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.01.003

BBE 186 1–9 biocybernetics and biomedical engineering xxx (2017) xxx–xxx

5

Fig. 2 – Flowchart of T2FRFTS model.

354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370

of patients) was used as a type-1 observation because the mean of three formulas is a good choice for GFR prediction if type-1 fuzzy regression is used. For the type-2 fuzzy model, type-2 observations were required. Minimum and maximum of GFR1, GFR2, GFR3 were chosen as the type-2 observation. These type-1 and type-2 observations make type 2 fuzzy model in steps that are explained in the following section. Proposed model steps have been applied for GFR forecasting. Step 1: Choosing a model of type-1 fuzzy regression (Hojati's model). In this study, Hojati's model was chosen. It provides relatively simpler calculations and better forecasting results. Step 2: Selecting a variable as observation of type-1. The mean of three GFR equation results has been chosen as the main value in the type-1 fuzzy model. This value was selected as type-1 observation.

Table 1 – CKD Stage based on GFR level. CKD Stage Stage Stage Stage Stage Stage

1 2 3 4 5

GFR level (mL/min/1.73 m2) ≥90 60–89 30–59 15–29 <15

Step 3. Selecting observations of type-2 In three equations, the minimum and maximum outcomes are selected as observations of type-2. For example, in the fourth value, GFR1 = 56.07, GFR2 = 71.65, GFR3 = 82.00; the mean of them is 69.91, and that has been chosen as the type-1 observation. Low value is 56.07 and high value is 82. Step 4: Applying the model of type-1 fuzzy regression to the observations of type-1 and type-2. The type-1 fuzzy regression model can then be applied to type-1 and type-2 observations and the forecasts can be obtained. For this goal, these steps should be followed: Step 4-1: Defining fuzzy sets for observations With reference to the authors' prior model [29], triangular fuzzy sets with indeterminate supports were used. This means that triangular legs were not fixed, but were determined after the solution of an optimization problem. The GA algorithm, a powerful random global search technique to handle optimization problems and which can help find the globally optimal solution over a domain, was used for optimizing fuzzy sets. The GA tool of MATLAB was used to find the optimized fuzzy sets support and parameter / that is defined in the following steps. Initial fuzzy sets supports, [0 10], [10 20],. . ., [90 100], and /=0.5 as initial population, uniform mutation function, scattered crossover function, and stochastic uniform selection function are used in the GA tool setting. Stopping criteria was number of generations 100. Other parameters

Please cite this article in press as: Bajestani NS, et al. Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.01.003

371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397

BBE 186 1–9

6

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

Table 2 – Parameters of GA algorithm.

Table 3 – GFR Forecast after ^m and _m.

Parameter

Value

Population size Mutation rate Crossover rate generations

40 0.01 0.8 100

No 4

GFR1 GFR2 GFR3

Forecast after ^m

Forecast after _m

A8

A6, A8

A6 ! A6 A8 ! A8 A9 ! A8

Then, ^m and _m are both used to the predictions of all date. In Table 3, ^m and _m are applied to all predictions, including those of observation of type-1 and type-2. Step 6. Defuzzifying the predictions The defuzzyfied prediction for ^m is equivalent to:

420 421 422 423 424 425

deffuzzificationintersection ðno:4Þ ¼ 74:79 Correspondingly, the defuzzyfied prediction for _m is as follows: Fig. 3 – Fuzzy sets after optimization with GA.

398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419

were used show in Table 2. After no changes in parameters, minimization was robust between about 30 runs. Min, max, mean and variance of optimal values were 3.9, 5.1, 4.38 and 0.2 respectively for patient 1 and 1.33, 2.35, 1.77 and 0.14 for patient 2. Optimized fuzzy set supports found with GA in patient 1 are (Fig. 3): [0 10.10], [0.04 20.00], [19.89 32.16], [27.96 40.00], [39.98 51.75], [46.85 60.00], [59.99 70.30], [68.56 80.01], [79.62 90.07], [90.00 100.00]. Step 4-2: Fuzzifying the observations Different GFR values can be fuzzified into the corresponding fuzzy sets. All observation of type-1 and type-2 are fuzzified in this step. For example, the GFR for the fourth value was 71.65, which was fuzzified to A8. Step 4-3: Applying model of type-1 fuzzy regression to observations of type-1 and type-2 and acquiring predictions. Hojati's model was applied to the main, high, and low values that were fuzzified before. Fig. 4 shows the results of the type-1 fuzzy regression model for type-1 and type-2 observations (second column of Table 3). Step 5: Using operators to the FLRGs for each of observations.

426 427 428 429 430

defuzzificationunion ð11=8Þ ¼ ð53:42 þ 74:79Þ=2 ¼ 64:11 Step 7. Obtaining predictions for model of type-2 / was used as a coefficient in calculating type-2 forecasting [29]:

431 432 433 434 435 436

defuzzificationðtÞ ¼ a ðdefuzzificationintersection ðtÞ þ ð1aÞðdefuzzificationunion ðtÞÞ

(13)

where a can be achieved through solving the optimization problem. This means that there is another parameter in the optimization problem. The best value of a is 0.126. Thus, it is used for calculating type-2 forecasting:

437 438 439 440 441 442 443

defuzzificationðtÞ ¼ 0:874ðdefuzzificationintersection ðtÞÞ þ 0:126ðdefuzzificationunion ðtÞÞ

(14)

Step 8: Evaluating the model efficiency. The assessment of the efficiency of the model is performed by applying the RMSE:

RMSE ¼

vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi uX n u u ðactual valueðtÞforecasted valueðtÞÞ2 t i¼1

n

444 445 446 447 448 449

(15)

Fig. 4 – Apply type-1 fuzzy regression model to type-1 and type-2 values in patient 1. Please cite this article in press as: Bajestani NS, et al. Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.01.003

BBE 186 1–9

7

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

Fig. 5 – GFR forecasting results in patient 1. Fig. 7 – GFR forecasting results in patient 2.

Table 4 – Forecasts of the GFR in patient 1. No.

Actual

Huarng and Yu

WGP

Shafaei

T2FRFTS

.. . 37.28 11.28 9.46 8.16 7.48 .. .

.. . 33.3 20 15 15 15 .. .

.. . 31.5 18.3 11.2 10.3 10.3 .. .

.. . 35 15 7.87 7.87 7.87

.. . 33.98 11.02 10.02 10.02 10.02

.. . 8 9 10 11 12 .. .

Table 6 – Comparison of RMSE of GFR forecasting in patient 2. Model Train-RMSE Test-RMSE

Hojati

Huarng and Yu

WGP

Shafaei

T2FRFTS

7.32 8.88

4.12 5.25

2.88 5.17

2.50 4.53

1.33 2.65

Table 5 – Comparison of RMSE of GFR forecasting in patient 1. Model Train-RMSE Test-RMSE

452 450 451 453 454 455

Hojati

Huarng and Yu

WGP

Shafaei

T2FRFTS

15.10 24.42

13.29 23.50

13.01 23.11

8.97 15.57

3.91 6.85

Figs. 4 and 5 and Tables 3 and 4 show the results for patient 1. Figs. 6 and 7 show the results for patient 2. Tables 5 and 6 show a comparison of the RMSE of several models. The results indicate that the suggested model was able to perform

predictions relatively more accurately in two patients. The RMSE in both train and test values is better than other models. Such an accurate prediction is very helpful for diabetic patients and diabetes clinics. As mentioned before, 47 patients were used in this research. Min, max, mean and variance of RMSEs between 47 patients were 1.33, 20.20, 8.67, and 19.86 respectively. In more than 90% of patient proposed model has better performance and minimum RMSE. The two patients that were reported had the best results with proposed model.

Fig. 6 – Apply type-1 fuzzy regression model to type-1 and type-2 values in patient 1. Please cite this article in press as: Bajestani NS, et al. Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.01.003

456 457 458 459 460 461 462 463 464 465

BBE 186 1–9

8

biocybernetics and biomedical engineering xxx (2017) xxx–xxx

466

4.

467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483

This study presents a GFR forecasting with different perspectives to type-2 fuzzy regression models based on type-2 fuzzy time series concepts (T2FRFTS) to forecast nephropathy in diabetic patients. GFR forecasting clinically helps patients and society decrease the cost of control and treatment by optimizing the number of check-ups. Unnecessary regular check-ups worry patients, even though they may not suffer from the illness for years to come. In addition, GFR forecasting can help doctors make better decisions regarding the type and dosage of medication needed to postpone retinopathy among diabetic patients. The proposed type-2 fuzzy regression model avoids additional complexity and helps model more uncertainty in diabetic values. Therefore, the authors propose the use of such models in diabetes clinics. From the empirical analysis, it can be inferred that, in addition to its intelligible steps, this model can adequately forecast GFR.

484

Conflict of interest

485

None declared.

486

references

487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520

Calculation

[1] Study PD, Americans A, Americans N, Asso- AD. Diabetic nephropathy: diagnosis, prevention, and treatment. Diabetes Care 2005;28:176–88. [2] Grubb A. Non-invasive estimation of glomerular filtration rate (GFR). The Lund model: simultaneous use of cystatin C- and creatinine-based GFR-prediction equations, clinical data and an internal quality check. Scand J Clin Lab Invest 2010;70:65–70. http://dx.doi.org/10.3109/00365511003642535 [3] Pourahmad S, S.M.T. ayatollahi SMT. Fuzzy logistic regression: a new possibilistic model and its applivation in clinical vague status. Iran J Fuzzy Syst 2011;8:1–17. [4] Stavros Lekkas LM. Evolving fuzzy medical diagnosis of Pima Indians diabetes and of dermatological diseases. Artif Intell Med 2010;50:117–26. [5] Cosenza B. Off-line control of the postprandial glycemia in type 1 diabetes patients by a fuzzy logic decision support. Expert Syst Appl 2012;39:10693–9. http://dx.doi.org/10.1016/j.eswa.2012.02.198 [6] Lee C-S, Wang M-H. A fuzzy expert system for diabetes decision support application. IEEE Trans Syst Man Cybern Part B 2011;41:139–53. http://dx.doi.org/10.1109/TSMCB.2010.2048899 [7] Doostparast Torshizi A, Fazel Zarandi MH. Alpha-plane based automatic general type-2 fuzzy clustering based on simulated annealing metaheuristic algorithm for analyzing gene expression data. Comput Biol Med 2014;64:347–59. http://dx.doi.org/10.1016/j.compbiomed.2014.06.017 [8] Dazzi D, Taddei F, Gavarini A, Uggeri E, Negro R, Pezzarossa A. The control of blood glucose in the critical diabetic patient: a neuro-fuzzy method. J Diabetes Compl 2001;15:80–7. http://dx.doi.org/10.1016/S1056-8727(00)00137-9

[9] Mahmoodian H, Ebrahimian L. Using support vector regression in gene selection and fuzzy rule generation for relapse time prediction of breast cancer. Biocybern Biomed Eng 2016;1–7. http://dx.doi.org/10.1016/j.bbe.2016.03.003 [10] Paramasivam V, Yee TS, Dhillon SK, Sidhu AS. A methodological review of data mining techniques in predictive medicine: an application in hemodynamic prediction for abdominal aortic aneurysm disease. Biocybern Biomed Eng 2014;34:139–45. http://dx.doi.org/10.1016/j.bbe.2014.03.003 [11] Mansoof A Bin, Khan Z, Khan A, Khan SA. Enhancement of exudates for the diagnosis of diabetic retinopathy. IEEE Int Multitopic Conference. 2008. pp. 128–31. http://dx.doi.org/10.1109/INMIC.2008.4777722 [12] Narasimhan B, Malathi A. Fuzzy logic system for risk-level classification of diabetic nephropathy. Green Comput. Commun. Electr. Eng. (ICGCCEE), 2014 Int. Conference. 2014. pp. 1–4. [13] RD E, Nagaveni N. Design methodology of a fuzzy knowledgebase system to predict the risk of diabetic nephropathy. Int J Comput Sci 2010;7:239–47. [14] Bolotin A. Fuzzification of linear regression models with indicator variables in medical decision making. IEEE Int Conf Comput Intell Model Control Autom; 2005. [15] Poleshchuk O, Komarov E. A fuzzy linear regression model for interval type-2 fuzzy sets. Annu Meet North Am Fuzzy Inf Process Soc. 2012. pp. 1–5. http://dx.doi.org/10.1109/NAFIPS.2012.6290970 [16] Hosseinzadeh E, Hassanpour H, Arefi M. A weighted goal programming approach to fuzzy linear regression with crisp inputs and type-2 fuzzy outputs. Soft Comput 2014;19:1143–51. http://dx.doi.org/10.1007/s00500-014-1328-3 [17] Watada J. Fuzzy time series analysis and forecasting of sales volume. Fuzzy Regres. Anal.. Pmnitec Press; 1992. p. 211–27. [18] Song Q, Chissom BS. Fuzzy time series and its models. Fuzzy Sets Syst 1993;54:269–77. [19] Chen S. Forecasting enrollments based on fuzzy time series. Fuzzy Sets Syst 1996;81:311–9. [20] Cai Q, Zhang D, Zheng W, Leung SCH. A new fuzzy time series forecasting model combined with ant colony optimization and auto-regression. Knowledge-Based Syst 2015;74:61–8. http://dx.doi.org/10.1016/j.knosys.2014.11.003 [21] Askari S, Montazerin N. A high-order multi-variable fuzzy time series forecasting algorithm based on fuzzy clustering. Expert Syst Appl 2015;42:2121–35. http://dx.doi.org/10.1016/j.eswa.2014.09.036 [22] Bas E, Uslu VR, Yolcu U, Egrioglu E. A modified genetic algorithm for forecasting fuzzy time series. Appl Intell 2014;41:453–63. http://dx.doi.org/10.1007/s10489-014-0529-x [23] Sadaei HJ, Enayatifar R, Abdullah AH, Gani A. Short-term load forecasting using a hybrid model with a refined exponentially weighted fuzzy time series and an improved harmony search. Int J Electr Power Energy Syst 2014;62:118–29. http://dx.doi.org/10.1016/j.ijepes.2014.04.026 [24] Tsaur R, Kuo T. Tourism demand forecasting using a novel high-precision fuzzy time series model. Int J Innov Comput Inf Control 2014;10:695–701. [25] Egrioglu E. PSO-based high order time invariant fuzzy time series method: application to stock exchange data. Econ Model 2014;38:633–9. [26] Sun B, Guo H, Reza Karimi H, Ge Y, Xiong S. Prediction of stock index futures prices based on fuzzy sets and multivariate fuzzy time series.

Please cite this article in press as: Bajestani NS, et al. Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.01.003

521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589

BBE 186 1–9 biocybernetics and biomedical engineering xxx (2017) xxx–xxx

589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608

[27]

[28]

[29]

[30] [31]

[32]

Neurocomputing 2015;151:1528–36. http://dx.doi.org/10.1016/j.neucom.2014.09.018 Li S, Cheng Y. Deterministic fuzzy time series model for forecasting enrollments. Comput Math with Appl 2007;53:1904–20. http://dx.doi.org/10.1016/j.camwa.2006.03.036 Huarng K, Yu H. A Type 2 fuzzy time series model for stock index forecasting. Physica A 2005;353:445–62. http://dx.doi.org/10.1016/j.physa.2004.11.070 Bajestani NS, Zare A. Forecasting TAIEX using improved type 2 fuzzy time series. Expert Syst Appl 2011;38:5816–21. http://dx.doi.org/10.1016/j.eswa.2010.10.049 Bajestani NS. Application of optimized type 2 fuzzy time series for forecasting Taiwan Stock Index n.d. Aliev RA, Fazlollahi B, Vahidov R. Genetic algorithmsbased fuzzy regression analysis. Soft Comput A Fusion Found Methodol Appl 2002;6:470–5. http://dx.doi.org/10.1007/s00500-002-0163-0 Ling SSH, Nguyen HT. Genetic-algorithm-based multiple regression with fuzzy inference system for

[33]

[34]

[35]

[36]

detection of nocturnal hypoglycemic episodes. IEEE Trans Inf Technol Biomed 2011;15:308–15. http://dx.doi.org/10.1109/TITB.2010.2103953 Aladag CH, Yolcu U, Egrioglu E, Bas E. Fuzzy lagged variable selection in fuzzy time series with genetic algorithms. Appl Soft Comput J 2014;22:465–73. http://dx.doi.org/10.1016/j.asoc.2014.03.028 Hojati M, Bector CR, Smimou K. A simple method for computation of fuzzy linear regression. Eur J Oper Res 2005;166:172–84. http://dx.doi.org/10.1016/j.ejor.2004.01.039 Narges Shafaei Bajestani, Ali Vahidian Kamyad AZ. An interval type-2 fuzzy regression model with crisp inputs and type-2 fuzzy outputs for TAIEX forecasting. IEEE Int Conf Inf Autom China; 2016. Michalewicz Z. Genetic algorithms+data structures=evolution programs. New York: SpringerVerlag; 1996.

9 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626

627

Please cite this article in press as: Bajestani NS, et al. Nephropathy forecasting in diabetic patients using a GA-based type-2 fuzzy regression model. Biocybern Biomed Eng (2017), http://dx.doi.org/10.1016/j.bbe.2017.01.003