Wind speed forecasting approach based on Singular Spectrum Analysis and Adaptive Neuro Fuzzy Inference System

Wind speed forecasting approach based on Singular Spectrum Analysis and Adaptive Neuro Fuzzy Inference System

Accepted Manuscript Wind speed forecasting approach based on Singular Spectrum Analysis and Adaptive Neuro Fuzzy Inference System Sinvaldo Rodrigues M...

1MB Sizes 1 Downloads 32 Views

Accepted Manuscript Wind speed forecasting approach based on Singular Spectrum Analysis and Adaptive Neuro Fuzzy Inference System Sinvaldo Rodrigues Moreno, Leandro dos Santos Coelho PII:

S0960-1481(17)31190-4

DOI:

10.1016/j.renene.2017.11.089

Reference:

RENE 9491

To appear in:

Renewable Energy

Received Date: 6 August 2016 Revised Date:

13 October 2017

Accepted Date: 29 November 2017

Please cite this article as: Moreno SR, dos Santos Coelho L, Wind speed forecasting approach based on Singular Spectrum Analysis and Adaptive Neuro Fuzzy Inference System, Renewable Energy (2018), doi: 10.1016/j.renene.2017.11.089. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

RI PT

Sinvaldo Rodrigues Moreno1 and Leandro dos Santos Coelho1,2*

1 Department of Electrical Engineering, Federal University of Parana Zip code 81531-980, Curitiba, PR, Brazil, 2 Industrial and Systems Engineering Graduate Program (PPGEPS)

SC

Pontifical Catholic University of Parana (PUCPR) Imaculada Conceicao, 1155, Zip code 80215-901, Curitiba, PR, Brazil * Corresponding author Tel/fax numbers: (55) 41 32711345 Email: [email protected]

M AN U

3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31

Abstract: As a promising renewable energy source, wind power has environmental benefits, as well as economic and social ones. Due these characteristics, wind farm has grown fast in the last five years, and in some countries, it has already surpassed conventional sources, such as hydro and coal plants. However, owing to the uncertainty of wind speed, it is essential to build an accurate forecasting model for large-scale wind power penetration. This study proposes a hybrid approach that combines the Singular Spectrum Analysis (SSA), which rarely presents application in literature on wind speed forecasting, and a Computing Natural paradigm called Adaptive Neuro Fuzzy Inference System (ANFIS). The SSA decomposes the original wind speed into various components, so these components are pre-processed regarding to the level of original wind series information remained. The main components selected to reconstruct the original series have in their structure the information about trend and harmonic components. The final remaining components grouped are labeled as noise. The ANFIS model uses these two information to construct the model applied to forecasting the next wind speed value. In this way, the hybrid model can learn the trend and the harmonic structure of the wind time series. Experimental results show that prediction errors are significantly reduced using the proposed technique to perform 10min one-step-ahead and -step-ahead wind speed forecast.

TE D

2

Wind Speed Forecasting Approach Based on Singular Spectrum Analysis and Adaptive Neuro Fuzzy Inference System

EP

1

Keywords: wind speed, hybrid forecasting approach, wind farm, machine learning, singular spectrum

33

analysis, neuro fuzzy inference system, time series forecasting.

34

1 Introduction

35

The renewable energy sources, particularly wind power is becoming more popular because of

36

ecological concerns and continuously rising prices of rapidly depleting conventional energy

37

sources. However, several drawbacks complicate the large-scale penetration into large fully

38

connected systems, as the intermittent nature of wind power. Mostly due to wind speed related

39

uncertainties affecting the reliability of energy supply, reliable forecasts methods can effectively

AC C

32

1

ACCEPTED MANUSCRIPT

reduce the risk to the power system. Machine Learning methods have attracted more attention than

41

traditional methods (such as numerical weather predictions and statistical approaches) when the

42

forecasting horizon are comprised between 1 to 3 hours or steps ahead [1-3].

RI PT

40

43

Many approaches applying Natural Computing and Machine Learning concepts have been proposed

45

recently to predicted wind speed. The approaches include promising mathematical techniques such

46

as Support Vector Machine (SVM), Wavelet Transform Method, Artificial Neural Networks,

47

Adaptive Neuro Fuzzy Inference System (ANFIS), Extreme Learning Machine (ELM), among

48

others. Despite the quality of the results, each technique has its own advantages and limitations.

49

Recent work conducted by [4] presents a combination of ARIMA (Autoregressive Integrated

50

Moving Average) model, ELM, SVM and LSSVM (Least Square SVM) to build a model of inputs,

51

and applied it a GPR (Gaussian Process Regression) to perform short-term wind speed forecasting.

52

In [5] a novel bidirectional mechanism and a backward forecasting model based on ELM for wind

53

power time series forecasting have been introduced. This model, as reported by the authors, can

54

achieve good results when applied to forecast wind farm power production for six time horizons as

55

1 up to 6 h.

M AN U

TE D

56

SC

44

A hybrid intelligent forecasting model based on Least Square Support Vector Machine (LSSVM)

58

and the Markov model is proposed in [6]. The LSSVM get the parameters optimized by PSOGA

59

algorithm (Particle Swarm Optimization combined with Gravitational Search Algorithm).

60

Moreover, the reported results shown that the approach outperformed other techniques in terms of

61

forecasting quality. Other approach, which applies Support Vector Regression (SVR) with

62

parameters tuned by Genetic Algorithm is proposed in [7] to forecast wind speed and outperformed

63

the persistence model and well-known autoregressive models: Autoregressive (AR), Autoregressive

64

Moving Average (ARMA), and Autoregressive Integrated Moving Average (ARIMA). An

65

ensemble of Mixture Density Neural Networks developed by [8] to forecasting wind speed and

66

wind power with 72h ahead shown promising results. The model was applied to one wind farm

67

located in Taiwan and achieved good results when compared to Nonlinear Autoregressive Neural

68

Network with eXogenous inputs (NARX), ARMA and GARCH approaches.

AC C

EP

57

69 2

ACCEPTED MANUSCRIPT

In the context of time series forecasting, Artificial Neural Networks (ANNs) is the approach, which

71

have been numerous types of new models proposed [9-13], such as Radial Basis Function neural

72

network (RBF), the Multi-Layer Perceptron network (MLP) and the Support Vector Machine

73

(SVM). In addition, approaches applying hybrid models also have been grown fast due to best

74

results presented. As example can be cited [14], which applies a hybrid model that combines the

75

Extreme Learning Machine (ELM), the Ljung-Box Q-test (LBQ) and the Seasonal Autoregressive

76

Integrated Moving Average (SARIMA) on wind forecasting. Other example presented by [15],

77

applies ANNs, and Kalman filter (KF) based on the ARIMA model.

SC

RI PT

70

78

Various authors recently have pointed out the efficiency and quality of machine learning approach

80

to perform wind speed forecasting. In the present work, we intend to show a new hybrid forecasting

81

approach applied to wind speed forecast. The accuracy of the method is analyzed in terms of Mean

82

Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Error (MAE) and

83

Standard Deviation (STD). According to the analysis of the obtained results, the proposed hybrid

84

model show better results on forecasting wind speed than ANFIS-FCM and Group Method Data

85

Handling (GMDH).

86

TE D

M AN U

79

The remainder of this paper is organized as follows. Section 2 describes Singular Spectrum

88

Analysis (SSA) theoretical approach and common uses for trend extraction and time series

89

decomposing. In addition, this section also presents the ANFIS and GMDH theoretical foundation.

90

Section 3 presents the proposed method. Section 4 presents the data description, the experimental

91

setup, and our numerical results. After, Section 5 contains the conclusions of this work.

92

2 Background

93

Singular Spectrum Analysis (SSA) is a technique of time series analysis and forecasting [16]. SSA

94

aims at decomposing the original series into a sum of a small number of interpretable components

95

such as a slowly varying trend, oscillatory components and a ‘structure less’ noise [17]. It is based

96

on the singular value decomposition (SVD) of a specific matrix constructed upon the time series.

AC C

EP

87

3

ACCEPTED MANUSCRIPT

Neither a parametric model nor stationary-type conditions have to be assumed for the time series.

98

This makes SSA a model-free method and hence enables SSA to have a wide range of applicability

99

[18]. In this section, we present the foundation of SSA and ANFIS techniques, with detailed

100

presentation, explore SSA foundations to facilitate the clear understanding. Regarding the ANFIS

101

technique, due the popularity and numerous works found in literature applying it, only a general

102

idea of the ANFIS framework is presented in this work. A briefly introduction of the GMDH

103

concepts also is provided.

104

2.1 Singular Spectrum Analysis - SSA

105

Singular Spectrum Analysis was published for the first time in 1986, but this approach became

106

popular only after the book published in 2000 by [17]. The SSA performs four steps:

110 111 112 113

SC M AN U

109

1. Embedding step: the one-dimensional time series is represented as a multidimensional series whose dimension is called the window length. The multidimensional time series forms the trajectory matrix. The only parameter of this step is the window length . Usually  has value as  <  , where  is the time series length; 

TE D

107 108

RI PT

97

2. SVD step: is the singular  value decomposition of the trajectory matrix into a sum of rankone bi-orthogonal matrices;

3. Grouping step: corresponds to splitting the matrices, computed at the SVD step, into

115

several groups and summing the matrices within each group. The result of the step is a

116

representation of the trajectory matrix as a sum of several resultant matrices;

AC C

EP

114

117

4. Diagonal averaging step: It is a linear operation and maps the trajectory matrix of the

118

initial series into the initial series itself. In this way, we obtain a decomposition of the initial

119

series into several additive components.

120 121

The Embedding and SDV steps form the Decomposition Stage and the last two steps form the

122

Reconstruction Stage. The power of the SSA technique lays on the analysis, as example, if we are

123

not interested in certain aspects of the components of the time series; we subsume them under the

124

noise components. The mathematical procedure to perform the two stages described below, 4

ACCEPTED MANUSCRIPT

125 126

considering a real-valued time series  =  = ( , … ,  ) of length . The full detailed explanation can be found in [17], please refers to it for detailed mathematical foundation.

RI PT

127 128 129

2.1.1 Decomposition Stage

130 131

The Decomposition Stage starts with the Embedding Step, which maps the original time series to a

132

 = (  , … ,  ) , which have dimension  and where 1 ≤  ≤ . The matrix formed by these vectors is the trajectory matrix presented in Equation 1

135

139 140 141 142 143 144 145 146 147 148 149 150 151 152



$

% ⋮



$

%

& ⋮



…  …  + …  * ⋱ ⋮ …  )

(1)

The matrix  is a Hankel matrix, it means, has equal elements on the diagonals  + - =

TE D

138



#  = " $ ⋮ ! 

./012302. The next step is the singular value decomposition (SVD) of the trajectory matrix  into a sum of biorthogonal elementary matrices of rank one. Take 4 =  and the eigenvalues of 4 in decreasing order of magnitude (5 ≥ 5 ≥ ⋯ ≥ 5 ≥ 0), also the orthonormal system of the

eigenvectors denoted by 9 , … , 9 corresponding these eigenvalues. Let : = max>, 1?.ℎ 5 >

EP

137

 =  : … :   = (  ), ,

0B = C30 , so if we denote D =  9 /F5 and  = F5 9 D , where  = 1, … , :, then the SVD of the trajectory matrix  can be write as the sum

AC C

136

SC

134

M AN U

133

sequence of multidimensional lagged vectors of size  by forming  =  −  + 1 lagged vectors

 =  +  + ⋯ + G

(2)

This work applies the useful property of Equation 2 to evaluate the contribution of each component

 in the expansion of the original time series. It can be done as follows: among all the matrices

(H) of rank C < : (: is the number of 5 > 0), the matrix ∑H  provides the best approximation to the trajectory matrix , it means that the Frobenius norm is minimum ‖ − H ‖ℳ . In practical 5

ACCEPTED MANUSCRIPT

terms, it means that the components contribution to form the original time series can be measure by

154

Equation 3. Therefore, we can decide which components select to perform the Reconstruction Stage

155

by selecting the first values sorted in decreasing order of their contribution

156

RI PT

153

%./02CM = ∑P

NO

OQR NO

157 158

2.1.2 Reconstruction Stage

SC

159

(3)

160 161

The Reconstruction stage is the following step after the expansion described in Equation 2. It starts

162

Now, let T = V , … , W X, so the matrix Y corresponding to the group T is defined as Y =  +

164 165

⋯ + W , and these matrices are computed for T = T , … , TU . By applying the expansion described in Equation 2 leads to the decomposition

 = Y + Y + ⋯ + YU

170 171 172 173 174

175

176

(4)

In this final step, the matrices Y , … , YU are converted into a new series of length , through a process called diagonal averaging. For step in this procedure, let Z be a   matrix with elements

[ , where 1 ≤  ≤  and 1 ≤ - ≤ . Then set ∗ = min (, ),  ∗ = max (, ) and  =  +  −

EP

169

∗ ∗ 1. Finally, let [ = [ if  <  otherwise [ = [ . The diagonal averaging procedure transforms

the   matrix Z into the time series [ , … , [ using the formula

AC C

168

TE D

166 167

M AN U

163

with the grouping procedure partitions the set of indices 1, … , : into S disjoint subsets T , … , TU .

_ ∗ ∗ c _ ∑U [U,_U d/C 0 ≤  <  , a ∗ ∗ d/C ∗ ≤  <  ∗ , [_ = ∗ ∑U [U,_U b ∗ a ∑  [ ∗ d/C  ∗ ≤  <  ` _ U _∗  U,_U

(5)

6

ACCEPTED MANUSCRIPT

179

equivalent to the decomposition of the initial series , … ,  into the sum of reconstructed S series

180

g = ∑U f (0 = 1,2, … , ) _

(_)

181 182

RI PT

178

(_) (_) Equation 5 applied a resultant matrix Y_ , produces the series e (_) = ( f , … , f ). This is

(6)

SC

177

2.2 Adaptive Neuro Fuzzy Inference System - ANFIS

184

Adaptive Neuro Fuzzy Inference System (ANFIS) is able to combine a fuzzy system’s ability to

185

model a reasoning process and to handle uncertainty, with the learning ability and adaptivity of a

186

neural network [19]. The ANFIS is a type of artificial neural network that is based on Takagi–

187

Sugeno fuzzy and it uses a hybrid learning algorithm. The main structure consists of five layers,

188

which can be called inputs, if part, rules and normalization, then part and output. For simplification, it is assumed that the framework of ANFIS has two inputs , [ and one output i as presented in

TE D

189

M AN U

183

190

Figure 1. Then, the corresponding rule set with two fuzzy if–then rules for a first-order Sugeno

191

consequent parameters. l , m and l , m are the linguistic labels. Each layer is described below.

193

AC C

194

EP

192

fuzzy model can be expressed as shown in Equation 7, where j , k , C and j , k , C are the

195 196

Figure 1 – ANFIS structure for a two-input , [ and one output i [20] 7

ACCEPTED MANUSCRIPT

nopq r: Td 1 l 30: [ 1 m 2ℎs0 i = j + k [ + C nopq t: Td 1 l 30: [ 1 m 2ℎs0 i = j + k [ + C

197 198

201

Layer 1: This is the fuzzification layer. Every node  in this layer is an adaptive node with a node output defined by u , = vwO ( ) d/C  = 1,2

202 203

u , = vxOyz ([)

204

206 207 208

output is the product of all the incoming signals. The output of this layer is given by u, = ~ = vwO ( )vxO ([) d/C  = 1,2

210

213

the normalized firing strength as

217 218 219 220 221 222 223

€O

€R €z

d/C  = 1,2

(11)

Layer 4: This layer is the defuzzification layer. Every node  in this layer is an adaptive node with a

AC C

216

u$, = ~  =

EP

214 215

(10)

Layer 3: This layer is the normalization layer. The 2ℎ node of this layer, labeled as , calculates

TE D

212

(9)

Layer 2: This layer is the rule layer. The nodes  in this layer are fixed nodes, labeled as Π, whose

209

211

(8)

d/C  = 3,4

M AN U

205

RI PT

200

SC

199

(7)

node function

u%, = ~   d = ~   (j + k [ + C )

(12)

where j , k , C is the parameter set of this node. Layer 5: This is the output layer. The single node in this layer is a fixed node labeled as Σ, which computes the overall output as the summation of all incoming signals. Overall output is given by

224 8

ACCEPTED MANUSCRIPT

u&, = ∑ ~   d =

225

∑O €O ‚O ∑O €O

(13)

2.3 GMDH

227

The GMDH was introduced by [21] in the 1966’s. The main idea of the GMDH algorithm is

228

RI PT

226

searching for optimal structure within the space of multipolynomial functions d: ƒ g → ƒ, which it

realizes as a multilayered polynomial network. An important characteristic of GMDH it is to be a

230

self-organizing neural network model. This means that in these networks, the most important input

231

variables, number of layers, neurons in hidden layers and optimal model structure are determined

232

automatically through an iterative process, in which the model structure is modified in order to find

233

the best data prediction [22-24].

234

M AN U

SC

229

The GMDH algorithms start by creating simple polynomials that roughly approximate the aimed

236

systems and incrementally expand the complexity of the polynomials in order to create models that

237

are more accurate [25]. The connections between input and output variables can be expressed by

238

Volterra functional series, discrete analog of which is Kolmogorov–Gabor polynomial [25], as

239

presented in Equation 14

242 243 244 245

† † † † † [ = 3… + ∑†  3  + ∑ ∑ 3   + ∑ ∑ ∑_ 3_   _

EP

241

(14)

where ( ,  , … , † ) is the input vector of variables, (3 , 3 , … , 3† ) is the vector of coefficients or weights and ‡ is the number of input variables.

AC C

240

TE D

235

246

3 Proposed hybrid SSA-ANFIS approach

247

The proposed hybrid model combines SSA and ANFIS. In literature can be found one hybrid model

248

proposed by [26], which resembles our approach, but the cited model differ by the hybridization

249

form applied in ANFIS and SSA. In the cited work, the authors apply SSA to decompose a daily 9

ACCEPTED MANUSCRIPT

250

average wind power time series, obtaining (0 − 9) components. Each component has the upper and

251

lower bound determined by Firefly Algorithm combined to Subtractive Cluster, and the final step

252

bounds on -step-ahead horizon by applying (0 − 9) ANFIS structures. Finally, the time series

254

RI PT

253

comprises to do the forecasting of the (0 − 9) time series components, also the upper and lower reconstruction is performed by summing the (0 − 9) components. The cited work was developed to forecast the uncertainty of the average wind power in daily basis, besides the complexity of the

256

model, the authors reported promising results.

257

The main context of SSA application is slightly different in our approach; where the main purpose

258

of SSA is, decompose the wind speed series into additive components. After this, we evaluate the

259

analysis of the contribution of each component, discarding those components that have no

260

significant contribution to the original signal recovery, and then the reconstruction of the original

261

series is made by grouping the components in two major components: trend plus harmonic and

262

noise components. These two components are the input for two different ANFIS model, which

263

performs one-step-ahead forecasting for each one.

M AN U

SC

255

264

266

Figure 2 shows the main process of the proposed model, where the trend plus harmonic component

TE D

265

is grouped as ( ) , and the noise component as () , both are applied as input for the two instances of ANFIS algorithm. We always implicitly assume that the wind speed series is a sum of several

268

simpler series. The main purpose is training two ANFIS models, one for each component.

269

Therefore, the goal is at the end of the training process, have a final model able to forecasting each

270

one of these components and their structure. Based on aforementioned definition, the original time

271

series can be obtained by summing these two components, the final stage is the summation of the

272

forecasted trend plus harmonic and noise components to get one-step-ahead forecasted value for the

273

original time series.

AC C

274

EP

267

275

Direct Recursive forecasting procedure, as presented in [24], is the approach applied to perform k-

276

step-ahead forecasting in this work. It means that we use in the recursive way the one-step-ahead

277

forecasting method, and each forecasted value is added to the original time series, so the new time

278 279

series length become  + 1. After forecasting, the decomposition stage is performed again, also all

the algorithm steps, as presented in Figure 2. The entire procedure need be repeated after one-step10

ACCEPTED MANUSCRIPT

281 282 283

ahead value be forecasted, until perform all the k-step-ahead forecasting. At the end of the forecasting process, the length of the time series resulting is equals to  + .

The SSA has only two parameters that need to be set, the first parameter is the window length ,

RI PT

280

and the second parameter is the way of grouping components. In [22] can be found explanation

285

about how to do the parameters choice by different approaches. A simple way to do this selection

286

can be done by taking into account the length, seasonality and periodicity of the time series. In this

287

work, we will take in consideration this former procedure, and in the next section provides detailed

288

explanation.

SC

284

TE D

Load the Time Series ; Get the Time Series length ; Plot the Time Series to visualize possible trend and seasonality; Set window length L by applying the Equation (15); Apply SSA-Decomposition Stage (Embedding and SVD step); Evaluate the contribution of each component by Eq.(3); Apply SSA-Reconstruction Stage (Grouping and Diagonal Averaging step) to get the two components: ( ) and () ; Apply the desired Lag (usually all lags containing autocorrelation values above a given threshold (0.2 as example) is selected); Split the ( ) in Training and Test Set and apply ANFIS algorithm; Split the () in Training and Test Set and apply ANFIS algorithm; Evaluated the model quality, if necessary improve the ANFIS parameters to achieve the minimal error desired and retrain the ANFIS Do the one-step-ahead forecast applying all ( ) data into ANFIS model; Do the one-step-ahead forecast applying all () data into ANFIS model; ( ) () Reconstruct the Time Series Š‹‰ (‰ ) = Š‹‰(‰ ) + Š‹‰(‰ ) Complete the Time Series with the forecasted value; Update the new Time Series length  = gŒ€ If k-step-forecasting then While (t < k) Apply SSA-Decomposition Stage (Embedding and SVD step) to the new Time Series with length gŒ€ ; Apply SSA-Reconstruction Stage (Grouping and Diagonal Averaging step) to get the two components: ( ) and () ;

EP

291

The pseudo-code presented below introduces the main idea of the proposed approach.

AC C

290

M AN U

289

11

ACCEPTED MANUSCRIPT

model;

Apply the Lag; Do the one-step-ahead forecast applying all ( ) data into ANFIS

Do the one-step-ahead forecast applying all () data into ANFIS

Reconstruct the Time Series Š‹‰ (‰ ) = Š‹‰(‰ ) + Š‹‰(‰ ) Complete the Time Series with the forecasted value; Update the new Time Series length  = gŒ€ End While End if ()

RI PT

( )

SC

model;

Pseudo Code 1 – Hybrid SSA-ANFIS

AC C

EP

TE D

M AN U

292

12

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

13

293

ACCEPTED MANUSCRIPT

Figure 2 – Proposed approach hybrid SSA-ANFIS

294

4 Numerical Experiments

296

As previously mentioned, the aim of this paper is to develop an approach to perform a wind speed

297

forecasting by applying a hybrid strategy, which allows the hybrid model to learn the structure of

298

trend and harmonic behavior of the original wind time series. Therefore, the main goal is allow

299

better energy production by uncertainty wind speed reducing. In this section, the potential benefit of

300

developing this strategy is analyzed from a quantitative point of view.

301

4.1 SSA - Parameters choice and Wind Time Series

305 306 307 308 309 310 311 312 313 314

SC

M AN U

which help to capture the periodic components with period   =  ~ℎsCs Ž 

TE D

304

grouping components. The choice of  considering wind time series consider the equation (15),  = 240 d 2ℎs 0?SMsC /d 13Sjs1 jsC ℎ/?C 0 > 1  = 24 d 2ℎs 0?SMsC /d 13Sjs1 jsC ℎ/?C 0 = 1

(15)

where  is the length of the time series, 0 is the number of samples observed on interval within one

EP

303

The two parameters of SSA as discussed in Section 3 are the window length  and the way of

hour. The result of this equation must result on  that need be an integer, if it is not, a rounded

AC C

302

RI PT

295

process must be applied. The only restriction that must be observed, as presented in [17],  must lays into a close interval given by 2 ≤  ≤

. As example, if the time series has length  = 1000

 

and the sampling occurs at each 10 minutes, then we have 6 samples per hour; it means that 0 = 6 , in addition  =

………

%‘’

=

……… %%

= 6.94 ≅ 7, which can help capture the seasonality for multiple

periods, if the series has multiple periods.

315 316

The way of grouping, the last parameter that need be choose to perform the SSA algorithm, is done

317

by applying the Equation 3. The result of this equation is the contribution rate of each component 14

ACCEPTED MANUSCRIPT

into the original time series formation. The Table 1 presents an example of this result, when a time

319

series of 1000 data points of wind speed measured in a Wind Farm located in Rio Grande do Norte,

320

Northeast of Brazil, sampled by each 10 min on the first week of May 2016, is decomposed in 7

321

components.

RI PT

318

322 323 324

Table 1 – Component Contribution for original Time Series

326 327

Contribution [%] 99.36685 0.32204 0.13847 0.06714

Component #5 #6 #7

M AN U

Component #1 #2 #3 #4

SC

325

Contribution [%] 0.04531 0.03554 0.02465

Per the results of Table 1, the components 1 to 4 are the best choice to group them into ( ) component, mostly due to the level of contribution on the reconstruction of the original time series

329

as presented in Figure 3.

AC C

EP

330

TE D

328

331 332

Figure 3 – Contribution of each component to recovery the original time series 15

ACCEPTED MANUSCRIPT

333

To select which components can be grouped or discarded during the reconstruction step, the main

335

procedure described in [17] consists by analyzing the decreasing behavior of the singular values for

336

each component; a pure noise series produces a slowly decreasing sequence of singular values.

337

Because the axis scale, this slow variation cannot be identified after the second component in Figure

338

3, but the same information of Figure 3 are presented on numerical values form in Table 1.

339

Therefore, after the forth component, we can identify that moving from the fifth to the next

340

component; the decreasing behavior is very slow (decaying less from 0.01).

341

Considering the aforementioned procedure, the four initial components were grouped as ( ) ,

343 344

SC

M AN U

342

RI PT

334

representing all information related to the trend and harmonic time series behavior. After the forth component, all remaining information were grouped as noise and represented as () . In practical

terms, if the percentage of contribution is less than 0.1%, considering the insights found in [17],

346

which states that after this value a slowly decreasing sequence of singular values is observed; the

347

remaining components can be grouped as noise without loss information.

348

In this work, the wind time series have length  = 1000 and the sampling occurs at each 10

351 352 353 354

minutes, then  ≅ 7. For the grouping step, the components 1 to 4 was choose to form the

component ( ) and the remaining components will form the noise component () . Also, the 3– = 6 was adopted. The Figure 4 presents the original wind series and the Figure 5 the

EP

350

decomposed time series.

AC C

349

TE D

345

16

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

355

Figure 4 – Wind Time Series to be forecasted.

357

Figure 5 – Time Series decomposed: Components 1 to 4 reconstruced as ( ) and Noise as () .

AC C

EP

TE D

356

358



17

ACCEPTED MANUSCRIPT

4.2 ANFIS - Parameters choice

360

The ANFIS applied is the variant called Sugeno ANFIS-FCM (FCM - Fuzzy c-means for short).

361

Fuzzy c-means is a data clustering technique wherein each data point belongs to a cluster to some

362

degree that is specified by a membership grade. It provides a method that shows how to group data

363

points that populate some multidimensional space into a specific number of different clusters. The

364

number of clusters determine the number of rules and membership functions in the generated FIS.

365

The parameters was chosen by trial and error, also the initial set of values followed the

366

recommendation found in [19].

SC

M AN U

367 368

RI PT

359

Table 2 – Sugeno ANFIS – FCM parameters Value 20 4 200 1e-5

Parameter Max. # Epochs Error Goal Initial Step Size Step Decrease Rate Step Increase Rate

Value 200 0 0.01 0.9 1.1

TE D

Parameter Number of Cluster Partition Matrix Exponent Max. Number of Iterations Minimal Improvement

4.3 Training, Validation and Test Sets

370

The time series presented in Figure 4 must be split in three subsets before performing the model

371

fitting. The subset of training was composed of 70% of original length and 15% composes the

372

validation set of ANFIS one-step-ahead model. The remaining 15% was applied as test set to

compare the quality of the -steps-ahead forecasted values. Also, after the training and validation

AC C

373

EP

369

374

process, the 85% of the original time series is presented to the ANFIS model to evaluate the model

375

performance. The quality indicators to measure the trained and the validation for the one-step-ahead

376

forecasting model are the Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean

377 378 379 380

Absolute Error (MAE), Standard Deviation (StD) and Coefficient of Determination (ƒ  ). ‰H‹gg™ ‡—˜ =  ∑ −  HŒ‹š )  (

(16)

381 18

ACCEPTED MANUSCRIPT

‰H‹gg™ ƒ‡—˜ = › ∑ −  HŒ‹š )  (



383





385

R

 ¡¢O£O£¤ Ÿ ¡¥¢¦ )z ∑ž O OQR(ŸO

ƒ  = 1 − ž∑ž

386

RI PT

‰H‹gg™ ‡l˜ = ∑ −  HŒ‹š œ  œ

384

OQR(ŸO

387

 ¡¢O£O£¤ Ÿ ¡¥¢¦ )z O

§

(17)

(18)

(19)

SC

382

4.4 Results for Training and Validation Set

389

Figure 6, 7, and 8 present the results for the training, validation set and for both data, resulting in

390

M AN U

388

85% of the all data related to the component ( ) , when applied the hybrid SSA-ANFIS-FCM

391

model to forecast one-step-ahead.

392 393

The quality of the model was evaluated regarding the correlation factor ƒ  for training, validation

397 398 399 400

TE D

396

Figure 9 presents these results. The hybrid SSA-ANFIS-FCM model learned on the training set and the correlation coefficient for the one-step-ahead forecasting achieved ƒ  = 0.99472. The

EP

395

and for both data set merged as a single one (training and validation) related to Component  ( ) .

correlation achieved on Forecasting model related to the validation set was lower than the correlation for the training set, and its value ƒ  = 0.86626. The last quality measure was applied to evaluate the result when both data set as presented as one time series to the hybrid model, with 85% length, the value achieved was very good: ƒ  = 0.97458.

AC C

394

401 402

The same procedure was applied to the component () . It worth remember this component has in

403

its structure only noise and few information about the original wind time series. The Figures 10 to

404

12 present the results for one-step-ahead forecasting.

405 406

The quality indices of the model was evaluated regarding the correlation factor ƒ  for the training,

407

validation and for the combination of both data set related to Component  () . The Figure 13 19

ACCEPTED MANUSCRIPT

408

presents these results. The hybrid SSA-ANFIS-FCM model learned the structure of the training set

409

set ƒ  = 0.47734. The last performance analysis was done by presenting all data of both set to the

411

model, and the ƒ  = 0.72985.

RI PT

410

and the correlation coefficient for the forecasted values achieved ƒ  = 0.97158, for the validation

Figure 6 – Results for Training Set - 70% length component ( )

EP

414 415

AC C

413

TE D

M AN U

SC

412

20

TE D

Figure 7 – Results for Validation Set – 15% length component ( )

AC C

EP

416 417 418 419 420 421

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

422 423

21

ACCEPTED MANUSCRIPT

430 431 432

Figure 9 – ƒ  coefficient for training, validation and for both data set merged as single one for

AC C

429

EP

TE D

M AN U

SC

425 426 427 428

Figure 8 – Results for both data set (training and validation merged as single one) for Component ( )

RI PT

424

Component ( )

22

433

Figure 10 – Results for Training Set – 70% length Component ()

AC C

EP

TE D

434

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

435 436 437 438

Figure 11 – Results for Validation Set – 15% length Component ()

23

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

439

TE D EP

441 442 443

Figure 12 – Results for both data set (training and validation merged as single one) for Component ()

AC C

440

24

444 445

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Figure 13 – ƒ  coefficient for training, validation and for both data set merged as single one for

EP

446 447

449

As presented in Figure 2 and in Pseudo-Code 1, after obtain the one-step-ahead forecast for each

component ( ) and () , we need to perform the summation of these two components to

AC C

448

Component ()

450

reconstruct the original time series and the continuation of the series with the forecasted values, it

451

means that

452

(Š‹‰) (‰ )

= Š‹‰(‰ ) + Š‹‰(‰ ). The Figure 14 presents the result for the original time ( )

()

series reconstructed after forecasting one-step-ahead process. Therefore, Figure 15 presents the ƒ 

453

coefficient for all data after performs one-step-ahead forecasting.

454

The results for -steps-ahead forecasting are presented in Figure 16. Where the forecasting was

455 456

performed in recursive way, it means the first value forecasted became an input for the next value in 25

ACCEPTED MANUSCRIPT

460 461 462

463

hours forecast horizon. The 15% remaining of the original time series was applied to test the quality of the forecasted values, and the ƒ  = 0.97261 was the coefficient resulting when the forecasted time series was compared to the test data set.

RI PT

459

850 points, 3– = 6, sampled by each 10min, were applied to obtain 144 steps resulting in 24

ª(2 + 1) = d( (2), (2 − 1), (2 − 2), (2 − 3), (2 − 4), (2 − 5))

ª(2 + 2) = d( ª(2 + 1), (2), (2 − 1), (2 − 2), (2 − 3), (2 − 4))

ª(2 + 3) = d« ª(2 + 2), ª(2 + 1), (2), (2 − 1), (2 − 2), (2 − 3)¬ ⋮

ª(2 + 6) = d« ª(2 + 5), ª(2 + 4), ª(2 + 3), ª(2 + 2), ª(2 + 1), (2)¬

SC

458

the forecasting process as Equation 20. As input for the forecasting model, a wind time series with

(20)

M AN U

457

AC C

EP

TE D

464

465 466 467 468

Figure 14 – Final Time Series reconstructed after forecasting one-step-ahead each component ( ) 30: () by SSA-ANFIS-FCM model

26

EP

TE D

Figure 15 – ƒ  for the reconstructed series after forecasting one-step-ahead by SSA-ANFIS-FCM model

AC C

469 470 471 472 473

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

27

EP

TE D

Figure 16 – Wind Time Series forecasting 144-step-ahead

AC C

474 475 476 477 478

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

28

Figure 17 – ƒ  after forecasting 144-step-ahead by SSA-ANFIS-FCM model

TE D

479 480

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

4.5 Comparisons Results

482

In this section, we present comparisons related to the proposed model with the ANFIS-FCM and

483

GMDH. For keeping the validity of comparisons between hybrid SSA-ANFIS and the ANFISFCM, the same parameters were used as presented in Table 2. In addition, 3– = 6 for the Time

AC C

484

EP

481

485

Series as input for all models also was kept. For GMDH algorithm, the Table 3 presents the

486

parameters applied, which the values was obtained considering the recommendation found in [24-

487

and Figure 19 the results for one-step-ahead forecasting. Figures 20 and 21 the results for -steps-

488 489

25, 27] . Results related to ANFIS-FCM model are depicted in Figure 18, which presents ƒ  values ahead forecasting by ANFIS-FCM model.

490 491 492

The results for GMDH model are presented in Figures 22 to 25. For the one-step-ahead forecasting

the ƒ  = 0.93664, decreasing to 0.89661 for -steps-ahead forecasting. In Table 4 the quality 29

ACCEPTED MANUSCRIPT

indices for all approaches, is presented and through it can be confirmed that the Hybrid SSA-ANFIS

494

outperformed the other two approaches.

495

On the results for -steps-ahead forecasting related to ANFIS-FCM and GMDH depicted on Figure

496

RI PT

493

20 and 24, respectively, a lag is observed between the target values and those that were forecasted.

498

We believe the main cause is due to recursive process applied on forecasting model. As presented in

499

Equation 20, after the maximum lag value, all inputs for the model are previous outputs forecasted,

500

causing some instability. For the proposed hybrid model, due to the filtering process into SSA step,

501

the outputs became more stable achieving better results on the forecasting process.

SC

497

M AN U

502

Table 3 – GMDH parameters

503

EP

505

AC C

504

Value 25 5 0 0.7

TE D

Parameter Max. Number of Neurons in a Layer Maximum Number of Layers Selection Pressure (­) Train Ratio

30

508

Figure 18 –ƒ  for ANFIS-FCM model one-step-ahead forecasting

EP

507

AC C

506

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

31

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

509 510

Figure 19 – One-step-forecasting applying ANFIS-FCM

AC C

EP

TE D

511

512 513

Figure 20 – 144-step-forecasting applying ANFIS-FCM 32

ACCEPTED MANUSCRIPT

M AN U

SC

RI PT

514

TE D

518

Figure 21 - ƒ  after forecasting 144-steps-ahead by ANFIS-FCM model

EP

516 517

AC C

515

33

521

Figure 22 –ƒ  for one-step-ahead forecasting GMDH model

EP

520

AC C

519

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

34

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

522 523

Figure 23 – One-step-forecasting applying GMDH

AC C

EP

TE D

524

525 526

Figure 24 – 144-steps-forecasting applying GMDH 35

ACCEPTED MANUSCRIPT

531 532

Figure 25 - ƒ  after forecasting 144-steps-ahead by GMDH model

EP

529 530

Table 4 – Quality Indices for one and k-step-ahead forecasting Technique

AC C

528

TE D

M AN U

SC

RI PT

527

SSA-ANFIS-FCM

ANFIS-FCM

Quality Indices MAE MSE RMSE St.D R2 MAE MSE RMSE St.D

Values for One-step-ahead 0.24053 0.20449 0.45221 0.45238 0.96886 0.48044 0.78746 0.88739 0.88770

Values for -steps-ahead 0.20583 0.08776 0.29625 0.29646 0.97261 0.51360 0.46828 0.68431 0.66950 36

ACCEPTED MANUSCRIPT

GMDH

533

0.88125 0.46025 0.38117 0.61739 0.61774 0.93664

0.85195 0.44280 0.31985 0.56555 0.55554 0.89661

RI PT

R2 MAE MSE RMSE St.D R2

The results provided by SSA-ANFIS-FCM are very close to the real values, we can infer through it

535

that the model learned the structure and of the peaks and trend of the time series. Otherwise, the

536

ANFIS-FCM forecasted quality indices presented in Table 4 were worse than GMDH approach, but

537

when the SSA is combined to ANFIS-FCM, the improvement are really promising.

M AN U

SC

534

538

5 Conclusion

540

In this study, a novel hybrid modeling method is proposed for short-term wind speed forecasting by

541

integrating SSA to ANFIS-FCM approach. This hybrid model combines the power of components

542

extraction and reconstruction provided by SSA algorithm. Moreover, the ANFIS characteristics of

543

learning a nonlinear system behavior introduce the precision and stability of the proposed approach

544

in forecasting wind speed time series. Since the dynamic behavior is recursively updated in the

545

proposed SSA–ANFIS approach, when more than one-step-ahead forecast were performed, the

546

stochastic uncertainty and fluctuations of wind speed can be better accounted for by the presented

547

method than by applying only the ANFIS-FCM and GMDH strategies. The better performance of

EP

hybrid SSA-ANFIS-FCM in both one-step-ahead and -steps-ahead wind speed forecasting

AC C

548

TE D

539

549

indicates the robustness of the proposed approach. Future researches will comprise to derive a

550

hybrid model by applying SSA into GMDH algorithm; also, the SSA-ANFIS-FCM model will be

551

compared, aiming to achieve the high accuracy on wind power forecasting.

552

37

ACCEPTED MANUSCRIPT

References

554

[1] O. Kramer, F. Gieseke, B. Satzger. Wind energy prediction and monitoring with neural

555

computation. Neurocomputing, 109 (2013), pp. 84–93

556

RI PT

553

[2] S. Salcedo-Sanz, J. Rojo-Álvarez, M. Martínez-Ramón, G. Camps-Valls. Support vector

558

machines in engineering: an overview. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 4

559

(2014), pp. 234–267

SC

557

560

562

[3] J. Heinermann, O. Kramer, Machine learning ensembles for wind power prediction, Renewable

M AN U

561

Energy, Volume 89, April 2016, Pages 671-679, ISSN 0960-1481.

563

[4] J. Wang, J. Hu, A robust combination approach for short-term wind speed forecasting and

565

analysis – Combination of the ARIMA (Autoregressive Integrated Moving Average), ELM

566

(Extreme Learning Machine), SVM (Support Vector Machine) and LSSVM (Least Square SVM)

567

forecasts using a GPR (Gaussian Process Regression) model, Energy, Volume 93, Part 1, 15

568

December 2015, Pages 41-56, ISSN 0360-5442.

569

TE D

564

[5] Y. Zhao, L. Ye, Z. Li, X. Song, Y. Lang, J. Su, A novel bidirectional mechanism based on time

571

series model for wind power forecasting, Applied Energy, Volume 177, 1 September 2016,

572

Pages 793-803, ISSN 0306-2619.

AC C

573

EP

570

574

[6] Y. Wang, J. Wang, X. Wei, A hybrid wind speed-forecasting model based on phase space

575

reconstruction theory and Markov model: A case study of wind farms in northwest China,

576

Energy, Volume 91, November 2015, Pages 556-572, ISSN 0360-5442.

577 578

[7] G. Santamaría-Bonfil, A. Reyes-Ballesteros, C. Gershenson, Wind speed forecasting for wind

579

farms: A method based on support vector regression, Renewable Energy, Volume 85, January

580

2016, Pages 790-809, ISSN 0960-1481.

581 38

ACCEPTED MANUSCRIPT

[8] Z. Men, E. Yee, F.-S. Lien, D. Wen, Y. Chen, Short-term wind speed and power forecasting

583

using an ensemble of mixture density neural networks, Renewable Energy, Volume 87, Part 1,

584

March 2016, Pages 203-211, ISSN 0960-1481.

RI PT

582

585

[9] Y-L. Tu, T.-J. Chang, C.-L. Chen, Y.-J. Chang, Estimation of monthly wind power outputs of

587

WECS with limited record period using artificial neural networks, Energy Conversion and

588

Management, Volume 59, July 2012, Pages 114-121, ISSN 0196-8904.

SC

586

589

[10] K. G. Sheela, S.N. Deepa, Neural network based hybrid computing model for wind speed

591

prediction, Neurocomputing, Volume 122, 25 December 2013, Pages 425-429, ISSN 0925-2312.

592

M AN U

590

593

[11] Q. Cao, B. T. Ewing, Mark A. Thompson, Forecasting wind speed with recurrent neural

594

networks, European Journal of Operational Research, Volume 221, Issue 1, 16 August 2012,

595

Pages 148-154, ISSN 0377-2217.

596

[12] D.A. Fadare, The application of artificial neural networks to mapping of wind speed profile for

598

energy application in Nigeria, Applied Energy, Volume 87, Issue 3, March 2010, Pages 934-942,

599

ISSN 0306-2619.

TE D

597

EP

600

[13] J. Zhou, J. Shi, G. Li, Fine tuning support vector machines for short-term wind speed

602

forecasting, Energy Conversion and Management, Volume 52, Issue 4, April 2011, Pages 1990-

603

1998, ISSN 0196-8904.

604 605 606

AC C

601

[14] J. Wang, J. Hu, K. Ma, Y. Zhang, A self-adaptive hybrid approach for wind speed forecasting, Renewable Energy, Volume 78, June 2015, Pages 374-385, ISSN 0960-1481.

607 608 609

[15] O. B. Shukur, M. H. Lee, Daily wind speed forecasting through hybrid KF-ANN model based on ARIMA, Renewable Energy, Volume 76, April 2015, Pages 637-647, ISSN 0960-1481.

610

39

ACCEPTED MANUSCRIPT

611 612

[16] A.D.D. Zhigljavsky (eds) (1997) Principal components of time series: the “Caterpillar” method. St.Petersburg Press, St. Petersburg (in Russian).

613

615

[17] N. Golyandina, V. Nekrutkin, A. Zhigljavsky, Analysis of time series structure: SSA and related techniques, (2001) Chapman & Hall/CRC, 305 pgs.

616

618

[18] J.B. Elsner, A.A.Tsonis, (1996) Singular spectrum analysis: a new tool in time series analysis. Plenum, New York.

SC

617

619

621

[19] J. S. R. Jang, “ANFIS: adaptive-network-based fuzzy inference system,” IEEE Transactions on

M AN U

620

Systems, Man and Cybernetics, vol. 23, pp. 665-685, 1993.

622 623 624

[20] E. Akkaya, ANFIS based prediction model for biomass heating value using proximate analysis components, Fuel, Volume 180, 15 September 2016, Pages 687-693, ISSN 0016-2361.

625

627

[21] A. G. Ivakhnenko, Polynomial Theory of Complex Systems. IEEE Transactions on Systems,

TE D

626

Man, and Cybernetics, Vol. SMC-1, No. 4, pp. 364-378 (1971).

628

630

[22] A.G. Ivakhnenko, A.J. Müller, Self-organization of neuron nets with active neurons. Pattern Recognition Image Analysis 1994;4(2):177e88.

EP

629

631

633 634

[23] A.G. Ivakhnenko, D. Wunsch, Inductive sorting/out GMDH algorithms with polynomial

AC C

632

RI PT

614

complexity for active neurons of neural network. IEEE 1999;6 (99): 1169e73.

635

[24] M.G. De Giorgi, M. Malvoni, P.M. Congedo, Comparison of strategies for multi-step ahead

636

photovoltaic power forecasting models based on hybrid group method of data handling networks

637

and least square support vector machine, Energy, Volume 107, 15 July 2016, Pages 360-373,

638

ISSN 0360-5442.

639 640

[25] M. Dorn, A.L.S. Braga, C.H. Llanos, L.S. Coelho, A GMDH polynomial neural network-based

641

method to predict approximate three-dimensional structures of polypeptides, Expert Systems 40

ACCEPTED MANUSCRIPT

642

with Applications, Volume 39, Issue 15, 1 November 2012, Pages 12268-12279, ISSN 0957-

643

4174.

644

[26] Z. Zhang, Y. Song, F. Liu, J. Liu, Daily Average Wind Power Interval Forecasts Based on an

646

Optimal Adaptive-Network-Based Fuzzy Inference System and Singular Spectrum Analysis.

647

Sustainability 2016, 8, 125.

RI PT

645

648

[27] S.-Kwun Oh, W. Pedrycz, B.-Jun Park, Polynomial neural networks architecture: analysis and

650

design, In Computers & Electrical Engineering, Volume 29, Issue 6, 2003, Pages 703-725, ISSN

651

0045-7906, https://doi.org/10.1016/S0045-7906(02)00045-9.

M AN U

SC

649

AC C

EP

TE D

652

41

ACCEPTED MANUSCRIPT Research highlights •

An hybrid SSA-ANFIS-FCM approach for wind speed forecasting is proposed



The hybridization of SSA and ANFIS-FCM is an original idea on context of Machine Learning applied to wind speed forecast The model presents excellent results with stable performance

AC C

EP

TE D

M AN U

SC

RI PT