Journal Pre-proof A Novel Approach for Harmonic Tidal Currents Constitutions Forecasting using Hybrid Intelligent Models based on Clustering Methodologies
Hamed H.H. Aly PII:
S0960-1481(19)31445-4
DOI:
https://doi.org/10.1016/j.renene.2019.09.107
Reference:
RENE 12329
To appear in:
Renewable Energy
Received Date:
06 July 2018
Accepted Date:
21 September 2019
Please cite this article as: Hamed H.H. Aly, A Novel Approach for Harmonic Tidal Currents Constitutions Forecasting using Hybrid Intelligent Models based on Clustering Methodologies, Renewable Energy (2019), https://doi.org/10.1016/j.renene.2019.09.107
This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain. Β© 2019 Published by Elsevier.
Journal Pre-proof
A Novel Approach for Harmonic Tidal Currents Constitutions Forecasting using Hybrid Intelligent Models based on Clustering Methodologies Hamed H. H. Aly Electrical and Computer Engineering Department, Dalhousie University and Electrical Power and Machines Engineering, Zagazig University, Mathematics and Statistics Department, Acadia University
Abstractβ Forecasting of renewable energy resources and their output power is playing a key role to improve the grid energy efficiency by making some load generation management. Tidal currents output power is depending on the tidal currents constitutions (speed magnitude and direction) forecasting. The accuracy of the tidal currents forecasting models is very important especially when we deal with smart grid and renewable energy integration. Many models are proposed in the literature for tidal currents forecasting but most of the models are not able to control the requirements of the smart grid due to their accuracy. This paper is proposing hybrid approaches for harmonic tidal currents constitutions forecasting based on clustering approaches to improve the system accuracy. These hybrid models involve various combinations of Wavelet and Artificial Neural Network (WNN and ANN) and Fourier Series Based on Least Square Method (FSLSM) techniques. The proposed work is validated by using two different datasets one for tidal currents speed magnitude and the other one for tidal currents direction as well as K-fold cross validation. Simulations results prove the importance of the proposed models to improve the system performance. The proposed models are tested based on actual tidal currents data collected from the Bay of Fundy, Canada in 2008.
Keywords - Tidal currents forecasting; Smart grid; ANN; FSLSM; WNN; Clustering techniques. 1
Journal Pre-proof
1. Introduction Tidal currents energy is playing an important role these days as it is pollution free energy, but its cost is still high compared to other renewable energy resources. Tidal currents energy is promising to be one of the most valuable energies used in the future as it is somewhat predictable energy compared to wind energy. Prediction of tidal currents output power is based on the prediction of tidal currents speed and direction. Forecasting tidal currents using data gathered for short periods of time is predictable to within 98% accuracy. The accuracy of models used for tidal currents forecasting is very important as the integration of the tidal currents power into the grid is depending on its forecasting accuracy. Many works done in the literature for tidal currents forecasting but most of the work done was depending on using an artificial technique for the tidal forecasting and rarely used hybrid models [1-5]. In this paper, we propose hybrid models based on clustering techniques to improve the overall system accuracy; hence improve the integration and increase the overall system stability.
2. Previous Work Done on Tidal Currents Forecasting The idea of representing the tidal currents by a number of simple harmonic constitutions came by Darwin et al. They stated that βThe tidal oscillation, of the ocean may be represented as the sum of a number of simple harmonic wavesβ [6]. Harmonic component theory of tidal currents constitutions is modified by Doodson who suggested to use least squares estimation to estimate the parameters of the series [7-9]. Aly and El-Hawary proposed different models based on ANN and Fourier series based on Least Square method for the tidal currents speed forecasting. They proposed a hybrid model of ANN and FSLSM for improving the system accuracy [1, 2]. Lee and Jeng proposed a model of ANN for tidal forecasting using data from three harbors in Taiwan [10]. 2
Journal Pre-proof
Lee used the ANN backpropagation with descent [11]. Vijay and Govil used radial basis function of ANN and WNN approaches. They found that both ANN and WNN are effective but WNN requires longer time for training [12]. Chen et al. and Adamowski used hybrid model of WNN and ANN. The proposed model improved the accuracy compared to the ANN or WNN alone [13, 14]. An interval-based model constructs the optimal prediction intervals based on support vector regression and a non-parametric method is used for tidal currents forecasting. The fuzzy membership functions are used for that model to provide appropriate balance between the coverage probability and normalized average width [15]. A model based on WNN and support vector regression along with an optimization method based on the bat algorithm is used to train the support vector regression model [16]. Most of the previous used techniques were using the data as it is without any modification or analysis to improve the system accuracy. Due to uncertainties and fluctuation of the data the models were not accurate especially if we are going to use the models to predict the output power for the purpose of integration into the smart grid. In this paper we use clustering techniques to generate different segments of data (we use different number of cluster segments) to improve the overall system accuracy and executable time needed for the training. As well as we use hybrid models of WNN, ANN and FSLSM to add many desired features to the proposed models. Different calibrations methods are used to compare the accuracy of the work to the previous work done. From these methods we use mean absolute error (MAPE). MAPE is defined as the absolute average difference between the actual and predicted values as shown in equation 1[17].
ππ΄ππΈ =
1 π
β|
|
π
π₯π β π¦π
1
π¦π
3
(1)
Journal Pre-proof
Another calibration method is the index r which is describing the strength of the relation between the measured and predicted and is defined in equation 2[17].
π=
π(βπ₯ππ¦π) β (βπ¦π)(βπ₯π)
(πβπ₯π2 β (βπ₯π)2)(πβπ¦π2 β (βπ¦π)2)
(2)
The most commonly used factor also is normalized root mean square error (nRMSE) which is defined in equation 3 [18, 19]. 1 π β1 (π₯π β π¦π)2 π ππ
πππΈ = π¦
(3)
where, x is the predicted value, y is the actual value, N is the number of observations and Μ
y is mean of the actual data. In this paper we use a clustering technique to discriminate between different shapes for the data and put them in different segments based on their common characteristics. Then we use six different hybrid models of WNN and ANN; ANN and WNN; ANN and FSLSM; FSLSM and ANN; WNN and FSLSM and finally FSLSM and WNN to choose the best one. The clustered data is fed first to the first technique and the error (which is the difference between the actual and the predicted data) is fed to the second technique. Then the output is the summation of the output from the first technique and the output from the second technique. The paper is organized as follows. In Sections 3, 4 and 5 the concept of clustering, ANN, WNN and FSLSM methodologies and the proposed models constructions are introduced. In section 6 we define six different hybrid models proposed as well as the results of the proposed hybrid models along with the simulations for the tidal currents speed and direction forecasting as a way of validation as well as using k-fold for cross validation. The new contribution in this paper is the hybrid models of different techniques 4
Journal Pre-proof
based on Artificial Intelligent as well as the feedback signals from the residuals to improve the overall system accuracy with the help of clustering techniques. Different algorithms with different learning rates, different training functions and epochs are tested based on the clustered data.
3. Clustering Techniques for the Tidal Currents Constitutions Forecasting Model One of the best mathematical methods for improving the forecasting models is to use clustering techniques. K-means is unsupervised learning algorithm used for solving clustering problems. The clustering algorithm could be described by the following steps: 1.The data are clustered into k groups randomly. 2.The k centroid should be defined for each cluster and then take each point near to these centroids. 3. Recalculate new centroids. K centroids are changing their locations repeatedly until no more change takes place. 4.This way is depending on minimizing a certain objective function which is the error as shown in Fig. 1.
xβ¦β¦.. β¦.. Start m2 β¦β¦.. x
β¦β¦..x β¦.. Start m3 β¦xβ¦..
β¦β¦..x β¦.. Start m1 xβ¦β¦..
x β¦β¦.. β¦.. Start mn β¦β¦..x
x β¦β¦.. β¦.. Start m4 β¦β¦.. β¦β¦.. x Final Boundary
Fig. 1 K-mean clustering technique
5
Journal Pre-proof
The tidal currents constitutions features are varying from location to another and from time to time, so it is a good idea to make some analysis to examine the behaviour of the tidal currents constitutions and create some common segments that has the same characteristics. This is what we call clustering. This could be done using various statistical analysis by different analytical methods and different programs. In this paper we use a clustering technique based on k-mean shown Fig. 1 with the help of R-studio and Matlab softwares. After creating the clustering segments, the data is fed to ANN, WNN and FSLSM for forecasting. This technique is reducing the error profile for the predicted data. The number and size of clusters segments are playing a crucial role for improving the characteristics of the forecasting methods. As the size is increased, the performance is improved but at the same time the number of clusters segments will be decreased, and the overall improvement will be affected. So, this is a nonlinear problem and it is hard to find the best size and segments numbers. By increasing the clustering segment size, the performance will be improved but at the same time will minimize the clusters numbers which has a negative impact on the performance and vise versa. Many clustering sizes and numbers are used in this paper to choose the best one depending on the minimum error. After making some statistical studies we found that the improvement is not recognized if the clusters numbers are above eight and below four, but the best one is when the clusters numbers are eight. In this paper we use different forecasting techniques to predict the tidal currents constitutions after for the clustered data. In the k-mean clusters approach we use z as data with t instances, and S1, S2, . . . ,Sk are the k disjoint clusters of z. The error function is defined by equation 4.
π
Error = βπ = 1βπ₯ β πππ·(π¦, π(ππ))
6
(4)
Journal Pre-proof
where ΞΌ(Si) is the centroid of cluster Si, D(y,ΞΌ(Si)) is the distance between y data point and ΞΌ(Si), the Euclidean distance for any data points yi = (yi1, yi2, . . . , yin) and yj =(yj1, yj2, . . . , yjn) is defined by equation (4) and shown in Fig. 1 [20, 21]. π
Deuc(yi, yj) = (βπ = 1(π¦ππ β π¦ππ) 2)
1/2
(5)
4. Proposed Network Construction
We use hybrid models of WNN, ANN and FSLSM and we aggregate the clustered data into aggregated forecasting hybrid model. Fig. 2 shows the proposed clustering forecasting aggregated model. The data is fed first to the clustering algorithm and the clustered data is fed to different forecasting models to choose the best forecasting model and then data is aggregated to have one model to have an output for each input. Fig. 3 shows the proposed models. The data is fed first to R-studio for creating clustering segments based on the common features. The clustered data is fed to ANN then compare the forecasted data by the actual one to calculate the error data (residuals). The error data is fed to WNN to find the forecasted residuals. The forecasted data is equal to the summation of output from ANN and WNN. In each case the number of neurons, hidden layers and epochs is changed regularly to minimize the error as well as the trained functions. Then we choose the best case for the forecasted data. The forecasted data and the actual data are compared to calculate MAPE and nRMSE. The whole process is repeated for six different approaches (WNN+ANN, ANN+ FSLSM, FSLSM + ANN, WNN + FSLSM, and FSLSM + WNN). The last step is to compare between different approaches and choose the best one that has minimum MAPE, and nRMSE.
7
Journal Pre-proof
Data input
R-studio for having different clusters numbers and sizes
Cluster 1
Hybrid forecasting model 1
Hybrid forecasting model 4
Cluster 2
Cluster β¦
Hybrid forecasting model 2
Hybrid forecasting model 5
Cluster n
Hybrid forecasting model 3
Hybrid forecasting model 6
Aggregated forecasting model
Fig. 2 Input output clustered forecasting aggregated model
8
Journal Pre-proof Start
Load data file β two months ahead Use R-studio for creating clustering segments
Feed clustered data to ANN; compare model to actual data to find the error.
Feed clustered data to ANN then compare model to actual data to calculate the error data.
Feed clustered data to WNN; compare the model data to actual data to find the error.
Feed clustered data to FSLSM then compare model data to actual data calculate the error data.
Feed clustered data to WNN then compare the model data to actual data to find the error.
Feed clustered data to FSLSM then compare model data to actual data to calculate the error data.
Feed the error data to WNN
Feed the error data to FSLSM
Feed the error data to ANN
Feed the error data to FSLSM
Feed the error data to ANN
Feed the error data to WNN
ANN+ WNN
ANN+ FSLSM
WNN+ FSLSM
WNN+ ANN
FSLSM +ANN
FSLSM +WNN
Calculate MAPE, and ππ
πππΈ for each model and choose the best one.
Stop
Fig. 3 Tidal currents forecasting proposed models
9
Journal Pre-proof
Algorithm 1. Load data file. 2. Use R-studio for creating clustering segments. 3. Feed the clustered data to ANN then compare the forecasted data by the actual one to calculate the error data. 4. Feed the error data to WNN and find the forecasted error. 5. The forecasted data is equal to the summation of output from ANN and WNN. 6. Compare between the forecasted data and the actual data and calculate MAPE and ππ
πππΈ. 7. Repeat from 3 to 6 by using different approaches (WNN+ANN, ANN+ FSLSM, FSLSM + ANN, WNN + FSLSM, and FSLSM + WNN). 8. Compare between different models and choose the best one that has minimum MAPE, and ππ
πππΈ.
5. ANN, WNN and FSLM Approaches Back propagation ANN is a supervised training approach and is used to improve the error derivative of the weights and biases. The training is done in two modes, the online and the batch modes. The Radial Basis Function network is practical application model. Linear transfer functions are used for layers and nonlinear transfer functions are used for hidden layers. It requires more neurons compared to BP but it needs less time for the training. FSLSM model can be described by equation (6).
π
Zestimated(k) = DC + βπ = 1ππsin (π€ππ + ππ)
10
(6)
Journal Pre-proof
where DC is the constant value and is depending on the data; k is discrete time; a is amplitude; i is the number of harmonics in the wave; and ΞΈi is the phase shift. The actual data is defined by: Z = DC + HX + Ζ(k)
(6.a)
where Ζ(k) is the error, and then we apply the least squares model, as shown in the equations (711), to estimate the Fourier series parameters Xhat = (HTH)-1HTZ
(7)
Zsetimate = DC + HXhat
(8)
Zerror = Z - Zsetimate
(9)
[
H=
π πππ€1 πππ π€1 β¦β¦β¦β¦β¦β¦. π πππ€π πππ π€π
]
π ππ2π€1 πππ 2π€1β¦β¦β¦β¦β¦β¦.β¦β¦β¦.πππ 2π€π β¦β¦β¦β¦ β¦β¦β¦β¦β¦. β¦β¦β¦..β¦β¦β¦.β¦β¦β¦..β¦β¦β¦β¦. π ππππ€1 β¦β¦β¦β¦β¦..β¦β¦β¦.. β¦β¦.. β¦β¦β¦. πππ ππ€π
[ ]
(10)
π1πππ π1
X=
β¦β¦β¦. β¦β¦.. πππππ ππ
(11)
6. Harmonics Tidal Currents Constitutions Forecasting Models 6.1. Tidal Currents Hybrid Model of ANN and WNN In this model we use ANN feed forward back propagation function to train the un-clustered (row data) and clustered data. The input to the model is the time for each ten minutes for two months and the output is the tidal currents speed in meters per second. The forecasted data for the unclustered data is compared to the actual data and we calculate the residuals which is the forecasted minus the actual. We do the same with the clustered data, but the data is aggregated before 11
Journal Pre-proof
calculating the residuals. The residuals are then fed to WNN for both cases (clustered and unclustered data). The model is validated by using different set of data which is the tidal currents direction forecasting. The MAPE and nRMS are calculated for that model for tidal currents speed and direction forecasting as shown in table 1 and 2. As we see from table 1 the MAPE for unclustered data is 1.55791 and for clustered data is 1.1578 and nRMS is 0.0406 for un-clustered data and is 0.0202 for clustered data. Fig. 4 shows the relationship between the actual and forecasted data with and without clustering for tidal currents speed forecasting and the time. Fig. 5 shows the same relationship for the tidal currents direction as a way of validation of the proposed model. As we see the clustering technique is improving the performance of the forecasting model.
4 Forecasted with clustering Actual data Forecasted without clustering
3.5
3
Speed (m/s)
2.5
2
1.5
1
0.5
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
Smaple point time
Fig. 4 The relationship between the actual data, forecasted data with and without clustering technique for tidal currents speed forecasting and the time in minutes based on ANNWNN Hybrid model. 12
5000
Journal Pre-proof
400 Forecasted with clustering Actual data Forecasted without clustering
350
Direction (degrees)
300
250
200
150
100
50
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
5000
Sample point time
Fig. 5 The relationship between the actual data, forecasted data with and without clustering for tidal currents direction forecasting and the time based on ANNWNN Hybrid model. 2.6
Forecasted with clustering Actual data Forecasted without clustering
2.4
Speed (m/s)
2.2
2
1.8
1.6
3.5
1.4
Forecasted with clustering Actual data Forecasted without clustering
1.2
3 1200
1400
1600
1800
2000
2200
Sample point time
Speed (m/s)
2.5
2
1.5
1
0.5
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
Sample point time
Fig. 6 The relationship between the actual data, forecasted data with and without clustering technique for tidal currents speed forecasting and the time in minutes based on based on WNNANN Hybrid model.
13
5000
Journal Pre-proof
320
Forecasted with clustering Actual data Forecasted without clustering
Direction (degrees)
300
280
260
240
400
350
600
300
Direction (degrees)
Forecasted with clustering Actual data Forecasted without clustering
220
800
1000
1200
1400
1600
1800
2000
Sample point time
250
200
150
100
50
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
Sample point time
Fig. 7 The relationship between the actual data, forecasted data with and without clustering technique for tidal currents direction forecasting and the time based on WNNANN Hybrid model. 6.2. Tidal Currents Hybrid Model of WNN and ANN In this model we use WNN to train the un-clustered (row) and clustered data. The input to that model is the time for each ten minutes for two months and the output is the tidal currents speed in meter per second. The forecasted data for the un-clustered data model is compared to the actual data and we calculate the residuals which is the forecasted minus the actual. We do the same with the clustered data, but the data is aggregated before calculating the residuals. The residuals are then fed to ANN for both cases (clustered and un-clustered data). The model is validated by using different set of data from which is the tidal currents direction forecasting. The MAPE and nRMS are calculated for that model for tidal currents speed and direction forecasting and it is shown in table 1 and 2. As we see from table 1 the MAPE for un-clustered data is 1.2522 and for clustered 14
5000
Journal Pre-proof
data is 1.0322 and nRMS is 0.0176 and for un-clustered data and is 0.0091 for clustered data. As we see the clustering technique is improving the performance of the forecasting model. Fig. 6 shows the relationship between the actual and forecasted data with and without clustering for tidal currents speed forecasting and the time. Fig. 7 shows the same relationship for the tidal currents direction as a way of validation of the proposed model.
This model is considered as the best
model for both tidal currents speed magnitude and direction forecasting. 6.3. Tidal Currents Hybrid Model of ANN and FSLSM In this model we use ANN to train the un-clustered (row) and clustered data. The forecasted data for the un-clustered data is compared to the actual data and we calculate the residuals. We do the same with the clustered data, but the data is aggregated before calculating the error. The residuals are fed then to FSLSM for both cases (clustered and un-clustered data). 4 Forecasted with clustering Actual data Forecasted without clustering
3.5
3
Speed (m/s)
2.5
2
1.5
1
0.5
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
Sample point time
Fig. 8 The relationship between the actual data, forecasted data with and without clustering for tidal speed forecasting and the time in minutes based on based on ANNFSLSM Hybrid model. The model is validated by using different set of data from which is the tidal currents direction forecasting. The MAPE and nRMS are calculated for that model for tidal currents speed and
15
5000
Journal Pre-proof
direction forecasting and it is shown in table 1 and 2. As we see from table 1 the MAPE for unclustered data is 1.4945 and for clustered data is 1.1945 and nRMS is 0.0421 for un-clustered data and is 0.0292 for clustered data. Fig. 8 shows the relationship between the actual and forecasted data with and without clustering for tidal currents speed forecasting and the time. Fig. 9 shows the same relationship for the tidal currents direction as a way of validation of the proposed model.
400 Forecasted with clustering Actual data Forecasted without clustering
350
Driection (degrees)
300
250
200
150
100
50
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
Sample point time
Fig. 9 The relationship between the actual data, forecasted data with and without clustering technique for tidal currents direction forecasting and the time based on FSLSMANN Hybrid model.
6.4. Tidal Currents Hybrid Model of FSLSM and ANN In this model we use FSLSM to train the un-clustered (row) and clustered data. The forecasted data for the un-clustered data is compared to the actual data and we calculate the error. We do the same with the clustered data, but the data is aggregated before calculating the error. The error 16
5000
Journal Pre-proof
is fed then to ANN for both cases (clustered and un-clustered data). The model is validated by using different set of data from which is the tidal currents direction forecasting.
4 Forecasted with clustering Actual data Forecasted without clustering
3.5
3
Speed (m/s)
2.5
2
1.5
1
0.5
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
Sample point time
Fig. 10 The relationship between the actual data, forecasted data with and without clustering technique for tidal currents speed forecasting and the time in minutes based on FSLSMANN Hybrid model. The MAPE and nRMS are calculated for that model for tidal currents speed and direction forecasting and it is shown in table 1 and 2. As we see from table 1 the MAPE for un-clustered data is 1.686 and for clustered data is 1.306 and nRMS is 0.0361 for un-clustered data and is 0.0212 for clustered data. As we see the clustering technique is improving the performance of the forecasting model. Fig. 10 shows the relationship between the actual data, forecasted data with and without clustering technique for the tidal currents speed and the time. Fig. 11 shows the same relationship between tidal currents direction and the time.
17
5000
Journal Pre-proof
400 Forecasted with clustering Actual data Forecasted without clustering
350
Direction (degrees)
300
250
200
150
100
50
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
Sample point time
Fig. 11 The relationship between the actual data, forecasted data with and without clustering for tidal currents direction forecasting and the time based on FSLSMANN Hybrid model. 6.5. Tidal Currents Hybrid Model of WNN and FSLSM WNN is used first to train the un-clustered (row) and clustered data. The input to that model is the time for each ten minutes for two months and the output is the tidal currents speed in meter per second. We do the same with the clustered data. The error is then fed to FSLSM for both cases (clustered and un-clustered data). The model is validated by using the tidal currents direction data. The MAPE and nRMS are calculated for that model for tidal currents speed and direction forecasting and it is shown in table 1 and 2. As we see from table 1 the MAPE for un-clustered data is 1.5212 and for clustered data is 1.2212 and nRMS is 0.0298 for un-clustered data and is 0.0102 for clustered data. As we see the clustering technique is improving the performance of the forecasting model. Fig. 12 shows the relationship between the actual data, forecasted data with and without clustering technique for tidal currents speed and the time while Fig. 13 shows the same relationship for the tidal currents direction forecasting.
18
5000
Journal Pre-proof
4 Forecasted with clustering Actual data Forecasted without clustering
3.5
3
Speed (m/s)
2.5
2
1.5
1
0.5
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
5000
Sample point time
Fig. 12 The relationship between the actual data, forecasted data with and without clustering for tidal currents speed forecasting and the time in minutes based on for WNNFSLSM Hybrid model.
400 Forecasted with clustering Actual data Forecasted without clustering
350
Direction (degrees)
300
250
200
150
100
50
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
Sample point time
Fig. 13 The relationship between the actual data, forecasted data with and without clustering for tidal currents direction forecasting and the time based on WNNFSLSM Hybrid model. 6.6. Tidal Currents Hybrid Model of FSLSM and WNN In this model we feed the un-clustered data to FSLSM first. We do the same with the clustered data, but the data is aggregated before calculating the error. The error is fed then to WNN for both
19
5000
Journal Pre-proof
cases. The model is validated by using different set of data which is the tidal currents direction forecasting. The MAPE and nRMS are calculated for that model for tidal currents speed and direction forecasting and it is shown in table 1 and 2. As we see from table 1 the MAPE for unclustered data is 1.7069 and for clustered data is 1.4068 and nRMS is 0.0421 for un-clustered data and is 0.0299 for clustered data. As we see the clustering technique is improving the performance of the forecasting model. Fig. 14 shows the relationship between the actual and forecasted data with and without clustering for tidal currents speed forecasting and the time a certain period. Fig. 15 shows the same relationship for the tidal currents direction forecasting.
4 Forecasted with clustering Actual data Forecasted without clustering
3.5
3
Speed (m/s)
2.5
2
1.5
1
0.5
0
0
500
1000
1500
2000
2500
Sample point time
3000
3500
4000
4500
Fig. 14 The relationship between the actual data, forecasted data with and without clustering technique for tidal currents speed forecasting and the time in minutes based on based on FSLSMWNN Hybrid model.
20
5000
Journal Pre-proof
400 Forecasted with clustering Actual data Forecasted without clustering
350
Direction (degrees)
300
250
200
150
100
50
0
0
500
1000
1500
2000
2500
3000
3500
4000
4500
Sample Point time
Fig. 15 The relationship between the actual data, forecasted data with and without clustering technique for tidal currents direction forecasting and the time based on WNNFSLSM Hybrid model. Another way of validation is to use K-fold cross validation. In k-fold cross validation the data is first partitioned into k approximately equally sized segments. Data is commonly stratified prior to being split which means rearranging the data to ensure each fold is a good representative of the whole. Table 3 shows a comparison between 8-fold 6-fold and 4-fold cross validation. Table 1. MAPE and nRMS for different models with and without clustering technique for tidal current speed magnitude forecasting Different techniques for direction forecasting ANN+WNN ANN+FSLSM WNN+ANN WNN+FSLSM FSLSM+ANN FSLSM+WNN
Models without clustering MAPE nRMSE 1.55791 0.0406 1.4945 0.0421 1.2522 0.0176 1.686 0.0361 1.5212 0.0298 1.7069 0.0421
21
Models with clustering MAPE nRMSE 1.1578 0.0202 1.1945 0.0292 1.0322 0.0091 1.306 0.0212 1.2212 0.0102 1.4068 0.0299
5000
Journal Pre-proof
Table 2. MAPE and nRMS for different models with and without clustering technique for tidal currents direction forecasting Different techniques for direction forecasting
Models without clustering MAPE 1.6793 1.5452 1.322 1.605 1.705 1.823
ANN+WNN ANN+FSLSM WNN+ANN WNN+FSLSM FSLSM+ANN FSLSM+WNN
nRMSE 0.0505 0.0501 0.0215 0.0320 0.0467 0.0542
Models with clustering MAPE nRMSE 1.191 0.0321 1.245 0.0395 1.0413 0.0102 1.321 0.0205 1.306 0.0322 1.376 0.0375
Table 3: Comparison between 8-fold 6-fold and 4-fold Cross Validation WNN+ANN techniques for 8-fold 6-fold and 4fold 4-fold 6-fold 8-fold
Models with clustering MAPE nRMSE 1.191 1.160 1.0413
0.0221 0.0215 0.0102
Conclusions Harmonic tidal currents constituents forecasting is considered as one of the highest important topics to be discussed these days and in the near future. Many models are proposed in the literature but most of them were focusing on either using one forecasting technique or two as a hybrid model. In this paper we use clustering techniques to modify the shape of the wave that will be forecasted to improve the accuracy of the forecasted model. As well as six different hybrid models are proposed based on different techniques. Due to fluctuation and nonlinearity of the forecasting data and the crucial requirements of new generation of renewable energy; forecasting of the renewable energy generated became very important key for load management and system coordination. This paper is not recommended to use only one technique for forecasting, but it is better to use hybrid
22
Journal Pre-proof
techniques to take the advantages of each one. Also, it is recommended to use clustering techniques to have common segments for improving the system variance and as a result improving the overall system performance and accuracy. In this paper six different hybrid models based on clustering techniques are proposed. The procedure of clustering along with the hybrid models are repeatedly adjusts the network, so as to minimize the measure of the difference between the actual and the forecasted data. All proposed models are accurate compared to the previous conventional models without using clustering techniques, but the best model is the hybrid model of WNN and ANN with MAPE of 1.0322 compared to 1.2522 without clustering. The work is validated by applying the proposed models on two different datasets; one for tidal current speed magnitude and the other one is for tidal currents direction. Also, we use K-fold cross validation. From table 3 we conclude that the best size is 8 as it has the smallest (MAPE and nRMSE) error compared to 6-fold and 4fold cross validation. The simulation results prove the effectiveness of the proposed hybrid models based on clustering techniques compared to conventional models used without clustering techniques. The system accuracy is improved when the clustering techniques with hybrid models are used as the advantages of all techniques are added together in one model.
References [1] Hamed Aly, βForecasting, Modeling and Control of Tidal Currents Electrical Energy Systemsβ Ph.D dissertation, Dept. Elect. Eng., Dalhousie Univ., Halifax, N.S., 2012. [2] Hamed H. H. Aly, M. E. El-Hawary, βA Proposed ANN and FLSM Hybrid Model for Tidal Current Magnitude and Direction Forecastingβ IEEE Journal of Oceanic Engineering, Volume: 39, Issue: 1, Jan. 2014. [3] Hamed H. H. Aly, M. E. El-Hawary, A Proposed Algorithms for Tidal in-Stream Speed Model, American Journal of Energy Engineering. Vol. 1, No. 1, 2013, pp. 1-10. 23
Journal Pre-proof
[4] Hamed H. H. Aly, M. E. El-Hawary, The Current Status of Wind and Tidal in-Stream Electric Energy Resources, American Journal of Electrical Power and Energy Systems. Vol. 2, No. 2, 2013, pp. 23-40. [5] Hamed H. H. Aly, and M. E. El-Hawary "State of the Art for Tidal Currents Electrical Energy Resources", 24th Annual Canadian IEEE Conference on Electrical and Computer En-gineering, Niagara Falls, Ontario, Canada, 2011. [6] G. H. Darwin, βOn an apparatus for facilitating the reduction of tidal observations,β Proc. Roy. Soc. A, vol. 52, pp. 345β376, 1892. [7] A. T. Doodson, βThe harmonic development of the tide-generating potential,β Proc. Roy. Soc. Lond., vol. 100, pp. 305β329, 1923. [8] A. T. Doodson, βThe analysis and predictions of tides in shallow water,β Int. Hydrographic Rev., vol. 33, pp. 85β126, 1958. [9] V. John, βHarmonic tidal current constituents of the Western Arabian Gulf from moored current measurements,β Coastal Eng., vol. 17, pp. 145β151, 1992. [10] T. L. Lee and D. S. Jeng, βApplication of artificial neural networks in tide forecasting,β Ocean Eng., vol. 29, no. 9, pp. 1003β1022, Aug. 2002. [11] T.-L. Lee, βBack-propagation neural network for long-term tidal predictions,β Ocean Eng., vol. 31, no. 2, pp. 225β238, Feb. 2004. [12] R. Vijay and R. Govil, βTidal data analysis using ANN,β Journal of World Academic Science Engineering Technology, no. 24, pp. 872β875, 2006. [13] B.-F. Chen, H.-D. Wang, and C.-C. Chu, βWavelet and artificial neural network analyses of tide forecasting and supplement of tides around Taiwan and South China Sea,β Ocean Eng., vol. 34, no. 16, pp. 2161β2175, Nov. 2007.
24
Journal Pre-proof
[14] J. F. Adamowski, βRiver flow forecasting using wavelet and cross wavelet transform models,β J. Hydrol. Processes, vol. 22, no. 25, pp. 4877β4891, 2008. [15] Abdollah Kavousi-Fard βModeling Uncertainty in Tidal Current Forecast Using Prediction Interval-Based SVRβ IEEE Transaction on Sustainable Energy, vol. 8, no. 2, April 2017. [16] Abdollah Kavousi-Fard and Wencong Su βA Combined Prognostic Model Based on Machine Learning for Tidal Current Predictionβ IEEE Transactions on Geoscience and Remote Sensing, vol. 55, no. 6, June 2017. [17] Y. Jie, Z. Mingzhan, and W. Qinglin. "Correlative analysis of measured data between anemometer tower and WTG," 8th International Conference on Computing and Networking Technology (ICCNT), Gueongju, 2012. [18] T. Senjyu, H. Takara, K. Uezato, and T. Funabashi, βOne-Hour-Ahead load forecasting using neural network,β IEEE Trans. Power Syst., vol. 17, pp. 113β118, February 2002. [19] Tri Kurniawan Wijaya, Matteo Vasirani, Samuel Humeau, Karl Aberer, βCluster-based Aggregate Forecasting for Residential Electricity Demand using Smart Meter Dataβ, IEEE International Conference on Big Data, Santa Clara, CA, USA, 2015. [20] J. B. MacQueen "Some Methods for classification and Analysis of Multivariate Observations, Proceedings of 5-th Berkeley Symposium on Mathematical Statistics and Probability", Berkeley, University of California Press, 1976. [21] Franklin L. Quilumba, Wei-Jen Lee, Heng Huang, David Y. Wang, Robert L. Szabados βUsing Smart Meter Data to Improve the Accuracy of Intraday Load Forecasting Considering Customer Behavior Similaritiesβ, IEEE Transactions on Smart grid, vol. 6, no. 2, March 2015.
Biography
25
Journal Pre-proof
Hamed H. Aly received the B.Eng. and M.ASc. degrees in electrical engineering with distinction, in 1999 and 2005, respectively, from Zagazig University, Egypt and the Ph.D. degree from Dalhousie University, Canada, in 2013. He was working as Postdoctoral Research Associate for one year and as a lecturer for four years at Dalhousie University. He worked as a lecturer at Zagazig University for eight years. His research interests include: Power System Dynamics and Stability; Power Electronics; Electric Machines, Power Quality Issues; Applications of Artificial Intelligence in Power Systems; Smart Grid; and the Integration of Renewable Energy into the Electrical Grid. He taught in different universities in Canada. He pioneered many computational and artificial intelligence solutions to problems in economic/environmental operation of power systems. He has written one textbook, three chapters books and more than 50 refereed journal and conference articles. He has consulted and taught for more than 16 years. He is a senior member of the Institute of Electrical and Electronics Engineers (IEEE). Dr. Aly is a faculty member at Acadia University since 2013. Dr. Aly is currently an Assistant Professor at Faculty of Engineering, Zagazig University and an adjunct Assistant Professor at Dalhousie University.
26
Journal Pre-proof
Highlights β’Six different hybrid models are proposed for harmonic tidal currents forecasting. β’The hybrid approaches are combinations of Wavelet, ANN and Least Square Method. β’The most accurate hybrid model is the model of WNN and ANN with MAPE of 1.0322. β’Two different datasets used (speed magnitude and direction) for work validation. β’The proposed models are tested using data collected from the Bay of Fundy in 2008.