Accepted Manuscript Intelligent Techniques for Forecasting Electricity Consumption of Buildings
K.P. Amber, R. Ahmad, M.W. Aslam, A. Kousar, M. Usman, M.S. Khan PII:
S0360-5442(18)30999-X
DOI:
10.1016/j.energy.2018.05.155
Reference:
EGY 12990
To appear in:
Energy
Received Date:
27 December 2017
Accepted Date:
24 May 2018
Please cite this article as: K.P. Amber, R. Ahmad, M.W. Aslam, A. Kousar, M. Usman, M.S. Khan, Intelligent Techniques for Forecasting Electricity Consumption of Buildings, Energy (2018), doi: 10.1016/j.energy.2018.05.155
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
1
Intelligent Techniques for Forecasting Electricity Consumption of Buildings
2
K. P. Ambera,b*, R. Ahmadc, M. W. Aslamd, A. Kousare, M. Usmanc, M.S. Khana
3 4
aDepartment
5 6
bFaculty
of Mechanical Engineering, Mirpur University of Science and Technology (MUST), Mirpur10250 (AJK), Pakistan
of Engineering, Science and the Built Environment, London South Bank University, London SE1 0AA, UK
7 8
cSchool
9 10
dDepartment
of Computer Systems Engineering, Mirpur University of Science and Technology (MUST), Mirpur-10250 (AJK), Pakistan
11 12
eDepartment
of Electrical (Power) Engineering, Mirpur University of Science and Technology (MUST), Mirpur-10250 (AJK), Pakistan
13 14
of Electrical Engineering and Computer Science, National University of Sciences and Technology, Islamabad, Pakistan
[email protected]*,
[email protected],
[email protected],
[email protected],
[email protected],
[email protected]
15
Abstract
16
The increasing trend in building sector’s energy demand calls for reliable and robust energy
17
consumption forecasting models. This study aims to compare prediction capabilities of five
18
different intelligent system techniques by forecasting electricity consumption of an administration
19
building located in London, United Kingdom. These five techniques are; Multiple Regression
20
(MR), Genetic Programming (GP), Artificial Neural Network (ANN), Deep Neural Network
21
(DNN) and Support Vector Machine (SVM). The prediction models are developed based on five
22
years of observed data of five different parameters such as solar radiation, temperature, wind
23
speed, humidity and weekday index. Weekday index is an important parameter introduced to
24
differentiate between working and non-working days. First four years data is used for training the
25
models and to obtain prediction data for fifth year. Finally, the predicted electricity consumption
26
of all models is compared with actual consumption of fifth year. Results demonstrate that ANN
27
performs better than all other four techniques with a Mean Absolute Percentage Error (MAPE) of
1
ACCEPTED MANUSCRIPT
28
6% whereas MR, GP, SVM and DNN have MAPE of 8.5%, 8.7%, 9% and 11%, respectively. The
29
applicability of this study could span to other building categories and will help energy management
30
teams to forecast energy consumption of various buildings.
31
Keywords: Electricity forecasting, ANN, DNN, GP, MR, SVM
32
1. Introduction
33
Energy demand and consumption is running in parallel to increasing population across the globe [1].
34
However, the building sector is the largest energy-consuming sector in the world and accounts for over one-
35
third of final energy consumption globally and nearly 40% of the total primary energy consumption in
36
European Union (EU) and United States of America (USA) [2]. With increasing population, building stocks
37
and fast-paced economic growth, energy consumption in this sector is projected to increase by an average
38
of 1.5% per year from 2012 to 2040 worldwide [3]. This increasing trend in buildings energy demand calls
39
for reliable and robust energy consumption forecasting that should help in effective planning, long term
40
strategies, efficient initiatives to reduce carbon emissions and controlling energy usage in the building
41
sector [4]. Further, a robust forecasting is indispensable for efficient utilization of energy in the buildings
42
and helps facility managers to investigate any unusual jumps or drops in their buildings energy
43
consumption. Another key benefit of forecasting is that it helps facility managers in preparing reliable
44
energy budget forecasts which is a vital component of successful planning [5]. In recent decades, the
45
building energy consumption forecasting has gained momentum with the emergence of new building energy
46
management systems.
47
The total Building Energy Consumption (BEC) of higher education institutions and commercial buildings
48
is 45% and 30% higher compared to residential buildings [6]. Hence, this field entails researchers to
49
concentrate and work for state of the art energy consumption prediction models/techniques which should
50
be more accurate, reliable and robust [7-8]. Energy usage prediction has remained a major focus of many
2
ACCEPTED MANUSCRIPT
51
researchers [9-11]. BEC forecasting involves three basic approaches; i.e. statistical, engineering and
52
artificial intelligence. Engineering approach investigates architectural and climatic behavior of buildings as
53
weather factors, building design, thermal characteristics of construction materials, occupants’ activities and
54
their heating, ventilation and air conditioning parameters based on laws of physics and thermodynamics.
55
Statistical approach uses historical data to predict BEC considering the most influential parameters.
56
Regression models, Auto regressive moving average, Conditional demand analysis and Gaussian mixture
57
models are some examples of statistical forecasting approach [12-15]. Artificial intelligence (AI) offers
58
different processes exhibiting behavior of phenomena being modelled using historical data like statistical
59
approach. AI has gained remarkable attention in the circle of analysts and forecasters efficiently reflecting
60
non-linear behavior in BEC. Among all AI models, neural networks have shown significant capability to
61
work on varying, non-linear functions between inputs and outputs. Artificial Neural Networks (ANN),
62
Support Vector Machines (SVM), Fuzzy Logic (FL), Genetic Algorithms (GA) and Genetic Programming
63
(GP) are some examples of AI largely used by researchers [16-18].
64
In the recent past energy consumption has increased at exponential rates following basic changes in industry
65
and economy. The forecasting of future energy needs has become an integral part of the power industry
66
with the increased generation to fulfil the energy requirements. Oliveira et al. [19] in 2018 predicted
67
electricity demand for the next year using monthly electricity consumption pattern from different countries.
68
Their approach was based on exponential smoothing methods and Bootstrap aggregating (bagging) ARIMA
69
method. In a similar study, Ding et al. [20] designed grey model to forecast electricity consumption in China
70
up to 2020. A number of other researchers have adopted various forecasting approaches for predicting
71
energy consumption of buildings. Of these forecasting techniques Genetic Programming (GP), Artificial
72
Neural Networks (ANN), Deep Neural Networks (DNN), Support Vector Machines (SVM) and Multiple
73
Regression (MR) have been widely used across the globe. For analyzing the accuracy of forecasting models,
74
different researchers have used different error metrics such as Relative Error (RE), Mean Absolute
75
Percentage Error (MAPE), Root Mean Square Error (RMSE), and Mean Square Error (MSE).
3
ACCEPTED MANUSCRIPT
76
GP has been widely used for estimating energy consumption in different regions. Kaboli et al. [18] used
77
long term electricity consumption data (1971 to 2011) for five Asian countries i.e. Philippines, Thailand,
78
Indonesia, Malaysia and Singapore. They developed forecasting models using three different forecasting
79
methods i.e. GP, ANN and SVM. Their results showed that GP outperformed ANN and SVM offering
80
lower MAPE values for all five countries. Kovacic and Sarler [21] used GP and MR methods to forecast
81
natural gas consumption in Slovenia for a Store Steel Company which consumes nearly 1.1% of the
82
country’s total gas consumption. Testing of the models revealed that GP performs better than MR. In a
83
similar study, Silva et al. [22], using GP, investigated various influential factors such as political, social and
84
economic, and proposed long term energy consumption forecasting model (up to 2050) for the industrial
85
sector in Brazil.
86
Biswas et al. [23] used Artificial Neural Networks (ANN) and predicted future energy utilization in
87
residential buildings using data from TxAIRE Research Houses in USA. The accuracy of the model was
88
analyzed using the coefficients of determination (R2). They found R2 values between 0.87 to 0.91. ANN
89
has also found significant applications in oil industry [24-25]. Zeng et al. [25] introduced three different
90
machine learning techniques to predict electricity usage on daily basis for oil driven pumps in China. Their
91
models were optimized using a step by step practice of trial and error process and comparison results from
92
ANN, SVM and MR were based on MAPE and RE. This comparison presented better performance for
93
ANN demonstrating an error of ±5%. Szoplik [26] used ANN to forecast diurnal and seasonal gas
94
consumption in Poland by considering various environmental and calendar parameters. This study
95
concluded that ANN could be the most viable option to predict the hourly and daily gas consumption. The
96
MAPE of this ANN model was observed 9%.
97
Support Vector Machines (SVM) offers diversified applications in forecasting of electricity load [27-28],
98
electricity consumption [29], heating demand [30] and hydropower consumption [31]. Photovoltaic
99
installations are also supported by SVM in predicting solar irradiance of a particular site. Jiang et al. [32]
100
applied SVM to forecast solar irradiance levels in China. Yang et al. [27] presented incremental model 4
ACCEPTED MANUSCRIPT
101
based on SVM to predict electricity load for China. Along with electricity consumption forecasting,
102
investigation of heating demand encompasses important place in the studies of forecasting. Izadyar et al.
103
[30] proposed a thermal model for district heating system in Iran based on SVM. The authors also compared
104
the acquired results with ANN and GP models based on RMSE, Pearson Coefficient (R) and coefficient of
105
determination (R2). They found that SVM performs better than ANN and GP in this particular study.
106
Deep Neural Networks (DNN) is considered as the most efficient method to forecast systems which
107
demonstrate nonlinear and dynamic behaviours. Fu [33] developed a DNN model to forecast the cooling
108
energy demand for the buildings sector of China and Hong Kong in 2018. Testing of real data with the
109
predicted data using the standard performance metrics of MAPE, RMSE, MAE, CV-RMSE and R2 proved
110
the validity of the model. Dedinec et al. [34] also used DNN to forecast short term (24 hour ahead and daily
111
peak) electricity load for Macedonia using historical data for the period 2008 to 2014. Suryanarayana et al.
112
[35] applied three regression models and a DNN model for thermal load forecasting of two district heating
113
schemes in Sweden namely Rottne and Karlshamn. It was found that despite being highly intensive, DNN
114
offers better results on the basis of MAPE in both district heating schemes (8.08% and 4.15%).
115
Multiple Regression (MR) is another forecasting method which is a simple statistical technique and has
116
been widely used for forecasting energy consumption. It further helps researchers in identifying the
117
significant and insignificant variables that have a direct linear relationship with the energy consumption.
118
Pino-Mejías et al. [36] developed MR and ANN models and forecasted the energy consumption of office
119
buildings in Chile. They found that MR performs better (R2 = 0.85) when the input variables are
120
transformed. With no transformation of input variables, ANN outperforms MR with a R2 value of 0.998.
121
Bianco et al. [37] used MR method to investigate the effect of different economic and demographic
122
variables, i.e. gross domestic product (GDP), gross domestic product per capita (GDP per capita), electricity
123
price and population on the electricity consumption of residential sector of Italy. They developed MR
124
models and found that electricity price is not a significant variable for the electricity sector of Italy but GDP
125
and GDP per capita have direct relationship with the electricity consumption. Their study concluded that 5
ACCEPTED MANUSCRIPT
126
Italy’s electricity consumption will increase at an average rate of 2% per year in next 20 years. Tratar and
127
Strmcnik [38] collected data of heat load from a company called Energetika Ljubljana for the period
128
September 2008 to February 2013 and develop MR models to predict daily, weekly and monthly heat load
129
in Ljubljana, Slovenia. They observed MAPE of 3.93%. 2.77% and 1.89% for daily, weekly and monthly
130
forecasting models respectively.
131
This comprehensive literature review confirms that GP, ANN, DNN, SVM and MR methods have been
132
widely applied by the researchers for forecasting energy consumption. There are many more similar studies
133
where these methods have been applied. Fig. 1 summarizes such forecasting studies performed in different
134
countries and helps in understanding which forecasting techniques were applied and which error matrix
135
was used to analyze the accuracy of these forecasting models.
136
This current research is inspired by the work done by Amber et al. [39] where they employed two different
137
forecasting techniques:.1) MR and 2) GP for forecasting the daily electricity consumption of an
138
administration building called “South Bank Technopark” located in London, United Kingdom. Fig.2 shows
139
the front view of this building. Details of this building are presented in Table 2 later in Section 3.
140
In this work, three new machine learning models, i.e. SVM, ANN and DNN are used for the energy
141
consumption prediction of the South Bank Technopark building and their results have been compared with
142
the results of MR and GP methods. These models are trained on data which contain different parameters
143
such as solar radiation, temperature, wind speed, humidity and weekday index. Weekday index is an
144
important parameter introduced to differentiate between WD and NWD. Its value is kept either 0 or 1,
145
where 0 indicates NWD and 1 indicates WD. The observed data is spread over a period of five years. First
146
four years data is used for training the prediction models and to obtain prediction data for fifth year. Finally
147
the predicted data is compared with real data of fifth year and error analysis is performed.
6
ACCEPTED MANUSCRIPT
148 149
Fig. 1 Summary of Buildings energy forecasting studies
150 151
Fig. 2 South Bank Technopark Building at LSBU
152
This study follows the following structure. Section 1 presents related work on energy consumption
153
forecasting for buildings using different techniques. A brief description of different modeling techniques 7
ACCEPTED MANUSCRIPT
154
and their architecture and different parameters used in this work are presented in Section 2. In Section 3,
155
results are compared for all the five models and error analysis has been performed in terms of statistical
156
parameters. Conclusion are drawn in Section 4.
157
2. Methods
158
Five energy forecasting models viz. ANN, DNN, SVM, MR and GP are selected for this study and their
159
specific working as done in this research is discussed.
160
2.1. Artificial Neural Network (ANN)
161
Artificial Neural Networks are inspired by a mathematical model of biological neural networks and
162
represent a model which mimics human brain working [11]. ANN was not in used in beginning due to
163
enormous computations. But due to availability of high performance computing they became popular. ANN
164
can solve complex non-linear input-output relationships that are difficult for other techniques. It is an
165
adaptive system that learns and changes its structure as the input is fed to it. The basic layout of ANN
166
consists of many neurons and these are linked together according to a specific network structure. These
167
neurons could be divided into several layers. There are different architectures of ANN depending on the
168
number of layers and the flow of information. It could be single layer, multi-layer, supervised and
169
unsupervised ANN. A simple structure of ANN is shown in Fig. 3.
170 171
Fig. 3 Simple structure of ANN
8
ACCEPTED MANUSCRIPT
172
There are two phases of ANN learning, training and testing. During training an input is fed into ANN’s
173
architecture (with many layers) and outputs of layers are computed using random weights and biases. Upon
174
every layer an activation function is applied and output of one layer is passed as input to next layer. For this
175
research we have applied sigmoid activation function which limits the output values between 0 and 1. In
176
this research ten hidden layers with 100 neurons have been used. The final output of ANN is fed back to
177
adjust the weights and biases of different layers to match the target output. In this research Gradient descent
178
of the loss is back propagated to adjust the weights. Mathematically if
179
we can write.
180
1
ℎ𝑖𝑥 =
‒
1+𝑒
(∑𝑤 𝑥 + 𝑏) 𝑗 𝑖 𝑖
ℎ𝑖𝑥 is output of ith hidden layer then
𝐸𝑞. (1)
181
Where 𝑤𝑖weight at ith is layer and is the input at ith layer with j variables. This feedback process is carried
182
out until the output is within a required range. A validation phase is also used in this research to overcome
183
the over fitting problem.
184
2.2. Deep Neural Network (DNN)
185
Deep learning is an effective machine learning tool which is imparting quality results almost in every field
186
of research by automatic selection and extraction of features without human intervention [50]. It is an
187
extension to ANN with deeper layers and large number of neurons. Prediction is done using vague to fine
188
feature extraction. Similar to ANN, recent research on DNN is possible due to availability of high
189
performance computing. After the development of tensor flow library by Google and making it available
190
for general public as an open source library put plethora of research in different fields [51]. Therefore, use
191
of DNN for energy forecasting is carried out to establish how well DNN can perform in this field. A two
192
layer Deep Model is shown in Fig. 4.
9
ACCEPTED MANUSCRIPT
193 194
Fig. 4 A two hidden layer Deep Neural Network
195
Dependent variables or inputs are fed into the DNN from left and each variable is multiplied by some
196
random weight and a bias is added to it at each layer. After this an activation function is applied to get some
197
output e.g. sigmoid, tanh, etc. The output of this layer is the input to the next layer. In this way, more layers
198
are introduced and computations performed using weights and biases. Sometime during layers we also
199
have to apply different functions like pooling to downsize our input size. Finally for classification or
200
regression we apply another function like softmax to map upon certain categories or values and linear
201
function for continuous values. This phase is known as feed forward phase. In this way, all inputs are
202
processed and results tweaked after certain batches of inputs and make more iteration for better results.
203
Finally, weights and biases which are used to predict unknown input by passing them in feed forward
204
network are determined. Mathematically the output at each layer will be; ℎ𝑖(𝑥) = 𝑚𝑎𝑥(𝑤1𝑥 + 𝑏,𝑤2𝑥 + ………….)
205
𝐸𝑞. (2)
206
Similarly for classification Problems we can use softmax function in last layer, which uses normalized score
207
and is given by
208
209
𝑓𝑧 =
𝑒
𝑧
∑ 𝑒𝑧
𝑎𝑛𝑑 𝑧 = 𝑤𝑥 + 𝑏
𝐸𝑞. (3)
2.3. Support Vector Machine (SVM)
10
ACCEPTED MANUSCRIPT
210
Support vector machines are well known supervised and unsupervised learning algorithms which are very
211
effective for both linearly separable and not separable data sets. When data is not linearly separable then a
212
transformation is applied from one dimension to another dimension so that now data is easily separable
213
using hyperplane. Support vector machines are used both for classification and regression problems. In
214
Support Vector regression the data set is separated by drawing a virtual line which separates each class or
215
cluster among data points of a cluster or same class at a maximum distance. These points are known as
216
support vectors. Then for any unknown point the previous learned weights are applied to the dependent
217
variables or the input to calculate its predicted position and this will predict the output class or mapped to
218
a continuous value in case of regression.
219
In SVM we can use different kernels according to dataset but in this study we used dot kernel which is
220
widely used kernel and implemented it using a library libSVM. SVM works in a manner that for a given
221
input space of x it finds out a function which only accepts the predictions which are at maximum distance
222
from
223
, the support vector [39]. In mathematical form, SVM is given by Eq. (4). 𝑓(𝑥) = (𝑊,𝑥) + 𝑏
𝐸𝑞. (4)
224
2.4. Genetic Programming (GP)
225
GP is an evolutionary computing algorithm which was first used in 1992 by Koza [52]. The GP computer
226
programs are called GP individuals. These GP individuals are in the form of mathematical formulas. Each
227
individual is a solution of the given problem. GP is preferred over other methods as it has some inherent
228
advantages; (1) Preprocessing of data not required, (2) Built in feature selection capability, (3) Follows
229
white box model (final solution is interpretable).
230
GP algorithm uses the following steps (1) A random population of GP individual is created, (2) Fitness of
231
each individual is calculated according to a predefined formula, (3) New generation is created from existing
232
generation using genetic operators, (4) Fitness of new generation is calculated, 5) Repeat steps 3 and 4 until
233
a perfect solution is achieved.
234
Important Parameters of GP
11
ACCEPTED MANUSCRIPT
235
a) Structure of GP individuals
236
A GP individual is in the form of a mathematical formula and the most common structure to represent this
237
formula is tree structure. A tree is composed of the root, intermediate nodes and the leaf node. The inputs
238
(terminals) are at leaf nodes, the intermediate nodes represent functions operating on terminals and the root
239
is the output of a GP individual.
240 241 242
b) Terminal Pool Terminals represent the inputs. The independent variables constitute the terminal pool. c)
Function Pool
243
Function pool consists of all the function that are applied on the inputs and these are intermediate nodes in
244
a tree structure. The choice of functions is purely dependent on the problem. The functions used in this
245
study are given in Table 1.
246
Table 1: Parameters used for GP
247 248
d)
Fitness Function
249
A fitness function tests the ability of any GP solution in solving the problem. The choice of fitness function
250
varies from problem to problem. In our study the fitness function used is the sum of absolute error. The
251
fitness function used is given below in Eq. (5). 12
ACCEPTED MANUSCRIPT
𝑓𝑖𝑡𝑛𝑒𝑠𝑠 = ∑|𝑔𝑜 ‒ 𝑡𝑜|
252
Eq. (5)
253
where 𝑔𝑜 is the output generated by GP individual and 𝑡𝑜 is the output of the target. As per the above fitness
254
function, a lower value of fitness means a better individual.
255
e)
Genetic Operators
256
Genetic operators operate are used to create a new generation using existing generation. Crossover and
257
mutation are two popular genetic operators and the same operators have been used in this study. Crossover
258
creates two children by swapping branches of two parents while mutation replaces the branch of one parent
259
with a randomly generated branch to create a child.
260
2.5. Multiple Regression (MR)
261
MR is a statistical technique widely used for researchers for energy consumption forecasting. It establishes
262
relationship among two or more variables. General form of a multiple regression model is shown in Eq. (6):
263
𝑍 = ∝ 0 + ∝ 1𝐴1 + ∝ 2𝐴2 + ……………… ∝ 𝑛𝐴𝑛 + ∈
Eq. (6)
264
Where Z is the dependent variable, ∝ i are the regression coefficients (i = 0, 1, 2….n), 𝐴𝑖 are the explanatory
265
variables (i =1, 2, 3 …n) and ∈ is the random error term which represents the effect of all omitted
266
independent variables on the dependent variable “Z” [4].
267
The fitness of a MR model could be judged by its 𝑅 value which is between 0 and 1. 𝑅 is called the
268
Coefficient of determination. Higher value of R2 reflects a strong association among dependent and
269
independent variables and vice versa. Significance of different independent variables is judged through p-
270
value or t-stat value. A variable having t-stat values greater than +1.96 or ∝ - value > 0.05 are
271
considered to be significant and vice versa.
2
272
13
2
ACCEPTED MANUSCRIPT
273
3. Result Analysis
274
3.1. Data Collection and Analysis:
275
The building data is composed of the following parameters measured/ observed for the period Jan. 2007-
276
Dec. 2011: a) Daily electricity usage (Wh/m2), b) Daily mean surrounding temperature (K), c) Daily mean
277
global irradiance (W/m2), d) Daily mean humidity (%), e) Daily mean wind velocity (m/s) and f) Weekday
278
Index (0/1).
279
For data analysis phase, building description, electricity consumption data, weather data and building
280
occupancy data were considered. All the relevant information such as area of the building, hours of
281
operation, year of construction etc. were collected from the office of energy manager through on site
282
interview with the London South Bank University’s (LSBU’s) energy manager. Table 2 presents building
283
related information obtained from energy manager having major impact on the energy consumption of the
284
building.
285
Table 2: Building related information Description
Building
Building type
Offices
Gross Internal Area, m2
7,811
Year of Construction
2003
Building Orientation
South-West
Hours of Operations
8 am to 6 pm
Closing time
10 pm.
Cooling method
Natural ventilation and few dedicated split AC units
Heating equipment
Two gas fired boilers
Lighting type
CFL
Number of lifts
2
Number of LV supplies
2
Number of floors
3
286 14
ACCEPTED MANUSCRIPT
287
Table 3 shows the building electricity consumption statistics for non-working days (NWD) and working
288
days (WD). Electricity consumption of building drops during summer mainly due to low heating
289
requirements. In winter, electricity consumptions increases due to use of heating equipment such as pumps,
290
boilers, air handling units etc. The building electricity consumption patterns show that temperature plays
291
an important role in the daily electricity consumption of the building. The building is naturally ventilated,
292
therefore, heating is only required in winter. The building occupancy factor is somewhat constant during
293
both winter and summer periods.
294
Table 3: Electricity consumption (Wh/m2) statistics of WD and NWD Statistics
NWD
WD
Min
155
229
Max
311
479
Mean
222
374
Median
221
375
N
577
1249
295 296
For our study, realistic daily mean values of temperature, irradiance, wind velocity and humidity for the
297
Central London region were available from the Kings College Environment Research Group’s website [53].
298
Weekday index was selected to represent the building’s occupancy. It has a value of 1 and 0 for WD and
299
NWD respectively. For this variable, it was important to have an accurate data of WD and NWD for years
300
2007 to 2011. A list of public holidays was available for this period [54].
301
3.2. Performance Evaluation
302
All models are trained on same data set from year 2007 to 2010 while data of 2011 is used for testing. Both
303
the training and the testing data set is for the same building. Comparison of actual and predicted electricity
304
consumption by five models is presented in the form of box plots. Further comparison of actual and
305
predicted electricity consumption during working and non-working days is also presented and discussed.
306 15
ACCEPTED MANUSCRIPT
307
3.3. Comparison between real and predicted electricity consumption
308
Figure 5 shows a comparison between the spread of predicted energy consumption data by different
309
methods and the real data in the form of box plots.
310 311
Fig. 5 Comparison of outputs of different modelling techniques
312
It can be seen from the figure that none of the methods was able to correctly predict the minimum or
313
maximum value although prediction of some of the methods was closer to actual than others. Looking at
314
the figure it can be said that results of ANN and GP for median, second and third quartile are very close to
315
the actual values and these two methods seem to perform better than others in overall prediction.
316
Figures 6 and 7 show the comparison between the actual and predicted electricity consumption for the
317
working and non-working days of different months.
16
ACCEPTED MANUSCRIPT
318 319
Fig. 6 Comparison of actual and predicted electricity consumption on working days
320
It is apparent that predicted electricity consumption of working days of different months was almost similar
321
for all the methods except for GP which under predicted the consumption from January to March. The
322
prediction is most accurate from March to July while the prediction is worst for the month of December.
323
The poor performance for December is due to Christmas holidays where the electricity consumption
324
suddenly dropped and the predicted consumption is higher than actual. A similar trend is seen for the non-
325
working days as shown in Fig. 7 except for DNN which predicts slightly higher electricity consumption
326
during the summer months. Again, all five models predicted higher electricity consumption (average 12%)
327
for the month of December for the same reason.
17
ACCEPTED MANUSCRIPT
328 329
Fig. 7 Comparison of actual and predicted electricity consumption on non-working days
330 331
Fig. 8 Comparison of actual and predicted monthly electricity consumption
332
For comparing the monthly electricity consumption figures, daily electricity consumption and predicted
333
consumption were converted into monthly electricity consumption figures (kWh). Figure 8 shows a 18
ACCEPTED MANUSCRIPT
334
comparison between the monthly actual and predicted electricity consumption data by five models. Again
335
it is observed that all the model over predict December’s electricity consumption. Overall, it could be seen
336
that all five models predict monthly electricity consumption of South Bank Technopark building with an
337
error less than 10%.
338
3.4. Error Analysis
339
The performance of five different forecasting models has been analyzed using Root mean Square Error
340
(RMSE), Mean Absolute Error (MAE), Mean Relative Error (MRE), Mean Absolute Percentage Error
341
(MAPE) and Normalized Root Mean Square Error (NRMSE) for all five techniques. The formulas for these
342
errors are given in Eq. 7 to 11.
R𝑀𝑆𝐸 =
∑𝑚
Eq. (7)
(𝑍𝑜𝑏𝑠𝑒𝑟𝑣𝑒𝑑, 𝑖 ‒ 𝑍𝑃𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑, 𝑖)2
𝑖=1
𝑚
343 𝑚
1 |𝑍 ‒ 𝑍𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑,𝑖| 𝑀𝐴𝐸 = 𝑚𝑖 = 1 𝑜𝑏𝑠,𝑖
∑
𝑀𝑅𝐸 =
|
Eq. (9)
|
𝑍𝑜𝑏𝑠 ‒ 𝑍𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 𝑍𝑜𝑏𝑠
𝑚 𝑍 100 𝑜𝑏𝑠, 𝑖 ‒ 𝑍𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑, 𝑖 𝑀𝐴𝑃𝐸 = 𝑚 𝑖=1 𝑍𝑖
∑
|
Eq. (8)
|
Eq. (10)
344 𝑁𝑅𝑀𝑆𝐸 = 345
𝑅𝑀𝑆𝐸
Eq. (11)
(𝑍𝑜𝑏𝑠, 𝑚𝑎𝑥 ‒ 𝑍𝑜𝑏𝑠, 𝑚𝑖𝑛)
346
Where 𝑍𝑜𝑏𝑠 is the actual value and 𝑍𝑝𝑟𝑒𝑑𝑖𝑐𝑡𝑒𝑑 is the forecasted value. “m” represents the sample size and
347
“i” has values from 1 to m.
19
ACCEPTED MANUSCRIPT
348
Table 4 presents the error analysis in terms of these errors.
349
It is apparent that ANN outperforms all other methods in terms of all the error measures. It achieves a
350
NRMSE of 10 % which is much better than all contemporary models. The performance of MR and GP is
351
same while the performance of SVM and DNN is also almost equal in terms of NRMSE. Similarly if we
352
analyze MAPE, it is obvious that ANN performs better compared to all other methods. In the same manner
353
MRE also tells the efficacy of ANN over all other classifiers.
354 355 356 357
Table 4: Error analysis of different prediction models Error Analysis RMSE MAE MRE MAPE NRMSE
W/m² W/m² % % %
MR
GP
34.50 34.31 21.47 24.37 7.5% 8.5% 8.5% 8.8% 12.7% 12.7%
ANN 26 17 6% 6% 10%
DNN
SVM
35 35.39 27 24.11 9.56% 8% 11.15% 9% 13.02% 13%
358
359
Figure 9 gives a comparison of MRE (%) of different methods. ANN outperforms all other methods with
360
6% error. The second-best performance is shown by MR followed by SVM, GP and DNN.
20
ACCEPTED MANUSCRIPT
361 362
The results of DNN are not as promising as expected. The reason behind this is limited amount of data used
363
in this study. The accuracy of DNN relies heavily on enormous amount of training data along with proper
364
analysis of hyper parameter selection. Although the data used in this study is limited, the results of DNN
365
are almost similar to statistical and data centric classifier. If we had more amount of training data, the
366
performance of DNN could be expected to be superior.
367
4. Conclusions
368
In this work, real historical building energy consumption data of five years is compared for different
369
intelligent forecasting techniques including MR, GP, ANN, DNN and SVM. The predicted consumption
370
figures for the weekdays, weekends and different months were compared with the actual consumption
371
figures. It was observed that all five models predicted electricity consumption of working days of different
372
months within a range of 3%. However, for the non-working days, DNN predicts slightly higher electricity
373
consumption during the summer months whereas all five models predicted higher electricity consumption
374
(average 12%) for the month of December. In terms of monthly comparison, in the first three months,
375
except GP model, all other four models predicted electricity consumption very well. Overall, all five models 21
ACCEPTED MANUSCRIPT
376
predicted monthly electricity consumption of South Bank Technopark building with a difference of less
377
than 10%.
378
In a nutshell, we conclude that ANN with the building data set under consideration performs better than all
379
other four methods. The results of ANN and GP for median, second and third quartile are very close to the
380
actual values and these two methods seem to perform better than others in overall prediction. In terms of
381
MRE, ANN outperforms all other methods with 6% error. This is followed by MR, SVM, GP and DNN
382
respectively. The performance of ANN is better than all other methods irrespective of performance
383
measures. ANN performs well with reasonable complexity compared to other exhaustive methods such as
384
DNN and SVM. Since DNN requires enormous amount of data for training along with proper analysis of
385
hyper parameter selection and the dataset in this research is rather limited, the results of DNN are not as
386
promising as expected but even with a limited data set DNN perform close to other methods. The
387
applicability of this study could span to other building categories and will help energy management teams
388
to forecast energy consumption of their buildings.
389
Acknowledgments
390
Authors would like to pay special gratitude to the Energy Manager, Mr. Anuj Saush at London South Bank
391
University for providing information and electricity data.
392
List of Acronyms
393
ANN
Artificial Neural Network
394
BEC
Buildings Energy Consumption
395
DNN
Deep Neural Network
396
GP
Genetic Programming
397
GIA
Gross Internal Area 22
ACCEPTED MANUSCRIPT
398
HE
Higher Education
399
kWh
Kilo Watt Hour
400
LSBU
London South Bank University
401
MAE
Mean Absolute Error
402
MAPE
Mean Absolute Percentage Error
403
MR
Multiple Regression
404
MRE
Mean Relative Error
405
MSE
Mean Square Error
406
NMBE
Normalized Mean Biased Error
407
NRMSE
Normalized Root Mean Square Error
408
NWD
Non-Working Day
409
RE
Regression Error
410
RMSE
Root Mean Square Error
411
RMSPE
Root Mean Square Percentage Error
412
SSE
Sum of Square Error
413
WD
Working Day
414 415
23
ACCEPTED MANUSCRIPT
416 417 418 419 420 421 422 423
References
424 425 426 427
[1] L. Wang, H. Hu, X.Y. Ai, H. Liu, “Effective electricity energy consumption forecasting using echo state network improved by differential evolution algorithm,” Energy, vol. 153, pp. 801-815, 2018. [2] X. Cao, X. Dai, and J. Liu, “Building energy-consumption status worldwide and the state-of-the-
428
art technologies for zero-energy buildings during the past decade.” Energy and Buildings, vol. 128,
429
pp.198-213, 2016.
430 431
[3] International
Energy
Outlook
2016
–
EIA,
Available
at:
https://www.eia.gov/outlooks/ieo/pdf/0484(2016).pdf
432
[4] R. Ghedamsi, N. Settou, A. Gouareh, A. Khamouli, N. Saifi, B. Recioui, and B. Dokkar, “Modeling
433
and forecasting energy consumption for residential buildings in Algeria using bottom-up
434
approach.” Energy and Buildings, vol. 121, pp.309-317, 2016.
435
[5] K. P. Amber, M.W. Aslam, A. Mahmood, A. Kousar, M.Y. Younis, B. Akbar, G.Q. Chaudhary,
436
and S.K. Hussain, “Energy Consumption Forecasting for University Sector Buildings.” Energies,
437
vol. 10(10), p.1579, 2017.
24
ACCEPTED MANUSCRIPT
438 439 440 441 442 443 444 445
[6] M. S. Gul and S. Patidar, “Understanding the energy consumption and occupancy of a multi-purpose academic building,” Energy Build., vol. 87, pp. 155–165, 2015. [7] U. Berardi, “A cross-country comparison of the building energy consumptions and their trends,” Resources, Conservation and Recycling, vol. 123, pp. 230–241, 2017. [8] C. De Cruz-lovera, A. P. Id, and D. Cruz-fern, “Worldwide Research on Energy Efficiency and Sustainability in Public Buildings,” 2017. [9] C. Li, Z. Ding, D. Zhao, J. Yi, G. Zhang, “Building energy consumption prediction: an extreme deep learning approach” Energies, vol. 10, p. 1525, 2017.
446
[10] B. Zeng, M. Zhou, J. Zhang, “Forecasting the energy consumption of China’s
447
manufacturing using a homologous grey prediction model” Sustainability, vol. 9, p. 1975,
448
2017.
449
[11] A. S. Ahmad, M. Y. Hassan, M. P. Abdullah, H. A. Rahman, F. Hussin, H. Abdullah, R.
450
Saidur, “A review on applications of ANN and SVM for building electrical energy
451
consumption forecasting.” Renewable and Sustainable Energy Reviews, vol. 33, pp. 102–
452
109, 2014.
453
[12] X. Lu, T. Lu, C. J.Kibert, M. Viljanen, “Modeling and forecasting energy consumption
454
for heterogeneous buildings using a physical-statistical approach.” Applied Energy, vol.
455
144, pp. 261–275, 2015.
456 457
[13] A. Lomet, F. Suard, D. Cheze, “Statistical modeling for real domestic hot water consumption forecasting.” Energy Procedia, vol. 70, pp. 379–387, 2015.
458
[14] Z. Ma, H. Li, Q. Sun, C. Wang, A. Yan, F. Starfelt, “Statistical analysis of energy
459
consumption patterns on the heat demand of buildings in district heating systems.” Energy
460
and Buildings, vol. 85, pp. 464–472, 2014.
25
ACCEPTED MANUSCRIPT
461 462
[15] N. Fumo, M. A. R. Biswas, “Regression analysis for prediction of residential energy consumption.” Renewable and Sustainable Energy Reviews, vol. 47, pp. 332–343, 2015.
463
[16] X. Wang, D.L. Xu, Z.Z. Sun, “Estimates of energy consumption in China using a self-
464
adaptive multi-verse optimizer-based support vector machine with rolling cross-
465
validation,” Energy, vol. 152, pp.539-548, 2018.
466
[17] Y.R. Zeng, Y. Zeng, B. Choi, L. Wang, “Multifactor-influenced energy consumption
467
forecasting using enhanced back-propagation neural network,” Energy, vol.127, pp. 381-
468
396, 2017.
469
[18] S.H.A. Kaboli, A. Fallahpour, J. Selvaraj, N.A.Rahim, “Long-term electrical energy
470
consumption formulating and forecasting via optimized gene expression programming,”
471
Energy, vol. 126, pp. 144-164, 2017.
472
[19] E.M. Oliveira, F.L.C. Oliveira, “Forecasting mid-long term electric energy consumption
473
through bagging ARIMA and exponential smoothing methods,” Energy, vol. 144, pp. 776-
474
788, 2018.]
475 476 477 478
[20] S. Ding, K.W. Hipel, Y.G. Dang, “Forecasting China's electricity consumption using a new grey prediction model,” Energy, vol. 149, pp. 314-328, 2018. [21] M. Kovacic, B. Sarler, “Genetic programming prediction of the natural gas consumption in a steel plant,” Energy, vol. 66, pp. 273-284, 2014.
479
[22] F.L.C. Silva, R.C. Souza, F.L.C. Oliveira, P.M. Lourenco, R.F. Calili, “A bottom-up
480
methodology for long term electricity consumption forecasting of an industrial sector -
481
Application to pulp and paper sector in Brazil” Energy, vol. 144, pp. 1107-1118, 2018.
482
[23] M.A.R. Biswas, M.D. Robinson, N. Fumo, “Prediction of residential building energy
483
consmption: A neural network approach,” Energy, vol. 117, pp. 84-92, 2016.
26
ACCEPTED MANUSCRIPT
484 485
[24] J. Wang, J. Wang, “Forecasting energy market indices with recurrent neural networks: Cast study of crude oil price fluctuations,” Energy, vol. 102, pp. 365-374, 2016]
486
[25] C. Zeng, C. Wu, L. Zuo, B. Zhang, X. Hu, “Predicting energy consumption of
487
multiproduct pipeline using artificial neural networks,” Energy, vol. 66, pp. 791-798, 2014.
488
[26] J. Szoplik, “Forecasting of natural gas consumption with artificial neural networks,”
489 490 491
Energy, vol. 85, pp. 208-220, 2015. [27] Y. Yang, J. Che, Y.Y. Li, Y.Y. Zhao, S. Zhu, “An incremental electric load forecasting model based on support vector regression,” Energy, vol. 113, pp. 796-808, 2016.
492
[28] M. Barman, N.B.D. Choudhary, S. Sutradhar, “A regional hybrid GOA-SVM model based
493
on similar day approach for short-term load forecasting in Assam, India,” Energy, vol. 145,
494
pp. 710-720, 2018.
495 496
[29] G. Cao, L. Wu, “Support vector regression with fruit fly optimization algorithm for seasonal electricity consumption forecasting,” Energy, vol. 115, pp. 734-745, 2016
497
[30] N. Izadyar, H. Ghadamian, H.C. Ong, Z. Moghadam, C.W. Tong, S. Shamshirband,
498
“Appraisal of the support vector machine to forecast residential heating demand for the
499
district heating system based on monthly overall natural gas consumption,” Energy, vol.
500
93, pp. 1558-1567, 2015.
501
[31] S. Wang, L. Yu, L. Tang, S. Wang, “A novel seasonal decomposition based least squares
502
support vector regression ensemble learning approach for hydropower consumption
503
forecasting in China,” Energy, vol. 36, pp. 6542-6554, 2011.
504
[32] H. Jiang, Y. Dong, “Global horizontal radiation forecast using forward regression on a
505
quadratic kernel support vector machine: Case study of the Tibet autonomous region in
506
China,” Energy, vol. 133, pp. 270-283, 2017.
27
ACCEPTED MANUSCRIPT
507 508
[33] G. Fu, “Deep belief network based ensemble approach for cooling load forecasting of airconditioning system,” Energy, 148, pp. 269-282, 2018.
509
[34] A. Dedinec, S. Filiposka, A. Dedinec, L. Kocarev, “Deep belief network based electricity
510
load forecasting: An analysis of Macedonian case,” Energy, vol. 115, pp. 1688-1700, 2016.
511
[35] G. Suryanarayana, J. Lago, D. Geysen, P. Aleksiejuk, C. Johansson, “Thermal load
512
forecasting in district heating networks using deep learning and advanced feature selection
513
methods,” Energy, DOI: 10.1016/j.energy.2018.05.111, 2018.
514
[36] R. Pino-Mejías, A. Pérez-Fargallo, C. Rubio-Bellido, J.A. Pulido-Arcas, “Comparison of
515
linear regression and artificial neural networks models to predict heating and cooling
516
energy demand, energy consumption and CO2 emissions,” Energy, vol. 118, pp.24-36,
517
2017.
518 519 520 521
[37] V. Bianco, O. Manca, S. Nardini, “Electricity consumption forecasting in Italy using linear regression model,” Energy, vol. 34(9), pp.1413-1421, 2009. [38] L.F. Tratar, E. Strmcnik, “The comparison of Holt–Winters method and Multiple regression method: A case study,” Energy, vol. 109, pp.266-276, 2016.
522
[39] K. P. Amber, M. W. Aslam, and S. K. Hussain, “Electricity consumption forecasting
523
models for administration buildings of the UK higher education sector,” Energy and
524
Buildings, vol. 90, pp. 127–136, 2015.
525
[40] L. Bingchun, F. Chuanchuan, B. Arlene and Q. L. Yan, “Forecasting of Chinese Primary
526
Energy Consumption in 2021 with GRU Artificial Neural Network,” Energies, vol. 10, pp.
527
1453-1467, 2017.
28
ACCEPTED MANUSCRIPT
528
[41] C. L. Joseph, K. W. W. Kevin, L. Dalong and L. T. C, “Multiple regression models for
529
energy use in air-conditioned office buildings in different climates,” Energy Conversion
530
and Management, vol. 51, pp. 2692-2697, 2010.
531
[42] F. Yangyang, L. Zhengwei, Z. Hao and X. Peng, “Using Support Vector Machine to
532
Predict Next Day Electricity Load of Public Buildings with Sub-metering Devices,”
533
Procedia Engineering, vol. 121, pp. 1016-1022, 2015.
534
[43] O. Gamze, F. D. Omer and Z. Selim, “Forecasting Electricity Consumption with Neural
535
Networks and Support Vector Regression,” Procedia - Social and Behavioral Sciences, vol.
536
58, pp. 1576-1585, 2012.
537
[44] M. Çunkaş, U. Taşkiran, “Turkey's Electricity Consumption Forecasting Using Genetic
538
Programming,” Energy Sources, Part B: Economics, Planning, and Policy, vol. 6, no. 4,
539
pp. 406-416, 2011.
540
[45] K. Korhan, A. Ahmet and S. Y. Ahmet, “Long Term Energy Consumption Forecasting
541
Using Genetic Programming,” Mathematical and Computational Applications, vol. 13, no.
542
2, pp. 71-80, 2008.
543
[46] M. Braun, S. Beck, P. Walton, M. Mayfield. “Estimating the impact of climate change
544
and local operational procedures on the energy use in several supermarkets throughout
545
Great Britain,” Energy and Buildings, vol. 111, pp. 109-119, 2016.
546
[47] K. Alireza, A. R. Reza and M. Dominic, “ANNSTLF - Artificial Neural Network Short-
547
Term Load Forecaster - Generation Three,” IEEE Transactions on Power Systems, vol. 13,
548
no. 4, pp. 1413-1422, 1998.
29
ACCEPTED MANUSCRIPT
549
[48] M. Mohammad, M. Atefeh, S. A. Shideh, R. David and A. Somayeh, “Multi-linear
550
Regression Models to Predict the Annual Energy Consumption of an Office Building with
551
Different Shapes,” Procedia Engineering, vol. 118, pp. 622-629, 2015.
552
[49] K. Amarasinghe, D. L. Marino, M. Manic, “Deep Neural Networks for Energy Load
553
Forecasting,” in Proceedings of the IEEE 26th International Symposium on Industrial
554
Electronics (ISIE), Edinburgh, UK, 19–21 June 2017; pp. 1483–1488.
555 556
[50] Y. Le Cun, Y. Bengio, and G. Hinton, “Deep learning,” Nature, vol. 521, pp. 436–444, 2015.
557
[51] P. Goldsborough, “A Tour of Tensor Flow Proseminar Data Mining,” Arxiv, 2016.
558
[Online] Available at: https://arxiv.org/abs/1610.01178, (accessed on: 12 November 2017)
559
[52] J. R. Koza. Genetic Programming: On the programming of Computers by Means of
560 561
Natural Selection. MIT Press. 1992. [53] ERG, Environmental Research Group, KCL London Air, ERG, Environmental
562
Research Group, 2013, [Online]. Available at: http://www.londonair.org.uk/ (accessed: 2
563
February 2013).
564 565
[54] Project
Britain
Bank
Holidays.
[Online].
Available
http://projectbritain.com/bankholidays.html (accessed on: 18 January 2013), 2012
30
at:
ACCEPTED MANUSCRIPT 1. Manuscript Title : Intelligent Techniques for Forecasting Electricity Consumption of Buildings
2. Authors List :Khuram Pervez Amber, Rizwan Ahmad, Muhammad Waqar Aslam, Anila Kousar, Muhammad Usman, Muhammad Sajid Khan
3. Highlights :
i)
Forecasting of daily electricity consumption for administration buildings
ii)
Use of different intelligent forecasting techniques, i.e. MR, GP, ANN, DNN, SVM
iii)
ANN outperforms all other forecasting techniques