Accepted Manuscript Title: The diversity of residential electricity demand - A comparative analysis of metered and simulated data Author: Jos´e Luis Ram´ırez-Mendiola Philipp Grunewald ¨ Nick Eyre PII: DOI: Reference:
S0378-7788(16)31398-6 http://dx.doi.org/doi:10.1016/j.enbuild.2017.06.006 ENB 7669
To appear in:
ENB
Received date: Revised date: Accepted date:
2-11-2016 22-5-2017 10-6-2017
Please cite this article as: Jos´e Luis Ram´irez-Mendiola, Philipp Grddotunewald, Nick Eyre, The diversity of residential electricity demand - A comparative analysis of metered and simulated data, (2017), http://dx.doi.org/10.1016/j.enbuild.2017.06.006 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
The diversity of residential electricity demand - A comparative analysis of metered and simulated data Jos´e Luis Ram´ırez-Mendiolaa,∗, Philipp Gr¨ unewalda , Nick Eyrea a Environmental
Change Institute, University of Oxford,UK
ip t
Abstract
A comparative study between simulated residential electricity demand data and metered data from the UK
cr
Household Electricity Survey is presented. For this study, a high resolution probabilistic model was used to test whether this increasingly widely used modelling approach provides an adequate representation of the statistical
us
characteristics the most comprehensive dataset of metered electricity demand available in the UK. Both the empirical and simulated electricity consumption data have been analysed on an aggregated level, paying special
an
attention to the mean daily load profiles, the distribution of households with respect to the total annual demands, and the distributions of the annual demands of particular appliances. A thorough comparison making use of both qualitative and quantitative methods was made between simulated datasets and it’s metered counterparts.
M
Significant discrepancies were found in the distribution of households with respect to both overall electricity consumption and consumption of individual appliances. Parametric estimates of the distributions of metered
ed
data were obtained, and the analytic expressions for both the density function and cumulative distribution are given. These can be incorporated into new and existent modelling frameworks, as well as used as tools for further analysis.
Ac ce
pt
Keywords: Domestic electricity use, Energy demand modelling, Electricity demand load profiles
∗ Corresponding
author: Email address:
[email protected] (Jos´ e Luis Ram´ırez-Mendiola )
Preprint submitted to Energy and Buildings
Page 1 of 21
1
1. Introduction
41
progresses, robust modelling tools become even more
42
relevant. The effectiveness of low-carbon measures
It is a well known fact that the residential sec-
43
such as the implementation of Demand-Side Man-
3
tor is a major contributor to overall electricity con-
44
agement (DSM) strategies relies heavily on the ex-
4
sumption in most countries. Residential electricity
45
tent to which we understand residential electricity
5
consumption in the UK accounts for over a third of
46
consumption. A better understanding of the electric-
6
the national total [5]. Moreover,it is also well known
47
ity consumption patterns, and the potential changes
7
that the domestic sector contributes significantly to
48
these may undergo, would allow us to devise the most
8
peak demand, especially during winter. Over the
49
suitable measures.
9
last decade, however, the potential of this sector to
50
As new, relevant data become available, opportu-
10
contribute towards reductions in energy consumption
51
nities for further improvement in our models arise.
11
and CO2 emissions has been increasingly recognised.
52
The lack of information about consumer behaviour
12
This, in turn, has sparked the interest in this research
53
and appliance usage during the development of mod-
13
area.
54
els such as the one used in this study (see Section 3)
us
cr
ip t
2
In particular, there is an increasing interest in un-
55
may lead to problems which could not have been re-
15
derstanding the consumption patterns of the resi-
56
vealed with the data available at the time. However,
16
dential sector, and how these might change in re-
57
17
sponse to changes in climate and energy prices, and
58
18
the implementation of new supply-demand balancing
59
19
strategies and other low-carbon measures.
60
sources of data currently available provide us with an opportunity to re-assess the models’ performance.
an
14
by applying these models in a different context to the one in which they were validated, these issues can
M
now be revealed and explored. Moreover, the new
To this end, several efforts have been made to
61
21
model electricity demand, seeking to quantify the
62
In particular, it is worth verifying that the simulated
22
energy requirements at different levels of analysis.
63
electricity consumption data produced by the mod-
23
Two fundamentally different approaches divide these
64
els aggregate such that the statistical characteristics
24
modelling efforts: top-down and bottom-up. This
65
observed in the metered data are well represented,
25
terminology refers to the hierarchical level of the
66
as this is key to a robust bottom-up model.
26
data inputs. Top-down models focus on describing
67
In the following section brief descriptions of the
27
the overall trends observed in historical records, with
68
evolution of residential energy demand modelling
28
little or no interest in individual end-uses. In con-
69
and the state of the art are given. The next two
29
trast, bottom-up models focus on representing the
70
sections concern a description of the key elements
30
contribution of each end-use towards the aggregate
71
of the analysis presented, namely the probabilistic
31
consumption and overall trend. A more detailed de-
72
demand model used and the sub-metered electricity
32
scription of these two approaches can be found in
73
consumption data. The methodology for the com-
33
[19].
74
parative analysis, which encompasses both qualita-
Ac ce
pt
ed
20
34
Electricity demand modelling has proved very im-
75
tive and quantitative methods, is discussed in Sec.
35
portant in both academia and industry. In particu-
76
5.1. This is followed by a discussion of the analy-
36
lar, bottom-up models’ ability to incorporate many
77
sis results, where the critical shortcomings identified
37
different factors into the modelling has been key to
78
are highlighted. Finally, a summary of concluding
38
identifying and understanding the essential elements
79
remarks based on the results previously described is
39
associated with the production of demand loads. As
80
given, as well as further details about potential di-
40
the transition towards low-carbon energy systems
81
rections of future work.
2
Page 2 of 21
82
2. Residential electricity demand modelling
122
ability to represent the diversity of energy demand
123
across time and populations.
In terms of demand for electricity, the variability
124
The model developed by Yao and Steemers [26]
84
across households is high. In the UK, for instance,
125
is a good example of a deterministic model. The
85
according to data from the Office of Gas and Electric-
126
simulation of demand loads is based on fixed, pre-
86
ity Markets (Ofgem), ‘Typical Domestic Consump-
127
determined household activity patterns characterised
87
tion Values’ for electricity range from 2000 to 7200
128
mainly by the periods of absence. Appliances are al-
88
kWh/year [13].
The results of the most detailed
129
located based on national ownership statistics. Their
89
study (to date) of electricity use in English house-
130
associated demand is calculated based on average
90
holds would appear to indicate that there is an even
131
ratings and frequency of use estimates. The model is
91
higher variability, with values ranging from 560 to
132
able to produce rough estimates of half-hourly daily
92
14580 kWh/year [16].
133
load profiles. Another example of a deterministic model is presented in [23].
cr
ip t
83
There are many factors associated with this vari-
134
94
ability. The variations in energy consumption associ-
135
Recognising the limitations of the deterministic
95
ated with differences in dwelling characteristics and
136
approach, several attempts to capture the variability
96
the impact of ambient temperature/climate condi-
137
observed in real households were made. An emerg-
97
tions have have been extensively studier. However,
138
98
the impacts of the differences in dwellings’ appliance
139
99
content, and the usage patterns of such appliances
140
have not been studied as much.
141
patterns. These, in turn, are used in the development of probabilistic models. Page et al. [15] devel-
an
ing approach, which continues to attract interest, is based on the use of data from Time-Use Surveys (TUS) for the extraction of characteristic activity
M
100
us
93
Accounting for the impact different combinations
142
102
of these factors have is critical in representing the
143
oped a general methodology for the use of Markov
103
range of consumptions currently observed. Conse-
144
chain modelling techniques for the simulation of ac-
104
quently, the bottom-up approach to demand mod-
145
tivity sequences. Several variations of this method-
105
elling has been favoured for this purpose. Method-
146
ology, supported by the use of country-specific TUS
106
ologies that follow this approach offer the advantage
147
data were implemented.
107
of modelling demand with greater level of detail. De-
148
monly cited examples are the models developed by
108
mand estimates can be calculated at the household
149
Wid´en and W¨ackelg˚ ard [24] and Richardson et al.
109
or even end-use level.
150
[18], which are based on Swedish and UK data, re-
Ac ce
pt
ed
101
Some of the most com-
110
A branch of bottom-up modelling is characterised
151
spectively. The reader is referred to [20] for a more
111
by a focus on user activity-based simulations of de-
152
comprehensive review of models based on this ap-
112
mand patterns. Models falling within this group can
153
proach.
113
be further divided into deterministic and probabilis-
154
A few attempts at improving this widely adopted
114
tic models. Deterministic models are based on as-
155
modelling approach have been made. These, how-
115
sumptions about seemingly direct causal links be-
156
ever, have focused on the refinement of the simu-
116
tween chosen drivers and expected outcomes.
In
157
lation of occupancy profiles. To this end, different
117
contrast, probabilistic models make use of stochastic
158
methods have been applied.
118
methods for the simulation of electricity consump-
159
One of these approaches has focused on improv-
119
tion patterns. This kind of models make predictions
160
ing the representation of the duration of the occu-
120
about different outcomes based on the stochasticity
161
pancy states through the use of semi-Markov models.
121
of the input data. These models have, therefore, the
162
In the model presented in [25] the transition prob-
3
Page 3 of 21
163
abilities are based on the French TUS, and Weibull
204
of country-specific TUS data and statistics on appli-
164
distributions are used to model the duration of the
205
ance ownership and average ratings. Despite these
165
occupancy states. In the model presented in [1] the
206
differences, the approach to converting activity pat-
166
transition probabilities are based on the Belgian TUS
207
terns into electricity demand loads has remained es-
167
and non-parametric estimates are used to model the
208
sentially the same. Therefore, although this model is
168
duration of the occupancy states.
209
based on UK-specific data, the results of the analy-
Another approach has focused on the identifica-
210
sis presented in this paper are relevant to any model
170
tion of characteristic activity patterns which exhibit
211
using a similar approach. In the following section,
171
statistically significant differences. In the model pre-
212
a more detailed description of the CREST model is
172
sented in [4] a Bayesian clustering technique is used
213
presented.
173
for the identification of household-level activity pro-
174
files. In the model presented in [1], a hierarchical
214
3. The CREST model
175
agglomerative clustering approach is used for iden-
176
tifying characteristic individual occupancy patterns.
177
In both models, the clustering approach is comple-
178
mented by the development of a Markov model for
179
the identified clusters.
216
demand was developed at CREST (Centre for Re-
217
newable Energy Systems Technology, Loughborough
218
University). This is a probabilistic model that pro-
181
model developed by Richardson et al. [18]. In addi-
182
tion to the extension of the model to include ther-
183
mal demand loads, the simulation of occupancy se-
vides an estimate of the electricity consumption of individual households based on the number of res-
M
220
an
The model presented in [10] is an extension of the
cr
In 2010 a new model for residential electricity
us
215
219 180
ip t
169
quences was refined. The original model was based
185
on two-state occupancy patterns simulations. The
186
refined version is based on the simulation of four-
187
state occupancy patterns. Both models, however,
188
make use of the first-order Markov chain technique
189
for the production of the occupancy sequences. The
190
electricity demand model was left largely unaltered.
191
The only modification being the removal of end-uses
192
associated with thermal demand loads. This original
193
model for electricity demand has found widespread
194
applications in academia and industry, for instance
195
[6, 11, 3, 2, 8].
idents, occupancy patterns and dwelling appliance
222
content. Based on the approach developed by Page
223
et al. [15] for the implementation of Markov chain
Ac ce
pt
ed
184
221
224
models based on time-use data, Richardson et al. de-
225
veloped a residential building occupancy model [17].
226
Later on, this occupancy model was combined with
227
the approach developed by Yao and Steemers [26]
228
for generating switch-on events of electric appliances
229
to create a new probabilistic model that has found
230
widespread application in the literature [18]. The
231
model is capable of generating electricity load pro-
232
files of individual households as derived from the sim-
233
ulated use of individual appliances.
234
3.1. Structure
196
The structure of the model developed by Richard-
235
The model is divided into two modules. The first
197
son et al. [18], also known as the CREST model,
236
one is responsible for the simulation of dwelling oc-
198
shows great similarity to that of other, more re-
237
cupancy patterns.
199
cent models.
There are, however, differences be-
238
for the conversion of the simulated activity patterns
200
tween some of the key elements of the structure.
239
into electricity demand loads. Occupancy profiles
201
The main difference concerns the approach to simu-
240
are simulated for each household, for each day of the
202
lating occupancy patterns, as discussed in the para-
241
simulated period. These profiles are based upon data
203
graphs above. A lesser difference concerns the use
242
from the UK 2000 Time-Use survey [9]. The data
4
The second one is responsible
Page 4 of 21
from this survey is representative of national time
284
average). For the simulations used in the original
244
budgets [12]. The version of the model used for the
285
validation analysis this value was set between 4100
245
simulations described in this paper features a two-
286
and 4300 kWh; this in order to represent dwellings
246
state occupancy model. Individual occupancy pro-
287
from a specific region in the UK. The calibration of
247
files are essentially binary sequences of states corre-
288
the model is meant to ensure that the overall average
248
sponding to periods of active and inactive occupancy.
289
annual consumption of the simulated set of house-
249
The active occupancy states also take account of the
290
holds is around a particular value, which is one of
250
number of household members currently active.
291
the input parameters for the model.
292
3.2. Validation
ip t
243
Each simulated household is allocated a particular
252
set of appliances. Appliances are taken from a pre-
253
defined set of 33 common appliances and randomly
293
For the purpose of validating the CREST model,
254
allocated based on national ownership statistics. The
294
household-level electricity consumption data was me-
255
simulated appliances are configured using statistics
295
tered from 22 households in the East Midlands [18].
256
that include their mean total annual electricity con-
296
The households’ electricity consumption was moni-
257
sumption, average power rating, and average cycle
297
tored for a full year. Thus, the resulting metered
258
length. This configuration is meant to ensure that
298
dataset contained around eight thousand 1 min res-
259
the appliances represent some general behaviour or
299
260
pattern rather than emulate some specific real-life
300
261
appliance (a specific make, or any other specific char-
301
262
acteristics). In this sense, CREST’s ‘appliances’ are
302
itored households. The simulations runs covered the
263
rather categories of sorts into which many real-life
303
equivalent of a full year period, and generated data
264
appliances are expected to fit.
304
an
us
cr
251
olution daily load profiles. The model was then calibrated against the collected data, and used for simu-
ed
M
lating the electricity consumption from the 22 mon-
with the same resolution as the metered data.
The activation of some of these ‘appliances’ dur-
305
The simulation output was compared against the
266
ing the simulations depends on the presence of active
306
original metered data. The validation analysis was
267
occupants in the dwelling. Therefore, the variation
307
based on comparisons of different aspects, including
268
in appliance usage is based upon the activity of the
308
the variation of annual demand between dwellings.
269
members of the household and their number. A de-
309
One of the aims was to formally assess the extent
270
mand load profile is generated for each allocated ap-
310
to which these datasets differ from one another in
271
pliance for each day of the simulated period. The
311
terms of this variation. To this end, a statistical
272
loads generated depend on the occupancy profiles
312
significance test, namely the Mann-Whitney U test,
273
previously generated. The total daily load profile is
313
was used [18]. The purpose of this test is to de-
274
the result of aggregating all of the appliances’ daily
314
termine how likely it is that the simulated dataset
275
demand loads.
315
corresponds to a sample drawn from the same pop-
Ac ce
pt
265
276
The variation in electricity consumption between
316
ulation as the reference sample (metered dataset).
277
different households is achieved through changes in
317
One of the advantages of this test is that it can be
278
the household composition, set of appliances, and oc-
318
applied even when the distributions of the data are
279
cupancy patterns of each simulated dwelling. More-
319
not known. The resulting measured and simulated
280
over, a calibration factor provides a means to align
320
annual consumptions are shown below in Fig 1(A).
281
the overall average household consumption of the
321
In summary, the comparison between metered and
282
simulated sample to the relevant average annual con-
322
simulated datasets appeared to show that there was
283
sumption (e.g. sample group, regional or national
323
no significant statistical difference between them,
5
Page 5 of 21
thus validating the model. The CREST model is
362
day of the monitoring period. For some households
325
a good example of the attempts to link activity pro-
363
total demand load, as read from the mains, was
326
files to electricity consumption via the simulation of
364
also recorded. The overall length of the monitoring
327
appliance usage. For this reason, we were particu-
365
period varies across households. 26 of them were
328
larly interested in looking at how well represented
366
monitored for a full year. The rest were monitored
329
is this link by the simulations when compared with
367
for “one-month-long” periods with effective lengths
330
a larger, highly disaggregated metered dataset. A
368
varying between 20 to 30 days, and covering different
331
more detailed description of the data used for this
369
parts of the trial period.
332
purpose is given below in Sec. 4.
370
The resolution of the metered electricity consump-
371
tion profiles also varies. For households monitored
372
for a full year, electricity consumption was metered
4. The UK Household Electricity Survey
cr
333
ip t
324
every 10 min. For the rest of the households, it
the result of a study jointly commissioned by Defra,
374
335
was metered every 2 min. The resulting dataset is
DECC and the Energy Saving Trust [7]. The study
375
336
a collection of highly detailed electricity consump-
had four main objectives:
376
337
tion profiles, which already has provided very im-
377
portant insights into the way electricity is used in
• To identify the range and quantity of electrical 378
appliances found in the typical home.
339
379
• To understand their patterns of use and their impact on peak electricity demand.
341
• To monitor total electricity consumption of the
343
homes as well as that of individual major appli-
344
ances in the household.
pt
• To collect user habit data when using the range
exhausted. Further analysis would allow us to have a better understanding of some of the issues associ-
382
ated with residential electricity consumption, such as
383
the scope for demand shifting, estimation of stand-by
384
consumption, the effects of changes in the size and
385
efficiency of appliances, and differences in the way
386
different socio-economic groups use electricity.
of appliances present in the households.
346
nities for exploiting such a rich source are far from
381
ed
342
345
380
the UK’s residential sector. However, the opportu-
M
340
an
338
us
The UK’s Household Electricity Survey (HES) was
373
334
387
The characteristics of this dataset make it ideal for
For this study 250 owner-occupier households were
388
testing current approaches to modelling of residen-
348
monitored over the period May 2010 to July 2011.
389
tial electricity use. In particular, we are interested
349
The final report on this study observes that the mon-
390
in testing whether a model such as CREST could
350
itored households were chosen such that the statisti-
391
provide us with simulated data that adequately rep-
351
cal characteristics of the sample would match closely
392
resents the statistical characteristics observed in HES
352
those of the typical socio-economic mix [27]. Only
393
data.
353
owner-occupiers were recruited for the study. The
354
households, however, were still fairly typical in terms
394
5. Comparison of CREST model’s simula-
355
of socio-demographic factors. The average annual
356
electricity consumption across the sample was 4093
357
kWh/year, which compares very well to the 4115
396
The information about the relationship between
358
kWh/year average (as of 2015) across all UK homes
397
consumer behaviour and appliance usage was more
359
[5].
398
limited at the time models such as CREST’s were
was
399
developed. The issues caused by such lack of in-
recorded for each of the 250 households for each
400
formation could not be revealed using the metered
360
361
Ac ce
347
Appliance-level
electricity
consumption
395
6
tions against HES data
Page 6 of 21
electricity consumption data available at the time.
440
final sample resulting from excluding 6+ member
402
However, the arrival of richer datasets allows us to
441
households consisted of a total of 243 households,
403
explore these issues now by applying these models in
442
of which 24 had annual records. We will refer to this
404
a different context to the one in which they were val-
443
dataset as HES243. The extent to which this filtering
405
idated. Now that more comprehensive datasets are
444
affects the analysis will be discussed in Sec. 7.
406
available, such as the one provided by HES, this pa-
445
In order to test the simulation outputs against
407
per re-assesses CREST model’s assumptions on ap-
446
the whole HES dataset, the comparative analysis
408
pliance usage and their corresponding impacts on
447
between simulated and metered data was divided
409
daily electricity load profiles and the variability of
448
into two parts. Firstly, we focused on the 24 HES
410
annual consumption across households.
449
households that were monitored for a full year. The
ip t
401
The interest in testing this model against the HES
450
corresponding dataset will be referred to as HES24
412
data resides in great measure in the fact that the
451
throughout the rest of the paper. The existence of
413
group of monitored households is much more diverse
452
this sub-set of households with annual records proves
414
than any previous group. Moreover, the appliance-
453
important, as it gives us the opportunity to make a
415
level electricity consumption profiles provide us with
454
closer comparison between the results of the analysis
416
a further point of comparison between simulated and
455
of this HES24 sub-set and the set of 22 households
417
metered data.
456
5.1. Methodology
458
us
used for the original model validation (See Sec. 3.2). We will refer to the latter dataset as CREST22. Secondly, we then extended the analysis to the
M
418
an
457
cr
411
We wanted to reproduce, to some extent, the orig-
459
rest of the HES sample so as to be able to assess
420
inal model validation. For the original validation a
460
the model’s ability to represent the statistical char-
421
relatively small sample consisting of 22 households
461
acteristics observed in a larger, more comprehensive
422
was used (see Sec. 3.2). Total daily demand for
462
metered dataset. However, the sub-set of 219 house-
423
electricity was monitored for a full year. These me-
463
holds for which only monthly records were present
424
tered data allowed for comparisons of total annual
464
had to be processed in a different way to the set of
425
demands and aggregated load profiles between me-
465
households with full year records.
426
tered and simulated households. In contrast, HES
466
Total household annual consumptions are not ex-
427
data offers the opportunity to compare total an-
467
plicitly included in the HES data. This informa-
428
nual consumption levels, individual appliances’ an-
468
tion had to be extracted from the records available
429
nual consumption and aggregated load profiles of a
469
for each household. Most households had monthly
430
group of households over 10 times larger.
470
records only, so these had to be used for producing
471
an estimate of the household’s total annual consump-
472
tion. For each household with monthly records only
431
Ac ce
pt
ed
419
5.1.1. Data processing
432
Preliminary analysis of HES data revealed that
473
the available data was used for calculating total daily
433
some of the households monitored during the study
474
consumptions for each day in the monitoring period.
434
are composed of 6+ members. Based on its present
475
Then, based on the distribution of total daily con-
435
configuration, the CREST model can only simulate
476
sumptions, the total daily consumptions for the miss-
436
households of up to 5 members. For this reason and
477
ing days needed to complete a full year were gener-
437
the sake of consistency, the original HES sample was
478
ated stochastically to add some variation. The gener-
438
“filtered” so as to be able to make a direct compar-
479
ated daily consumptions were then grouped into the
439
ison between metered and simulated samples. The
480
months corresponding to the same monitoring period
7
Page 7 of 21
as that of households with annual records, and the
522
tions compare to those of the corresponding metered
482
corresponding monthly seasonal factor was applied.
523
appliances. To this end, we also extracted the total
483
Finally, all daily loads were added up thus provid-
524
annual consumptions of individual appliances for the
484
ing an estimate of the total annual consumption of
525
households in the HES243 dataset.
485
that particular household. The case of the 24 house-
526
As observed in Section 3.1, the CREST model gen-
486
holds with annual records was more straightforward
527
erates artificial electricity consumption data based
487
as we only had to add up the consumptions recorded
528
on the simulation of appliance usage for a pre-defined
488
throughout the year. In addition to the total annual
529
set of appliances. Among the model’s pre-defined
489
consumption, a mean daily load profile for the whole
530
set, there are ‘appliances’ which are meant to rep-
490
HES sample was calculated based on all available
531
resent the consumption of the appliances listed in
491
data.
532
Table 1. This pre-defined set includes some other
cr
ip t
481
For the simulation of electricity consumption data
533
‘appliances’ which represent the electricity consump-
493
the CREST model was used. A full version of the
534
tion of some other devices with more predictable con-
494
CREST model is publicly available in the form of
535
sumption patterns (e.g. cold appliances). However,
495
an Excel spreadsheet model (See [18]). However, for
536
we focus the analysis on the appliances listed in Ta-
496
the purpose of simulating the electricity consump-
537
ble 1, as the electricity loads generated by the use
497
tion of the same number of households as in the HES
538
498
sample, the CREST model was re-implemented in
539
499
Python in order to allow for faster, multiple auto-
540
500
mated runs. Features, functionality and results of
541
501
Python and Excel versions of the model are equiva-
542
502
lent. We used this Python implementation for gen-
543
on the activities they are associated with: cooking,
503
erating a year’s worth of artificial electricity con-
544
washing, TV watching, and ICT related activities.
504
sumption data for 243 households. Each simulated
545
Both CREST and HES appliances are grouped in
505
household profile was matched to a metered counter-
546
these more general categories, as shown in Table 1.
506
part in the HES243 dataset. Since it is possible to
547
In most cases it is possible to make a one-to-one com-
507
specify both the day and the month when running
548
parison between HES and CREST appliances in spite
508
simulations, the simulated period was also matched
549
of the differences that may exist between them.
509
to the monitoring period of the corresponding me-
550
In order to obtain the distributions of the simu-
510
tered households. Total annual consumptions corre-
551
lated total annual consumption of individual appli-
511
sponding to each household were calculated, as well
552
ances, it was necessary to run a new set of simu-
512
as the mean daily load profile for the whole simu-
553
lations. Based on the original model configuration,
513
lated HES243 sample. For the first part of the analy-
554
appliances are allocated to the simulated households
514
sis, which focused on the HES24 dataset, simulations
555
on a random basis. Therefore, the original model
515
were run based on the specifications of this sub-set
556
implementation had to be modified so as to ensure
516
of households only.
557
that the appliances of interest were allocated to the
an
us
492
of these appliances are the ones with direct links to households’ activities. We compare the simulated
Ac ce
pt
ed
M
demand of these appliances with the data from their corresponding metered counterparts. The appliances of interest were grouped based
517
As the electricity consumption of individual appli-
558
relevant households. The goal was to match the ap-
518
ances is available from both metered and simulated
559
pliance configuration of the simulated households to
519
data, there was an interest in looking at how the es-
560
that of the metered counterparts.
520
timates of the annual consumptions of the simulated
561
HES records show which appliances are owned by
521
appliances are distributed and how these distribu-
562
which households. Based on this information, house-
8
Page 8 of 21
holds were grouped accordingly. Using the modified
586
is not simply a smoothing method. This method of-
564
model implementation, simulation runs correspond-
587
fers a way of obtaining an accurate non-parametric
565
ing to one year long periods were carried out.
588
estimate of the PDF of a given dataset. This esti-
589
mate is in fact the best possible estimate of the dis-
590
tribution of the original population from which the
591
sample was drawn. It can be shown that the kernel
592
density estimate converges to the real distribution as
593
the sample size increases, and it’s convergence rate
594
is the highest among all estimates [22].
hob, oven, microwave, kettle
Washing
Washing machine, Tumble dryer, Dish washer
Washing machine, washer dryer, Tumble dryer, Dish washer
TV1, TV2, TV3
TV1, TV2, TV3, CRT TV, LCD TV, plasma TV, Audio-visual site
PC
Desktop PC 1, Desktop PC 2, Computer site, Laptop 1, Laptop 2
TV watching ICT related
595
As observed above, some statistical significance
596
tests are based on the properties of ECDFs. This
597
leads naturally to the quantitative part of this com-
598
parative analysis, which focuses on the use of such
599
tests. For the quantitative comparison we will use
600
two tests, namely, the Kolmogorov-Smirnov test and
601
the Mann-Whitney U test.
602
566
603
5.1.2. Comparative analysis
Both tests are non-
parametric, which means they do not make assumptions about the distribution of the data. Both tests can be used to compare two unpaired datasets, both
M
604
cr
Cooking
cooker1, cooker2, hob1, hob2, oven1, oven2, hob+oven, microwave1, microwave2, kettle1, kettle2
us
Appliances contained CREST HES
Categories
an
Table 1: Categories of appliances and the appliances contained in each one, according to the appliances present in both metered and simulated datasets.
ip t
563
In order to assess whether the simulated data is
568
consistent with the metered one, both qualitative
569
and quantitative methods were used.
605
focus on comparing the statistical characteristics of
606
the distributions of the two datasets being compared,
607
and both are robust to the presence of outliers. How-
608
ever, the two tests work in a very different way.
ed
567
570
The qualitative comparison between datasets fo-
571
cused on the Empirical Cumulative Distribution
572
Function (ECDF) of the different datasets.
573
ECDF is a formal direct estimate of the Cumulative
574
Distribution Function1 from which simple statisti-
575
cal properties can be derived. Moreover, the ECDF
576
properties form the basis of various statistical signi-
577
ficance tests.
pt
Ac ce
578
The Probability Density Function (PDF) of the to-
579
tal annual consumptions was obtained as well, as it
580
provides a better graphical representation of some of
581
the statistical characteristics of the metered HES243
582
dataset. In order to obtain a reliable representation
583
of the PDFs, the Kernel Density Estimation (KDE)
584
method was used. Kernel Density estimation is often
585
used as a data smoothing technique. However, KDE 1 Cumulative
609
The Kolmogorov-Smirnov (KS) test compares the
610
cumulative distribution of the two data sets in ques-
611
tion, and calculates a p-value that depends on the
612
largest difference between the two distributions. The
613
test is sensitive to any differences in the two distri-
614
butions. Significant differences in shape, spread or
615
median values will result in a small p-value. The KS
616
test can also be used to test whether an empirical
617
distribution is consistent with an ideal distribution,
618
and therefore, it is commonly used as a test of good-
619
ness of fit, or for testing for normality. In essence,
620
this test answers the question: If the two samples
621
were randomly drawn from different parent popula-
622
tions, how likely it is that the two parent populations
623
have the same statistical distribution?
The
distribution function (CDF) - It is a function whose value is the probability that a corresponding continuous random variable has a value less than or equal to the argument of the function.
624
The Mann-Whitney (MW) test compares two
625
datasets by ranking the elements in the dataset from
626
smallest to largest, and calculating the average of
9
Page 9 of 21
the rank scores of the elements in the two datasets.
663
6. Analysis results
628
A p-value is calculated which depends on the differ-
629
ence between the average ranks of the two datasets.
664
6.1. Variations in annual demand levels
630
Compared to the KS test, the MW test is mostly
665
The high variability observed in annual demand
631
sensitive to changes in the median value. The MW
666
levels across the monitored households clearly re-
632
test is closely related to the t-test. However, the MW
667
flects the fact that different households mean differ-
633
test is more widely applicable than other more pop-
668
ent needs, lifestyles and consumption patterns. In
634
ular tests such as the t-test, as it does not require
669
this context, when we talk about variability, we re-
635
the assumption of normally distributed data, and it
670
fer to the natural variation that is observed in the
636
is much more robust to the presence of outliers. In
671
demand levels as a result of the differences in the
637
essence, the MW test answers the question: If both
672
underlying causes of said demand.
638
samples come from the same population, how likely
673
Fig. 1 shows both simulated and metered annual
639
it is that random sampling would result in the dif-
674
demands, obtained from the different datasets. Both
640
ferences observed in this particular case?
675
the variability of annual demand among households
676
and the differences between the metered and simu-
677
lated levels can be appreciated, as well as how they
As we are interested in determining whether the
642
simulated data generated by the model accurately
643
capture the statistical characteristics of the metered
644
data, we use these test to obtain a quantitative mea-
645
sure of how closely related the two datasets are. In
646
other words, we use these tests to assess whether
647
both datasets are consistent enough with statistical
648
samples drawn from the same population, or whether
649
the two datasets are consistent with samples drawn
650
from populations which follow the same statistical
651
distribution. The MW test provides us with a quan-
652
titative measure of how likely the former situation is,
653
whereas the KS test provides us with a measure of
654
how likely it is the latter.
679
cr
us
compare across datasets. This figure starts to show that the empirical data has greater diversity than the model simulations would suggest. The red horizontal
M
680
an
678 641
ip t
627
681
lines indicate the overall average annual consumption
682
of the corresponding metered dataset. Only one line per section is needed because, as observed in section
684
3, an adequate calibration of the CREST model en-
685
sures that the overall average annual consumption
686
of the simulated sample is around a specified value.
687
The corresponding values are listed in Table 2 which
688
shows, in numbers, how the variability of both simu-
689
lated and metered data compares across the different
690
datasets.
Ac ce
pt
ed
683
691
The mean annual electricity demands per dwelling
692
calculated from metered and simulated data for each
693
of the analysed datasets differ by around 1%. The
694
fact that this difference is small only shows that the
655
The MW test was also used in the original val-
695
model was appropriately calibrated. Despite this, it
656
idation of the CREST model (see Sec. 3.2). The
696
is observed that the differences in standard devia-
657
positive results of the test were one of the arguments
697
tions are considerably larger. For both HES24 and
658
in favour of the validation of the model. We take this
698
HES243 datasets the standard deviation of the me-
659
as an opportunity to show the effect that sample size
699
tered annual consumptions is about 40% higher than
660
may have when using this kind of test by applying
700
the standard deviation of the simulated ones.
661
the test to the datasets HES24 and HES243 sepa-
701
The results of the first part of the analysis, con-
662
rately and comparing the results.
702
cerning the HES24 dataset, already appeared to sug-
10
Page 10 of 21
gest that the variation in annual demand levels was
709
the standard deviation of the metered annual con-
704
being under-represented.
However, the simulated
710
sumptions was about 30% higher than the standard
705
HES24 dataset used for this study appears to be con-
711
deviation of the simulated ones. Despite this seem-
706
sistent with the simulated CREST22 dataset used in
712
ingly large difference, the result of applying a statis-
713
tical significance test would appear to indicate that
714
the datasets were not significantly different (See Sec.
715
3.2).
716
6.2. Empirical cumulative distributions and signifi-
707
2
the original validation analysis (See Table 2).
cance tests
717
ip t
703
The HES24 dataset is very similar in size to
719
the CREST22 dataset used for the original valida-
720
tion. As part of the original validation analysis, a
721
Mann-Whitney (MW) test with a 5% level of signi-
722
ficance was used to compare metered and simulated
723
CREST22 datasets. Based on the test results, it was
725
us
concluded that there was no significant difference between them (Sec. 3.2). In addition to verifying this, and in order to further test how well represented was
M
726
an
724
cr
718
727
the metered data by the simulations in the original
728
analysis, we extracted the annual consumption data from the original validation results and performed
730
a Kolmogorov-Smirnov (KS) test on these datasets.
731
The results of this test also supported the hypothesis
732
that the two datasets are consistent with each other.
733
The same tests were then applied to metered and
734
simulated HES24 datasets. With a p-value of 0.38,
735
the MW test with 5% significance level suggests that
736
it is likely that these two samples were drawn from
737
the same population. Moreover, with a p-value of
738
0.44, the KS test with 5% significance level suggests
739
that it is likely that these two samples come from
740
populations equally distributed. The results of this
741
part of the analysis seem to be well in agreement
742
with those of the original validation, thus confirming
743
that the model produces reasonable estimates when
744
dealing with small samples (∼ 20 households).
pt
ed
729
Ac ce
Figure 1: Annual electricity demands by household, ranked by magnitude: A) Annual demands of both metered and simulated households in CREST22 dataset (adapted from Ref. [18, Fig. 6]). B) Annual demands of both metered and simulated households in HES24 dataset. C) Annual demands of both metered and simulated households in HES243 dataset. Households were grouped in sets of five. The levels observed correspond to the average consumption of each set.
Table 2: Mean values and standard deviations of annual electricity consumptions datasets. CREST22
708
HES24
HES243
metered
simulated
metered
simulated
metered
simulated
Mean annual demand (kWh/y)
4172
4124
4624.6
4629.7
3944.5
3942.9
Standard Deviation (kWh/y)
1943
1372
2510
1545
2348
1405
Coefficient of variation
0.47
0.33
0.54
0.33
0.60
0.36
The analysis of CREST22 datasets showed that 2 CREST22
data was taken from [18])
745
The second part of the analysis, which focused
746
on the HES243 datasets, was aimed at determining
747
whether the model simulations produce a reasonable
748
representation of the statistical characteristics of a
11
Page 11 of 21
larger, more diverse set of households. Both MW
772
ciated in Fig. 2. The Kernel Density Estimate of
750
and KS test were used again to provide a quanti-
773
the distribution of metered annual consumptions re-
751
tative measure of the differences between metered
774
vealed the existence of a more complex underlying
752
and simulated datasets. On this occasion, however,
775
distribution. Based on this estimate, three clusters
753
the tests results revealed the existence of important
776
can be readily identified. What this shows is that the
754
differences in the statistical characteristics of both
777
distribution of households with respect to total an-
755
datasets.
With a p-value of 0.049, the MW test
778
nual consumption is concentrated in three clusters,
756
with 5% significance level now suggests that it is un-
779
centred around the values (modes): 1.58 MWh, 3.92
757
likely that the metered and simulated HES243 sam-
780
MWh and 7.85 MWh. Based on this information,
758
ples come from the same population. In a similar
781
and on the same principles behind the Kernel Den-
759
manner, with a p-value of 0.001, the KS test with
782
sity Estimation, we produced a parametric estimate
760
5% significance level suggests that it is very unlikely
783
for the empirical PDF observed in Fig. 2. This es-
761
that these two samples come from populations with
784
timate can be expressed by the following trimodal
762
the same statistical distribution. Moreover, by us-
785
Gaussian mixture:
763
ing the KS test to test for normality, we found that
764
the simulated HES243 dataset appears to be consis-
765
tent with normally distributed data (p-value of 0.81).
766
The metered HES243 sample is not. These results
767
are best appreciated graphically. As Fig. 2 shows,
768
the simulated HES243 dataset is well in agreement
786
where Ni (x | µi , σi ) =
769
with its corresponding normal fit (dashed line).
787
eral normal distribution with mean µi and standard
788
deviation σi , and Ci is a proportionality constant.
us
cr
ip t
749
Ac ce
pt
ed
M
an
f (x) ∼ C1 N1 (x | µ1 , σ1 ) + C2 N2 (x | µ2 , σ2 )
(1)
σi
1 √
e 2π
− 21
x−µi σi
2
is the gen-
789
A summary of the parameters of this estimate and
790
their corresponding values can be found in Table 3.
791
Fig. 3(A) shows how this trimodal Gaussian model
792
compares to the empirical PDF.
793
The ability to produce a parametric estimate for
794
the PDF means that it is also possible to find an ex-
795
pression (F (x)) for the CDF, as these two functions
796
797
798
Figure 2: Distribution of total annual consumptions: A) Probability Density Functions of metered and simulated HES243 datasets. B) Empirical Cumulative Distribution of metered and simulated data. The dotted lines around the ECDF of the metered HES243 dataset are the 95% confidence bounds.
+C3 N3 (x | µ3 , σ3 )
are related analytically by the equation: F (x) = Rx f (t)dt. Therefore, by integrating f (x) we ob−∞ tain that the CDF is given by:
3 X Ci x − µi √ F (x) = 1 + erf 2 σi 2 i=1
(2)
799
where erf(x) is the error function, and Ci , µi and σi
800
are the same parameters used in equation (1) (see
770
Probability density functions corresponding to
801
Table 3). The way this estimate compares to the
771
each set of annual consumptions can also be appre-
802
ECDF is shown in Fig. 3(B).
12
Page 12 of 21
tion falls within the interquartile range of the cluster
821
in question. Households in each cluster were further
822
grouped according to size. The percentage composi-
823
tion of each cluster is shown in Fig. 4(B).
us
cr
ip t
820
N2 (µ2 , σ2 ) 0.622 3.92 1.28
N3 (µ3 , σ3 ) 0.115 7.85 1.0
ed
Ci µi σi
N1 (µ1 , σ1 ) 0.261 1.575 0.78
Figure 4: Distribution of households according to size and annual consumption range: A) EPDFs corresponding to households grouped by size. The vertical line in each distribution indicates the mean value. B) Percentage composition of household sizes for each cluster.
M
Table 3: Trimodal Gaussian mixture model’s parameters.
an
Figure 3: Total annual consumptions distributions: A) Probability Density Function of metered HES243 dataset and trimodal Gaussian mixture fit. Each vertical bar at the bottom of the plot represents a data point. B) Empirical Cumulative Distributions of metered annual consumptions and trimodal Gaussian mixture fit. The dotted lines around the ECDF of the metered HES243 dataset are the 95% confidence bounds.
824
825
6.3. Individual appliances: Total annual consumptions
In order to gain a better insight into the relation-
826
We looked at the appliance content of the HES243
804
ship between household size and total annual con-
827
households and, based on the list of appliances of
805
sumption, we looked at the sub-groups of households
828
interest presented in Sec. 5.1, we grouped the house-
806
characterised by these features in two different ways.
829
holds that contained each one of those appliances.
807
Firstly, we grouped the households according to size,
830
We then calculated the ECDF of both simulated and
808
and used the KDE method to obtain an estimate
831
metered data for each one of those groups. Examples
809
of the empirical PDF corresponding to each group.
832
of the obtained distributions are shown in Fig. 5.
810
These distributions can be observed in Fig. 4(A).
833
The results of this analysis made evident that the
811
The number of households with 5 members is not
834
variability observed in the annual consumption of in-
812
large enough as to provide a reliable estimate of the
835
dividual appliances is completely misrepresented by
813
overall distribution of annual consumptions for this
836
the simulated data. In some cases (see Fig. 5(A)
814
sub-group. Therefore, this distribution was omitted.
837
and (D)) the average annual consumption of both
815
Secondly, based on the parametric estimate we ob-
838
datasets is well in agreement. In some others (Fig.
816
tained for the overall distribution, we grouped the
839
5(B) and (C)), the differences between the average
817
households according to the three observed clusters.
840
annual consumptions are evident as well.
818
The criteria for the allocation of households to the
819
different clusters was whether their annual consump-
Ac ce
pt
803
13
Page 13 of 21
ip t cr us an
Ac ce
pt
ed
M
Figure 5: Annual consumptions ECDFs of individual appliances: A) Main cooking appliances: cooker, or equivalently, hob+oven. Present in 70% of households. B) ECDFs of simulated and monitored kettles. Present in 94% of households. C) ECDFs of simulated and monitored washing machines. Present in 83% of households. D) ECDFs of simulated and monitored main TVs. Present in 98% of households.
14
Page 14 of 21
843
tions were skewed. Log-normal and Weibull models
844
were tested for possible fits, as these models are par-
845
ticularly good at representing skewed data.3 The
846
analysis revealed that the best fits were provided by
847
Weibull models. The summary of parameters of the
848
distributions fitted to the ECDFs of the annual con-
849
sumptions of individual appliances is presented in
850
Table 4. Table 4: Summary of parameters of the Weibull distributions fitted to the annual consumptions ECDFs of individual appliances.
Both the metered and simulated mean daily load
860
profiles have, generally speaking, similar shapes to
861
that of the typical profile. However, when the profiles
862
are compared in more detail, marked contrasts are
863
observed:
864
865
• There are significant differences between the values of the lowest and highest demands observed in the two profiles.
M
866
6.4. Mean daily load profile general features
pt
851
859
867
868
Mean daily load profiles were obtained from both
853
metered and simulated HES243 datasets. These can
854
be observed in Fig. 6, along with the typical UK pro-
855
file. This typical UK load profile corresponds to the
856
half hour load profile for the average Profile Class 1
857
customers4 [21], and is presented for comparison pur-
858
poses only.
Ac ce
852
3 A more empirical argument for the use of these models is rooted in the fact that they are generally applicable for modelling size/magnitude distributions (e.g. particle size, wind speed). 4 Profile Class 1 customers are domestic customers supplied on unrestricted tariffs, with a maximum demand below 100 kW as measured by metering systems containing only one meter register. There are eight profile classes, and for each profile class a sample group of electricity supply customers is randomly drawn from the population of electricity supply market customers. These samples are designed to provide an accurate estimate of the load pattern for each class of customers for use in electricity settlement. Consumption data is obtained by either installing half-hourly meters at the sites or getting half-hourly consumption data directly from suppliers.
• The simulated profile appears to overestimate the duration of the peak period.
869
• The variations observed in the CREST pro-
870
file, such as the transitions between periods of
871
low/high demand and periods of high/low de-
872
mand, correspond to more abrupt changes than
873
those observed in the other two profiles.
ed
Main cooking cooker oven microwave toaster kettle washing machine tumble dryer washer dryer dishwasher TV1 TV2 laptop desktop computer site
Weibull fit parameters Scale Shape 356.54 1.26 344.72 1.28 248.34 1.04 57.66 1.19 21.90 0.94 188.20 1.82 164.85 1.20 367.87 0.96 262.81 1.06 301.57 1.47 174.90 0.86 109.22 0.69 24.34 0.87 156.37 0.89 213.01 0.98
Figure 6: Mean daily load profiles corresponding to metered and simulated HES243 datasets. Typical UK profile corresponds to the load profile of the average class 1 (unrestricted domestic tariff) customer.
ip t
tributions of individual appliances’ annual consump-
cr
842
us
In general, it was observed that the empirical dis-
an
841
874
Further analysis of the characteristics and differ-
875
ences of these profiles may be desirable, and indeed,
876
necessary. In particular, it might be interesting to
877
pay a closer look at how the activity profiles and
878
the use of appliances are linked, and how well repre-
879
sented the duration of certain activities is. However,
880
a more detailed analysis of these profiles falls beyond
881
the scope of this paper. The prospects for a more de-
882
tailed study on the intra-day variability of demand
883
are left to be discussed in the next section.
884
7. Discussion and Future work
885
The primary aim of the analysis presented in
886
this paper was to determine whether a probabilistic
15
Page 15 of 21
model based on currently used approaches is capable
928
consumption across the whole 250 households HES
888
of providing a good representation of the variability
929
sample is just over 3% higher than the average across
889
observed in the electricity consumption levels of a
930
the HES243 dataset.
890
highly detailed dataset. The choice of the CREST
931
Great part of the analysis presented in this pa-
891
model for this study was due in great measure to
932
per focused on the distribution of households with
892
the fact that this model seeks to emulate the vari-
933
respect to their annual consumptions. The analy-
893
ability of household electricity use [18]. According
934
sis of the annual consumption data extracted from
894
to the way in which the model was constructed, the
935
the HES dataset revealed the existence of a complex
895
simulated appliances are configured such that their
936
trimodal distribution (see Sec. 6.2). Further analy-
896
annual consumptions result in figures typical of an
937
sis revealed that the underlying distribution appears
897
average household. Moreover, the overall average
938
to be very well estimated by a trimodal Gaussian
898
annual consumption of the simulated households was
939
mixture with analytical expression given by equa-
899
also expected to be in agreement with the national
940
tion (1).
900
figures. The model calibration involves the scaling of
941
the empirical distribution is consistent with a nor-
901
the simulated sample’s annual consumptions. This is
942
mal distribution. The resultant composite distribu-
902
meant to ensure that the simulated average annual
943
tion, however, does not exhibit normal-like features.
903
consumption matches the average annual consump-
944
904
tion of the real-life counterpart (see Sec. 3).
945
cr
ip t
887
an
us
Each of the three clusters identified in
A breakdown into sub-groups of households characterised by size revealed that: 1) Over 50% of the households concentrated around the first cluster are
The high level of detail offered by HES data justi-
946
906
fies its use for the analysis. The appliance-level me-
947
one-person households. The first cluster is also the
907
tered electricity consumption records make it ideal
948
largest one, which means that the majority of the
908
for a comparison against the simulated appliance
949
one-person households are found in this consump-
909
consumption data. Due to the technical limitations
950
tion range.
910
of the model discussed in Section 6, the original HES
951
centrated around the second cluster correspond to
911
dataset had to be filtered so as to allow for a con-
952
two-people households, making this sub-group the
912
sistent analysis. The filtering consisted in removing
953
predominant one. 3) Over 33% of the households
913
the households with more than five members from
954
concentrated around the third cluster are 4-people
914
the sample. However, we believe this filtering had
955
households. The second biggest group corresponds
915
little impact on the overall statistical characteristics
956
to two-people households (see Fig. 4). It appears to
916
of the dataset and the analysis results. The propor-
957
be a linear correlation between the household size of
917
tions of households in the HES sample correspond-
958
the predominant group and the mean value of the
918
ing to household sizes ranging from 1 to 4 are well
959
corresponding cluster. However, further analysis is
919
in agreement with the proportions observed at the
960
needed in order to determine whether this is a gen-
920
national level [14]. If we extrapolate this, by assum-
961
eral trend or just an interesting coincidence.
921
ing that the proportion of households with five and
962
The distributions of the simulated annual con-
922
6+ residents are equally well represented in the sam-
963
sumptions were found to be consistently normal. It
923
ple, this would mean that households with 6 or more
964
would appear that the current simplifying modelling
924
members account for just over 2% of households in
965
assumptions are causing simulation outputs consis-
925
the UK. A sense of the scale of the effect of removing
966
tent with normally distributed data. These assump-
926
said households can be given by comparing the over-
967
tions would appear to lead to reasonable first-order
927
all average annual consumptions; the average annual
968
estimates as misrepresentation issues appear less se-
Ac ce
pt
ed
M
905
16
2) Over 40% of the households con-
Page 16 of 21
969
vere when simulations are restricted to small sets of
1010
transform sampling. Given a cumulative distribu-
970
households. However, the differences between simu-
1011
tion function, this method allows for the generation
971
lated and metered data become evident when simu-
1012
of samples consistent with data drawn from said dis-
972
lating larger samples.
1013
tribution. A better idea of the statistical characteris-
Modelling assumptions concerning the conversion
1014
tics of the distribution of annual consumptions gives
974
of activity profiles into electricity consumption pat-
1015
us the opportunity to fine tune demand models. The
975
terns have remained essentially the same over the
1016
improvements derived from this would be reflected in
976
last decade. It is therefore necessary to reassess these
1017
simulated datasets that are more in agreement with
977
assumptions. Assumptions leading to normally dis-
1018
the kind of distribution and levels of variability ob-
978
tributed data will need to be modified or replaced
1019
served across real-life households.
979
entirely. If replacing these assumptions in their en-
1020
Total annual consumptions from individual appli-
980
tirety proves overly complicated, then the implemen-
1021
ances were used as a further point of comparison
981
tation of mixture models based on normal compo-
1022
between simulated and metered datasets. Based on
982
nents might be a good alternative.
1023
each dataset, distributions of total annual consump-
us
cr
ip t
973
In addition to the differences in the overall shape
1024
tions were obtained for the appliances listed in Ta-
984
of the distributions, it was observed that the variabil-
1025
ble 1. The choice of the appliances of interest was
985
ity in the levels of annual consumption is consistently
1026
986
misrepresented in the simulated data. Regardless of
1027
987
sample size, it was found that the level of variabil-
1028
988
ity observed in the metered data is consistent across
1029
tor is calculated for each appliance. According to the
989
independent datasets. Moreover, the variability ob-
1030
model documentation, this factor is adjusted so that
990
served in the metered data is about 20% higher than
1031
over a large number of runs the average annual con-
991
in the simulated data (see Table 2). The HES sample
1032
sumption of the appliance would match typical levels
992
is still arguably small, statistically speaking. How-
1033
[18]. In some cases it was found that the average an-
993
ever, we presume it provides a better insight into the
1034
nual consumption of the simulated appliances was
994
kind of distribution and variability that would be
1035
well in agreement with that of their metered coun-
995
observed in larger samples. Given the method used
1036
terparts (see Fig. 5(A) and (D)). In most cases, how-
996
for estimating the overall distribution, it is also very
1037
ever, discrepancies were found. In general, the sim-
997
likely that the statistical characteristics observed in
1038
ulated values are consistently higher than those ob-
998
the estimate are close to those of the true population
1039
tained from the metered data. In terms of the distri-
999
distribution. It is important that a refined represen-
1040
bution, similar problems to those of observed in the
1000
tation of the characteristics of the distributions ob-
1041
distributions of total household annual consumptions
1001
served in Fig. 3 is achieved, as this is key to a robust
1042
were found. The variability observed in the metered
1002
approach to residential electricity demand modelling.
1043
data is consistently under-represented by the simula-
1003
The analysis of the distribution of metered annual
1044
tions’ output. Moreover, the annual consumptions of
1004
consumptions generated many interesting results,
1045
individual simulated appliances present skewed dis-
1005
which include analytic expressions for the probability
1046
tributions. These distributions were found to be con-
1006
density function (PDF) and the cumulative distribu-
1047
sistent with Weibull distributions. A summary of the
1007
tion function (CDF) (see Sec. 6.2). Knowing the
1048
parameters for the fitted models is presented in Table
1008
analytical expression for the cumulative distribution
1049
4.
1009
allows for the use of methods such as the inverse
1050
an
983
based on the reasonable expectation that their demand loads would be linked to households activities.
Ac ce
pt
ed
M
In the CREST model’s simulations, a calibration fac-
17
A comparison was also made between metered and
Page 17 of 21
simulated mean daily load profiles extracted from
1090
ancies were found when the corresponding distribu-
1052
the corresponding HES243 datasets. Important dif-
1091
tions were compared (see Figs. 2 & 5). In particu-
1053
ferences in the overall features of both profiles were
1092
lar, it was found that metered household annual con-
1054
identified. Further in-depth analysis is needed in or-
1093
sumptions follow a complex distribution from which
1055
der identify the underlying causes of these issues.
1094
three clusters can be identified. The distributions ob-
1056
However, a more detailed characterisation of said
1095
served in the simulated data are much simpler. Cur-
1057
differences falls beyond the scope of this paper. In
1096
rent underlying assumptions about electricity use ap-
1058
particular, it would be interesting to investigate in
1097
pear to lead to normally distributed data with consis-
1059
more detail the relationship between household ac-
1098
tently limited variability. Therefore, these assump-
1060
tivity patterns, appliance usage and the intra-day
1099
tions need to be rectified so as to prevent the misrep-
1061
variations in demand levels. This will be the subject
1100
resentation of the statistical characteristics observed
1062
of future work.
1101
in metered data.
1063
8. Conclusions
us
cr
ip t
1051
1102
The diversity of real-life households is reflected by the complexity of the distribution of total an-
In this paper, a comparative analysis of the sta-
1103
1065
tistical characteristics of metered and stochastically
1104
1066
simulated electricity consumption datasets was pre-
1105
1067
sented. Metered household- and appliance-level elec-
1106
1068
tricity consumptions were extracted from the UK’s
1107
a trimodal Gaussian mixture model which provides
1069
Household Electricity Survey dataset (see Sec. 4).
1108
a very good estimate of the observed distribution.
1070
Corresponding electricity consumptions were simu-
1109
Analytical expressions for both the density function
1071
lated using a widely implemented probabilistic model
1110
and cumulative distribution are given. In a similar
1072
based on the UK residential sector (see Sec. 3). The
1111
manner, the distributions of annual consumptions of
1073
model was configured such that the simulated house-
1112
individual appliances were found to be well repre-
1074
holds matched the relevant features of the metered
1113
sented by Weibull models. As this paper demon-
1075
counterparts. The analysis of the differences between
1114
strates, it is possible to use estimates that capture
1076
datasets allowed us to provide a measure of how well
1115
reasonably well the complexity of the observed dis-
1077
represented are the observed features by probabilis-
1116
tribution while preserving models’ simplicity.
1078
tic models based on currently used approaches and
1079
to identify key shortcomings.
an
1064
nual consumptions. A more complex distribution does not necessarily mean over-complicated assump-
Ac ce
pt
ed
M
tions need to be used in the models. We propose
1080
In particular, we compared the way simulated and
1117
The analysis presented shows that residential de-
1081
metered households and individual appliances are
1118
mand for electricity is more diverse than hitherto
1082
distributed with respect to total annual consump-
1119
assumed. A better idea of how households and in-
1083
tions. An element of the assessment of the model’s
1120
dividual appliances are distributed with respect to
1084
performance concerns how well represented are these
1121
annual consumption provides us with opportunities
1085
distributions by the simulated output.
Based on
1122
for further refinement of current probabilistic mod-
1086
the analysis presented in this paper we can con-
1123
els. In addition, the results further support the fact
1087
clude that current probabilistic models fail to cap-
1124
that current and future modelling approaches would
1088
ture some of the key characteristics observed in the
1125
benefit greatly from the use of larger, highly resolved
1089
metered datasets (see Sec. 6.2). Significant discrep-
1126
metered datasets.
18
Page 18 of 21
[6] EA Technology, 2012. Assessing the impact
Acknowledgements
of low carbon technologies on Great Britain’s
This work was partially supported by the Consejo
power distribution networks. Tech. rep., Ofgem.
Nacional de Ciencia y Tecnolog´ıa (CONACyT).
URL
https://www.ofgem.gov.uk/
publications-and-updates/assessing-
References
impact-low-carbon-technologies-great[1] Aerts, D., Minnen, J., Glorieux, I., Wouters,
britains-power-distribution-networks
I., Descamps, F., 2014. A method for the [7] Energy Saving Trust, DECC, DEFRA, Owen,
ip t
identification and modelling of realistic domes-
P., 2012. Powering the nation:
Household
demand simulations and peer comparison.
electricity-using habits revealed. Tech. rep.,
Build. Environ. 75, 67–78.
Energy Saving Trust, Department of Energy
URL
and Climate Change, Department for Environ-
cr
tic occupancy sequences for building energy
http://dx.doi.org/10.1016/j.
ment Food and Rural Affairs, London, United
us
buildenv.2014.01.021
Kingdom. [2] Ballanti, A., Ochoa, L. F., 2015. Initial assess-
URL
ment of voltage-led demand response from UK
an
uk/Publications2/Corporate/Research-
residential loads. In: 2015 IEEE Power Energy
and-insights/Powering-the-nation-
Soc. Innov. Smart Grid Technol. Conf. IEEE,
URL
household-electricity-using-habits-
M
pp. 1–5.
http://www.energysavingtrust.org.
revealed
http://dx.doi.org/10.1109/ISGT.
[8] Good, N., Zhang, L., Navarro-Espinosa, A.,
ed
2015.7131821
[3] Ballanti, A., Pilo, F., Navarro-Espinosa, A.,
Ochoa, L. F., 2013. Assessing the benefits of
pt
PV var absorption on the hosting capacity of
Mancarella, P., 2015. High resolution modelling of multi-energy domestic demand profiles. Appl. Energy 137, 193–210. URL
http://dx.doi.org/10.1016/j.
LV feeders. In: IEEE PES ISGT Eur. 2013.
URL
apenergy.2014.10.028
Ac ce
IEEE, pp. 1–5.
[9] Ipsos-RSL, September 2003. Ipsos-RSL and Of-
http://dx.doi.org/10.1109/
fice for National Statistics, United Kingdom
ISGTEurope.2013.6695423
Time Use Survey, 2000 [computer file]. 3rd Edi-
[4] Bondu, A., Dachraoui, A., 2015. Realistic and
tion. Colchester, Essex: UK Data Archive [dis-
very fast simulation of individual electricity
tributor], SN: 4504, URL http://dx.doi.org/
consumptions. In: 2015 Int. Jt. Conf. Neural
10.5255/UKDA-SN-4504-1.
Networks. IEEE, pp. 1–8. URL
[10] McKenna, E., Thomson, M., 2016. High-
http://dx.doi.org/10.1109/IJCNN.
resolution stochastic integrated thermalelectri-
2015.7280339 [5] DECC, 2015. Chapter 3:
cal domestic demand model. Appl. Energy 165,
Domestic energy
445–461.
consumption. In: Energy Consum. UK. Depart-
URL
ment of Energy and Climate Change, London,
http://dx.doi.org/10.1016/j.
apenergy.2015.12.089
United Kingdom, Ch. 3, p. 17. https://www.gov.uk/government/
[11] Navarro, A., Ochoa, L., Mancarella, P., Ran-
statistics/energy-consumption-in-the-uk
dles, D., 2013. Impacts of photovoltaics on low
URL
19
Page 19 of 21
voltage networks: a case study for the north
Cambridge, United Kingdom.
west of England. In: 22nd Int. Conf. Exhib.
URL
Electr. Distrib. (CIRED 2013). Institution of
publications/household-electricity-
Engineering and Technology, pp. 1419–1419.
survey--2
URL http://dx.doi.org/10.1049/cp.2013.
https://www.gov.uk/government/
[17] Richardson, I., Thomson, M., Infield, D., 2008.
1229
A high-resolution domestic building occupancy model for energy demand simulations. Energy
[12] Office for National Statistics, 2003. The United
Build. 40 (8), 1560–1566.
port. Tech. rep., Office for National Statistics,
URL
London, United Kingdom.
enbuild.2008.02.006
http://www.ons.gov.uk/ons/rel/
https://dx.doi.org/10.1016/j.
cr
URL
ip t
Kingdom 2000 Time Use Survey: Technical Re-
[18] Richardson, I., Thomson, M., Infield, D., Clifford, C., 2010. Domestic electricity use: A
use-survey--technical-report.pdf
high-resolution energy demand model. Energy
us
lifestyles/time-use/2000-edition/time-
Build. 42 (10), 1878–1887.
[13] Ofgem, 2015. Typical Domestic Consumption
URL
URL
https://dx.doi.org/10.1016/j.
an
Values for gas and electricity 2015.
enbuild.2010.05.023
https://www.ofgem.gov.uk/gas/
retail-market/monitoring-data-
[19] Swan, L. G., Ugursal, V. I., 2009. Modeling of
and-statistics/typical-domestic-
M
end-use energy consumption in the residential
consumption-values
sector: A review of modeling techniques. Renew. Sustain. Energy Rev. 13 (8), 1819–1835.
[14] ONS, 2015. Families and Households: 2015.
URL https://dx.doi.org/10.1016/j.rser.
don, United Kingdom.
2008.09.033
URL
ed
Tech. rep., Office for National Statistics, Lon-
pt
http://www.ons.gov.uk/
[20] Torriti, J., 2014. A review of time use models of residential electricity demand. Renew. Sustain.
birthsdeathsandmarriages/families/
Energy Rev. 37, 265–272.
Ac ce
peoplepopulationandcommunity/
bulletins/familiesandhouseholds/2015-
URL
11-05
2014.05.034
http://dx.doi.org/10.1016/j.rser.
[15] Page, J., Robinson, D., Morel, N., Scartezzini,
[21] UKERC Energy Data Centre, Electricity Asso-
J.-L., 2008. A generalised stochastic model for
ciation, Elexon Ltd., 2013. Electricity user load
the simulation of occupant presence. Energy
profiles by profile class. Tech. rep., Electricity
Build. 40 (2), 83–98.
Association, London, United Kingdom.
URL
URL http://data.ukedc.rl.ac.uk/browse/
http://dx.doi.org/10.1016/j.
enbuild.2007.01.018
edc/Electricity/LoadProfile/doc
[16] Palmer, J., Terry, N., Kane, T., Firth, S.,
[22] Wahba, G., 1975. Optimal Convergence Prop-
Hughes, M., Pope, P., Young, J., Knight, D.,
erties of Variable Knot, Kernel, and Orthogonal
Godoy-Shimizu, D., 2013. Electrical appliances
Series Methods for Density Estimation. Ann.
at home: tuning in to energy savings. Tech.
Stat. 3 (1), 15–29.
rep., Cambridge Architectural Research —
URL
Element Energy — Loughborough University,
1176342997 20
https://dx.doi.org/10.1214/aos/
Page 20 of 21
[23] Wid´en, J., Molin, A., Elleg˚ ard, K., 2012. Models of domestic occupancy, activities and energy use based on time-use data:
deterministic
and stochastic approaches with application to various building-related simulations. J. Build. Perform. Simul. 5 (1), 27–44. URL
http://dx.doi.org/10.1080/
ip t
19401493.2010.532569 [24] Wid´en, J., W¨ ackelg˚ ard, E., 2010. A highresolution stochastic model of domestic activity
cr
patterns and electricity demand. Appl. Energy 87 (6), 1880–1892. URL
us
https://dx.doi.org/10.1016/j.
apenergy.2009.11.006
an
[25] Wilke, U., Haldi, F., Scartezzini, J.-L., Robinson, D., 2013. A bottom-up stochastic model
activities. Build. Environ. 60, 254–264. URL
M
to predict building occupants’ time-dependent
http://dx.doi.org/10.1016/j.
ed
buildenv.2012.10.021
[26] Yao, R., Steemers, K., 2005. A method of formulating energy load profile for domestic
663–671.
https://dx.doi.org/10.1016/j.
Ac ce
URL
pt
buildings in the UK. Energy Build. 37 (6),
enbuild.2004.09.007
[27] Zimmermann, J.-P., Evans, M., Griggs, J., King, N., Harding, L., Roberts, P., Evans, C., 2012. Household Electricity Survey: A study of domestic electrical produc usage - Final report. Tech. rep., Intertek, London, United Kingdom. URL http://randd.defra.gov.uk/Default. aspx?Menu=Menu&Module=More&Location= None&ProjectID=17359&FromSearch= Y&Publisher=1&SearchText=EV0702& SortString=ProjectCode&SortOrder=Asc& Paging=10#Description
21
Page 21 of 21