Generation of synthetic video traffic using time series

Simulation Modelling Practice and Theory 75 (2017) 127–145 Contents lists available at ScienceDirect Simulation Modelling Practice and Theory journa...

Download PDF

2MB Sizes 0 Downloads 38 Views

Report

PDF Reader
Full Text

Simulation Modelling Practice and Theory 75 (2017) 127–145

Contents lists available at ScienceDirect

Simulation Modelling Practice and Theory journal homepage: www.elsevier.com/locate/simpat

Generation of synthetic video traﬃc using time series Christos Katris∗, Sophia Daskalaki Department of Electrical & Computer Engineering, University of Patras, Rio 26504 Greece

a r t i c l e

i n f o

Article history: Received 9 June 2016 Revised 5 February 2017 Accepted 13 April 2017

Keywords: Synthetic video traﬃc Time series models FARIMA FARIMA-GARCH Non-normality FARIMA-to-LN

a b s t r a c t Generating traﬃc has always been an important part of network simulations but has turned to an even more challenging task with modern networks. The statistical properties of the input stochastic processes traced in the networks used all along Information Era turned out to be complicated and diﬃcult to reproduce. Taking into account successful efforts in modeling Internet traﬃc with FARIMA time series models, this paper attempts to extend their applicability and employ them to generate synthetic video traﬃc. It is known that FARIMA can model both the Short Range (SRD) and Long Range Dependence (LRD) existing in video traﬃc; however the traces it produces fail to describe correctly the moments (mean, standard deviation, skewness, kurtosis) of the distribution behind the data. Since an eﬃcient traﬃc generator should capture both the statistical properties and queuing behavior of video traﬃc we experiment with models such as FARIMA with Student’s t errors and FARIMA-GARCH with Normal and Student’s t errors, improving somewhat the accuracy of the generated traﬃc. Furthermore, the paper suggests the projection of the traces generated by a FARIMA model to values of a Lognormal distribution. It is shown that such a methodology produces synthetic traces that can emulate very closely the behavior of real traces. In order to quantify closeness the generated traces are fed into a simple FIFO queuing system with ﬁnite buffers, where loss probability is calculated and compared to that experienced by the corresponding real traces. Using ﬁve different real traces, MPEG-4 or H.263, it is shown that the proposed methodology produces traﬃc generators that can capture satisfactorily several statistical properties of the real traﬃc and also its queuing behavior for a wide range of buffer sizes and service rates. © 2017 Elsevier B.V. All rights reserved.

1. Introduction Generation of synthetic traﬃc is an important procedure quite necessary for performance evaluation when designing a network and for network management. Its necessity has become even more pressing due to the remarkable growth of video applications and multimedia users reported during the recent years. When reliable, synthetic traﬃc generators can be used with simulations to resolve problems in multimedia traﬃc management, such as bandwidth allocation, design of protocols and admission controls [1], while testing real networks for their performance can be expensive and sometimes prohibitive. For this reason it is preferable to use mathematical models and/or simulations to evaluate the performance of a network. In general, statistical models are considered better choice than trace-driven simulations since they can shed light to traﬃc characteristics, they are stochastic and not static, and can describe different situations when tuning the parameters in the models. ∗

Corresponding author. E-mail addresses: [email protected] (C. Katris), [email protected] (S. Daskalaki).

http://dx.doi.org/10.1016/j.simpat.2017.04.005 1569-190X/© 2017 Elsevier B.V. All rights reserved.

128

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

In order to generate synthetic video traﬃc using time series one should consider models that are suitable for this kind of traﬃc, such as FARIMA (Fractional ARIMA), since these models capture properties like SRD (Short Range Dependency) and LRD (Long Range Dependency) that normally video traﬃc appear to carry. According to Alheraish [2] a successful video traﬃc model should be able to capture the statistical characteristics of a real trace (mean, standard deviation, coeﬃcient of variation, mode and autocorrelation) and also its dynamical behavior, e.g. to expose similar queuing behavior. It should be as simple as possible, generate traﬃc with low computational complexity and if possible characterize a wide range of video sources. According to Casilari et al. [3] approximating the probability density function (pdf) of the underlined random process is of utmost importance for matching the queuing behavior of a real traﬃc trace, while SRD is of secondary importance and LRD of even less. The method proposed in this paper satisﬁes relatively well the above criteria and certainly improves proximity to real traﬃc compared to the classical FARIMA models. The main scope of the paper is to present a new procedure that is based on time series models and can be used for generating synthetic traﬃc for full motion video. Several VBR source models that have been used in the past are based on Autoregressive (AR) processes and a comprehensive description can be found in [4], along with a thorough and most recent survey on models for video traﬃc. Here we brieﬂy refer only to those that have been used for video traﬃc generation. In general the AR approach is simple and requires few parameters. For full motion video, however, which could comprise frames of different types (I, B, or P), single AR models are not suitable and usually a combination of AR processes with different residual distributions for each frame type is required. As a consequence frame-based [5,6], scene-based and nested AR models [7] have been adopted. Of course this increases complexity to modeling and formulation. Other alternative variations of AR models include the DAR (Discrete Autoregressive) model of order p [8,9], GAR (AR model with Gamma residual process), GBAR (Gamma-Beta Autoregressive model) [10] and the General AR model that also includes GACS (Gaussian Autoregressive and Chi-Square) [11,12]. The projected AR (PAR) models form yet another interesting category of models for generating video traﬃc. They attempt to preserve autocorrelation and at the same time to ﬁt the frame-size histogram. Previous work on such traﬃc models uses different types of AR processes according to the type of video and encoding scheme [13]. They appear to capture in general the autocorrelation behavior of a video and estimation of their coeﬃcients from empirical data is relatively simple. On the other hand, it is not possible to ﬁnd a single AR model that can capture different statistical characteristics and also there is no single video model that is suitable for all video sequences and for all purposes. Moreover, self-similar models have been used successfully for video traﬃc modeling and generation. FARIMA models are the most widely used, individually [14,15] and in more composite frameworks [16,17,3]. Casilari et al. [3] considered brieﬂy projections of the FGN (Fractional Gaussian Noise) and FARIMA models to a desired distribution. Apart from the time series models there has been a long standing effort for modeling traﬃc using Markov and Markov modulated models, most of them quite successfully for some type of video [e.g. [18,19]]. Their level of complexity is considered much higher than the level required from the AR family of models. Lastly, wavelets have formed another group of successful models for video traﬃc. Their success has been reported in a number of articles [e.g. [20,21]]. The ultimate goal of our approach is to provide a general framework, which can be adapted to several circumstances and is able to generate traﬃc that mimics real video traces, using only a certain time unit and very little information of the empirical traﬃc, such as the mean and standard deviation. In the core of our approach is the dependence structure (long and short range) that exists in video traﬃc, and for this reason we employ FARIMA models. On the other hand, for capturing the queuing behavior of real traﬃc, the marginal distribution of the underlying process should be matched. For this reason, we consider a 2-stage approach where an initially ﬁtted FARIMA model to a video trace is transformed to a data series with a pre-determined target distribution. It builds upon the PAR types of models using FARIMA modeling in conjunction with the Lognormal distribution, which requires only two parameters and its CDF is easily invertible. With this approach we can accurately capture the marginal distribution of the traﬃc. Although this approach is superior to other FARIMA and ARIMA alternatives in capturing queuing performance of real traﬃc, we also have to assess its ability to preserve the SRD and LRD property of the FARIMA model. We perform this assessment experimentally. The classical FARIMA model is usually very successful in capturing the LRD and SRD properties of a real traﬃc trace. However our experimentation showed that it cannot capture its marginal distribution, when it is far from normal, neither its non-linear dependencies, when they are present. To overcome this we considered as traﬃc generators FARIMA models with Student’s t and Normal Inverse Gaussian error distributions, and also FARIMA-GARCH models with Normal, Normal Inverse Gaussian and Student’s t errors. A number of these models achieved slight improvements for some traces, but none achieved a clear and deﬁnite improvement. As an alternative we develop a two-step traﬃc generating procedure which combines FARIMA models with Lognormal distribution through a transformation of percentiles. Such a procedure ﬁrst embodies the desired SRD and LRD characteristics with the help of a FARIMA model and then attempts to approximate the pdf of a recommended distribution, the Lognormal in this case. The method is analogous to ARTA (AR To Anything) process [22,23] which combines an AR model with a desired marginal distribution and generates a series that displays the autocorrelation of the AR model and at the same time achieves a desired marginal distribution. The method is tested with simulated traces which include different characteristics, and then applied to real traces successfully compared with the aforementioned models. Next, we describe analytically the procedure, argue on its eﬃcacy and set up a queuing framework for evaluating its performance. The evaluation is performed experimentally and shows that transforming a FARIMA generated trace to variates of a Lognormal distribution matches suﬃciently well real video traces. Section 2 presents the time series models that will

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

129

be used in this paper, i.e. FARIMA and FARIMA-GARCH. Section 3 gives the details for the proposed procedure which can be used either to generate video traﬃc when signiﬁcant amount of information is known about a video trace but also when limited information is given. In Section 4 a performance evaluation scheme based on a FIFO single server queuing system is proposed for the comparison of the produced traces. Section 5 renders an extensive experimentation of the procedure using ﬁve different video traces and lastly Section 6 summarizes and concludes this work. 2. Synthetic traﬃc with time series models In this section we present the time series models that will be used for generating traﬃc. It is assumed that traﬃc is expressed with a random process {Xt , t = 1, 2, . . .} which represents the traﬃc load that arrives at time unit t. We consider discrete-time time series models and focus on those that match the statistical characteristics of video traﬃc, i.e. FARIMA and FARIMA-GARCH. 2.1. FARIMA models The Fractional ARIMA (FARIMA) models introduced by Granger and coworkers [24,25], are extensions of ARIMA and have the ability to describe both short and long range dependencies of the data. FARIMA models have been used previously to describe internet traﬃc in general and video traﬃc in particular. In [26] FARIMA models were used to model broadband traﬃc and in several situations they were found to improve pure LRD models. In [27] FARIMA models were found to be suitable for estimating the queuing performance under heavy-load conditions. In [28] a FARIMA based methodology for bandwidth allocation was reported to be more successful compared to other models. Similarly, in [29] FARIMA models were used as basis in a bandwidth allocation scheme for different types of traﬃc, including JPEG and MPEG traﬃc. Lastly, in [1] and [3] FARIMA models are compared with White Noise, Autoregressive (AR) and Fractal Gaussian Noise (FGN), and found to be superior in modeling VBR traﬃc. The formulation of a FARIMA time series model is given as:

p (L )(1 − L )d (Xt ) = q (L )εt , where L is the lag operator, p (L ) = 1 − ϕ1 L − . . . − ϕ p L p , q (L ) = 1 + θ1 L + . . . + θq Lq , (1 − L )d =

d j

(−1 ) =

∞

j=0

d j

(1)

(−1 ) j L j and

(−d+ j ) (−d )( j+1 ) .

Long memory is displayed by the process, when 0 < d < 0.5 . The classical FARIMA model assumes that the error terms ɛt are normally distributed with zero mean and constant variance σ 2 . However, since studies with Internet traﬃc datasets suggest mostly non-symmetric leptokurtic distributions, for several applications the classical model carries an obvious weakness. For this reason, we consider FARIMA models with Student’s t innovations, i.e. the error terms are assumed to follow Student’s t distribution with zero mean, constant variance σ 2 and v degrees of freedom, where v > 2. The pdf of the location-scale version of Student’s t distribution is known to be:

v+1

( x − α )2 f (x; a, β , v ) = v 1 + βv βvπ 2 2

−( v+1 2 )

where α , β and v are the location, scale and shape parameters, respectively, and (.) is the Gamma function. The mean of the distribution is the location parameter α and the variance is given as (vβv . We should note here that for this study we −2 ) additionally experimented with Normal Inverse Gaussian distribution for the error terms but experienced large deviations for several traﬃc traces, so there is no need reporting their performance. The ﬁtting procedure for a FARIMA model to a dataset is not trivial in general and the one followed here is described in [30]. The estimation of the parameters (except d) is realized through Maximum Likelihood (MLE), however a crucial step is to decide upon the value of d, the fractional parameter, and for this paper it is estimated using the Geweke and PorterHudak estimator [31]. When the estimator fails to achieve a value between 0 and 0.5, then we conventionally use the value 0.25. 2.2. FARIMA-GARCH models The FARIMA-GARCH models are extensions of the FARIMA models, in the sense that they allow heteroskedasticity for the errors and in some cases may achieve better results than plain FARIMA. These models assume conditionally heteroskedastic errors, quite a rational assumption for video traﬃc, since in a video there are scenes that create lumps of heavy traﬃc (such as action scenes, explosions etc.) and others where traﬃc is much lighter. FARIMA-GARCH models have been used for the prediction of inﬂation in [32] and when variance cannot be assumed as stationary, we expect that their usage for generating synthetic traﬃc will improve both modeling and queuing performance. Examples of articles where linear models such as ARIMA are combined with the non-linear GARCH models for internet traﬃc modeling are [33], where the author concludes that the seasonal AR-GARCH models are superior over the plain seasonal ARIMA models in forecasting accuracy;

130

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

[34], where ARIMA-GARCH is found to have larger forecasting accuracy than FARIMA; and [35] where it is stated that in that study ARIMA-GARCH model was superior than Seasonal ARIMA (SARIMA) model for forecasting HTTP traﬃc. In this paper, we consider FARIMA-GARCH models, because they have the ability to capture SRD and LRD and because of conditional heteroskedasticity. A FARIMA-GARCH model follows the equation:

p (L )(1 − L )d (Xt − μ ) = q (L )εt where

εt | t−1 ∼ probabilit y dist ribut ion ( p1,..., pk ) and σt2 = ω +

(2)

q

2 αi εt−1 + i=1

p i=1

2 βi σt−1 .

Comparing with the FARIMA model in Eq. (1), the new element here is the second line, where it implies that the error term has a conditional PDF with parameters ( p1,..., pk ). Most commonly it is assumed that the distribution conditional on the information set t−1 is normal with zero mean and σt2 variance. In the last equation, the conditional variance follows the GARCH model of [36]. The ﬁtting procedure we use in this paper is the following. First, we consider the same order and same fractional parameter d as for the corresponding FARIMA process. Additionally, for the conditional variance we consider a GARCH(1,1) model, which is generally accepted to be a suitable approximation for most applications. These two steps are performed for errors that follow Normal or Student’s t distributions. After the ﬁtting procedures are ﬁnalized the resulting models are used for traﬃc generation through simulation. For the ﬁrst part of our experimental work, shown in Section 5, we attempt generating traﬃc using FARIMA and FARIMAGARCH models with Normal and Student’s t errors. The results indicate that FARIMA with t errors prevail and the generated traces follow relatively closer the behavior of the real traces. As a next step and in order to approximate even further we introduce a 2-step methodological procedure that in addition involves the marginal distribution of Xt . 3. Proposed approach for generating video traﬃc The 2-step approach to be introduced here aims at generating streams of synthetic traﬃc that behave similarly to some existing video stream and for this we attempt to approximate satisfactorily both the dependence structure observed in the data and its marginal PDF. First, we create a sequence of values with the help of a time series model which is suitable for describing the dependence structure found in an actual video stream; and then through a statistical transformation, which is equivalent to the Inverse Sampling method, we project the simulated values to a suitable distribution so to approximate the marginal distribution hypothesized for the actual data. The rationale behind this procedure stems from the fact that the initial simulated data from the time series model will somehow exhibit the dependence structure traced in the actual video stream; however, as it became clear from our study, they will not normally ﬁt well the PDF of the real stream. In order to handle such a deﬁciency, the simulated data are projected to a new corresponding trace carrying the desired distribution. The projection is realized using the percentile ranks of the data simulated previously and then through the inverse CDF that corresponds to some distribution assumed for the real traﬃc trace. As it turns out this procedure is capable of creating streams that exhibit approximately a desired dependence structure and at the same time ﬁt well to some desired distribution so it can ﬁnally behave very closely to the original trace. 3.1. Description of the procedure Given a real video traﬃc trace let our goal be the generation of a synthetic trace of length N that behaves similarly to a real trace. Step 1. Simulate N observations from a suitable time series model. We consider a time series model that can describe successfully the dependence structure of the data. Such a model could be either a FARIMA or FARIMA-GARCH and initially it is applied as generator for the synthetic traﬃc (yi , i = 1, . . . , N). Based on theory it is expected that the simulated data will exhibit a dependence structure similar to that of the real observations (i.e. SRD and LRD dependencies). Regarding the distribution however, when we use the classical FARIMA model the generated data will have approximately a normal distribution: (yi ∼ N(μ, σ 2 )). Step 2. Transform simulated observations from step 1 to a target distribution. Given a “target” marginal distribution D, the second step of the traﬃc generation is to transform the simulated data to follow its PDF and at the same time retain the original dependence structure. The proposed transformation is realized through percentiles. More speciﬁcally, the probabilities pi s that correspond to theyi s (traﬃc from step 1) are all calculated using the Normal distribution ( pi = FN (yi )). Then using the inverse cumulative function of D the percentiles xi = FD−1 ( pi ) that correspond to the pi s are calculated. The values xi produced form a new trace of synthetic traﬃc distributed according to D with mean μ ˆ D and standard deviation σˆ D and carry approximately the same dependence structure as the yi s, both essential ingredients for capturing the dynamics of the behavior of the real traces in a queuing situation.

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

131

3.2. Selecting the target distribution Based on previous studies the marginal distribution of internet traﬃc cannot be described by a normal distribution. For video traﬃc, and in general for network traﬃc, the lognormal has been suggested instead [4,37]. In order to evaluate the suitability of Lognormal for our case, the empirical distribution is constructed ﬁrst using kernels [38,39]. The kernel x−x density estimate for the empirical function of a set of data is given by fˆh (x ) = 1n ni=1 1h K ( h i ), where K(.) is the kernel, a non-negative function that has mean zero and integrates to 1, and h is a smoothing parameter called bandwidth. There are several kernel functions such as uniform, triangular, Epanenchnikov and normal. For this paper we used the normal (Gaussian) kernel and the univariate plug-in selection method of [40] for the selection of the bandwidth. In Section 5.3 the empirical distributions of ﬁve different real video traces will be calculated and compared to Lognormal models to conclude that among different choices the Lognormal distribution prevails; this is due to its goodness of ﬁt but also because it requires only two parameters that are easily estimated with the knowledge of a targeted mean and variance. 3.3. Evaluation of dependence structure The 2-step methodology previously described is based on the assumption that the traces generated from such a procedure will retain, at some signiﬁcant level, the dependence structure originally held from the time series. The dependence structure here refers mainly to short and long range dependencies, i.e. autocorrelation and Hurst exponent, and to possible non-linearities. In order to test such a premise, in Section 5 we proceed with an additional experimentation. At ﬁrst, data are simulated from ARIMA, FARIMA, ARIMA-GARCH and FARIMA-GARCH models. It is expected that the simulated data carry the SRD property and this can be diagnosed with a Ljung-Box test indicating signiﬁcant autocorrelation. The Hurst exponent (H), on the other hand, can indicate the presence of LRD. It is a measure of long memory and takes values from 0 to 1. If H ∈ (0, 0.5) the series is expected to be antipersistent (i.e. high values tend to follow low values and vice versa), if H ∈ (0.5, 1) the series is expected to exhibit long-memory (i.e. high values tend to follow high values for some time into the future) and if H = 0.5 then absence of long memory is expected. There are a few different methods to estimate the H exponent in a time series, however there is no general agreement about the most accurate and/or suitable method. In this paper we will employ the Aggregated Variance method [41]. However, towards the end of our experimental work in Section 5, we additionally consider other alternatives for the calculation of Hurst exponent, including the more recent and according to studies accurate wavelet method [42]. Lastly, for the property of non-linear dependencies one can use the White Neural Network test. With these three tools it is possible to trace the dependencies existing in each one of the simulated series. Then using the inverse cumulative function of a Lognormal model the aforementioned transformation can be applied to the simulated data and thus generate traces that follow Lognormal distributions of desired parameters. Since it is not certain at which level the dependence structure is carried over, the transformed data will be re-evaluated using the same tools as previously. 3.4. Application to a new video trace The procedure presented in Section 3.1 assumes availability of a real trace which is actually utilized to ﬁt the initiating time series model. In case of an unknown video trace this step will not be possible; however the proposed procedure can still be used as long as there is an estimate or a guess for its mean and variance. Given there is no trace to ﬁt the model, a FARIMA(1,1) can be considered as reference model, with coeﬃcients p1 = 0.5, q1 = 0.25 and fractional parameter d = 0.3. This value of d corresponds to a Hurst exponent of approximately H = 0.8 and establishes long range dependencies in the process, quite a realistic assumption for real video traces. At the second step of the procedure an additional problem is related to the parameter estimation (μ ˆ logyi , σˆ logyi ). Since it is assumed that there is an estimate (or at least a target value) for the mean and standard deviation of the trace, then it is possible to estimate the Lognormal parameters through the Bayesian Monte Carlo Markov Chain (MCMC) approach. In Section 5.5 the modiﬁed procedure just described is applied to generate traﬃc using only the means and standard deviations of several traces. The procedure’s eﬃcacy is then evaluated by comparing the performance of the simulated and real traces in the queuing framework described in the next section. 4. Empirical performance evaluation of the approach In traﬃc generation it is well understood that similarity of traces can be achieved when their corresponding descriptive statistics, PDF and time-dependent characteristics are matched, since they all inﬂuence somehow the performance of a trace. However, since it is not easy to set the level at which all such characteristics should be matched throughout the empirical analysis we will follow a 3-fold comparison. One refers to the descriptive statistics, i.e. mean, standard deviation, skewness, kurtosis, coeﬃcient of variation (CV), the other to certain time-dependent characteristics, i.e. stationarity, autocorrelation, LRD, non-linearity, and the third to the dynamical behavior of a trace which is formed by the video sequences. While the ﬁrst two are pretty standard to handle, for the third we propose a queuing framework where the performance of traﬃc traces can be evaluated and compared. If the queuing performance of the synthetic traﬃc in this queue is close to the performance of the traﬃc it imitates the generating procedure is validated to be successful.

132

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145 Table 1 Overview of datasets.

Trace Tr1 Tr2 Tr3 Tr4 Tr5

-

Silence of the Lambs Mr. Bean Star Trek Die Hard III The Firm

Type

Length (in 12-frame aggregates)

VBR / H.263 VBR / H.263 VBR / H.263 MPEG4 (HQ) VBR / H.263

40 0 0 40 0 0 40 0 0 40 0 0 40 0 0

Table 2 Descriptive statistics of real traces. Trace

Mean

Sd

Skewness

Kurtosis

CV

Tr1 Tr2 Tr3 Tr4 Tr5

32.108 31.608 17.640 42.528 14.309

31.343 16.977 12.799 22.150 10.915

3.256 1.922 2.518 1.363 2.143

19.438 10.453 14.158 5.325 10.112

0.976 0.537 0.726 0.521 0.763

Let us consider a single server FIFO queue with deterministic service rate and ﬁnite buffer. The queue is fed with traﬃc which is either a real trace or traﬃc generated from one of the procedures described previously. At each time period some traﬃc load (in kBytes) arrives and requests bandwidth. If the load exceeds availability (service capacity), the excess is placed in the buffer; however if the buffer size is also exceeded then loss will be realized. The traﬃc that should be served at time t (TRt ), at a FIFO basis, is the buffered load up until time t-1 (QTt -1 ) plus the new traﬃc that arrives during time period t (NTt ).

T Rt = N Tt + Q Tt−1 The performance measure that will be used for the queuing system under study is the loss probability: Pr[T Rt > SC + BS], where SC and BS are the service capacity and buffer size, respectively. The evaluation will be performed in two ways. Firstly the generators will be compared for their ability to capture the queuing behavior of real traﬃc when service rate is ﬁxed and buffer sizes vary, and secondly when the buffer size is set to some ﬁxed value and the service rates vary. Using these two schemes for comparison the traces generated by the different models will be compared for their performance with the actual trace and among themselves. For each traﬃc model the goal is to generate traces that are as close as possible to the real traﬃc traces, where closeness refers not only to the descriptive statistics, but also to the dependence structure characteristics, i.e. SRD and LRD, and also its queuing behavior for a wide range of service rates and buffer sizes. 5. Empirical analysis with real video traces In this section, the methodology described previously is realized to generate traﬃc that successfully emulates the behavior of real video traces. The focus is on capturing, as much as possible, the key statistical characteristics found in a collection of real traces, so that in a typical queuing situation the generated synthetic traces would perform similarly to their corresponding real traces. 5.1. Description of the real traces The data which are used throughout this analysis consist of ﬁve traces, four VBR with H.263 coding and one MPEG-4 of high quality. The ﬁve datasets are excerpts from publicly available video traces in TU-Berlin [43], while the aggregation level was decided to be the same for all of them and equal to kBytes per 12 frames (approximately a GOP level). Tables 1 and 2 display an overview of the datasets and their descriptive statistics, respectively. In Table 1 we give the names of the videos where the traces came from, the types and coding schemes, and the length considered for each dataset. In Table 2, for each trace we report the mean, standard deviation, skewness, kurtosis and coeﬃcient of variation (CV). All datasets indicate clearly right skewed and leptokurtic distributions, with Tr4 to display the smallest discrepancy from symmetry and possibly normality. It is also interesting to note that the CV, which is a measure of relative dispersion, varies among the traces and it is possible that larger values indicate increased diﬃculty in modeling the trace. Moreover, Table 3 presents certain time-dependent properties observed for the ﬁve traces, i.e. stationarity, autocorrelation, long range dependence and non-linearity. According to the ADF (Augmented Dickey-Fuller) and Ljung-Box tests all traces can be assumed stationary and also display strong autocorrelation. Additionally all traces indicate the existence of LR dependency, with the estimated Hurst exponents to be in the range (0.5, 1.0) or, as in the case of Tr4, barely over 1. Finally, using the White NN test [44], the hypothesis of linearity is rejected in some traces (e.g. Tr5) and not in others (e.g. Tr3 and Tr4).

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

133

Table 3 Time dependent properties of real traces.

Trace

Stationaritya (ADF-test)

Tr1

Lag = 15 None: −5.6875 Intercept: −8.7382 Intercept+trend: −8.9462 (p-value < 0.01)

Tr2

Tr3

Tr4

Tr5

a

Lag = 15 None: −2.9672 Intercept: −8.4511 Intercept+trend: −8.8663 (p-value < 0.01) Lag = 15 None: −4.5573 Intercept: −9.8228 Intercept+trend: −9.9472 (p-value < 0.01) lag= 15 None: −3.0013 Intercept: −7.8074 Intercept + trend: −8.3516 (p-value< 0.01) Lag = 15 None: −4.0049 Intercept: −7.7504 Intercept+trend: −8.4254 (p-value< 0.01)

Autocorrelation (Ljung-Box)

LRD Hurst Exponent (H)

Non-linearity (White-NN test)

3496.5 (<0.01)

0.89767

7.2233 (0.07487)

2780 (<0.01)

0.82712

5.7184 (0.05731)

3026.6 (<0.01)

0.75789

3.0149 (0.2215)

3382.2 (<0.01)

0.86057

1.538 (0.4635)

3240.3 (<0.01)

0.82746

9.4833 (<0.01)

Stationarity is the alternative hypothesis.

Given that the data used for calculating the descriptive statistics are not i.i.d. and instead they are correlated, an effort has been made to estimate the bias introduced by the presence of autocorrelation using the block bootstrap approach [45]. This is a method for correcting estimations of moments when data are autocorrelated and stationary, like in our case. According to the block bootstrap the time series data yi are split in overlapping blocks of length b. Each block is represented as blockt = {yt , yt+1 , . . . , yt+b−1 }, where t = 1,…, N-b + 1 and N is the total number of observations of the time series, while resampling during bootstrap is realized using whole blocks. The crucial question for the block bootstrap technique is the decision about the length b of the blocks. In [46] some rules about the optimal block size are provided, however in this paper we borrow the concept of embedding dimension from dynamical systems. We consider a time series as dynamical system, where the value of each observation yi is a function of the m previous ones yt−1 , . . . , yt−m . Following the Embedding Theory of Takens and Mañé a suﬃciently large embedding dimension m can capture the original time series and a widely used heuristic for calculating m is the False Nearest Neighbor (FNN) [47]. Detailed description for the time series as dynamical systems can be found in [48] and for their application on internet traﬃc forecasting in [30,49]. Speciﬁcally for the block bootstrap estimation technique the embedding dimension will be used as the size of the blocks. Using this method with 10,0 0 0 samples, we can estimate the bias in the time series and then subtract it from the initial biased estimations. Table 4 displays estimations for the mean, standard deviation, skewness and kurtosis both the initial (biased) and the block bootstrap ones, for all ﬁve traces examined previously. The last column gives the 90% conﬁdence intervals calculated by the bootstrap method. We observe that for all cases examined the initial biased estimations lie inside the 90% bootstrap conﬁdence intervals, suggesting that there are no signiﬁcant differences between biased and corrected estimations. For this reason and due to the high computational effort required by the block bootstrap this step is skipped for the next steps.

5.2. Synthetic traﬃc using FARIMA models As a ﬁrst step in creating and comparing traﬃc generators for the ﬁve traces just examined, we are ﬁtting the FARIMA and FARIMA-GARCH models to the traﬃc traces. This task involves decisions on (i) the order of the corresponding ARIMA model, using BIC (Bayes Information criterion), (ii) the fractional parameter d, using the Geweke and Porter-Hudak (GPH) estimator, and (iii) the rest of the parameters of each model, using Maximum Likelihood. For the FARIMA-GARCH models, a GARCH (1,1) component is additionally appended to account for conditional heteroskedasticity. The ﬁtting of the models has been realized with the help of R software [50,51] and for each video trace Table 5 displays the values adopted for the model parameters along with the BIC value for each one of the ﬁtted model. For Tr4, the parameter d was set manually equal to 0.25 since the GPH estimator failed to indicate a value in the desired interval (0, 0.5).

134

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

Table 4 Block bootstrap estimations of moments and Hurst exponent. Silence of The Lambs (Embedding Dimension: 13) Moment Mean Sd Skewness Kurtosis

Biased 32.108 31.343 3.2556 19.438

Block bootstrap 31.396 29.071 2.942 17.130

90% Bootstrap CI 28.276- 34.232 23.849–33.92 2.103–3.591 9.250–23.815

Mr.Bean (Embedding Dimension: 5) Moment Biased Mean 31.608 Sd 16.9766 Skewness 1.922 Kurtosis 10.453

Block bootstrap 31.529 16.813 1.842 9.802

90% Bootstrap CI 30.026–33.055 15.097–18.51 1.317–2.350 6.058–13.459

Star Trek (Embedding Dimension: 8) Moment Biased Mean 17.640 Sd 12.799 Skewness 2.518 Kurtosis 14.158

Block bootstrap 17.736 12.738 2.436 13.520

90% Bootstrap CI 16.662–18.801 11.269–14.237 1.958–2.893 9.478–17.794

Die Hard III (Embedding Dimension: 7) Moment Biased Mean 42.528 Sd 22.150 Skewness 1.363 Kurtosis 5.325

Block bootstrap 42.759 22.055 1.354 5.293

90% Bootstrap CI 40.549–45.106 20.345–23.699 1.162–1.547 4.407–6.256

The Firm (Embedding Dimension: 7) Moment Biased Mean 14.309 Sd 10.915 Skewness 2.143 Kurtosis 10.112

Block bootstrap 14.299 10.829 2.107 9.838

90% Bootstrap CI 13.195–15.406 9.720–11.917 1.734–2.464 7.102–12.682

Table 5 Optimal parameter values for the models ﬁtted to traces. Trace

Order (p, q)

Fractional parameter (d)

Tr1 FARIMA-N FARIMA– t FARIMA-GARCH-N FARIMA-GARCH – t

(3,1)

0.47113

Tr2 FARIMA-N FARIMA– t FARIMA-GARCH-N FARIMA-GARCH – t

(1,4)

Tr3 FARIMA-N FARIMA– t FARIMA-GARCH-N FARIMA-GARCH – t

(1,3)

Tr4 FARIMA-N FARIMA– t FARIMA-GARCH-N FARIMA-GARCH – t

(1,2)

Tr5 FARIMA-N FARIMA– t FARIMA-GARCH-N FARIMA-GARCH – t

(1,2)

a

BIC 7.629 6.837 6.970 6.503

0.32896 7.299 7.036 7.042 6.916 0.31316 6.487 6.165 6.216 6.039 0.2500a 7.159 6.725 6.945 6.631 0.49882

Parameter d was set manually equal to 0.25.

5.933 5.608 5.594 5.430

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

135

Table 6 Statistical characteristics of generated traﬃc traces. Trace

# of data

Mean

Sd

Skewness

Kurtosis

H

Ljung-Box test

White NN test

T1 FARIMA-N FARIMA– t FARIMA-GARCH–N FARIMA-GARCH – t

40 0 0 16,845 25,055 2597 24,559

32.108 22.820 33.801 29.440 48.710

31.343 17.605 33.565 31.813 60.164

3.256 1.158 3.381 2.561 3.425

19.438 4.242 26.634 10.001 17.260

0.898 0.785 0.816 0.760 0.975

3496.5 (<0.01) 11,717 (<0.01) 22,494 (<0.01) 1759.3 (<0.01) 24,013 (<0.01)

7.223 (0.075) 504.29 (<0.01) 315.42 (<0.01) 45.955 (<0.01) 96.053 (<0.01)

T2 FARIMA-N FARIMA– t FARIMA-GARCH–N FARIMA-GARCH – t

40 0 0 91,527 86,672 82,336 95,839

31.608 27.518 28.983 23.554 35.356

16.977 14.572 17.544 14.576 18.211

1.922 0.451 1.166 1.067 1.787

10.453 2.863 7.261 2.379 13.331

0.827 0.803 0.823 0.899 0.864

2780 (<0.01) 56,970 (<0.01) 63,984 (<0.01) 53,120 (<0.01) 75,307 (<0.01)

5.7184 (0.057) 400.06 (<0.01) 330.73 (<0.01) 481.4 (<0.01) 188.88 (<0.01)

T3 FARIMA-N FARIMA– t FARIMA-GARCH–N FARIMA-GARCH – t

40 0 0 83,481 75,075 81,216 87,876

17.640 17.002 19.128 15.344 19.689

12.799 10.172 14.356 9.978 13.415

2.518 0.619 2.861 1.278 3.831

14.158 3.063 31.820 3.840 58.231

0.758 0.770 0.780 0.811 0.848

3026.6 (<0.01) 55,104 (<0.01) 59,282 (<0.01) 54,165 (<0.01) 73,239 (<0.01)

3.0149 (0.221) 741.26 (<0.01) 373.42 (<0.01) 451.34 (<0.01) 269.4 (<0.01)

T4 FARIMA-N FARIMA – t FARIMA-GARCH-N FARIMA-GARCH – t

40 0 0 95,914 84,745 96,153 94,273

42.528 39.960 44.355 39.865 42.527

22.150 18.840 29.392 18.049 25.548

1.363 0.303 2.202 0.541 6.576

5.325 -0.221 15.225 0.915 139.406

0.861 0.762 0.785 0.767 0.822

3382.2 (<0.01) 76,093 (<0.01) 76,128 (<0.01) 74,702 (<0.01) 85,437 (<0.01)

1.538 (0.464) 178,79 (<0.01) 174.3 (<0.01) 217.92 (<0.01) 100.87 (<0.01)

T5 FARIMA-N FARIMA – t FARIMA-GARCH-N FARIMA-GARCH – t

40 0 0 11,286 12,319 18,447 88,520

14.309 8.425 13.010 9.232 34.873

10.915 6.476 12.499 8.199 20.564

2.143 1.184 2.253 2.146 1.031

10.112 1.328 8.170 7.724 4.287

0.827 0.805 0.854 0.849 0.973

3240.3 (<0.01) 6982.1 (<0.01) 9665.2 (<0.01) 12,525 (<0.01) 85,103 (<0.01)

9.4833 (<0.01) 303.03 (<0.01) 290.81 (<0.01) 215.51 (<0.01) 48.513 (<0.01)

Given the ﬁtted models, we next simulate 10 0,0 0 0 values from each model, out of which we keep only those that are larger than the minimum value of each trace. The procedure is repeated four times for each trace, in order to generate from FARIMA and FARIMA-GARCH with Normal and Student’s t errors. In Table 6 we present the ﬁnal size of the simulated traces, their descriptive statistics (mean, standard deviation, skewness and kurtosis), values of Hurst exponents (H) using the Aggregated Variance method, the results from the Ljung-Box tests for autocorrelation and White’s NN test for non-linearity. For readability purposes we also report the corresponding values for the real traces shown in Table 2. Using the BIC values in Table 5, one could conclude that FARIMA-GARCH with Student’s t distribution for the residuals gives the best model for all ﬁve datasets. However, in Table 6, this conclusion is overturned with the calculated descriptive statistics. It is further observed that the traces generated from the classical FARIMA models (FARIMA-N) underestimate the mean, variance, skewness and kurtosis, but capture reasonably well the estimations for H. On the other hand, the traces generated by FARIMA with Student’s t innovations (FARIMA-t) give descriptive statistics closer to those of real traces, even though sometimes excessively overestimated, especially with kurtosis. Traces generated from classical FARIMA-GARCH models (FARIMA-GARCH-N) in some cases display a slight improvement over the FARIMA-N in capturing descriptive statistics; however they still underestimated most parameters. Finally, FARIMA-GARCH with Student’s t innovations (FARIMA-GARCH-t) generated traces that signiﬁcantly overestimated the descriptive statistics, for example the mean and sd for Silence of the Lambs (Tr1) or the kurtosis in Star Trek (Tr3) and Die Hard III (Tr4). We should also notice that all models generated traﬃc for which the assumption of linearity is rejected, while this was not the case for at least two of the traces. On the other hand, the Ljung-Box tests for autocorrelation applied on the synthetic traces agree with the corresponding results on the real traces. Judging from the statistics calculated for the synthetic traces, FARIMA with Student’s t innovations and FARIMA-GARCH with normal innovations appear to be closer to those calculated from the real traces. Therefore we expect that they would be more successful in generating synthetic traﬃc that mimic closely the behavior of each corresponding video. The next step was to verify this assertion by letting real and synthetic traces compete in the performance evaluation scenario presented in Section 4, compare them and then draw conclusions about the accuracy and appropriateness of the models. Each trace is evaluated based on its performance in queuing situations where either the buffer size is held constant at 100 (kbytes) and the service rate varies as percentage of a predeﬁned reference value equal to the estimated mean of the trace plus one standard deviation or where the service rate is held constant at a level equal to the 80% of the reference value and the buffer size varied from 50 to 450 (kbytes). For both evaluations the performance measure is the loss probability measured from the simulation run with the corresponding queuing system. Fig. 1 displays the results of those simulations with the queuing systems, when each one of the ﬁve real traces was used as input process as well as each of the corresponding four synthetic traces generated from the different time series models. According to Fig. 1, the traces generated by the FARIMA-N models always underestimated the loss probability and captured the queuing behavior of the actual trace in very few situations, e.g. the Star Trek trace and only for high service rates.

136

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

Fig. 1. Performance evaluation of FARIMA and FARIMA-GARCH traﬃc generators.

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

137

Fig. 2. Empirical distributions for the real traces compared to Lognormal models.

The behavior of the traces generated by the FARIMA-GARCH-N models in most cases is close to that of the FARIMA-N models, and only in the case of Silence of the Lambs was very close to the actual trace. On the contrary, FARIMA-GARCH-t overestimated signiﬁcantly the loss probability (except of the trace for Die Hard III) and so it appears to be unsuitable as a model, despite of the smallest BIC values it gave in ﬁtting. The model with the best overall generators appears to be the FARIMAt, since it gives better estimates compared to the others. Still its performance is far from being considered as completely satisfying. Speciﬁcally, when buffer size varies loss probabilities almost always are overestimated with FARIMA-t, and when service rate varies then FARIMA-t follows the general trend of loss probabilities but only in few occasions it fully captures the queuing behavior of the real traﬃc. For this reason in the next section we attempt a new approach that involves ﬁrst the generation of a data series by a FARIMA model and afterwards its transformation to lognormal variates through its inverse CDF function. 5.3. Synthetic traﬃc using the 2-step procedure The next stage of our analysis was to compare the “best” so far model, shown previously to be the FARIMA-t, with the 2step procedure for generating traﬃc presented in Section 3. Before doing so, however, the empirical distribution is calculated for all traces and compared to Lognormal as depicted in Fig. 2. For the construction of the empirical distributions there are a few different kernel functions that can be used, such as uniform, triangular, Epanenchnikov and Gaussian. For this work, the Gaussian kernels were used and for the bandwidth h the univariate plug-in selection method of [40], respectively. The ﬁtting of the Lognormal models for each trace was realized using the PDF of LN (μ ˆ log yi , σˆ log yi ). It is quite clear from Fig. 2 that the Lognormal models for all ﬁve traces ﬁt very satisfactorily on the empirical distributions plotted, so these were our choices as target distributions for the 2nd step of the procedure that follows. The procedure starts by generating a trace using a classical FARIMA model and afterwards all values are transformed into Lognormal variates, which form a new synthetic trace with different characteristics compared to the original one. We now want to compare the statistical characteristics and queuing performance of the new traces (named FARIMA-toLN) with the most successful model of the FARIMA-based models (FARIMA-t) and two benchmarks (ARIMA and Lognormal). We consider as benchmark models the ARIMA, which fails to capture the LRD property but it is considered as a very good choice for capturing SRD, and the lognormal distribution which generates independent data. For each model all parameters

138

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145 Table 7 Statistical characteristics of synthetic traces. Trace

BIC

# of data

Mean

Sd

Skewness

Kurtosis

H

Ljung-box test

White NN test

T1 Lognormal ARIMA FARIMA- t FARIMA-to-LN

8.649 7.668 6.837 6.612

40 0 0 10 0,0 0 0 82,060 25,055 99,997

32.108 31.563 40.669 33.801 31.750

31.343 29.536 24.551 33.565 29.310

3.256 3.707 0.619 3.381 3.010

19.438 29.114 0.033 26.634 15.676

0.898 0.488 0.626 0.816 0.913

3496.5 (<0.01) 1.071 (0.301) 66,620 (<0.01) 22,494 (<0.01) 89,062 (<0.01)

7.223 (0.075) 4.367 (0.113) 680.44 (<0.01) 315.42 (<0.01) 688.34 (<0.01)

T2 Lognormal ARIMA FARIMA- t FARIMA-to-LN

8.191 7.304 7.036 5.440

40 0 0 10 0,0 0 0 96,031 86,672 10 0,0 0 0

31.608 31.746 32.590 28.983 31.813

16.977 17.929 15.472 17.544 17.879

1.922 1.925 0.295 1.166 1.895

10.453 7.276 -0.246 7.261 7.129

0.827 0.489 0.699 0.823 0.816

2780 (<0.01) 0.315 (0.574) 62,327 (<0.01) 63,984 (<0.01) 65,906 (<0.01)

5.7184 (0.057) 2.934 (0.231) 190.79 (<0.01) 330.73 (<0.01) 718.37 (<0.01)

T3 Lognormal ARIMA FARIMA- t FARIMA-to-LN

7.381 6.494 6.165 5.015

40 0 0 10 0,0 0 0 90,480 75,075 10 0,0 0 0

17.640 17.835 19.726 19.128 17.876

12.799 13.930 10.827 14.356 13.889

2.518 2.898 0.462 2.861 2.905

14.158 17.238 -0.147 31.820 18.605

0.758 0.489 0.616 0.780 0.790

3026.6 (<0.01) 0.736 (0.391) 62,065 (<0.01) 59,282 (<0.01) 70,896 (<0.01)

3.0149 (0.221) 3.789 (0.15) 447.12 (<0.01) 373.42 (<0.01) 925.39 (<0.01)

T4 Lognormal ARIMA FARIMA-t FARIMA-to-LN

8.745 7.169 6.725 5.281

40 0 0 10 0,0 0 0 96,449 84,745 10 0,0 0 0

42.528 42.626 43.670 44.355 42.705

22.150 23.415 20.314 29.392 23.279

1.363 1.862 0.269 2.202 1.8064

5.325 6.789 -0.253 15.225 6.330

0.861 0.489 0.666 0.785 0.769

3382.2 (<0.01) 0.290 (0.59) 79,090 (<0.01) 76,128 (<0.01) 79,992 (<0.01)

1.538 (0.464) 2.8783 (0.237) 160.21 (<0.01) 174.3 (<0.01) 432.14 (<0.01)

T5 Lognormal ARIMA FARIMA-t FARIMA-to-LN

6.954 5.949 5.608 4.328

40 0 0 10 0,0 0 0 89,059 12,319 10 0,0 0 0

14.309 14.291 16.310 13.010 14.414

10.915 11.375 9.072 12.499 11.596

2.143 2.972 0.491 2.253 2.541

10.112 18.184 -0.124 8.170 10.661

0.827 0.489 0.631 0.854 0.940

3240.3 (<0.01) 0.768 (0.381) 66,360 (<0.01) 9665.2 (<0.01) 88,568 (<0.01)

9.4833 3.8488 503.25 290.81 705.23

(<0.01) (0.146) (<0.01) (<0.01) (<0.01)

are determined with a typical ﬁtting procedure to the actual traces and then from each model we simulate 10 0,0 0 0 values to form synthetic traﬃc. Table 7 displays the statistical characteristics of the synthetic traces generated by the different models. Speciﬁcally it shows the BIC values from the model ﬁtting procedure, the ﬁnal size of each trace after cleaning the simulated data, the descriptive statistics, H exponents, and outcomes of Ljung-Box and White NN tests for autocorrelation and nonlinearity. The BIC value for the FARIMA-to-LN models is calculated approximately as a function of sample size instead of the log-likelihood formula used in all other cases. We observe that the FARIMA-to-LN models always give the smallest BIC value, however since this is only an approximation we cannot extract any useful conclusion from it. Comparison of the statistical characteristics indicates that the traces generated from the FARIMA-to-LN models in most cases approximate fairly well the corresponding values of the actual traces, but so do also the traces generated from the FARIMA-t. An additional comparison is presented Fig. 3, where we compare again the four models for their ability to capture the queuing behavior of real traces across a range of buffer sizes and service rates. We observe that the synthetic traces generated from the procedure FARIMA-to-LN emulate closer the queuing behavior of the real traces by estimating quite accurately the loss probability in most cases. An exception occurs at The Firm trace, where FARIMA-to-LN overestimates the loss only across buffer sizes and then FARIMA-t is more accurate. This is not the case, however, for lower service rates where again the FARIMA-to-LN trace is closer to the real trace. In Fig. 3 we observe that overall the FARIMA-to-LN approximates the queuing behavior of the real traﬃc consistently and is more accurate than the other models in almost all cases. With Lognormal distribution, we take into account only the PDF of the data and we omit the dependence structure, i.e. SRD and LRD. In this case the model underestimates loss probability, and the accuracy is very low. FARIMA-t and ARIMA models only in rare occasions seem to approach the real traﬃc, e.g. ARIMA is close with Mr. Bean and Star Trek traces only for large buffer sizes and FARIMA-t is close to the Firm trace across service rates. 5.4. Experimental evaluation of dependence structure In this section we experiment further with the 2-step procedure applied previously. More speciﬁcally our interest is focused on whether the projection of a data series to a Lognormal distribution through its inverse CDF maintains, and at what level, the dependence structure that existed in the original series. For this reason, we consider simulated traces either completely random from a Normal(0, 1) distribution, or with some short range dependence structure from an ARIMA(1, 1) and an ARIMA(1, 1)-GARCH(1, 1) model, or with some long range dependence structure from a FARIMA(1, d, 1) and a FARIMA(1, d, 1)-GARCH(1, 1)model. For each one of these models we simulated eight traces of different sizes and then applied the transformation to the Lognormal with parameters 0 and 1. The sizes of the traces varied from 10 0 0 to 50 0,0 0 0 observations. The means and standard deviations are calculated for all transformed traces and are depicted in Fig. 4. It is obvious that samples are required to be of size approximately 10 0,0 0 0 in order for the ﬁrst two moments to converge to certain

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

139

Silence of the Lambs (Tr1) 0,6

0,3 0,25

0,4

Loss

Loss

0,2 0,15

0,2

0,1 0,05

0

0 50

100

150

200

250

300

350

400

0,4

450

0,5

0,6

0,7

0,8

0,9

1

0,8

0,9

1

Service Rates

Buﬀer Size

Mr. Bean (Tr2) 0,8

0,30 0,25

0,6

Loss

Loss

0,20 0,15 0,10

0,4

0,2

0,05 0

0,00 50

100

150

200

250

300

350

400

0,4

450

0,5

0,6

0,7 Service Rates

Buﬀer Size

Star Trek (Tr3) 0,6

0,30 0,25

0,4

Loss

Loss

0,20 0,15

0,2

0,10 0,05

0

0,00 50

100

150

200

250

300

350

400

0,4

450

0,5

0,6

0,7

0,8

0,9

1

Service Rates

Buﬀer Size

Die Hard III (Tr4) 0,8

0,30 0,25

0,6

Loss

Loss

0,20 0,15 0,10

0,4

0,2

0,05 0

0,00 50

100

150

200

250

300

350

400

0,4

450

0,5

0,6

0,7

0,8

0,9

1

0,8

0,9

1

Service Rates

Buﬀer Size

The Firm (Tr5) 0,30

0,6

0,25 0,4

Loss

Loss

0,20 0,15

0,2

0,10 0,05

0

0,00 50

100

150

200

250 Buﬀer Size

300

350

400

450

0,4

0,5

0,6

0,7 Service Rates

Fig. 3. Performance evaluation of FARIMA-to-LN, benchmark models and FARIMA-t.

140

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

1,68

3

1,67

2,5 Standard Deviation

1,66

Mean

1,65 1,64 1,63 1,62 1,61 1,6

2 Normal

1,5

ARIMA

1

ARIMA/GARCH FARIMA

0,5

1,59

FARIMA/GARCH

1,58

0

Observations

12

600

10

500

8

400 Kurtosis

Skewness

Observations

6

Normal

300

ARIMA

4

200

ARIMA/GARCH

2

100

FARIMA FARIMA/GARCH

0

0

observations

observations Fig. 4. Moments of the transformed traces.

values. On the contrary, skewness and kurtosis, require samples of larger size in order to converge to some value, especially for the FARIMA-GARCH model. In addition, the Hurst exponents are calculated for the original samples and for the transformed ones and their values, along with the % change between them, are reported in Table 8 and depicted in Fig. 5. One may observe that the random generator for Normal variates produces a sample of independent numbers, so the H exponent is approximately 0.5 both for the original and the transformed traces. Similarly, the ARIMA (1,1) and ARIMA-GARCH (1,1)/(1,1) simulators produce traces that do not carry long memory and H converges to some value around 0.51, while the transformed traces follow very close the original values. Lastly, the FARIMA (1,0.25,1) and FARIMA-GARCH (1,0.25,1) /(1,1) simulators produce traces that display LRD with the H exponent to converge to some value close to 0.75. When the transformation to lognormal distribution is applied the new traces lose some of it and converge to a smaller value, around 0.71 and 0.72, respectively, however still indicating LRD. Looking at Fig. 5(a) and (b) it is obvious that the three ﬁrst simulators converge to their corresponding H values with samples 10,0 0 0 observations and above, however for FARIMA and FARIMA/GARCH it requires at least 50,0 0 0. The detailed values for the aforementioned comparison are given in Table 8, where the percent differences between the H estimates for the original and transformed data series are also given, where Fig. 5(c) depicts the corresponding absolute percent differences. The experimentation just performed indicates that the transformed traces do maintain a major part of the dependence structure they carried prior to their projection. Apparently in order to generate traces that mimic closely real traﬃc it is necessary to start with a trace that carries the desired dependencies, measured by H, and the transformation through the inverse CDF of a suitable Lognormal distribution can do the rest. Another concern that this experimentation raised was the characterization of the difference observed between the H estimates for the original and the transformed data series. For this reason, and since it is known that there is not a single “best” technique for estimating the Hurst exponent, we extended the investigation to different techniques for estimating it. More speciﬁcally, we estimated H for the actual and the synthetic traces generated from the 2-step procedure using 6 different techniques: R/S, Aggregate Variance, Periodogram, Higutchi and Discrete Wavelet transform. Table 9 displays the estimation for the actual traces, for the simulated traces with FARIMA-to-LN and their absolute difference for each one technique, while Table 10 displays the average absolute differences between actual and simulated traces recorded for each estimation technique. It is observed that the average absolute differences between the estimations of H for the original and the transformed traces vary between 0.0027 (Higuchi method) and 0.0919 (R/S) for the different techniques. Using the Higuchi method the absolute difference never exceeded 0.009, using the Wavelet transforms method never exceeded 0.11,

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

141

Table 8 Statistical characteristics of original and transformed traces. Size of dataset Normal (0,1)

H (original)

H (transformed)

% change

10 0 0 10,0 0 0 50,0 0 0 10 0,0 0 0 20 0,0 0 0 30 0,0 0 0 40 0,0 0 0 50 0,0 0 0

0.54475 0.49554 0.48607 0.49257 0.49555 0.49547 0.49532 0.49599

0.62510 0.48491 0.50024 0.49502 0.49589 0.49710 0.49626 0.49775

14.75% -2.15% 2.91% 0.50% 0.07% 0.33% 0.19% 0.35%

0.53353 0.48902 0.49100 0.50228 0.50807 0.51003 0.50856 0.51016

14.08% −0.73% −2.50% −1.37% −1.09% −0.37% −0.60% −0.38%

ARIMA (1,1) – ar1 = 0.3, ma1 = 0.2, 10 0 0 10,0 0 0 50,0 0 0 10 0,0 0 0 20 0,0 0 0 30 0,0 0 0 40 0,0 0 0 50 0,0 0 0

μ = 0, σ = 1

ARIMA/GARCH (1,1) / (1,1) -μ = 0, 10 0 0 10,0 0 0 50,0 0 0 10 0,0 0 0 20 0,0 0 0 30 0,0 0 0 40 0,0 0 0 50 0,0 0 0

ar1 = 0.3, ma1 = 0.2, ω = 0.0 0 0 01, 0.47748 0.49210 0.50371 0.50946 0.51388 0.51187 0.51169 0.51235

0.46769 0.49261 0.50357 0.5092 0.51369 0.51194 0.51162 0.51209

FARIMA (1,d,1) - ar1 = 0.3, ma1 = 0.2, d = 0.25, μ = 0, σ = 1 10 0 0 0.73209 10,0 0 0 0.71606 50,0 0 0 0.73352 10 0,0 0 0 0.74469 20 0,0 0 0 0.74043 30 0,0 0 0 0.74923 40 0,0 0 0 0.75011 50 0,0 0 0 0.75097 FARIMA/GARCH (1,d,1) / (1,1) - ar1 = 0.3, ma1 = 0.2, μ = 0, α 1 = 0.05, 10 0 0 0.72709 10,0 0 0 0.71483 50,0 0 0 0.73369 10 0,0 0 0 0.74511 20 0,0 0 0 0.74082 30 0,0 0 0 0.74943 40 0,0 0 0 0.75020 50 0,0 0 0 0.75114

α 1 = 0.05, β 1 = 0.05 0.53634 0.49384 0.49292 0.50448 0.50870 0.51046 0.50919 0.51112

12.33% 0.35% −2.14% −0.98% −1.01% −0.28% −0.49% −0.24%

0.71466 0.66313 0.69505 0.70963 0.70816 0.71526 0.71384 0.71606

−2.38% −7.39% −5.25% −4.71% −4.36% −4.53% −4.83% −4.65%

β 1 = 0.05, ω = 0.00001, d = 0.25 0.71313 0.65209 0.68631 0.69811 0.69879 0.70535 0.70449 0.70612

−1.92% −8.78% −6.46% −6.31% −5.67% −5.88% −6.09% −5.99%

Table 9 Hurst Exponent estimations with different methods (actual traces, simulated traces, absolute difference). Trace

Wavelet

R/S

Aggregated variance

Periodogram

Higuchi

Average

Silence of The Lambs

1.0473 1.0475 0.0 0 02

0.9173 0.8424 0.0749

0.8977 0.9125 0.0148

1.0602 0.9962 0.0640

0.9602 0.9692 0.0090

0.97654 0.95356 0.0326

0.8549 0.9362 0.0813

0.7148 0.7938 0.0790

0.8271 0.8164 0.0107

0.9088 0.9745 0.0657

0.9702 0.9690 0.0012

0.85516 0.89798 0.0476

0.9860 0.9100 0.0760

0.8156 0.7822 0.0334

0.7579 0.7905 0.0326

0.9783 0.8571 0.1212

0.9704 0.9695 0.0 0 09

0.90164 0.86186 0.0528

1.1142 1.0069 0.1073

1.0 0 06 0.8530 0.1476

0.8606 0.7692 0.0914

1.0193 0.9631 0.0562

0.9700 0.9692 0.0 0 08

0.99294 0.91228 0.0807

1.1127 1.0360 0.0767

0.7049 0.8297 0.1248

0.8275 0.9398 0.1123

1.0222 1.0055 0.0167

0.9671 0.9688 0.0017

0.92688 0.95596 0.0664

Abs. Diff. Mr. Bean Abs. Diff. Star Trek Abs. Diff. Die Hard III Abs. Diff. The Firm Abs. Diff.

142

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

(b)

0,8

0,8

0,7

0,7

0,6

0,6

H (transformed)

H (original)

(a)

0,5 0,4 0,3 0,2

0,5 0,4 0,3 0,2

0,1

0,1

0

0

Observaons

Observaons

(c) 16,00%

H (% change)

14,00% 12,00% 10,00%

Normal

8,00%

ARIMA

6,00%

ARIMA/GARCH

4,00% 2,00%

FARIMA

0,00%

FARIMA/GARCH

Observaons Fig. 5. Hurst exponent values for (a) initial and (b) transformed data; (c) absolute % change in H. Table 10 Absolute Differences per method. Method

Average Absolute Difference

Wavelet R/S Aggregated Variance Periodogram Higuchi Average of all Methods

0.0683 0.0919 0.0524 0.0648 0.0027 0.0560

while the worst case was 0.15 (with R/S). The average of all absolute differences was found to be 0.056, so we conclude that the LRD measured in the simulated traces was very close to that of the original. 5.5. Generating traﬃc for a new video The methodology that was described in Section 3 and used for the experimentation presented previously has been shown to capture satisfactorily the static and time-dependent statistical characteristics of the corresponding real traces, resulting eventually to similar queuing behaviors when fed to a FIFO queuing system. The challenge now is to use the methodology in more practical situations where the real trace is not available and instead there is only an estimate (or a target value) for the mean and standard deviation. Since it is not necessary for the procedure to track down the exact LRD structure in a video trace in order to be successful then adopting a “typical” FARIMA model as reference would probably be suﬃcient. As discussed earlier, in our case we adopted the FARIMA (1,0.3,1) as generator for the initial trace and then projected it to

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

Fig. 6. Queuing performance of FARIMA(r)-to-LN.

143

144

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145 Table 11 Statistical characteristics of the traces generated by the FARIMA(r)-to-LN procedure. Trace

Mean

Sd

Skewness

Kurtosis

H

Ljung-Box test

White NN test

T1 FARIMA(r)-to-LN Abs. Difference

32.108 32.138 0.030

31.343 31.663 0.320

3.256 4.147 0.891

19.438 44.568 25.130

0.898 0.771 0.127

3496.5 (<0.01) 67,914 (<0.01)

7.223 (0.075) 1296.2 (<0.01)

T2 FARIMA(r)-to-LN Abs. Difference

31.608 31.610 0.002

16.977 17.023 0.046

1.922 1.822 0.100

10.453 9.678 0.775

0.827 0.786 0.041

2780 (<0.01) 72,592 (<0.01)

5.7184 (0.057) 594.64 (<0.01)

T3 FARIMA(r)-to-LN Abs. Difference

17.640 17.644 0.004

12.799 12.867 0.068

2.518 2.678 0.160

14.158 18.582 4.424

0.758 0.780 0.022

3026.6 (<0.01) 70,896 (<0.01)

3.0149 (0.221) 925.39 (<0.01)

T4 FARIMA(r)-to-LN Abs. Difference

42.528 42.531 0.003

22.150 22.207 0.057

1.363 1.756 0.393

5.325 9.166 3.841

0.861 0.786 0.074

3382.2 (<0.01) 72,734 (<0.01)

1.538 (0.464) 568.35 (<0.01)

T5 FARIMA(r)-to-LN Abs. Difference

14.309 14.312 0.003

10.915 10.979 0.064

2.143 2.870 0.727

10.112 21.192 11.080

0.827 0.778 0.049

3240.3 (<0.01) 70,361 (<0.01)

9.4833 (<0.01) 924.2 (<0.01)

a Lognormal distribution with parameters that are estimated using the MCMC routine from the bayescount R package [52]. Using the ﬁve traces T1-T5 we apply this idea and compare the performance of the newly generated traces (FARIMA(r)-to-LN) with the actual ones. Table 11 displays the descriptive statistics, Hurst exponents for the traces generated by the FARIMA(r)-to-LN procedure and for the real traces for comparison; in addition, the outcomes from the Ljung-Box and White NN tests. With the exception of kurtosis, in most cases the descriptive statistics calculated for the synthetic traces are very close to the corresponding values calculated for the real traces. The H exponent in most cases is different than the values we saw in Table 7, since now the FARIMA model is just a reference model, however it still indicates signiﬁcant LRD for all traces. Lastly, the outcomes of the Ljung-Box and White NN tests for the simulated traces indicate autocorrelation and non-linear dependencies for all traces. With such proximity in most values of the statistical parameters observed in Table 11 it is expected that the synthetic traces will emulate closely the real traces in a queuing situation. This expectation is veriﬁed in Fig. 6 where the synthetic traces FARIMA(r)-to-LN are fed as input to the FIFO queuing system described in Section 4. Again the performance measure is the loss probability and is compared with the performance of the real traces. Their absolute difference in estimating loss probability never exceeded the value of 0.06, indicating that the two traces have very similar performance. 6. Summary and conclusions In this paper a 2-step procedure based on time series and the Inverse Sampling method is proposed in order to generate synthetic video traﬃc. The basis for the procedure is the classical FARIMA model and the simulated traces are transformed using the inverse CDF of a “target” marginal distribution, Lognormal in our case. The result is synthetic traﬃc with features such as autocorrelation, SRD, LRD and marginal distribution that is close to distributions observed in real video traﬃc. The procedure has been applied to ﬁve different publicly available video traces and the synthetic traces produced displayed behavior closer to that of the real traces, when compared to other generators in the FARIMA and FARIMA-GARCH family with normal or Student’s t innovations, or to other benchmarking models such as the and the Lognormal distribution. Finally, we considered the case of a new video trace for which we may only have estimation or a guess for its mean and standard deviation and not the actual trace. In such a case the proposed procedure is adjusted to start with a typical FARIMA model and then be projected to the desired Lognormal distribution. Apparently such an adjustment does not nullify the usefulness of the suggested procedure, on the contrary it is shown experimentally that the synergy between the FARIMA reference model and the Lognormal distribution can provide traces that emulate quite satisfactorily the behavior of the real traces. References ´ [1] E. Casilari, A. Reyes, A. Diaz-Estrella , F. Sandoval, Classiﬁcation and comparison of modelling strategies for VBR video traﬃc, in: Proceedings of International Teletraﬃc Congress (ITC-16)’99, 1999. [2] A. Alheraish, Autoregressive video conference models, Int. J. Netw. Mgmt. 14 (5) (2004) 329–337. ´ [3] E. Casilari, A. Reyes, A. Diaz-Estrella , F. Sandoval, Characterisation and modelling of VBR video traﬃc, Electron. Lett. 34 (10) (1998) 968–969. [4] S. Tanwir, H. Perros, A survey of VBR video traﬃc models, IEEE Commun. Surv. Tutorials 15 (4) (2013) 1778–1802. [5] M. Krunz, S.K. Tripathy, On the characterization of VBR MPEG streams, Perform. Eval. Rev. 25 (1) (1997) 192–202. [6] H. Koumaras, C. Skianis, A. Kourtis, Analysis and modeling of H.264 unconstrained VBR video traﬃc, Int. J. Mob. Comput. Multimedia Commun. 1 (4) (2009) 14–31 IGI-Global. [7] D. Liu, E.I. Sára, W. Sun, Nested auto-regressive processes for MPEG-encoded video traﬃc modeling. IEEE Transactions on Circuits and Systems for Video Technology, 11(2), 169-183. [8] D.P. Heyman, T.V. Lakshman, Source models for VBR broadcast-video traﬃc, IEEE/ACM Trans. Netw. 4 (1) (1996) 40–48. [9] A. Lazaris, P. Koutsakis, Modeling multiplexed traﬃc from H.264/AVC videoconference streams, Comput. Commun. 33 (10) (2010) 1235–1242.

C. Katris, S. Daskalaki / Simulation Modelling Practice and Theory 75 (2017) 127–145

145

[10] M. Frey, S. Nguyen-Quang, A gamma-based framework for modeling variable-rate MPEG video sources: the GOP GBAR model, IEEE/ACM Trans. Netw. 8 (6) (20 0 0) 710–719. [11] Q.T. Zhang, A general AR-based technique for the generation of arbitrary gamma VBR video traﬃc in ATM networks, IEEE Trans. Circuits Syst. Video Technol. 9 (7) (1999) 1130–1137. [12] A. Alheraish, S.A. Alshebeili, T. Alamri, A GACS modeling approach for MPEG broadcast video, IEEE Trans. Broadcast. 50 (2) (2004) 132–141. [13] J.-L.C. Wu, Y.-W. Chen, C.-C. Shiu, Traﬃc modeling and bandwidth allocation for MPEG video sources in ATM networks, in: IEEE Global Telecommunications Conference, 3, 1995, pp. 2237–2241. [14] M.W. Garrett, W. Willinger, Analysis, modeling and generation of self-similar VBR video traﬃc, in: ACM SIGCOMM, 1994, pp. 269–280. [15] C. Huang, M. Devetsikiotis, I. Lambadaris, A.R. Kaye, Modeling and simulation of self-similar variable rate compressed video: a uniﬁed approach, in: ACM SIGCOMM, 1995, pp. 114–125. [16] H. Liu, N. Ansari, Y.Q. Shi, Modeling VBR video traﬃc by Markov-modulated self-similar processes, in: IEEE 3rd Workshop on Multimedia Signal Processing, 1999, pp. 363–368. [17] N. Ansari, H. Liu, Y.Q. Shi, H. Zhao, On modeling MPEG video traﬃcs, IEEE Trans. Broadcast. 48 (4) (2002) 337–347. [18] U.K. Sarkar, S. Ramakrishnan, D. Sarkar, Modeling full-length video using Markov-modulated Gamma-based framework, IEEE/ACM Trans. Netw. 11 (4) (2003) 638–649. [19] Y. Sun, J.N. Daigle, A source model of video traﬃc based on full-length VBR MPEG4 video traces, in: GLOBECOM-IEEE Global Telecommunications Conference, 2, 2005, pp. 766–770. [20] M. Dai, Y. Zhang, D. Loquinov, A uniﬁed traﬃc model for MPEG-4 and H.264 video traces, IEEE Trancs. Multimedia 11 (5) (2009) 1010–1023. [21] M. Sheng, C. Ji, Modeling heterogeneous network traﬃc in wavelet domain, IEEE/ACM Trans. Netw. 9 (5) (2001) 634–649. [22] M. Cario, B. Nelson, Autoregressive to anything: time-series input processes for simulation, Oper. Res. Lett. 19 (1996) 51–58. [23] B. Biller, B. Nelson, Fitting time-series input processes for simulation, Oper. Res. 53 (2005) 549–559. [24] C. Granger, R. Joyeux, An introduction to long-memory time series models and fractional differencing, J. Time Ser. Anal. 1 (1980) 15–29. [25] J. Hosking, Fractional differencing, Biometrika 68 (1981) 165–176. [26] M. Corradi, R. Garroppo, S. Giordano, M. Pagano, Analysis of F-ARIMA processes in the modelling of broadband traﬃc, in: International Conference on Communications (ICC 2001), 3, IEEE, 2001, pp. 964–968. [27] R. Garroppo, S. Giordano, S. Lucetti, F. Russo, Comparison of LRD and SRD traﬃc models for the performance evaluation of ﬁnite buffer systems, in: International Conference on Communications, 2001 (ICC 2001), 9, IEEE, 2001, pp. 2681–2686. [28] G. Chiruvolu, R. Shankar, An approach towards resource management and transportation of VBR video traﬃc, in: International Conference on Communications (ICC ’97), IEEE, 1997, pp. 550–554. [29] N. Sadek, A. Khotanzad, T. Chen, ATM dynamic bandwidth allocation using F-ARIMA prediction model, in: International Conference on Computer Communications and Networks (ICCCN), IEEE, 2003, pp. 359–363. [30] C. Katris, S. Daskalaki, Combining time series forecasting methods for internet traﬃc, in: Stochastic Models, Statistics and their Applications, 2015, pp. 309–317. [31] J. Geweke, S. Porter-Hudak, The estimation and application of long memory time series models, J. Time Ser. Anal. 4 (1983) 221–238. [32] R. Baillie, C. Chung, M. Tieslau, Analysing inﬂation by the fractionally integrated ARFIMA-GARCH model, J. Appl. Econ. 11 (1996) 23–40. [33] S. Kim, Forecasting internet traﬃc by using seasonal GARCH models, J. Commun. Netw. 13 (2011) 621–624. [34] B. Zhou, D. He, Z. Sun, W. Ng, Network traﬃc modeling and prediction with ARIMA/GARCH, in: Proceedings Of HET-Nets Conference, 2005, pp. 1–10. [35] A. Syed, H. Saleem, H. Syed, MCMC simulation of GARCH model to forecast network traﬃc load, Int. J. Comput. Sci. Inf. 9 (2012) 277–284. [36] R. Engle, T. Bollerslev, Modelling the persistence of conditional variances, Econometric Rev. 5 (1986) 1–50. [37] I. Antoniou, V. Ivanov, V. Ivanov, P. Zrelov, On the log-normal distribution of network traﬃc, Physica D 167 (2002) 72–85. [38] B. Silverman, Density Estimation for Statistics and Data Analysis, Chapman and Hall, London, 1986. [39] T. Duong, ks: Kernel Smoothing, 2015. [40] M.P. Wand, C. Jones, Multivariate plug-in bandwidth selection, Comput. Stat. 9 (2) (1994) 97–116. [41] M.S. Taqqu, V. Teverovsky, W. Willinger, Estimators for long-range dependence: an empirical study, Fractals 3 (4) (1995) 785–798. [42] P. Abry, D. Veitch, Wavelet analysis of long-range-dependent traﬃc, IEEE trans. Inf. Theory 44 (1) (1998) 2–15. [43] F. Fitzek, M. Reisslein, MPEG-4 and H.263 video traces for network performance evaluation, IEEE Netw. 15 (2001) 40–54. [44] T. Lee, H. White, C. Granger, Testing for neglected nonlinearity in time series models, J. Econometrics 56 (1993) 269–290. [45] H. Kunsch, The jackknife and the bootstrap for general stationary observations, Ann. Statist. 17 (1989) 1217–1241. [46] P. Hall, J. Horowitz, B. Jing, On blocking rules for the bootstrap with dependent data, Biometrika 82 (1995) 561–574. [47] H. Abarbanel, M. Kennel, Local false nearest neighbors and dynamical dimensions from observed chaotic data, Phys. Rev. E 47 (1993) 3057–3068. [48] R. Frank, N. Davey, S. Hunt, Time series prediction and neural networks, J. Intell. Rob. Syst. 31 (2001) 91–103. [49] C. Katris, S. Daskalaki, Comparing forecasting approaches for Internet traﬃc, Expert Syst. Appl. 42 (2015) 8172–8183. [50] A. Ghalanos, rugarch: univariate GARCH models, 2015. [51] C. Fraley, F. Leisch, M. Maechler, V. Reisen, A. Lemonte, fracdiff: fractionally differenced ARIMA aka ARFIMA(p,d,q) models, 2012. [52] M. Denwood, Bayescount: power calculations and bayesian analysis of count distributions and fecrt data using mcmc, 2015

Generation of synthetic video traffic using time series

Generation of synthetic video traffic using time series

Recommend Documents