Modeling and simulation of data transmission on a hybrid fiber coax cable network

Simulation Modelling Practice and Theory 12 (2004) 239–261 www.elsevier.com/locate/simpat Modeling and simulation of data transmission on a hybrid ﬁb...

Download PDF

370KB Sizes 0 Downloads 39 Views

Report

PDF Reader
Full Text

Simulation Modelling Practice and Theory 12 (2004) 239–261 www.elsevier.com/locate/simpat

Modeling and simulation of data transmission on a hybrid ﬁber coax cable network M. Garcia a

a,*

, D.F. Garcia a, V.G. Garcia a, R. Bonis

b

Department of Computer Science, University of Oviedo, Oﬃce 1.2.15, Campus de Viesques, 33204 Gijon, Spain b TeleCable, s.a., Scientiﬁc Park, 33206 Gijon, Spain

Received 9 April 2003; received in revised form 15 September 2003; accepted 29 October 2003 Available online 5 June 2004

Abstract This paper describes the steps followed in the development, simulation, and later validation of a model for a cable network based on hybrid ﬁber-coax (HFC) technology, used for data transmission. This work presents a representation of a communication system which has been growing dramatically over recent years and will continue to do so in the near future. The modeling process is based on the analysis of measurements of a cable operator, establishing a direct relationship between model parameters and network characteristics. The modeling technique produces a scalable model capable of simulating the evolution of the real cable network. 2004 Elsevier B.V. All rights reserved. Keywords: Hybrid ﬁber-coax network; Traﬃc modeling; Network simulation

1. Introduction This paper describes the steps followed in the development of a simulation model for a cable network providing data transmission services. Cable networks have traditionally been associated to television broadcasts; however, over the last decade the explosive growth of Internet has sparked an interest in alternatives to traditional telephonic lines for Internet access. The advantages of cable networks over telephonic lines are their broader bandwidth and their great number of subscribers. Their great disadvantage is the absence of a return path, necessary to make the cable network bidirectional. In consolidated *

Corresponding author. Tel.: +34-985-182519; fax: +34-985-181986. E-mail address: [email protected] (M. Garcia).

1569-190X/$ - see front matter 2004 Elsevier B.V. All rights reserved. doi:10.1016/j.simpat.2003.10.005

240

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

and widely extended cable networks, overcoming this disadvantage requires great investments, making them a less viable alternative as an Internet access medium. However, there is a new group of cable networks, based on hybrid ﬁber-coax (HFC) technology, which represents a real alternative. The HFC technology improves network capacity and reliability, which facilitates the implementation of a return path. These characteristics permit the cable operator using this kind of network to act as global telecommunication operator, providing television, voice and Internet services using the same network. Currently, the main problems faced by companies using HFC technology are coping with the rapid growth of new subscribers, and the demand for new data services, such as multimedia interactive services. Cable operators must be able to predict how many users could potentially demand these services simultaneously without aﬀecting the performance of the network, and to determine the inﬂuence of new services on the HFC network. The best way to answer these and other questions is to use a model of the cable network which includes the new service requirements. At the same time the cable network model can be used by the cable operator to support tuning and capacity planning decisions. In this paper, the development of a simulation model for data transmission on an HFC cable network is described. This model represents a generic HFC architecture, and is validated in use by a cable operator. The main characteristics of the cable network model are: • It captures the evolution of network performance over time, rather than at a ﬁxed moment. • The performance provided by the model is directly related to the network capacity (the percentage of utilization of the network channels). • The parameters of the model are obtained from network characteristics, i.e. the number of network subscribers and the traﬃc measurements. • The relationships established between the traﬃc measurements and the number of subscribers assigned to each channel make the use of the model very simple. This paper follows the development of the simulation model. Section 2 provides a general description of the system to be represented. Other related works are summarized in Section 3. The development of the cable network model is divided in two parts: Section 4 describes the way the cable network is used, that is, the traﬃc model and Section 5 describes the cable network model. In Section 6, the results obtained are presented, and compared with the results obtained from the real cable network. Finally, Section 7 summarizes the main characteristics of the developed model and the conclusions obtained.

2. Description of the system The simulation model represents a generic cable network based on HFC technology. This technology combines traditional coax cables with ﬁber optics, establishing

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

241

a hierarchical architecture. The following is a description of this architecture from the bottom to the top. The coax cable, also called the ‘‘last mile’’, is the part nearest to the subscribers. In the subscribers’ homes, data services are accessed through a cable modem, connected to a home PC. This cable modem contains the hardware to receive and transmit signals over the HFC network. It also negotiates access to the HFC network, determining the maximum speed at which it can transmit. Each coax cable is shared by several subscribers and ends at a local optical node. Each optical node supports between 200 and 300 subscribers, in an area of 400–800 m in diameter. In the local optical node the electrical signal transmitted by the coax cable is transformed into an optical signal, to be transmitted using optic ﬁbers towards the head-end channel switch (HCX). This structure constitutes an HFC branch. The next level in the hierarchical structure is the connection between the diﬀerent branches. This connection is made using ﬁber trunks with diﬀerent topologies. The model developed is based on the data network of the cable operator T E L E C A B L E , I N C . , one of the most important cable operators in Spain. This company, created in 1995, began by providing cable TV services; later with the growth in the demand for telecommunications, it evolved and became a global provider. This operator provides TV, voice and data services in an area including three cities, with half a million potential subscribers. Fig. 1 shows the general architecture of its cable network. The operator’s network is organized following an architecture very similar to the generic architecture previously described. However, it presents particular characteristics related with the technology it implements. The network is also organized in HFC branches, as can be seen in Fig. 1, and there are several HFC branches in each city. All the HFC branches in all the cities are connected by an ATM backbone. In one of the cities, and also connected to the ATM backbone, is the main head-end, where the servers and the Internet access are located.

Fig. 1. Architecture of the cable network.

242

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

The cable modem used in the subscribers’ homes transforms Ethernet packets into ATM cells and viceversa; it controls access to the channel using a proprietary protocol very similar to DOCSIS (Data Over Cable Service Interface Speciﬁcations) protocol. The data transmission on the HFC branch is bidirectional; there are two channels with very diﬀerent characteristics: the upstream channel, used by the subscribers to send data requests, and the downstream channel, through which the subscribers receive data. The downstream channel is shared by all the subscribers and has a bandwidth of 30 Mbps. On the other side, there are up to six upstream channels in each HFC branch, each one with a bandwidth of 1.9 Mbps. Each subscriber is assigned to one of these upstream channels. All the traﬃc originated in the HFC branch is sent towards the branch controller (HCX in Fig. 1). This element distinguishes between the traﬃc whose destination is in the same branch, internal traﬃc, and the rest of the traﬃc. The internal traﬃc is sent back, and the rest of the traﬃc is sent towards the ATM switch to reach its destination. The model of the cable network developed in this work is based on the analysis of the traﬃc measurements taken on all the channels of this network. When the measurements were taken the cable network providing data services had 17 HFC branches and 17,369 subscribers. These dimensions are adequate to obtain representative information. The cable network works on a 24 h ‘‘ﬂat rate’’ with ‘‘best eﬀort’’ quality of service.

3. Related work The aim of the developed model is to evaluate the cable network performance, represented mainly by channel bandwidth requirements, as the time of day and number of subscribers change. The use of HFC architecture for data transmission has not been explored in any depth; the main studies in this ﬁeld are related to standardization eﬀorts for the proposal of an upstream media access control (MAC) protocol. Apart from these studies, only a few papers are devoted to the performance of HFC cable networks. The most representative works are: [1], concerning traﬃc modeling and analysis of a real HFC branch devoted to telephonic applications; [2], an analysis of the support for multimedia applications using ATM technology in an experimental HFC network; and ﬁnally [3], in which the performance perceived by the subscriber for web-browsing and interactive applications are evaluated using analytic modeling and simulation. Like the ﬁrst two of these three works, the model described in this paper is based on a real system, and it evaluates performance. The diﬀerences between this work and related projects however, are signiﬁcant: instead of just a part of the cable network, the whole network is represented; performance is represented as a percentage of channel utilization and throughput; the model parameters are related to the number of subscribers assigned by the cable operator to each channel, and the time of day considered; and the simulation model built is scalable and can evolve as the real cable network does.

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

243

4. Traﬃc modeling There are two aspects to be considered in the model development: the physical behaviour of the cable network, and the way the cable network is being used by the subscribers. The data requests sent by the subscribers through the upstream channels of the cable network constitute the network workload. This workload must be represented by a model: the workload model or traﬃc model. This section is devoted to the development of the traﬃc model. In the next section this traﬃc model will be incorporated into the physical model to deﬁne the complete cable network model. 4.1. Background There are many studies into traﬃc models, as they have been developed in parallel with the evolution of telecommunication systems. These traﬃc models can be divided into two groups: those developed before the work of Leland et al. [4], and those developed after. The authors in [4] proved that data traﬃc on modern networks exhibits the statistical property of self-similarity, which is represented by traﬃc invariance independently of the time scale considered. Subsequent studies, [5–8] show that self-similarity can be considered as an inherent property of data traﬃc in modern telecommunication systems. In the ﬁrst group, referred to as traditional traﬃc models, traﬃc models were associated mainly to Poisson or Markov processes. A complete review of this kind of model can be found in [9]. The second group began in the early 90s, when it was shown that in order to consider traﬃc self-similarity and produce valid results, traﬃc models must be able to reproduce not only ﬁrst and second order moments (mean and variance), but also the autocorrelation function. Some of the most important traﬃc models developed are: the PT k model, the M/Pareto model and the N-Burst model. The PT k model, [10], represents traﬃc using a combination of a power tail (PT) distribution (in this case a Pareto distribution), and an exponential ðkÞ distribution. Using this combination of distributions, it is possible to obtain self-similar traﬃc, while at the same time the eﬀect of the exponential distribution improves model performance. The M/Pareto model, developed by Addie et al. [11], is used in the representation of variable bit rate (VBR) traﬃc. This model considers a burst superposition. The number of bursts are distributed following a Poisson distribution k, each burst has a constant traﬃc rate of r, and the length of the burst follows a Pareto distribution. The main disadvantage of this model is that it uses four parameters obtained from three statistical traﬃc properties, so there is a degree of freedom which produces an indetermination. This indetermination makes the process of parameter adjustment very complex. The N-Burst model, [12], is another traﬃc modeling alternative based on an ON/ OFF process. During the ON period the system transmits and in the OFF period the system is silent. The N-Burst model considers the superposition of N sources of type ON/OFF. By selecting the distribution of the ON period, it is possible to produce

244

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

self-similar traﬃc. This model also considers an important parameter know as the ‘‘burstiness’’ parameter, which deﬁnes the relationship between the length of the ON and OFF periods. All these models have two common characteristics: ﬁrstly, they consider homogeneous traﬃc, that is traﬃc produced by only one kind of source or application, and secondly, the traﬃc model parameters are obtained from real traﬃc analysis, but with no relationship to the characteristics of the telecommunication system evaluated. This situation results in simpliﬁed traﬃc models diﬃcult to apply to real telecommunication systems, because real traﬃc is aggregated and there is no clear correspondence between system characteristics and traﬃc model parameters. The traﬃc model developed here to represent the traﬃc on the HFC cable network overcomes these limitations considering the aggregated traﬃc on the HFC network, and relating the traﬃc model parameters directly to the HFC network characteristics. The parameters of this traﬃc model have been related to the number of subscribers assigned by the cable operator to each channel, and the time of day. 4.2. Traﬃc model construction The objective of the traﬃc model is to represent the way in which the HFC cable network is used by subscribers. The traﬃc model distinguishes between the traﬃc on the upstream and downstream channels. The upstream traﬃc consist of the requests sent by the subscribers. The downstream traﬃc is produced as a response to these request. Thus, the traﬃc model focuses on the upstream traﬃc, whereas the downstream traﬃc is obtained from the upstream traﬃc modiﬁed by a random factor. The distribution, mean value and standard deviation of this factor are calculated from the traﬃc analysis. The traﬃc model is based on the analysis of the traﬃc measurements taken on all the channels of the cable network, the number of subscribers assigned to each channel, and the measurements of the IP address assignment to subscribers by the Dynamic Host Conﬁguration Protocol, DHCP, server. These measurements were taken over two diﬀerent periods of time: January 2001 and January 2002. In the interim, the cable network evolved from 8 to 17 branches and from 8331 to 17,369 subscribers. In spite of the changes in the cable network, the basic relationships found in the analysis are consistent over time. An extended analysis of the measurements can be found in [13]. In order to reproduce the real traﬃc values on the upstream channels the traﬃc model must consider three elements: the upstream channel characteristics, the media access control (MAC) protocol, and the user proﬁle (the way in which subscribers demand service from the HFC network). In the following subsections the inﬂuence of each element on the development of the model is described. 4.2.1. Upstream channel Each upstream channel is represented as a shared media with a transmission payload of 1.5525 Mb/s. This is so because the HFC operator does not apply quality of service diﬀerences. As a result, all subscribers receive the ‘‘best eﬀort’’ quality of service, that is, all of them compete for the channel.

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

245

4.2.2. Cable modem and MAC protocol Each subscriber connects to the cable network through a cable modem, and communication is controlled by the MAC protocol. The communication on the upstream channel uses a proprietary protocol, whose main characteristics are: • The access to the channel is organized in frames, which are repeated continuously, a frame is sent across the upstream channel every 102.4 ms. • The frame is composed of 512 ATM cells, of which only 414 are devoted to data transmission. The rest are used for communication requests, opportunities to join the channel, synchronization, conﬂict resolution and control. • Of the 414 useful ATM cells, each cable modem can use a maximum of 63 cells in each frame. This maximum value can be controlled by the cable operator depending on the type of contract oﬀered. Using this protocol, cable modems transmit requests towards the controller. Although subscriber contracts with diﬀerent bandwidths are possible, on the dates when the measurements were taken all the subscribers had the same assigned bandwidth: 128/64 Kb, that is 128 Kb for the downstream channel and 64 Kb for the upstream channel. This bandwidth in the upstream channel corresponds to a maximum transmission value of 16 ATM cells per frame for each subscriber. The eﬀective value is reduced as the number of subscribers using the network increases, because in the ‘‘best eﬀort’’ quality of service the scheduling algorithm provides a fair allocation of bandwidth between all cable modems. The way in which the information is sent produces a burst eﬀect on the channel. Each cable modem works as an ON/OFF process. During the assigned time in the frame, the ON period, it can send up to 16 ATM cells. Then it must wait until the next frame for a new transmission; this waiting time is the OFF period. This pattern is repeated for all cable modems, so in each frame there are sequences of used cells followed by groups of unassigned or unused cells. The pattern is similar for each frame, and is modiﬁed as the number of subscribers on the channel changes. The global eﬀect on the channel is a sequence of ON/OFF sources, as can be seen in Fig. 2. 4.2.3. Subscriber proﬁle This is the most relevant aspect in the development of the traﬃc model, because the diversity of subscribers’ behavior is represented with a reduced set of parameters. Two factors must be considered: when and how the subscribers use the network.

Protocol frame

ON

Protocol frame

ON OFF OFF

Fig. 2. ON/OFF eﬀect in the upstream channel.

246

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

4.2.3.1. When subscribers access the cable network. The total number of connected subscribers to the cable network is known at any given moment from the measurements taken on the DHCP server. The DHCP server assigns an IP address on demand to any subscriber requesting access the cable network. Thus, the DHCP server provides the global evolution of the number of connected subscribers, and can distinguish between IP addresses belonging to one of the cities and the group of the other two cities. Fig. 3 represents the percentage of the total connected subscribers in time for the cities where the cable operator provides services. Both lines show approximately the same evolution, in spite of the diﬀerence in the total number of subscribers in each city. This pattern has been observed in all the sub-networks managed by the cable operator. These measurements support the assumption that the number of connected subscribers on all the upstream channels follows a similar pattern to that observed in the DHCP measurements. Thus, using the DHCP measurements, the connection pattern can be synthesized applying Fourier analysis. The expression obtained is then used to determine the number of connected subscribers at each moment, for each upstream channel. The evolution of the number of connected subscribers is reproduced in the traﬃc model following these steps: (1) The number of connected subscribers in each sample period of the DHCP measurements is expressed as a percentage of the total number of subscribers. (2) The values obtained are then represented by a Fourier series. (3) In order to reduce the number of model parameters, the Fourier series used to synthesize the subscribers evolution is truncated to 90% of the spectral power. This percentage represents a tradeoﬀ between the number of coeﬃcients used and the error committed. The expression obtained is: x½i ¼

p X

Re X ½k cosð2pki=N Þ þ

k¼0

p X

Im X ½k sinð2pki=N Þ

ð1Þ

k¼0

Connected subscribers (%)

where Re X and Im X are the real and imaginary coeﬃcients of the Fourier analysis, N is the number of elements in the Fourier analysis, and p is the number of terms needed to synthesize the function with 90% of the spectral power. 70

City-1 Cities2+3

60 50 40 30 20 10 0 6:00

12:00

18:00

0:00

6:00

12:00

18:00

0:00

6:00

Time of day

Fig. 3. Evolution of connected subscribers to the network.

12:00

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

247

(4) Using Eq. (1), the percentage of connected subscribers is calculated for each sample period, i. (5) Multiplying the number of assigned subscribers to each upstream channel by the percentage obtained in the previous step gives the number of connected subscribers. (6) An analysis of the variations of the connected subscribers in the DHCP measurements shows that there are slight variations in the number of connected subscribers at the same time on diﬀerent days. These variations follow a normal distribution, and have been included in the result obtained. The ﬁnal number of connected subscribers is obtained from a normal distribution, which has as its mean the number of connected subscribers obtained in the previous step, and a standard deviation of 5% of the mean value. 4.2.3.2. How subscribers demand services. The second step in the development of the subscriber proﬁle is to determine how the subscribers demand services from the network. The subscriber proﬁle is estimated from the traﬃc measurement analysis which conﬁrms the following assumptions. Firstly, the mean and peak traﬃc on the upstream channels are proportional to the number of subscribers assigned to the channel, which means that traﬃc can be considered nearly homogeneous among the subscribers. Secondly, traﬃc can be divided into two types: interactive and noninteractive traﬃc. Interactive traﬃc increases as the number of connected subscribers increases, this traﬃc is associated to applications which require human interaction, for example web browsing. On the other hand, non-interactive traﬃc remains almost constant although the number of connected subscribers varies, and is associated to more permanent applications, for example peer to peer services. For the ﬁrst assumption, Fig. 4 shows the graphs for the mean values and peak values of the upstream traﬃc, related with the number of subscribers assigned in each upstream channel. For each graph the points are adjusted by a linear regression model, whose equation is also shown in the graph. In the graphs two dotted lines mark the limits of the conﬁdence interval for the predictions of the regression model, for a 95% level of conﬁdence. This kind of relationship has also been observed on the downstream channels, and they are constant over time. This behaviour was observed both in the 2001 and in the 2002 measurements. The second assumption is based on the traﬃc proﬁle on the upstream channels, where a continuous level of traﬃc is observed throughout the day. The existence of two types of traﬃc has been conﬁrmed by the analysis of the relationship between traﬃc and connected subscribers, and by the traﬃc measurements collected in the Internet access router. From the DHCP measurements, the number of connected subscribers in each group of cities is known. The aggregated upstream traﬃc in each group is calculated by adding all the upstream traﬃcs which belong to each group. Dividing the traﬃc between the number of connected subscribers provides the traﬃc per subscriber metric. Fig. 5 shows the evolution of this measurement over time: in periods of high trafﬁc and high number of connected subscribers it remains almost constant, while in periods of low activity it exhibits a peak. This behaviour conﬁrms that during the

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261 Channel band width utilization (%)

248

50 45 40 35 30 25 20 15 10 5 0

y=0.1119x-0.4076 2

R =0.5513

0

50

100

Channel band width utilization (%)

(a)

150

200

250

300

350

250

300

350

Nο Subscribers on the channel

100 90 80 70 60 50 40 30 20 10 0

y=0.2201x+11.811 2 R =0.654

0

50

100

(b)

150

200

Nο Subscribers on the channel

Fig. 4. Relationship of mean and peak traﬃc with number of subscribers assigned to the upstream channels: (a) mean upstream traﬃc values, (b) peak upstream traﬃc values.

Channel utilization per subscriber (%)

1.2

Traffic

1 0.8

Traffic per subscriber

0.6 0.4 0.2

Connected subscribers 0 0:00

6:00

12:00 18:00

0:00

6:00 12:00 18:00

0:00

6:00 12:00

Time of day

Fig. 5. Evolution of traﬃc per subscriber.

period of human activity interactive traﬃc dominates, while in the periods of low activity non-interactive traﬃc is more signiﬁcant. The analysis of the traﬃc measurements taken on the Internet access router, broken down into network services, have also shown that the services can be classiﬁed into two types of patterns. The most important volume of traﬃc is associated to the

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

249

‘‘peer to peer’’ application Edonkey, which represents 53.35% of the total upstream traﬃc. The traﬃc pattern associated to this application has an almost constant proﬁle throughout the day. On the other hand, the traﬃc proﬁle of most of the services change over time, as the number of connected subscribers does. The conclusion is that interactive traﬃc occurs mainly during the human active period, it evolves with the number of subscribers and the impact of each subscriber can be considered constant. On the other hand, traﬃc during the human inactive period corresponds to non-interactive type, and a reduced number of subscribers generate a high volume of traﬃc. The diﬀerent percentages of traﬃc belonging to each type are calculated considering that during the period of human inactivity traﬃc corresponds to non-interactive traﬃc. Thus, on each upstream channel we obtain a percentage of non-interactive traﬃc by averaging the traﬃc values between 5:35 a.m. and 6:30 a.m. Later, all the values obtained were represented in a graph against the peak to mean traﬃc rate. This index was chosen because the greater the peak to mean rate, the higher the number of interactive subscribers. Fig. 6 shows the graph of non-interactive traﬃc against the peak to mean rate; the points are adjusted by a potential model whose equation and coeﬃcient of determination are shown on the graph. Considering the above assumptions and relationships, the traﬃc model parameters are calculated as follows: (1) The mean traﬃc on each upstream channel is obtained from the relationship shown in Fig. 4(a): Mean ¼ 0:1119 Subs 0:4076

ð2Þ

(2) The peak traﬃc on each upstream channel is obtained from the relationship shown in Fig. 4(b): Peak ¼ 0:2201 Subs 11:811

ð3Þ

Non-inter.channel utilization (%)

(3) The peak to mean rate is obtained by dividing the values obtained from Eqs. (2) and (3). From this value, the percentage of non-interactive traﬃc is estimated, using the relationship shown in Fig. 6: 60 50 y=44.579x-3.312 2 R =0.5063

40 30 20 10 0 0

0

1

1.5

2

2.5

Rate peak traffic/mean traffic

Fig. 6. Potential model which adjusts the percentage of non-interactive traﬃc.

250

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

Peak traffic estimation

Peak traffic model

Non-interactive traffic model Number of subscribers on the channel

Mean traffic model

Mean traffic estimation

Subscriber evolution function

Non-interactive traffic estimation

Mean Non-in. Efec. Subs.

Subscriber estimation

Non-intractive traffic estimation Interactive traffic estimation

Connected subscribers estimation

Fig. 7. Traﬃc estimation process based on the number of subscribers on the channel.

Nint ¼ 44:579

Peak Mean

3:312 ð4Þ

(4) The interactive traﬃc is calculated as a percentage of interactive traﬃc per subscriber. Thus, the value of interactive traﬃc is obtained as the diﬀerence between the mean traﬃc, calculated with Eq. (2), and the non-interactive traﬃc, from Eq. (4). Then, the mean number of eﬀective connected subscribers on the channel is calculated. Eﬀective subscribers are the number of connected subscribers minus the non-interactive subscribers, that is the interactive subscribers: Int ¼

Mean Nint ðperc percmin Þ Subs

ð5Þ

where perc and percmin are the mean and minimum values of the connection evolution function, and Subs is the number of subscribers on the channel. All the traﬃc model parameters are obtained from a known parameter: the number of subscribers assigned to each channel. Fig. 7 summarizes the calculation process described and the relationships between the traﬃc measurements considered. 4.3. Traﬃc model implementation The traﬃc model has been implemented using the modeling and simulation language QNAP2. 1 This language is based on the queuing network paradigm and uses discrete events simulation. Thus, the diﬀerent elements of the traﬃc model have been represented by queues with complex services. The non-interactive traﬃc is represented by a source. This element produces the percentage of non-interactive traﬃc calculated in Eq. (4). At this source, it is necessary to determine the size of the requests and the time between requests in order to 1

QNAP2 was developed by the INRIA, and is a trademark of SIMULOG.

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

251

generate the percentage of non-interactive traﬃc. Considering the type of applications which produce non-interactive traﬃc, and in accordance with [8], the size of the non-interactive requests are distributed following a Pareto distribution. The mean value of the Pareto distribution is chosen as the maximum traﬃc that a cable modem can send in a second, approximately 7500 bytes. The inter-request time is calculated to reach the required traﬃc volume. The interactive traﬃc is generated by an inﬁnite server sending requests from the active subscribers. In this case the inter-request time is ﬁxed at 1 s, and the size necessary to maintain the traﬃc rate per subscriber is obtained. The size of the requests are distributed following a normal distribution taking the obtained value as the mean and a standard deviation of 20%. The upstream communication is represented by two servers. One of them implements the upstream channel, and the other considers the inﬂuence of the MAC protocol. Together, they constitute a load dependence server, whose service time depends on the number of ATM cells assigned to each cable modem by the MAC protocol under the ‘‘best eﬀort’’ quality of service. This traﬃc model has been used to simulate the behaviour of each upstream channel of the cable network. The only parameter it requires is the number of subscribers assigned to the channel, and using the established relationships the model parameters are calculated. As the simulation time evolves, the number of connected subscribers are recalculated following the connection evolution function. The result of the simulation is the estimated traﬃc proﬁle in each upstream channel. In Section 6, these results are compared with the real values in order to validate the traﬃc model, concluding that the traﬃc model produces a traﬃc proﬁle which is statistically indistinguishable from the real traﬃc for a 95% of level of conﬁdence.

5. Cable network model development In this section the traﬃc model is joined to a physical description of the HFC network to produce the global model of the cable network. The proposed model represents a cable network with an architecture like that shown in Fig. 1, validated for the case of study. This model represents a complex system, including the behaviour of more than 17,000 subscribers, 17 HFC branches with 17 downstream and 102 upstream channels. In order to deal with this complex system an approach based on hierarchical structure is used. In the ﬁrst step a model for a simple HFC branch is developed. Later, several simple HFC branch models are combined and joined to other elements to represent the whole cable network. 5.1. A simple HFC branch model A simple HFC branch of the cable network can be represented using queuing elements as shown in Fig. 8. This HFC model includes six upstream channels, each of which is represented by an upstream traﬃc model (the stations enclosed in the dotted line in Fig. 8) to be applied to the cable network. Using the facilities of the QNAP2

252

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261 Subscribers x 6 Non-interactive requests Interactive requests Upstream HCX

Exterior

Downstream

HFC branch OUT

Fig. 8. Queuing model for a simple HFC branch.

language, the traﬃc model is encapsulated in a simulation object, which receives the number of assigned subscribers to the upstream channel as its parameter. In this way, the simplicity and independence of the model is improved. The downstream queue represents the downstream channel, the HCX queue represents the HFC controller and the exterior queue represents the rest of the cable network. The downstream and HCX stations work as time shared queues whose service time depends on the channel capacity for the downstream queue, and on the HFC controller speciﬁcations for the HCX queue. The most important element in this model is the exterior queue, which calculates the size of the responses to the subscribers’ requests. These responses make up the downstream traﬃc. The size of responses are obtained from the rate between downstream and upstream traﬃc measurements. Thus, the size of each response is calculated by multiplying the size of each subscriber request (obtained from the traﬃc model) by the rate. The rate for each simple HFC branch is calculated from the downstream traﬃc on the branch, and the aggregated traﬃc of all the upstream channels in the branch. Diﬀerent rates are obtained for the two diﬀerent types of traﬃc. The non-interactive rate is the rate between downstream and upstream samples belonging to periods of low activity (5:35 a.m. to 6:30 a.m.). These samples give a mean value and a standard deviation for the non-interactive rate in each HFC branch. For the interactive traﬃc rate, the samples from periods of high activity (between 9:00 p.m. and 11:00 p.m.) are considered. The interactive rate is calculated using the expression: r¼

Downhigh Downlow Uphigh Uplow

ð6Þ

This expression calculates the rate between interactive traﬃc on both kinds of channels, downstream (Down) and upstream (Up). The interactive traﬃc on both channels is obtained as the diﬀerence between the maximum traﬃc (high) and the

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

253

minimum or non-interactive traﬃc (low). Applying Eq. (6) to each group of samples, several values for the rate are obtained. These values produce a mean value and a standard deviation for the interactive traﬃc. In this way, the simple HFC network model reproduces the upstream traﬃc using the traﬃc model previously developed, and the external queue generates the traﬃc on the downstream channel of each HFC branch. 5.2. The cable network model Using the modeling language facilities, the simple HFC branch model is encapsulated in a simulation object. This object is the basic element for building the global cable network model. Thus, the global cable network model is formed by the same number of HFC branch objects as HFC branches on the cable network. The cable network model is completed with a queue which represents the ATM backbone, which interconnects all the HFC branches, and a special network branch which represents the cable operator head-end. This modeling strategy produces a scalable model, which adapts easily to the cable network evolution. Fig. 9 shows a scheme of the whole cable network model. The most important aspect of this model is the way the traﬃc is distributed in the cable network. There are several kinds of traﬃc: internal, local, and external. Internal traﬃc is that established between subscribers in the same HFC branch. It is ﬁltered by the HCX element and does not migrate to the rest of the cable network. This traﬃc is very limited and can be considered negligible. Local traﬃc is the traﬃc established between subscribers in diﬀerent HFC branches, and external traﬃc is

Subscribers x 6

x 6 Subscribers

Non-interactive request Interactive request

Non-interactive request Interactive request

Upstream

Upstream HCX

HCX Downstream

Downstream

ATM Backbone

HFC branch OUT

Branch 1

HFC branch OUT

Branch N Internet access 2 Router 1

Internet

Router 2 ATM switch

Servers

Internet access 1

Head-end branch Fig. 9. Queuing model for the cable network.

254

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

MBytes per second

the traﬃc sent by the subscribers towards the head-end network branch. The external traﬃc in the head-end can go towards the servers or towards Internet. The volume of each kind of traﬃc is estimated using the available measurements. The sum of all upstream traﬃc, that is, the traﬃc sent through all the upstream channels and directed towards the ATM backbone is established. This traﬃc comprises both local and external traﬃc. This global upstream traﬃc is compared with the trafﬁc registered in the Internet access router (Router 2 in Fig. 9). The evolution of both traﬃcs is shown in Fig. 10. The graph shows that the traﬃc proﬁles in both parts of the network are almost the same. This indicates that the majority of the HFC cable network traﬃc is sent towards Internet. The small diﬀerences between the two lines are due to the local traﬃc and the traﬃc towards the cable network server. To evaluate the volume of server traﬃc, the server log ﬁles for the same date as traﬃc measurements were analyzed. The average traﬃc value on the server was found to be 0.23 Mbps, far from the registered measurements in Fig. 10. This traﬃc is shared between the cable network subscribers (6.35% of requests and 4.41% of traﬃc volume) and external requests made from Internet. The conclusion obtained is that the majority of upstream traﬃc on the cable network is represented by traﬃc towards Internet, the local traﬃc represents between 1% and 2%, and the server traﬃc represents a percentage between 0.5% and 3%. The downstream traﬃc associated with the response to the subscribers, is calculated as in the simple HFC branch model, but in this case the rate between the incoming traﬃc to the cable network from Internet, and the outgoing traﬃc from the cable network towards Internet is considered; these traﬃcs are measured in the Internet access router (Router 2 in Fig. 9). As in the HFC branch model, a rate for each kind of traﬃc, interactive and non-interactive, must be obtained. For non-interactive traﬃc, the continuous part of the traﬃc is considered. This kind of traﬃc is produced by peer to peer applications, so the rate between the inner and outer traﬃc produced by peer to peer applications gives the eﬀect of non-interactive traﬃc. This rate gives a mean value of 2.44 and a standard deviation of 0.21. In the case of interactive traﬃc, the total traﬃc is reduced by the continuous traﬃc both for the inner and outer traﬃc. Obtaining the rate between the resulting types of traﬃc gives the inﬂuence of the interactive traﬃc. This rate results in a mean value of

10 9 8 7 6 5 4 3 2 1 0 6:00

TowardsInternet AgregatedUpstream

12:00

18:00

0:00

6:00

12:00

18:00

0:00

6:00

Time of day

Fig. 10. Upstream and outgoing router traﬃc comparison.

12:00

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

255

2.23 and a standard deviation of 0.24. From the analysis of the traﬃc measurements the traﬃc rates have been found to follow approximately normal distribution. The ﬁnal cable network model generates requests in all the upstream elements, each of them represented by a traﬃc model, and in this way generates the upstream traﬃc on all the channels. This traﬃc is sent across the ATM backbone element mainly to the head-end branch, but also to the other HFC branches. In the headend branch, the traﬃc is directed towards the server or Internet elements. In these elements the response traﬃc is calculated, and sent back after a delay period which represents the information access time. This traﬃc will be sent back to the HFC branches through the downstream channels. 5.3. Model extensions The cable network model has been developed based on the characteristics of however it can be extended to other cable network architectures. The versatility of the model is a result of its modular structure and the deﬁnition of simulation objects which encapsulate some of the modules. The ﬁrst module is formed by the traﬃc model. This model represents a complete upstream channel: the subscribers’ requests, the MAC protocol, etc. This traﬃc model is encapsulated on the upstream simulation object, which receives as its parameter the number of assigned subscribers to the channel. The second module is constituted by the HFC branch model. This model represents the basic elements of an HFC branch: the downstream channel, the HFC controller and deﬁnes as many upstream objects as upstream channels on the HFC branch. Finally this model is encapsulated to build the HFC branch simulation object. The modular nature of the model simpliﬁes the changes needed to evaluate new working conditions. The following are some examples of possible working conditions that can be analyzed and their associated changes:

TELECABLE INC.,

• If the proprietary MAC protocol were changed to DOCSIS protocol, the change would require modifying the station associated to the MAC protocol in the traﬃc model to implement the characteristics of the DOCSIS protocol. Thus the changes would be included in the upstream simulation objects and incorporated into the cable network model. • The existence of subscribers with diﬀerent qualities of services can be included in the model by deﬁning new classes of customers in the queuing model. Each class of customers would have a distinct service in the model stations. • A similar situation occurs when the behaviour of a particular application on the cable network is to be studied. The deﬁnition of a new class with a diﬀerent service will provide information about the studied application. • Finally, the number of upstream channels on each HFC branch can be modiﬁed directly from a network description ﬁle. This ﬁle initializes the cable network model by deﬁning the number of HFC branch objects, their associated upstream channels and the number of assigned subscribers to each upstream channel.

256

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

Currently, the cable network model is used for the tuning and capacity planning of the bandwidth allocation between channels, because the model provides information over time rather than a ﬁxed snapshot of the network situation. Using this model, the cable operator can take decisions about network growth: inclusion of a new HFC branch or new upstream channels, etc. Other direct applications of the model are to study the impact on the network of new services, such as multimedia interactive services, and the performance obtained by the subscribers using these new services.

6. Results and model validation The main result provided by the cable network model developed is the traﬃc expressed as a percentage of channel utilization in all the network channels: upstream and downstream channels, backbone link and Internet access channels. The model also permits the cable operator to obtain information about traﬃc throughput and network devices utilization. Some of these results can be directly compared with the values of the real cable network and so the model can be validated. The cable network model was developed using a hierarchical approach; ﬁrst the upstream traﬃc model was developed, then it was included in the HFC branch model, which is the basic element of the cable network model. The ﬁnal model obtained at each stage was validated in order to improve the quality of the results obtained. The validation consists of three parts: traﬃc proﬁle comparison, comparison using conﬁdence intervals, and a comparison of the values obtained for some statistical properties (autocorrelation function and self-similarity coeﬃcient). The validation based on conﬁdence interval comparison is described in [14]. This method is based on deﬁning a diﬀerence series ðn ¼ Real ModelÞ and calculating its conﬁdence interval; if the calculated conﬁdence interval includes zero, both series are statistically indistinguishable. The ﬁrst component to be validated is the upstream traﬃc model. Fig. 11 shows two examples of the comparison of traﬃc proﬁles on the upstream channels. Applying the comparison method based on conﬁdence intervals for all the upstream channels, the interval obtained for a 95% level of conﬁdence is [)11.33, 13.55], which includes zero. The statistical properties to compare are the autocorrelation function and the coeﬃcient of self-similarity. The comparison of the autocorrelation function obtained ranges from perfect adjustment to a slight diﬀerence in the lower half of the graph for the worst cases; Fig. 12 shows the comparison of the autocorrelation function for the two upstream channels of Fig. 11. In case of self-similarity, the values of the Hurst coeﬃcient are very close. The diﬀerence between values is lower than 10% except in two cases, and the mean diﬀerence is 2.77%. In conclusion, the upstream traﬃc generated by the model developed is a statistically equivalent approximation to the real traﬃc on the upstream channels. In the case of the HFC branch model the downstream traﬃcs obtained were also validated using the same methods; all the methods conﬁrm the validity of the simple

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

Channel utilization(%)

80

Real

70 60

Model

50 40 30 20 10 0 6:00 12:00 18:00 0:00

(a)

6:00 12:00

18:00 0:00

Channel utilization (%)

6:00 12:00

Time of day

60

Real

50

Model

40 30 20 10 0 6:00 12:00 18:00 0:00 6:00 12:00 18:00 0:00

(b)

257

6:00 12:00

Time of day

Fig. 11. Examples of traﬃc comparisons on upstream channels: (a) channel GI01CC02-UP7, (b) channel GI02CC01-UP8.

HFC branch model. The results are not shown because the simple HFC branch model is an intermediate step towards the ﬁnal cable network model. The cable network model can be validated both for the traﬃc on the upstream and downstream channels, and on the traﬃc registered through the router which controls the access to Internet. The results obtained for the upstream channels are the same for the traﬃc model (Fig. 11). For the downstream traﬃc, Fig. 13(a) shows the comparison between the real and the simulated traﬃc proﬁles, giving an example of the approximation level that can be obtained. The comparison based on conﬁdence intervals provides more information about the level of adjustment between the model and the real cable network. The conﬁdence interval obtained on all the channels for a level of conﬁdence of 95% includes zero ([)6.328, 14.159]), which means that both systems are statistically indistinguishable. The comparison of the statistical properties of autocorrelation function and self-similarity show insigniﬁcant diﬀerences in both cases. Fig. 13(b) depicts the worst adjustment among the autocorrelation functions in the downstream channels. The relative error of the self-similarity coeﬃcients have a mean diﬀerence of 2.59%, and a worst case of 10.59%. Finally, as most of the traﬃc of the cable network is destined to the Internet, it is very important to compare the adjustment of simulation results for this traﬃc. Fig.

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0

Autocorrelation coefficient

258

Model Real

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61

Autocorrelationcoefficient

(a)

Lag 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0

Model Real

1 3 5 7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61

(b)

Lag

Fig. 12. Comparison of autocorrelation coeﬃcients: (a) channel GI01CC02-UP7, (b) channel GI02CC01UP8.

60

Real Model

Channel utilization(%)_

50 40 30 20 10 0 6:00

12:00

Autocorrelation coefficient.

(a)

18:00

0:00

6:00

12:00

18:00

0:00

6:00

12:00

Time of day

1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0

(b)

Model Real

1

3

5

7 9 11 13 15 17 19 21 23 25 27 29 31 33 35 37 39 41 43 45 47 49 51 53 55 57 59 61

Lag

Fig. 13. Example of traﬃc comparison on a downstream channel: (a) traﬃc proﬁle comparison, (b) traﬃc autocorrelation, worst case.

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

Mbytes per second

9 8

Real Model

7 6 5 4 3 2 1 0 6:00

12:00

18:00

0:00

(a)

Mbytes per second

259

20 18 16 14 12 10 8 6 4 2 0 6:00

(b)

6:00

12:00

18:00

0:00

6:00

12:00

Time of day

Real Model

12:00

18:00

0:00

6:00

12:00

18:00

0:00

6:00

12:00

Time of day

Fig. 14. Traﬃc comparison on the Internet access router: (a) outgoing traﬃc (towards Internet), (b) incoming traﬃc (from Internet).

14 compares the traﬃc through the router both to and from Internet. The level of coincidence between the real results and the simulated results is high in both cases. The conﬁdence intervals obtained for a 95% level of conﬁdence include zero: they are [)0.337, 1.323] and [)1.685, 1.706] respectively. A comparison of statistical properties conﬁrms the validity of the model, there is no diﬀerence between the autocorrelation functions in both cases, and the relative diﬀerence between the self-similarity coeﬃcients are 0.34% and 1.10% respectively.

7. Conclusions This paper presents the design of a simulation model for a generic cable network, and its validation procedure using the measurements of a real cable operator. The cable network model is built with a hierarchical structure which is based on simpler intermedia models; these intermedia models are independently validated, thus conﬁrming the quality of the ﬁnal model. Using the modeling language facilities, each intermedia model is represented as a simulation object, which can be directly incorporated into higher level models. Thus, the ﬁnal cable network model has the same number of simulation objects as there are HFC branches in the real cable network. The cable network model is scalable, and can evolve as the real cable network does.

260

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

The parameters of the cable network model are obtained from the analysis of traﬃc measurements on all its channels. The traﬃc on the channels is related by simple regression models to the number of subscribers assigned to each channel. The result of the analysis is a simple procedure to obtain model parameters from cable network information. This procedure makes the use of the model direct, because it relates known physical parameters with more complicated traﬃc parameters. The use of the ﬁnal cable network model by the cable operator is very simple; the only information it requires is the number of subscribers assigned by the cable operator to each upstream channel. This information allows the cable operator to use the model for a process of continuous capacity planning, as the number of subscribers assigned to each channel evolves. This makes this model diﬀerent from existing models, in which the model parameters are not always related to the network and do not have a direct meaning. As a ﬁnal conclusion, the developed model can be considered a valid tool to support performance decisions about the cable network studied. A simple procedure is established to obtain model parameters from the available measurements. The model is generic and has a hierarchical structure which makes it adaptable to the particular implementation of other cable networks.

References [1] D.J. Houck, W.S. Lai, Traﬃc modeling and analysis of hybrid ﬁber-coax systems, Computer Networks and ISDN Systems 30 (1998) 821–834. [2] I. Borges, F. Fontes, J. Bastos, J. Loureiro, Interactive Services over Hybrid Fibre-Coax Networks. in: Proceedings of international conference on ATM, ICATM99, Colmar, France, June 1999. [3] N.K. Shankaranarayanan, Z. Jiang, P. Mishra, User-Perceived Performance of Web-browsing and Interactive Data in HFC Cable Access Networks, IN: Proceedings of the IEEE International Conference on Communications, Helsinki, Finland, June 2001. [4] W. Leland, M. Taqqu, W. Willinger, D. Wilson, On the self-similar nature of Ethernet traﬃc (Extended version), IEEE/ACM Transactions on Networking. 2 (1) (1994) 1–15. [5] M. Garrett, W. Willinger, Analysis, Modeling and Generation of Self-Similar VBR Video Traﬃc, Proceedings of the ACM Sigcomm., London, September 1994, pp. 269–280. [6] V. Paxson, S. Floyd, Wide area traﬃc: the failure of Poisson modeling, IEEE/ACM Transactions on Networking 3 (3) (1994) 226–244. [7] W. Willinger, M.S. Taqqu, W.E. Leland, D.V. Wilson, Self-similarity in high-speed packet traﬃc: analysis and modeling of Ethernet traﬃc measurements, Statistical Science 10 (1) (1995) 67–85. [8] M.E. Crovella, A. Bestavros, Self-similarity in World Wide Web traﬃc: evidence and possible causes, IEEE/ACM Transactions on Networking. 5 (6) (1997) 835–846. [9] V.S. Frost, B. Melamed, Traﬃc modeling for telecommunications networks, IEEE Communications Magazine (1994) 70–81. [10] P. Fiorini, Modeling telecommunication systems with self-similar data traﬃc, Ph.D. Thesis, Department of Computer Science and Engineering, University of Connecticut, 1997. [11] R.G. Addie, M. Zukerman, T.D. Neame, Broadband traﬃc modeling: simple solutions to hard problems, IEEE Comunications Magazine 36 (8) (1998) 88–95. [12] L. Lipsky, H.-P. Schwefel, M. Greiner, M. Jobmann, Comparison of the Analytic N-Burst Model with Other Approximations to Self-Similar Telecommunications Traﬃc, Technical Report, TUM and BRC, November, 2000.

M. Garcia et al. / Simulation Modelling Practice and Theory 12 (2004) 239–261

261

[13] M. Garcia, X.G. Pa~ neda, D.F. Garcia, V.G. Garcia, R. Bonis, Traﬃc analysis of data transmission on Hybrid Fiber Coax Network, in: Proceedings of the IASTED International Conference on Communication Systems an Networks, CSN 2002, Malaga, Spain, September, 2002, pp 172–177. [14] A.M. Law, W.D. Kelton, Simulation modeling & analysis, second ed., McGraw-Hill International, 1991.

Modeling and simulation of data transmission on a hybrid fiber coax cable network

Modeling and simulation of data transmission on a hybrid fiber coax cable network

Recommend Documents