Validation of traffic flow models with respect to the spatiotemporal evolution of congested traffic patterns

Validation of traffic flow models with respect to the spatiotemporal evolution of congested traffic patterns

Transportation Research Part C 21 (2012) 31–41 Contents lists available at SciVerse ScienceDirect Transportation Research Part C journal homepage: w...

1MB Sizes 0 Downloads 107 Views

Transportation Research Part C 21 (2012) 31–41

Contents lists available at SciVerse ScienceDirect

Transportation Research Part C journal homepage: www.elsevier.com/locate/trc

Validation of traffic flow models with respect to the spatiotemporal evolution of congested traffic patterns Martin Treiber ⇑, Arne Kesting Technische Universität Dresden, Institute for Transport & Economics, Würzburger Str. 35, 01062 Dresden, Germany

a r t i c l e

i n f o

Article history: Received 10 August 2010 Received in revised form 1 September 2011 Accepted 1 September 2011

Keywords: Spatiotemporal dynamics Traffic instabilities Calibration Validation Traffic flow models Stop-and-go traffic

a b s t r a c t We propose a quantitative approach for calibrating and validating key features of traffic instabilities based on speed time series obtained from aggregated data of a series of neighboring stationary detectors. The approach can be used to validate models that are calibrated by other criteria with respect to their collective dynamics. We apply the proposed criteria to historic traffic databases of several freeways in Germany containing about 400 occurrences of congestions thereby providing a reference for model calibration and quality assessment with respect to the spatiotemporal dynamics. First tests with microscopic and macroscopic models indicate that the criteria are both robust and discriminative, i.e., clearly distinguishes between models of higher and lower predictive power. Ó 2011 Elsevier Ltd. All rights reserved.

1. Introduction Traffic flow models displaying instabilities and stop-and-go waves have been proposed for more than fifty years (Reuschel, 1950; Pipes, 1953) (see also the reviews by Helbing (2001) and Hoogendoorn and Bovy (2001)). Observations of instabilities date back several decades as well (Treiterer and Myers, 1974). Nevertheless, a systematic concept for quantitatively calibrating and validating model predictions of instabilities against observed data seems still to be missing. A possible reason for this are intricacies in the data interpretation and wrong expectations about what can be achieved and what not. In principle, traffic instabilities and oscillations in congested traffic can be captured by several types and representations of data: flow-density points and speed time series of aggregated stationary detector data (SDD), passage times and individual speeds from single-vehicle SDD, floating-car data captured by the vehicles themselves, and trajectory data derived from images of cameras at high viewpoints. At first sight, flow-density data seem to be useful for representing traffic instabilities since oscillations are non-stationary phenomena in which traffic is clearly out of equilibrium. This should lead to a wide scattering in the flow-density data of congested traffic (Kerner and Rehborn, 1996) as shown in Fig. 1 regardless of the theoretical question of whether traffic flow has a definite equilibrium relation (‘‘fundamental diagram’’), or not (Nishinari et al., 2003; Kerner, 2004; Treiber et al., 2010). In fact, simulations with the Intelligent Driver Model (Treiber et al., 2000), as well as with a wide variety of other models, show wide scattering even for identical drivers and vehicles (Fig. 1). Nevertheless, flow-density data cannot be used for a quantitative calibration since other effects such as inter-driver variability (heterogeneous traffic) (Banks, 1999; Ossen et al., 2007; Treiber and Helbing, 1999), intra-driver variability (drivers change their behavior over time) (Treiber and

⇑ Corresponding author. E-mail addresses: [email protected] (M. Treiber), [email protected] (A. Kesting). 0968-090X/$ - see front matter Ó 2011 Elsevier Ltd. All rights reserved. doi:10.1016/j.trc.2011.09.002

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

Q (Vehicles/h)

(a)

3000

Data IDM, virtual detector

2500 2000 1500 1000 500

(b) 3000 Q (Vehicles/h)

32

IDM, virtual detector IDM, spatial density

2500 2000 1500 1000 500 0

0 0

30

60

90

ρ (Vehicles/km)

120

0

30

60

90

120

ρ (Vehicles/km)

Fig. 1. (a) Comparison of empirical and simulated flow-density data as obtained from real and virtual detectors; (b) Comparison of the simulated data as obtained from a virtual detector with the simulated local spatial density. Also shown is the fundamental diagram of the model (solid curve).

Helbing, 2003), and many effects related to lane changes and platoons (Nishinari et al., 2003; Treiber et al., 2006b) may lead to wide scattering as well. Moreover, since the density is not directly measurable by stationary detectors, it must be estimated, typically by making use of the hydrodynamic relation ‘‘flow divided by the average velocity’’. The available temporal average, however, systematically underestimates the true spatial average relevant for this relation which results in an extreme bias in the flow-density data of congested traffic (Leutzbach, 1988; Helbing, 2001). Fig. 1 illustrates this by means of simulations with the Intelligent Driver Model by Treiber et al. (2000)). This model has been calibrated such that the aggregated virtual detector shows a similar scattering as the empirical data (Fig. 1a). However, comparing the IDM virtual detector data with the true spatial local density (which, of course, is available in the simulation) reveals the systematic errors (Fig. 1b). Alternatively, detailed information can be extracted from single-vehicle data, floating-car data, or trajectory data. However, since oscillations are collective phenomena, the high level of detail is not necessary. Moreover, to date, the availability of these data types is still limited, or the data are not suitable for our purposes. Specifically, the NGSIM trajectory data (US Department of Transportation, 2007) widely used as a testbed for these types of data cover road sections of at most 2100 ft (640 m) in length, while stop-and-go waves develop on typical scales of 5–10 km. In contrast, speed time series obtained from aggregated stationary detector data are widely available (see Fig. 2) and thus suitable for systematic calibration and benchmarking purposes. Moreover, since temporal averages are the appropriate mean when analyzing time series, no systematic errors arise. Using this type of data, the wavelength and propagation velocities of oscillations on freeways has been investigated by Lindgren (2005), Bertini and Leal (2005), Ahn and Cassidy (2007), Schönhof and Helbing (2007), and Wilson (2008b) and compared for several countries by Zielke et al. (2008). However, the proposed

Fig. 2. Screenshot of a publicly available image database of congestion patterns on the German freeway A5. The database can be searched by criteria such as direction of travel, space interval, time interval, weekday, cause of the congestion, or the type of traffic pattern. In the example, the database has been filtered for jams occurring on Mondays in the southern driving direction. The images have been produced using a dedicated reconstruction technique (Treiber et al., 2011).

33

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

schemes are not suitable for a systematic automated analysis. Moreover, to our knowledge, there seem to be no published work quantitatively determining the growth rate of the oscillations. In this paper, we propose a systematic approach for calibrating and validating key features of traffic instabilities based on speed time series obtained from aggregated stationary detector data. This includes the propagation velocity, the wavelength, and the growth rate of the oscillations. We give first results based on a historic congestion database of the German freeway A5, and show that the method has a high discriminative power in assessing the model quality. The proposed criteria are described in Section 2, and applied to observed data in Section 3. A first application to traffic flow models is given in Section 4 before Section 5 concludes with a discussion and an outlook.

2. The proposed quality measures The formulation of the criteria is based on qualitative empirical findings (‘‘stylized facts’’) (Treiber et al., 2010) that are persistently observed on various freeways all over the world, e.g., the USA, Great Britain, the Netherlands, and Germany (Lindgren, 2005; Bertini and Leal, 2005; Ahn and Cassidy, 2007; Schönhof and Helbing, 2007; Zielke et al., 2008; Helbing et al., 2009; Wilson, 2008a). For our purposes, the relevant stylized facts are the following: 1. Congestion patterns are typically caused by bottlenecks which may be caused, e.g., by intersections, accidents, or gradients. Analyzing about 400 congestion patterns on the German freeways A5-North and A5-South (Fig. 2) did not bring conclusive evidence of a single breakdown without a bottleneck (Schönhof and Helbing, 2007). 2. The congestion pattern can be localized or spatially extended Localized patterns, as well as the downstream boundary of extended congestions, are either pinned at the bottleneck, or moving upstream at a characteristic velocity ccong. Only extended patterns (Fig. 3) and moving localized patterns (isolated stop-and-go waves, Fig. 5) are relevant for our investigation. 3. Most extended traffic patterns exhibit internal oscillations propagating upstream approximately at the same characteristic velocity ccong = const (Daganzo, 2002; Mauch and Cassidy, 2002). Notice that, due to the upstream propagation, ccong < 0. In spatiotemporal representations such as that of Fig. 5 this corresponds to nearly parallel structures.

A5 South (May 7, 2001)

Speed (km/h) 140

480

120

120

100

100

80

478

60

476 474

40

(a)

Speed (km/h)

Location (km)

482

20

08:00

60 40 20

(b)

0 07:55

0

07:30

80

08:30

08:00

08:05

Speed (km/h) 100 482 80

480

60

D12

40

476 474 472

20

(c)

D07 7:30

8:00

Time (h)

08:1

8:30

0

Speed (km/h)

Detector position (km)

Detector position (km)

484

478

08:10

Time

Time

484 100

482

80

480

D12

478

60

476

40

474 472

(d) 8:00

20

D07

0

8:30

9:00

Time (h)

Fig. 3. Example of stationary detector data displaying the development of stop-and-go waves from stationary congested traffic near the bottleneck. (a) Reconstruction of the spatiotemporal evolution of local vehicle speeds using a dedicated reconstruction technique (Treiber et al., 2011). Also shown is a reconstructed vehicle trajectory (black curve). (b) Speed–time series of the reconstructed trajectory. (c) Speed–time series according to Eq. (1). The locations of the respective detectors are given by the black baselines, and the speed (reference value 90 km/h) is plotted relative to the baseline. The black parallelogram encloses the relevant spatiotemporal range defined in Section 2.2. (d) Representation of the same data with a skewed time axis t0 = t  (x  x1)/ccong with x1 = 473 km and ccong =  16 km/h.

(a)

0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 -0.1 -20

(b)

D07-D08 D08-D09 D09-D10 D10-D11 D11-D12

-18

-16

-14

-12

-10

Averaged cross correlation

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

Averaged cross correlation

34

-8

0.7

D07-D11 D08-D11 D07-D12 D09-D12

0.6 0.5 0.4 0.3 0.2 0.1 0 -0.1 -20

-18

-16

-14

-12

-10

-8

Wave propagation velocity c (km/h)

Wave propagation velocity c (km/h)

Fig. 4. Determining the propagation velocity c with the cross-correlation method. (a) Pairwise correlation; (b) multiple correlations according to Eq. (8).

Location (km)

A5 South (June 6, 2001)

Speed (km/h)

489

140

488

120

487

100

486

80 60

485

40

484 483

20

(a) 07:00

0

07:30

08:00

08:30

09:00

Time Speed (km/h)

Speed (km/h) 100

488 487

80

486

60

485 484

40

483 482 481

20

(b) 7:00

0

7:30

8:00

Time (h)

8:30

9:00

489

Detector position (km)

Detector position (km)

489

100

488 487

80

486

60

485 484

40

483 482 481

20

(c) 7:00

0

7:30

8:00

8:30

9:00

Time (h)

Fig. 5. Stationary detector data displaying the evolution of isolated stop-and-go waves caused by a weak bottleneck located at about 487.5 km. (a) The spatiotemporal speed field, (b) the time series, and (c) the time series with skewed time axis are derived from the data as in Fig. 3. The parameters of the skewed time transformation t0 = t  (x  x1)/ccong are given by x1 = 483 km and ccong =  16 km/h.

4. The amplitude of the oscillations increases while propagating upstream (Bertini and Leal, 2005; Ahn and Cassidy, 2007). We quantify this by defining a temporal growth rate r characterizing the growth process before saturation, i.e., before the oscillations become fully developed stop-and-go waves. 5. The average wavelength L of the (saturated) waves increases with decreasing bottleneck strength (Treiber et al., 2000, 2010). In formulating the calibration criteria, We use the first two facts to filter the input data for applicable spatiotemporal regions (Section 2.2) while the quantities c, r, and L quantifying the remaining stylized facts constitute the criteria itself (Sections 2.4,2.5,2.6). In general, c, r, and L depend on the bottleneck strength which, therefore, needs to be quantitatively defined as well (Section 2.3). 2.1. Data description We assume that aggregated data for the flow Q and speed V (arithmetic average) are available from several detector cross sections at locations i. The distance between two adjacent detectors should not exceed about 2.5 km which generally is

35

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

satisfied on relevant freeway sections (i.e., sections where traffic breakdowns are observed regularly). Usually, the flow and speed data (Vilj, Qilj) of cross section i are available for each lane (index l) after each time interval (index j). With few exceptions, congested traffic is ‘‘synchronized’’ between lanes, i.e., the average speed at a given location and time is not significantly different across the lanes. We therefore use the following weighted averages as input for our calibration criteria:

V ij ¼

ni 1 X Q V ilj ; Q ij l¼1 ilj

ð1Þ

where ni is the number of mainroad lanes at location i. The total flow is given by

Q ij ¼

ni X

Q ilj :

ð2Þ

l¼1

2.2. Filtering spatiotemporal regions In order to determine the dynamic quantities c, r, and L from the input data {Vij} it is necessary to restrict the spatiotemporal range to regions of actual congestions. From the stylized facts above, we use the information that the downstream boundary of congestions is either stationary at a bottleneck, or moving with a constant velocity ccong < 0. Furthermore, internal structures move at nearly the same velocity. This leads to defining parallelogram-shaped regions for both isolated stopand-go waves, and extended congestions (cf. Figs. 3 and 5). Notice that we define the parallelogram using a quantity (the propagation velocity) whose precise value c will be determined only at a later stage. Fortunately, Stylized Fact 3 suggests that the values of c lie in a very small range which is confirmed by Fig. 6c. This makes it possible to define the spatiotemporal region using the a-priori estimate ccong without danger of circular reasoning. The parallelogram is defined as follows:  Two edges correspond to detector locations. The downstream edge is defined by the detector that is nearest to the bottleneck (in the upstream direction). The upstream edge is at the detector location where the waves begin to saturate. This should include n P 3 detector cross sections. In the following, we will count the cross sections in the direction of the flow, i.e., the upstream edge is located at x1, and the downstream edge at xn.  The other two edges are angled such that, in the spatiotemporal picture, the waves propagate parallel to these edges.     and x1 ; tend  The vertices x1 ; t beg connected by the upstream edge are defined such that the parallelogram has a max1 1 imum area subject to the condition that there is no free traffic inside:

  xi  x1 ; tbeg ¼ max ^t i  1 i ccong   xi  x1 tend :¼ t beg ¼ min ~ti  1 1 þ T: i ccong

Growth rate (h−1)

(a)

ð3Þ ð4Þ

10 5 0 −5 −10 −15

OCT/TSG MLC Free traffic

−20 −25 −30 0

20

40

60

80

100

120

Wavelength (km)

(b) 8

(c)

7 6 5 4 3 2 1

Propagation velocity c

Velocity at bottleneck (km/h) 100 OCT/TSG MLC Free traffic

80 60 40 20 0 −20

0

10

20

30

40

50

Velocity at bottleneck (km/h)

60

70

0

20

40

60

80

100

120

Velocity at bottleneck (km/h)

Fig. 6. Characteristic properties of the propagation of perturbations in congested traffic. (a) Growth rate r, (b) wavelength L, and (c) propagation velocity c, as a function of the average speed V characterizing the bottleneck strength.

36

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

Here, T ¼ t end  tbeg denotes the considered time interval for each detector, i.e., the length of the edges of the parallelogram in 1 1 the direction of time. Furthermore, ^ti is the time where the aggregated speed time series Vij of cross section i drops below a critical speed vc, and ~ti is the time where it rises above vc. To reduce the occurrence of false signals, the time series can be smoothed over time scales of a few minutes just for this step. A value vc = 70 km/h turned out to give good, i.e., robust, results for freeways. Finally, the parallelogram is used to extract the relevant data for each of the n detector cross sections. For cross section i at positions xi, the relevant traffic records begin at the sampling interval index beg

ji

! tbeg xn  xi 1 ; þ Dt ccong Dt

¼ round

ð5Þ

where Dt denotes the sampling time interval (Dt = 1 min, in our case). The relevant data end at the sampling interval index end

ji

beg

¼ ji

þ m;

  T m ¼ round : Dt

ð6Þ

2.3. Quantifying the bottleneck strength For extended congestions, the quantities c, r, and L depend on the bottleneck strength. Therefore, this exogenous variable has to be characterized by the measured quantities as well. Theoretically, the bottleneck strength is defined by a local drop of the capacity that is available for the mainroad traffic (Schönhof and Helbing, 2007). However, this quantity is problematic to measure, particularly, if the bottleneck includes merging/diverging traffic (e.g., junctions, intersections, or lane closings). Therefore, we use as an instrument variable the average vehicle speed at the detector i = n nearest to the bottleneck, end



jn X 1 V nj : m þ 1 beg

ð7Þ

j¼jn

At this location, the oscillations inside extended jams are minimal (Stylized Fact 4) making V a well-defined quantity. Basically, V measures the bottleneck strength: The lower the average speed of the vehicles immediately upstream of the bottleneck, the stronger the bottleneck. Obviously, the maximum bottleneck strength, i.e., a complete road blockade, corresponds to a queue of standing vehicles ðV ¼ 0Þ. Of course, in the immediate neighborhood of the bottleneck, the traffic dynamics generally depends on details of the bottleneck such as merging and diverging regions. However, in the upstream region relevant for this investigation, the dynamics essentially depends only on the bottleneck strength (Treiber et al., 2010), i.e., on the instrument variable V. In order to plot the characteristic quantities of isolated stop-and-go waves into the same diagram, such as Fig. 6, we define V as the speed outside of the wave, for this case. 2.4. Propagation velocity The propagation velocity c can be calculated by maximizing the sum of cross-correlation functions of speed time series of detector pairs {i, j} in the congested region (at positions xi and xj, respectively), with respect to the velocity c (Coifman and Wang, 2005; Zielke et al., 2008):

c ¼ arg max 0 c

XX i

k>i

h  xi  xk i Corr V i ðtÞ; V k t þ ; c0

ð8Þ

where the continuous-in-time function Vi(t) is defined by a piecewise linear interpolation of the detector time series Vij. When selecting spatiotemporal regions of free traffic, Eq. (8) can be used to determine the propagation velocity of perturbations in free traffic (data points at the upper right corner of Fig. 6c). Notice that the velocity according to (8) is discrete-valued. For example, when using only two cross sections as has been done in the previous investigations (Coifman and Wang, 2005; Zielke et al., 2008), a simple calculation shows that the discretization interval is given by

Dc ¼ c 2

Dt : x2  x1

ð9Þ

For typical values (aggregation interval Dt = 1 min, distance x2  x1 = 1 km, velocity c = 5 m/s, Dc = 1.5 m/s), this corresponds to about one third of the value to be determined (Fig. 4a). Fig. 4b shows that using multiple cross sections according to Eq. (8) greatly decreases the discretisation errors and, moreover, makes the result robust with respect to choosing different upstream and downstream boundaries of the jam region. Notice that even an erroneous extension of the downstream boundary to Detector 12 (no oscillations at this position, Fig. 3c) hardly influences the result (Fig. 4b).

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

37

2.5. Growth rate ~ of perturbations is defined in terms of the slope of the linear regression of the logarithm The average spatial growth rate r of the amplitude,

r~ ¼

P

i xi

ln jAi j  nxln jAi j P 2 ; 2 i xi  nx

ð10Þ

where the amplitude Ai is approximated by the standard deviation of the speed time series Vij inside the parallelogram. Of ~ is only a rough approximation of the true growth rate. Particularly, r ~ is influenced by non-collective fluctuations course, r caused by random noise and driver-vehicle heterogeneity as well as by saturation effects, particularly, if the spatiotemporal region is not defined carefully. Both effects systematically decrease the estimated growth rate but do not invalidate the overall picture. We tested the linear regression of the logarithm against a nonlinear (quadratic) growth model. The nonlinear coefficient was not significantly different from zero (i.e., the null hypothesis of a linear growth could not be rejected) if the maximum speed during the oscillations did not exceed 60% of the free-flow speed. This defines an applicable set of detectors (possibly a subset of the parallelogram) for determining the growth rate. Once the spatial growth rate is known, the temporal growth rate is given by

r ¼ cr~ :

ð11Þ

~ is negative, and r positive. Notice that, in general, r 2.6. Wavelength The average period s of the oscillations of extended congestions is given by the position of the first nontrivial peak of the autocorrelation functions of detector time series inside the parallelogram. In some cases, waves merge during their propagation such that the period and the wavelength increases with the distance from the bottleneck. We therefore define s at a certain development stage, namely at the limit of saturation where the wavelengths of the oscillations are of the same order such that a distinct maximum of the cross correlation function exists:

s ¼ arg max Corr½V 1 ðtÞ; V 1 ðt þ s0 Þ: 0 s >0

ð12Þ

In analogy to Eq. (11), the associated wavelength is given by

L ¼ jcjs:

ð13Þ

3. Application to freeway data The characteristic quantities defined in the previous section have been calculated for about 30 instances of traffic congestions observed on a section of the German freeway A5-South near Frankfurt for April and May 2001. Images of the corresponding spatiotemporal speed fields can be viewed at www.traffic-states.com (cf. Fig. 2 for a screenshot). Not all of the more than 60 jams observed in the southern direction in this time period are suited for the analysis: Localized jams must be moving which can be selected by the website filter criterion ‘‘MLC’’ standing for ‘‘moving localized traffic’’ (Treiber et al., 2000), and extended jams must exhibit distinct oscillating structures (website filter criteria ‘‘OCT’’ or ‘‘TSG’’ standing for ‘‘oscillatory congested traffic’’ and ‘‘triggered stop-and go waves’’, respectively). Additionally, they must be sufficiently extended such that the parallelogram includes at least three cross sections, and each cross section records at least three oscillations. While oscillating structures are observed for most of the cases except for very strong or weak bottlenecks (website filter criteria ‘‘HCT’’ (‘‘homogeneous congested traffic’’), and ‘‘HST’’ (‘‘homogeneous synchronized traffic’’), respectively), nearly 50% of the remaining candidates do not satisfy the minimum size criteria. Fig. 6 displays the main results which can be used not only for calibration purposes, but also for understanding the traffic dynamics in general. Part (a) of this figure shows the growth rate of the moving structures inside extended congestions (points for V below 70 km/h), the growth rate of isolated stop-and-go waves (MLC, top right corner), and, for comparison, the growth rate of some moving structures in free traffic. For congested traffic, the growth rate is generally positive, apart from very low average velocities V corresponding to extended congestions behind strong bottlenecks (usually accidentcaused lane closures, selected in the database by the criterion ‘‘accident’’). From the point of view of the system comoving with the drivers (or when investigating floating-car data) , this growth transforms to a negative growth rate of the observed oscillations as shown in Fig. 3b: More specifically, when approaching the bottleneck , the experienced velocity oscillations apparently decrease in amplitude with the growth rate

rFC ¼ V FC r~ :

ð14Þ

They have the apparent period

sFC ¼ s

! jcj ; jcj þ V FC

ð15Þ

38

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

Fig. 7. Speed–time series of several detectors on the US American freeway OR 217 (Zielke et al., 2008). Shown are deviations (oscillating curves) from the average speed (horizontal baselines). The detector spacings are proportional to the spacings of the baselines. Also shown is the time period 1/r in which the amplitudes of each wave increase by a factor of e.

where V FC is the average speed of the floating car. Fig. 6b shows that the wavelength decreases with increasing bottleneck strength, but always remains above 1 km. Finally, Fig. 6c confirms that the propagation velocity is nearly constant (in the range 17 km/h to 16 km/h for extended jams and about 18 km/h for isolated moving waves). The analysis quantitatively confirms the stylized facts presented in Section 2. Particularly, there is no empirical evidence against the existence of linear instabilities in congested traffic. This is also confirmed by data from other freeways in several countries, although there are quantitative deviations (this, exactly, is the basis for calibrating the models). As an example, Fig. 7 taken from Zielke et al. (2008) shows speed time series which are suitable for the present analysis. The growth rate is comparable to the A5 data while the propagation velocity c   20km/h has a somewhat higher absolute value (Zielke et al., 2008). 4. A first validation of traffic flow models In this section, we will illustrate how the methodology described in Sections 2 and 3 can be used to calibrate and validate traffic flow models based on their dynamics. To this end we will use the quantities c, r and s calculated for the extended region of congestion depicted in Fig. 3 to determine best-fit parameters for the Intelligent-Driver Model (IDM) (Treiber et al., 2000) and the Human Driver Model (HDM) (Treiber et al., 2006a). Fig. 8 presents the results for an IDM simulation with identical vehicles of length 6 m and an IDM parameter vector ~ b with values v0 = 120 km/h, T = 1 s, s0 = 2 m, d = 4, a = 1 m/s2, and b = 1.5 m/s2. For the model equations, we refer to Treiber et al. (2000). The bottleneck is represented by an onramp of merging length 1000 m. To obtain the observed vehicle speed V ¼ 28 km=h for the instance shown in Fig. 3, a ramp inflow of 550 vehicles per hour was necessary. At t = 70 min, the bottleneck is reduced resulting in a moving downstream front of the congestion at later times (this has no influence on the calibration results). Virtual detectors are positioned every kilometer, and vehicle passage times and speeds are aggregated in one-minute intervals, in analogy to the data sampling at the real detectors. The characteristic quantities rsim ð~ b; VÞ; ssim ð~ b; VÞ, and csim ð~ b; VÞ are determined from the virtual detectors using exactly the same method (cf. Section 3) that has been used for the actual observations. A comparison with the empirical values r = 5 h1, c =  17 km/h, and s = 10 min (cf. Fig. 3) gives a first impression of the discriminative power of the proposed validation and testing method: The empirical propagation velocity c agrees with both models. However, for the IDM simulation, the period sIDM = 6 min of the waves is too short, and the growth rate rIDM  10 h1 (calculated using the detectors at 0, 1, and 2 km) is too high. To quantify the discrepancy, we propose an objective function F measuring the agreement with respect to the propagation velocity c, wavelength L (or period s), and growth rate r as a function of the model parameter vector ~ b for fixed instrument variable V,

Fð~ b; VÞ ¼

rsim ð~ b; VÞ  rðVÞ A

!2 þ

ssim ð~ b; VÞ  sðVÞ B

!2

!2 csim ð~ b; VÞ  cðVÞ þ : C

ð16Þ

39

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

Speed (km/h)

Location (km)

Location (km)

140

0

-2

-4

-6

120

0

100 80

-2

60 40

-4

(b)

(a)

20

-6 30

60

0

30

90

Time (min)

60

90

Time (min) Speed (km/h)

2

140 120

0

Location (km)

Location (km)

2

-2 -4

0

100 80

-2

60 40

-4

(d)

(c)

20

-6

-6 30

60

Time (min)

90

0

30

60 Time (min)

90

Fig. 8. Examples of simulated virtual detector data based on simulations with the Intelligent Driver Model for a permanent bottleneck (plots a and b), and the Human Driver Model with a temporary bottleneck (c and d). The simulated data aggregation (time intervals of 1 min), and the transformations to display the data are the same as for the empirical data in Fig. 3.

The weighting factors A, B, and C can be used to define the relative importance of each criterium (e.g., A = 5 h1, B = 10 min, C = 5 km/h). First investigations indicate that this objective function cannot be significantly reduced by changing the model parameters. We defer systematic calibrations with respect to this objective function and investigations on the robustness of the resulting calibration scheme to forthcoming research. For the HDM simulation, the period sHDM = 8 min deviates less, and the growth rate rHDM  5 h1 is consistent with the data. Consequently, the value of the objective function F is significantly lower. This may indicate that explicit reaction times and multi-anticipation (the drivers take into account multiple leaders) incorporated into the HDM, but not the IDM, are essential for a quantitative modeling. However, it is too early to draw definitive conclusions.

5. Discussion and outlook In conclusion, we propose three aspects of traffic instabilities that are accessible to a systematic and quantitative test of model prediction: the wavelength, the propagation velocity, and the growth rate. Since all properties are functions of the average velocity indicating the bottleneck strength, there should be enough input for a discriminative testing of models. All quantities are defined in terms of automated schemes enabling the systematic investigation of large-scale databases. As a first step, it is planned to investigate the complete database displayed at www.traffic-states.com containing more than 400 instances of congestions. Since microscopic models typically are calibrated with respect to the car-following behavior (e.g., in Brockfeld et al. (2004)), the proposed metrics for spatiotemporal criteria open an interesting possibility for cross-validation: Models calibrated with respect to the car-following behavior, or directly to time series of stationary detectors, can be cross-validated against the spatiotemporal criteria, and vice versa. On a more fundamental level, the investigations of this paper give evidence of convective instability in congested traffic flow, i.e., perturbations grow but propagate in only one direction (Treiber and Kesting, in press). The database contains also some unusual patterns to which the scheme of Section 2 can be easily generalized and which may be used to validate the predictive power of models calibrated to the more conventional patterns discussed above. For example, there are some searchable instances of ‘‘moving bottlenecks’’ (Fig. 9), possibly caused by very slow special transports or obstructions due to maintenance of the central reservation We notice that all proposed quantities for calibrating and testing relate to existing congestions, so parameters relating to free traffic (such as the desired speed or free accelerations) cannot be calibrated or validated by our approach (Hourdakis et al., 2003; Punzo and Ciuffo, 2009). Due to the stochastic nature of the perturbations eventually leading to a breakdown,

40

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

Fig. 9. A moving bottleneck filtered from the image database www.traffic-states.com: validating a model with data it most probably has never seen before.

predictions for an actual breakdown are necessarily restricted to a probabilistic level (Elefteriadou et al., 1995; Brilon et al., 2005). It must be emphasized that calibrations based on microscopic quantities such as the difference between measured and simulated gaps are strongly influenced by unpredictable intra- and inter-driver variations (Ossen et al., 2007; Kesting and Treiber, 2008) resulting in little discriminative power (Brockfeld et al., 2004; Ranjitkar et al., 2004; Ossen and Hoogendoorn, 2005; Punzo and Simonelli, 2005). However, when calibrating against integrated quantities such as the traveling time (Brockfeld et al., 2003), important aspects such as accelerations or the properties of oscillations are averaged away resulting in reduced discriminative power as well. In contrast, the schemes proposed here probe the traffic dynamics on the intermediate scale of individual oscillations which is of the order of one kilometer and a few minutes. Since we probe on collective phenomena, inter-driver variabilities (multiple driver-vehicle classes), and intra-driver variabilities should only play a minor role. In fact, from the research on traffic-related effects of adaptive cruise control simulated with multi-class models (see, e.g., (Kesting et al., 2010)), we learn that the collective dynamics of multi-class traffic is essentially equivalent to single-class traffic with appropriately averaged parameters. Similarly, due to speed synchronization of congested traffic across lanes, lane-specific details cannot probed by our method. Nevertheless, the scale is sufficiently microscopic to retain accelerations. We therefore expect that the proposed schemes can be applied to a better and more discriminative benchmarking of microscopic and macroscopic models with respect to traffic instabilities. This is the topic of ongoing work. Acknowledgments The authors would like to thank the Hessisches Landesamt für Straßen- und Verkehrswesen for providing the freeway data. References Ahn, S., Cassidy, M.J., 2007. Freeway traffic oscillations and vehicle lane-change maneuvers. In: Allsop, R.E., Bell, M.G.H., Heydecker, B.G. (Eds.), Transportation and Traffic Theory 2007. Elsevier, pp. 691–710 (Chapter 29). Banks, J.H., 1999. An investigation of some characteristics of congested flow. Transportation Research Record 1678, 128–134. Bertini, R.L., Leal, M.T., 2005. Empirical study of traffic features at a freeway lane drop. Journal of Transportation Engineering 131 (6), 397–407. Brilon, W., Geistefeldt, J., Regler, M., 2005. Transportation and traffic theory: flow, dynamics and human interaction. In: Proceedings of the 16th International Symposium on Transportation and Traffic Theory, Reliability of Freeway Traffic Flow: A Stochastic Concept of Capacity. Elsevier, pp. 125– 144. Brockfeld, E., Kühne, R.D., Skabardonis, A., Wagner, P., 2003. Toward benchmarking of microscopic traffic flow models. Transportation Research Record 1852, 124–129. Brockfeld, E., Kühne, R.D., Wagner, P., 2004. Calibration and validation of microscopic traffic flow models. Transportation Research Record 1876, 62–70. Coifman, B.A., Wang, Y., 2005. Average velocity of waves propagating through congested freeway traffic. In: Mahmassani, H.S. (Ed.), Transportation and Traffic Theory: Flow, Dynamics and Human Interaction. Elsevier, pp. 165–179. Daganzo, C.F., 2002. A behavioral theory of multi-lane traffic flow. Part I: Long homogeneous freeway sections. Transportation Research Part B: Methodological 36 (2), 131–158. Elefteriadou, L., Roess, R., McShane, W., 1995. Probabilistic nature of breakdown at freeway merge junctions. Transportation Research Record 1484, 80–89. Helbing, D., 2001. Traffic and related self-driven many-particle systems. Reviews of Modern Physics 73, 1067–1141. Helbing, D., Treiber, M., Kesting, A., Schönhof, M., 2009. Theoretical vs. empirical classification and prediction of congested traffic states. The European Physical Journal B 69, 583–598. Hoogendoorn, S., Bovy, P., 2001. State-of-the-art of vehicular traffic flow modelling. Proceedings of the Institution of Mechanical Engineers, Part I: Journal of Systems and Control Engineering 215 (4), 283–303. Hourdakis, J., Michalopoulos, P., Kottommannil, J., 2003. Practical procedure for calibrating microscopic traffic simulation models. Transportation Research Record 1852 (-1), 130–139. Kerner, B., Rehborn, H., 1996. Experimental properties of complexity in traffic flow. Physical Review E 53, R4275–R4278. Kerner, B.S., 2004. The Physics of Traffic: Empirical Freeway Pattern Features, Engineering Applications, and Theory. Springer. Kesting, A., Treiber, M., 2008. Calibrating car-following models by using trajectory data: methodological study. Transportation Research Record 2088, 148– 156. Kesting, A., Treiber, M., Helbing, D., 2010. Enhanced intelligent driver model to access the impact of driving strategies on traffic capacity. Philosophical Transactions of the Royal Society A 368, 4585–4605.

M. Treiber, A. Kesting / Transportation Research Part C 21 (2012) 31–41

41

Leutzbach, W., 1988. Introduction to the Theory of Traffic Flow. Springer, Berlin. Lindgren, R., 2005. Analysis of Flow Features in Queued Traffic on a German Freeway. Ph.D. thesis, Portland State University. Mauch, M., Cassidy, M.J., 2002. Freeway traffic oscillations: observations and predictions. In: Taylor, M.A. (Ed.), Transportation and Traffic Theory in the 21st Century: Proceedings of the Symposium of Traffic and Transportation Theory. Elsevier, pp. 653–672. Nishinari, K., Treiber, M., Helbing, D., 2003. Interpreting the wide scattering of synchronized traffic data by time gap statistics. Physical Review E 68, 067101. Ossen, S., Hoogendoorn, S.P., 2005. Car-following behavior analysis from microscopic trajectory data. Transportation Research Record 1934, 13–21. Ossen, S., Hoogendoorn, S.P., Gorte, B.G., 2007. Inter-driver differences in car-following: a vehicle trajectory based study. Transportation Research Record 1965, 121–129. Pipes, L., 1953. An operational analysis of traffic dynamics. Journal of Applied Physics 24, 274. Punzo, V., Ciuffo, B., 2009. How parameters of microscopic traffic flow models relate to traffic dynamics in simulation: implications for model calibration. Transportation Research Record 2124, 249–256. Punzo, V., Simonelli, F., 2005. Analysis and comparision of microscopic flow models with real traffic microscopic data. Transportation Research Record 1934, 53–63. Ranjitkar, P., Nakatsuji, T., Asano, M., 2004. Performance evaluation of microscopic flow models with test track data. Transportation Research Record 1876, 90–100. Reuschel, A., 1950. Fahrzeugbewegungen in der Kolonne. Österreichisches Ingenieur-Archiv 4 (3/4), 193–215. Schönhof, M., Helbing, D., 2007. Empirical features of congested traffic states and their implications for traffic modeling. Transportation Science 41, 1–32. Treiber, M., Helbing, D., 1999. Macroscopic simulation of widely scattered synchronized traffic states. Journal of Physics A: Mathematical and General 32 (1), L17–L23. Treiber, M., Helbing, D., 2003. Memory effects in microscopic traffic models and wide scattering in flow-density data. Physical Review E 68, 046119. Treiber, M., Hennecke, A., Helbing, D., 2000. Congested traffic states in empirical observations and microscopic simulations. Physical Review E 62, 1805– 1824. Treiber, M., Kesting, A., in press. Evidence of convective instability in congested traffic flow: a systematic empirical and theoretical investigation. Transportation Research Part B: Methodological. doi:10.1016/j.trb.2011.05.011. Treiber, M., Kesting, A., Helbing, D., 2006a. Delays, inaccuracies and anticipation in microscopic traffic models. Physica A 360, 71–88. Treiber, M., Kesting, A., Helbing, D., 2006b. Understanding widely scattered traffic flows, the capacity drop, and platoons as effects of variance-driven time gaps. Physical Review E 74, 016123. Treiber, M., Kesting, A., Helbing, D., 2010. Three-phase traffic theory and two-phase models with a fundamental diagram in the light of empirical stylized facts. Transportation Research Part B 44 (8–9), 983–1000. Treiber, M., Kesting, A., Wilson, R.E., 2011. Reconstructing the traffic state by fusion of heterogenous data. Computer-Aided Civil and Infrastructure Engineering 26, 408–419. Treiterer, J., Myers, J., 1974. The hysteresis phenomenon in traffic flow. In: Buckley, D. (Ed.), Proc. 6th Int. Symp. on Transportation and Traffic Theory. Elsevier, New York, p. 13. US Department of Transportation, 2007. NGSIM – Next Generation Simulation. . Wilson, R., 2008a. Mechanisms for spatio-temporal pattern formation in highway traffic models. Philosophical Transactions of the Royal Society A 366 (1872), 2017–2032. Wilson, R.E., 2008b. From inductance loops to vehicle trajectories. In: Symposium on the Fundamental Diagram: 75 Years (Greenshields 75 Symposium). Transportation Research Board. Zielke, B., Bertini, R., Treiber, M., 2008. Empirical measurement of freeway oscillation characteristics: an international comparison. Transportation Research Record 2088, 57–67.