Computation and application of the paired combinatorial logit stochastic user equilibrium problem

Computation and application of the paired combinatorial logit stochastic user equilibrium problem

Computers & Operations Research 43 (2014) 68–77 Contents lists available at ScienceDirect Computers & Operations Research journal homepage: www.else...

2MB Sizes 0 Downloads 58 Views

Computers & Operations Research 43 (2014) 68–77

Contents lists available at ScienceDirect

Computers & Operations Research journal homepage: www.elsevier.com/locate/caor

Computation and application of the paired combinatorial logit stochastic user equilibrium problem Anthony Chen a,n, Seungkyu Ryu a, Xiangdong Xu b, Keechoo Choi c a

Department of Civil and Environmental Engineering, Utah State University Logan, UT 84322-4110, USA Department of Civil and Environmental Engineering, Hong Kong University of Science and Technology, Clear Water Bay, Hong Kong Special Administrative Region c Department of Transportation Engineering, Ajou University Suwon, 442-749, South Korea b

art ic l e i nf o

a b s t r a c t

Available online 29 August 2013

The paired combinatorial logit (PCL) model is one of the recent extended logit models adapted to resolve the overlapping problem in the route choice problem, while keeping the analytical tractability of the logit choice probability function. However, the development of efficient algorithms for solving the PCL model under congested and realistic networks is quite challenging, since it has large-dimensional solution variables as well as a complex objective function. In this paper, we examine the computation and application of the PCL stochastic user equilibrium (SUE) problem under congested and realistic networks. Specifically, we develop an improved path-based partial linearization algorithm for solving the PCL SUE problem by incorporating recent advances in line search strategies to enhance the computational efficiency required to determine a suitable stepsize that guarantees convergence. A real network in the city of Winnipeg is applied to examine the computational efficiency of the proposed algorithm and the robustness of various line search strategies. In addition, in order to acquire the practical implications of the PCL SUE model, we investigate the effectiveness of how the PCL model handles the effects of congestion, stochasticity, and similarity in comparison with the multinomial logit stochastic traffic equilibrium problem and the deterministic traffic equilibrium problem. Published by Elsevier Ltd.

Keywords: Logit Paired combinatorial logit Stochastic user equilibrium Traffic assignment Partial linearization Line search

1. Introduction The stochastic user equilibrium (SUE) principle was suggested by Daganzo and Sheffi [21] more than 30 years ago to relax the perfect knowledge assumption of network travel times of the deterministic user equilibrium (DUE) model. It is defined as follows: “At SUE, no motorists can improve his or her perceived travel time by unilaterally changing routes” [21]. Specifically, a random error term is incorporated in the route choice decision process to simulate travelers’ imperfect perceptions of network travel times, such that they do not always end up picking the minimum travel time route. The random error term here is interpreted as the perception error of network travel times due to the travelers’ imperfect knowledge of network conditions. In this model, each traveler is assumed to have some perceptions of the mean travel times on each link of the network. Each traveler’s route choice criterion is to minimize the perceived value n

Corresponding author. Tel.: þ 1 435 797 7109; fax: þ 1 435 797 1185. E-mail address: [email protected] (A. Chen).

0305-0548/$ - see front matter Published by Elsevier Ltd. http://dx.doi.org/10.1016/j.cor.2013.08.022

of the route travel time, which can be obtained by adding up the perceived travel times on all the links belonging to the route. Route choice models proposed under this approach can have different specifications according to modeling assumptions on the random error term. The two commonly used random error terms are Gumbel [23] and Normal [21] distributions, which result in the logitand probit-based route choice models, respectively. The logit-based route choice model has a closed-form probability expression and also an equivalent mathematical programming (MP) formulation for the SUE problem under congested networks [25]. The multinomial logit (MNL) SUE MP formulation can be solved using both path enumeration techniques [9,11,46] and column (or path) generation techniques [7,8,12–13,16,18–19,22,27,30,33]. Column generation techniques for the MNL SUE model can be implemented in the link- or path-based domains. The link-based algorithms [7,18,27,33] do not require path storage and often use Dial’s STOCH algorithm [23] or Bell’s alternative [6] as the stochastic loading step, while the path-based algorithms [8,12–13,16,19,22,30,46] require explicit path storage in order to directly compute the logit route choice probabilities. The drawbacks of the logit model are: (1) inability to account for overlapping (or correlation) among routes, and (2) inability to account for perception variance with respect to trips with different lengths. These two drawbacks stem from its underlying assumptions that the random

A. Chen et al. / Computers & Operations Research 43 (2014) 68–77

69

Fig. 1. Hierarchical structure of the paired combinatorial logit model.

error terms are independently and identically distributed (IID) with the same and fixed variances [43]. On the other hand, the probitbased route choice model does not have such drawbacks, because it handles the overlapping and identical variance problems between routes by allowing covariance between the random error terms for pairs of routes. However, the probit model does not have a closedform solution and it is computationally burdensome to solve the multiple integrals when the choice set contains more than a handful of routes. Due to the lack of a closed-form probability expression, solving the probit-based SUE model will require either Monte Carlo simulation [44], Clark’s approximation method [32,34], or numerical method [42]. Recently, there are renewed interests to relax the assumptions of the MNL SUE model. Several modifications or generalizations of the logit structure have been proposed to relax the IID assumptions of the MNL SUE model (see [38,39] for a comprehensive review of these recent advances of the logit model). Among these extended logit models, the paired combinatorial logit (PCL) model is considered the most suitable for adaptation to the route choice problem to resolve the overlapping problem while keeping the analytical tractability of the logit choice probability expression. In the PCL model, each pair of alternatives can have a similarity relationship that is completely independent of the similarity relationship of other pairs of alternatives. Bekhor and Prashker [2] argued that this feature is highly desirable for route choice models, since each pair of routes may have different similarities. Further, Gliebe et al. [26] demonstrated that the PCL model could be scaled to account for perception variance with respect to different trip lengths. Both features are useful in addressing the well-known “independent of irrelevant alternatives” (IIA) property inherited in the MNL model. Pravinvongvuth and Chen [40] examined these two features using simple examples (i.e., three routes in the loop-hole network) with limited number of routes to illustrate the flexibility of PCL’s hierarchical tree structure in overcoming the drawbacks of the MNL model. The nice features of the PCL model have been theoretically explored and numerically demonstrated. However, there has been quite less development of efficient algorithms for solving the PCL model under congested and realistic networks. The computationally complexity mainly comes from the large-dimensional solution variables as well as the complex objective function when implementing in large-scale realistic networks. In this paper, we examine the computation and application of the PCL SUE problem under congested and realistic networks. Specifically, we develop an improved path-based partial linearization algorithm for solving the PCL SUE mathematical programming formulation by incorporating recent advances in line search strategies. The partial linearization method is a descent algorithm for solving convex optimization problems [37]. It iterates between the search direction and line search steps until some convergence criterion is reached. The search direction is obtained by solving a first-order approximation of an additive part of the PCL SUE

objective function (to be shown later in Section 3). For the line search step, recent advances in line search strategies (e.g., self-regulated averaging (SRA) scheme and quadratic interpolation scheme) and the traditional line search methods (e.g., method of successive averages (MSA), bisection, and Armijo) are examined in order to efficiently determine a suitable stepsize. A real-size network in the city of Winnipeg is applied to examine the efficiency and robustness of various line search schemes that minimize the computational efforts required to determine a suitable stepsize while guaranteeing convergence. In addition, the practical implications of implementing the PCL SUE model are acquired by examining the effects of congestion, stochasticity, and similarity in comparison with the multinomial logit stochastic traffic equilibrium problem and the deterministic traffic equilibrium problem. The paper is organized as follows. The PCL route choice model and the equivalent PCL SUE formulation are presented in Section 2. The path-based partial linearization algorithm with various line search schemes is described in Section 3. Computational results and practical implications are then presented in Section 4 and Section 5, respectively. Finally, some conclusions are summarized in Section 6.

2. Paired combinatorial logit stochastic user equilibrium problem 2.1. PCL route choice model The paired combinatorial logit (PCL) model was originally proposed by Chu [20], further developed by Koppleman and Wen [29] to examine the structure, properties, and estimation, and adapted to model route choice decisions by Bekhor and Prashker [2], Gliebe et al. [26], and Pravinvongvuth and Chen [40]. In contrast to the simple structure of the logit model, the PCL model has a hierarchical structure that decomposes the choice probability into two levels represented by the marginal and conditional probabilities (see Fig. 1). Thus, the PCL choice probability can be expressed as PðkÞ ¼ ∑ PðkjÞ UPðkjkjÞ;

ð1Þ

jak

where PðkjkjÞ ¼

PðkjÞ ¼

eV k =ð1skj Þ ; þ eV j =ð1skj Þ

V k =ð1skj Þ

e

ð1skj ÞðeV k =ð1skj Þ þeV j =ð1skj Þ Þ1skj ; n1 n ∑l ¼ 1 ∑m ¼ l þ 1 ð1slm ÞðeV l =ð1slm Þ þ eV m =ð1slm Þ Þ1slm

ð2Þ

ð3Þ

where V k is the observable component of the utility for alternative k (i.e., U k ¼ V k þεk ; 8 k); skj is a similarity index between alternatives k and j; and n is the number of alternatives. PðkjkjÞ is the conditional probability of choosing alternative k given that the alternative pair kj has been chosen, and PðkjÞ is the marginal (unobserved) probability

70

A. Chen et al. / Computers & Operations Research 43 (2014) 68–77

for the alternative pair kj. For a choice set of n alternatives, there are a total of n(n 1)/2 pairs of alternatives as shown in the double summation of the marginal probability in Eq. (3). Fig. 1 provides an illustration of the hierarchical tree structure of the PCL model for a simple network with three alternative routes connecting node X and node Y. If skj is equal to zero for all kj pairs, the PCL model reduces to the multinomial logit (MNL) model [29] as follows: PðkÞ ¼

eV k ∑nj¼ 1 eV j

:

ð4Þ

When the similarity index is zero for all route pairs, this implies that the routes in the choice set are uniquely distinct. 2.2. Equivalent mathematical programming formulation The PCL equivalent mathematical programming (MP) formulation presented here is due to Bekhor and Prashker [2]. They showed that the marginal and conditional probabilities of the PCL model could be represented by two entropy terms in the objective function as follows: min Z ¼ Z 1 þ Z 2 þ Z 3 ; where

Z

Z1 ¼ ∑

xa

ð5Þ

t a ðwÞ dw;

ð6Þ

! rs f kðkjÞ 1 rs rs Z 2 ¼ ∑∑ ∑ βkj f kðkjÞ ln ; θ rs k j a k βrs kj

ð7Þ

a

0

jK rs j1

1 Z3 ¼ ∑ ∑ θ rs k ¼ 1

jK rs j

rs

rs

∑ ð1βrs kj Þðf kðkjÞ þf jðkjÞ Þln

j ¼ kþ1

rs rs f kðkjÞ þ f jðkjÞ rs βkj

;

ð8Þ

8 r; s;

ð9Þ

k jak rs

3.1. Algorithmic description In this section, we provide a path-based partial linearization method for solving the PCL SUE formulation. The method is a descent algorithm for continuous optimization problems [37]. Conceptually, this method consists of two main steps: search direction and line search. A search direction is obtained from the solution of a convex auxiliary problem, defined by an approximation of the original objective function through a first-order approximation of an additive part of the PCL SUE objective function. A line search is made in the search direction obtained with respect to the original objective function, and then the resulting stepsize defines a new solution with a reduced objective value. rs Suppose that a feasible solution f kðkjÞ ðnÞ is given in iteration n, and its corresponding route travel time is crs k ðnÞ. The partial linearized subproblem is ð11Þ

where rs

Z 1;sub ¼ ∑∑ crs k ðnÞ Uhk ;

ð12Þ

! rs hkðkjÞ 1 rs rs Z 2 ¼ ∑∑ ∑ βkj hkðkjÞ ln ; θ rs k j a k βrs kj

ð13Þ

! rs rs hkðkjÞ þ hjðkjÞ 1 jK rs j1 jK rs j rs rs Þðh þ h Þln Z3 ¼ ∑ ∑ ; ∑ ð1βrs kðkjÞ jðkjÞ kj θ rs k ¼ 1 j ¼ k þ 1 βrs kj

ð14Þ

rs k

∑ ∑ f kðkjÞ ¼ qrs ; f kðkjÞ Z 0;

3. Path-based partial linearization algorithm

min Z sub ¼ Z 1;sub þ Z 2 þ Z 3 ;

!

subject to rs

speeds, route-based congestion pricing, select link analysis, sensitivity analysis, and O–D trip table estimation from traffic counts). The PCL model accounts for route overlapping problem by incorporating a similarity index into the objective function (i.e., Z 2 and Z 3 ). This enables the PCL model to treat the similarity effect separately from the congestion effect (Z 1 ) and the stochasticity effect (Z 2 in Fisk’s formulation). Hence, the PCL SUE formulation is theoretically capable of accounting for congestion, stochasticity, and similarity effects simultaneously.

8 k; ðkjÞ; r; s;

ð10Þ

where t a ð U Þ is the travel time on link a, xa is the flow on link a, θ is the dispersion coefficient, βrs kj is a measure of dissimilarity index, rs rs defined as βrs ¼ 1s ; f is the flow on route k (of route pair kj) kðkjÞ kj kj between origin r and destination s; jK rs j is the number of routes between O and D pair rs; qrs is the travel demand between O and D pair rs. The objective function in Eq. (5) consists of three terms: Z1 given in Eq. (6) is the well-known UE term reflecting the congestion effect; Z2 given in Eq. (7) is a modification of Fisk’s entropy term reflecting the stochasticity effect; and Z3 given in Eq. (8) is another entropy term reflecting the similarity effect. These two entropy terms represent the two-level tree representation of the PCL model shown in Fig. 1. Z2 represents the conditional probability of the lower level and Z3 represents the marginal probability of the upper level. Note rs rs rs that the expression βrs kj f kðkjÞ lnðf kðkjÞ =βkj Þ is defined as zero if either rs rs f kðkjÞ ¼ 0 or βkj ¼ 0 in Eq. (7). Eq. (9) ensures that the route flows add up to the travel demand between each O–D pair. To ensure meaningful solutions, non-negativity constraints are imposed in Eq. (10). For the equivalency and uniqueness of the above model, interested readers may refer to Behkor and Prasker [2]. The uniqueness means that the above MP formulation is a strictly convex program. This property is beneficial for both algorithmic development and route flow analysis. We can adopt existing efficient algorithms to solve the MP formulation. In addition, the uniqueness of route flows is helpful in several important applications (e.g., optimal routing in the route guidance system, environmental impact analysis and fuel consumption estimation based on route profiles of travel

subject to rs

∑ ∑ hkðkjÞ ¼ qrs ;

8 r; s;

ð15Þ

k jak rs

hkðkjÞ Z 0;

8 k; ðkjÞ; r; s:

ð16Þ

The search direction is obtained by solving a first-order approximation of an additive part of the original objective function (i.e., Z1 of the PCL SUE formulation). This can be done by updating the route travel times, calculating the PCL route choice probabilities, and assigning the auxiliary route flows (and then auxiliary link flows) according to the obtained PCL probabilities. The solution to the above subproblem is

P rs kj ðnÞ ¼

    βrs kj θcrs ðnÞ θcrs ðnÞ k βrs þ exp  βj rs kj exp  βrs kj

kj

h    θcrs ðnÞiβrs ; ml θcrs jK rs j rs m ðnÞ ∑jmK rs¼j1 þ exp  βl rs 1 ∑l ¼ m þ 1 βml exp  βrs ml

8 ðkjÞ; r; s;

ml

ð17Þ 



θcrs ðnÞ

exp  βkrs kj rs    ; P kðkjÞ ðnÞ ¼ θcrs ðnÞ θcrs ðnÞ exp  βkrs þ exp  βj rs kj

rs rs P rs k ðnÞ ¼ ∑ P kj ðnÞ UP kðkjÞ ðnÞ; jak

8 k; ðkjÞ; r; s;

ð18Þ

kj

8 k; r; s;

ð19Þ

A. Chen et al. / Computers & Operations Research 43 (2014) 68–77

71

1=βkj 1=βkj 1=βkj βkj 1 ∑ βkj expðθcrs ðexpðθcrs þ expðθcrs Þ j ðnÞÞ k ðnÞÞ k ðnÞÞ rs

rs

rs

rs hk ðnÞ ¼ qrs UP rs k ðnÞ ¼ q U

rs

min

0rαr1

rs

rs

rs

rs

rs

Z½f kðkjÞ ðnÞ þ αðhkðkjÞ ðnÞf kðkjÞ ðnÞÞ:

rs

rs

rs

f kðkjÞ ðn þ 1Þ ¼ f kðkjÞ ðnÞ þ αðnÞ U ðhkðkjÞ ðnÞf kðkjÞ ðnÞÞ;

ð21Þ

8 k; ðkjÞ; r; s:

ð22Þ

The partial linearization method iterates between the search direction and line search steps until some convergence criterion is satisfied. For completeness, the solution procedure is provided in Table 1. 3.2. Line search schemes Before presenting the line search schemes, we analyze the difficulty of stepsize determination when solving the above MP formulation.

 The solution variable is the route pair flows 

;

1=βkj βkj ∑jkK¼rs j1 ∑jjK¼rs kj þ 1 βkj ðexpðθcrs ðnÞÞ1=βkj þ expðθcrs Þ j ðnÞÞ k 1

We will elaborate the line search schemes in Section 3.2. The new iterative solution can then be obtained by rs

rs

jak

If the vector hðnÞfðnÞ is a non-zero, it defines a descent direction for the original objective function. A line search in this descent direction is then conducted to find a suitable stepsize αðnÞ ¼ arg

rs

rs f kðkjÞ ,

which have a large dimension for large-scale realistic networks. The objective function has one integral term (i.e., summation of integral of link cost functions) and two entropy terms (i.e., conditional probability representing the stochasticity effect in the lower-level tree structure and marginal probability representing the similarity effect in the upper-level tree structure). Thus, it is quite computationally expensive to evaluate the objective function and its derivatives.

In the literature, there are three main types of line search schemes: predetermined, exact, and inexact. The method of successive averages (MSA) is the most widely used method in the predetermined line search scheme. It predetermines a diminishing stepsize sequence that satisfies αðnÞ-0 and ∑1 n ¼ 0 αðnÞ ¼ 1, such as f1; 1=2; 1=3; …; 1=ng. This scheme is easy to implement since it does not need to evaluate the complex objective function and/or its derivative. However, it suffers from a sublinear convergence rate [36]. To overcome the sublinear convergence rate issues of the MSA scheme, McCormick [35] suggested determining the stepsize by exactly solving a one-dimensional minimization problem. Golden section and bisection are two widely used exact line search schemes (see [43]). They are simple and robust. However, both are computationally expensive because they need to evaluate the objective function or its derivatives several times in order to solve the one-dimensional minimization problem exactly. In practice, inexact line search schemes are usually more practical and efficient. Among others, the well-known Armijo-type methods

rs

rs

8 k; r; s

ð20Þ

are perhaps the most popular inexact line search scheme [1]. Note that the Armijo scheme was originally proposed for the steepest descent approach to solving unconstrained minimization problems. Bertsekas [5] then developed a generalized Armijo scheme for constrained optimization problems. In the literature, two promising line search schemes, namely the quadratic interpolation scheme by Maher [33] and the self-regulated averaging (SRA) scheme by Liu et al. [31], were recently proposed to enhance the computational performance of determining a suitable stepsize. Below we further discuss these two promising line search schemes as well as the Armijo scheme. 1. Self-regulated averaging scheme: the SRA scheme was recently developed by Liu et al. [31] for solving the MNL SUE problem in the link domain. This scheme determines a suitable stepsize as follows: αðnÞ ¼

1 ; βðnÞ (

βðnÞ ¼

ð23Þ

βðn1Þ þ λ1

ifjjhðnÞfðnÞjj Z jjhðn1Þfðn1Þjj

βðn1Þ þ λ2

otherwise

; ð24Þ

where λ1 41 and 0oλ2 o1. Similar to the original MSA scheme, the following conditions should be satisfied in order to guarantee convergence [10,31,41]: αðnÞ 4 0;

1

∑ αðnÞ ¼ 1

n¼1

and

 lim αðnÞ ¼ 0 or

n-1

 1 ∑ ðαðnÞÞ2 o 1

n¼1

ð25Þ Fig. 2 provides an illustration of the SRA scheme. From Eq. (24), we can observe that similar to the original MSA scheme, the stepsize sequence from the SRA scheme is still strictly decreasing. However, the decreasing speed is more efficient since the next stepsize is determined according to the residual error (i.e., the deviation between the current solution and its auxiliary solution) relationship of two consecutive iterations. When the current residual error is increased compared to the previous iteration (i.e., tends to diverge), parameter λ1 41 is used to make the stepsize reduction more aggressive (e.g., at iterations 19 and 31). In contrast, when the residual error is decreased (i.e., tends to converge), parameter 0oλ2 o1 is used to make the stepsize reduction more conservative. Hence, the stepsize sequences from the SRA scheme indeed satisfy the above conditions. 2. Quadratic interpolation: the quadratic interpolation scheme was first suggested by Maher [33] also for solving the MNL

Table 1 The partial linearization method.

Step 0: Initialization: set an initial feasible solution fð0Þ and let n¼ 0; Compute the similarity index βrs kj Step 1: Search direction: update the route travel times cðnÞ based on fðnÞ; solve the partial linearized subproblem and obtain the auxiliary solution hðnÞ. The resulting search direction is hðnÞfðnÞ Step 2: Line search: solve min Z½fðnÞþ αðhðnÞfðnÞÞ. The resulting stepsize is αðnÞ 0rαr1

Step 3: Update: let fðn þ 1Þ ¼ fðnÞ þ αðnÞU ðhðnÞfðnÞÞ. Let n: ¼nþ 1 Step 4: Termination criterion: if some termination criterion is satisfied, then terminate; otherwise, go to Step 1

72

A. Chen et al. / Computers & Operations Research 43 (2014) 68–77

SUE problem in the link domain. This scheme calculates an approximate stepsize as follows:  ∇α ZðαÞα ¼ 0   αðnÞ ¼ ; ð26Þ ∇α ZðαÞα ¼ 0 þ ∇α ZðαÞα ¼ 1 where ∇α ZðαÞ is the derivative of objective function with respect to stepsize. In this scheme, ∇α ZðαÞ is evaluated twice per iteration, one at α¼0 and another at α¼1, to determine an approximate stepsize. However, this scheme can only be considered as a practical method as its convergence cannot be guaranteed in general. To demonstrate the quadratic interpolation scheme, consider the following example in Fig. 3. The

objective function with respect to stepsize is ZðαÞ ¼ 2ð1αÞ2 þ 5α2 . It is easy to see that the derivative of objective  function with respect to stepsize is ∇α ZðαÞ ¼ 14α4, ∇α ZðαÞα ¼ 0 ¼ 4, and ∇α ZðαÞα ¼ 1 ¼ 10. Hence, an approximate stepsize is ð4Þ=½ð4Þ þ 10 ¼ 0:286, which is the same as the exact stepsize for this example since Z(α) is a quadratic function. 3. Armijo: for completeness, the generalized Armijo strategy is illustrated in Fig. 4. The basic idea is as follows: Given that fðnÞ is not an optimal solution, we set αðnÞ ¼ βmðnÞ s;

ð27Þ

where mðnÞ is the first nonnegative integer m such that ZðfðnÞÞZðfðnÞðβm sÞÞ Z s∇ZðfðnÞÞT ðfðnÞfðnÞðβm sÞÞ;

ð28Þ

where Zð U Þ and ∇Zð U Þ are the objective function and its gradient; s A ð0; 1Þ; β A ð0; 1Þ, and s 4 0 are fixed scalars. From Eq. (28), one can see that in order to obtain an appropriate stepsize, only some evaluations of the objective function and its gradient are required. Therefore, we can avoid solving the computationally expensive exact line search of the one-dimensional minimization problem. Remarks.

Fig. 2. Illustration of the self-regulated averaging scheme.

Fig. 3. Illustration of the quadratic interpolation scheme.

1. For the line search step, it is impractical to compute an exact line search for the PCL SUE objective function. The reasons are rs twofold: (1) the complex decision variables, f kðkjÞ , need to be stored in order to calculate Z2 and Z3, and (2) it is computationally expensive to calculate all three terms in the PCL SUE objective function. 2. Note that Chen et al. [14] proposed two practical schemes to determine a suitable stepsize: (a) optimize the stepsize just according to Z1 since it is the dominant part of the whole objective function, and (b) optimize the stepsize according to Fisk’s objective rs rs Fisk function: Z 1 þ Z Fisk ¼ 1=θ∑rs ∑k f k ln f k . These two 2 , where Z 2 practical schemes may be more computationally efficient compared to optimizing the whole objective function. However, convergence cannot be assured, as there is no guarantee that the stepsize obtained from the approximation will decrease (or at least not worsen) the objective value. 3. The line search schemes are compared in Table 2 to highlight the computation with respect to the objective function and its derivative evaluations. Since the objective function in Eq. (5) is quite complex, evaluating the objective function and its derivatives constitutes the main part of the whole computation. Compared to other schemes (e.g., bisection, quadratic interpolation, and Armijo), the SRA scheme is easy to implement. It does not need to evaluate the complex objective function or its derivatives (with respect to solution variable or stepsize).

Fig. 4. Illustration of the generalized Armijo strategy.

A. Chen et al. / Computers & Operations Research 43 (2014) 68–77

4. Computational results in a real network In this section, a set of numerical experiments using a real network in the city of Winnipeg, Canada, is conducted to examine the efficiency of the path-based partial linearization algorithm with two recent advanced line search schemes (SRA and quadratic interpolation) and three traditional line search schemes (MSA, bisection, and Armijo) for solving the PCL SUE model. The Winnipeg network, shown in Fig. 5, consists of 154 zones, 1067 nodes, 2535 links, and 4345 O–D pairs. The network structure, O–D trip table, and link performance parameters are from the Emme/2 software [28]. To set up a fair comparison of different line search schemes, we use a working route set generated by Bekhor et al. [3]. In this working route set, there are totally 174,491 routes with an average of 40.1 routes for each O–D pair, and the maximum number of routes generated for any O–D pair is 50. Note that this working route set has been adopted to examine the length-based and congestion-based C-logit SUE models in Zhou et al. [46] and Xu et al. [45], the cross-nested logit (CNL) SUE model in Bekhor

Table 2 Comparisons of the line search schemes. Line search schemes

Objective function evaluation

Derivative evaluation

MSA SRA Bisection

No No

No No Yes (with respect to stepsize) Yes (with respect to stepsize) Yes (with respect to solution)

Quadratic interpolation Armijo

No No Yes

73

et al. [4], various logit-based SUE model in Chen et al. [15], and the nonadditive traffic equilibrium model in Chen et al. [17]. The convergence criterion used in the numerical comparison is based on the mean square error (MSE) of the generalized route costs and the minimum generalized O–D cost given in Eq. (29). Note that the MSE is a direct measure of the equilibrium conditions given by the KKT conditions [2] ! ! 1 1 rs rs 2 rs MSE ¼ ∑∑ ðgcrs ln f π Þ ¼ þ ∑ ðf Þ ; gc rs k jðkjÞ k k N rs k θ jak 0 rs 11 ! ! ! βkj θcrs θcrs j k AA; ð29Þ  ln ∑ @βrs þ exp  rs kj exp  rs βkj βkj jak where gcrs k is the generalized cost on route k between O–D pair rs; π rs is the minimum O–D cost between O–D pair rs; and N is the number of routes. The tolerance error of MSE is set at 1E  10; the dispersion parameter (θ) is set at 0.1; the initial stepsize of MSA, SRA, and Armijo schemes are all set at 1.0; and the stopping error of bisection method is 1E  4. Other parameters are set as: λ1 ¼1.9, and λ2 ¼0.1 for the SRA scheme; β ¼0.6, s ¼1.0, and s ¼0.5 for the Armijo rule. The path-based partial linearization algorithm is coded in Intel Visual FORTRAN 9.1 and run on a 3.80 GHz processor and 2.00 GB of RAM. 4.1. Convergence characteristics The convergence characteristics of the solution algorithm with various line search schemes are shown in Fig. 6 and Table 3. From Fig. 6, the SRA, Armijo, quadratic interpolation, and bisection schemes can promise convergence, while the MSA scheme has difficulty in converging to the desired accuracy level within the maximum amount of CPU time (600 s) allowed. The inexact line

Fig. 5. Winnipeg network.

74

A. Chen et al. / Computers & Operations Research 43 (2014) 68–77

Fig. 6. Convergence characteristics of different line search schemes.

Table 3 Comparison of different line search schemes. Stepsize scheme

No. of iterations

No. of objective function evaluations

No. of derivative evaluations

CPU time (s)

MSA SRA Bisection Quadratic interpolation Armijo

75 15 10 10 12

– – – – 12

– – 150 20 26

Not converged 155 380 134 174

search schemes (SRA, Armijo, and quadratic interpolation) use about half of the CPU time compared to the exact line search scheme (bisection) when reaching a MSE of 1E 10. The computational efforts required by each line search scheme are shown in Table 3 in terms of the number of iterations, number of objective function evaluations, number of derivative evaluations, and CPU time. Note that from Fig. 6, we can observe that using the SRA, quadratic interpolation, and Armijo schemes have more computational benefits compared to the bisection and MSA schemes. Despite the low number of iterations required by the bisection scheme to reach convergence, majority of the computational efforts are spent on evaluating the complex derivatives of the objective function in Eq. (5). For further investigation, Fig. 7 plots the stepsize trajectories of these three inexact line search schemes. Fig. 7(a) plots the stepsize trajectory in log scale for the MSA and SRA schemes. One can see that the stepsize sequences of both schemes are strictly decreasing. In this experiment, λ2 in the SRA scheme (see Eq. (24)) is active for all iterations. This means that the flow difference jjhðnÞfðnÞjj keeps decreasing during the iteration process and the iterative solutions are getting closer to the optimal solution. For the exact and inexact line search schemes shown in Fig. 7(b), we can observe that the stepsizes are not strictly decreasing, but all three stepsizes are within an acceptable range. The stepsizes approximated by the quadratic interpolation scheme exhibit similar pattern as the exact stepsize determined by the bisection scheme, while the stepsizes generated by the Armijo scheme in Eq. (28) remains constant after the first few iterations. 4.2. Sensitivity analysis The above analysis indicates that the SRA, quadratic interpolation, and Armijo schemes outperform the bisection and MSA schemes in terms of computational efficiency. In the following analyses, we examine the robustness and sensitivity of the path-based partial linearization algorithm with these three promising line search schemes. Specifically, the effects of travel demand level and dispersion

Fig. 7. Stepsize trajectories of different line search schemes. (a) Stepsize comparison between the MSA and SRA schemes. (b) Stepsize comparison between one exact and two inexact line search schemes.

parameter on the computational performance are investigated in Fig. 8. First, we analyze the impact of different demand levels (7 40% of the base demand level) on the computational performance. From Fig. 8(a), we can observe that CPU time increases when the demand level increases. It seems that the SRA and quadratic interpolation schemes exhibit a linear trend, while the Armijo scheme exhibits a nonlinear trend for the same convergence tolerance (i.e., MSE¼ 1E10 in Eq. (29)). Regardless of the demand level, the Armijo scheme requires more computational efforts compared to the SRA and quadratic interpolation schemes. This is because the Armijo scheme requires calculating both the objective function and its derivatives to determine an acceptable stepsize with a sufficient decrease in objective value (not necessarily the optimum) as indicated in Eq. (28) and Table 3. Fig. 8(b) examines the impact of dispersion parameter value (θ) on the performance of three line search schemes. In this experiment, we vary θ from 0.01 to 10 (note that the base dispersion parameter value is 0.1 used in Section 4.1). When θ tends to be zero, the objective function in Eq. (5) is dominated by the two entropy terms (Z2 and Z3). Flow allocations tend to spread to more routes (i.e., stochasticity effect) with consideration of route overlapping (i.e., similarity effect) irrespective of the route costs (i.e., congestion effect). On the other hand, flow allocations tend to concentrate on the minimum cost routes when θ is large (i.e., the objective function in Eq. (5) is dominated by Z1). For all three line search schemes, CPU time increases as θ increases.

5. Practical implications of the PCL SUE model In this section, we examine the practical implications of implementing the PCL SUE model in the Winnipeg network. To this end, we investigate how the PCL SUE model handles the effects of

A. Chen et al. / Computers & Operations Research 43 (2014) 68–77

75

Fig. 8. Computational efforts under different demand levels and dispersion parameter values. (a) Effect of demand level and (b) Effect of dispersion parameter.

Fig. 9. Flow allocation comparison between DUE and PCL SUE models. (For interpretation of the references to color in this figure caption, the reader is referred to the web version of this article.)

congestion, stochasticity, and similarity in comparison with the multinomial logit (MNL) SUE model and the deterministic user equilibrium (DUE) model. Figs. 9 and 10 provide the flow allocation comparison between DUE and PCL SUE models and MNL SUE and PCL SUE models, respectively. Each figure consists of three sets of results: link flow allocation (left panel), link flow difference between two models (middle panel), and route flow allocation (right panel). Specifically, the link flow allocation comparison is given in terms of link volume to capacity (V/C) ratio distribution for each model. The link flow difference between two models is depicted in a GIS map to facilitate visualization. Links are coded in color and thickness to highlight the link flow differences (e.g., darker color in Fig. 10 indicates that MNL SUE link flows are higher than PCL SUE link flows in the city center). The route flow allocations are compared

based on the percentage of O D demand assigned to the used routes between the DUE and PCL SUE models in Fig. 9. In addition, Fig. 10 uses the mean absolute percentage error (MAPE) to measure route flow difference between the MNL SUE and PCL SUE models according to the average similarity index (ASI) of each route (i.e., rs ASIrs k ¼ 1=jKrs jð∑j a k skj Þ, where jKrs j is the number of routes in O D pair rs). Note that rs

MAPEm ¼

rs

jf k;PCL f k;MNL j 1 ∑ ∑  100%; rs jASIm j rs k A ASIm f k;PCL

  rs where ASI m  is the number of routes with an ASI in category m, f k;PCL rs and f k;MNL are the flows on route k of O–D pair rs assigned by the PCL and MNL SUE models, respectively.

76

A. Chen et al. / Computers & Operations Research 43 (2014) 68–77

60%

MNL

40% 30%

676 523

486

20%

183

10% 0%

50 0.20

0.60

1.00

1.40

1.80

5 2.40

V/C ratio 60%

Number of Routes

30%

Number of Routes

% of links

40%

674 525

496

20%

179

10% 0%

45 0.20

0.60

1.00

1.40

35

1,000,000

PCL

50%

40

MAPE(%)

1.80

4 2.40

113,740 49,345

100,000 10,000

7,196

30 25

3,740

1,000

20

422

15

48

100 10 1

MAPE(%)

% of links

50%

10 5

0.05

0.15

0.25

0.35

0.45

0.55

0

ASI of Route k

V/C ratio Fig. 10. Flow allocation comparison between MNL SUE and PCL SUE models.

In terms of the link V/C ratio distributions, there is a clear difference between the DUE and PCL SUE models (see left panel of Fig. 9), while the difference between the MNL SUE and PCL SUE models is not significant (see left panel of Fig. 10). The reason for the difference is that the DUE model only considers the congestion effect (i.e., Z1 in Eq. (5)), while the two SUE models consider both congestion and stochasticity effects (i.e., Z1 and Z2 in Eq. (5)). In other words, the DUE model only assigns flows to the routes that minimize congestion according to Wardrop’s first principle: all used routes have route costs equal to the minimum O–D cost, while unused routes have route costs higher than or equal to the minimum O–D cost. As for the SUE models, all routes are used. However, lower cost routes are preferred. This result is illustrated in the right panel in Fig. 9 that there is a significant difference on the percentage of demand allocated to the used routes by the DUE and PCL SUE models. In the middle panel of Fig. 9, it shows the DUE model assigns significantly less flows to the links around the city center than the PCL SUE model does as indicated by the lighter color. For this particular network, it seems that the assigned link V/C distributions in the left panel of Fig. 10 are not very different between MLN SUE and PCL SUE models. However, a closer look at the right panel reveals that the MAPE of route flows is indeed significantly different as the ASI increases (i.e., more overlapping between routes). This is attributed to the similarity effect (i.e., Z3 in Eq. (5)) of the PCL SUE model. The middle panel further illustrates the link flow difference between the two SUE models, particularly the link flows in the central areas where roads are much denser with more overlapping between routes. The MNL SUE model assigns more flows to routes passing through the central areas as indicated by the links in darker color. The PCL SUE model allocates less flows on these routes compared to the MNL SUE model, because similarity among routes is explicitly considered in the PCL SUE model.

6. Concluding remarks In this paper, we presented a path-based partial linearization algorithm for solving the paired combinatorial logit (PCL) stochastic

user equilibrium (SUE) formulation with recent advances in line search strategies to improve the computational efforts. Specifically, the self-regulated averaging (SRA) and quadratic interpolation schemes were examined along with the three traditional line search methods (i.e., the predetermined method of successive averages (MSA) scheme, the exact bisection scheme, and the inexact Armijo scheme). Numerical results using a real-size Winnipeg network revealed the SRA and quadratic interpolation schemes were more efficient than the traditional schemes. The sensitivity analysis with respect to various demand levels and dispersion parameter values also supported the findings that the SRA and quadratic interpolation schemes are indeed more robust compared to the traditional line search methods. The computational efficiency and robustness of the SRA and quadratic interpolation schemes are attributed to their smart stepsize determination mechanism. They substantially reduce or even avoid evaluating the complex PCL SUE objective function and its derivatives. In addition, numerical results were also conducted to examine how the PCL SUE model handles the effects of congestion, stochasticity, and similarity in comparison with the multinomial logit (MNL) SUE model and the deterministic user equilibrium (DUE) model. The numerical results indeed demonstrated that the PCL SUE model is capable of handling the route overlapping problem through the similarity index and the hierarchical logit structure through its extended objective function, which includes a user equilibrium term and two entropy terms. This alleviates the inability of the MNL SUE model by accounting for the correlation among routes. However, the other theoretical drawback related to the inability to account for perception variance for trips of different trip lengths also needs to address. For future research, we should explore the issue of scaling effect to account for varying perceptions across trips of different lengths, and how it can affect route choice probabilities and network flow allocations (see Chen et al. [15] for a recent study on the scaling issue and route overlapping problem in various logit-based SUE models). In addition, computational experiments on different network topologies, congestion levels, and zonal densities should be performed to assess the effectiveness and efficiency of the proposed path-based partial linearization algorithm.

A. Chen et al. / Computers & Operations Research 43 (2014) 68–77

Acknowledgment This research was partly supported by the National Science Foundation (NSF) CAREER grant by the United States, the Oriental Scholar Professorship Program sponsored by the Shanghai Ministry of Education in China, and the National Research Foundation of Korea grant funded by the Korea Government (MEST) (NRF-2010–0029443). These supports are gratefully acknowledged. The authors are also grateful to the editor, Professor Stefan Nickel, and two referees for providing useful comments and suggestions for improving the quality and clarity of the paper. References [1] Armijo L. Minimization of functions having continuous partial derivatives. Pacific Journal of Mathematics 1966;16(1):1–3. [2] Bekhor S, Prashker JN. Formulations of extended logit stochastic user equilibrium assignments. In: Proceedings of the 14th international symposium on transportation and traffic theory. Jerusalem, Israel: Elsevier; July 20–23, 1999. p. 351–72. [3] Bekhor S, Toledo T, Prashker JN. Effects of choice set size and route choice models on path-based traffic assignment. Transportmetrica 2008;4(2):117–33. [4] Bekhor S, Toledo T, Reznikova L. A path-based algorithm for the cross-nested logit stochastic user equilibrium. Computer-Aided Civil and Infrastructure Engineering 2008;24(1):15–25. [5] Bertsekas DR. On the Goldstein–Levitin–Polyak gradient projection method. IEEE Transactions on automatic control 1976;21(2):174–84. [6] Bell MGH. Alternatives to Dial’s logit assignment algorithm. Transportation Research Part B 1995;29(4):287–95. [7] Bell MGH, Lam WHK, Poss G, Inaudi D. Stochastic user equilibrium assignment and iterative balancing. In: Proceedings of the 12th international symposium on transportation and traffic theory. Berkeley, California, USA: Elsevier; July 21–23, 1993. p. 427–39. [8] Bell MGH, Shield CM, Busch F, Kruse G. A stochastic user equilibrium path flow estimator. Transportation Research Part C 1997;5(3-4):197–210. [9] Ben-Akiva M, Bergman M, Daly A, Ramaswamy R. Modeling inter-urban route choice behavior. In: Proceedings of the ninth international symposium on transportation and traffic theory. Delft, The Netherlands: VSP; July 11–13, 1984. p. 299–330. [10] Blum JR. Multidimensional stochastic approximation methods. The Annals of Mathematical Statistics 1954;25(4):737–44. [11] Cascetta E, Russo F, Vitetta A. Stochastic user equilibrium assignment with explicit path enumeration: comparison of models and algorithms. In: Proceedings of transportation systems. Chania, Greece: Elsevier; July 16–18, 1997. p. 1031–7. [12] Chen A, Chootinan P, Recker W. Examining the quality of synthetic origindestination trip table estimated by path flow estimator. Journal of Transportation Engineering 2005;131(7):506–13. [13] Chen A, Chootinan P, Recker W. Norm approximation method for handling traffic count inconsistencies in path flow estimator. Transportation Research Part B 2009;43(8):852–72. [14] Chen A, Kasikitwiwat P, Ji Z. Solving the overlapping problem in route choice with paired combinatorial logit model. Transportation Research Record 2003;1857:65–73. [15] Chen A, Pravinvongvuth S, Xu X, Ryu S, Chootinan P. Examining the scaling effect and overlapping problem in logit-based stochastic user equilibrium models. Transportation Research Part A 2012;46(8):1343–58. [16] Chen A, Ryu S, Chootinan P. L-infinity norm for handling traffic count inconsistencies in path flow estimator: formulation and solution algorithm. Journal of Transportation Engineering 2010;136(6):565–75.

77

[17] Chen A, Zhou Z, Xu X. A self-adaptive gradient projection algorithm for the nonadditive traffic equilibrium problem. Computers & Operations Research 2012;39(2):127–38. [18] Chen M, Alfa AS. Algorithms for solving Fisk’s stochastic traffic assignment model. Transportation Research Part B 1991;25(6):405–12. [19] Chootinan P, Chen A, Recker W. Improved path flow estimator for estimating origin-destination trip tables. Transportation Research Record 2005;1923:9–17. [20] Chu C. A paired combinatorial logit model for travel demand analysis. In: Proceedings of the fifth world conference on transportation research, vol. 4, Ventura, CA, USA; 1989. p. 295–309. [21] Daganzo C, Sheffi Y. On stochastic models of traffic assignment. Transportation Science 1977;11(3):253–74. [22] Damberg O, Lundgren JT, Patriksson M. An algorithm for the stochastic user equilibrium problem. Transportation Research Part B 1996;30(2):115–31. [23] Dial R. A probabilistic multipath traffic assignment model which obviates path enumeration. Transportation Research 1971;5:83–111. [25] Fisk C. Some developments in equilibrium traffic assignment. Transportation Research Part B 1980;14(3):243–55. [26] Gliebe J.P, Koppleman F.S, Ziliaskopoulos A. Route choice using a paired combinatorial logit model. Paper presented at the 78th annual meeting of the transportation research board, Washington, DC, USA; Jan 10–14, 1999. [27] Huang HJ, Bell MGH. A study on logit assignment which excludes all cycles flows. Transportation Research Part B 1998;32(6):401–12. [28] INRO Consultants. Emme/2 user’s manual: release 9.2. Montréal; 1999. [29] Koppelman FS, Wen CH. The paired combinatorial logit model: properties, estimation and application. Transportation Research Part B 2000;34(2):75–89. [30] Leurent FM. Curbing the computational difficulty of the logit equilibrium assignment model. Transportation Research Part B 1997;31(4):315–26. [31] Liu H, He X, He BS. Method of successive weighted averages (MSWA) and selfregulated averaging schemes for solving stochastic user equilibrium problem. Network and Spatial Economics 2009;9(4):485–503. [32] Maher M. SAM—a stochastic assignment model. In: Griffiths JD, editor. Mathematics in transport planning and control. Oxford: Oxford University Press; 1992. [33] Maher M. Algorithms for logit-based stochastic user equilibrium assignment. Transportation Research Part B 1998;32(8):539–49. [34] Maher M, Hughes P. A probit-based stochastic user equilibrium assignment problem. Transportation Research Part B 1997;31(4):341–55. [35] McCormick GP. Anti-zig-zagging by bending. Management Science 1969;15 (5):315–9. [36] Nagurney A. Network economics: a variational inequality approach. Kluwer Academic Publishers; 1999. [37] Patriksson M. The traffic assignment problem: models and methods. Utrecht, The Netherlands: VSP; 1994. [38] Prashker JN, Bekhor S. Route choice models used in the stochastic user equilibrium problem: a review. Transport Reviews 2004;24:437–63. [39] Prato CG. Route choice modeling: past, present and future research directions. Journal of Choice Modelling 2009;2(1):65–100. [40] Pravinvongvuth S, Chen A. Adaptation of the paired combinatorial logit model to the route choice problem. Transportmetrica 2005;1(3):223–40. [41] Robbins H, Monro S. A stochastic approximation method. The Annals of Mathematical Statistics 1951;22(3):400–7. [42] Rosa A, Maher MJ. Algorithms for solving the probit path-based stochastic user equilibrium traffic assignment problem with one or more user classes. In: Proceedings of the 15th international symposium on transportation and traffic theory. Adelaide, Australia: Elsevier; July 16–18, 2002. p. 371–92. [43] Sheffi Y. Urban transportation networks: equilibrium analysis with mathematical programming methods. Englewood Cliffs, NJ: Prentice-Hall; 1985. [44] Sheffi Y, Powell W. An algorithm for the equilibrium assignment problem with random link costs. Networks 1982;12(2):191–207. [45] Xu X, Chen A, Zhou Z, Behkor S. Path-based algorithms for solving C-logit stochastic user equilibrium assignment problem. Transportation Research Record 2012;2279:21–30. [46] Zhou Z, Chen A, Bekhor S. C-logit stochastic user equilibrium model: formulations and solution algorithm. Transportmetrica 2012;8(1):17–41.