A hybrid ant colony optimization algorithm for a multi-objective vehicle routing problem with flexible time windows

A hybrid ant colony optimization algorithm for a multi-objective vehicle routing problem with flexible time windows

Accepted Manuscript A hybrid ant colony optimization algorithm for a multi-objective vehicle routing problem with flexible time windows Huizhen Zhang...

3MB Sizes 0 Downloads 100 Views

Accepted Manuscript

A hybrid ant colony optimization algorithm for a multi-objective vehicle routing problem with flexible time windows Huizhen Zhang, Qinwan Zhang, Liang Ma, Ziying Zhang, Yun Liu PII: DOI: Reference:

S0020-0255(16)32120-X https://doi.org/10.1016/j.ins.2019.03.070 INS 14410

To appear in:

Information Sciences

Received date: Revised date: Accepted date:

20 December 2016 23 March 2019 26 March 2019

Please cite this article as: Huizhen Zhang, Qinwan Zhang, Liang Ma, Ziying Zhang, Yun Liu, A hybrid ant colony optimization algorithm for a multi-objective vehicle routing problem with flexible time windows, Information Sciences (2019), doi: https://doi.org/10.1016/j.ins.2019.03.070

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

A hybrid ant colony optimization algorithm for a multi-objective vehicle routing problem with flexible time windows Qinwan Zhang†

Liang Ma‡

March 22, 2019

Abstract

Ziying Zhang§

Yun Liu¶k

CR IP T

Huizhen Zhang∗

M

AN US

Abstract: In this paper, we present a multi-objective vehicle routing problem with flexible time windows (MOVRPFlexTW). In this problem, a fleet of vehicles can service a set of customers earlier and later than the required time with a given tolerance. This flexibility enables a logistics company to save distribution costs at the expense of customer satisfaction. This paper uses a direct interpretation of the vehicle routing problem with flexible time windows (VRPFlexTW) as a multi-objective problem, where the total distribution costs (including travel costs and fixed vehicle costs) are minimized and the overall customer satisfaction is maximized. We propose a solution strategy based on ant colony optimization and three mutation operators, which incorporates the concept of Pareto optimality for multi-objective optimization. The performance of the proposed approach was evaluated using the well-known benchmark Solomon’s problems. Our experimental results show that the suggested approach is effective, because it provides solutions that are comparative to the best known results in the literatures. Finally, we not only highlight the advantages of MOVRPFlexTW when compared with the single objective VRPFlexTW, but also illustrate its applicability and validation by analyzing a real case.

Introduction

PT

1

ED

Key words: Vehicle routing problem, Flexible time windows, Ant colony optimization algorithm, Multi-Objective Optimization, Mutation operator.

AC

CE

The vehicle routing problem (VRP) is a well-known NP-hard combinatorial problem arising in transportation, logistics, scheduling and supply chain management. The basic VRP determines an optimal set of routes for a set of vehicles to deliver goods and/or services from a supply depot to a set of geographically dispersed customers in a cost-effective manner. As the VRP appears in real life, it may have several classes of additional constraints, such as limits on the vehicle capacity [46, 48], time windows for serving customers [3, 4, 6, 5], route lengths, or the number of hours worked by a driver or a distribution clerk. For a recent complete review on the classification of different VRP variants, see [29, 9]. As with basic VRP, most VRP variants are known to be NP-hard. The capacitated vehicle routing problem with (hard) time windows (VRPTW) is an important variant of the VRP, and has been drawing extensive attention from researchers. It aims to find the optimal set ∗

Corresponding author: [email protected], School of Management,University of Shanghai for Science and Technology, Mail box 459, 516 Jungong Road, Shanghai 200093,China † School of Management,University of Shanghai for Science and Technology ‡ School of Management,University of Shanghai for Science and Technology § School of Materials Engineering, Shanghai University of Engineering Science ¶ School of Management,University of Shanghai for Science and Technology

1

ACCEPTED MANUSCRIPT

of routes for a set of identical vehicles with restricted capacity so that all customers, whose demands are known, are serviced exactly once by a single vehicle within each time window defined by the earliest and latest times. Moreover, each vehicle should start and end its route at the given depot. The vehicle is permitted to arrive at the customer point before the earliest time, and wait at no cost until service becomes allowable, but it is not permitted to arrive at the customer point after the latest time.

CR IP T

The VRPTW requires the time window constraints to be strictly satisfied. However, in some real-world situations( e.g. newspaper delivery, non-emergency parcel delivery, furniture delivery), the time window constraints can be violated to a certain extent. These violations of time windows are considered in the vehicle routing problem with soft time windows (VRPSTW) and the vehicle routing problem with flexible time windows (VRPFlexTW). The VRPSTW assumes that some or all customer time windows can be violated [2], and customers are available to be serviced at any point of the planning horizon [43]. The VRPFlexTW assumes that some or all customer time windows can be violated to a certain extent [43]. The VRPFlexTW allows the vehicles to serve the customers before and after the earliest and latest times with a given tolerance. Therefore, the VRPSTW can be considered as a special case of the VRPFlexTW where the violation of time windows is unbounded, i.e., infinite flexible bounds [43].

PT

ED

M

AN US

The violation of time window constraints may lead to reducing the total distance traveled and the number of vehicles used for the service at the expense of customer satisfaction [43]. Thus, when minimizing the routing costs and maximizing the customer satisfaction (or minimizing the customer inconvenience) affected by the time window violations are defined, these two different objectives frequently conflict. Customer inconvenience for being visited too late and (possibly) too early with respect to the desired time window is typically quantified and the objective functions in VRPSTW and VRPFlexTW are generally modeled as a weighted combination of routing costs and a penalty corresponding to the customer inconvenience [41]. However, how to measure the customer inconvenience and quantify the relative weights of routing and customer inconvenience are still open research questions [41]. For these reasons, adopting a multi-objective point of view can be advantageous and the optimization process needs to provide a range of solutions that represent the trade-offs between the objectives [21], rather than a single solution. Therefore, this article studies the multi-objective vehicle routing problem with flexible time windows (MOVRPFlexTW). This problem is based on the VRPFlexTW proposed in [43] which aggregates the travel costs, fixed costs for using the vehicles to delivery service, and penalty costs incurred for early and late service into a single objective function. MOVRPFlexTW aims to minimize the routing costs, which consist of travel costs and the fixed costs of the vehicles used for the service, while also maximizing the customer satisfaction.

AC

CE

To find some trade-off solutions between the routing costs and customer satisfaction, in this paper we propose a new Pareto-based multi-objective approach that combines the ant colony optimization (ACO) algorithm and mutation operators to solve MOVRPFlexTW. The motivation of using hybrid ACO (HACO) in our work is multifold. Firstly, the ACO algorithm is a swarm based meta-heuristic approach for solving combinatorial optimization problems which has produced good solutions within a reasonable computation time (e.g., multidimensional knapsack problem [39], traveling salesman problem [13] and job shop scheduling problem), especially many ACO-based approaches have been proposed to solve different variants of the VRP in the literatures (e.g., vehicle routing problem with simultaneous pickup and delivery [28], heterogeneous vehicle routing problem with mixed backhaul [49] and multi-compartment vehicle routing problem [38]). Secondly, swarm optimization algorithms maintain a pool of solutions during the search process. Therefore, swarm optimization algorithms are particularly suited for finding many trade-off solutions in a single run [31]. Thirdly, a metaheuristic is a general algorithmic framework for finding solutions to optimization problems within which local heuristics are guided and adapted to effectively explore the solution space [25]. Finally, different meta-heuristics have already been applied to other variants of VRP and provided good results [37, 46, 32, 7, 26]. All these factors have strongly motivated us to employ hybrid ACO algorithm to

2

ACCEPTED MANUSCRIPT

solve MOVRPFlexTW for the first time.

CR IP T

The main contributions of this paper are as follows. Firstly, we have introduced and modeled the MOVRPFlexTW. Secondly, to produce high-quality solutions, we have developed an ACO metaheuristic based algorithm to solve the MOVRPFlexTW problem. We have not only hybridized the ACO meta-heuristic with mutation operators which exploit the neighborhood search space, but also incorporated the concept of Pareto optimality for multi-objective optimization into this hybrid algorithm. This hybridization is an effective way to find trade-off solutions for the MOVRPFlexTW. Finally, we have conducted extensive experiments using publicly available benchmark instances to assess the operational gains of using multiple objectives and flexible time window. We have also analyzed the performance of our algorithm experimentally. As reported later, our approach can effectively generate trade-off solutions to MOVRPFlexTW.

2

AN US

The remainder of this paper is organized as follows. Section 2 contains a brief overview of the relevant literatures. Section 3 formally describes MOVRPFlexTW. Section 4 presents our Pareto-based hybrid ant colony optimization algorithm for solving MOVRPFlexTW. Section 5 describes our computational experiments that investigated the performance of the proposed algorithm and the advantageous of MOVRPFlexTW, respectively. Section 6 applies the presented MOVRPFlexTW on a real case study. Section 7 presents our concluding remarks.

Literature review

2.1

M

The aim of this section is to provide a brief but comprehensive overview of previous work related to the vehicle routing problem with time windows using multi-objective optimization techniques.

The vehicle routing problem with time windows

AC

CE

PT

ED

There have been extensive investigations into VRPTW because of its high computational complexity and usefulness in real life, especially with respect to the extremely important economic impact on all logistic systems. Some exact methods have been proposed for the VRPTW, including dynamic programming, column generation, and Lagrangian relaxation based methods [18]. However, exact methods often perform poorly when solving intermediate and large VRPTW instances. Thus, heuristic and meta-heuristic methodologies, which seek approximate solutions in polynomial time instead of exact methods with intolerable computing cost, are very suitable for solving larger VRPTW instances. Many different heuristics and meta-heuristics for solving the VRPTW have been proposed, such as genetic algorithm [22], simulated annealing [5], ACO, particle swarm optimization, and tabu search. Along with the basic features of each solution method, experimental results for benchmark test problems have also been presented and analyzed in the literatures. Br¨aysy and Gendreau surveyed route construction, local search algorithms, and meta-heuristics in [10] and [11]. Interested readers can refer to the review papers [18, 3, 11] for details of mathematical formulations, exact methods and meta-heuristic algorithms for the VRPTW. VRPSTW and VRPFlexTW are relaxations of VRPTW (in which time windows are treated as hard constraints). The time windows are unboundedly relaxed in VRPSTW and relaxed according to a fixed tolerance in VRPFlexTW. That is, the desired time windows can be violated at the cost of customer satisfaction in VRPSTW and VRPFlexTW. In particular, customers are assumed to be available for the service at any moment of the planning horizon in VRPSTW. VRPFlexTW is distinct from VRPSTW in that the former considers a bounded time windows violation. Vehicles in VRPFlexTW are allowed to provide both early and late services with a limited penalty, and to wait at no cost in the case of early 3

ACCEPTED MANUSCRIPT

arrival (until the extended earliest time) without any waiting time limitations [43]. When compared with VRPTW, VRPSTW and VRPFlexTW have the largest and larger feasible solution spaces [43], respectively.

CR IP T

Compared with the VRPTW, there have been significantly fewer investigations into VRPSTW and VRPFlexTW. The majority of literatures on VRPSTW and VRPFlexTW consider a linear penalty function for the violations of time windows. Chiang and Russell [14] and Balakrishnan [2] developed different tabu searches for the VRPSTW problem. Taillard et al. [44] also proposed a tabu search for the VRPSTW, in which a linear penalization is only applied to late visits. Calvete et al.[12] applied goals programming to VRPSTW with a linear penalization for early and late visits. Ibaraki et al. [24] measured time window violations as a nonconvex, piecewise linear, and time dependent function. Zare-Reisabadi et al. [50] developed an ant colony algorithm with local searches for the VRPSTW with a linear penalization for early and late visits. Bhusiri et al. [8] solved VRPSTW using column generation, which penalized both early and late arrivals, but the arrival time was bounded in an outer time window. Salani et al. [41] solved VRPSTW using a branch-and-cut-and-price technique. Koskosidis et al. [30] proposed a heuristic algorithm to decompose VRPSTW into an assignment component and a series of routing and scheduling components.

Multi-objective VRPTW

ED

2.2

M

AN US

Figliozzi [19] proposed an iterative route construction and improvement algorithm for the vehicle routing problem, which considered bounded early and late visits. Although this problem is called VRPSTW in [19], in fact it is a VRPFlexTW. Tas¸ et al. [43] developed a solution procedure that constructs feasible vehicle routes for VRPFlexTW via a tabu search algorithm. Qureshi et al. [35, 36] developed a column generation based exact algorithm for the vehicle routing and scheduling problem with semi soft time windows (VRPSSTW). The VRPSSTW presented in [35, 36] only considered an upper bound on the late time window violations, so this problem can be viewed as a special case of the VRPFlexTW.

AC

CE

PT

Jozefowiez et al. [27] classified the different objectives of variants of the VRP according to the component of the problem with which they are associated, i.e. the route, the resources, and the node/arc activity. The most common objective related to the route is to minimize the cost of the generated solutions with respect to distance traveled and/or time required. Other authors aimed to reduce the imbalance among the workloads of the vehicles, where workloads can be expressed as the number of customers visited, the quantity of goods delivered [6], the time required, or the route length [6]. Most studies dealing with objectives related to node/arc activity replaced the time windows with an objective that minimizes either the number of violated time windows constraints [19] or the total wait time of the customers and/or drivers due to earliness or lateness, or that maximizes customer satisfaction. The main objectives related to resources can be roughly classified into vehicle-related objectives and goods-related objectives. The majority of vehicle-related objectives focus on minimizing the number of vehicles or maximizing vehicle cost-effectiveness in terms of time or capacity. Goods-related objectives typically consider the nature of the goods. For instance, if the merchandise is perishable, we should minimize the deterioration. Many techniques have recently been proposed for solving multi-objective VRPTW. These strategies can be divided into three general categories: hierarchical techniques, aggregating approaches, and Pareto methods [6, 5]. Hierarchical techniques, which establish a hierarchy among the objectives to be optimized at the same time, intrinsically consider one objective as more important than others [5]. Aggregating approaches include all of objectives in a single objective function using mathematical transformations. As with hierarchical techniques, aggregating approaches are also unable to find all the Pareto optimal solutions. Additionally, the main disadvantage of aggregating approaches is that 4

ACCEPTED MANUSCRIPT

the weights are difficult to determine precisely [6, 5]. Pareto methods apply the notation of Pareto dominance to evaluate the quality of a solution or to compare solutions [27], so they can overcome the drawbacks of two other methods and are becoming increasingly popular. Pareto methods are mainly used in evolutionary and swarm algorithms. Evolutionary and swarm algorithms create a pool of solutions (called a population) during the search process. Therefore, these algorithms are particularly suited for obtaining a set of trade-off solutions for multi-objective problems [31]. Many pioneers have proposed Pareto-based hybrid methods for multi-objective VRPTW by combining evolutionary/swarm algorithms and local searches, heuristics, and/or exact methods [6, 5, 22].

3

AN US

CR IP T

There are few studies on multi-objective VRPFlexTW. Therefore, a major research effort is necessary in this field, including hybrid methods that take advantage of different heuristic evolutionary/swarm algorithms, exact methods, and local searches to improve the quality of the solutions and/or to reduce the computing time. In this paper, we aim to solve the multi-objective vehicle routing problem with flexible time windows using a Pareto-based hybrid ACO algorithm, which minimizes the total routing costs and maximizes the average customer satisfaction. This formulation allows us to obtain solutions with different customer satisfaction levels and costs incurred by the traveled distance and the number of vehicles used, so that the decision maker can decide which solution is the most suitable according to certain criteria.

A multi-objective optimization model for the VRPFlexTW

ED

M

A logistics company will typically make appointments with customers using order information or by telephone before distributing the goods or service. However, this often leads to distribution vehicles arriving earlier or later than the appointment time because of, for example, an unreasonable distribution scheduling or traffic jams. In real life, customers have a certain tolerance to services arriving earlier and later than the arranged time. This does not lead to significant losses to the logistics company in the short term, but an early or postponed service can decrease customer satisfaction and have a negative effect on the income and reputation of the company in the long term. Therefore, a logistics company should aim to improve customer satisfaction while minimizing the total traveled distance and number of vehicles.

PT

In this section, we consider a multi-objective VRPFlexTW formulation that aims to simultaneously maximize customer satisfaction while the vehicle routes are subject to capacity and time windows constraints and minimize the costs related to the traveled distance and number of vehicles.

AC

CE

The VRPTW can be modeled as a directed complete graph G(V, E) [18], with a vertex set V = {0, 1, · · · , n} and an edge set E = {hi, ji|i, j ∈ V and i 6= j}. Here, the central depot is denoted as 0, and customers are represented as i(i ∈ V and i 6= 0). M = {1, 2, · · · , m} is a set of vehicles. Each arc in the network describes the connection between two vertices. Each vehicle with limited capacity bk (k ∈ M ) and maximum route time rk starts from and returns back to the depot after serving a number of customers. Each customer i ∈ V (i 6= 0) has a variable demand di and must be visited only once by exactly one of the vehicles. The sum of all the demands serviced by vehicle k must be less than or equal to its capacity bk . Similarly, the total travel time of vehicle k cannot exceed its allowed route time rk . Additionally, the satisfaction µi (zi ) (zi is the time when vehicle arrives at customer i) is equal to 1 if customer i(i ∈ V ) is serviced within the appointed time interval [ei , li ]. A customer i cannot be served later than the extended latest time Li . A vehicle arriving at customer i earlier than the extended earliest time Ei incurs waiting time wi . Although the waiting time is permitted at no cost, customer satisfaction µi (zi ) decreases to 0 when wi > 0. (ei − Ei ) and (Li − li ) are customer i’s tolerances for being served earlier and later than the appointed time, respectively. Figure 1 shows the

5

ACCEPTED MANUSCRIPT

𝜇

1 complete satisfaction

𝑙𝑖

𝑒𝑖

𝐿𝑖

𝑧

CR IP T

𝐸𝑖

Figure 1: The customer satisfaction function.

customer satisfaction function. The satisfaction of customers serviced within the intervals [Ei , ei ] and [li , Li ] is assumed to vary linearly with the arrival time zi in this paper, and are calculated using zi < Ei , Ei ≤ zi < ei , ei ≤ zi ≤ li , li < zi ≤ Li , zi > Li .

AN US

  0,    z −E   eii −Eii , 1, µi (zi ) =  Li −zi    Li −li ,   0,

(1)

M

For the depot’s time window we assume that E0 and e0 are equal to 0, and l0 is equal to the extended latest time, L0 . This assumption implies that all vehicles start from the depot at time 0 and should return back to it before L0 . For simplicity, we set L0 = rk for any k ∈ M , that is, the maximum route time rk is also the depot’s time window for vehicle k.

ED

Before presenting the mathematical formulation, we define the following notations: hk is the transportation cost per unit distance of vehicle k,

PT

fk is the fixed cost incurred for using vehicle k, cij is the distance between vertex i and vertex j,

CE

si is the service time at vertex i,

wi is the waiting time at vertex i,

AC

tij is the time required for traveling from vertex i to vertex j. Decision variables xijk

( 1, if vehicle k travels from vertex i to vertex j = 0, otherwise

yik

( 1, = 0,

if vertex i is served by vehicle k otherwise

Since the VRP was proposed by Dantzig and Ramser as a generalization of the traveling salesman problem, cost has mostly been associated with the traveled distance, but there are several other types 6

ACCEPTED MANUSCRIPT

of cost involved, such as waiting time, service costs(early and late arrivals)[42], the number of vehicles and delivery time [27]. For simplicity, in this paper we only consider the main routing costs resulted by the traveled distance and the number of vehicles. The traveled distance affects time and fuel costs, and the number of vehicles affects vehicle and labor costs [22]. The objective of the VRPFlexTW is to serve all the customers such that the following objectives are met and the following constraints are satisfied. Objectives:

CR IP T

• minimize the total routing costs; and • maximize the average customer satisfaction. Constraints:

• for every vehicle route, the total demand cannot exceed the capacity of the vehicle;

AN US

• the total travel time of each vehicle cannot exceed its maximum route time; • each customer is served exactly once by exactly one vehicle;

• each vehicle route starts from the depot and ends at the depot; and • the vehicle serves every customer within the required time frame.

M

Given the above defined objectives, parameters and decision variables, the problem can be formulated as follows: n

i=1

min

m X k=1

hk

n X n X

cij xijk +

i=0 j=0

m X

fk

(2) n X

s.t.

CE

i=1 m X

di yik ≤ bk

∀k ∈ {1, 2, · · · , m}

(4)

yik = 1

∀i ∈ {1, 2, · · · , n}

(5)

∀k ∈ {1, 2, · · · , m}

(6)

AC

k=1

n X j=1

n X

i=0 n X

(3)

x0jk

j=1

k=1

PT

n X

ED

max

1X µi (zi ) n

x0jk −

n X

xi0k = 0

i=1

xijk = yjk

∀k ∈ {1, 2, · · · , m}, ∀j ∈ {1, 2, · · · , n}

(7)

xijk = yik

∀k ∈ {1, 2, · · · , m}, ∀i ∈ {1, 2, · · · , n}

(8)

j=0

n X n X i=0 j=0

xijk (tij + si + wi ) ≤ rk

∀k ∈ {1, 2, · · · , m}

(9) (10)

w0 = s 0 = 0

7

ACCEPTED MANUSCRIPT

m X n X

xijk (zi + wi + si + tij ) = zj

k=1 i=0

Ei ≤ zi + wi ≤ Li

wi = max{0, Ei − zi }

xijk ∈ {0, 1}

yik ∈ {0, 1}

∀j ∈ {1, 2, · · · , n}

(11)

∀i ∈ {1, 2, · · · , n}

(12)

∀i ∈ {1, 2, · · · , n}

∀i, j ∈ {1, 2, · · · , n}, ∀k ∈ {1, 2, · · · , m} ∀i ∈ {1, 2, · · · , n}, ∀k ∈ {1, 2, · · · , m}

zi ≥ 0

∀i ∈ {1, 2, · · · , n}

(13) (14) (15) (16)

4

AN US

CR IP T

In this model, Objective (2) is to maximize the customer satisfaction. Objective (3) is to minimize the total routing costs, which consist of travel costs and fixed vehicle costs. Constraint (4) guarantees that the vehicle capacity is not exceeded; Constraint (5) ensures that each customer is served by exactly one vehicle; and Constraint (6) ensures that each route starts and ends at the depot. Constraints (7)-(8) guarantee that each customer is served exactly once. Constraint (9) ensures that the maximum route time is not exceeded; Constraint (10) defines the waiting and service time at the depot; Constraint (11) represents the relationship between the arrival time at a vertex and the departure time from its predecessor; Constraint (12) ensures that customers are served within the required time; and Constraint (13) defines the waiting time.

Multi-objective hybrid ant colony optimization algorithm

PT

ED

M

Hybridization enhances strengths and compensates weaknesses of two or more methods, with the aim of generating better solutions by combining the key elements of competing methodologies [6]. Most hybridized methods in literatures were designed by embedding one solution method into another one as a local search technique(e.g., hill-climbing method is incorporated into genetic algorithm to search for better routing solutions [22], the saving algorithm and λ-interchange mechanism are embedded into ACO to obtain improved solution method to VRPTW [15], simulated annealing is combined with evolutionary algorithm as an acceptance criterion to generate high quality non-dominated solutions for multi-objective VRPTW [6]). Some previous studies have shown the advantages of embedding local search techniques into evolutionary approaches for solving VRP [1].

AC

CE

ACO algorithms imitate the behavior of a real ant colony when searching for food. During the search, each ant marks the trail it is moving along by laying a substance called pheromone. Ants on a short path will be quicker to return to the nest, so more pheromone will be deposited on the shorter paths. The amount of pheromone in a path lets other ants know if it is promising. An important benefit of this algorithm is parallelism: several solutions can be built simultaneously, they exchange information during the procedure, and use information from previous iterations. With the aim of obtaining high quality, non-dominated solutions for the MOVRPFlexTW presented in Section 3, we propose a new approach to improve the efficiency of a Pareto-based multi-objective ACO algorithm using mutation operators. We can take advantage of the ACO algorithm and mutation operators to obtain a search procedure with a well-balanced exploration-exploitation behavior using hybridization. This proposed HACO starts with P ants at the depot. Each ant constructs a sloution to the problem according to the mechanism of solution construction. Mutation operators will then be used to the randomly selected solutions in order to exploit their neighborhood search space, and K new solutions will be obtained. An acceptance criterion will then be used to select P prospective solutions from the P + K solutions based on their fitness computed by evaluation function. The next step will be to use a Pareto-optimization method to select non-dominated solutions from the P accepted solutions. Whereafter, the pheromone trails will be updated to explore the search space of P 8

ACCEPTED MANUSCRIPT

ants and produce better subsequent solutions. The HACO will then reiterate through this process until a predefined number of iterations has been reached. The specific features and general process of the proposed HACO are described below.

4.1

Solution construction

CR IP T

When using HACO to solve the MOVRPFlexTW, each ant starts at the depot and constructs a route by incrementally selecting customers under some constraints. When the next selection in the current route is impossible, the ant returns to the depot and constructs another route by assigning the unrouted customers. This process is repeated until all customers are routed, and a feasible solution can be attained. At each construction step, each ant located at customer i uses a probabilistic formula to select the next customer j. That is, ( arg max{(τiu )α (ηiu )β (ψiu )γ } for u ∈ / tabul , if q ≤ q0 , j= (17) S, otherwise,

AN US

where S is obtained by roulette wheel rule with probability distribution   P (τij )α (ηij )β (ψij )γ , if j ∈ / tabul , (τiu )α (ηiu )β (ψiu )γ u∈tabu / l pij = 0, otherwise.

(18)

In (17) and (18), τiu is equal to the amount of pheromone on the path between the current customer i and possible customer u; ηiu denotes the visibility on edge (i, u), which is defined as the inverse of the distance between customer i and customer u, ηiu = c1iu ; ψiu = max(1,l1 −z l ) denotes the urgency u

iu

ED

M

l is the time when ant l arrives at customer location u from customer i; and of edge (i, u), where ziu α, β, and γ are parameters that determine the relative influences of the pheromone trail, the visibility value, and the urgency value, respectively. Customers already visited by ant l are stored in the set tabul and are not considered for selection. q is a random variable uniformly distributed in [0, 1] and q0 , 0 ≤ q0 ≤ 1, is a parameter that controls the exploitation against the exploration of ants during the searching process.

PT

The solutions can be constructed by using the following procedure. Algorithm 1 ( solution construction)

CE

1. Information:

• Input: 0 ≤ q0 ≤ 1, the number of ants P , the demand of each customer di for all i ∈ {1, 2, · · · , n}, the limited capacity bk and maximum route time rk for all k ∈ {1, 2, · · · , m}.

AC

• Output: P initial solutions.

2. Initialization: • For l = 1, 2, · · · , P , start ant l at the depot, and initialize the set of visited customers tabul = ∅ and the set of candidate customers Candl = {1, 2, · · · , n}. • Set k = 1 and l = 1.

3. For the given vehicle k (k ∈ {1, 2, · · · , m}), set its available capacity bbk = bk and route time rrk = rk .

4. Update the candidate list: According to tabul , bbk , rrk , the vehicle capacity constraints (4) and route time constraints (9), update Candl for ant l. 5. Criterion of returning to the depot: If Candl = ∅, then set k = k + 1 and go to Step 3(the ant l returns to the depot and constructs another route); otherwise, go to Step 6.

9

ACCEPTED MANUSCRIPT 0 1

2

3

0

4

5

6

7

0

8

9

10

0

9

10

11

12

13

14

0

05 11

12

13

14

3 1

0 1

314 0

2

2

4

5

6

7

0

8

4

0

3 0

1

4

14

7

11 10

8

0 12

13 1

9

2

4Figure 5 2: 6 A7typical 8 9vehicle 10 routing 11 problem. 12 13

2

3

0

4

5

6

7

14

8

AN US

10

0 1

6

7

11

3

5

CR IP T

12

13

6

2

0

8

9

10

0

11

12

13

14

0

3 from figure 2. (a) Sequence codes of the3solution 9

1

2

3

1

1

4

6 27

5

2

8

9

10

11

12

13

14

4 4 from figure 3(a). 5 14(b) Sequence codes of the customers

5

M

14

0

6

Figure 3: Sequence codes.3

8. 9.

4.2

ED

PT

7.

CE

6.

0 6 12 1 13 2 Select the13 next customer: If12Candl 6= ∅,11then select the next customer to be visited j(j ∈ / tabul and 7 4 j ∈ Candl ) according to Equations (17) and (18). 7 5 11 Set tabul = tabul ∪ {j}, rrk = rrk − tij 10 − sj − wj (i is the vertex visited by vehicle k before j) and 8 bbk = bbk − dj , and go to Step 8. 1410 08 6 If tabul = {1, 2, · · · , n}, then set l = l + 1 and go to Step 9; otherwise, go to Step 4. 12otherwise, set k9 = 1 and go to Step 3. Stopping criterion: If13 l > P , then stop; 9

11

1 2Operation 3 4 5 6 7 8 9 10 11 Mutation 10 1

2

3

4

5

6

7

8

9

10

7

12 11

13

14 138 14

12

AC

In this paper, we introduce the mutation operation into the ACO algorithm to diversify the population and prevent convergence to a local optimum. The aim of the mutation operation is to alter some 9 customers and produce a new solution. There are 3three mutation operators in this algorithm: the 0 1 2 3 0 9 8 7 0 6 5 4 0 10 12 operators 0 13 14randomly 0 interchange operator, shifting operator, and inverse operator. These 11 mutation 1 2 conduct customer exchanges without considering the constraints on vehicle capacity and route time. Let q1 , q2 , and q3 be such that 0 ≤ q1 , q2 , q3 ≤ 1 and q2 < q3 . Then, two random numbers qmu1 and 4 qmu2 are uniformly generated in [0, 1]. The solution constructed by ant l is not selected for mutation 1 2 3 4 5 6 7 8 9 10 11 125 13 14 if qmu1 > q1 , otherwise the interchange operator is applied if qmu2 ≤ q2 , the shifting operator is 14inverse operator is applied if qmu2 > q3 . applied if q2 < qmu2 ≤ q3 , or 0 6 To describe the mutation operators, we present an example solution in figure 3(a), which is a sequence 0 1 2 312 0 in 9 figure 8 7 2.0 The 6 mutation 5 4 operators 0 10 were 11 implemented 12 0 13 on 14 the0 codes transformed 13 from the solution 11

7

10 10 8

9

7

112

4 ACCEPTED 10 MANUSCRIPT 0 1

2

3

0 4 14

5

6

7

0

8

9

8

10

5 11

0

12

0 93 113 2

3

4

12 1 6 7

5

92 10

8

12

13

4

7

9 12

5

10

6

8

0

14

7

4

5

8

0 14 2 3 13

14

6

11

11 Exchanging 10

14

13

6 11

12

13

1

0

1

2

14 2 3 4 0 7 10 9 5 0 3 4 5 6 7 8 9 10

CR IP T

9 Inserting 0 7 Figure 4: Sequence acquired 11by using interchange operator. 10 6 8 0 11 11 12 8 13 14

Shifting 9 1

2 3 4 5 6 2 3 9 4 5

7 6

13

1

0

8 9 10 11 12 13 14 7 8 10 11 12 13 14

AN US

1

12

Figure 5: Sequence acquired by using shift operator.

3

sequence codes shown 14 5were 0 1 in 2figure 3 3(b), 0 9 which 0 2obtained 6 7 by8 eliminating 10 0 0s 11 (the12depot) 13 from 14 the0 solution shown in figure 3(a).

M

4 e.g., 1 and 14, 5 and 7, 6 and 9, Interchange operator. Randomly select several pairs of customers, 5 8 and 10. By exchanging these pairs of customers we obtain a new sequence. The result is shown in 14 figure 4. 0

6

ED

Shift operator. Randomly select two points, e.g., 4 and 9. The last customer between these two points (including these two points) is moved12 to the first position, and the other customers are separately 13 moved to the position next to them. The new sequence obtained using the shifting operator is shown 7 11 in figure 5.

PT

Inverse operator. Randomly select two points, e.g., 10 4 and 9. A new sequence is obtained by inversing 8 6. the subsequence between these two points. The result is shown in figure

AC

CE

Three new solutions can be obtained by inserting 0s into the new sequence acquired using the muta9 route satisfies the constraints on vehicle tion operation. In the new solutions shown in figure 7, each capacity and route time.

1

2

3

4

5

6

7

8

9

10

11

12

13

14

5

4

10

11

12

13

14

Inversing

1

2

3

9

8

7

6

Figure 6: Sequence acquired by using inverse operator.

0 1

2

3

0

9

8

7

0

11

6

5

4

0

10

11

12

0

13

14

0

9

1

2

14

3

4

2 3

5

6

7

8

9

10

11

12

13

14

ACCEPTED MANUSCRIPT Shifting

4

7

9

5

10

6

8

11

12

1 2 3 4 5 6 7 8 9 10 11 12 13 1 2 3 9 Inversing 4 5 6 7 8 10 11 12 0 14 2 3 4 0 7 9 5 0 10 6 8

13

1

14 13 14 0 11 12

13

1

0

13

14

0

(a) New solution obtained by using interchange operator.

1 2 0 1

3 2

9 3

8 0

7 6 5 4 10 9 4 5 0 6

11 7

12 8

13 10

14 0 11

12

(b) New solution obtained by using shift operator.

Inserting 0

0 1

2

3

0

9

8

7

0

6

5

4

0

10

11

12

0

13

14

0

Figure 7: New solutions.

4.3

Acceptance criterion

CR IP T

(c) New solution obtained by using inverse operator.

Ratiol =

AN US

Suppose that K offspring solutions are obtained by applying the mutation operators, then the number of solutions is P + K (where P is the number of ants and K ≤ P ). The P + K solutions are ranked according to their fitness Ratiol (1 ≤ l ≤ P + K) which is evaluated using Satisl , Costl

for 1 ≤ l ≤ P + K

(19)

M

where Satisl and Costl are the objective values related to solution l, which are calculated using objectives (2) and (3), respectively. Then, the first P solutions are used to update the pheromone trails described in Subsection 4.5.

Pareto-optimization method

ED

4.4

CE

PT

For a problem with more than one objective function, given any two solutions (S1 and S2 ) there are two possibilities: one dominates the other or they are indifferent. We say that solution S1 dominates (or is preferred to) S2 (S1  S2 , where ≺ denotes worse and  denotes better) if and only if S1 is better than S2 in at least one objective, and not worse in the others. Two solutions are called indifferent if neither dominates the other.

AC

We consider the two objectives of the MOVRPFlexTW presented in Section 3 to be equally important, so we can extend the above concepts to find the set of non-dominated solutions from the P accepted solutions.

4.5

Pheromone Updating

Updating the pheromone trails is a key element to adaptive learning technique of ACO and to improve the subsequent solutions. In this paper, after each iteration the pheromone trails of traveled edge (i, j) are updated using P X new old τij = (1 − ρ) × τij + ∆τijl (20) l=1

12

ACCEPTED MANUSCRIPT

where τijnew is the pheromone on edge (i, j) after updating; τijold is the pheromone on edge (i.j) before updating; ρ is a parameter that controls the speed of pheromone evaporation; P is the number of ants; and ∆τijl is the amount of pheromone that ant l deposits on edge (i, j). ∆τijl is defined as ∆τijl =

(

Q×Satisl Costl ,

if edge (i, j) belongs to T l

0,

otherwise

(21)

CR IP T

where Q is a constant related to the amount of pheromone deposited by the ants; and T l is the tour built by ant l. Equation (21) implies that edges belonging to better tours receive more pheromone, so they are more likely to be chosen by ants in subsequent iterations. Too many or too few pheromone trails may lead to premature convergence. Therefore, we set τmin and τmax (the minimum and maximum pheromone trails) for all pheromone trails τij to satisfy τmin ≤ τ ≤ τmax , following [15].

4.6

Hybrid ant colony optimization algorithm for MOVRPFlexTW

AN US

Based on the procedures presented in Subsection 4.1-4.5, the steps of the hybrid ant colony optimization (HACO) algorithm for solving MOVRPFlexTW are shown as follows. Algorithm 2 (HACO) 1. Information:

• Input: the parameters in the formulation (2)-(16),such as m, n, hk and so on.

M

• Output: a set of non-dominated solutions for MOVRPFlexTW (2)-(16). 2. Initialization:

ED

• Initialize the parameters, such as the number of ants P , the maximum number of iterations maxiter, the pheromone trails τij (0 ≤ i, j ≤ n), the visibility ηij (0 ≤ i, j ≤ n), 0 ≤ ρ, q0 , q1 , q2 , q3 ≤ 1(q2 < q3 ) and so on.

PT

• Set the counter to nc = 1, the set of global non-dominated solutions Gnon−dom = ∅ and the set of local non-dominated solutions Lnon−dom = ∅, respectively. 3. Generate solution: If nc ≤ maxiter, produce P solutions by using the solution construction method shown as Algorithm 1; otherwise, the algorithm terminates and outputs Gnon−dom .

CE

4. Mutation:

(a) Set l = 1.

(b) Uniformly generate two random numbers qmu1 and qmu2 in [0, 1].

AC

(c) If qmu1 ≤ q1 , the mutation operation is performed on the solution generated by ant l   interchange operator if qmu2 ≤ q2 ; shifting operator if q2 < qmu2 ≤ q3 ;   inverse operator if qmu2 > q3 ; otherwise, go to Step 4.(d).

(d) Set l = l + 1 and repeat Step 4.(b) and Step 4.(c) until l > P . 5. Acceptance criterion: Arrange the P + K solutions (P solutions obtained by P ants and K new solutions obtained using the mutation operation) in the order of descending fitness computed using equation (19), and accept the first P solutions. 6. Compute non-dominated solutions:

13

ACCEPTED MANUSCRIPT

(a) Compute the routing costs and average customer satisfaction related to each solution. (b) Find a set of non-dominated solutions from the P accepted solutions, and use it to update Lnon−dom . (c) Based on the set Gnon−dom ∪ Lnon−dom , generate a new set of non-dominated solutions and use it to update Gnon−dom . 7. Update pheromone trails: According to equations (20)-(21), update the pheromone trails τij (0 ≤ i, j ≤ n), and τij is replaced by τmax if τij > τmax , or by τmin if τij < τmin . 8. Set nc = nc + 1 and go to Step 3.

Computational analysis

CR IP T

5

5.1

AN US

To assess the advantages of MOVRPFlexTW and the performance of the proposed HACO, we applied them to a set of Solomon’s VRPTW benchmark examples. The time windows of these instances are revised to adapt them to the MOVRPFlexTW formulation presented in Section 3. The algorithm was implemented in Matlab R2010a and executed on a computer with an Intel Core(TM) i7-4790 CPU 3.60GHz, with 16.0GB of RAM.

Description of experimental problems

ED

M

Solomon’s problems include 56 data sets of size n = 100 that are available from Solomon’s web site 1 . These instances are categorized into C1, C2, R1, R2, RC1 and RC2. Customers of the instances in sets C1 and C2 are clustered either geographically or according to time windows. Customers of the instances in sets R1 and R2 are randomly located, whereas those in sets RC1 and RC2 have mixed characteristics of random locations and clusters. Moreover, the instances vary in vehicle capacity, fleet size, and maximum route times. Specially, the instances in C1, R1 and RC1 have narrow depot time windows so only a few customers can be served by the same vehicle. In contrast, instances in C2, R2 and RC2 have wider time windows so many customers can be serviced by the same vehicle. Like reference [43], we set the cost coefficients (hk , fk ) to (2.0, 400).

AC

CE

PT

The customers’ details in Solomon’s instances are given in sequence of customer index, X-coordinates, Y-coordinates, demand for load di ( i ∈ N ), ready time ei ( i ∈ N ), due time li ( i ∈ N ), and service time si ( i ∈ N ). All the customers in Solomon’s instances do not have any tolerance to unpunctual service, that is, all customers must be serviced between the ready time and the due time. To obtain instances that are relevant to the MOVRPFlexTW model in (2)-(15), we generated a flexible time window [Ei , Li ] for each customer i ∈ N with respect to the length of the original time window, where Ei = ei − wbli (li − ei ) and Li = li + wbui (li − ei ) (wbli and wbui are the fractions used to set the maximum allowed violations).

5.2 Sensitivity analysis and parameter settings A crucial feature of ACO is that the quality of the solution can be controlled by the parameters. In the field of evolutionary computing, two traditional approaches are usually employed to choose parameter values [21], following the scheme offered in [17]: parameter tuning, where (good) parameter values are established before the run of a given evolutionary algorithm. In this case, parameter values are fixed in the initialization stage and do not change while the evolutionary algorithm is running; parameter control, where (good) parameter values are established during the run of a given evolutionary 1

http://w.cba.neu.edu/ msolomon/problems.htm

14

ACCEPTED MANUSCRIPT

algorithm. In this case, parameter values are given an initial value when starting the evolutionary algorithm and they undergo changes while the evolutionary algorithm is running. In this paper the first approach is used. In order to illustrate how those parameters affect the overall performance of the HACO, the analyses between the approximate optimal solution and the crucial parameters, including α, β, γ, etc., are carried out on three randomly selected problems C101, R101, and RC101. When investigating the relationship between one of the parameters and the approximate optimal solution, the other parameters are kept unchanged. This investigation is executed by using the proposed HACO to solve the MOVRPFlexTW where wbli = wbui = 0.

CR IP T

(1) Parameter α Table 1. The approximate optimal objective values obtained by using different α. α

Approximate optimal objective values

1 2 3 4 5 6

C101 5657.88 5712.94 5711.35 5712.94 5729.30 5778.75

RC101 9273.84 9276.26 9352.64 9386.85 9400.46 9397.22

AN US

R101 10901.60 10924.61 11092.94 11220.32 11154.76 11228.54

M

If α = 0, the most urgent and closest customer is likely to be selected. In this case, HACO is same as the classical greedy random search algorithm. Table 1 presents the approximate optimal objective values obtained by using different parameters α. According to the results shown in Table 1, we can know that the larger α is, the more easily HACO gets a local optimal solution. During the experiments with HACO, it has been detected that the relatively good values of the parameter α are within the interval [1,3]. As shown in Table 1, for the problems C101, R101, and RC101, the best α obtained in our experiments was found to be equal to 1. (2) Parameter β β

Approximate optimal objective values

1 2 3 4 5 6

C101 5826.77 5705.90 5657.88 5657.88 5705.90 5775.90

R101 11854.14 11271.08 10901.60 11275.20 11093.09 11477.52

RC101 9863.11 9626.78 9359.63 9273.84 9274.47 9634.22

CE

PT

ED

Table 2. The approximate optimal objective values obtained by using different β.

AC

If β = 0, the next serviced customer is selected without using the heuristic factors η. In this case, HACO tends to converge to local optimums and does not perform well. Table 2 presents the approximate optimal objective values obtained by using different parameters β. For problem C101, the optimal objective value 5657.88 is obtained when β equals to 3 and 4. For problems R101 and RC101, their optimal objective values are obtained when β equals to 3 and 4, respectively. According to the numbers shown in Table 2, we can know that the relatively good values of the parameter β are within the interval [3,5]. (3) Parameter γ

If γ = 0, the next serviced customer is selected without using the heuristic factors ψ. In this case, HACO also tends to converge to local optimums. Table 3 presents the approximate optimal objective values obtained by using different parameters γ. For problem C101, the optimal objective value 15

ACCEPTED MANUSCRIPT Table 3. The approximate optimal objective values obtained by using different γ. γ

Approximate optimal objective values

1 2 3 4 5 6

C101 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88

R101 10901.60 11324.38 11668.15 11708.59 11834.21 11873.86

RC101 9299.68 9273.84 9384.50 9587.96 9600.13 9646.13

CR IP T

5657.88 is always obtained whenever the value of parameter γ changes in the prespecified interval [1,6]. For problem R101, the approximate optimal objective value gets worse with the increase of γ. In other words, the larger γ is, the more easily HACO gets a local optimal solution for problem R101. For problem RC101, the approximate optimal objective values are relatively good when parameter γ is within the interval [1,3], especially the optimal objective value 9273.84 is obtained when γ equals to 2. According to the numbers shown in Table 3, we can conclude that the relatively good values of the parameter γ are within the interval [1,3]. (4) Parameter ρ

C101 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88

R101 11160.75 11091.53 11110.01 11187.18 11009.76 11004.93 10901.60 10993.05 11203.36

RC101 9640.95 9540.96 9568.33 9488.56 9273.84 9366.63 9273.84 9336.70 9702.68

ED

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

Approximate optimal objective values

M

ρ

AN US

Table 4. The approximate optimal objective values obtained by using different ρ.

AC

CE

PT

A reasonable amount of pheromone on the ants’ trails is dependent on the suitable value of parameter ρ. A smaller ρ increases the difficulty of obtaining the optimal solution, because the pheromone laid on better trails cannot distinguish the better routes. However, a larger ρ means that HACO is more likely to converge to a local optimum, because the amount of pheromone on the currently worst trails is dramatically less. Table 4 presents the approximate optimal objective values obtained by using different ρ. For problem C101, the optimal objective value 5657.88 is always obtained whenever the value of parameter ρ changes in the prespecified interval [0.1,0.9]. For problem R101, the approximate optimal objective values are relatively good when parameter ρ is within the interval [0.7,0.8]. For problem RC101, the optimal objective value 9273.84 is obtained when ρ equals to 0.5 and 0.7. From the numbers presented in Table 4, we can conclude that the HACO performs relatively well when ρ is in the interval [0.5, 0.8]. (5) Parameter q0 The edges can be classified into three families: i) edges belonging to the last best tour (BE, best edges), ii) edges that do not belong to the last best tour, but which did in one of the two preceding iterations (TE, testable edges), and iii) the remaining edges, that is, the ones that have never belonged to a best tour or have not in the last two iterations (UE, uninteresting edges) [16]. If q0 is close to 1, the HACO favors the BE and TE edges. Consequently, ants easily converge to a local optimum. If q0 is close to 0, HACO cannot effectively use the information from previous iterations. Consequently, the ants tend to routes with worse objective values. Table 5 presents the approximate optimal objective values 16

ACCEPTED MANUSCRIPT Table 5. The approximate optimal objective values obtained by using different q0 . q0

Approximate optimal objective values

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

C101 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88 5705.90 5705.90

RC101 9669.02 9661.53 9472.84 9450.39 9430.35 9273.84 9298.96 9281.10 9537.71

CR IP T

R101 11348.45 11405.97 11410.33 11043.63 11040.30 10901.60 10941.43 10975.51 11153.58

(6) Mutation rate q1

AN US

obtained by using different q0 . For problem C101, the optimal objective value 5657.88 is always obtained whenever the value of parameter q0 changes in interval [0.1,0.7]. For problems R101 and RC101, the approximate optimal objective values are relatively good when parameter q0 is within the interval [0.6,0.8], especially their optimal objective values are obtained when q0 equals to 0.6. From the numbers presented in Table 5, we can conclude that the HACO performs relatively well when q0 is in the interval [0.6, 0.8].

Table 6. The approximate optimal objective values obtained by using different q1 . q1

Approximate optimal objective values

0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9

C101 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88 5657.88

ED

M

R101 11405.97 11405.97 11410.33 11043.63 10975.51 10901.60 11040.30 11410.33 11405.97

RC101 9472.84 9661.53 9472.84 9450.39 9281.10 9273.84 9298.96 9430.35 9537.71

AC

CE

PT

One of the crucial things in the mutation operation is determining the proper mutation rate. Very small mutation rate might not prevent falling back into the visited local optima; however, large mutation rate might be very similar to a crude random multistart. Table 6 presents the approximate optimal objective values obtained by using different mutation rate q1 . For problem C101, the optimal objective value 5657.88 is always obtained whenever the value of parameter ρ changes in the prespecified interval [0.1,0.9]. For problems R101 and RC101, the approximate optimal objective values are relatively good when parameter q1 is within the interval [0.5,0.6], especially their optimal objective values are obtained when q1 equals to 0.6. From the numbers presented in Table 6, we can conclude that the HACO performs better when q1 is in the interval [0.5, 0.6].

Based on the above discussion and previous research, we implemented HACO with population size = 35, number of generation = 100, repetition = 10, mutation rate q1 = 0.6, α = 1, β = 4, γ = 2, q0 = 0.6, ρ = 0.7 and repetitions of interchanges = substring length for inversion = substring length for shifting = n/10, where n is the number of customers.

5.3

Analysis of results

We analyzed the experimental results from three different perspectives. Firstly, we compared the results obtained by HACO with previous results. Secondly, we investigated whether similar or better 17

ACCEPTED MANUSCRIPT

Table 7. Comparison of the original VRPTW solutions with the solutions obtained by implementing MOVRPFlexTW with the suggestive HACO for the category of type 1 (C1, R1 and RC1), where wbli = wbui = 0 (i ∈ {1, 2, · · · , n}). MOVRPFlexTW+HACO Best known

Best results #Veh.

Dis.

C101 C102 C103 C104 C105 C106 C107 C108 C109

10 10 10 10 10 10 10 10 10

828.94 828.94 828.06 824.78 828.94 828.94 828.94 828.94 828.94

10 10 10 10 10 10 10 10 10

R101 R102 R103 R104 R105 R106 R107 R108 R109 R110 R111 R112

19 17 14 10 14 12 11 10 12 11 10 10

1650.80 1434.00 1237.05 974.24 1377.11 1252.03 1100.52 960.26 1169.85 1112.21 1096.72 976.99

RC101 RC102 RC103 RC104 RC105 RC106 RC107 RC108

15 13 11 10 16 11 11 10

Avg.

11.62

Average in 10 runs Cost

#Veh.

Dis.

828.94 828.94 828.06 824.78 828.94 828.94 828.94 828.94 828.94

Gapdis (%) 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00

5657.88 5657.88 5656.12 5649.56 5657.88 5657.88 5657.88 5657.88 5657.88

10 10 10 10 10 10 10 10 10

828.94 831.42 835.46 838.53 828.94 828.94 833.78 838.15 828.94

Gapdis (%) 0.00 0.30 0.89 1.67 0.00 0.00 0.58 1.11 0.00

Time (Sec.) 351 368 385 360 954 916 1267 1223 1159

19 17 14 10 14 12 11 10 12 11 10 10

1669.34 1443.18 1238.56 974.24 1382.50 1252.03 1108.80 960.88 1222.70 1114.56 1096.73 976.99

1.12 0.64 0.12 0.00 0.39 0.00 0.75 0.06 4.52 0.21 0.00 0.00

10938.68 9686.36 8077.12 5948.48 8365.00 7304.06 6617.60 5921.76 7245.40 6629.12 6193.46 5953.98

19 17 14 10 14 12 11 10 12 11 10 10

1675.45 1447.20 1239.40 974.24 1385.67 1258.61 1110.54 961.23 1225.60 1116.79 1096.73 987.17

1.49 0.92 0.19 0.00 0.62 0.53 0.91 0.10 4.77 0.41 0.00 1.04

1277 1265 1222 1225 1205 1187 1179 1187 1164 1135 1119 1073

1636.92 1470.26 1261.67 1135.48 1590.25 1427.13 1230.48 1142.66

15 13 11 10 16 11 11 10

1645.67 1476.50 1262.01 1135.48 1589.40 1424.73 1230.48 1142.66

1127.31

11.62

1131.13

CR IP T

Dis.

0.53 0.42 0.03 0.00 -0.05 -0.17 0.00 0.00

9291.34 8153.00 6924.02 6270.96 9578.80 7249.46 6860.96 6285.32

15 13 11 10 16 11 11 10

1648.76 1479.34 1263.25 1135.50 1592.45 1427.82 1234.73 1148.96

0.72 0.62 0.13 0.00 0.14 0.05 0.35 0.55

1235 1182 1185 1120 1114 1086 1138 1127

0.30

6910.54

11.62

1134.57

0.62

1048.36

M

#Veh.

AN US

Instance

ED

results were obtained when multiple objectives were optimized simultaneously, rather than when only one objective was considered. Finally, we investigated the effects of implementing different time windows.

PT

5.3.1 Comparison with previous results

AC

CE

To assess the performance of HACO in terms of applicability and solution quality, we ran some experiments using Solomon’s original VRPTW instances. Tables 7 and 8 present the best known solutions reported in the literatures and the solutions obtained using the proposed method for MOVRPFlexTW with wbli = wbui = 0 (i ∈ {1, 2, · · · , n}). According to customer satisfaction definition presented in Section 3, every customer’s satisfaction is equal to 1 if wbli = wbui = 0 (i ∈ {1, 2, · · · , n}). In this case, MOVRPFlexTW becomes a single objective problem. In Table 7 and Table 8, the column labeled Best known gives the best known published solutions from [21, 22]; MOVRPFlexTW+HACO gives the best and average results obtained using MOVRPFlexTW with HACO over 10 runs; #Veh. and Dis. give the number of vehicles and total distance, respectively; Cost gives the routing costs; Gapdis (%) gives the difference between the distance obtained by HACO and the best known distance, by HACO−best known distance that is Gapdis = distance obtained × 100%, which can relatively measure best known distance the efficiency of the proposed HACO; Time(Sec.) gives the average computational time (in seconds) of the HACO. Table 7 and Table 8 also give the average values of every column over all type 1 instances and type 2 instances, respectively.

Bold numbers indicate that the solutions obtained by HACO are very similar to or better than the best known solutions from the literatures, considering either the number of vehicles or traveled distance. The main result is that solutions obtained using the proposed HACO method were very similar to or 18

ACCEPTED MANUSCRIPT

Table 8. Comparison of the original VRPTW solutions with the solutions obtained by implementing MOVRPFlexTW with the suggestive HACO for the category of type 2 (C2, R2 and RC2), where wbli = wbui = 0 (i ∈ {1, 2, · · · , n}). MOVRPFlexTW+HACO Best known

Best results Dis.

#Veh.

Dis.

C201 C202 C203 C204 C205 C206 C207 C208

3 3 3 3 3 3 3 3

591.56 591.56 591.17 590.60 588.16 588.49 588.29 588.32

3 3 3 3 3 3 3 3

R201 R202 R203 R204 R205 R206 R207 R208 R209 R210 R211

5 4 4 3 3 3 3 2 3 3 4

1206.42 1091.21 935.04 789.72 994.42 833.00 814.78 726.82 855.00 954.12 761.10

RC201 RC202 RC203 RC204 RC205 RC206 RC207 RC208

6 4 4 3 4 3 4 4

Avg.

3.44

Average in 10 runs Cost

#Veh.

Dis.

591.56 591.56 591.17 598.72 588.16 588.90 592.46 588.32

Gapdis (%) 0.00 0.00 0.00 1.37 0.00 0.07 0.71 0.00

2383.12 2383.12 2382.34 2397.44 2376.32 2377.80 2384.92 2376.64

3 3 3 3 3 3 3 3

591.56 594.28 602.13 615.68 596.77 593.92 595.66 590.12

4 4 4 3 3 3 3 2 3 3 4

1252.37 1091.21 937.56 794.56 994.43 906.14 825.48 726.50 855.00 939.37 761.50

3.81 0.00 0.27 0.61 0.00 8.78 1.31 -0.04 0.00 -1.55 0.05

4104.74 3782.42 3475.12 2789.12 3188.86 3012.28 2850.96 2253.00 2910.00 3078.74 3123.00

1134.91 1181.99 1026.61 798.46 1300.25 1153.93 1040.67 785.93

4 4 4 3 4 3 4 4

1423.70 1263.53 1048.00 799.52 1410.30 1146.32 1140.67 828.71

25.45 6.90 2.08 0.13 8.46 -0.66 9.61 5.44

4447.40 4127.06 3696.00 2799.04 4420.60 3492.64 3881.34 3257.42

855.65

3.33

884.29

2.70

3101.91

Gapdis (%) 0.00 0.46 1.85 4.25 1.46 0.92 1.25 0.31

Time (Sec.) 1260 1356 1401 1283 1359 1342 1377 1484

CR IP T

#Veh.

4 4 4 3 3 3 3 2 3 3 4

1257.98 1091.21 938.43 796.67 994.43 906.64 827.63 727.85 855.00 943.18 762.72

4.27 0.00 0.36 0.88 0.00 8.84 1.58 0.14 0.00 1.15 0.21

1791 1886 1818 1917 1850 1905 1925 1918 1897 1935 1964

4 4 4 3 4 3 4 4

1428.53 1267.95 1053.71 803.76 1413.49 1153.68 1143.56 835.03

25.87 7.27 2.64 0.66 8.71 0.02 9.89 6.25

1837 1813 1823 1826 1837 1810 1810 1809

3.33

888.21

3.31

1712.19

AN US

Instance

AC

CE

PT

ED

M

better than the best known solutions in 38 out of the 56 tested instances. When compared with the best known solutions, HACO achieves the minimum number of vehicles and a small average difference with respect to the distance (0.30% for the category of type 1 in Table 7 and 2.70% for the category of type 2 in Table 8). As can be seen, HACO leads to new best results with the smaller routing distance for instances RC105 and RC106 in Table 7, and R208, R210 and RC206 in Table 8. For the instances C101-109, HACO can always find the best known solution. For RC201, although the distance produced by HACO is worse than the best known distance, there are four vehicles, which is better than the best known result. For the other 42 instances, the Gapdis varies between 0.00% and 9.61%. Additionally, the average computational time of HACO for every problem in sets C1, R1, and RC1 is less than 1300 seconds. The problems in sets C2, R2 and RC2 required slightly more computational time because of their wider time windows, which allow a more flexible arrangement in the routing construction process. Besides, a vehicle with longer route also takes up more computational time during the feasibility evaluation processes [45]. In general, the results obtained by the proposed HACO method are good in terms of solution quality and computational time, when compared with the best published results. To study the consistency and reliability of the results obtained by HACO, 10 different but repeated simulations with randomly generated initial populations have been performed for the Solomon’s 56 instances. The average values of the various simulation results are also represented in Table 7 and Table 8. As shown in Table 7 and Table 8, the average number of vehicles are equal to the best number for each instance and except instance RC201, the average values of Gapdis obtained from HACO for the 10 different but repeated simulation runs are found to vary within 0.00-9.89% from the best values. Therefore, we can conclude that the results obtained by HACO are rather consistent. It is also observed that the category of type 1 (C101-C109, R101-R112 and RC101-RC108) gives smaller average values of Gapdis as compared with the category of type 2 (C201-C208, R201-R211 and RC201-RC208), since the number of customers per route is smaller for the category of type 1 [45]. 19

ACCEPTED MANUSCRIPT

Table 9. Comparisons between the HACO and state-of-the-art algorithms from literatures for the category of type 1 (C1, R1 and RC1). Instance

HMOEA

MOGA

MACS-IH

HSFLA

ILNSA

LGA

#Veh. 10 10 10 10 10 10 10 10 10

Dis. 828.94 828.94 828.06 825.65 828.94 828.94 828.94 828.94 828.94

#Veh. 10 10 10 10 10 10 10 10 10

Dis. 828.94 828.94 828.06 824.78 828.94 828.94 828.94 828.94 828.94

#Veh. 10 10 10 10 10 10 10 10 10

Dis. 828.94 828.94 828.06 824.78 828.94 828.94 828.94 828.94 828.94

#Veh. 10 10 10 10 10 10 10 10 10

Dis. 828.94 828.94 839.37 838.98 828.94 842.10 828.94 832.74 828.94

R101 R102 R103 R104 R105 R106 R107 R108 R109 R110 R111 R112

18 18 14 10 15 13 11 10 12 12 12 10

1613.59 1454.68 1235.68 974.24 1375.23 1260.2 1085.75 954.03 1157.74 1104.56 1057.8 974.73

19 17 14 10 14 13 11 10 12 11 11 10

1690.28 1513.74 1237.05 1020.87 1415.13 1254.22 1100.52 975.34 1169.85 1112.21 1084.76 976.99

19 17 13 9 14 12 10 9 11 10 10 9

1645.79 1486.12 1292.68 1007.24 1377.11 1251.98 1104.66 960.88 1194.73 1118.59 1096.72 982.14

19 17 13 9 14 12 10 9 11 10 10 9

1650.80 1486.12 1292.67 1007.31 1377.11 1252.03 1104.66 960.88 1194.73 1118.84 1096.73 982.14

18 16 12 10 14 12 11 10 12 11 11 10

1612.29 1473.41 1279.37 1025.47 1377.95 1276.48 1153.86 990.82 1179.73 1113.10 1155.39 981.46

20 18 15 11 16 13 12 10 13 12 12 10

1646.90 1474.28 1222.68 989.53 1382.78 1250.11 1083.42 952.44 1160.69 1080.69 1057.64 965.00

19 17 14 10 14 12 11 10 12 11 10 10

1669.34 1443.18 1238.56 974.24 1382.50 1252.03 1108.80 960.88 1222.70 1114.56 1096.73 976.99

RC101 RC102 RC103 RC104 RC105 RC106 RC107 RC108

16 13 11 10 14 13 11 11

1641.65 1470.26 1267.86 1145.49 1589.91 1371.69 1222.16 1133.9

15 14 12 10 14 12 12 10

1636.92 1488.36 1306.42 1140.45 1616.56 1454.61 1254.26 1141.34

14 12 11 10 13 11 11 10

1696.94 1554.75 1261.67 1135.48 1629.44 1424.73 1230.48 1139.82

14 12 11 10 13 11 11 10

1696.95 1554.75 1261.67 1135.48 1629.44 1424.73 1230.48 1139.82

15 13 11 11 13 12 11 11

1671.54 1447.14 1313.79 1163.54 1502.48 1406.25 1278.96 1172.83

16 15 12 10 16 14 12 11

1660.55 1494.92 1276.05 1151.63 1556.21 1402.25 1212.83 1133.25

15 13 11 10 16 11 11 10

1645.67 1476.50 1262.01 1135.48 1589.40 1424.73 1230.48 1142.66

AN US

341 33046.17 202492.34

325 33047.37 196094.74

325 33052.76 196105.52

#Veh. 10 10 10 10 10 10 10 10 10

Dis. 828.94 828.94 828.06 824.78 828.94 828.94 828.94 828.94 828.94

334 33073.75 199747.50

358 32609.27 208418.54

337 32802.86 200405.72

M

344 32549.78 202699.56

Dis. 828.94 828.94 828.06 824.78 828.94 828.94 828.94 828.94 828.94

CR IP T

C101 C102 C103 C104 C105 C106 C107 C108 C109

Dis. 828.93 828.19 828.06 825.54 828.9 828.17 829.34 832.28 829.22

Tol. Tol. Cost

#Veh. 10 10 10 10 10 10 10 10 10

HACO

#Veh. 10 10 10 10 10 10 10 10 10

Dis. 591.58 591.56 593.25 595.55 588.16 588.49 588.88 588.03

CE

C201 C202 C203 C204 C205 C206 C207 C208

HMOEA #Veh. 3 3 3 3 3 3 3 3

MOGA

MACS-IH

HSFLA

ILNSA

LGA

HACO

#Veh. 3 3 3 3 3 3 3 3

Dis. 591.56 591.56 596.55 590.6 588.88 588.49 588.29 588.32

#Veh. 3 3 3 3 3 3 3 3

Dis. 591.56 591.56 591.17 590.6 588.88 588.49 588.29 588.32

#Veh. 3 3 3 3 3 3 3 3

Dis. 591.56 591.56 591.17 590.60 588.88 588.49 588.29 588.32

#Veh. 3 3 3 3 3 3 3 3

Dis. 591.56 591.56 591.17 591.17 588.88 588.49 588.29 591.39

#Veh. 3 3 3 3 3 3 3 3

Dis. 591.56 591.56 591.17 590.60 588.88 588.49 588.29 588.32

#Veh. 3 3 3 3 3 3 3 3

Dis. 591.56 591.56 591.17 598.72 588.16 588.90 592.46 588.32

PT

Instance

ED

Table 10. Comparisons between the HACO and state-of-the-art algorithms from literatures for the category of type 2 (C2, R2 and RC2).

5 4 4 3 3 3 3 2 3 5 4

1206.42 1091.21 935.04 789.72 1094.65 940.12 852.62 790.6 974.88 982.31 811.59

4 4 3 3 3 3 3 2 3 3 3

1268.44 1112.59 989.11 760.82 1084.34 919.73 825.07 773.13 971.7 985.38 833.76

4 3 3 2 3 3 2 2 3 3 2

1252.37 1191.7 939.54 825.52 994.42 906.14 890.61 726.75 909.16 939.34 892.71

4 3 3 2 3 3 2 2 3 3 2

1252.37 1191.70 939.50 825.52 994.43 906.14 890.61 726.82 909.16 939.37 885.71

5 4 3 3 3 3 3 3 3 3 3

1285.90 1195.10 980.72 801.68 1051.60 941.17 843.77 744.71 938.29 966.50 855.76

9 8 6 4 6 6 4 3 5 5 5

1156.30 1042.30 877.29 736.52 960.35 894.19 800.79 706.86 860.63 948.82 762.23

4 4 4 3 3 3 3 2 3 3 4

1252.37 1091.21 937.56 794.56 994.43 906.14 825.48 726.50 855.00 939.37 761.50

RC201 RC202 RC203 RC204 RC205 RC206 RC207 RC208

6 5 4 3 5 4 4 3

1134.91 1130.53 1026.61 879.82 1295.46 1139.55 1040.67 898.49

4 4 3 3 4 4 3 3

1423.73 1183.88 1131.78 806.44 1352.39 1269.64 1140.23 881.2

4 3 3 3 4 3 3 3

1406.91 1365.65 1049.62 798.41 1297.19 1146.32 1061.14 828.14

4 3 3 3 4 3 3 3

1406.94 1365.64 1049.62 798.46 1297.65 1146.32 1061.14 828.14

5 4 3 3 5 4 3 3

1399.20 1382.40 1097.60 828.55 1269.40 1123.00 1090.20 859.13

10 8 6 4 8 7 7 5

1281.63 1103.47 942.00 796.12 1168.89 1060.52 970.97 782.70

4 4 4 3 4 3 4 4

1423.70 1263.53 1048.00 799.52 1410.30 1146.32 1140.67 828.71

AC

R201 R202 R203 R204 R205 R206 R207 R208 R209 R210 R211

Tol. Tol. Cost

97 23740.70 86281.40

86 24437.61 83275.22

80 24140.51 80281.02

80 24134.11 80268.22

20

90 24377.19 84754.38

140 22571.45 101142.90

90 23875.72 83751.44

ACCEPTED MANUSCRIPT

CR IP T

We also compare the results obtained by the proposed HACO with the results obtained by 6 stateof-the-art algorithms from the literatures, which are hybrid multi-objective evolutionary algorithm (HMOEA) [45], multi-objective genetic algorithm (MOGA) [34], multiple ant colony system algorithm hybridized with insertion heuristics (MACS-IH) [4], hybrid shuffled frog leaping algorithm (HSFLA) [32], improved large neighborhood search algorithm (ILNSA) [23] and localized genetic algorithm (LGA) [47]. These 6 algorithms that are used in the comparisons are heuristic algorithms with very effective strategies for avoiding local optimum, like insertion heuristics, mutation procedure, neighborhood search. In addition to these 6 algorithms, there is a large number of researchers that solved the vehicle routing problem with time windows with different heuristic algorithms [33]. However, if we present analytically all of the results obtained by the heuristic algorithms from the literatures, a huge table (or tables) will be produced and the most important outcomes will not be sufficiently presented.

AN US

Table 9 and Table 10 give the comparisons between the HACO and the comparing algorithms. #Veh. and Dis. have the same meanings as shown in Tables 7 and 8. Bold numbers indicate that the solutions obtained by HACO are very similar to or better than the results obtained by the comparing algorithms. In Tables 9 and 10, the second row from the bottom gives the total accumulated number of vehicles and the total traveled distance for each one of the algorithms. The total routing costs for each one of the algorithms are also given in the last rows of Tables 9 and 10.

PT

ED

M

As shown in Tables 9 and 10, the proposed HACO produces excellent routing results with 27 instances (out of the Solomon’s 56 instances) achieving a smallest number of vehicles and lowest traveled distance as compared with the best solutions obtained by other 6 algorithms. From Table 9, we can also see that HACO performs as well as MACS-IH, HSFLA and LGA on C1 (C101-C109) instances. These 4 algorithms succeed in finding the best known solutions on C1 problems which have all customers located in geographical location and tight time windows. The total costs obtained by the proposed HACO for the Somolon’s 56 instances are better than that produced by HMOEA, MOGA, ILNSA and LGA. The number of vehicles and the total traveled distance are two objectives of the VRPTW. In [4] and [32], the results have the following hierarchical objectives: (1) minimizing the number of vehicles and (2)minimizing the total distance, which result in lower number of vehicles and smaller routing costs obtained by MACS-IH and HSFLA. As shown before, the proposed HACO does not use hierarchical computational procedure to solve the MOVRPFlexTW with wbli = wbui = 0 (i ∈ {1, 2, · · · , n}). Thus, we can say that the proposed HACO still appears relatively competitive.

AC

CE

To further investigate the performance of the algorithms, statistical tests are also conducted as experimental tests in the current study. All computations related to the statistical tests were performed with the statistical software SPSS. Given that nonparametric tests do not require explicit conditions to be conducted, the nonparametric Friedman test is employed to determine whether the differences observed between the proposed HACO and those comparing algorithms are statistically significant or not. A wider description of nonparametric tests and their usage for analysis of algorithms’ behavior are presented in [20]. The Gapcost (Equation (22)) of each instance is considered for the statistical tests. Gapcost =

cost obtained by the given algorithm − best cost × 100% best known cost

(22)

where the cost means the routing costs. Best cost is calculated on the basis of the best known solution presented in Table 7 and Table 8, and the cost obtained by the given algorithm is calculated on the basis of the solution presented in Table 9 and Table 10. Significance level α = 0.05 has been assumed. Table 11 presents average ranks calculated for each algorithm, the Friedman χ2F statistic, p-value, and decision about acceptance of zero hypothesis for 21

ACCEPTED MANUSCRIPT

Table 11. Results of the Friedman test. Average rank 4.71 4.35 2.37 2.42 4.63 5.62 3.91 131.18 0.00 No

CR IP T

Algorithm HMOEA MOGA MACS-IH HSFLA ILNSA LGA HACO χ2F p-value Accept ?

Table 12. Results of pairwise comparison using the Wilcoxon signed-ranks nonparametric test. HACO vs HMOEA MOGA MACS-IH HSFLA ILNSA LGA

p-value 0.034 0.317 1.000 1.000 0.068 1.000

Accept ? No Yes Yes Yes Yes Yes

Set of instances R2

C2

HMOEA MOGA MACS-IH HSFLA ILNSA LGA

0.225 0.686 0.273 0.273 0.500 0.273

Yes Yes Yes Yes Yes Yes

RC1

R1

HMOEA MOGA MACS-IH HSFLA ILNSA LGA

0.047 0.041 0.013 0.013 0.049 0.005

No No No No No No

RC2

HACO vs HMOEA MOGA MACS-IH HSFLA ILNSA LGA

p-value 0.013 0.041 0.063 0.063 0.016 0.003

Accept ? No No Yes Yes No No

HMOEA MOGA MACS-IH HSFLA ILNSA LGA

0.033 0.040 0.043 0.043 0.026 0.025

No No No No No No

HMOEA MOGA MACS-IH HSFLA ILNSA LGA

0.047 0.031 0.018 0.018 0.067 0.012

No No No No Yes No

M

AN US

Set of instances C1

ED

the whole set of instances. Because of the fact that the value of χ2F is equal to 131.18 (critical value of χ2F with 6 degrees of freedom is equal to 12.69), zero hypothesis has to be rejected at assumed level of significance. Moreover, the p-value (0.00 < 0.05) strongly suggests the existence of significant difference among the considered algorithms.

CE

PT

In order to detect differences between particular algorithms, the Wilcoxon signed-ranks nonparametric test [40] is employed to compare two algorithms for each instance (pairwise comparison). Results of the Wilcoxon tests are presented in Table 12, where the p-values are presented for each set of instances and for each pair of algorithms. Similar as previously, the last column indicates whether zero hypothesis should be accepted or rejected for each pair of compared algorithms within each set of instances.

AC

As results in Table 12 show, pairwise comparison of the HACO with other 6 comparing algorithms does not always confirm significant differences between it and other ones. Whereas significant differences are observed for each compared pair of algorithms for R1 and RC1 set of instances. It means that HACO outperforms other 6 comparing algorithms while solving instances belonging to sets R1 and RC1. For R2 and RC2 set of instances, differences can be also observed for 4 and 5 compared pair of algorithms, respectively. In the case of C1 set of instances, difference are observed between HACO and HMOEA. An interesting observation from the analysis is that for instances belonging to C2 set, zero hypotheses have to be accepted where HACO with other 6 heuristic algorithms are compared. It means that HACO and other 6 comparing algorithms statistically perform similarly while solving instances belonging to this set.

22

ACCEPTED MANUSCRIPT

5.3.2

Solutions obtained by applying HACO to solve MOVRPFlexTW

Table 13. Comparison of the single objective VRPFlexTW solutions with the solutions obtained by implementing MOVRPFlexTW with the suggestive HACO where wbli = wbui = 0.05 (i ∈ {1, 2, · · · , n}). MOVRPFlexTW+HACO Ins.

Single obj. VRPFlexTW

Dis.

Satis.

Cost

#Veh.

Dis.

828.94 838.04 864.64 848.98 828.94 828.94 828.94 828.94 828.94

10 10 10 10 10 10 10 10 10

828.94 828.94 838.56 856.04 828.94 828.94 828.94 828.94 828.94

1.000 1.000 0.944 0.968 1.000 1.000 1.000 1.000 1.000

5657.88 5657.88 5677.12 5712.08 5657.88 5657.88 5657.88 5657.88 5657.88

10.00 10.00 10.30 10.50 10.00 10.00 10.00 10.00 10.00

828.94 828.94 849.17 862.30 828.94 828.94 828.94 835.90 828.94

Time (Sec.) 428 497 508 485 1029 1352 1469 1584 1720

R101

19

1669.34

570 726

17.00

1530.74

1484

R103

14

1232.08

1539

14.30

1244.84

1297

R104 R105

10 15

994.97 1370.79

1644 1395

10.70 15.20

1007.63 1390.75

1384 1492

R106

13

1240.71

1413

12.40

1267.35

1373

R107

11

1091.73

1079

11.30

1135.46

1436

R108

10

952.87

786

10.50

966.34

1275

R109

13

1163.73

1069

R110

11

1073.11

1456

R111

12

1077.80

460

R112

10

967.98

673

10947.16 10973.08 9839.34 9893.52 8077.12 8106.50 5988.48 8365.00 8393.86 7304.06 7373.82 6617.60 6795.00 5921.76 5947.34 7245.40 7337.04 6635.30 6779.38 6608.74 6655.66 5959.98 5989.34

1598

1509.70

0.914 0.935 0.886 0.908 0.973 0.984 0.993 0.971 0.986 0.938 0.957 0.964 0.981 0.967 0.981 0.884 0.902 0.943 0.965 0.918 0.924 0.936 0.948

1684.63

17

1673.58 1686.54 1519.67 1546.76 1238.56 1253.25 994.24 1382.50 1396.93 1252.03 1286.91 1108.80 1197.50 960.88 973.67 1222.70 1268.52 1117.65 1189.69 1104.37 1127.83 979.99 994.67

19.00

R102

19 19 17 17 14 14 10 14 14 12 12 11 11 10 10 12 12 11 11 11 11 10 10

RC101

15

1629.99

1190 1078

1506.30

1368

11

1265.89

11.50

1283.50

1365

RC104

10

1152.08

10.60

1149.56

1307

RC105

15

1538.94

16.30

1603.27

1325

RC106

12

RC107

11

RC108

10

9291.34 9568.80 8159.04 8297.26 6931.78 6979.30 6277.88 6314.44 9578.80 9623.00 7254.26 7356.34 6860.96 7133.02 6279.64 6306.94

13.40

RC103

0.951 0.962 0.963 0.982 0.974 0.985 0.968 0.980 0.982 0.991 0.915 0.926 0.904 0.933 0.941 0.965

1496

1493.60

1645.67 1784.40 1479.52 1548.63 1265.89 1289.65 1138.94 1157.22 1589.40 1611.50 1427.13 1478.17 1230.48 1366.51 1139.82 1153.47

1706.40

13

15 15 13 13 11 11 10 10 16 16 11 11 11 11 10 10

15.70

RC102

12.00

1214.18

0.958

7228.37

11.79

1428 1321

1341.22

777

1194.74

775

1113.39

1989

CE

Ave.

775

1124.14

958.62

AN US

10 10 10 10 10 10 10 10 10

CR IP T

#Veh.

M

C101 C102 C103 C104 C105 C106 C107 C108 C109

Average in 10 runs

Time (Sec.) 87 492 629 1468 637 709 593 539 503

ED

Dis.

PT

#Veh.

Non-dominated solutions

12.70

1248.53

1304

11.60

1174.50

1322

12.10

1136.70

1317

10.00

993.20

1266

11.70

1463.28

1315

11.50

1318.49

1285

10.80

1144.53

1308

12.04

1154.38

1254.74

AC

In this subsection, we evaluate the benefits gained by MOVRPFlexTW. Tables 13-15 provide the solutions for the single objective VRPFlexTW (Single obj. VRPFlexTW) reported in the literature [43], as well as the non-dominated solutions and average results produced by the proposed HACO method for MOVRPFlexTW over 10 runs. These tables present the total distance (Dis.), number of vehicles (#Veh.), total routing cost (Cost), and average customer satisfactions (Satis.) for: (i) MOVRPFlexTW with wbli = wbui = 0.05; (ii) MOVRPFlexTW with wbli = wbui = 0.15; and (iii) MOVRPFlexTW with wbli = wbui = 0.25. In Tables 13-15, the solutions produced using MOVRPFlexTW and HACO are written in boldface if they are very similar to or better than the solutions of the single objective VRPFlexTW presented in [43]. Note that the single objective VRPFlexTW was solved using the tabu search algorithm, and its objective was to minimize the total cost, which consists of traveling costs, fixed vehicle costs, and penalty costs incurred for time window violations [43]. In this paper, time window violations are reflected by customer satisfaction, which is maximized as a separate objective. 23

ACCEPTED MANUSCRIPT

Problem types C2, R2, and RC2 are omitted here because these instances require more computational time if their wider time windows are further relaxed.

CR IP T

In this proposed approach, a decision maker’s preferences among the objectives (customer satisfaction and routing costs) are taken into consideration to choose among the solutions. Table 13 gives the results for MOVRPFlexTW with wbli = wbui = 0.05. In Table 13, a single Pareto solution was obtained for problems C101-C109 and R104. For C101, C102, C103, C105, C106, C107, C108, C109, and R104, their solutions obtained using MOVRPFlexTW were the same as or better than the solutions obtained using the single objective VRPFlexTW. For C104, although the distance obtained using MOVRPFlexTW was slightly worse than the distance obtained by using single objective VRPFlexTW, the difference was only 0.83%. For the other problems, different non-dominated solutions were obtained using MOVRPFlexTW. Compared with the number of vehicles obtained using the single objective VRPFlexTW, MOVRPFlexTW implemented using HACO performed better for R105, R106, R109, R111, and RC106.

AN US

Tables 14 and 15 present the solutions obtained for the MOVRPFlexTW with wbli = wbui = 0.15 for all i ∈ {1, 2, · · · , n} and with wbli = wbui = 0.25 for all i ∈ {1, 2, · · · , n}, respectively. Compared with the solutions obtained using the single objective VRPFlexTW, the non-dominated solutions of C102, C103 and RC103 in Table 14 and the non-dominated solutions of C102, C103, R106, R107, R109, R110, RC102, RC103, RC104, RC105 and RC108 in Table 15 are much better in terms of distance and the number of vehicles, but the MOVRPFlexTW cannot always provide similar or better solutions for all instances.

Effects of implementing different time windows

PT

5.3.3

ED

M

As shown in Tables 13-15, although a part of the non-dominated solutions obtained using MOVRPFlexTW are not better than the solutions obtained using the single objective VRPFlexTW, we should note that these alternate Pareto solutions are clearly comparable and a decision maker can decide which solution is more preferable based on his/her preferences. Moreover, the proposed MOVRPFlexTW eliminates the need to experiment with weights for the time window violations (as required in a weighted-sum approach), which is useful because poorly chosen weights can result in unsatisfactory solutions. Furthermore, the solutions and computational times in Tables 13-15 confirm that the proposed HACO method can provide good solutions in a reasonable computational time.

CE

From the numbers shown in Tables 13-15, it is easy to find that the average computational time is increased with the enlarged time windows. For example, the average computational time of solving the instance C101 where wbli = wbui = 0.05 is 428 seconds over 10 runs, while the average computational time of solving it where wbli = wbui = 0.15 and wbli = wbui = 0.25 are 678 seconds and 1017 seconds, respectively. This is because the wider relaxed time windows require more computational time.

AC

Comparing the average results in Table 13 and 14 with those in Table 15, it is clear that larger flexibility fractions reduce the total distance using fewer vehicles where the customer satisfaction is lower. We can similarly compare the average results in Table 13 with those presented in Table 14. This indicates that the decision maker can increase customer satisfaction at the expense of routing costs using narrow flexible time windows; on the contrary, the decision maker can decrease the routing costs at the expense of customer satisfaction using wide flexible time windows.

24

ACCEPTED MANUSCRIPT

Table 14. Comparison of the single objective VRPFlexTW solutions with the solutions obtained by implementing MOVRPFlexTW with the suggestive HACO where wbli = wbui = 0.15 (i ∈ {1, 2, · · · , n}). MOVRPFlexTW+HACO Non-dominated solutions

C101 C102

10 10

828.94 1007.31

Time (Sec.) 315 712

C103

10

972.41

617

C104 C105 C106 C107 C108 C109

10 10 10 10 10 10

855.29 828.94 828.94 828.94 828.94 828.94

1575 681 672 573 570 636

R101

18

1623.69

1196

R102

16

1425.14

1236

R103

14

1210.14

2112

R104

10

980.65

1523

R105

14

1343.93

1938

R106

12

1186.79

1435

R107

11

1050.86

1726

9

961.70

1390

R109

12

1128.15

1938

R110

11

1033.29

1723

R111

11

1068.23

1017

R112

10

965.95

648

RC101

15

1595.05

1512

RC102

13

1428.71

2127

RC103

12

1279.81

RC104

10

1123.85

RC105

13

1457.73

RC106

11

RC107

11

RC108

10

Ave.

11.48

742

917

PT

1128

1298.27

995

1183.25

1895

1094.27

1959

1112.00

1224.41

CE

Dis.

Satis.

Cost

#Veh.

Dis.

10 10 10 10 10 10 10 10 10 10 10

828.94 857.64 878.04 864.64 937.68 848.98 828.94 828.94 828.94 828.94 828.94

1.000 0.894 0.916 0.883 0.907 0.952 1.000 1.000 1.000 1.000 1.000

5657.88 5715.28 5756.08 5729.28 5875.36 5697.96 5657.88 5657.88 5657.88 5657.88 5657.88

10.0 10.4

828.94 864.37

Time (Sec.) 678 649

10.0

907.48

1008

10.2 10.3 10.0 10.3 10.6 10.3

867.54 833.64 828.94 835.46 843.74 835.84

895 1343 1799 1775 1795 1933

18 18 18 16 16 14 14 10 10 14 14 12 12 11 11 9 9 12 12 11 11 11 11 10 10

1654.87 1661.59 1670.31 1445.23 1478.56 1227.35 1234.32 980.65 987.60 1343.93 1356.87 1189.80 1207.99 1075.80 1123.30 961.70 987.70 1130.64 1167.86 1038.67 1089.60 1064.25 1087.43 963.20 981.90

0.836 0.874 0.891 0.783 0.805 0.942 0.961 0.954 0.967 0.918 0.937 0.881 0.903 0.928 0.943 0.868 0.908 0.847 0.872 0.854 0.881 0.848 0.869 0.886 0.905

10509.74 10523.18 10540.62 9290.46 9357.12 8054.70 8068.64 5961.30 5975.20 8287.86 8313.74 7179.60 7215.98 6551.60 6646.60 5523.40 5575.40 7061.28 7135.72 6477.34 6579.20 6528.50 6574.86 5926.40 5963.80

18.6

1663.20

1944

16.6

1463.84

1866

14.3

1229.84

1755

10.5

983.56

1865

14.8

1352.74

1934

12.0

1196.58

1637

11.0

1104.67

1698

15 15 13 13 11 11 10 10 13 13 11 11 11 11 11 10 10

1595.05 1632.08 1436.89 1467.28 1154.60 1189.50 1083.90 1134.60 1484.60 1509.80 1298.27 1328.94 1209.64 1223.57 1237.60 1104.52 1124.60

0.928 0.946 0.903 0.937 0.845 0.897 0.937 0.957 0.728 0.849 0.774 0.795 0.853 0.868 0.886 0.897 0.918

9190.10 9264.16 8073.78 8134.56 6709.20 6779.00 6167.80 6269.20 8169.20 8219.60 6996.54 7057.88 6819.28 6847.14 6875.20 6209.04 6249.20

11.75

1163.91

0.901

7029.70

ED

R108

Average in 10 runs

#Veh.

CR IP T

Dis.

AN US

#Veh.

10.1

965.40

1549

12.3

1159.85

1798

11.3

1065.74

1965

11.8

1083.59

1985

10.0

970.13

1436

15.2

1608.59

1988

13.1

1449.78

1976

11.2

1176.43

1798

10.0

1095.64

1838

14.2

1498.46

1848

11.0

1308.68

1766

11.3

1230.58

1895

10.3

1116.95

1696

11.78

1116.21

1658.96

A real case application

AC

6

Single obj. VRPFlexTW

M

Instance

In this section, a real case application has been developed for Vanguard logistics which is a leading company in the vegetable and food distribution sector in China, with the purpose of validating the multi-objective VRPFlexTW model. The problem consists of 16 delivery points (16 supermarkets around Beilin district, Xi’an) served directly from a depot (supply base of vegetable and food). Figure 8 illustrates the geographical position of 16 delivery points. Owing that the depot (longitude 108.611118 and latitude 34.117327) is located in Hu county and far away from the 16 delivery points, it is not marked in Figure 8. The distance of transportation between each pairs of points are listed in Table 16 (0 is the index of depot). The fleet of vehicles consists of several same rigid trucks with capacity of 25

ACCEPTED MANUSCRIPT

Table 15. Comparison of the single objective VRPFlexTW solutions with the solutions obtained by implementing MOVRPFlexTW with the suggestive HACO where wbli = wbui = 0.25 (i ∈ {1, 2, · · · , n}). MOVRPFlexTW+HACO Single obj. VRPFlexTW

Non-dominated solutions

10 10

828.94 1017.06

C103

10

1022.88

634

C104

10

933.47

1741

C105 C106 C107 C108 C109

10 10 10 10 10

828.94 828.94 828.94 828.94 828.94

532 597 496 689 511

R101

17

1597.96

1117

R102

15

1405.93

664

R103

13

1225.99

1013

R104

10

1022.51

1013

R105

13

1294.36

1490

R106

12

1207.99

914

R107

11

1062.88

902

R108

9

946.54

1524

R109

12

1110.40

1301

R110

11

1061.10

559

R111

10

1053.24

887

R112

10

971.28

1416

RC101 RC102

14 13

1521.56 1395.25

1205 1835

RC103

10

1221.53

RC104

10

1133.50

RC105

13

1425.89

RC106

11

RC107

10

RC108

10

Ave.

11.17

1053

764

PT

1173 588

1157.88

1252

1127.99

740

1109.34

948.55

Dis.

Satis.

Cost

#Veh.

Dis.

10 10 10 10 10 10 10 10 10 10 10 10 10

828.94 903.65 946.85 918.12 974.20 1013.52 933.47 946.36 828.94 828.94 828.94 828.94 828.94

1.000 0.814 0.825 0.894 0.903 0.913 0.903 0.912 1.000 1.000 1.000 1.000 1.000

5657.88 5807.30 5893.70 5836.24 5948.39 6027.04 5866.94 5892.72 5657.88 5657.88 5657.88 5657.88 5657.88

10.0 10.0

828.94 927.65

Time (Sec.) 1017 977

10.0

946.82

1469

10.0

940.55

1136

10.3 10.0 10.2 10.0 10.0

830.24 828.94 830.76 831.25 828.94

1795 2007 2104 2201 2594

17 17 15 15 13 13 10 10 13 13 13 12 12 11 11 9 9 11 11 10 10 10 10 10 10

1568.93 1614.65 1405.93 1438.93 1211.09 1268.76 1008.54 1039.46 1304.50 1367.90 1441.20 1147.60 1164.90 1027.85 1048.39 947.20 968.70 1228.20 1285.60 1001.40 1108.30 1053.24 1063.52 976.80 981.90

0.693 0.775 0.672 0.775 0.628 0.635 0.906 0.915 0.746 0.771 0.793 0.813 0.865 0.875 0.894 0.846 0.873 0.658 0.673 0.516 0.579 0.558 0.584 0.894 0.905

9937.86 10029.30 8811.86 8877.86 7622.18 7737.52 6017.08 6078.92 7809.00 7935.80 8082.40 7095.20 7129.80 6455.70 6496.78 5494.40 5537.40 6856.40 6971.20 6002.80 6216.60 6106.48 6127.04 5953.60 5963.80

17.8

1593.28

2568

15.6

1427.35

2365

13.7

1254.65

2290

10.0

1019.93

2229

13.4

1430.86

2498

12.0

1158.43

2009

11.0

1039.82

2014

9.0

950.48

2104

11.6

1277.90

2339

10.5

1106.85

2438

13 12 13 10 10 10 10 11 13 11 11 10 10 10 10

1788.40 1593.70 1394.60 1154.60 1175.20 1061.69 1124.20 1219.70 1359.60 1280.12 1298.27 1123.00 1173.47 1070.01 1071.60

0.509 0.458 0.716 0.764 0.782 0.903 0.946 0.485 0.697 0.615 0.774 0.597 0.625 0.784 0.796

11.11

1135.31

0.782

10.3

1060.25

2655

10.0

979.68

2186

8776.80 7987.40 7989.20 6309.20 6350.40 6123.38 6248.40 6839.40 7919.20 6960.24 6996.54 6246.00 6346.94 6140.02 6143.20

13.6 12.8

1793.82 1428.90

2646 2785

10.2

1170.98

2227

10.0

1087.32

2453

12.5

1298.30

2457

11.0

1294.57

2114

10.0

1162.49

2228

10.0

1070.30

2105

6715.90

11.22

1117.25

2138.21

AC

CE

1280.12

Average in 10 runs

#Veh.

CR IP T

C101 C102

Time (Sec.) 377 521

AN US

Dis.

ED

#Veh.

M

Ins.

80 tons and average speed of 60 kilometers per hour to deliver the supermarkets’ demands. Table 17 shows the depot and supermarkets’ details in sequence of supermarket index, extended earliest time to begin the service Ei , ready time ei , due time li , extended latest time to begin the service Li , daily demands and service time. As shown in Table 17, the time windows of almost all supermarkets are between 10:00 and 12:00 which are not rush hours, and therefore the traffic jam is not considered in this study. The maximum route travel time for each vehicle is 3 hours. The transportation cost and the fixed cost of using each vehicle are 3 Yuan per kilometer and 600 Yuan, respectively. Because the unpunctual services have negative effect on the sales of corresponding supermarket, all supermarkets usually require to be served between the ready time and the due time. However, owing to the smaller flow of customers during the traditional festival holidays such as Dragon Boat Festival, 26

CR IP T

ACCEPTED MANUSCRIPT

AN US

Figure 8: Geographical positions of the 16 delivery points (supermarkets). Table 16. Distance of transportation between each pair of points (km). 2 8.000 1.650 0.000

3 13.000 4.750 4.218 0.000

4 7.000 2.042 1.773 5.507 0.000

5 16.000 0.936 5.068 3.230 6.495 0.000

6 10.000 3.111 4.859 6.842 4.133 7.873 0.000

7 14.000 7.112 5.520 8.129 4.952 8.934 9.843 0.000

8 11.000 4.956 3.364 5.973 3.388 6.778 7.687 2.382 0.000

M

1 7.000 0.000

ED

Index 0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

9 15.000 6.524 6.012 1.855 7.439 3.321 7.598 6.402 5.001 0.000

10 15.000 6.545 6.033 1.796 7.460 3.192 7.619 6.423 5.570 0.267 0.000

11 5.000 3.339 4.383 7.070 3.125 8.101 3.474 4.837 7.137 9.064 8.796 0.000

12 14.000 7.105 6.534 4.436 6.287 7.639 9.536 5.281 2.907 4.232 3.976 9.920 0.000

13 60.000 5.144 6.730 6.295 6.775 6.862 3.431 8.765 10.000 8.289 8.021 6.388 9.887 0.000

14 15.000 8.540 8.008 3.975 9.396 3.828 10.000 8.418 7.226 2.546 2.847 11.000 6.600 9.629 0.000

15 10.000 8.590 9.730 8.410 9.816 6.026 6.447 11.000 13.000 8.653 9.224 9.954 11.000 5.135 6.106 0.000

16 80.000 6.745 7.885 7.498 7.978 5.824 4.609 9.968 11.000 9.025 9.224 8.116 11.000 2.987 8.067 2.013

CE

PT

Mid-Autumn Festival and Spring Festival, the time window constraints of supermarkets can be violated to a certain extent. The extended earliest time Ei and latest time Li of each supermarkets are presented in Table 17. Additionally, during the traditional festival holidays, the company needs to pay large extra allowance to the distribution clerks who still deliver demands. The decision maker of this company needs to design the vehicle routs with two different objectives: (1)minimizing the total routing costs, and (2) maximizing the total customer satisfaction.

AC

As stated before, the MOVRPFlexTW with wbli = wbui = 0 (i ∈ {1, 2, · · · , n}) is equivalent to the single objective vehicle routing problem with hard time window. To strictly satisfy the time window constraints of all delivery supermarkets, we firstly solve this problem by implementing MOVRPFlexTW with the proposed HACO where wbli = wbui = 0 (i ∈ {0, 1, 2, · · · , 16}), and the results are presented in Table 18.

From Table 18, we can know that the company usually needs to arrange 5 vehicles to deliver the demands within the appointed time windows [ei , li ] (i ∈ {0, 1, 2, · · · , 16}). Because of the narrow time windows, all the vehicles are not full-load when they dispatch from the depot. All the vehicles can return back to the depot before 12:00. According to the total distance 203.908km shown in the last row in Table 18 and the number of vehicles, we can obtain the daily routing cost which is around 3611.724 Yuan. 27

ACCEPTED MANUSCRIPT Table 17. Time windows and demands of each point. Ei

ei

li

Li

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16

10:00 10:25 10:30 10:40 10:15 10:10 10:15 10:05 10:15 10:05 10:40 10:00 10:20 10:30 10:20 10:05 10:40

10:00 10:30 10:50 10:45 10:40 10:15 10:20 10:10 10:20 10:10 11:00 10:10 10:40 10:55 10:30 11:25 11:00

12:00 11:10 11:10 11:05 10:55 10:50 10:50 10:30 10:45 10:30 11:20 10:30 11:00 11:15 10:55 11:40 11:30

12:30 11:20 11:20 11:30 11:00 11:40 11:30 10:40 10:55 10:55 11:30 11:40 11:05 11:30 11:45 12:00 11:50

Demands (ton) 0 30 16 18 9 23 17 12 26 14 19 15 29 31 10 21 18

Service time (minute) 0 10 15 15 10 15 12 10 8 10 15 13 11 10 8 10 12

CR IP T

Point

Table 18. Result obtained by implementing MOVRPFlexTW to solve the proposed case where wbli = wbui = 0 (i ∈ {0, 1, 2, · · · , 16}). Route

1 2 3 4 5 Total

0-13-16-15-0 0-5-14-3-10-0 0-9-8-12-0 0-7-1-4-2-0 0-11-6-0

Distance (km) 75.000 40.599 36.908 32.927 18.474 203.908

Delivery time (hour) 1.790 1.560 1.090 1.295 0.724 6.459

Time of returning back to the depot 11:47 11:33 11:05 11:17 10:43

Sum of demands (ton) 70 70 69 67 32 308.000

AN US

Vehicle

PT

ED

M

For delivering the demands during the festival holidays, the MOVRPFlexTW provides a single Pareto solution which is presented in Table 19. The total customer satisfactions of each route are listed in the last column of Table 19. Owing to the violation of the appointed time windows [ei , li ] (i ∈ {0, 1, 2, · · · , 16}), the number of vehicles and the total distance presented in Table 19 (4 and 174.875km, respectively) are smaller than the corresponding results shown in Table 18, which result in smaller daily routing cost around 2924.625 Yuan. Furthermore, the total customer satisfaction is decreased from 16.00 to 10.65. Additionally, because of the decreased number of vehicles, the total delivery time is relatively increased and the time of returning back to the depot for each vehicle is also later in Table 19. After extending the time windows from [ei , li ] to [Ei , Li ], a part of vehicles, which is not full-load in the case of narrow time windows, can load more demands and serve more points. For example, the first vehicle in Table 18 returns to the depot after serving supermarket 15. After extending the time windows, supermarket 14 is added to this route (the first vehicle in Table 19) before returning to the depot.

CE

The above observations are very similar to the ones obtained in subsection 5.3.3. During the festival holidays, the decision maker can arrange a smaller number of vehicles to deliver the demands at the expense of supermarkets’ satisfaction using the flexible time windows.

AC

Table 19. Result obtained by implementing MOVRPFlexTW to solve the proposed case where the flexible time windows [Ei , Li ](i ∈ {0, 1, 2, · · · , 16}) are used. Vehicle

Route

1 2 3 4 Total

0-13-16-15-14-0 0-4-7-8-12-0 0-9-10-3-5-0 0-2-1-6-11-0

Distance (km) 86.106 31.241 36.293 21.235 174.875

Delivery time (hours) 2.100 1.300 1.767 1.550 6.717

Time of returning back to the depot 12:06 11:18 11:46 11:33

28

Sum of demands (ton) 80 76 74 78 308

Satisfaction 3.040 3.000 2.500 2.110 10.65

ACCEPTED MANUSCRIPT

7

Conclusion

In this paper, we considered a multi-objective VRPFlexTW, which is an extension of VRPTW, in which the customers are allowed to be serviced outside their required time interval with a certain tolerance. This problem also considers the customer satisfaction related to the route, which is represented as a convex function dependent on the vehicle arrival time. Compared with VRPTW, VRPFlexTW is more flexible and applicable to real life problems. This paper considered VRPFlexTW as a multi-objective problem, with the aim of minimizing the overall routing costs and maximizing the overall customer satisfaction.

AN US

CR IP T

The basic framework of the proposed approach for solving MOVRPFlexTW was inspired by the classical ACO meta-heuristic. However, this approach was extended using three mutation operators that enhance the local search ability and allow for random global explorations. These mutation operators prevent the ACO from getting trapped in local optimal solution. Meanwhile, using the concept of Pareto optimization, we can obtain better non-dominated solutions for MOVRPFlexTW. The proposed HACO was applied to solve the benchmark Solomon’s instances. Our results imply that the proposed HACO is quite sufficient as compared with the best published results. Because the MOVRPFlexTW uses more flexible time windows, more savings can be made for the logistic company at the expense of customer satisfaction. This trade-off must be carefully balanced by the decision makers with respect to the preferences of logistic company and the concerns of their customers.

PT

ED

M

A number of future research directions and ideas have arisen from this work. On the one hand, several new MOVRPFlexTW models can be obtained by reformulating objective functions, e.g., tripleobjective VRPFlexTW in which the total vehicles used for service and overall total traveled distance are minimized, and the overall customer satisfaction is maximized; and quadruple-objective VRPFlexTW in which the total vehicles used for service, overall total traveled distance and waiting time are minimized, and the overall customer satisfaction is maximized. On the other hand, due to the relatively high efficiency and generality of HACO, it can be easily adapted to solve some classic variants of VRP. For example, VRPTW (vehicle routing problem with hard time window), by not relaxing the time windows; HVRP ( heterogeneous fleet vehicle routing problem ), by separately checking the capacity constraint for each vehicle; and VRPSTW ( vehicle routing problem with soft time window), by limitlessly relaxing the customers’ time windows in the planning horizon. Therefore, we hope that the proposed HACO with required modification will be implemented to successfully solve these problems.

CE

Acknowledgement

AC

The authors would like to thank the anonymous referees and the editor for their constructive feedback. This work is supported by the National Natural Science Foundation of China (grant. 71401106), the Shanghai Natural Science Foundation (grant. 14ZR1418700), the Humanities and Social Sciences Foundation from the Ministry of Education of China (grants. 16YJA630037 and 19YJAZH064).

References [1] E. Alba and B. Dorronsoro. Computing nine new best-so-far solutions for capacitated vrp with a cellular genetic algorithm. Information Processing Letters, 98(6):225–230, 2006. [2] N. Balakrishnan. Simple heuristics for the vehicle routeing problem with soft time windows. Journal of the Operational Research Society, 44(3):279–287, 1993.

29

ACCEPTED MANUSCRIPT

[3] R. Baldacci, A. Mingozzi, and R. Roberti. Recent exact algorithms for solving the vehicle routing problem under capacity and time window constraints. European Journal of Operational Research, 218(1):1–6, 2012. [4] S. R. Balseiro, I. Loiseau, and J. Ramonet. An ant colony algorithm hybridized with insertion heuristics for the timedependent vehicle routing problem with time windows. Computers & Operations Research, 38(6):954–966, 2011.

CR IP T

[5] R. Ba˜ nos, J. Ortega, C. Gil, A. Fern´andez, and F. de Toro. A simulated annealing-based parallel multi-objective approach to vehicle routing problems with time windows. Expert Systems with Applications, 40(5):1696–1707, 2013. [6] R. Ba˜ nos, J. Ortega, C. Gil, A. L. M´ arquez, and F. de Toro. A hybrid meta-heuristic for multiobjective vehicle routing problems with time windows. Computers & Industrial Engineering, 65(2):286–296, 2013.

AN US

[7] A. K. Beheshti and S. R. Hejazi. A novel hybrid column generation-metaheuristic approach for the vehicle routing problem with general soft time window. Information Sciences, 316:598–615, 2015. [8] N. Bhusiri, A. G. Qureshi, and E. Taniguchi. The trade-off between fixed vehicle costs and timedependent arrival penalties in a routing problem. Transportation Research Part E: Logistics and Transportation Review, 62:1–22, 2014. [9] K. Braekers, K. Ramaekers, and I. V. Nieuwenhuyse. The vehicle routing problem: State of the art classification and review. Computers & Industrial Engineering, 99:300–313, 2016.

M

[10] O. Br¨aysy and M. Gendreau. Vehicle routing problem with time windows, part 1: Route construction and local search algorithms. Transportation Science, 39(1):104–118, 2005.

ED

[11] O. Br¨aysy and M. Gendreau. Vehicle routing problem with time windows, part 2: Metaheuristics. Transportation Science, 39(1):119–139, 2005.

PT

[12] H. I. Calvete, C. Gal´e, M. Oliveros, and B. S´anchez-Valverde. A goal programming approach to vehicle routing problems with soft time windows. European Journal of Operational Research, 177(3):1720–1733, 2007.

CE

[13] L. Chen, H.-Y. Sun, and S. Wang. A parallel ant colony algorithm on massively parallel processors and its convergence analysis for the travelling salesman problem. Information Sciences, 199:31–42, 2012.

AC

[14] W.-C. Chiang and R. A. Russell. A metaheuristic for the vehicle-routeing problem with soft time windows. Journal of the Operational Research Society, 55(12):1298–1310, 2004. [15] Q. Ding, X. Hu, L. Sun, and Y. Wang. An improved ant colony optimization and its application to vehicle routing problem with time windows. Neurocomputing, 98:101–107, 2012.

[16] M. Dorigo and L. M. Gambardella. Ant colony system: A cooperative approach to the traveling salesman problem. IEEE Transactions on Evolutionary Computation, 1(1):53–66, 1997. [17] A. E. Eiben and S. K. Smit. Parameter tuning for configuring and analyzing evolutionary algorithms. Swarm and Evolutionary Computation, 1(1):19–31, 2011. [18] N. A. El-Sherbeny. Vehicle routing with time windows: An overview of exact, heuristic and metaheuristic methods. Journal of King Saud University (Science), 22(3):123–131, 2010. 30

ACCEPTED MANUSCRIPT

[19] M. A. Figliozzi. An iterative route construction and improvement algorithm for the vehicle routing problem with soft time windows. Transportation Research Part C: Emerging Technologies, 18(5):668–679, 2010. [20] S. Garcia, D. Molina, M. Lozano, and F. Herrera. A study on the use of non-parametric tests for analyzing the evolutionary algorithms’s behaviour: a case study on the cec’2005 special session on real parameter optimization. Journal of Heuristics, 15:617–644, 2009.

CR IP T

[21] S. F. Ghannadpour, S. Noori, and R. Tavakkoli-Moghaddam. A multi-objective dynamic vehicle routing problem with fuzzy time windows: Model, solution and application. Applied soft computing, 14:504–527, 2014. [22] K. Ghoseiri and S. F. Ghannadpour. Multi-objective vehicle routing problem with time windows using goal programming and genetic algorithm. Applied Soft Computing, 10(4):1096–1107, 2010. [23] L. Hong. An improved lns algorithm for real-time vehicle routing problem with time windows. Computers & Operations Research, 39(2):151–163, 2012.

AN US

[24] T. Ibaraki, S. Imahori, M. Kubo, T. Masuda, T. Uno, and M. Yagiura. Effective local search algorithms for routing and scheduling problems with general time-window constraints. Transportation Science, 39(2):206–232, 2005. [25] S. Iqbal, M. Kaykobad, and M. S. Rahman. Solving the multi-objective vehicle routing problem with soft time windows with the help of bees. Swarm and Evolutionary Computation, 24:50–64, 2015.

M

[26] O. Jabali, R. Leus, T. Van Woensel, and T. de Kok. Self-imposed time windows in vehicle routing problems. OR Spectrum, 37(2):331–352, 2015.

ED

[27] N. Jozefowiez, F. Semet, and E. Talbi. Multi-objective vehicle routing problems. European Journal of Operational Research, 189(2):293–309, 2008.

PT

[28] C. B. Kalayci and C. Kaya. An ant colony system empowered variable neighborhood search algorithm for the vehicle routing problem with simultaneous pickup and delivery. Expert Systems with Applications, 66:163–175, 2016.

CE

[29] C. Koc¸, T. Bektas¸, O. Jabali, and G. Laporte. Thirty years of heterogeneous vehicle routing. European Journal of Operational Research, 249(1):1–21, 2016.

AC

[30] Y. A. Koskosidis, W. B. Powell, and M. M. Solomon. An optimization-based heuristic for vehicle routing and scheduling with soft time window constraints. Transportation Science, 26(2):69–85, 1992. [31] A. A. Kovacs, S. N. Parragh, and R. F. Hartl. The multi-objective generalized consistent vehicle routing problem. European Journal of Operational Research, 247(2):441–458, 2015. [32] J. Luo, X. Li, M.-R. Chen, and H. Liu. A novel hybrid shuffled frog leaping algorithm for vehicle routing problem with time windows. Information Sciences, 316:266–292, 2015. [33] Y. Marinakis, M. Marinaki, and A. Migdalas. A multi-adaptive particle swarm optimization for the vehicle routing problem with time windows. Information Sciences, 481:311–329, 2019. [34] B. Ombuki, B. J. Ross, and F. Hanshar. Multi-objective genetic algorithm for vehicle routing problem with time windows. Applied Intelligence, 24(1):17–30, 2006. 31

ACCEPTED MANUSCRIPT

[35] A. G. Qureshi, E. Taniguchi, and T. Yamada. An exact solution approach for vehicle routing and scheduling problems with soft time windows. Transportation Research Part E: Logistics and Transportation Review, 45(6):960–977, 2009. [36] A. G. Qureshi, E. Taniguchi, and T. Yamada. Exact solution for the vehicle routing problem with semi soft time windows and its application. Procedia - Social and Behavioral Sciences, 2(3):5931–5943, 2010. [37] A. Rajasekhar, N. Lynn, S. Das, and P. N. Suganthan. Computing with the collective intelligence of honey bees-a survey. Swarm and Evolutionary Computation, 32:25–48, 2017.

CR IP T

[38] M. Reed, A. Yiannakou, and R. Evering. An ant colony algorithm for the multi-compartment vehicle routing problem. Applied Soft Computing, 15:169–176, 2014. [39] Z.-G. Ren, Z.-R. Feng, and A.-M. Zhang. Fusing ant colony optimization with lagrangian relaxation for the multiple-choice multidimensional knapsack problem. Information Sciences, 182(1):15–29, 2012.

AN US

[40] S. Rostami, D. O’Reilly, A. Shenfield, and N. Bowring. A novel preference articulation operator for the evolutionary multi-objective optimisation of classifiers in concealed weapons detection. Information Sciences, 295:494–520, 2015. [41] M. Salani, M. Battarra, and L. M. Gambardella. Operations Research Proceedings 2014, chapter Exact Algorithms for the Vehicle Routing Problem with Soft Time Windows, pages 481–486. Springer International Publishing, 2016.

M

[42] D. Tas¸, N. Dellaert, T. Van Woensel, and T. de Kok. Vehicle routing problem with stochastic travel times including soft time windows and service costs. Computers & Operations Research, 40(1):214–224, 2013.

ED

[43] D. Tas¸, O. Jabali, and T. Van Woensel. A vehicle routing problem with flexible time windows. Computers & Operations Research, 52(Part A):39–54, 2014. [44] E. Taillard, P. Badeau, M. Gendreau, F. Guertin, and J.Y. Potvin. A tabu search heuristic for the vehicle routing problem with soft time windows. Transportation Science, 31(2):170–186, 1997.

PT

[45] K. C. Tan, Y. H. Chew, and L. H. Lee. A hybrid multiobjective evolutionary algorithm for solving vehicle routing problem with time windows. Computational Optimization and Applications, 34(1):115–151, 2006.

CE

[46] E. Teymourian, V. Kayvanfar, GH. M. Komaki, and M. Zandieh. Enhanced intelligent water drops and cuckoo search algorithms for solving the capacitated vehicle routing problem. Information Sciences, 334-335:354–378, 2016.

AC

[47] Z. Ursani, D. Essam, D. Cornforth, and R. Stocker. Localized genetic algorithm for vehicle routing problem with time windows. Applied Soft Computing, 11(8):5375–5390, 2011. [48] T. Vidal. Technical note: Split algorithm in o(n) for the capacitated vehicle routing problem. Computers & Operations Research, 69:40–47, 2016. [49] W. Wu, Y. Tian, and T. Jin. A label based ant colony algorithm for heterogeneous vehicle routing with mixed backhaul. Applied Soft Computing, 47:224–234, 2016. [50] E. Zare-Reisabadi and S. H. Mirmohammadi. Site dependent vehicle routing problem with soft time window: Modeling and solution approach. Computers & Industrial Engineering, 90:177– 185, 2015.

32