A computational method for free terminal time optimal control problem governed by nonlinear time delayed systems

Accepted Manuscript A computational method for free terminal time optimal control problem governed by nonlinear time delayed systems Qinqin Chai, Wu ...

Download PDF

651KB Sizes 2 Downloads 63 Views

Report

PDF Reader
Full Text

Accepted Manuscript

A computational method for free terminal time optimal control problem governed by nonlinear time delayed systems Qinqin Chai, Wu Wang PII: DOI: Reference:

S0307-904X(17)30541-3 10.1016/j.apm.2017.08.023 APM 11935

To appear in:

Applied Mathematical Modelling

Received date: Revised date: Accepted date:

12 April 2016 18 August 2017 21 August 2017

Please cite this article as: Qinqin Chai, Wu Wang, A computational method for free terminal time optimal control problem governed by nonlinear time delayed systems, Applied Mathematical Modelling (2017), doi: 10.1016/j.apm.2017.08.023

This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

Highlights • A free final time optimal control problem for nonlinear delay systems

CR IP T

is considered.

• A solving method based on fully informed particle swarm optimization is adopted.

AC

CE

PT

ED

M

AN US

• A practical optimal fishery control problem is solved.

1

ACCEPTED MANUSCRIPT

Qinqin Chaia,∗, Wu Wanga a

CR IP T

A computational method for free terminal time optimal control problem governed by nonlinear time delayed systems College of Electrical Engineering and Automation, Fuzhou University, Fuzhou, P. R. China

AN US

Abstract

This paper considers a free terminal time optimal control problem governed by nonlinear time delayed system, where both the terminal time and the control are required to be determined such that a cost function is minimized

M

subject to continuous inequality state constraints. To solve this free terminal time optimal control problem, the control parameterization technique is

ED

applied to approximate the control function as a piecewise constant control function, where both the heights and the switching times are regarded as

PT

decision variables. In this way, the free terminal time optimal control problem is approximated as a sequence of optimal parameter selection problems

CE

governed by nonlinear time delayed systems, each of which can be viewed as a nonlinear optimization problem. Then, a fully informed particle swarm optimization method is adopted to solve the approximate problem. Finally,

AC

two free terminal time optimal control problems, including an optimal fishery control problem, are solved by using the proposed method so as to demon∗

Corresponding author Email addresses: [email protected] (Qinqin Chai), [email protected] (Wu Wang) Preprint submitted to Applied Mathematical modelling

September 7, 2017

ACCEPTED MANUSCRIPT

strate its applicability. Keywords: time delayed system, free terminal time, optimal control

CR IP T

computation, fully informed particle swarm optimization, fishery control problem 1. Introduction

In this paper, we consider a class of free terminal time optimal control

AN US

problems governed by time delayed systems, where both the terminal times and control variables are to be chosen such that an objective function is minimized subject to some constraints. Many real world practical problems can be formulated in this form, see, for example, the resource (fishery, forestry) harvesting problem [1]. For fishery, the juvenile fish cannot be harvested,

M

and they need time to reach maturity for reproduction. This phenomenon is called time delay. Let the resource population be the state, and let the

ED

feeding, harvesting efforts be the controls. Then the growth of the resource population can be modelled as a dynamic system with delayed state [2]. Dif-

PT

ferent controls will result in different growth processes. Given the delayed dynamic system, the objective is to maximize a given economical benefit

CE

over a planning horizon, where the terminal time is the time when certain sustainable population is achieved. This unknown terminal time is called

AC

free terminal time. The corresponding optimal control problem, where the controls and the unknown terminal time are to be optimized simultaneously, is usually referred to as free terminal time optimal control problem. Other examples include (i) the biological process [3], where the feeding rates and the cells’ growth time are to be chosen such that, the yield is maximized and 3

ACCEPTED MANUSCRIPT

the operation cost is minimized; (ii) the HIV infection treatment process [4], where drugs are prescribed at crucial time points (to be determined) such

CR IP T

that the time from which the patient can live a normal life is the shortest; and (iii) attitude control of micro air vehicles [5], where the flight distance before running out of fuel or battery is maximized.

The free terminal time optimal control problem can, in principle, be solved

by using Pontryagin Maximum Principle [6]. However, as the terminal time is

AN US

determined by a stopping criterion, the resulting boundary value problem is

extremely difficult to solve.Consequently, different numerical computational approaches have been proposed in the literature for solving free terminal time optimal control problems. In [1], a two-stage algorithm is proposed for solving a free terminal time optimal control problem with time delay, where

M

Guinn’s transformation is used to transform the delayed dynamic system to a higher dimensional dynamic system without delay. Then, a gradient descent

ED

algorithm is used to solve the transformed problem. In [4], an adapted iterative forward backward sweep method integrated with a gradient algorithm is

PT

applied to solve a HIV infection problem, where the transversality condition for the terminal time is used. In [7, 8], necessary conditions of optimality

CE

and the transversality condition are obtained for an optimal control problem involving fractional differential equations by using Lagrange multiplier technique. On this basis, a shooting type of algorithm is utilized to search for the

AC

optimal final time and the optimal controls of the fractional optimal control problem. In [9], a hybrid time-scaling transformation and gradient-based algorithms are proposed to search for optimal initial states and terminal time for batch fermentation process. In [10, 11], computational methods based on

4

ACCEPTED MANUSCRIPT

control parameterization method are developed to solve free terminal time optimal control problems with or without time-delayed arguments. In [12],

CR IP T

an optimal switching control problem involving multiple time-delayed systems is formulated as a free terminal time control problem, where the cost

on changing control is part of the cost function. Then, a computational

method developed based on the control parameterization technique is presented. In [13], a two-stage optimization strategy is proposed, in which the

AN US

terminal time is adjusted in the outer stage, while a sequence of fixed time problems is solved in the inner stage. In [14], a simulation-based approxi-

mate dynamic programming method is proposed to solve a free-end optimal feedback control problem of fed-batch reactors. However, for most of these methods, they are developed for optimal control problems without delays

M

in their dynamical systems, and require the gradient of the cost function. These gradient-based algorithms are local search methods. The quality of

ED

the solution obtained depends critically on the initial guess of the decision variables for the optimization process. Furthermore, for the calculation of

PT

the gradient, it requires to solve the dynamic system forward in time and the corresponding co-state system backward in time. Furthermore, it is required

CE

to calculate the integration of the gradient of the Hamiltonian function over the planning horizon. Although all these tasks are to be carried out numerically, these tasks are known to be rather time consuming. In many real

AC

industrial applications, only input-output data are available [15]. For these situations, it is impossible to derive the gradient formulas. Thus, it is required to seek derivative-free optimization algorithms for finding the controls and the terminal time.

5

ACCEPTED MANUSCRIPT

In this paper, we propose to adopt a numerical computational approach based on particle swarm optimization (PSO) algorithm. PSO [16] is a derivative-

CR IP T

free and population-based stochastic search algorithm, which is easy to implement and has been found to be robust and fast in solving many practical

problems in sciences and engineering [17]. It is known that these PSO-based optimization algorithms can get trapped in local minima during the later stage of the searching process. Therefore, extensive research has been car-

AN US

ried out by many researchers over the years for overcoming this drawback. Improvements achieved are basically in the following three aspects: topology design [18], parameter adjustment [19], and hybrid algorithm with other optimization algorithms [20]. Many improved particle swarm optimization algorithms can enhance the performance of the original method at the expense

M

of increasing the complexity of these algorithms. For the topology design, it determines the social information sharing mechanism among populations.

ED

Subsequently, it changes the effectiveness of PSO. With this motivation, in this paper, a basic PSO with fully informed topology proposed in [21] is uti-

PT

lized. It is well recognized that this topology is simple and is more robust than other topologies.

CE

The rest of the paper is organised as follows. Problem formulation is given in Section 2. The problem approximation through the application of the control parameterization technique is detailed in Section 3. In Section 4,

AC

an effective optimization method based on a fully-informed particle swarm optimization method is presented and two numerical examples are solved. Finally, some concluding remarks are made in Section 5.

6

ACCEPTED MANUSCRIPT

2. Problem statement Consider the following time delayed system t ∈ [0, T ],

t ≤ 0,

x(t) = φ(t),

(1)

CR IP T

˙ x(t) = f (t, x(t), x(t − α), u(t)),

(2)

where T > 0 is an unknown terminal time; x(t) = [x1 (t), . . . , xn (t)]> ∈ Rn

is the state vector; x(t − α) ∈ Rn is the delayed state vector; α is a given

AN US

time delay; u(t) = [u1 (t), . . . , um (t)]> ∈ Rm is an unknown control function. f = [f1 , . . . , fn ]> and φ(t) = [φ1 , . . . , φn ]> are given functions.

A control function u = [u1 , . . . , um ]> is said to be an admissible control if the following boundedness constraints are satisfied i = 1, . . . , m,

t ∈ [0, T ],

(3)

M

bi ≤ ui (t) ≤ ci ,

where 0 ≤ bi < ci , i = 1, . . . , m, and bi , ci , i = 1, . . . , m, are, respectively,

ED

the lower and upper bounds of the ith control variable ui (t). Let U be the class of all admissible controls.

PT

Furthermore, we impose the following constraint: Tmin ≤ T ≤ Tmax

(4)

CE

where Tmin and Tmax are the lower and upper bounds of the terminal time, respectively.

AC

We assume that the following conditions are satisfied.

Assumption 1. The function f is continuously differentiable with respect to x and u for each t ∈ [0, T ], and piecewise continuous with respect to t for

each (x, u) ∈ Rn × Rm . Furthermore, the function φ is twice continuously differentiable. 7

ACCEPTED MANUSCRIPT

Assumption 2. There exists a real number L1 > 0 such that ˆ , u)| ≤ L1 (1 + |x| + |ˆ |f (t, x, x x| + |u|),

ˆ , u) ∈ R × Rn × Rn × Rm , (t, x, x

CR IP T

where | · | denotes the Euclidean norm.

On the basis of Assumptions 1 and 2, the dynamic system (1)-(2) admits a unique solution corresponding to each control u satisfying (3)[22]. Let

u ∈ U.

AN US

x(·|u) denote the solution of the dynamic system (1)-(2) corresponding to

We further require that the following state constraints are satisfied. Φl (t, x(t|u)) ≥ 0,

l = 1, . . . , r,

t ∈ [0, T ],

(5)

where the functions Φl , l = 1, . . . , r, are continuously differentiable with

M

respect to x for each t ∈ [0, T ] and piecewise continuous with respect to t for each x ∈ Rn . as follows:

ED

A free terminal time optimal control problem can now be formally stated

PT

Problem (P). Given system (1)-(2), find a terminal time T and an admis-

CE

sible control u(t) such that the performance function Z T J(T, u) = Φ0 (T, x(T |u)) + L0 (t, x(t|u), u)dt,

(6)

0

AC

is minimized subject to constraints (3)-(5). 3. Problem approximation In this section, we shall apply the control parameterization technique [23] to approximate Problem (P) by a sequences of nonlinear programming 8

ACCEPTED MANUSCRIPT

problems. The control parameterization technique approximates the control by a piecewise-constant function or piecewise smooth function with possible

CR IP T

discontinuities at the N − 1 pre-assigned partition time points, where the

heights of the piecewise-constant function are regarded as decision variables. Let uN i (t)

=

N X

σiN,k χ[τk−1 ,τk ) (t),

t ∈ [τk−1 , τk ),

k=1

AN US

Here, χ[τk−1 ,τk ) (t) is the indicator function defined by   1, if t ∈ [τk−1 , τk ), χ[τk−1 ,τk ) (t) =  0, elsewhere

i = 1, . . . , m.

(7)

(8)

and τk , k = 1, . . . , N , are the control switching times satisfying 0 = τ0 < τ1 < · · · < τN −1 < τN = T . For each k = 1, . . . , N and i = 1, . . . , m,

M

σiN,k is the control height of the ith piecewise constant control on the time

interval [τk−1 , τk ). Denote σ N = [(σ N,1 )> , . . . , (σ N,N )> ]> , where σ N,k =

ED

N,k > ] , k = 1, . . . , N . [σ1N,k , . . . , σm

Let ∆τk = τk − τk−1 , where the convention of ∆τ0 = 0 is adopted. It is

CE

PT

assumed that mink=1,...,N ∆τk > 0. Clearly, τk =

k X

∆τk .

i=1

In the classical control parameterization method, ∆τk is fixed. In this

AC

case, only the control heights are decision variables. Thus, to obtain an accurate optimal control policy, the number of N is required to be very large, resulting in a large number of decision variables. Thus, a non-uniform control vector parameterization is introduced in [24, 25], where the time durations ∆τk ,k = 1, ..., N , are also taken as decision variables. Let ∆τ N = 9

ACCEPTED MANUSCRIPT

[∆τ1 , . . . , ∆τN ]> . Then ∆τk and σiN,k , k = 1, . . . , N , i = 1, . . . , m, are decision variables. Let T N denote the set of all such ∆τ N and let ΞN be the set

CR IP T

containing all such σ N . Substituting (7) into system (1)-(2) gives the following system N X k=1

f (t, x(t), x(t − α), σ N,k )χIk (t),

x(t) = φ(t),

t ∈ Ik ,

t ≤ 0,

k = 1, . . . , N

(9)

(10)

P Pk−1 ∆τi , ki=1 ∆τi ). where Ik = [ i=1

AN US

˙ x(t) =

Note that, for each pair (σ N , ∆τ N ) ∈ ΞN × T N , the dynamic system

(9)-(10) admits a unique solution, which is denoted by x(·|σ N , ∆τ N ). For the constraints (4)-(5), they become

k=1

∆τk ≤ Tmax ,

M

Tmin ≤

N X

ED

bi ≤ σik ≤ ci , i = 1, . . . , m,

Φl (t, x(t|σ N , ∆τ N ) ≥ 0,

k = 1, . . . , N,

l = 1, . . . , r,

t ∈ [0, τN ].

(11) (12) (13)

PT

The new objective function is

CE

˜ N , ∆τ N ) = Φ0 (τN , x(τN |σ N , ∆τ N )) J(σ N Z τk X L0 (t, x(t|σ N , ∆τ N ), σ N,k )dt. + k=1

(14)

τk−1

AC

We may now state the approximate optimal control problem with contin-

uous state inequality constraints as follows: Problem (PN ). Given system (9)-(10), find a time duration vector ∆τ N

and a control parameter vector σ N such that the objective function (14) is minimized subject to constraints (11)-(13). 10

ACCEPTED MANUSCRIPT

Note that unless the dynamic system (9)-(10) is linear and the objective function (14) is linear or quadratic, they are non-convex. Thus, Problem (PN )

CR IP T

is non-convex. For example, we consider the optimal harvesting problem in [27]. Let us just discuss the situation when the partition number N = 1. The ˜ 1 , ∆τ1 )) three-dimensional drawing with respect to each pairs of (σ 1 , ∆τ1 , J(σ are shown in Figure 1, where the pairs of (σ 1 , ∆τ1 ) corresponding to the white

blocks are not feasible solutions. Thus, the feasible domain of Problem (PN )

AN US

is non-convex. In the following section, we will adopt an optimization strat-

egy developed based on a particle swarm optimization algorithm to solve Problem (PN ).

−5

M

−10 −15 −20

ED

The value of the cost function J

0

−25 1

Figure

1:

PT

u(t) 2

The

3

three-dimensional

4

6

drawing

t

2

with

respect

0

to

each

pairs

of

˜ 1 , ∆τ1 )) (σ , ∆τ1 , J(σ

AC

CE

1

4. Problem solving 4.1. A fully-informed particle swarm optimization Particle swarm optimization (PSO) is inspired by the social and cooperative behaviour of flocks of birds searching for food. A potential solution 11

ACCEPTED MANUSCRIPT

is called particle. A population of potential solutions is called swarm. The swarm is initialised with randomly generated particles, and the search for an

CR IP T

optimal result is carried out by updating the particles’ velocities and positions at each iteration. Recently, more and more efforts are devoted to the

design of methods for updating velocities and positions so as to avoid getting trapped in a local optimum or losing important information. In [21],

a fully informed particle swarm optimization (FIPSO) is proposed. In a

FIPSO are updated as follows. → − − → → − − v j+1 = χ(→ v ji + ϕ( P m − X ji )), i − → − → − = X ji + → v j+1 , X j+1 i i

AN US

D-dimensional search space, the velocities and positions of the particles in

i = 1, . . . , M,

j = 1, . . . , M axiter (15) (16)

i

M

where M is the population size; M axiter is the maximum iteration number; − → → − v j ∈ RD and X j ∈ RD are the velocity and position of the ith particle in the i

ED

population at the jth iteration, respectively; χ = 0.7298 is the constriction → − coefficient for the velocity; ϕ = 4.1 is the acceleration coefficient; P m refers

AC

CE

PT

to the comprehensive information learned from the ith particle’s neighbours. → − In this deterministic model, P m is calculated as → − Pm =

P

N→ − − W(k)→ ϕk Pk P , → − k∈N W(k) ϕ k → − → − ϕ k = U 0, ϕmax /|N | , k∈N

(17) (18)

→ − where N is the best among the neighbors of the particle; P k is the best position found by individual k, k ∈ N ; W(k) is the fitness difference between

the best position of the particle and the current individual, or it is a given

12

ACCEPTED MANUSCRIPT

constant. Furthermore, the velocity and position must satisfy i = 1, . . . , M,

j = 1, . . . , M axiter

(19)

i = 1, . . . , M,

j = 1, . . . , M axiter

(20)

CR IP T

− v min ≤ → v ji ≤ v max , − → X min ≤ X ji ≤ X max ,

where v min and v max are, respectively, the lower and upper bounds of the

velocity; X min and X max are, respectively, the lower and upper bounds of the position.

AN US

In the fully informed neighborhood, all neighbors are considered as sources of influence. Thus, the size and the topology of neighbors determine how diverse the influence will be. Experimental results in [21] show that FIPSO with the square topology, where the reference to the particle’s own index is removed from the neighborhood, converges faster than FIPSO with other

M

topologies. Furthermore, it has better chance of finding the best optimum.

ED

Thus, we will use FIPSO to solve the approximate optimal control problem. 4.2. A computational method based on FIPSO

PT

It is known that PSO can only solve problems with bound constraints, such as constraints (11) and (12). For optimization problems with complex

CE

constraints, such as continuous state inequality constraints, they cannot be

AC

solved directly by PSO. Thus, we introduce the following penalty function

13

ACCEPTED MANUSCRIPT

[26] to handle these complex constraints.

+

r X l=1

ρl

Z

τN

N Z X k=1

τk−1

L0 (t, x(t|σ N , ∆τ N ), σ N,k )dt

min{Φl (t, x(t|σ N , ∆τ N )), 0}2 dt

0

τk

CR IP T

J˜ρ (τN , σ N , ∆τ N ) = Φ0 (τN , x(τN |σ N , ∆τ N )) +

+ ρr+1 min{τN − Tmax , 0}2 + min{Tmin − τN , 0}2

(21)

AN US

where ρl , l = 1, . . . , r + 1, are given large penalty factors. The last three terms in (21) represent the constraint violations. Problem (PN ) is now approximated as:

Problem (PN,ρ ). Given system (9)-(10), find a time duration vector ∆τ N and a control vector σ N such that the objective function (21) is minimized

M

subject to bound constraint (12).

ED

Clearly, Problem (PN ) is a more accurate approximation of Problem (P ) when the partition number N increases. By using arguments similar to those given in [23], it can be shown that the approximate optimal cost will converge

PT

to the true optimal cost as N → ∞. However, in practice, we cannot let N = ∞. Furthermore, when N increases the decision parameters of the

CE

approximated problem will grow by m times. From our extensive numerical experiments, it suggests that the improvement in the cost value for N larger

AC

than N = 20 appears to be highly insignificant. Thus, we start with a small N , then Problem (PN,ρ ) is solved by FIPSO to obtain the optimal control.

We then increase the value of N and re-calculate the optimal control of the corresponding Problem (PN,ρ ). We repeat this process until the reduction in

the cost value is negligible. Details are listed as follows. 14

ACCEPTED MANUSCRIPT

Step 1: Initialize N . Set the swarm size M , the neighbour particles for each particle, and the maximum iteration or terminal conditions of FIPSO.

CR IP T

Step 2: Initialize the velocity and position of each particle in the swarm and its neighbours. Let the iteration p = 1. Denote the swarm by Swarmp .

Step 3: For each particle in the swarm, execute the following step: Solve the state differential equation (9)-(10) with the initial condition forward

AN US

in time from t = 0 to t = τN . Let the solution obtained be denoted as x(·|σ N , ∆τ N ). Substitute x(·|σ N , ∆τ N ) into the objective function (21) to obtain the cost value for each particle.

Step 4: Update each particle’s personal best position and the global best of the swarm. If the terminal conditions of FIPSO are satisfied, then go to

M

Step 6, else let p := p + 1 and go to Step 5.

ED

Step 5: Update the velocities and positions of the swarm according to (15)-(16) to obtain a new swarm Swarmp+1 . Go to Step 3.

PT

Step 6: Obtain the best control parameter vector σ N ∗ , time duration vector ∆τ N ∗ and the minimum cost value J˜ρ (τ N,∗ , σ N,∗ , ∆τ N,∗ ), and the best

CE

swarm P ∗M . If J˜ρ (τ N,∗ , σ N,∗ , ∆τ N,∗ ) − J˜ρ (τ N +1,∗ , σ N +1,∗ , ∆τ N +1,∗ ) ≤ ,

where = 10−6 is a required precision, then stop. Else let N := N + s,

AC

where s is some positive integer. Go to Step 2.

15

ACCEPTED MANUSCRIPT

5. Numerical results 5.1. Example 1

CR IP T

The first example involves optimal fishery harvesting, where the growth

of the population has limited resources (food, water, space). Suppose the growth of the population is limited by a carrying capacity K. Let x be the

current fish population and r be the intrinsic growth rate. The dynamics

of the population from generation to generation is governed by a logistic x(t) ]. K

Moreover, considering that the

AN US

model in the form of x(t) ˙ = rx(t)[1 −

juvenile fish cannot be harvested, and they needs time to reach maturity for reproduction, the population growth model should contain time delay from their birth to maturity. Hence, the following delayed logistic model with

M

harvesting for fish growth is considered [27]:

x(t) ˙ = 3x(t)[1 − 0.2x(t − 0.5)] − u(t), t ≤ 0,

(22) (23)

ED

x(t) = 2,

t ∈ [0, T ]

where u(t) is the harvesting effort, r = 3, K = 5.

PT

Clearly, the fish population increases, initially. However, when it gets closer to the carrying capacity, it becomes saturated forcing the rate of growth

CE

to decrease. To maximize sustainable yield and to achieve the commercial purpose of the fishery, it is necessary to harvest the population at appropriate

AC

time points with suitable harvest effort in a long run. Hence, the following control objective function are proposed : Z T Φ0 (T, u(t)) = 0.1T 2 + e−βt (CE x(t)−1 u(t)3 − pu(t))dt 0

16

(24)

ACCEPTED MANUSCRIPT

where β = 0.05 is the discount rate of the economic benefit influenced by the market; CE = 0.2 is the harvesting cost, p = 2 is the fish price. Over all, the

CR IP T

first term of equation (24) is used to shorten the total investment time. The second term is expenditure minus income. To minimize the second term is to maximum the economic benefit. In addition, to avoid overfishing and avoid exceeding the carrying capacity limitation, the following constraints must be satisfied: t ∈ [0, T ]

AN US

2 ≤x(t) ≤ 4,

(25)

0 ≤u(t) ≤ 3.5

(26)

0.1 ≤ T ≤ 100

(27)

The optimal fishery harvesting problem may now be described as follows.

M

Given the time delayed system (22)-(23), find a terminal time T and a harvesting strategy u such that the objective functional (24) is minimized,

ED

subject to constraints (25)-(27). By the approximation method described in Section 3, the free terminal

PT

time optimal control problem is approximated by the following optimization problem. Given the dynamic system

CE

N X

˙ x(t) =

AC

k=1

x(t) = 2,

{3x(t)[1 − 0.2x(t − 0.5)] − σ k }χIk ,

t ∈ Ik ,

k = 1, . . . , N (28)

t ≤ 0,

(29)

P Pk N where Ik = [ k−1 and a control i=1 ∆τi , i=1 ∆τi ), find a positive vector ∆τ 17

ACCEPTED MANUSCRIPT

parameter vector σ N such that the cost functional N

N Z X k=1

τk

τk−1

e−0.05t [0.2x(t)−1 (σ k )3 − 2σ k ]dt

CR IP T

J˜ρ (τN , σ , ∆τ ) = N

N N nX o2 X + 0.1 ∆τk + ρ1 (min{ ∆τk − 0.1, 0})2 k=1

k=1

+ ρ1 (min{100 − τN

0

k=1

∆τk , 0})2

[(min{x(t) − 2, 0})2 + (min{4 − x(t), 0})2 ]dt

AN US

+ ρ2

Z

N X

(30)

is minimized subject to the bound constraints (11) and (12), where ρ1 and ρ2 are penalty factors.

M

We solve this problem by writing a Matlab program on a personal computer with the following configuration: Intel Core i5-4590 3.3GHz CPU, 8G

ED

RAM, 64-bit Window 7 operating system. In order to test the performance of the proposed method, we choose N = 9 as given in [27]. The penalty factors are ρ1 = 1000 and ρ2 = 105 . The performances of the FIPSO are evaluated

PT

by comparing with that of the comprehensive learning particle swarm optimizer (CLPSO) proposed in [28], genetic algorithm (GA) given in [29]. For

CE

all three optimization algorithms, the swarm size or population size are 30, and the maximum iteration is 1000. According to reference [21], for FIPSO,

AC

the square topology is used for choosing four neighbours, and W(k) is the fitness difference between the best position of the particle and the current individual. For GA, the selection rate is 0.9, cross rate is 0.8, and mutation rate is 0.1. The other parameters of CLPSO are as defined in [28]. The optimal results are summarized in Table 1 and Figure 2-Figure 4. 18

ACCEPTED MANUSCRIPT

Table 1: Comparisons results of Example 1

FIPSO

CLPSO

GA

reference [14]

optimal cost value

-26.032

-26.025

-25.620

-25.97

optimal terminal time

12.1221

12.1487

12.1448

12.1565

CR IP T

optimization algorithm

AN US

Table 1 shows that FIPSO solves the optimal control problem with a shorter

terminal time (12.12 years), and a smaller cost value. Compared with [27], the harvest time can be shorted by 10 days. Figure 2 demonstrates that the convergence speed of FIPSO is faster than that of CLPSO and GA optimization algorithm. And the optimal state and control of FIPSO have a tendency

M

of becoming stable. Hence, FIPSO is superior to the other two optimization

are satisfied. 10

x 10

4

FIPSO

cost function value

CE AC

CLPSO

GA

−24.5

PT

8

ED

methods in solving this optimal control problem. In addition, all constraints

6

−25

4

−25.5 −26 490

2

495

500

0 100

200

300

400 500 600 iteration number

700

800

900

1000

Figure 2: The convergence trajectories of FIPSO, GA, and CLPSO.

19

ACCEPTED MANUSCRIPT

CR IP T

4

x(t)

3.5 3

CLPSO FIPSO GA

2.5

0

2

4

6

8

t

10

12

AN US

2

M

Figure 3: The optimal state trajectories obtained by FIPSO, GA, and CLPSO, respectively

ED

3.5

PT

u(t)

3

2.5

FIPSO CLPSO GA

1.5

0

2

4

6

AC

CE

2

t

8

10

12

Figure 4: The optimal piecewise-constant controls obtained by FIPSO, GA, and CLPSO, respectively

20

ACCEPTED MANUSCRIPT

Table 2: Comparisons results of Example 2

CLPSO

GA

reference [1]

CR IP T

optimization algorithm FIPSO optimal cost value

5.287

5.425

8.875

5.37

optimal terminal time

1.349

1.1818

1.005

1.446

AN US

5.2. Example 2

The second example is the optimization problem given in [1]

min J = subject to the dynamics

T

0

(x2 (t) + u2 (t))dt + 0.1T

M

u,T

Z

ED

x(t) ˙ = x(t) + 0.8x(t − 0.15) + 0.9u(t),

t ∈ [0, T ]

(31)

(32)

t ≤ 0,

(33)

−10 ≤ u(t) ≤ 10,

(34)

PT

x(t) = 1,

We solve this problem with control partition numbers N = 6, using

CE

FIPSO, CLPSO, GA algorithms with the same parameter settings for solving example 1. Comparison results are given in Table 2. We can also see that

AC

FIPSO is more effectiveness than the other optimization algorithms. From the results given in Table 2, we can see that the minimum cost has been reduced by 1.6% and the minimum time has been reduced by 7% compared with that of [1].

21

ACCEPTED MANUSCRIPT

6. Conclusion In this paper, a class of free terminal time optimal control problem in-

CR IP T

volving time delayed system is investigated. To solve this problem, we apply the control parameterization method to yield an approximate optimization

problem. Then a simple effective optimization method is developed, based on a fully informed particle swarm optimization method, to solve the approx-

imate problem. Numerical results show that the proposed method is more

AN US

effective than other existing solving methods with shorter terminal time and better cost function values. Acknowledgment

M

The authors thank the anonymous reviewers and Mr. Jianhua Liu for their valuable comments and suggestions to improve this paper. This work

ED

was supported by the Natural Science Foundation of Fujian Province, China (Grant No. 2016J05154), the National Natural Science Foundation of China

PT

(Grant No. 61673116, 61503131).

CE

References

[1] A. Boccia, P. Falugi, H. Maurer, R. B. Vinter. Free time optimal control problems with time delays. IEEE Conference on Decision and Control,

AC

2013:520-525.

[2] M. Liu, C. Bai. Optimal harvesting of a stochastic logistic model with time delay. Journal of Nonlinear Science, 2015, 25(2):277-289.

22

ACCEPTED MANUSCRIPT

[3] C. Liu, Z. Gong. Modelling and optimal control of a time-delayed switched system in fed-batch process. Journal of the Franklin Institute,

CR IP T

2014, 351(2): 840-856. [4] I. Elmouki. Free terminal time optimal control problem for the treatment of HIV infection. Jawra Journal of the American Water Resources Association, 2016, 7(1):148-161.

AN US

[5] J. J. Ewoud, Q. Chu, G. C. D. Croon. Adaptive incremental nonlinear

dynamic inversion for attitude control of micro air vehicles.Journal of Guidance Control and Dynamics, 2016, 39(3): 450-461. [6] L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze, E. F.

science, 1962.

M

Mishchenko, The mathematical theory of time optimal processes, Inter-

ED

[7] R. K. Biswas, S. Sen. Free final time fractional optimal control problems. Journal of the Franklin Institute, 2014, 351(2):941-951.

PT

[8] S. Pooseh, R. Almeida, D. F. M. Torres. Fractional Order Optimal Control Problems with Free Terminal Time. Journal of Industrial and Man-

CE

agement Optimization, 2014, 10(2): 363-381. [9] J. Yuan, C. Liu, X. Zhang, J. Xie, E. Feng, H. Yin, Z. Xiu. Optimal

AC

control of a batch fermentation process with nonlinear time-delay and free terminal time and cost sensitivity constraint. Journal of Process Control, 2016, 44: 41-52.

[10] R. Li, Y. Shi, Finite-time optimal consensus control for second-order 23

ACCEPTED MANUSCRIPT

multi-agent systems. Journal of Industrial & Management Optimization, 2014, 10(3):929-943.

CR IP T

[11] C. Jiang, Q. Lin, C. Yu, K. L. Teo, G. R. Duan. An exact penalty method

for free terminal time optimal control problem with continuous inequality constraints. Journal of Optimization Theory and Applications, 2012, 154(1):30-53.

AN US

[12] Z. Gong, C. Liu, Y. Wang. Optimal control of switched systems with multiple time-delays and a cost on changing control. Journal of Industrial and Management Optimization, 2017, 13, doi:10.3934/jimo.2017042.

[13] J. M. Jeong, S. J. Son, Time optimal control of semilinear control sys-

M

tems involving time delays. Journal of Optimization Theory and Appli-

ED

cations, 2015, 165(3):793-811.

[14] C. V. Peroni, N. S. Kaisare, J. H. Lee. Optimal control of a fed-batch bioreactor using simulation-based approximate dynamic programming.

CE

790.

PT

IEEE Transactions on Control Systems Technology, 2005, 13(5): 786-

[15] T. J. J. V. D. Boom, J. B. Klaassens, R. Meiland. Real-time time-

AC

optimal control for a nonlinear container crane using a neural network. 2007:79-84.

[16] R. Eberhat, J. Kennedy, A new optimizer using particle swarm theory. Proceedings of 6th International Symposium On Mico Machine and Human Science, 1995: 39-43. 24

ACCEPTED MANUSCRIPT

[17] C. Ou, W. Lin. Comparison between PSO and GA for parameters optimization of PID controller. IEEE International Conference on Mecha-

CR IP T

tronics and Automation. 2006: 2471-2475. [18] Q. Liu, W. Wei, H. Yuan, Z. H. Zhan, Y. Li. Topology selection for particle swarm optimization. Information Sciences,2016, 363: 154-173.

[19] A. B. Hashemi, M. R. Meybodi. A note on the learning automata based

puting, 2011, 11(1):689-705.

AN US

algorithms for adaptive parameter selection in PSO. Applied Soft Com-

[20] H. Garg. A hybrid PSO-GA algorithm for constrained optimization problems. Applied Mathematics & Computation, 2016, 274(11):292-305.

M

[21] R. Mendes, J. Kennedy, O. Neves, The fully informed particle swarm: simpler, maybe better. IEEE Transactions on evolutionary computation,

ED

2004, 8(3):204-210

[22] N. U. Ahmed, Dynamic systems and control with applications, Singa-

PT

pore: World Scientific, 2006. [23] K. L. Teo, C. J. Goh, K. H. Wong, A unified computational approach

CE

to optimal control problems, Essex: Longman Scientific and Technical,

AC

1991.

[24] Y. Lei, S. Li, Q. Zhang, X. Zhang, Free final time optimal control based on non-uniform control vector parameterization. Journal of Systems Science and Mathematical Sciences, 2012, 32(3): 277-287.

25

ACCEPTED MANUSCRIPT

[25] Q. Lin , R. Loxton, K. L. Teo, The control parameterization method for nonlinear optimal control: A survey. Journal of Industrial & Manage-

CR IP T

ment Optimization, 2017, 10(1):275-309. [26] C. J. Yu, K. L. Teo, L. S. Zhang, Y. Q. Bai, On a refinement of the con-

vergence analysis for the new exact penalty function method for continuous inequality constrained optimization problem. Journal of Industrial

AN US

Management and Opti- mization, 2012, 8(2):485-491.

[27] C. Liu, R. Loxton, K. L. Teo. A computational method for solving timedelay optimal control problems with free terminal time. Systems & Control Letters, 2014, 72: 53-60.

[28] J. J. Liang, A. K. Qin, P. N. Suganthan, S. Baskar, Comprehensive learn-

M

ing particle swarm optimizer for global optimization of multimodal func-

281-295.

ED

tions. IEEE Transactions on Evolutionary Computation, 2006, 10(3):

PT

[29] Goldberg D E. Genetic algorithm in search, optimization, and machine

AC

CE

learning. 1989, xiii(7): 2104C2116.

26

A computational method for free terminal time optimal control problem governed by nonlinear time delayed systems

A computational method for free terminal time optimal control problem governed by nonlinear time delayed systems

Recommend Documents