An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach

Available online at www.sciencedirect.com Journal of the Franklin Institute ] (]]]]) ]]]–]]] www.elsevier.com/locate/jfranklin An obstacle avoidance...

Download PDF

843KB Sizes 1 Downloads 66 Views

Report

PDF Reader
Full Text

Available online at www.sciencedirect.com

Journal of the Franklin Institute ] (]]]]) ]]]–]]] www.elsevier.com/locate/jfranklin

An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach G. Franzèn, W. Lucia DIMES – Università della Calabria Rende (CS), 87036, Italy Received 14 October 2014; received in revised form 29 January 2015; accepted 12 March 2015

Abstract The paper addresses the obstacle avoidance motion planning problem for ground vehicles operating in uncertain environments. By resorting to set-theoretic ideas and sum of squares (SOS) decomposition techniques, a receding horizon control algorithm is proposed for robots modeled by polynomial systems subject to input, state and nonholonomic constraints. Sequences of inner ellipsoidal approximations of the exact one-step controllable sets are computed for all admissible obstacle scenarios and then on-line exploited to determine the more adequate control action to be applied in a receding horizon fashion. The results here proposed are a signiﬁcant generalization of existing algorithms which are tailored only for linear time invariant plant descriptions. The resulting framework guarantees uniformly ultimate boundedness and constraints fulﬁlment regardless of any obstacle scenario occurrence. & 2015 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.

1. Introduction Motion planning and autonomous vehicle control problems in uncertain environments are of paramount importance in several human applications where guaranteed real time feasibility, safety against uncertainties and disturbances are required. Despite extensive research, this problem still represents a real challenge that becomes more complex by taking into account the limited perception abilities and computational power resources of the robot, as well as restrictions on the vehicle mobility n

Corresponding author. E-mail addresses: [email protected] (G. Franzè), [email protected] (W. Lucia).

http://dx.doi.org/10.1016/j.jfranklin.2015.03.021 0016-0032/& 2015 The Franklin Institute. Published by Elsevier Ltd. All rights reserved.

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

2

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

and underactuation phenomena. Collision avoidance is a key component of safe navigation whose a typical objective is to reach a target through the obstacle-free part of the environment [1]. The existing algorithms can be generally classiﬁed into global and local path planners [2]. Global planning methods rely on a priori environment information and guarantee feasible or even optimal paths to the goal, but require huge computational resources. Local obstacle avoidance methods (also referred to as reactive methods) are mainly based on sensory data collected during operations that make them computationally efﬁcient and hence suitable for real-time applications. However, most of the algorithms proposed in the literature only address the path planning aspect by leaving out the control phase, see [3,2] and references therein. This represents a serious drawback when the vehicle is subject to dynamical constraints because any tracking capability is de facto limited. Along these lines are also the recent contributions [4] and [5]. The ﬁrst proposes a framework for path planning based on the use of homogeneous forms and linear matrix inequalities (LMIs), while in the second paper the attention has been focused on the formal deﬁnition of a navigation algorithm within dynamic cluttered environments. Moreover, a relevant aspect to be taken into account is that wheeled vehicles are typical examples of nonholonomic mechanical systems see e.g. [6,7]. These robots share strong controllability properties, but the nonholonomic kinematic constraints which characterize their motion render the associated control design problem quite challenging. In [8], this has been discussed in depth where the non-existence of pure-state feedbacks for the asymptotic stabilization of ﬁxed conﬁgurations has been proved. This difﬁculty has had the effect of focusing the research on the feedback control of nonholonomic systems on two distinct subproblems, namely (i) ﬁxed point asymptotic stabilization relying on highly nonlinear techniques, and (ii) asymptotic stabilization of feasible trajectories based on more classical linear and nonlinear techniques, see [9,10]. Of interest here are constrained Receding Horizon Control strategies which are an extremely appealing methodology for the obstacle avoidance motion planning problem due to their intrinsic capability to generate at each time instant feasible trajectories. Noticeable recent contributions are from [11,12]. In [11], the authors address the set-point regulation problem of a nonholonomic wheeled mobile robot with obstacle avoidance in a known dynamic environment populated with static and moving obstacles subject to robot kinematic and dynamic constraints by using the model predictive control (MPC) in polar coordinates. In [12], the proposed scheme is based on the combination of a model-based optimization scheme and a convergence-oriented potential ﬁeld method. This is done by recasting the two approaches in an MPC and control Lyapunov function (CLF) framework so that a tractable version of the dynamic window approach (DWA) results. More recently, predictive and adaptive active steering control schemes for nonlinear model descriptions have been presented, see [13–16]. The ﬁrst contribution essentially proposes an algorithm whose aim is to impose that the autonomous vehicle follows safe pre-deﬁned reference trajectories which are assumed to be collision-free and achievable by the vehicle. In [14] a standard MPC strategy has been developed for dealing with a tracking problem while avoiding collision with cluttered obstacle scenarios. In [15] a tracking and stabilization adaptive control of damped mobile robots with unknown parameters and subject to input torque saturations has been proposed in order to compensate exogenous bounded disturbances. In [16] a two-mode MPC scheme which ensures feasibility retention despite time-varying state constraints is presented: the standard operative mode is based on the classical RHC philosophy, while the safe mode, activated under time-varying constraints scenarios, allows to conﬁne the state trajectory within a robustly positively invariant set. An important aspect leaves out concerns with the deﬁnition of a procedure capable to determine new feasible paths when the standard mode cannot be recovered. Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

3

Moving from these considerations and thanking to the recent advances in semideﬁnite programming along with the use of semi-algebraic methods for developing efﬁcient control strategies of polynomial systems [17], in this paper the basic SOS MPC strategy proposed in [18] is extended to deal with the solution of the obstacle avoidance problem under the presence of nonholonomic constraints. It is assumed that the set of all possible obstacle locations on the planar environment are known, but it is unpredictable which is the obstacle conﬁguration within the working environment at each time instant. The latter gives rise to a certain level of uncertainty that if not properly treated can lead to collisions during the vehicle navigation. To deal with this problem the idea here proposed is to develop a set-theoretic based receding horizon approach so that the prescribed constraints are always fulﬁlled despite any obstacle scenario occurrence. The main ingredients of the proposed dual-mode predictive strategy can be summarized as follows:

a stabilizing state feedback law and a robust positively invariant ellipsoidal set centered at the goal location are ﬁrst computed; according to the given obstacle scenario conﬁgurations, a procedure to off-line compute families of one-step ahead controllable sets in order to enlarge the domain of attraction is derived; at each sample time and given the current obstacle scenario, an on-line receding horizon strategy is obtained by deriving the smallest ellipsoidal set complying with the obstacle conﬁguration. The control move is computed by minimizing a performance index such that the one-step ahead state prediction belongs to its successor.

A relevant feature of this scheme relies on its capability to move off-line most of computations and to ensure that at each time instant a feasible solution there always exists despite any admissible occurrence of the obstacle conﬁgurations. This leads to a substantial improvement in the computational efﬁciency and represents a key aspect when real-time applications are taken into consideration. Finally, theoretical results are illustrated by means of simulations on a dynamical nonholonomic mobile robot model whose the navigation within a planar environment is limited by unpredictable occurrences of two obstacle scenarios. Notations and preliminaries:

With R½x we denote the ring of multivariate scalar polynomials p A R½x in the unknown xA Rn : With Σ½x R½x ( ) q X q 2 Σ½x≔ s A R½x( qo1; ( pi i ¼ 1 ; pi A R½x; s:t: s ¼ pi i¼1

we denote the proper and closed subset of multivariate sum-of-squares (SOS) polynomials sA Σ½x in the unknown x A Rn . The polynomial pðxÞ, having degree 2d, belongs to Σ½x iff there exists a symmetric matrix Q ¼ QT Z 0 such that T pðxÞ ¼ zT ðxÞQzðxÞ; zðxÞ ¼ 1; x1 ; x2 ; …; xn ; x1 x2 ; …; xdn ; ð1Þ with z(x) containing all monomials in the variables x1 ; …; xn of degree lower than or equal to d. The matrix Q is known as the Gram matrix and Eq. (1) as the Square Matricial Representation (SMR) of p(x), see [19–21].

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

4

n m Given a set SDX YDR R , the projection of the set S onto X is deﬁned as ProjX fSg≔ xA Xj (y A Y s:t: ðx; yÞA S .

Let us consider nonlinear discrete-time dynamical systems xðt þ 1Þ ¼ f ðxðtÞ; uðtÞÞ þ Gw wðtÞ

ð2Þ

and wðtÞ A W R , with W≔fw A where t A Zþ ≔f0; 1; …g, xðtÞA R , uðtÞA R , Gw A R Rw jcw ðwÞr 1g a bounded exogenous disturbance including unmodeled dynamics and cw ðwÞA R½w a convex and compact polynomial function. Moreover, the following set-membership state and input constraints are prescribed: n

m

nm

w

xðtÞA X ; 8 t Z 0;

X ≔fx A Rn jcx ðxÞr 1g;

ð3Þ

uðtÞA U; 8 t Z 0;

U≔fuA Rm jcu ðuÞr 1g;

ð4Þ

where cx(x) and cu(u) are convex and compact polynomial scalar functions belonging to R½x and R½u, respectively. Deﬁnition 1. A set T DRn is robustly positively invariant (RPI) for Eq. (2) if there exists a control law uðtÞ A U such that once the closed-loop solution xCL(t) enters inside that set at any given time t0, it remains in it for all future instants, i.e. xCL ðt 0 ÞA T -xCL ðtÞA T ; 8 wðtÞA W; 8t Z t 0 . Deﬁnition 2. Given two sets A and B, the subtraction A B≔faA A : a þ bA A; 8 b A Bg is said P-difference and the P-difference operator [22].

ð5Þ □

Given the system (2), it is possible to determine the sets of states k-step controllable to Ξ via the following recursion (see [23]): Ξ 0 ≔Ξ Ξ k ≔fxA X : (u A U : f ðx; uÞ þ Gw w A Ξ k 1 ; 8 w A Wg ¼ fx A X : ( uA U : f ðx; uÞ A Ξ k 1 Gw Wg ¼ fx A X : ( uA U : f ðx; uÞ A Ξ~ k 1 g

ð6Þ

where Ξ0 is known as the terminal region, Ξk is the set of states that, irrespective of any admissible disturbance occurrence w A W, can be steered into Ξk-1 using a single control move and Ξ~ k 1 ≔Ξ k 1 Gw W. Furthermore, sets Ξk and Ξ~ k can be characterized in terms of appropriate polynomial functions Ψ k ðxÞ A R½x and Ψ~ k ðxÞA R½x as Ξ k ≔ x A Rn : Ψ k ðxÞr 1 ; Ξ~ k ≔ xA Rn : Ψ~ k ðxÞr 1 An equivalent description of Ξi can be given in terms of appropriate polynomial level surface functions V k ðx; uÞA R½x; u (see [18]): V k ðx; uÞ40; 8 xA Rn of0g; u A Rm {f0g and V k ð0; 0Þ ¼ 0 x A Rn ; u A Rm jV k ðx; uÞr 1 \ fx A Rn ; uA Rm jcu ðuÞr 1g \ fxA Rn ; uA Rm jcx ðxÞr 1gD xA Rn ; u A Rm jΨ~ k 1 ðf ðx; uÞÞ r 1

ð7Þ

ð8Þ

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

Ξ k ¼ Projx fx A X DRn ; u A UDRm j V k ðx; uÞ r 1g

5

ð9Þ

Note that Eqs. (7)–(8) are rephrasing the recursion (6) in the extended space ðx; uÞ. Proposition 1. Conditions (7)–(8) are satisﬁed if there exist V k ðx; uÞA Σ nþm ; si A Σ nþm ; i ¼ 1; …; 5; such that V k ðx; uÞ l1 ðx; uÞ A Σ nþm ð1 V k ðx; uÞÞs1 þ ð1 cu ðuÞÞs2 þ ð1 cx ðxÞÞs3 þð1 V k ðx; uÞÞð1 cu ðuÞÞs4 þð1 V k ðx; uÞÞð1 cx ðxÞÞs5 þ ðΨ~ k 1 ðf ðx; uÞÞ 1Þ A Σ nþm

ð10Þ

ð11Þ

where l1 A R½x; u is a suitable chosen positive deﬁnite polynomial. Proof. By resorting to Positivstellensatz (P-satz) and S-procedure arguments, and by following the lines indicated in [24]. □ Deﬁnition 3. Let S be a neighborhood of the origin. The closed-loop trajectory of Eq. (2) is said to be Uniformly Ultimate Bounded in S if for all μ40 there exist TðμÞ40 and uðtÞA U such that, for every J xð0ÞJ r μ, xCL ðtÞA S for all t Z TðμÞ. Deﬁnition 4. Given a set S Rn and a point x A Rn , the distance between them is deﬁned as distðx; SÞ ¼ inf Jx s J n ; sAS

where n is any relevant norm. Deﬁnition 5. Given two sets S; R Rn the distance between them is deﬁned as distðS; RÞ ¼ inff Js r J n : s A S; r A Rg Deﬁnition 6. An oriented graph is an ordered pair G ¼ ðV; EÞ such that

V is the vertex set; E is a subset of ordered pairs of V known as the edge set, i.e. E ffu; vgju; v A Vg

Deﬁnition 7. Let G ¼ ðV; EÞ an oriented graph, the reachable vertex set Vr is deﬁned as Vr ≔fv A Vj ( u A V : fu; vgA Eg Deﬁnition 8. Let f(z) and g(z) be two differentiable vector ﬁelds. The Lie bracket of f(z) and g(z) is another vector ﬁeld deﬁned as ½f ; gðzÞ≔

∂g ∂f ðzÞf ðzÞ ðzÞgðzÞ ∂z ∂z

ð12Þ

2. Nonholonomic mobile robots The aim of this section is twofold: to provide detailed model descriptions of autonomous ground vehicles and to show that such nonlinear models can be recast as polynomial systems. To this end, two well-known model descriptions of the unicycle car-like mobile robot depicted in Fig. 1 are ﬁrst presented and then a polynomial recasting procedure is outlined. Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

6

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

Y v

y

x

O

X

Fig. 1. Nonholonomic mobile robot.

2.1. Kinematic model A differential drive robot is a typical nonholonomic wheeled vehicle, which has two rear drive wheels and a front castor for body support. It is assumed that the motion of mobile robot cannot slip laterally so that the translational velocity is in the direction of heading, i.e. a pure rolling contact between the wheels and the ground. The velocity of the two rear wheels ðvl and vr Þ is used to impose the translation ðv ¼ ðvl þ vr Þ=2Þ and angular ðω ¼ ðvr vl Þ=BÞ speeds of the robot (B is the wheelbase). The robot pose is described by its position ðx; yÞ, the midpoint of the rear axis of the robot, and its orientation (θ). Then, the kinematics equation is 2 3 3 2 3 2 x_ ðtÞ cos ðθðtÞÞ 0 6 7 7 6 y_ ðtÞ 7 6 ð13Þ 4 5 ¼ 4 sin ðθðtÞÞ 5vðtÞ þ 4 0 5ωðtÞ _θðtÞ 0 1 where maximum linear and angular velocities are prescribed: jvðtÞj r V MAX and jωðtÞj r W MAX ; 8t. Notice that the kinematic model (13) is nonintegratable and, as a consequence, kinematics constraints cannot be converted into geometrical requirements [25]. Moreover since the number of control variables is less than the number of state variables, a nonholonomic constraint holds and a continuous time-invariant feedback control law cannot be used [10,26]. On the other hand, since the accessibility rank condition is globally satisﬁed [27]: 02 3 2 331 3 2 3 22 cos ðθðtÞÞ cos ðθðtÞÞ 0 0 B6 7 6 77C 7 6 7 66 ð14Þ Rank@4 sin ðθðtÞÞ 5; 4 0 5; 44 sin ðθðtÞÞ 5; 4 0 55A ¼ 3; 0

1

0

1

the model plant (13) is controllable by means of a nonlinear or time-varying controller. Finally for the sake of completeness, the car-like plant of Fig. 1 can be equivalently described by means of a polar coordination frame. In fact, by deﬁning the polar state components as follows: pﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ e ¼ x2 þ y2 Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

ϕ ¼ α tan 2ð y; xÞ α ¼ ϕθ we have that Eq. (13) can be converted into the following polar form: 8 cos ðαðtÞÞ > > e_ ðt Þ ¼ vðt Þ > > eðtÞ > > > < sin ðαðtÞÞ ϕ_ ðt Þ ¼ vðt Þ eðtÞ > > > > > sin ðαðtÞÞ > > : α_ ðtÞ ¼ ωðt Þ þ vðt Þ eðtÞ

7

ð15Þ

ð16Þ

2.2. Dynamical model We consider that the motion and orientation are achieved by two independent actuators (e.g. DC motors) providing the necessary torques τ1 and τ2 to the driving wheels. The robot pose in the inertial Cartesian frame fO; X; Yg is completely speciﬁed by the 3-dimensional vector of generalized coordinates qðtÞ≔ðxðtÞ; yðtÞ; θðtÞÞ. Moreover, the autonomous vehicle is subject to an independent velocity constraint: x_ ðtÞ sin ðθðtÞÞ y_ ðtÞ cos ðθðtÞÞ ¼ 0

ð17Þ

This represents a nonholonomic constraint and states that the robot can only move in the direction normal to the axis of the driving wheels: the mobile base satisﬁes the pure rolling and non-slipping conditions, see [7]. Then, the vehicle can be described by the following dynamical model: 8 cos ðθðtÞÞ > > x€ ðt Þ ¼ y_ ðt Þθ_ ðt Þ þ ðτ1 ðt Þ þ τ2 ðt ÞÞ > > mr > > < sin ðθðtÞÞ ðτ1 ðt Þ þ τ2 ðt ÞÞ y€ ðt Þ ¼ x_ ðt Þθ_ ðt Þ þ ð18Þ > mr > > > R > > : θ€ ðt Þ ¼ ðτ1 ðt Þ τ2 ðt ÞÞ Ir where m, I, R are the robot mass, moment of inertia and length, respectively, and r is the wheels radius. Finally, notice that at each time instant the robot pose qðtÞ, as well as its that it is assumed _ T are available for feedback. _ ¼ x_ ðtÞ y_ ðtÞ θðtÞ derivative, qðtÞ 2.3. Non-polynomial to polynomial systems recasting procedure Since our subsequent developments will consider polynomial models, in this section we describe an algorithmic procedure useful to convert a non-polynomial system into its correspondent polynomial description, see [28]. Speciﬁcally, we consider non-polynomial systems whose vector ﬁeld is a combination of sums and products of elementary functions of the following classes: exponential, power and trigonometric. Let us suppose that the nonlinear system is given in the following form: X ξ_ i ¼ λj ∏F ijk ðξÞ; i ¼ 1; …; s ð19Þ j

k

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

8

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

where λj A R, ξ ¼ ½ξ1 …ξs T and F ijk ðξÞ are elementary functions or some combinations of them. The recasting procedure is as follows: Recasting procedure (Papachristodoulou and Prajna [28]). 1. Let zi ¼ ξi ; i ¼ 1; …; s. 2. For each F ijk ðξÞ that is not in the form F ijk ðξÞ ¼ ξpl ; p A Zþ ≔f0; 1; …g and 1r l r s, introduce a new variable zr such that zr ¼ F ijk ðξÞ. 3. Compute the differential equation describing the time evolution of zr. 4. Replace all appearances of such F ijk ðξÞ in the system equations by zr. 5. Repeat steps 2–4, until a system of polynomial differential equations is achieved.

It is worth to note that the recasting process introduces additional and ﬁctitious state variables with the unavoidable consequence of deriving a polynomial model with dimensions higher than the original plant. Moreover to guarantee the equivalence between the polynomial version and the original manifold, the following state constraints must be taken into account: zr ¼ Fðz1 ; …; zs Þ;

8r4s:

ð20Þ

Therefore the non-polynomial system (19) can be rewritten as ( z_ a ¼ f a ðza ; zb Þ z_ b ¼ f b ðza ; zb Þ

ð21Þ

where za ¼ ½z1 …zs are the state variables of the original system (19), zb ¼ ½zsþ1 …zn the additional state variables and f a ðza ; zb Þ , f b ðza ; zb Þ the polynomials forms. Moreover, besides the equality constraints (20) arising from the Recasting procedure, the following further equality/ inequality requirements: Ga ðza ; zb Þ ¼ 0

ð22Þ

Gb ðza ; zb ÞZ 0

ð23Þ

are necessary in order to obtain a polynomial form for f a ðza ; zb Þ and f b ðza ; zb Þ. Note that F; Ga and Gb are column vectors of functions with appropriate dimensions. For a detailed analysis and exhaustive examples, the interested reader can refer to [28]. Finally, it is worth to recall that different schemes can be used to recast non-polynomial systems into the polynomial ﬁeld, see e.g. [29] where the nonlinear function is described by means of a truncated Taylor expansion with the remainder conﬁned into a polytopic set. 3. Problem formulation We consider autonomous vehicles described by polynomial discrete-time systems zðt þ 1Þ ¼ f p ðzðtÞ;

uðtÞÞ þ Gw wðtÞ

ð24Þ

where f p : Rn Rm -Rn is an array of multivariate polynomials, zðtÞ ¼ zqa ðt ÞT zva ðt ÞT zb ðt ÞT T A X Rn the state with zqa A Rq and zva A Rv accounting for the robot pose and its derivative respectively, uðtÞ A U Rm the control input, wðtÞA W Rw a bounded exogenous disturbance including unmodeled dynamics and Gw A Rnw . Moreover, the vehicle is subject to Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

9

nonholonomic constraints of the following form: aTi ðzqa ðtÞÞzva ðtÞ ¼ 0;

i ¼ 1; …; na ; 8 t A Zþ ;

ð25Þ

with na being the number of nonholonomic constraints and ai : R -R ; i ¼ 1; …; na , nonlinear functions of the vehicle pose, see [6]. In the sequel, it is supposed that vehicles move within dynamic environments characterized by the following uncertain obstacle conﬁguration: q

v

each moving object can occupy pre-speciﬁed positions so that a ﬁnite and known set of obstacle scenarios comes out; time instants at which the obstacle scenario changes are unknown.

This set-up concerns with partially known working areas where a speciﬁc set of obstacle scenarios may occur, e.g. production/assembly lines [30], restricted areas (buildings or rooms) where surveillance or housekeeping are required tasks [31] and so on. In order to formally describe this framework, the following deﬁnitions will be used: Deﬁnition 9. Let Obij be an object. Then an obstacle scenario Oi is deﬁned as Oi ≔fObi1 ; …; Obini g

ð26Þ

where ni denotes the number of the involved objects. The obstacles Obij ; i ¼ 1; …; l; j ¼ 1; …; ni , have a polyhedral convex structure described as the intersection of lj half-spaces: 2 T 3 2 3 gij H ij 6 17 1 7 6 7 6 ⋮ 7p r 6 ⋮ 7 Obij : 6 ð27Þ 6 7 6 T 7 4 i 5 5 4 i gj Hj lj

lj

where p≔Bz A R2 are the planar components of the state space z A Rn and B AR2n a projection matrix. Deﬁnition 10. Let Oi be an obstacle scenario. Then the non-convex obstacle-free region pertaining to Oi is identiﬁed as follows: Oifree ≔fzA Rn : hi ðzÞ40g

ð28Þ

where hi : Rn -Rnf characterizes the admissibility state space region and nf is the number of component-wise inequalities. Then, the problem we want to solve can be stated as follows. Obstacle Avoidance Motion Planning (OAMP) Problem – Given a set of obstacle scenarios Oi ; i ¼ 1; …; l, determine a state-feedback control policy uðtÞ ¼ gðzðtÞÞ

ð29Þ

compatible with Eqs. (3), (4), (25) and (28), such that starting from an initial condition zð0Þ the robot trajectory z(t) is driven to a target position zf regardless of any obstacle scenario occurrence. Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

10

4. Set-theoretic based approach background Consider autonomous vehicles described by discrete-time LTI systems zðt þ 1Þ ¼ ΦzðtÞ þ GuðtÞ þ Gw wðtÞ

ð30Þ

In [32,33] a set-theoretic based solution to the OAMP problem has been recently obtained for models (30) by dealing with the following key aspects: (a) Given a set of obstacles (27), determine computationally tractable geometric conditions under which the admissibility region (28) is convexiﬁed. (b) Given an obstacle scenario Oi , determine a sequence of inner ellipsoidal approximations i N i N i such that there exists at least a T s s ¼ 0 of the exact one-step controllable sets Ξ is s ¼ 0 feasible path to the goal . zf Ni (c) Given the sequences T is s ¼ 0 ; i ¼ 1; …; l, ensure viability properties under time-varying scenarios. Next sections will be devoted to summarize the main technical issues that allow to face (a)–(c). 4.1. Obstacle constraints convexiﬁcation The non-convex constraints describing Eq. (28) can be rewritten as ( )

lj T ni i n i i Ofree ≔ ⋂ zA R : ⋃ H j Bz4 gj j¼1

k¼1

k

ð31Þ

k

and recast into computationally tractable requirements by means of the convexiﬁcation procedure (CP) developed in [33]

a

a

a o ni n i a i a i a c ¼ c1 \ c2 \ ⋯ \ cini ¼ ⋂ zA Rn j H ij Bz4 gij ð32Þ j¼1

a where cij are the active constraint regions pertaining to each j-th obstacle of Oi . 4.2. One-step controllable sets Point (b) relies on the estimation of the domain of attraction (DoA), i.e. all the initial conditions zð0Þ for which there exists an admissible path to zf . Note that the basic construction (recursions (6)) of T ik may give rise to “small” DoA estimates, i.e. ⋃li ¼ 1 T iN i , because a saturation effect may occur on the one-step controllable sets growth. To overcome such a drawback, here we modify the construction of these sets as follows: One-step Controllable Set Procedure (OCSP) 1: Given the goal zf and chosen the initial condition zð0Þ as follows: zð0Þ≔argmax Jz zf J 2 design the pair ðK 0 ; T 0 Þ, with T 0 a RPI region centered in zf where K0 is zAX

the stabilizing state-feedback gain complying with the constraints (4), (3), (28). Store the subscript m ¼ 0 into an index vector hereafter named IRi . Let T i0 ¼ T 0 and z0eq ¼ zf be the initial terminal region and equilibrium point, respectively;

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

11

N m 2: Derive the sequence T ik k ¼ 1 by using recursions (6) under the additional state constrain (28). The integer Nm is the saturation level for the region growth; 3: Store the index Nm into an index vector denoted as LRi ; 4: if zð0Þ= 2 T iN m , then such that: 5: if there exists a candidate equilibrium zmþ1 eq mþ1 k ðaÞ zeq a zeq ; k ¼ 0; 1; …; m; ðbÞ zmþ1 eq ≔arg mini J zeq zð0Þ J 2 zeq A T N m

then and K mþ1 satisfying 6: Design a new pair ðK mþ1 ; T iN m þ1 Þ, with T iN m þ1 centered in zmþ1 eq i Eqs. (4), (3), (28). Store the corresponding index N m þ 1 into IR ; N m 7: Add T ik k ¼ 1 to the previous computed sequence; 8: m’m þ 1, T i0 ≔T iN m þ1 and goto Step 2; 9: else Stop; 10: end if 11: else Stop; 12: end if

4.3. Time-varying obstacle scenario occurrences The question (c) focuses on the difﬁculties arising when unpredictable obstacle scenario changes occur. Since the l controllable set sequences fT ik g; i ¼ 1; …; l, are computed under the hypothesis that a single obstacle scenario Oi takes place, the viability retention cannot be ensured 0 when Oi -Oi because a switching to a different set sequence must be imposed. Here, the idea is to design two further families of one-step controllable sets, hereafter named 0 Obstacle and Scenario Switching sequences, whose combined use allows to safely switch to fT ik g.

4.3.1. Obstacle controllable sequences Obi

0

The Obstacle sequences fT k j g; j ¼ 1; …; ni0 , have the aim to encircle the corresponding 0 obstacles Obij ; j ¼ 1; …; ni0 and they are introduced because of the following argument: 0

Let Oi be the current scenario, if at t the obstacle scenario change Oi -Oi occurs and the 0 2 T ik , for some k, then collisions could happen because the sequence fT ik g is no current state zðtÞ= longer admissible. In order to avoid such an undesired event, the Obstacle sets are designed such that when Obi

0

zðt^Þ A fT k j g; t^ Z t, the trajectory zðtÞ; 8t Z t^ remains conﬁned into them until the switching to 0 the correct sequence fT ik g is made admissible. Obi

0

The computation of the sequences fT k j g; j ¼ 1; …; ni0 , is performed by using the OCSP procedure Obi

0

where both zð0Þ and zf are substituted with the equilibria zeq j ; j ¼ 1; …; ni0 . These equilibrium points 0 are selected as those at the maximum distance from the obstacles Obij ; j ¼ 1; …; ni0 , and satisfying the Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

12

following condition: Obi

0

0

0

distðBzeq j ; Obij ÞodistðT imaxj ; Obij Þ;

j ¼ 1; …; ni0 ;

ð33Þ

where T imaxj are the sets corresponding to the greatest indices s of fT ik g such that 0

T imaxj \ Obij ¼ ∅;

j ¼ 1; …; ni0

ð34Þ 0 Obij

The latter allows to ensure that the equilibrium xeq lies in the zone bounded by the obstacle 0 and the closer element of fT ik g which does not intersect Obij , i.e. T imaxj . 4.3.2. Scenario switching controllable sequences Although the Obstacle sequences have the important merit to avoid collisions when the 0 scenario Oi occurs, their use leads to the following drawback: 0 Let Oi -Oi be a generic scenario change occurrence, once the trajectory zðÞ enters inside 0

Obi

Obij

the region deﬁned by fT k j g, it is by construction driven to the terminal region T 0 Obi zeq j

0

centered at

0

the equilibrium and it will be there conﬁned. This situation clearly compromises any chance of the vehicle to reach the target zf and for this 0 reason the trajectory zðÞ must be driven towards the controllable sequence fT ik g. To this aim, the SW ij

Scenario Switching sequences fT k

0

g; j ¼ 1; …; ni , are introduced and designed as follows: SW i

0

0

1. for any admissible scenario change Oi -Oi , choose an equilibrium zeq j belonging to some 0 T ik ; 0 0 SW i Obi 2. apply the OCSP procedure with zeq j in place of zð0Þ and zeq j in place of zf .

0

Then, the switching fT ik g-fT ik g is accomplished as follows:

Obi

0

Let zðt^ÞA fT k j g be the current state. As soon as SW ij

zðtÞA T k^

0

for some k^ and for some t Z t^;

the set-membership to fT As soon as 0

zðtÞA T ik

SW ij k

ð35Þ

0

g is considered;

for some k and for some t Z t^:

ð36Þ

0

the set-membership to fT ik g can be used.

A relevant aspect leaved out in the above developments is to guarantee that under a scenario change the robot does not bump on any obstacle during the switching phase. To this end, the SW i

sequences fT ik g and fT k j g are derived under the following further condition: JBðzþ zÞJ r ϵ;

ð37Þ

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

13

where zþ ¼ Φz þ Gu is the disturbance-free one-step evolution. The scalar ϵ can be determined by exploiting the Obstacle sequences properties as below detailed:

Obi

for each scenario Oi and for each j-th obstacle, consider the sequence fT k j g and determine IN i

the sequence fT m j g as follows: 8 IN ij Obij Obij > > < T m ≔T LRij ðmÞ ⋂ T LRij ðmþ1Þ ;

m ¼ 1; …; dimðLRij Þ 1

ð38Þ

IN j Obj Obj > > : T dimðLRij Þ ≔T LRij ð1Þ ⋂ T LRij ðdimðLRij ÞÞ i

i

i

Obi

where the index vectors LRij store the saturation regions for each sequence fT k j g; compute the minimum dij and maximum Dij Euclidean distances between the j-th obstacle

(polyhedron) and the indexed set sequence fT m j g; compute

IN i

d≔max i ¼ 1;…;l;

dij

ð39Þ

Dij

ð40Þ

j ¼ 1;…;pi

D≔ imin ¼ 1;…;l; j ¼ 1;…;pi

ϵ≔D d

ð41Þ

4.3.3. One-step controllable families computation The consequence of all the above developments is that the set sequences must be computed by using the following scheme: Families Construction Procedure (FCP) 0. Let G ¼ ðV; EÞ be the oriented graph with V the l obstacle scenarios and E the set of ordered 0 pairs fi; i0 g; i; i0 A V, such that the switching Oi -Oi is admissible; 1. Generate the sequences fT ik g; i ¼ 1; …; l; Obi

2. Generate the Obstacle families fT k j g; 8 iA Vr ; j ¼ 1; …; ni , according to Eqs. (33)–(34); 3. Estimate the tolerance level ϵ by Eqs. (39)–(41); 4. Update the sequences fT ik g; i ¼ 1; …; l under the additional constraint (37); SW i

5. Generate the sequences fT k j g; 8 iA Vr ; j ¼ 1; …; ni under the additional constraint (37).

5. A SOS-based RHC algorithm As discussed in the Introduction, the aim of this paper is to present an MPC strategy for the class of nonlinear systems (24) subject to nonholonomic constraints (25). In this section, the settheoretic based scheme of [33] summarized in Section 4 will be extended to comply with the Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

14

polynomial plant description (24). To this end, the following points deserve an adequate consideration within the polynomial framework: (1) Computation of controllable set sequences fT ik g. (2) Sum-of-Squares characterization of the requirement (37) which limits the maximum displacement of the one-step state evolution so that obstacle constraints are satisﬁed with a tolerance level ϵ40. (3) Relaxation of the on-line optimization: uðtÞ ¼ argmin J kðtÞ ðzðtÞ; uÞ subject to

ð42Þ

s f p ðzðtÞ; uÞA Ξ~ kðtÞ 1 ; u A U

ð43Þ

5.1. One-step controllable families SOS characterization By resorting to Proposition 1 and under the further requirements imposed by the nonholonomic constraints (25), we have that the sets of states k-steps controllable to Ξ0 complying with Eqs. (3), (4) and (25) can be obtained by deﬁning a collection of polynomial functions fV k ðz; uÞg each one satisfying the following condition and set containment: ð44Þ V k ðz; uÞ40; 8 zA Rn of0g; 8 u A Rm of0g and V k ð0; 0; 0Þ ¼ 0 z A Rn ; u A Rm : V k ðz; uÞr 1 \ z A Rn ; u A Rm : cx ðzÞr 1 \ zA Rn ; u A Rm : cu ðuÞr 1 \ zA Rn ; u A Rm : Ga ðza ; zb Þ ¼ 0 \ zA Rn ; u A Rm : zsþ1 ¼ Fðz1 ; …; zs Þ \ ⋯ \ z A Rn ; uA Rm : zn ¼ Fðz1 ; …; zs Þ na

⋂

i¼1

zA Rn ; uA Rm : aTi ðzq ðtÞÞzv ðtÞ ¼ 0 D zA Rn ; u A Rm : Ψ~ k 1 ðf p ðz; uÞÞ r1 ð45Þ

The next developments summarize methodological requirements and computational tools under which one-step controllable regions can be obtained. Proposition 2. Conditions (44)–(45)Pare satisﬁed if there exist a candidate function V k ðz; uÞA P , polynomial multipliers si A nþm ; i ¼ 1; …; 5, a radially unbounded positive deﬁnite nþm function l0 , li ; i ¼ 1; …; na , and ζ i ðz; uÞ; i ¼ 1; …; n s, positive polynomials such that ð46Þ V k ðz; uÞ l0 ðz; uÞA Σ nþm 8 ðð1 V k ðz; uÞÞs1 þ ð1 cu ðuÞÞs2 þ ð1 cx ðzÞÞs3 þ ð1 V k ðz; uÞÞð1 cu ðuÞÞs4 > > > > ns X > > > < þ ð1 V k ðz; uÞÞð1 cx ðzÞÞs5 þ ðΨ~ k 1 ðf p ðz; uÞÞ 1Þ þ ζ i ðz; uÞðzsþi ¼ Fðz1 ; …; zs ÞÞ > > > > > > > :

i¼1

þ

na X i¼1

li ðz; uÞðaTi ðzq ðtÞÞzv ðtÞÞÞ A Σ nþm ð47Þ

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

Proof. By resorting to Positivstellensatz Theorem and S-Procedure arguments.

15

□

Remark 1. An inner approximation of the one-step controllable region in the extended space (z,u) can be achieved by means of the following optimization problem with bilinear matrix inequality (BMI) constraints: n n ona o V^ k ðz; uÞ; fs^ i g5i ¼ 1 ; ^l i ≔arg minP Tr ðQk Þ i¼1

V k ðz;uÞ A

si A

P

nþm ;i ¼ 1;…;5;

nþm li A Rnþm ;i ¼ 1;…;na ; ζi ðz;uÞRnþm ;i ¼ 1;…;n s

s:t:

ð46Þ–ð47Þ

ð48Þ

where Qk is the Gram matrix of the decomposition V k ðz; uÞ ¼ φT ðz; uÞQk φðz; uÞ and Qk can be viewed as a shaping matrix of an ellipsoid in the extended space ðz; uÞ. From a computational point view, the BMI optimization (48) can be solved by means of standard SDP packages, [34,35], which make use of standard bisection and/or moment procedures as also shown in [36]. Once the set z AX ; uA U : V^ k ðz; uÞr 1 is obtained, the projected set T k along z can be easily computed by exploiting the quantiﬁer elimination procedure detailed in [37]. Remark 2. Finally, notice that the additional constraint J Bðf p ðxðtÞ; uÞ zðtÞÞ J 22 r ϵ2 straightforwardly translates into a SOS requirement and, therefore, can be used within the polynomial version of the FCP procedure. 5.2. On-line optimization Here, we shall provide a convenient formulation of the optimization (42)–(43) so that it can be rephrased into a SDP problem. As detailed in [33], the On-line phase of the RHC algorithm (namely OA-MPC) prescribes the solution of the following optimization problem: uðtÞ ¼ argmin J kðtÞ ðzðtÞ; uÞ

ð49Þ

s:t:

u

J Bðf ðzðtÞ; uÞ xðtÞÞ J 22 r ϵ2

ð50Þ

f p ðzðtÞ; uÞA Ξ~ kðtÞ 1 ;

ð51Þ

uAU

where the running cost is deﬁned as follows: J kðtÞ ðzðtÞ; uÞ≔α J f p ðzðtÞ; uÞ J 2~ ðÞ

Q kðtÞ 1

þ β J uJ 2Ru

ð52Þ

~ iðtÞ 1 the Gram matrix of Ξ~ iðtÞ 1 and α; β A ½0; 1. with Ru ¼ RTu 40, Q As it is evident the optimization (49) –(51) is computationally high demanding and, therefore, a relaxation procedure has to be considered. To this end, note ﬁrst that the sets of states and inputs satisfying Eqs. (50)–(51) belong to ( ) Ψ~ ðf ðz; uÞÞ r 1 n m kðtÞ 1 ~ S kðtÞ 1 ≔ zA R ; u A R : ð53Þ J Bðf ðzðtÞ; uÞ zðtÞÞ J 22 r ϵ2 with Ψ~ kðtÞ 1 ðf ðz; uÞÞ and J Bðf ðzðtÞ; uÞ zðtÞÞ J 22 SOS polynomials. Then, the following inner Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

16

ellipsoidal approximation of S~ iðtÞ 1 is considered in the extended space ðz; uÞ : n o T T T T 1 T~ kðtÞ 1 ≔ zT ; uT A Rnþm : ½zT ; uT PkðtÞ r 1 DS~ kðtÞ 1 1 z ; u

ð54Þ

Using Eq. (54) in place of S~ kðtÞ 1 does not ensure feasibility retention of the OA-MPC algorithm: such a property is simply recovered by computing the entire sequence fΞ k g under the satisfaction of the relaxation (54). The family of one-step controllable sets obtained by means of the following procedure is hereafter denotes with fT k g Polynomial Ellipsoidal Set (PES) procedure 1. Given the set T k 1 (T 0 Ξ 0 ), compute the inner ellipsoidal approximation T~ k 1 of S~ k 1 by solving an Eigenvalue LMI optimization problem (EVP) [38]; T 2. Solve the optimization problem (48) with½zT ; uT Pk11 zT ; uT r 1in place of Ψ~ k 1 ðf p ðz; uÞÞ r 1; 3. Compute the projection (9) (Ξ k -T k ) . Finally, the optimization (49)–(51) can be recast into a simple LMI problem. Proposition 3. The optimization (49)–(51) is recast into the following semi-deﬁnite programming problem: minαγ z þ βγ u

ð55Þ

u

" s:t: "

γu n

γz

½zðt ÞT uT

n

PkðtÞ 1

# Z0

# uT Z0 Ru 1

ð56Þ

ð57Þ

Proof. An upper-bound to the cost (52) can be derived by means of the following arguments:

J uJ 2Ru r γ u ; γ u A Rþ ; the following set n o T 1 T T T T~ kðtÞ 1 ðγ z Þ≔ zT uT A Rnþm ; zT uT PkðtÞ r γ z ; γ z A Rþ 1 x u with γ z A Rþ a slack variable, can be considered in place of T~ kðtÞ 1 .

Then, by choosing two weighting scalars α; β A ½0; 1, the optimization problem (56)–(57) becomes

s:t:

min αγ z þ β γ u u 8 T u Ru u r γ u > > > > < ½zðtÞT uT T A T~ kðtÞ 1 ðγ Þ z T u uru > > > > : γ r1 u

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

17

that is straightforwardly converted into the LMI conditions (56)–(57) via Schur complements. □ 5.3. RHC algorithm By collecting the above developments, the Receding Horizon Control algorithm proposed in [33] is here adapted to comply with polynomial systems. In the sequel the following assumptions are made: Assumption 1. At each time instant t the robot is informed about the current obstacle scenario, i.e. scðtÞ. □ 0

Assumption 2. At each time instant t, an obstacle scenario change Oi -Oi can occur if the actual robot position zp ðtÞ ¼ BzðtÞ is such that ni0

0

0

zðtÞ= 2 ⋃ fz A Rn j H ij Bz r gij þ dg

ð58Þ

j¼1

Sum-of-Squares Obstacle Avoidance MPC (SOS-OA-MPC) Algorithm Off-line: 1 Given the obstacle scenarios Oi ; i ¼ 1; …; l, the initial condition zð0Þ and the goal zf , compute a non-empty robust invariant ellipsoidal region T 0 Rn and a stabilizing feedback gain K0 complying with the constraints (3)–(4), (25) and (32); 2 Jointly apply the FCP and PES procedures to generate the inner ellipsoidal approximations n Obi o i Obi Ξ k -fT ik g; i ¼ 1; …; l, Ξ k j -fT k j g; 8 iA Vr ; j ¼ 1; …; ni , and n SW i o SW i Ξ k j -fT k j g; 8 iA Vr ; j ¼ 1; …; ni , such that 1 0 1 ! 0 ⋃ T ik

zð0ÞA

i ¼ 1;…;l k

i

i

SW Ob [ @ ⋃ T k jA [ @ ⋃ T k jA 8 i A Vr j ¼ 1;…;ni ; k

8 i A Vr j ¼ 1;…;ni ; k

3 Store the ellipsoidal sequences. On-line: 1: if ! ObjscðtÞ scðtÞ [ ⋃ Tk [ zðt ÞA ⋃T k j ¼ 1;…;ni ; k

k

⋃ T

j ¼ 1;…;ni ; k

SW jscðtÞ k

! then curr≔scðtÞ

2: else curr≔prec; 3: end if then 4: if zðtÞA ⋃ T curr k k candidate g; 5: goto Step 10 by considering fT curr k g as the candidate sequence, i.e. fT k 6: end if SW curr j

7: if zðt ÞA ⋃ T k j ¼ 1;…;ni ; k

SW curr j

then fT candidate g≔fT k k Obcurr j

8: else fT candidate g≔fT k k 9: end if

g

g

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

18

10 Find kðtÞ≔mink fzðtÞA T candidate g k 11: if kðtÞA IRcandidate then uðtÞ ¼ K kðtÞ zðtÞ 12: else 13: solve Eqs. (55)–(57) 14: end if 15: Apply uðtÞ; prec≔curr; t≔t þ 1; goto Step 1

The next proposition shows that the proposed SOS-OA-MPC algorithm enjoys the feasibility retention and closed-loop stability properties. Proposition 4. Let the sequences of sets

i T k ; i ¼ 1; …; l; n Obi o T k j ; 8 iA Vr ; j ¼ 1; …; ni ; n SW i o T k j ; 8 iA Vr ; j ¼ 1; …; ni ,

be non-empty and ! zð0ÞA

⋃ T ik

i ¼ 1;…;l k

0

1

0

1

Obij

SW ij

[@ ⋃ Tk A[@ ⋃ Tk A 8 i A Vr j ¼ 1;…;ni ; k

8 i A Vr j ¼ 1;…;ni ; k

Then, the SOS-OA-MPC algorithm always satisﬁes the constraints and ensures Uniformly Ultimate Boundedness for all time-varying occurrences of Oi ; i ¼ 1; …; l. Proof. Note that existence of solutions at time t implies existence of solutions at time tþ1, because the optimization problem in Step 13 is always feasible. In fact, by construction there exists an input vector u satisfying the constraints (3), (4), (25) and (32) such that the setmembership requirement in Eq. (51) holds true. Then because each element of the sequence fT~ i g has been built up under the satisfaction of the additional constraint constraints (37), see Eq. (53), at the next time instant tþ1 the existence of a solution uðt þ 1Þ for the Step 13 is ensured. Finally, Uniformly Ultimate Boundedness of the strategy follows by noting that the trajectory is in the worst case conﬁned to 1 0 1 ! 0 ⋃ T ik

i ¼ 1;…;l k

i

i

SW Ob [ @ ⋃ T k jA [ @ ⋃ T k jA 8 i A Vr j ¼ 1;…;ni ; k

8 i A Vr j ¼ 1;…;ni ; k

□

6. Simulations This section presents numerical results that illustrate the features of the proposed MPC strategy. We shall assume that the robot navigates within a planar dynamic environment where two obstacle scenarios are hypothesized: O1 (continuous lines) and O2 (dashed lines), see Fig. 3. Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

19

sc(t)

2

1 0

20

40

60

80

100

120

Time [sec]

Fig. 2. Obstacle scenario switchings. 30 t=73 sec

25

x

y[m]

20

f

2 Ob1 2 =Ob 2

15

t=69 sec

10

Ob 2 1

t=59 sec

5 0

Ob 1 1

0

5

t=56 sec

10

x(0)

15

20

25

x [m] Fig. 3. Robot path under time-varying obstacles.

The planar environment is described by 2 3 2 3 1 0 0 " # 6 1 7 6 7 z 0 7 1 6 6 25 7 r6 7 6 7 4 0 4 0 5 1 5 z2 0

1

ð59Þ

30

while the obstacles locations are below reported: 2 3 2 3 1 0 14 " # 6 1 6 15 7 0 7 6 7 z1 6 7 Ob11 : 6 r6 7 7; Ob21 4 0 4 13 5 1 5 z2 0

1 2

1 6 1 6 Ob12 ¼ Ob22 : 6 4 0 0

1

6 1 6 :6 4 0 0

14 3 2 3 5 0 " # 6 6 7 0 7 7 z1 6 7 r6 7 7 4 18 5 1 5 z2 1

2

0

3

"

#

2

14

3

6 15 7 0 7 7 z1 6 7 r6 7 7 4 11 5 1 5 z2 1

ð60Þ

12

ð61Þ

19

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

20

The aim of the simulation is to drive the robot from the initial planar position p0 ¼ ½20 5T to the target pf ¼ ½5 22T for any obstacle scenario change (from O1 to O2 and viceversa), see Fig. 2. All the simulations have been carried out by using Matlab R2014a and the Ellipsoidal Toolbox [39] over a laptop PC equipped with a Intel Core(TM) 2 Duo CPU. For the sake of completeness, we have considered the dynamical model (18) that has been recast into a polynomial ﬁeld by resorting to the Recasting procedure of Section 2.3. This prescribes that _ z1 ðtÞ ¼ xðtÞ, z2 ðtÞ ¼ yðtÞ, z3 ðtÞ ¼ θðtÞ, z4 ðtÞ ¼ x_ ðtÞ, z5 ðtÞ ¼ y_ ðtÞ, z6 ðtÞ ¼ θðtÞ, two auxiliary variables z7 ðtÞ ¼ sin θðtÞ and z8 ðtÞ ¼ cos θðtÞ are introduced and the following additional constraint imposed: z27 ðtÞ þ z28 ðtÞ ¼ 1

ð62Þ

0.4 0.35 [m/sec]

||V(t) ||

0.3 0.25 0.15

0.2 0.1 0.05 0

0

20

40

60

80

100

120

Time [sec]

1

τ (t) [Nm]

2

τ (t) [Nm]

Fig. 4. Linear velocity. 0,5 0.25 0 −0.25 −0,5 0,5 0.25 0 −0.25 −0,5

0

20

40

60

Time [sec]

80

120

100

Fig. 5. Applied torques.

180 T T T T

160 140

curr(t)

120

T

100

T

80 60 40 20 0

0

20

40

56 60

69 73 80

100

120

Time [sec]

Fig. 6. Set-membership signal. Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

21

Moreover, the nonholonomic constraint (17) translates into z4 ðtÞz8 ðtÞ z5 ðtÞz7 ðtÞ ¼ 0

ð63Þ

Then, the extended state space description has been discretized by using Euler forward differences with the sampling time T¼ 0.5 s. Moreover, the following constraints are prescribed: jτ1 ðtÞj r 0:45 ½N m;

8t Z 0;

ð64Þ

jτ2 ðtÞj r 0:45 ½N m;

8t Z 0;

ð65Þ

J vðtÞJ 2 r 0:16 ½m=s;

8 t Z 0;

ð66Þ

where J vðtÞJ 2 ¼ z24 ðtÞ þ z25 ðtÞ Then, inner approximations of one-step controllable set sequences have been off-line derived:

167 150 T 1k k ¼ 1 and T 2k k ¼ 1 n o n o n o n o Ob1 114 Ob1 138 Ob2 137 Ob2 138 Tk 1 , Tk 2 , Tk 1 and T k 2 k¼1 k¼1 k¼1 k¼1 n o n o n o n o SW 11 47 SW 12 22 SW 21 36 SW 22 30 Tk , Tk , Tk and T k k¼1

k¼1

k¼1

k¼1

All the relevant results are summarized in Figs. 3–6. In Fig. 3 the robot trajectory under timevarying obstacles is depicted. It can be noted that the SOS-OA-MPC strategy is capable to achieve good control performance regardless of any obstacle change. Moreover as it results from Figs. 4 and 5, the prescribed constraints are always fulﬁlled. To appreciate the modus operandi of the proposed scheme, it is important to analyze the signal shown in Fig. 6, because it provides the smaller ellipsoid of the pre-computed families containing the current state zðtÞ. Note that the markers (□ and ■) and (○ and ) have been used to denote the set-membership to the same sequences fT 1k g and fT 2k g, respectively. The latter is instrumental to put in light that if the current obstacle scenario sc(t) is different from that identiﬁed by the SOS-OA-MPC algorithm (namely Steps 1–9), the robot trajectory proceeds along the sequence of the admissible scenario used at the previous time instant (Step 2: curr≔prec). Nonetheless, this does not comprise neither the constraints satisfaction nor the feasibility retention thanking to the one-step evolution bound (50). Let us consider the time interval ½56; 73 (the grey zone of Fig. 6) where the switching O1 -O2 occurs. At t ¼ 56 s the obstacle conﬁguration changes while the current state is such that 150 137 138 36 30 SW 2 SW 2 Ob2 Ob2 zð56Þ= 2 ⋃ T 2k [ ⋃ T k 1 [ ⋃ T k 2 [ ⋃ T k 1 [ ⋃ T k 2 k¼1

k¼1

k¼1

k¼1

k¼1

therefore, as prescribed (Steps 1–3 of the On-line phase of the SOS-OA-MPC algorithm), the used scenario does not change (curr¼ 1), see the marker ■ in Fig. 6 and the membership to the red dashed ellipsoidal sequence in Fig. 3. Note that at 59 s, after the obstacle has been worked around, the robot trajectory direction quickly changes toward xf. This phenomenon fully complies with the nature of the ellipsoidal sequence T 1k which has been off-line built up to reach the target xf irrespective of obstacle occurrences. Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

22

Then at t ¼ 69 s. the correct scenario is recovered when the current state belongs to the SW 2

switching sequence (the blue dotted line ellipsoidal sequence of Fig. 3): zð69ÞA T 10 1 . Finally, at t ¼ 73 s the switching is accomplished and the current state belongs to the blue continuous line ellipsoidal sequence of Fig. 3, i.e. zð73ÞA T 229 Notice that the latter is clearly shown in Fig. 3, where the robot trajectory modiﬁes its direction because a different one-step 2 controllable family is used, i.e. the following switching occurs fT SW 1 g-fT 2 g. The same reasoning applies to the other obstacle scenario occurrence. 7. Conclusions In this paper we have presented a receding horizon strategy for solving the obstacle avoidance motion planning problem for autonomous vehicles described by polynomial systems subject to nonholonomic constraints. Set-theoretic ideas and semialgebraic arguments have been used to take care of all admissible time-varying obstacle scenarios. As one of its main merits, most of the computations are moved in the off-line phase so that the on-line computational load pertaining to the control action computation is modest and the RHC control unit becomes usable in real-time frameworks.

References [1] J. Latombe, in: Robot Motion Planning, Kluwer Academic Publishers, London, 1991. [2] L. Lapierre, R. Zapata, P. Lepinay, Combined path-following and obstacle avoidance control of a wheeled robot, Int. J. Robot. Res. 26 (2007) 361–375. [3] J. Minguez, F. Lamiraux, J.-P. Laumond, Motion planning and obstacle avoidance, in: B. Siciliano, O. Khatib (Eds.), Springer Handbook of Robotics, Springer, Berlin, Heidelberg, 2008, pp. 827–852. [4] G. Chesi, Visual servoing path planning via homogeneous forms and LMI optimizations, IEEE Trans. Robot. 25 (2) (2009) 281–291. [5] A.V. Savkin, C. Wang, Seeking a path through the crowd: robot navigation in unknown dynamic environments with moving obstacles based on an integrated environment representation, Robotics and Autonomous Systems, ISSN 0921- 8890, 2014. [6] A. De Luca, G. Oriolo, Kinematics and dynamics of multi-body systems. CISM Courses and Lectures, in: J. Angeles, A. Kecskemethy (Eds.), Modelling and Control of Nonholonomic Mechanical Systems, vol. 360, Springer, Berlin, 1995, pp. 277–342. [7] R. Fierro, F.L. Lewis, Robust practical point stabilization of a nonholonomic robot using neural networks, J. Intell. Robot. Syst. 20 (1997) 295–317. [8] R.W. Brockett, Asymptotic stability and feedback stabilization, in: Differential Geometric Control Theory, Birkhäuser, Boston, 1983, pp. 181–191. [9] W. Ren, S.J. Sun, R.W. Beard, T.W. Mclain, Nonlinear tracking control for nonholonomic mobile robots with input constraints: an experimental study, in: American Control Conference, Portland, OR, USA, 2005, pp. 4923-4928. [10] D. Gu, H. Hu, Receding horizon tracking control of wheeled mobile robots, IEEE Trans. Control Syst. Technol. 14 (4) (2006) 743–749. [11] Chung-Han Hsieh, Jing-Sin Liu, Nonlinear model predictive control for wheeled mobile robot in dynamic environment, in: IEEE/ASME International Conference on Advanced Intelligent Mechatronics, Kaohsiung, Taiwan, 2012. [12] P. Ögren, N.E. Leonard, A Convergent dynamic window approach to obstacle avoidance, IEEE Trans. Robot. Autom. 21 (2) (2005) 188–195. [13] P. Falcone, F. Borrelli, J. Asgari, H.E. Tseng, D. Hrovat, Predictive active steering control for autonomous vehicle systems, IEEE Trans. Control Syst. Technol. 15 (3) (2007) 566–580. [14] Y. Yoon, J. Shin, H.J. Kim, Y. Park, S. Sastry, Model-predictive active steering and obstacle avoidance for autonomous ground vehicles, Control Eng. Pract. 17 (2009) 741–750. Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

G. Franzè, W. Lucia / Journal of the Franklin Institute ] (]]]]) ]]]–]]]

23

[15] J. Huang, C. Wen, W. Wang, Z.-P. Jiang, Adaptive stabilization and tracking control of a nonholonomic mobile robot with input saturation and disturbance, Syst. Control Lett. 62 (3) (2013) 234–241. [16] J.M. Carson III, B. Açikmeşe, R.M. Murray, D.G. MacMartin, A robust model predictive control algorithm augmented with a reactive safety mode, Automatica 49 (2012) 1251–1260. [17] G. Chesi, LMI techniques for optimization over polynomials in control: a survey, IEEE Trans. Autom. Control 55 (11) (2010) 2500–2510. [18] G. Franzè, A nonlinear sum-of-squares model predictive control approach, IEEE Trans. Autom. Control 55 (6) (2010) 1466–1471. [19] G. Chesi, A. Garulli, A. Tesi, A. Vicino, Homogeneous Lyapunov functions for systems with structured uncertainties, Automatica 39 (6) (2003) 1027–1035. [20] M. Choi, T. Lam, B. Reznick, Sum of squares of real polynomials, in: Proceedings of Symposia in Pure Mathematics, vol. 34 (2), 1995, pp. 103–126. [21] P. Parrillo, Structured semideﬁnite programs and semialgebraic geometry methods in robustness and optimization (Ph.D. thesis), California Institute of Technology, CA, USA, 2000. [22] A. Kurzhanski, I. Valyi, in: Ellipsoidal Calculus for Estimation and Control, Birkhäuser, Berlin, 1997. [23] F. Blanchini, S. Miani, in: Set-Theoretic Methods in Control, Birkäuser, Boston, 1997. [24] W. Tan, A. Packard, Stability region analysis using polynomial and composite polynomial Lyapunov functions and sum-of-squares programming, IEEE Trans. Autom. Control 53 (2008) 565–571. [25] R. Fierro, F.L. Lewis, Control of nonholonomic mobile robot: backstepping kinematics into dynamics, in: IEEE Conference on Decision and Control, New Orleans, LA, 1995, pp. 3805–3810. [26] J.M. Eklund, J. Sprinkle, S.S. Sastry, Switched and symmetric pursuit/evasion games using online model predictive control with application to autonomous aircraft, IEEE Trans. Control Syst. Technol. 20 (2011) 604–620. [27] G. Oriolo, A. de Luca, M. Vendittelli, WMR control via dynamic feedback linearization: design, implementation, and experimental validation, IEEE Trans. Control Syst. Technol. 10 (6) (2002) 835–852. [28] A. Papachristodoulou, S. Prajna, Analysis of non-polynomial systems using the sum of squares decomposition, positive polynomials in control, Lecture Notes in Control and Information Sciences, vol. 312, Springer,, Berlin, Heidelberg 23–43. [29] G. Chesi, Estimating the domain of attraction for non-polynomial systems via LMI optimizations, Automatica 45 (2009) 1536–1541. [30] T. Brogårdh, Present and future robot control development – an industrial perspective, Annu. Rev. Control 31 (2007) 69–79. [31] T. Fraichard, H. Asama, Inevitable collision states a step towards safer robot? in: IEEE/RSJ International Conference on Intelligent Robots and Systems, Las Vegas, NV, USA, 2003, pp. 553–574. [32] G. Franzè, W. Lucìa, M. Muraca, An obstacle avoidance receding horizon control scheme for autonomous vehicles, in: IEEE American Control Conference, Washington, DC, USA, 2013. [33] G. Franzè, W. Lucia, The obstacle avoidance motion planning problem for autonomous vehicles: a low-demanding receding horizon control scheme, Syst. Control Lett. 77 (2015) 1–10. [34] J. Löfberg, A toolbox for modeling and optimization in MATLAB, in: Proceedings of the (CACSD) Conference, Taipei, Taiwan, 2004. [35] D. Henrion, J.B. Lasserre, Detecting global optimality and extracting solutions in gloptipoly, IEEE Control Syst. Mag. 24 (3) (2004) 72–83. [36] G. Franzè, A. Casavola, D. Famularo, E. Garone, An off-line MPC strategy for nonlinear systems based on SOS programming, Nonlinear Model Predictive Control (2009) 491–499. [37] C.W. Brown, QEPCAD quantiﬁer elimination by partial cylindrical algebraic decomposition, 2003, Available from: 〈http://www.cs.usna.edu/qepcad/B/QEPCAD.html〉. [38] G. Chesi, Estimating the domain of attraction via union of continuous families of Lyapunov estimates, Syst. Control Lett. 56 (2007) 326–333. [39] A.A. Kurzhanskiy, P. Varaiya, Ellipsoidal toolbox (ET), in: 45st IEEE Conference on Decision and Control, San Diego, CA, USA, 2006, pp. 1498–1503.

Please cite this article as: G. Franzè, W. Lucia, An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach, Journal of the Franklin Institute. (2015), http://dx.doi.org/10.1016/j.jfranklin.2015.03.021

An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach

An obstacle avoidance model predictive control scheme for mobile robots subject to nonholonomic constraints: A sum-of-squares approach

Recommend Documents