An Efficient Inexact NMPC Scheme with Stability and Feasibility Guarantees

10th IFAC Symposium on Nonlinear Control Systems 10th IFAC Symposium on Nonlinear Control Systems August 23-25, 2016. Monterey, California, USA 10th I...

Download PDF

628KB Sizes 0 Downloads 75 Views

Report

PDF Reader
Full Text

10th IFAC Symposium on Nonlinear Control Systems 10th IFAC Symposium on Nonlinear Control Systems August 23-25, 2016. Monterey, California, USA 10th IFAC Symposium on Control Systems August 23-25, 2016. Monterey, California, USA 10th IFAC Symposium on Nonlinear Nonlinear Control Systems Available August 23-25, 2016. Monterey, California, USA August 23-25, 2016. Monterey, California, USA online at www.sciencedirect.com

ScienceDirect IFAC-PapersOnLine 49-18 (2016) 053–058

An Efficient Inexact NMPC Scheme with An Efficient Inexact NMPC Scheme with An Efficient Inexact NMPC Scheme AnStability Efficientand Inexact NMPCGuarantees Scheme with with Feasibility Stability and Feasibility Guarantees Stability and Feasibility Guarantees Stability and Feasibility∗∗Guarantees ∗ ∗

Andrea Moritz Diehl ∗ Andrea Zanelli Zanelli ∗∗ Rien Rien Quirynen Quirynen ∗∗ ∗∗ Moritz Diehl ∗ Andrea Andrea Zanelli Zanelli ∗ Rien Rien Quirynen Quirynen ∗∗ Moritz Moritz Diehl Diehl ∗ ∗ ∗ University of Freiburg, Freiburg, Germany (e-mail: ∗ University of Freiburg, Freiburg, Germany (e-mail: ∗ University of [email protected], University of Freiburg, Freiburg, Freiburg, Freiburg, Germany Germany (e-mail: (e-mail: [email protected], [email protected], [email protected]) [email protected], [email protected]) ∗∗ [email protected]) ESAT-STADIUS, KU Leuven University, Leuven, [email protected]) ∗∗ Department ∗∗ Department ESAT-STADIUS, KU Leuven University, Leuven, ∗∗ Department ESAT-STADIUS, KU Belgium (e-mail: [email protected]). Department ESAT-STADIUS, KU Leuven Leuven University, University, Leuven, Leuven, Belgium (e-mail: [email protected]). Belgium (e-mail: [email protected]). Belgium (e-mail: [email protected]). Abstract: Abstract: In In this this paper, paper, an an inexact inexact nonlinear nonlinear model model predictive predictive control control scheme scheme with with reduced reduced Abstract: In this paper, an inexact nonlinear model predictive control scheme with reduced computational complexity is proposed. The presented approach exploits fixed sensitivity Abstract: In this paper, an inexact nonlinear model predictive control scheme with reduced computational complexity is proposed. The presented approach exploits fixed sensitivity computational complexity is The approach fixed sensitivity information precomputed at value. allows one online computational complexity offline is proposed. proposed. The presented presented approach exploits fixed the sensitivity information precomputed offline at aa reference reference value. This This allows exploits one to to avoid avoid the online information precomputed offline at a reference value. This allows one to avoid the computational effort resulting from the propagation of sensitivities and possibly the correspondinformation precomputed offline a propagation reference value. This allows to avoid the online online computational effort resulting fromatthe of sensitivities and one possibly the correspondcomputational resulting from propagation of the ing routine when optimal problem with aa sequential quadratic computational effort resulting from the thethe propagation of sensitivities sensitivities and possibly the correspondcorresponding condensing condensing effort routine when solving solving the optimal control control problem and withpossibly sequential quadratic ing condensingmethod. routine By when solving the the optimal control control problem with sequential quadratic programming performing aa numerical simulation of nonlinear dynamics online, ing condensing routine when solving optimal problem aa sequential quadratic programming method. By performing numerical simulation of the thewith nonlinear dynamics online, programming method. By aa numerical simulation of online, feasibility closed-loop trajectories can in to model programming method. By performing performing numerical simulation of the the nonlinear nonlinear dynamics online, feasibility of of the the closed-loop trajectories can be be preserved preserved in contrast contrast to linear linear dynamics model predictive predictive feasibility of the closed-loop trajectories can be preserved in contrast to linear model predictive control schemes. Nominal stability guarantees of the approach are derived and the effectiveness feasibility of the closed-loop trajectories can beofpreserved in contrast to linear predictive control schemes. Nominal stability guarantees the approach are derived and model the effectiveness control schemes. Nominal the of is on non-trivial control schemes. Nominal stability stability guarantees ofexample. the approach approach are are derived derived and and the the effectiveness effectiveness of the the scheme scheme is demonstrated demonstrated on aaguarantees non-trivialof example. of the scheme is demonstrated on a non-trivial example. of the scheme is demonstrated on a non-trivial example. © 2016, IFAC (International Federation of Automatic Control) Hosting by Elsevier Ltd. All rights reserved. Keywords: Keywords: nonlinear nonlinear model model predictive predictive control, control, numerical numerical methods, methods, Lyapunov Lyapunov stability, stability, Keywords: nonlinear model predictive control, numerical methods, Lyapunov embedded optimization. Keywords: nonlinear model predictive control, numerical methods, Lyapunov stability, stability, embedded optimization. embedded embedded optimization. optimization. 1. solved 1. INTRODUCTION INTRODUCTION solved with with the the direct direct multiple multiple shooting shooting method method using using a a 1. INTRODUCTION solved with the direct multiple shooting method Sequential Quadratic Programming (SQP) scheme (Bock 1. INTRODUCTION solved withQuadratic the directProgramming multiple shooting method using a Sequential (SQP) schemeusing (Bocka Quadratic Programming (SQP) scheme (Bock and In with schemes, fixed senSchemes Sequential Quadratic Programming (SQP) scheme (Bock and Plitt, Plitt, 1984). 1984). In contrast contrast with exact exact schemes, fixed senSchemes based based on on Nonlinear Nonlinear Model Model Predictive Predictive Control Control Sequential and Plitt, 1984). with exact fixed senused in the Problems Schemes Nonlinear Predictive (NMPC) can aa useful tool to aa large class and Plitt,are 1984). In contrast withQuadratic exact schemes, schemes, fixed(QP) senSchemes based on Nonlinear Model Predictive Control sitivities are usedIn incontrast the convex convex Quadratic Problems (QP) (NMPC) based can be beon useful tool Model to control control large Control class of of sitivities sitivities are used in the convex Quadratic Problems (QP) arising in each iteration. In this way, the subproblems (NMPC) can be a useful tool to control a large class of systems (Qin and Badgwell, 2000). Due to their inherent usediteration. in the convex Quadratic Problems (QP) (NMPC) can and be aBadgwell, useful tool to control large inherent class of sitivities arising inare each In this way, the subproblems systems (Qin 2000). Due toa their arising in this way, subproblems offline online update systems (Qin and 2000). Due their inherent flexibility, nonlinear dynamics, constraints arising in each each iteration. iteration. Inand thisaa cheap way, the the subproblems systems (Qin and Badgwell, Badgwell, 2000). Due to to and theirobjectives inherent can can be be pre-condensed pre-condensed offlineIn and cheap online update of of flexibility, nonlinear dynamics, constraints and objectives can be pre-condensed offline and a cheap online update of the nonlinear constraints is performed in order to recover flexibility, nonlinear dynamics, constraints and objectives can be tackled directly allowing one to naturally translate can be pre-condensed offline and a cheap online update of flexibility, nonlinear dynamics, constraints and objectives recover can be tackled directly allowing one to naturally translate the nonlinear constraints is performed in order to the nonlinear constraints is performed in order to recover a feasible, but suboptimal solution as proposed in (Bock can be tackled directly allowing one to naturally translate design requirements into mathematical statements withnonlinear performed order toinrecover can be tackled directly allowing one to naturally translate a feasible, butconstraints suboptimalissolution as in proposed (Bock design requirements into mathematical statements with- the aaet feasible, but solution proposed al., this the pre-condensed QP design into mathematical without need Moreover, feasible, butIn suboptimal solution as proposed in in (Bock (Bock design requirements intoreformulations. mathematical statements statements with- et al., 2007). 2007). Insuboptimal this last last work, work, the as pre-condensed QP is is out the therequirements need for for ad-hoc ad-hoc reformulations. Moreover, extenextenet al., 2007). In this last work, the pre-condensed QP is used as a building block of a more articulated scheme, out the need for ad-hoc reformulations. Moreover, extensive results exist in the literature that guarantee nominal al.,as2007). In this lastofwork, pre-condensed QPthe is out needexist for ad-hoc reformulations. Moreover,nominal exten- et used a building block a morethe articulated scheme, the sive the results in the literature that guarantee used as a building block of a more articulated scheme, the so called multilevel real-time iteration algorithm. sive results exist in the literature that guarantee nominal and robust stability under reasonable assumptions (Mayne used as a building block of a more articulated scheme, the sive results exist in the literature that guarantee nominal and robust stability under reasonable assumptions (Mayne so called multilevel real-time iteration algorithm. and robust stability reasonable assumptions et 2000). Due to high burden so called called multilevel multilevel real-time real-time iteration iteration algorithm. algorithm. and robust stability under reasonable assumptions (Mayne (Mayne et al., al., 2000). Dueunder to the the high computational computational burden so et al., 2000). Due to the high computational burden associated with the solution of the nonlinear and in general et al., 2000). Duesolution to theofhigh computational burden 1.1 Contribution and Outline associated with the the nonlinear and in general 1.1 Contribution and Outline associated the the and in nonconvex Optimal Control Problems (OCP), has associated with the solution solution of the nonlinear nonlinear andNMPC in general general nonconvex with Optimal Control of Problems (OCP), NMPC has 1.1 1.1 Contribution Contribution and and Outline Outline nonconvex Optimal Control Problems (OCP), NMPC has been employed in chemical indusnonconvex Optimal Controlmainly Problems (OCP), NMPC has In this paper, the inexact NMPC scheme introduced in been historically historically employed mainly in the the chemical indusbeen historically employed mainly in the chemical indusIn this paper, the inexact NMPC scheme introduced in try, where the sampling times are generally long enough been historically employed mainly the chemical indus- (Bock try, where the sampling times are in generally long enough In this paper, the inexact NMPC introduced in et al., 2007) is regarded as standalone algorithm In this paper, the inexact NMPC scheme introduced in try, where the sampling times are generally long enough (Bock et al., 2007) is regarded as aa scheme standalone algorithm (Garc´ ıa et al., 1989; Qin and Badgwell, 2003). try, where the 1989; sampling are generally (Garc´ ıa et al., Qin times and Badgwell, 2003).long enough and (Bock et al., 2007) is regarded as a standalone algorithm its stability properties are investigated. In particular, (Bock et al., 2007) is regarded as a standalone algorithm (Garc´ ıa et al., 1989; Qin and Badgwell, 2003). and its stability properties are investigated. In particular, (Garc´ ıa et al.,more 1989; Qin andnumerical Badgwell, 2003). are being aand As its properties investigated. In analysis of suboptimal solution is its stability stability properties are investigated. In particular, particular, As more more and and more efficient efficient numerical methods methods are being and a sensitivity sensitivity analysis of the the are suboptimal solution is derived derived As more and more efficient numerical methods are being developed and as the computational power available on a sensitivity analysis of the suboptimal solution is for the inexact scheme. It is shown that the cost of As more and efficient numerical power methods are being a sensitivity analysis of the solution is derived derived developed andmore as the computational available on for the inexact scheme. It suboptimal is shown that the cost of the the developed and as the computational power available on embedded systems increases, NMPC is becoming a viable for the inexact scheme. It is shown that the cost of optimization problem can be used as a Lyapunov function developed and as the computational available on for the inexact scheme. shown the cost of the the embedded systems increases, NMPC ispower becoming a viable optimization problem canItbeisused as athat Lyapunov function embedded systems increases, is viable way systems. Promising optimization problem can be used used as asequilibrium, Lyapunovsuch function in of the that embedded systemsspectrum increases,of NMPC is becoming becoming viable optimization problem be aa Lyapunov function way for for aa broader broader spectrum ofNMPC systems. Promisingaa results results in aa neighbourhood neighbourhood of can the reference reference equilibrium, such that way aa broader of Promising results have been reported for in in aa neighbourhood of equilibrium, such that can With to inexact way for broader spectrum of systems. systems. Promising(Frasch results stability in neighbourhood of the the reference reference equilibrium, such that have for been reportedspectrum for applications applications in automotive automotive (Frasch stability can be be guaranteed. guaranteed. With respect respect to other other inexact have been reported for applications in automotive (Frasch et al., 2013), renewable energy (Ferreau et al., 2011) and stability can be guaranteed. With respect to other inexact approaches based on a fixed linearization of nonlinear have reported for applications in automotive (Frasch can based be guaranteed. With respect to other inexact et al.,been 2013), renewable energy (Ferreau et al., 2011) and stability approaches on a fixed linearization of nonlinear et al., renewable energy (Ferreau et 2011) robotics (Diehl et where sampling times in the approaches based on linearization of Nicolao al., the scheme et al., 2013), 2013), renewable energy (Ferreau et al., al., 2011) and approaches based on aaet fixed linearization of nonlinear nonlinear robotics (Diehl et al., al., 2006), 2006), where sampling times in and the constraints constraints (De (De Nicolao etfixed al., 2000), 2000), the proposed proposed scheme robotics (Diehl et sampling millisecond time-scale need be constraints (De Nicolao et al., 2000), the proposed scheme has the advantage of guaranteeing recursive feasibility of robotics (Diehl et al., al., 2006), 2006), where sampling times times in in the the constraints (De Nicolao et al., 2000), the proposed scheme millisecond time-scale need to towhere be met. met. has the advantage of guaranteeing recursive feasibility of millisecond time-scale need to be met. has the advantage of guaranteeing recursive feasibility of the suboptimal solution. millisecond time-scale need to be met. has the advantage of guaranteeing recursive feasibility of the suboptimal solution. In this work, motivated by these considerations, an apIn this work, motivated by these considerations, an ap- the suboptimal solution. the suboptimal solution. In this work, motivated by these considerations, an approximate scheme with reduced computational complexity In this work, motivated by these considerations, an ap- The proximate scheme with reduced computational complexity The theoretical theoretical statements statements are are then then illustrated illustrated on on a a nonnonproximate with reduced computational complexity is and stability properties are The illustrated nonnumerical example. The inexact scheme proximate scheme with reduced computational complexity trivial The theoretical theoretical statements are then illustrated onis a comnonis analyzed analyzedscheme and its its stability properties are investigated. investigated. trivial numericalstatements example. are Thethen inexact schemeon isa comis analyzed and its stability properties are investigated. In the setup considered, the optimal control problem is trivial numerical example. The inexact scheme is compared with a second approach that uses instead exact is its stability are investigated. numerical example. The that inexact comIn analyzed the setupand considered, the properties optimal control problem is trivial pared with a second approach usesscheme insteadis exact In the setup the control problem pared with second approach that uses instead potentially InThis the research setup considered, considered, theby optimal optimal control problem is is derivatives. pared with aaDespite second the approach that large uses suboptimality instead exact exact was supported the EU via ERC-HIGHWIND derivatives. Despite the potentially large suboptimality research was supported by the EU via ERC-HIGHWIND derivatives. Despite the potentially large suboptimality of the inexact scheme, it is shown that the This (259 166), FP7-ITN-TEMPO (607 957), and via H2020-ITN-AWESCO This research was supported by the EU ERC-HIGHWIND derivatives. Despite the potentially large suboptimality of the inexact scheme, it is shown that the condition condition (259 166), FP7-ITN-TEMPO (607 and via H2020-ITN-AWESCO This research was supported by957), the EU ERC-HIGHWIND of the scheme, it shown the condition necessary to stability a (642 682). FP7-ITN-TEMPO R. Quirynen holds a PhD fellowship of the Research (259 (607 957), and of the inexact inexact scheme, it is is guarantees shown that thathold the in condition necessary to obtain obtain stability guarantees hold in a nonnon(642 166), 682). FP7-ITN-TEMPO R. Quirynen holds a PhD of the Research (259 166), (607 957), fellowship and H2020-ITN-AWESCO H2020-ITN-AWESCO necessary to obtain stability guarantees hold in Foundation –R.Flanders (FWO). negligible neighbourhood of the reference. (642 682). Quirynen holds a PhD fellowship of the Research necessary to obtain stability a nonnonFoundation (FWO). negligible neighbourhood of theguarantees reference. hold in a (642 682). –R.Flanders Quirynen holds a PhD fellowship of the Research Foundation negligible Foundation – – Flanders Flanders (FWO). (FWO). negligible neighbourhood neighbourhood of of the the reference. reference. Copyright © 2016, 2016 IFAC 53 Hosting by Elsevier Ltd. All rights reserved. 2405-8963 © IFAC (International Federation of Automatic Control) Copyright © 2016 IFAC 53 Copyright © 2016 IFAC 53 Peer review under responsibility of International Federation of Automatic Copyright © 2016 IFAC 53 Control. 10.1016/j.ifacol.2016.10.139

IFAC NOLCOS 2016 54 Andrea Zanelli et al. / IFAC-PapersOnLine 49-18 (2016) 053–058 August 23-25, 2016. Monterey, California, USA

The paper is organized as follows: Section 2 briefly summarizes the main ideas and concepts that will be used to derive the stability proof. The inexact SQP scheme is introduced and a standard nominal stability argument for NMPC is recalled. In Section 3 the main results are presented. The sensitivity of the cost associated with the suboptimal trajectories with respect to the initial condition is derived and used to build a Lyapunov function. Finally, Section 4 contains an illustrative example in which the inexact scheme is applied to a nonlinear dynamical system in order to validate the theoretical results.

2.2 The Inexact Scheme If the Nonlinear Program (NLP) in (1) is solved with a Gauss-Newton SQP scheme (Bock, 1983), the resulting subproblem at iteration k takes the form min ,··· ,x

x0 N u0 ,··· ,uN −1

xi+1 = Aki xi + Bik ui + cki , xN = 0,

cki = f (xki , uki ) − Aki xki − Bik uki , with dynamics linearized around the current estimate of the solution xk := (xk0 , . . . , xkN ) and uk := (uk0 , . . . , ukN −1 ):

2.1 Optimal Control Problem Statement

Aki =

Consider the following finite-horizon discrete-time optimal control problem:

x0 N u0 ,··· ,uN −1

s.t.

N −1 1 T xi Qxi + uTi Rui 2 i=0

x0 − x ¯0 = 0 xi+1 = f (xi , ui ), xN = 0,

i∈I

where

In the following, the optimal control problem formulation and the inexact scheme are introduced. Moreover, a standard nominal stability argument for NMPC is recalled as the stability proof for the inexact scheme will be built on it.

min ,··· ,x

(2)

x0 − x ¯0 = 0

s.t.

2. PRELIMINARIES

V ∗ (¯ x0 ) =

N −1 1 T x Qxi + uTi Rui 2 i=0 i

∂f k k (x , u ), ∂x i i

Bik =

∂f k k (x , u ). ∂u i i

If the solution of the above QP is taken as next solution guess xk+1 , uk+1 , it can be shown that such a scheme, together with an appropriate globalization strategy (Han, 1977), is guaranteed to recover a local minimum of (1). A possible way of solving the subproblems in (2) is by eliminating the state variables x resulting in a smaller and dense problem that can be efficiently solved by stateof-the-art QP solvers (Ferreau et al., 2014). However, the computational burden introduced by the so called condensing (Bock, 1983) can be rather high, especially for problems with a long prediction horizon N or a large number of states (Vukov et al., 2013). A second source of computational effort, regardless of the QP formulation used, is the fact that the matrices Aki and Bik need to be computed at every iteration. Numerical results that take into account the recent advances in both integration and condensing routines show that the latter can be rather computationally expensive (Quirynen et al., 2013).

(1)

i∈I

with optimization variables xi ∈ Rnx and ui ∈ Rnu , x u positive definite cost matrices Q ∈ Sn++ and R ∈ Sn++ and I := {0, · · · , N − 1}. Without loss of generality, the origin is considered as the reference equilibrium, i.e f (0, 0) = 0. Such a formulation can be obtained from a continuoustime problem by applying the direct multiple shooting method (Bock and Plitt, 1984). In this work, an OCP formulation with a zero terminal constraint and without inequality constraints is considered in order to simplify the derivation of stability guarantees for the inexact scheme. Similar arguments could be used to extend the proof to a more general OCP formulation without zero terminal constraints and with inequality constraints.

The main motivation behind an inexact scheme that exploits fixed derivatives is to avoid the computational burden associated with condensing and sensitivity generation by using the Jacobians evaluated at the reference:

Problem (1) can be reliably and efficiently solved online in an embedded setting with state-of-the-art solvers for NMPC (Diehl et al., 2009). While other approaches exist, a possible way of doing so is by exploiting an SQP scheme that solves a series of convex Quadratic Programs (QP) that locally approximate the original problem. In the next section, the main idea behind such a method is briefly recalled and the modifications that lead to the proposed inexact approach are introduced.

Aki = A =

∂f (0, 0), ∂x

Bik = B =

∂f (0, 0). ∂u

A convergence proof of such an SQP scheme to a feasible, but suboptimal solution can be found in (Bock et al., 2007). Using fixed derivatives will allow one to avoid sensitivity generation and condensing routines while solving the optimal control problem online, hence reducing the cost of the overall scheme to the one associated with a pre-condensed QP solve and a numerical simulation of the dynamics needed to update the terms cki in (2).

Throughout this paper the following assumption will be made. Assumption 1. There exists a local minimizer that satisfies Linear Independence Constraint Qualification (LICQ) and Second Order Sufficient Conditions (SOSC) of optimality (Nocedal and Wright, 2006) for the OCP in (1) for any x ¯0 in a non-empty neighbourhood Ω of the origin.

2.3 Standard Nominal Stability Argument Classical nominal and robust stability results hold for the system controlled using the optimal solution of the NLP in (1) in a receding horizon fashion (Scokaert et al., 1997). In 54

IFAC NOLCOS 2016 August 23-25, 2016. Monterey, California, USA Andrea Zanelli et al. / IFAC-PapersOnLine 49-18 (2016) 053–058

the following, a common argument used to derive stability guarantees based on Lyapunov functions is reported.

x0 − x ¯0

Ru0 + B λ1

f (xk0 , uk0 )

x1 − .. . xN − λ N − λt

2

U (F (xk )) − U (xk ) ≤ −c xk .

V (xk ) ≥

uk0 )

=0

(3)

.. . =0 = 0.

=0 T˜

x ˜1 − f (˜ x0 , u ˜0 ) .. . x ˜N ˜t ˜N − λ −λ

=0 =0 =0

(4)

.. . =0 = 0.

The solution recovered by the inexact scheme is feasible, as the equality constraints appear unchanged in the nonlinear root-finding problem (4). The approximation introduced by fixing the sensitivities affects instead the optimality of the solution. For this reason, when analyzing the stability of the closed-loop scheme, the question naturally arises of how this approximation degrades the properties of the optimal cost which is used as a Lyapunov function in exact schemes.

Consider the control profile u∗ (xk ) obtained by solving problem (1) with x ¯0 = xk . As the optimal solution satisfies x∗N = 0, a feasible solution to problem (1) formulated with x ¯0 = xk+1 = f (xk , u∗0 (xk )) can be obtained by replaying the controls computed at time k: u ˆ := u∗1 (xk ), u∗2 (xk ), · · · , u∗N −1 (xk ), 0 ,

The nominal stability proof will be based on a sensitivity analysis of the solution to the nonlinear root-finding problem (4) with respect to the initial value x ¯0 at the origin. These considerations will be then exploited to prove that, in a neighbourhood of the origin, the standard Lyapunov stability arguments must hold for the inexact scheme as well.

which would give rise to a feasible and suboptimal solution of (1) with cost V ∗ (xk ) − xTk Qxk − u∗0 (xk )T Ru∗0 (xk ). The optimal cost V ∗ (xk+1 ) can only be better than this, thus:

3.1 Sensitivity of the Suboptimal Solution

V ∗ (xk+1 ) − V ∗ (xk ) ≤ −xTk Qxk − u∗0 (xk )T Ru∗0 (xk ),

First, a result on the sensitivity of both optimal and suboptimal solutions with respect to x ¯0 will be derived. Let w∗ = (x∗ , u∗ , λ∗ ) be the optimal solution associated with the original NLP (1), hence satisfying the following optimality conditions:

or, equivalently, there exists a constant c such that 2

− B(u0 −

˜0 Q˜ x 0 + A λ1 − λ ˜1 R˜ u0 + B T λ

.

V ∗ (xk+1 ) − V ∗ (xk ) ≤ −c xk 2 ,

− A(x0 −

¯0 x ˜0 − x

It is possible to show that the optimal cost in (1) is a valid Lyapunov function for the system controlled by applying the optimal input in a receding horizon fashion. To this end, let V ∗ (xk ) denote the optimal cost associated with problem (1) for x ¯0 = xk . Then, due to positive definiteness of Q and R, it immediately follows that there exists a constant a ≥ 0 such that 2 a xk 2

=0 xk0 )

where λ0 , · · · , λN , λt are the Lagrange multipliers associated with the equality constraints. When convergence is achieved xk+1 = xki := x = uki := u ˜i , uk+1 ˜i and i i k+1 k := ˜ λi = λi λi and the following holds:

2

∗

=0

T

Theorem 2. Let F : Rnx → Rnx be the function that describes the dynamics of the closed-loop system and let F satisfy a Lipschitz condition in an open neighbourhood of the origin, with F (0) = 0. The origin is a locally exponentially stable fixed point of xk+1 = F (xk ), if there exist a function U (xk ) and strictly positive constants a, b, c and ρ such that, for all xk ∈ Bρ , with Bρ := {x ∈ Rn : x ≤ ρ}, the following holds: 2

=0 T

Qx0 + A λ1 − λ0

Consider the following result adapted from (Scokaert et al., 1997):

a xk ≤ U (xk ) ≤ b xk

55

∀xk ∈ Bρ .

Finally, Assumption 1 can be used to derive an appropriate constant b > 0. The proof is not reported here for brevity and the interested reader is referred to (Scokaert et al., 1997) for a detailed discussion.

x∗0 − x ¯0 ∗ Qx0 + ∇x f (x∗0 , u∗0 )λ∗1 − λ∗0 Ru∗0 + ∇u f (x∗0 , u∗0 )λ∗1 x∗1 − f (¯ x0 , u∗0 ) .. . x∗N − λ∗N − λ∗t

3. STABILITY OF THE INEXACT NMPC SCHEME In the following, a sensitivity analysis of the solution recovered by the inexact scheme will be derived. In order to do so, regard the first order optimality conditions (Nocedal and Wright, 2006) for the QP subproblem (2) at iteration k, with fixed derivatives A and B:

=0 =0 =0 =0 .. .

(5)

=0 = 0,

where ∇x f and ∇u f denote the transpose of the Jacobian of f with respect to x and u respectively. Equations (4) 55

IFAC NOLCOS 2016 56 Andrea Zanelli et al. / IFAC-PapersOnLine 49-18 (2016) 053–058 August 23-25, 2016. Monterey, California, USA

and (5) will be respectively referred to in the compact forms q˜(¯ x0 , w) ˜ =0

and

x ¯0

q ∗ (¯ x0 , w∗ ) = 0,

˜ is the vector containing the concatewhere w ˜ = (˜ x, u ˜, λ) nated suboptimal primal and dual solution. The following result holds: Lemma 3. The magnitude of the deviation of the suboptimal solution w ˜ from the optimal one w∗ is of second order in the magnitude of the norm of the initial condition x ¯0 : 2 w(¯ ˜ x0 ) − w∗ (¯ x0 ) = O ¯ x0 .

z˜1 z∗1

0

and

∂w ˜ ∂ q˜ −1 ∂ q˜ =− , ∂x ¯0 ∂w ˜ ∂x ¯0

∂ q˜ ∂q ∗ (0, 0) = (0, 0) ∂x ¯0 ∂x ¯0

and

B=

Using the fact that z ∗ is an optimal solution, i.e. ∇z L(z ∗ , λ∗ ) = 0, the following is obtained for z = z˜: 2 z − z ∗ ) J(˜ z ) = L(˜ z , λ∗ ) = L(z ∗ , λ∗ ) + O (˜ 2 = J(z ∗ ) + O (˜ z − z ∗ ) .

due to the fact that A=

∂f (0, 0) ∂x

∂f (0, 0). ∂u

2 x0 ) = O ¯ x0 , Together with the fact that w(¯ ˜ x0 )− w∗ (¯ this implies that the suboptimality grows with the fourth power of the norm of the initial condition: 4 J (˜ z (¯ x0 )) − J (z ∗ (¯ x0 )) = O ¯ x0 .

This implies that both the constant and linear term of the Taylor series of w(¯ ˜ x0 ) and w∗ (¯ x0 ) coincide at the origin x ¯0 = 0. 3.2 Suboptimality of the Inexact Scheme In the following the OCP in (1) will be regarded in the compact form min

J(z)

s.t.

G(¯ x0 , z) = 0,

z

N +1

The Taylor series of the Lagrangian around the optimal solution (z ∗ , λ∗ ) reads 2 L(z, λ∗ ) = L(z ∗ , λ∗ )+∇z L(z ∗ , λ∗ )T (z−z ∗ )+O z − z ∗ .

∗

and

N

L(z, λ) := J(z) − λT G(z).

∂q and ∂∂wq˜˜ is guaranteed in a where invertibility of ∂w ∗ neighbourhood of the origin due to Assumption 1. It is possible to see that the derivative matrices coincide at the origin, i.e

∂ q˜ ∂q ∗ (0, 0) = (0, 0) ∂w ˜ ∂w∗

1

Fig. 1. The sketch describes the procedure followed in the proof of Theorem 5. The cost associated with the suboptimal trajectory starting at x ¯0 is first compared with the one associated with the optimal trajectory starting at x ˜1 - in red. Then, due to Lemma 4, the cost increases with the fourth power of ˜ x1 going from z1∗ to z˜1 .

Proof. By exploiting the implicit function theorem, it is possible to express the sensitivity of optimal and suboptimal solutions with respect to the initial condition x ¯0 around the origin respectively as ∂w∗ ∂q ∗ −1 ∂q ∗ =− ∗ ∂x ¯0 ∂w ∂x ¯0

x ˜1

3.3 A Lyapunov Function for the Inexact Scheme The observations made in the previous lemmata and theorems will be now exploited in order to build a Lyapunov function for the system controlled by applying the suboptimal solution in a receding horizon fashion. The following theorem states the main stability result for the inexact scheme:

(6)

where z := (x, u) has been introduced to refer to the primal solution in compact form. The result from Lemma 3 can be used to quantify the suboptimality of w(¯ ˜ x0 ) as a function of ¯ x0 . In particular, the following holds:

Theorem 5. The origin is a locally exponentially stable equilibrium for the closed-loop system obtained by applying the suboptimal control input u ˜0 (¯ x0 ).

Lemma 4. The deviation of the cost J(˜ z ) associated with the suboptimal solution from the optimal one J(z ∗ ) is of fourth order in ¯ x0 : 4 J (˜ z (¯ x0 )) − J (z ∗ (¯ x0 )) = O ¯ x0 .

Proof. In order to prove the above result, it is possible to rely on Theorem 2. It will be shown that the cost V˜ (¯ x0 ) := J (˜ z (¯ x0 ))

Proof. The proof exploits the fact that z˜ is a feasible solution, hence G(¯ x0 , z˜) = 0. Thus, objective and Lagrangian coincide J(˜ z ) = L(˜ z , λ), where:

is a valid Lyapunov function. For a given initial condition x ¯0 , the following holds: 56

IFAC NOLCOS 2016 August 23-25, 2016. Monterey, California, USA Andrea Zanelli et al. / IFAC-PapersOnLine 49-18 (2016) 053–058

1 T ¯ Q¯ x0 + V ∗ (˜ x1 ) V˜ (¯ x0 ) ≥ x 2 0

∆V < 0 ∆V ≥ 0

and, due to Lemma 4,

failure

x2

4 ¯T0 Q¯ x0 + V˜ (˜ x1 ) + O ˜ x1 . V˜ (¯ x0 ) ≥ x

This last inequality shows that V˜ (·) decreases for ˜ x1 small enough. The trajectories taken into account to build the proof are illustrated in Figure 1. As the primal solution grows linearly with x ¯0 , i.e. x ˜1 = O (¯ x0 ), the following holds: 4 V˜ (¯ x0 ) ≥ x ¯T0 Q¯ x0 + V˜ (˜ x1 ) + O ¯ x0 .

1

x2

1

0

−1

0

−1 −1

0

−1

1

x1

0

1

x1

Fig. 2. ∆V for exact (left) and inexact (right) scheme. Decreasing cost ∆V < 0 in green, non-decreasing cost ∆V ≥ 0 in red, maximum number of iterations reached or infeasible SQP step in blue. For the exact scheme, the cost is guaranteed to decrease by construction. For the inexact one, the cost can be nondecreasing due to the approximation introduced. However, ∆V < 0 holds in a non-negligible region around the origin.

Hence, there exist strictly positive constants ρ and c such that 2 V˜ (˜ x1 ) − V˜ (¯ x0 ) ≤ −c ¯ x0 ,

57

∀¯ x 0 ∈ Bρ .

Moreover, noting that J(˜ z (¯ x0 )) = z˜(¯ x0 )T H z˜(¯ x0 ),

300

where H is the Hessian of the cost J, together with the well-posedness of the problem introduced in Assumption 1, makes it possible to define a constant b > 0 such that

%

200 100 0

2 x0 . V˜ (¯ x0 ) ≤ b ¯

2

2 x0 Finally, a positive constant a such that V˜ (¯ x0 ) ≥ a ¯ can be derived as previously done for Theorem 2.

2 0 x2

0 −2

−2

Fig. 3. Relative suboptimality % =

x1 ˜ −V ∗ V V∗

· 100. The inexact scheme gives rise to largely suboptimal policies in certain regions of the state space, however, as shown in Figure 2 stability is guaranteed in a non-negligible region of the state space.

4. ILLUSTRATIVE EXAMPLE In this section, the inexact scheme will be applied to a non-trivial example and it will be shown that the conditions necessary for stability hold in a non-negligible neighbourhood of the origin, as shown in Theorem 5.

the OCP is solved with both methods. For any initial condition x ¯0 and input computed u0 , let ∆V be the cost difference defined as

An OCP of the form in (1) is considered, where f (·) represents discretized dynamics obtained by applying the explicit Runge-Kutta scheme of order 4 with fixed step-size h = 0.1 to the following ordinary differential equation: 3 x + (1 + x2 )u1 x˙ = fc (x, u) := 1 3 . x2 + x1 + u2

∆V := V (f (¯ x0 , u0 )) − V (¯ x0 ). Figure 2 shows the regions where ∆V < 0, ∆V ≥ 0 or a failure is encountered, comparing the results obtained with the two schemes. In particular, it is shown that stability guarantees can be obtained for the inexact scheme in a rather large neighbourhood of the origin.

A control horizon T = 1 is used and the trajectories are discretized using N = 10 shooting nodes. The cost matrices have been chosen to be equal to the identity matrix Q = R = I2 .

˜

∗

The relative suboptimality % = V V−V∗ · 100 of the trajectories obtained with the inexact scheme is plotted in Figure 3 for different initial conditions. The scheme can become largely suboptimal for points sufficiently distant from the origin, where the fixed sensitivities might not be good approximations of the exact ones. However, closed-loop feasibility is always guaranteed and stability can be guaranteed in a non-negligible region as illustrated in Figure 2.

The full-step Gauss-Newton algorithm will be used for the comparison and both the exact and inexact SQPtype algorithms will be iterated until either convergence or failure. Two possible causes of failure are taken into account: either the algorithm has not converged after a maximum number of iterations τmax = 100 or an infeasible QP has arisen.

Finally, in Figure 4, points on a line that passes through the origin parametrized with the scalar coordinate xd are considered [x1 x2 ] = xd [1 1.71] and the absolute

The state space region X = {−1.2 ≤ x1 ≤ 1.2, −1.2 ≤ x2 ≤ 1.2} is discretized with an equally spaced grid and 57

IFAC NOLCOS 2016 58 Andrea Zanelli et al. / IFAC-PapersOnLine 49-18 (2016) 053–058 August 23-25, 2016. Monterey, California, USA

suboptimality V˜ − V ∗ is compared with 12 x ¯T0 Q¯ x0 . Due to Lemma 4, the suboptimality is of fourth order in ¯ x0 , hence the inequality V˜ − V ∗ < 12 x ¯T0 Q¯ x0 holds in a neighbourhood of the origin. 0.6

Predictive Control, volume 26 of Progress in Systems Theory, 3–23. Birkh¨auser, Basel Boston Berlin. Diehl, M., Ferreau, H.J., and Haverbeke, N. (2009). Nonlinear model predictive control, volume 384 of Lecture Notes in Control and Information Sciences, chapter Efficient Numerical Methods for Nonlinear MPC and Moving Horizon Estimation, 391–417. Springer. Diehl, M., Findeisen, R., and Allg¨ower, F. (2007). A Stabilizing Real-time Implementation of Nonlinear Model Predictive Control. In L. Biegler, O. Ghattas, M. Heinkenschloss, D. Keyes, and B. van Bloemen Waanders (eds.), Real-Time and Online PDEConstrained Optimization, 23–52. SIAM. Diehl, M., Bock, H.G., Diedam, H., and Wieber, P.B. (2006). Fast Motions in Biomechanics and Robotics, volume 340, chapter Fast Direct Multiple Shooting Algorithms for Optimal Robot Control, 65–93. Springer. Ferreau, H.J., Kirches, C., Potschka, A., Bock, H.G., and Diehl, M. (2014). qpOASES: A parametric active-set algorithm for quadratic programming. Mathematical Programming Computation, 6(4), 327–363. Ferreau, H.J., Houska, B., Geebelen, K., and Diehl, M. (2011). Real-time control of a kite-model using an autogenerated nonlinear mpc algorithm. In Proceedings of the 18th IFAC World Congress. Frasch, J.V., Gray, A.J., Zanon, M., Ferreau, H.J., Sager, S., Borrelli, F., and Diehl, M. (2013). An Autogenerated Nonlinear MPC Algorithm for Real-Time Obstacle Avoidance of Ground Vehicles. In Proceedings of the European Control Conference, 4136–4141. Garc´ıa, C., Prett, D., and Morari, M. (1989). Model Predictive Control: Theory and Practice – a Survey. Automatica, 25, 335–348. Han, S.P. (1977). A Globally Convergent Method for Nonlinear Programming. JOTA, 22, 297–310. Mayne, D., Rawlings, J., Rao, C., and Scokaert, P. (2000). Constrained model predictive control: stability and optimality. Automatica, 26(6), 789–814. Nocedal, J. and Wright, S. (2006). Numerical Optimization. Springer Series in Operations Research and Financial Engineering. Springer, 2 edition. Qin, S. and Badgwell, T. (2000). An overview of nonlinear model predictive control applications. In F. Allg¨ ower and A. Zheng (eds.), Nonlinear Predictive Control, volume 26 of Progress in Systems Theory, 370–392. Birkh¨auser, Basel Boston Berlin. Qin, S. and Badgwell, T. (2003). A survey of industrial model predictive control technology. Control Engineering Practice, 11, 733–764. Quirynen, R., Gros, S., and Diehl, M. (2013). Efficient NMPC for nonlinear models with linear subsystems. In Proceedings of the 52nd IEEE Conference on Decision and Control, 5101–5106. Scokaert, P., Rawlings, J., and Meadows, E. (1997). Discrete-time Stability with Perturbations: Application to Model Predictive Control. Automatica, 33(3), 463– 470. Vukov, M., Domahidi, A., Ferreau, H.J., Morari, M., and Diehl, M. (2013). Auto-generated Algorithms for Nonlinear Model Predictive Control on Long and on Short Horizons. In Proceedings of the 52nd Conference on Decision and Control (CDC).

V˜ − V ∗

1 T ¯0 Q¯ x0 2x

0.4

0.2

0 −0.2

−0.1

0

0.1

0.2

xd Fig. 4. Absolute suboptimality as a function of the directional coordinate xd . Lemma 4, the cost associated

V˜ − V ∗ is compared with 12 x ¯T x0 . Due to 0 Q¯ suboptimality is dominated by the quadratic with stage 0 of the OCP.

5. CONCLUSIONS AND OUTLOOK An efficient inexact scheme based on the ideas presented in (Bock et al., 2007) has been introduced and nominal stability guarantees have been derived. The scheme exploits fixed sensitivity information obtained by evaluating the derivatives at the reference and allows to avoid computationally intensive condensing and sensitivity generation routines. It has been shown that the cost function in problem (1), commonly used for the stability proof of exact schemes, can be used as a Lyapunov function for the inexact approach as well. An illustrative example is given in which it is shown how the necessary conditions for the cost to be usable as a Lyapunov function hold in a rather large region around the origin. Future work will include an efficient implementation of the algorithm and extensive benchmarking. Moreover the stability results could be extended in order to take into account an approximate scheme where a single QP is solved per iteration as proposed in (Diehl et al., 2007). REFERENCES Bock, H. (1983). Recent advances in parameter identification techniques for ODE. In P. Deuflhard and E. Hairer (eds.), Numerical Treatment of Inverse Problems in Differential and Integral Equations, 95–121. Birkh¨auser, Boston. Bock, H., Diehl, M., Kostina, E., and Schl¨ oder, J. (2007). Constrained Optimal Feedback Control of Systems Governed by Large Differential Algebraic Equations. In L. Biegler, O. Ghattas, M. Heinkenschloss, D. Keyes, and B. van Bloemen Waanders (eds.), Real-Time and Online PDE-Constrained Optimization, 3–22. SIAM. Bock, H. and Plitt, K. (1984). A multiple shooting algorithm for direct solution of optimal control problems. In Proceedings 9th IFAC World Congress Budapest, 242– 247. Pergamon Press. De Nicolao, G., Magni, L., and Scattolini, R. (2000). Stability and Robustness of Nonlinear Receding Horizon Control. In F. Allg¨ ower and A. Zheng (eds.), Nonlinear 58

An Efficient Inexact NMPC Scheme with Stability and Feasibility Guarantees

An Efficient Inexact NMPC Scheme with Stability and Feasibility Guarantees

Recommend Documents