Linear quadratic differential games with cheap control

Linear quadratic differential games with cheap control

Systems St Control North-Holland Letters 8 (1986) 181-188 181 Linear auadratk differential with cheap control Ian R. PETERSEN Depurment of Elect...

571KB Sizes 26 Downloads 160 Views

Systems St Control North-Holland

Letters

8 (1986)

181-188

181

Linear auadratk differential with cheap control Ian R. PETERSEN Depurment

of Electricul

and

Defence

Force

Academy,

Received Revised

24 March 1986 11 July 1986

Electronic

Campbell

ACT

This paper considers a linear game in which the weighting on the allowed to approach zero. It is shown mum phase condition is satisfied then will approach zero as the weighting on approaches zero.

Abstract

Keywords:

control,

Differential Cheap control.

games,

Riccati

Engineering, 2600,

Australiun

Aus~alia

quadratic differential minimizing control is that if a certain minithe value of the game the minimizing control

games

U

trol weighting on the minimizing player approaches zero. As in the case of references [l-3], this result has an important application in the stabilization of uncertain systems using high gain observer based feedback. Indeed, in [7] the uncertain systems under consideration are required to satisfy a certain ‘matching’ condition. The results of this paper will enable that matching condition to be significantly weakened.

2. System assumptions and definitions equations,

High

gain

We consider a linear quadratic differential game in which the cost functional is

1. Introduction

In recent years, a number of papers have dealt with the linear quadratic regulator problem with cheap control; see [l-3]. In these papers, it is shown that if a certain minimum phase condition is satisfied, then the optimal cost for the regulator problem approaches zero as the control weighting approaches zero. This result is not only useful in the theory of high gain linear regulators but it also forms the basis of the ‘asymptotic loop recovery’ approach to robust controller design; e.g. see [4-71. In particular, [4] uses the results of [l] to show that the robustness properties of a linear quadratic regulator using full state feedback can be recovered using a high gain observer. Furthermore, reference, [7] uses the results of [2] to show that a high gain observer can often be used in place of state feedback when stabilizing an uncertain linear system. In this paper, we consider the use of cheap control in a linear quadratic differential game rather than in the linear quadratic regulator problem; e.g., see [8]. In fact our main result generalizes the main result of [2] by showing that if a certain minimum phase condition is satisfied then the value of the game will approach zero as the con0167-6911/86/$3.50

6 1986, Elsevier

Science

Publishers

B.V.

-u(t)‘u(t)

i

and the system under consideration by the state equations z(t) =/ix(t)

+l?u(t)

+DJ(t),

y(r) = Cx(t).

dt

(2.1)

is described

x(0) =x0, (a

In this differential game, x(t) E R” is the state, u(t) E R”’ is the minimizing control, u(t) E RJ’ is the maximizing control, y(t j E Rq is the output and q >.O is the control weighting parameter. 2.1 (see also [8]). Given any x0 E R”, the differential game defined by cost functional Jq( -) and system (2) is said to have uafue Jg* if given any E > 0, there exist control laws u&x) and u,(x) such that the following conditions hold: Definition

6) Jqbe7u)GJ~*+E for all control strategies u(e); (ii)

J,(u, Ue)2Jq*-E

for all control strategies u(a). (North-Holland)

182

I. R. Petersetr

/ Lineur

qrtadrutic

Associated with the differential game described above is the algebraic Riccati equation A’P + PA - q2PBB’P + PDD’P + C’C = 0.

(2.2)

It is shown in [8] that if this Riccati equation has a minimal positive-definite symmetric solution P then given any x0 E R”, the value of the game is given by J; =x&&.

Furthermore, if the game has a finite value for all x0 E R” then this Riccati equation will have a positive-definite symmetric solution. Throughout this paper, we will assume that there exists a positive-definite symmetric solution to Riccati equation (2.2). However, it may be of interest to utilize some results concerning the existence of solutions to this Riccati equation. Indeed, Riccati equation (2.2) has a positive definite solution P if and only if the Riccati equation AS + SA’ - q2BB’ + DD’ + SC’CS = 0

has a positive-definite solution S = P-‘. Reference [9] gives conditions under which this Riccati equation has a positive-definite symmetric solution. These results are stated in terms of the eigenvalues and generalized eigenvectors of the associated Hamiltonian matrix A MADD’-q2BB’

C’C -A’

games

with

cheup corltrol

corresponding differential

game is given by

J;;” = x;Bx, where F is the minimal positive-definite ric solution to Riccati equation (2.4).

symmet-

Assumptions. Throughout the sequel, it will be assumed that the following assumptions are satisfied: Al. [A, B] is stabilizable. A2. [C, A] is detectable. A3. B is full rank. A4. C is full rank. A5. The transfer function matrix

G(s) = C(sI-A)-lB is right invertible and strictly minimum phase, i.e.,

1

=n+q,

Re(s)>O.

Remark. It is straightforward to verify that for any x0 E R’l, the value of the differential game satisfies Jq* > 0 and furthermore, J* is monotone non-increasing as q + 00. We w& show that under the above assumptions the value of the game will have the following property:

lim J4* = 0 9-00

1

for all x0 E R”.



In the sequel, it will be convenient to consider an augmented cost functional rather than the cost functional J,(u, u). Thus, we define the cost functional

where Q > 0 is a given positive-definite weighting matrix. Associated with this cost functional is the Riccati equation

3. The main result

The following theorem is the main result of this paper. Theorem 3.1. Suppose that the system (X) satisfies assumptions Al-A5 and that Riccari equarion (2.4) has a posiriue-definite symmetric solution for all q sufficiently large. Then giuen any x0 E R”,

lim J;;“=O. 9-m

Before proving this theorem, we first present some additional lemmas and notation.

AtP,i- PA - q2PBB’P + PDD’P -I- C’C +Q/q2=0.

dif/erettrial

(2.4

As above, given any x0 E R” the value of the

Notation. Let 2% R” be the maximal A, B-invariant subspace contained in ker C; e.g. see [lo].

I. R. Petersen

/ Linear

qurrdrntic

Furthermore, let F’(9) denote the set of all matrices F such that 9’ is (A + BF)-invariant. Given any FE .9( 9) let A, g A + BF. It follows from Section 0.7 of [lo] that there exists a state space transformation on (2) such that we can write (3.1) Furthermore, since YC ker C, it follows that C will be of the form C = [0 C,]. Lemma 3.1. Suppose that the system (X) satisfies

assumptions Al-A5. Then there exists a matrix F, E %( 9’) with the following property: Suppose A,, = A + BF, has been transformed into the form (3.1). Then the block A, is a stability matrix, i.e., all of its eigenvalues satisfy Re( h) < 0.

diJererhd

183

games with cheap control

solution to (2.4) with q = q. Using the definition of A,, it follows that A;jj

+ I’AFO - F,B+ - PBF, - ~~~~~~~

+i;DD’~+C’C+Q/~2=0. Let x E R” be given such that B’l?x = 0, x # 0. It follows that

= -x’Qx/q2

< 0.

Hence, Finsler’s Theorem (see [12]) implies that there exists a constant u > 0 such that the matrix -Q=A$j+ij~~~-

apBB’F + PDD’F + C’C

is negative-definite. Let S = I?-’ and 0 = SOS. It follows that S satisfies the Riccati equation A,S + SA;, - aBB’ + DD’ + SC’CS + 0 = 0.

Proof. It follows

from assumption zeros of the system matrix A-SI [ c

B 0

A5 that the

(3.2) We now partition (3.1) as follows ‘:

1

satisfy Re(s) -Z 0. However, using Lemma 3 of [ll] and Sections 5.4 and 5.5 of [lo], we conclude that the eigenvalues of A, are either zeros of the system matrix or freely assignable by choice of FE .9(Y), Hence, there exists an F, Ed such that A, is a stability matrix. In the next lemma, we will assume that the matrix F, has been constructed according to Lemma 3.1 and that the system (2) has been transformed so that A, is of the form (3.1).

the matrix S to conform with

Sl s3 s= [ S’ I s,>o, 3

s,>o.

S2’

Similarly, we also partition the matrices B, C and Q as follows: B= [;:I,

D= [;;I,

6, o;8, o2 1,

&O,

C=[O

C,],

Q220.

Lemma 3.2. Suppose that Riccati equation (2.4) has a positive-definite symmetric solution for all q z q. Then there exists a matrix Fl = [0 F] (partition conforming with (3.1)) such that the following condition holds: ‘Let

Thus, the substituting into (3.2) and taking the (2,2) block, we obtain the Riccati equation

F=F,+F,

Then the Riccati equation

We now let F= - $aBiSF1, A, = A, - B,P and a3 = A, + B,l? It follows that S, satisfies the Riccati equation

A’,P + PA, + PDD’P + C’C + F’F/q2

A,S2+S2A;,+D2Di+S2C;C2S2+~,=0.

and A,=A+BF.

A,& + $A;-

oB,Bi + D2Di + S2C;C2S2

+Q2=o.

(3.3)

+Q/q2=0 has a positive-definite sufficiently large. Proof.

symmetric solution for all q

Let P be a positive-definite

symmetric

1 Throughout the sequel. the notation N > M refers to the fact that the matrix N - M is positive-definite. Similarly, the notation N > M refers to the fact that the matrix N - M is positive-semidefinite.

I, R. Petersen

184

Furthermore

/ Lineur

quadruiic

gumes

with cheup cowol

the Ricatti equation

with

F=F,+F,=F,+ we get

d(ffererltiol

[O F]

1 A,=A+B(F,+ [OF])= [“d: 1.

A;P+PA,+PDD’P+C’C+E+g=O q2 as required.

q2

0

(3.4)

2

Lemma

3.3. Suppose that the Riccati equation

We now recall from Lemma 3.1 that the matrix A, is a stability matrix. Hence, the Lyapunov equation

A+ + PA - q2PBB’P + PDD~ i- C’C i- 2 = 0 4’ (3.7)

AlS; + SlA; + DIDi + I = 0

has a minimal positive-definite symmetric solution P. Furthermore, let A, be defined as in Lemma 3.2 and suppose that the Riccati equation

(3.5)

has a positive-definite symmetric solution Sr. Let y > 0 be given and define the positive-definite matrix S, by

A;P + PA, + PDD’P + C’C + F’F/q2 +Q/q2=0

(3.8)

It follows from (3.3)-(3.5),

has a positive-definite symmetric solu?ion. Then, there exists a minimal positive-definite solution P+ to (3.8) such that

A&

P
+ S,A; + DD’ + SYC’CS,. =-

yI+(y-i)~;~, [ -S,a; - D,D;

1

-A,s,-D,D;

=A

62

-Qy-

A,+ DD’P+

.Proof. In order to establish the existence of P+, let

We now choose y sufficiently large so that the matrix Q, is positive-definite. Letting $ = Sy ‘Q,S;‘, it follows that A&+S,A;.+DD’+S,{C’C+&}S,=O. (3.6)

Since Ricatti equation (3.6) has a positive-definite symmetric solution, it now follows from Theorem 5 and Remark 15 of [13] that given any q > 4” the Riccati equation Clc+E+9

S=O q2

cj = C’C+ Q/q2 + F’F/q2 and let S = P-’ where P is the positive-definite solution to (3.8). It follows that S isa solution to the Riccati equation - A,S - SA;. - S$S - DD’ = 0.

(3.9)

Now since the pair [-A,, Q112] is controllable, it follows from Theorem 5 of [13] that (3.9) has a unique maximal solution S+ > S > 0 such that all the eigenvalues of -A> - QS’ satisfy Re(h) Q 0. Let P+ = S+-’ G P. It follows from (3.9) that

Let the constant q > 0 be chosen so that

A,S”+SA;.+DD’+S

and all eigenvalues of the matrix satisfy Re( A) Q 0.

q2

will also have a positive-definite symmetric solution S> S,.. Letting P = S-l, we can see that P will be a positive-definite symmetric solution to

S+( -A’,-

$S+) = A$+

+ DD’

and hence S+( -A’,--

$S+)S+-’

= A,+ DD’P+.

That is, the matrix A, + DD’P+ is similar to -A, - $S+. Therefore, all of the eigenvalues of A, + DD’P+ satisfy Re(h) G 0. Furthermore, since S+ is the maximal solution to (3.9), it follows that P+ = ,S+-l is the minimal positive-definite solution to (3.8).

I. R. Peterseu

/ Linear

quudrutic

In order to complete the proof, we must now show that P+ > P. Indeed, let K” = -P+ Q 0. It follows from (3.8) that K+ satisfies the equation A;K+ + K+A,-

K+DD’K+

- & = 0.

(3.10)

Now since A, is a stability matrix and all of the eigenvalues of A,- DD’K+ satisfy Re(h) G 0, it follows from Theorem 2 of [14] that K is in fact the maximal solution of (3.10). We now consider the linear quadratic differential game described in Section 2 with performance index fq(u,u) given in (2.3). It follows from [8] that given any x,, E R” the value of this game is given by x;pxc. Hence, given any E > 0, there exists a control u,( -) such that given any control strategy ii(.), J( ii, u,) > xpxo

- E.

In particular, if we let ii(x) = Fx, it follows that co x$x - u:u,} dt z x;lpxc - E (3.11) I{0 where i(t)

+ Due(t),

-x’Qx

w=jom(

x(0) =x0.

+ u’u) dt

subject to Z(t) =AFx(f)

+ Du(t),

with cheup control

q(u) >, x&K+xo. Proof of Claim. Let x0 E R” and u( -) : R + R* be given. It follows from Lemma 6 of [13] that given any T>O, -x/ox

+ u’u)dt

=xhK+xo-x(T)‘K+x(T)

This holds for all T > 0 and hence 03 --Y&x + u’u)dt > x;K+x, I(0 as required. Applying this claim to inequality conclude that co -x/ox + u&) dt -x($x0 + E > I( 0

(3.11), we

2 x;K+xo.

That is,

for all xc E R” and E > 0. Hence, P+ 2 13. 0 Observations. Using the above lemmas, we can now investigate the behaviour of P, the minimal positive-definite solution to (3.7) as q + 00. Indeed, if we let s’= P-l, it follows that ,!? is the maximal solution to the Riccati equation

+s”(C’C+

I0

TIIu-tD’K+x112

2 x;K+x,.

Q/q2)$=0.

Furthermore, it follows from the results of [14] that s’ is monotonically increasing as q --, 00. Hence P = s-i is monotonically decreasing as q --, cc. Therefore, there exists a matrix P 2 0 such that

In order to establish the main result of this paper, we must show that P = 0. Towards this end, we also investigate the behaviour of P+, the minimal positive-definite solution to (3.8) as q + cc. Indeed, assuming that (3.7) has a positive-definite solution for all q sufficiently large, it follows from Lemma 3.2 that (3.8) will also have a positive-definite solution for all q sufficiently large, Let such a sufficiently large q be given. If P+ is the corresponding minimal positive-definite solution to (3.8), let K+ = -P+. It follows that K+ < 0 will be the maximal solution to the Riccati equation A’,K+ + K+AF-

+

185

x(0) =x0.

Claim. Given any x0 E R” and any control law u( *) : R + R*, the corresponding performance index satisfies

T

games

A??+ s;Q’ - q2BB’ + DD’

=AFx(t)

We now consider an optimal control problem in which we minimize the performance index

I(0

di//ererttial

dt -E+Q=o. q2

q2

K+DD’K:

- C’C

I. R. Petersen

186

/ Linear

quadratic

Moreover, it follows from the results of [14] that K+ is monotonically increasing as we let q + co. Hence, there exists a matrix KG 0 such that lim K+=K. if we write P= -K

gcmes

then

lim P+=P>,O.

We will show that fi = 0 by first considering the matrix F. Indeed, since P+ > ? for all q sufficiently large, it follows that

q+i=Im

(3.12)

Lemma 3.4.

ker i?

9~

control

(3.15)

SPcker3.

Ya=Im

Thus, we are now motivated space of the matrix p.

cheap

Observation. Using the above lemma and equation (3.12), it now follows that

4-03

P.P20.

with

Notation (see [16]). The minimal controllable output injection subspace ..TcR” is defined as follows:

q-m

Furthermore,

differential

to look at the null



Y=

B,

B+A(kerCnq);

limq. i+m

Lemma 3.5.

.7c ker

3.

Proof. Let q > 0 be a sufficiently large constant so that Riccati equation (3.7) has a minimal positive-definite symmetric solution B. Furthermore, let

Proof. Using the construction

L p qjB.

given in elements [AF, C]. struction

It follows from (3.7) that

of the matrix F Lemma 3.2, it follows that all of the of 9’ are unobservable states for the pair Furthermore, it follows from the conof p that it satisfies the Riccati equation (3.13)

A’,&FA,+FDD’hC’C=O.

Hence, K= -p < 0 satisfies the Riccati equation A$+

&I,-

kDD’i+?-

C’C = 0.

(3.14)

Moreover, if P+ is defined as in Lemma 3.3 and K+ = -P+, it follows that the eigenvalues of A, - DD’K+ satisfy Re(X) < 0 (for all q sufficiently large). Hence, taking the limit as q + co, it follows that the eigenvalues of A, - DD’T also satisfy Re(h) G 0. Therefore, using Theorem 2 of [14], we conclude that K is the maximal solution to (3.14). We will now show that p is the minimal positive-semidefinite symmetric solution to (3.13). Indeed, let P > 0 be a symmetric solution to (3.13). It follows that K= -P is a solution to (3.14). Therefore p = -Kg -K = P and hence p must solution to be the minimal positive-semidefinite (3.13). In order to complete the proof, we invoke Theorem 1 of [15] (with B = 0). Indeed, using this result, together with the fact that P is the minimal positive-semidefinite solution to (3.13) and the fact that all of the elements of Y are unobservable states for [AF, C], it follows that 9% ker i?

0

L’L=A++~A+~DD++CC’C+Q/q2.

We now take the limit as q + 00. lim L’L = A+ + PA + POD+ + C’C.

q-00

It now follows that the limit as q + co of L exists and we denote it z. Thus, we can write A+ + @A + POD+

+ 02 = ET.

(3.16)

Furthermore, if we again fix q’> 0 to be a sufficiently large constant, it follows from (3.7) that PBB+ = $ { A+ + FA + PDD+

+

02

+ Q/q2 }

Taking the limit as q + cc leads to the result (3.17)

ijB=O.

To complete the proof, we will now show that each subspace q satisfies q c ker @. The proof proceeds by induction. It follows from (3.17) that To = Im B

c ker

@.

We now suppose that @q = 0. Hence jq+,=s

Im B+>A(ker =pA(ker

Cnq

Cnq

).

)

I. R. Petersen

/ Lineur

quudrutic

To show that pA(ker C n 9J = 0, let x E ker C n q given. It follows that Cx = 0 and i% = 0. Therefore, using equation (3.16), we conclude that x’pzx = 0 and hence Lx = 0. Again using (3.16), . it follows that Affix + &IX + @D&X -!- c’cX = i?zX and hence @Ax = 0. However, since x E ker C nq was arbitrary, it follows that FA(ker Cnq

) =0

and hence @q.+, = 0. Thus, we can now use induction to conclude that $q= 0 for all i and hence @Y= 0. 0 We are now in a position to prove Theorem 3.1. 3.1. Using the fact that the transfer function G(s) is right invertible together with the results of [17], it follows that 9’+ Y= R”. However, Lemmas 3.4 and 3.5 imply that 9+Fc Ker 8. Therefore ker $ = R” and hence P = 0. That is,

di//eretltial

games

with cheap

control

187

using the above theorem, it follows that lim Jg*=O

q-00

for all x,, E R”. (ii) The above theorem shows that the strict minimum phase condition, assumption A5 (together with Al-A4) forms a sufficient condition for lim Jq*-+O.

4-00

However, using the results of [2], we can also obtain a necessary condition. Indeed{ consider the optimal control problem of minimizing the performance index

Proof of Theorem

lim B=O. q-00 However, as pointed out in Section 2, given any x0 E R”, the value of the differential game under consideration is given by

q =xp-x,.

subject to k(t)

=Ax(t)

y(t)

= Cx(t).

x(O) =x0,

Given any x0 E R”, it is straightforward to verify that the value of this optimal control problem jq* satisfies jq* 6 Jq*. Furthermore, it is shown in [2] that the non-strict minimum phase condition G(s) = C(sl-

A)-‘B

right invertible

and

1

Hence lime=0 q-* for all x0 E R”.

+Bu(t),

=n+q,

Re(s)>O,

(3.18)

is a necessary condition for 0

Observations. (i) The result of the above theorem

applies to the linear quadratic differential game with augmented performance index Ji( u, u). However, it straightforward to extend this result to the differential game with original performance index J,(u,u). Indeed, it is straightforward to verify that given any x0 E R” and q > 0, the values of these two differential games are related by Jq* < JI;” .

Hence, if assumptions AI-A5 are satisfied and if Riccati equation (2.2) has a positive-definite symmetric solution for all q sufficiently large, then

lim Q=O. ‘I-* It now follows that condition necessary condition for

(3.18) is also a

lim J;=O. q-00

At this point, the reader should note that there is a gap between the necessary condition (3.18) and our sufficient condition A.5. Further research could be directed towards closing this gap. Indeed, it might be conjectured that the non-strict minimum phase condition is both necessary and sufficient for lim J;=O.

4-m

188

I. R. Petersen

/ Linear

quadratic

References PI H. Kwakemaak and R. Sivan, The maximally achievable

accuracy of linear optimal regulators and linear optimal filters,qEEE Trans. Automat. Control 17 (1972) 79-86. PI B.A. Francis, The optimal linear-quadratic time-invariant regulator with cheap control, IEEE Trans. Auromar. Control 24 (1979) 616-621. 131 T. Fuji, On the perfect regulation of optimal regulators, Systems Control Lett. 1 (1982) 356-359. [41 J.C. Doyle and G. Stein, Robustness with observers, IEEE Trans. Automat. Control 24 (1979) 607-611. [51 J.C. Doyle and G. Stein, Multivariable feedback design: Concepts for a classical/modem synthesis, IEEE Trans. Automat. Conrrol 26 (1981) 4-16. Fl C.V. Hollot and A.R. Galimidi, Stabilizing uncertain systems: Recovering full state feedback performance via an observer, Proceedings of the 1986 American Control Conference, to appear. [7] I.R. Petersen and C.V. Hollot, Using observers in the stabilization of uncertain linear systems and in disturbance rejection problems, 1986 IEEE Conference on Decision and Control, submitted. [8] E.F. Mageirou, Values and strategies for infinite time linear quadratic games, IEEE Trans. Automat. Control 21 (1976) 547-550.

di//erenrial

games

,vil cheap control

P. Lancaster and L. Rodman, Existence and uniqueness theorems for, the algebraic Riccati equation, Inrernar. J. Control 32 (1980) 285-309. Control: A GeometWI W.M. Wonham, Linear Multivariable ric Approach (Springer, New York, 1979). 1111J.P. Corfmat and AS. Morse, Control of linear systems through specified input channels, SIAM J. Control 14 (1976) 163-175. Control, PI D.H. Jacobson, Extension of Linear-Quadratic Optimization and Matrix Theory (Academic Press. London, 1977). P31 J.C. Willems, Least squares stationary optimal control and the algebraic Riccati equation, IEEE Trans. Automat. Conrrol 16 (1971) 621-634. of maximal solutions of 1141 H.K. Wimmer, Monotonicity algebraic Riccati equations, $wtems Control Left. 5 (1985) 317-319. P51 M. Pachter, Some properties of the value matrix in infinite-time linear-quadratic differential games, IEEE Trans. Automat. Control 23 (1978) 746-748. WI A.S. Morse, Structural invariants of linear multivariable systems, SJAM J. Control 11 (1973) 446-465. 1171 A.S. Morse and W.M. Wonham, Status of noninteracting control, IEEE Trans. Automar. Control 16 (1971) 568-580. 191