ROBUST ADAPTIVE TRACKING FOR NONLINEAR SYSTEMS US!. ..
14th World Congress of IFAC
Copyright ((.; 1999 IFAC 14th TrielJuiaJ W-orld Congress, Beijing~ P.R. China
1-3b-03-2
ROBUST ADAPTIVE TRACKING FOR NONLINEAR SYSTEMS USING DYNAMIC NEURAL NETWORKS
Dai Qionghai*, Wu Hongwei*, Sun Fuxin*,· Li Yanda* and Wang Wei**, Chai T. y.*.*
* Automation Department. Tsinghua University, P. R. China *"" The Center ofAutomation, Northeast University, P. R. China Corresponding Address: Dr. Dai Qionghai, Automation Department, Tsinghua University, Beijing, Email:
J00084~
P R. China
qhdai@}jerry~au.tsinghua.edu.cn
Abstract: A dynamic-neural-network-based control is proposed for a class of affine systems. The approximation for the unknown affine system is studied. The robust learning Jaws are derived with respect to the modeling errors. The 0 - protection and hysteresis technique are used to guarantee the stability of the resulting controller. The features of this approach lie in that neither off-line learning nor sufficiently small initial
parameter errors are required. It is proven that the proposed algorithm guarantees signals bounded in the adaptive loop under any bounded initial
conditions~
an
Simulation
results are given to verify the effectiveness of the proposed adaptive control algorithm. Copyright© ]999 IFAC
Keyword: dynamic neural network, affine nonlinear system robust adaptive tracking oa
I~
connected
INTRODUCTION
the
static
back
propagation
neural
networks with linear dynamic systems in series or in
Adaptive control of nonlinear dynamic systems by
parallel. Levin and Narendra (1993) indicated the viability of the neuraI-network-based controllers on
using neural networks has recently been an active area~
Narendra et aJ. (1990; 1994)
the basis of the results in nonlinear control theories.
originally formulated the problem. They proposed dynamic back propagation schemes, which
In (Liu and Chen, 1993) and (Jin et aI., 1993) the
and challenging
convergence
analysis
of the
multi-layer-neural-
4476
Copyright 1999 IFAC
ISBN: 0 08 043248 4
14th World Congress of IFAC
ROBUST ADAPTIVE TRACKING FOR NONLINEAR SYSTEMS US!. ..
network-based direct adaptive control was provided.
architecture capable of modeJing continuous time
A weight learning algorithm was presented by using the gradient descent nlethod with a dead-zone
dynamic systems as below: i = f(x) + g(x)u
function, and the errors were proven exponentially
where x ERn is the state variables,
convergent. But their conclusion required the initial
(1) U E
R m is the
input, f(x) ERn and g(~ E R nxm are the function
parameter errors sufficiently small. In (Dai et aL,
vector and matrix respectively.
1995) the robust stable analysis of indirect adaptive
control with modeling errors was presented. A novel neural-network-based controller was explored with
When some mild assumptions are imposed on the
the robust stability to modeling errors. And the
system (1) and the neural network topologies, the
learning law was given by using Lyapunov theory,
following theorem is derived.
which guaranteed the stability of the resulting
Theorem 1 Suppose i(O):= x(O) and
controller and avoided iterative training procedures.
U E
U c Rm
where U is a compact set. The two-layer dynamic neural network shown in Fig. 1 satisfies
This paper is organized as follows. Section 2 develops the problem of approximating the affine nonlinear systems by using two-layer DNNs. In
Section 3, a class of bound functions for modeling
.where
errors is employed to investigate the robustness
network, W
E
R
properties of the DNN-based controller with respect
weight) V
R
Nxm
to modeling errors. Section 3 also proposes a law for
weight~
is the state vector of the neural
XE RN
E
h'xN
is a N X N matrix of synaptic
is a N X m matrix of synaptic
A and Bare N X N diagonal matrices with elements the scalars Qj, b i for i=l, 2~ , N,
robustly tuning the weights, in which a sort of hysteresis is used to guarantee the stability and
S(x) is a N-dimensional vector and S'(x) is a NXN
robustness of the controller. In section 4, some
diagonal matrix with elements 8(XI) and S'(,'Yi)
simulation effectiveness
results
are
given
to
verifY
respective Iy.
the
monotone
of the proposed adaptive control
s(.)
s'()
and
increasing
are
functions
both
This
smooth two.. layer
algorithm. Finally the conclusion of the paper is
recurrent DNN with dynamical elements in its
dro\vn.
neurons and N
~
n, can approximate the nonlinear
system (1) (proof omitted). 2. APPROXIMATION OF AFFINE SYSTEMS BY
USING DYNAMJC NEURAL NETWORKS
In order to approximate the behavior of a dynamic system, it is clear that any proposed neural network configuration must have some feedback connections.
Such networks are named as dynamic networks. The
neural networks is determined by weights. Therefore the representation input-output
response
of
Fig.. 1. The architecture of the two-layer dynamic
capability of a given network depends on whether
neural network
there exists such a set of weight values that the neural nern:ork configuration can approximate the behavior
In this paper the network with N
of the given dynamic system. In this
section the problem
is considered
=n
and without
any restrictions on the weight matrices W and V
of
is considered. Such a neural network is known as the
constructing a two-layer dynamic neural network
4477
Copyright 1999 IFAC
ISBN: 0 08 043248 4
ROBUST ADAPTIVE TRACKING FOR NONLINEAR SYSTEMS US!. ..
generalized
Hopfie Id
network.
The
differential
equation of this network is described by
14th World Congress of IFAC
In fact a very reasonable assumption (Craig, 1988) is that
i = Ax+ BWS(x) + BS'(x)Vu (3)
{m (x) =
ao (:td) Xd' xd) + a (e)
(8)
The computer simulation results have shown that the
with ad (Xd' Xd' Xd) and a(e) both continuous polynomial functions. The coefficients of aCe)
model represented by equation (3) is more effective
depend on x,,-, id ~ Xd •
and the learning is also faster, which means that the network
has
strong
approximation
ability
for
To solve this problem it is required to add a robust
nonlinear system (1). In this paper the robust tracking
term
control of nonJinear systems is carried out using the
yER.
UR
(t) , based on the function defined as (9) for
DNN model as (3).
l-exp(-Y/) sat(y) - {
-
with
3. ROBUST ADAPTIVE TRACKING CONTROL BASED ON DNN
r
/y
-(l-exp( ~)
ify~
(9)
ify
a small positive (design) parameter. When
Y ERn, sat(y) is a vector with entries sat(Yi)'
In this section a robust adaptive tracking control of affine nonlinear dynamic systems is studied. When
Take the control input as
DNN is considered as a model of the plant (I),
(10)
modeling errors are defined as following: with auxiliary signal
t(x):::= Ax+ BW*S(~+BS'V"'u -f(x}-g(x)u (4)
u R =- ~m(x)sat(e) (11)
with W . . and V'" denoting the unknown optimal weight values. The modeling errors arise from the
The closed-loop error dynamics become
e = Ae+BWS+BS'Vu+ t(x)+u R
inadequacy of the approximation. Thus the dynamic
equation is expressed as
x== Ax+BW"'S(x)+BS'(x)VAu- ~(~
(5)
with
W=W-W·
and
The
architecture
of the
(12)
V=V-V·. proposed
DNN-based
adaptive tra~king controJ is shovm in Fig .2. In the tracking
~roblem
the control objective is to
force the pJant's state vector x to follow a specified desired traj ectory x d. The tracking error vector between the plant's states and the desired states is defined as fo lIowing: e=xd-x
(6)
Differentiating (6) and considering (5) the following Fig. 2 the architecture of the DNN-based adaptive
expression is obtained:
e = Ae -
BW·S(x) - BS'(x)V· u + t(x) + r
with r ==- Xd -
AXd.
tracking control
(7)
The weight learning Jaws are given in (Dai et al.,
DNN is here used to make the
1995) as (16) when not considering modeling errors:
nonlinear system track the desired trajectory, thus the
problem is to design a control law.
W= -BPS"(x)E; V== -BPS'(x)eu T
(13)
It is developed that the approximation error bound ~m (x). is not a constant, but depends on x d and x.
4478
Copyright 1999 IFAC
ISBN: 0 08 043248 4
ROBUST ADAPTIVE TRACKING FOR NONLINEAR SYSTEMS US!. ..
14th World Congress of IFAC
hysteresis plus 0 -protection is considered, shown SI
with
E=diag{el,p~p,en} andS"=
as below:
: [ SI
Due to the presence of modeling errors, the learning laws about W and V given by (13) should be modified. Ioannou and Aniruddha (1991) proposed an alternative leaning scheme which required neither
IIV"II. and replaced
priori knowledge of ~W·II nor the constant tenn equations
for
G'.
laws
are
modified
6~v
L=
-I 6v~V
(15)
0"']
0
0
0"2
>(0',-,[
0
dJ!:. < f. The
L£
0
0
::=
crm _]
o
drifting to infinity.
Gm
0
o
+(m-N-l)om
The hysteresis idea (.MatheIin and Badson, 1995) is
singular value, and
a
b are
a
O"min (.)
+Om
0
Em
:5: (Cm +(m-N~l)om)I
and {tit.} are defined as following
expressions, in which
o Bm
o
applied to guarantee the existence of u. The
Vc
+!3N)"2(CN +!3N)I
UN
O"N+I
modified laws will prevent the weight values from
variables of
diag{o-,}, ,
0
0
and 6 v diagonal matrices, do and d 1
positive constants chosen to satisfy
:=
(14)
6v =-d, 5v -doE with
~l= 1:",
J
a;::; aj "'ii <
:-d1 5w -doE
-BPS'(x)eu T
V is considered
to make E and I:s satisfy the following inequities:
W= -BPS"(x)E -18w I'"
v=
Specifically, the partitioned singular value
[M MslV[N Ns ]] =- [ :
as
followings;
6w
G~.
decomposition of
with an adjusting term. The
learning
where M T 1:""" 1 N is a lower rank approximation of
The constant
denotes the smallest
Bm
and Om are chosen to satisfy the
foIIowing conditions:
is a designed positive constant, parameters
chosen
to
satisfy
l
wIth am > 1 and 0 < 8 m
G; The
am
can be guaranteed.
following
lemma
is
needed
for
further
proceeding.
(min (~(Tk));=O"./c, 1; is the time that
& = -....!!!-. The existence of
Lemma I For weight matrices W and V ~ the following equalities are established (proof omitted).
O'min (V(t))>% \Ilk
tr{WTW}
Furthermore, to relax the restriction that the pseudoinverse rVTV]~l must be bounded, the selective
tr{VTV}
=~II\\f+~llwlr -~~w·t, =~llvf +tllif -tllvof
4479
Copyright 1999 IFAC
ISBN: 0 08 043248 4
ROBUST ADAPTIVE TRACKING FOR NONLINEAR SYSTEMS US!. ..
The Lyapunov synthesis method will be used to
14th World Congress ofIFAC
eTp(
derive the stable adaptive Jaws. Therefore we
~x) +UR) ~';m (x)lef Pvec(exp{Jj,{})
consider the Lyapunov function candidate as below:
with vec(y;) the vector with components Y .And by
I T 1 -... T -.. 1 -- T ~ I T v=-e Pe+-tr{W W}+-tr{V V}+-tr{owow}
employing Lemma 1, the following equality is
2
1
2
2
j
2
obtained:
T
+-tr{ Qv Dv} 2
V ::::
(16)
where
P >0
is chosen to satisfy the Lyapunov
equation PA + ATp == -I. The following theorem provides a bound for the adaptive scheme when
-~(l- 4~o )lIe 11 2 +';m (x)leIT pvec(exp{-';!tn
+8~1 (~w·r +llv·r) From (8) it is yielded that
applied to the true system.
Theorem 2 Let :x d , id' Xd be bounded. Cons ider the plant given by (5), and Jet the modeling error bound be of the form (8). The controller is taken as
of DNN are adjusted according to (14) and (15). For any
WeO)
and
YeO)
satisfying
(V(O))~ Gm'
,O'min
Since the exponential term dominates any polynomial it may be argued that for a suitably small y
one~
(10), the auxiliary signal is as (11), and the weights
-;(1- 4~o )1~112 2
vs
+a{)n ~XP{l} +at n J{XP{l}
+8~1 (1Iw·r +/Iv·r) with
uJ
a
continuous
function
x d , id' Xd
of
Proof Differentiate (16) along the solution of (12),
depending on the coefficients of a(e).
the followings are obtained
Since the last terms are independent of e and the desired trajectory is bounded,
I ., T ......, T ,..... T v=--II~I- +e P(BWS)+e P(BS'Vu)+e P ~(x) 2
+ eT Pu R +tr{W
T
an Y /
bounded,
W} + tr{VTV} + tr{ 8"{;r DW } + tr{ 5~ 0v } _1 8d)
(17)
o
/exp{l}'
(lIw ·r +llv.r)
are
a o and a J are a,n ;{xP{J} and respectively_
bound,
Therefore for
Select (8) to guarantee the last term
to satisfy the following inequality:
e;<\g;(x)\ + U Ri )
~ ej(~m exp{-j{ }-8
j)
(18)
et 20
v< 0, implying that
-eiClC;i(x){ -URi)
e, W, V, 6w , 6v
E
Lcn
•
s-e,(.;'mexp{J:}-8;) e, <0 with 0, == ';m(x) -lq(x)1 ~ 0, 2; (';;(x) + URi)
S
4. SIMULATION RESULTS
so that
In order to evaluate the quality and feasibility of the
4m(:l(~eil exp{- j{}-
proposed DNN controller, it is applied to control the speed of a DC motor. The dynamics of a separately excited DC motor are mode led as follows:
Thus
4480
Copyright 1999 IFAC
ISBN: 0 08 043248 4
ROBUST ADAPTIVE TRACKING FOR NONLINEAR SYSTEMS US!. ..
14th World Congress of IFAC
to be robust stable. The learning law is generated using Lyapunov theory. The cS -protection and the hysteresis technique are used to make the adaptive
f(x) ==
system robust and avoid iterative training procedure. Simulation results indicate that the proposed DNNbased controller have good practical use for the It should be illustrated that the robust DNN adaptive
adaptive tracking control of nonlinear systems with
controller requires no knowledge of the dynamics or
modeling errors.
the structures. In the example x 3 can not be measured, which is considered as an unmodeled XJ~ X2
dynamic, and only
REFERENCES
are measured.
Qionghai, Dai, T.Y., Chai, Zhang, Tao and Yumei, ~
The reference trajectory is a periodic signaJ with
Zhang (1995). Robust adaptive control of
period=30s. On adequately choosing initial values
class of unknown plant using dynamjc neural
networks. Proc. IEEE ICNNSP, 568-573.
and assigning modification parameters, the tracking 3~
performance is shown in Fig.
K.S.,
Narendra
and
Parthasarathy (1990).
K.,
Identification and control of dynamic systems
using neural networks. IEEE Trans. Neural Networks, Vol. 1, No. I, 4-27-. Narendra and S.,
K.S.,
Mukhopadhyay (1994).
Adaptive control of nonlinear multivariable system
using
neural
networks.
Neural
networks, VoI. 7, No. 5, 737-752.
A.D., Levin and K.S., Narendra (1993). Control of Fig. 3 (a) the trajectories of .
T"
Xl
nonlinear
and x ml
networks~ 11 .. : -
"
:- ••_
-~
:
:
~ -
'"
~.- • •-
: .. -
-
:
IEEE Trans. Neural Networks, VoL 4, No. 2,
: i: :; ;:!; ~:; ;: ;: ;!; : ;l; ; ; ~ml
~t5
:
:
:
~
n
:
10
1
:
:
'
:
1~
14
J6
13
Jl
Fig. 3 (b) the trajectories
Xl
and
dynamic systems using neural controllability and stabilization.
207-220.
Chen-chung, Liu and Fu-chuang, Chen (1993).
Adaptive control of nonlinear continuous time system using neural networks-general relative degree and MIMO cases. 1nl. J. Contr; Va!. 58, No. 2, 317-335.
X m2
L~
Fig. 3 tracking performance of the proposed DNN-
Jin)
P.N.~
Nikiforuk and M.M., Gupta (1993).
Direct adaptive out tracking control using
based adaptive controller
multilayered neural networks. lEE Proc. D, VoJ. 140, No~ 6" 393.. 398~ P.A., Ioannou and Aniruddha, D. (1991). Robust
5. CONCLUSION
adaptive control: a unified approach. Proc.
The DNN-based robust adaptive tracking control of continuous-time
nonlinear
dynamic
systems
IEEE, Vo!. 79, No. 12, 1736-1767.
is
Michel, de, Mathelin and Marc, Bodson (1995).
studied. It is shown that the two-layer ONN can
approximate affine dynamic systems. A neural-
Multivariabl model reference adaptive control
network-based
without on the high-frequency gain matrix.
controlIer
is
explored.
And
the
Automatica, Vol. 31, No. 4, 597-604.
analysis of the function bound about modeling errors J.1.,
is employed to investigate the robustness properties
Craig
(1988).
Adaptive
control
of robot
manipulators. Reading, MA: Addison-Wesley.
ofDNN controlJer. The closed-loop system is proven
4481
Copyright 1999 IFAC
ISBN: 0 08 043248 4