Design of Closed Loop Optimal Guidance Law Using Neural Networks

V ol . 15　 N o . 2 　　　　　　　　　　　　 CHIN ESE JO U RN A L O F A ER O N A U T IC S　　　　　　　　　　　　　　M ay 2002 Design of Closed Loop Optimal Guidance Law Using ...

Download PDF

167KB Sizes 0 Downloads 64 Views

Report

PDF Reader
Full Text

V ol . 15　 N o . 2 　　　　　　　　　　　　 CHIN ESE JO U RN A L O F A ER O N A U T IC S　　　　　　　　　　　　　　M ay 2002

Design of Closed Loop Optimal Guidance Law Using Neural Networks ZHOU Rui ( Dep artment of A utomat ic Control, Beij ing Uni versity of A er onaut ics and A stronauti cs, Beij i ng 100083, China ) Abstract: 　It is gener ally impo ssible to obtain the analytic o pt imal g uidance law fo r complex no nlinear g uidance sy st ems o f hom ing missiles, and the o pen lo op o ptimal guidance law is oft en o btained by numer ical methods , w hich can not be used dir ectly in pr actice. T he neural netw or ks ar e t rained off-line using the optimal tr ajecto ry of the missile pro duced by the num erical open loo p o ptimal guidance law , and t hen, the conver g ed neur al netw or ks ar e used on-line as t he feedback optimal guidance law in r eal -time . T he r esear ch sho ws that differ ent selectio ns o f the neur al netw or ks inputs , such as the system stat e var iables o r the rat e of L O S ( line of sight ) , may hav e g reat effect o n the per for mances o f the guidance systems for homing missiles. T he ro bustness fo r sever al g uidance la ws is inv estig ated by simulatio ns, and the mo dular neural netw o rks ar chitectur es ar e used to incr ease the appro ximating and gener alizing abilit ies in the larg e state space. Some useful conclusio ns ar e obtained by simulatio n result s . Key words: 　neural netw or ks; missile guidance; o ptimal guidance law ; pr opor tional nav ig atio n g uidance 神经网络最优闭环制导律设计. 周　锐. 中国航空学报( 英文版 ) , 2002, 15( 2) : 98- 102. 摘　要: 对于复杂的非线性导弹制导系统, 很难求得其解析的最优制导律, 只能求得开环的数字解 , 不能适用于具有时变不确定性的导弹制导系统。利用神经网络的学习和推广能力, 对开环的数字最优制导律进行离线的学习, 作为闭环的神经最优制导律在线应用。研究分别选择系统状态变量和视线角速率等不同的神经网络输入对制导系统性能的影响, 以及各种制导律的鲁棒性问题, 并采用模块化神经网络结构提高神经网络的学习和推广能力, 仿真结果得到一些有益的结论。关键词: 神经网络; 导弹制导; 最优制导律; 比例导引文章编号: 1000-9361( 2002) 02-0098-05　　　中图分类号: V 249. 1　　　文献标识码: A

　　T he proport ional navig at ion guidance ( PNG)

diff icult t o solv e and o ft en obtained of f -line by nu-

is used w idely in practice because of it s simple

m erical methods, thus can be only used as open

st ruct ure and easily implement ing, but its per for -

l oop OGL . Mo reo ver , the OGL is sensit iv e t o t he

mances ar e limited . Opt imal guidance law s ( OGL )

initial condit ions and noises in t he missile guidance system , w hich lim it s it s applicat io ns in pract ice .

hav e theoret ically solv ed t he problem ex ist ing in PNG , and pro ved that the P NG is t he o pt imal so -

Neural net w orks ar e w idely used in synt hesis

lut ion under t he co nditions of zero miss dist ance

o f vario us feedback OGL using t he opt imal open

and minim izing t he sum o f t he squares o f t he missile accel erat ion , assuming no t arg et m aneuver ing

l oop t raject ory of m issiles

[ 1]

and no delay in missile dy namics . U nf ort unat ely , solv ing t he OGL w il l result in a complex tw o po int boundary value pro blem ( T PBVP) , w hich is Received date: 2001-10-08; Revision recei ved dat e: 2002-03-10 Foundati on it em: N at ional S cience Found at ion of China ( 69904002) A rt icl e U R L: ht t p: / / ww w . hk xb. net. cn/ cja/ 2002/ 02/ 0098/

[ 2, 3]

, and provide an ef-

f ect ive approach t o desig n t he opt imal co nt rol ler s f or co mpl ex opt im al problem s, because o f it s abilit ies t o com put ing par al lell y , learning , generalizing , and f aul t t olerance . But how t o select t he

M ay 2002

・ 99・

Design of Cl osed Loop O pt imal G uidance Law U s ing N eural N et w orks

st ruct ures and tr aining sam ples of t he neural net w orks guidance ( NNG ) may hav e an impo rt ant ef fect on t he per for mances of m issile guidance systems. T his paper discusses t he select io n of neur al net w orks st ruct ures in NNG , and anal yzes t he per fo rmances and ro bust ness t o noises o f NNG w it h dif ferent input s.

1　Generation of Opt imal Guidance L aw T he t w o dimensional eng agement bet w een t he missile and t arget is show n in Fig . 1. T he relat ive range o f t he m issile and t arget in t he direct io n o f X and Y is denot ed as x and y . Assuming no t ar get maneuvering, t he relative mot ion equat ions are

2　Select ion of N NG St ruct ures F or t he com plex nonlinear opt imal guidance f or hom ing m issiles, o nly o pen lo op numerical OGL can be obt ained of f -l ine, w hich can not be used in t he time varying and uncert ain guidance systems. But the opt imal open loo p t raject ory of t he missile based o n T PBVP can be used t o t rain t he neur al net wo rks of f -l ine , and then , the converg ed neural net wo rks are used as an o pt imal f eedback guidance o n-line. F or long range int ercept ion w it h larg e st at es space , a neural netw o rk m ay hav e g ood learning and generalizing abilit ies fo r so me st at es subspace, but may hav e po or o nes fo r anot her st at es subspace. So a m odular connect io nist archit ect ure show n in Fig . 2 can be used to all ocat e diff erent

F ig . 1　M issile and tar g et inter ceptio n geo metr y

given by

[ 2]

x = v t cos - v m cos , x ( 0) = x 0 y = v t sin - v m sin , y ( 0) = y 0 an = , ( 0) = vm

( 1)

0

Fig. 2　 M o dular neura l netw or ks a rchitectur e

w here an is t he normal acceler at ion of t he missil e, and assuming t he tr ansfer funct ion o f t he missile aut opilot is w it hout del ay . A simple f orm of OGL is defined as

netw o rks t o learn t raining pat t erns f rom diff erent [ 4] r eg io ns o f t he input space . Each netw or k can it self be a local or global appro ximat or for a part icuth

t

1 a2n dt ( 2) 2 0 and t he t erminal condit ions, x ( t f ) = 0, y ( t f ) = 0 are sat isf ied, where tf is t he fr ee t erminal t ime. J = min a n

∫ f

It is w ell know n that solv ing Eq. ( 2) under Eq. ( 1) w ill r esul t in a complex nonlinear T PBVP, w hich can be so lved num erically of f -line as o pen loop OGL . T he P NG of t he abo ve problem is given as an = N ′ v cq ( 3) w here N ′is t he navig ation rat io; v c and q are separat ely relativ e v elo cit y alo ng t he l ine of sight ( L OS) and ang le of L OS.

lar r eg io n of t he space. T he k o ut put unit of t he g ating net w ork , deno ted by g k , is esk gk = ( 4) m ∑i= 1e si w here sk is t he unit k′ s out put s o f t he gat ing net w ork and m is t he number of exper t netw o rks. T he o ut put of t he ent ire archit ect ure is act ually t he m issile guidance co mmand, and g iven as m

an =

∑g

k

( 5)

ank

k= 1

th

w here a nk denot es t he out put of the k ex pert net w ork. During t raining , t he weight s of t he ex pert and gat ing net w orks are adjusted sim ult aneously

・ 100・

ZHO U Rui

using t he backpropagat ion alg orit hm so as t o max imize t he f unct ion m

lnL = ln ∑g k e

-

k= 1

1 ‖a* - a ‖2 n nk 22

( 6)

k

* n

w here a denot es t he ideal missile guidance com mand pro duced by t he o pen loop opt im al t raject ory based on T PBVP under Eq. ( 1) and Eq. ( 2) ; 2i deth

no tes a scaling param et er associat ed w ith t he i ex pert netw o rk.

3　Desig n of T raining Sam ples for NN G Out put s of t he t raining sam ples for NNG are

CJA

3. 2　Rate of LOS as NNG inputs Similar t o P NG , q , t he rat e o f L OS , and v c , t he closing velocit y betw een t he m issile and t arget , are select ed as input s o f NNG . Based o n t he m issile and t arget intercept io n geomet ry in Fig . 1, o ne can obt ain q=

x ( v t sin - v m sin ) - y ( v t cos - v mcos ) x2 + y2

vc =

x ( v t cos - v m cos ) + y ( v t sin - v m sin ) x 2 + y2

q can be used direct ly as input , and v c is norm alized as v c = v c/ ( v m + v t)

* n

a show n in Eq. ( 6) . T o assure t he st abilit y and co nverg ence of neural net wo rks, a *n can be norm al ized as [ 2] a*n x 0 a n = 0. 3 + 3v 2m

4　Simulation Result s of Feedback NN G ( 7)

T he init ial condit io ns of t he syst em are select [ 2]

　　Input s of t he t raining samples f or NNG can be dif ferent according to the desig ner' s pref erence,

ed as

t hus t he perf orm ances of t he guidance syst em may

t ain an open loo p OGL and opt imal t raject ory ,

be diff er ent . Considering t he P NG and OGL have

based on which, tw o diff erent inputs in t raining samples, ( x, y , -) and ( v c , q ) can be developed,

distinct ly diff erent inputs, t w o cases are invest igat ed in Fig . 3, in w hich the inputs of NNG are se-

x 0 = 7500m, v m = 500m/ s, v t = 300m / s,

110° , 0 = 80° . Chebyshev t echnique

[ 5]

=

is used t o ob-

lect ed respectively as x , y and , o r q and v c ; t he

and the co rresponding out put s ar e all t he same, a-n. T he num ber of ex pert net w orks is f our. De-

st ruct ure of NNG is show n as F ig. 2. In al l cases,

pending on t he dif ferent input select ion, t he st ruc-

it is assumed t hat there is neit her targ et m aneuver ing nor delay in t he missile aut opilot .

t ure of each expert net w or k is select ed as 3-5-1 or 2-5-1, and t he st ruct ure of t he gat ing net w or k is select ed as 3-5-4 o r 2-5-4, respect iv ely . T he mo dular net w orks can be used as f eedback g uidance on l ine af ter they are co nverg ed . T he simulatio n r esult s under no minal condit ions are show n in Figs. 4～6, and compared w it h t he ones of PNG in the sam e coor dinat e. T o invest igat e t he robust ness of each missil e guidance law , t he t raject ories of t he missile and t ar get under t he

ig . 3　Schem e of N N G system w it h two different inputs

3. 1　System states variables as NNG i nputs Since t he f ull st at es variables rev eal t he int erior behavior s of t he g uidance syst em , x , y , and in Eq. ( 1) are first investig at ed as NN G input s . In order t o assure the st abil it y and co nverg ence o f t he neural net wo rks, input stat es can be norm alized as x = x / x 0, y = y / x 0, - = 2 / ! ( 8)

condit ions of measuring noises in missil e pat h ang le are sho w n in F ig. 7. T he st reng t h of measuring noises in , deno ted by n , is g iv en by n = 0. 02sin5t ( r ad) 　　 T he simulation r esult s show that t he perf orm ances of N NG using x , y , and as its input s are consist ent w it h t hose of nominal OGL , and have t he best r obustness in all cases. But t he m issile w it h NNG using q and v c as it s input s f ails t o kill

M ay 2002

Design of Cl osed Loop O pt imal G uidance Law U s ing N eural N et w orks

・ 101・

t he t arget under measuring noises in . T he o pen loop OGL has been of f it s optim alit y , and is very sensit ive t o t he measuring noise. T he missile w it h PNG has paid m uch mor e cont rol ef for ts w it h a large curvat ur e, but it s miss dist ance f or P NG is sm all in all cases .

Fig. 7　T r aject or ies of missile and t arg et under no ises

5　Conclus ions U nder nom inal condit ions, OGL is superior

F ig . 4　N ominal tr ajecto ries o f missile and tar g et

g reatly t o PNG on all sides. But f or nonlinear g uidance syst em s, onl y open l oop numerical OGL can be obt ained, which is very sensitive t o no ises and uncert aint ies in t he g uidance syst em , and t hus, can not be used direct ly in pract ice . But t he o pt imal t raject ory obtained fr om t he open OGL can be used t o t rain a f eed f orw ard m ult ilayer neural netw o rk off -line. T he netw o rk can be t hen used o n-line as feedback NNG. T he feedback NNG using f ul l sy st em st at es as it input s present s goo d perfo rmances and st rong r obustness w ith respect t o noises and uncert aint ies. Mo reo ver, t he NNG only needs t o st ore a set

Fig. 5　N om inal ang ular rat e of L O S

o f w eig ht s w it h small com put at ion and sto rage, t hus, it is v ery suit able t o use on -line in real t ime . Since only t he rate of L OS and closing v elo cit y betw een t he missile and targ et can not uniquely det ermine t he sy st em st at es , t hey can not r eveal t he int erio r dynamic behav io r in t he guidance syst em . U nder no minal condit ions, the feedback NNG using t he rate of L OS and clo sing velocity as it s input s has similar perfo rmances t o the OGL , but t he m issile f ails t o kill t he t arget under measuring noises. It is show n t hat the OGL may be a more complex f unctio n of t he rat e o f L OS and closing ve-

F ig . 6　M issile no minal nor mal acceleration

lo cit y bet ween t he m issile and t arget . It should be po int ed out t hat a sing le neural netw o rk ar chitecture can be select ed f or t hose

・ 102・

ZHO U Rui

shor t rang e engag em ent bet w een t he missil e and t arget , and maintains goo d guidance per for mances, so as t o simplif y t he NNG. References [ 1] 　 Bryson A E , Ho Y C. A ppl ied opt imal cont rol [ M ] . W ashington D C: Hals ted Pres s, 1975. [ 2] 　 Rahbar N , Bahr ami M . S ynt hes is of opt imal f eedback guidance law f or hom ing mis sil es us ing neural net w orks [ J ] . O ptimal Con trol A ppl icat ions & M et hods, 2000, 21: 137- 142. [ 3] 　 Cot tr el R G , V incent T L. M inimiz ing in tercept or si ze using neu ral net work s f or t er minal guid ance l aw s yst em [ J] . Journ al of G uidance , Cont rol and D ynam ics, 1996, 19( 3) : 557- 562. [ 4] 　 Jacobs R A , J or dan M I . Learning piecew ise cont rol s tr at egies in a modular neur al netw ork archit ect ure [ J ] . IEEE Tr ans acti on s on S yst ems , M an and Cybernet ics, 1993, 23

CJA

( 2) : 337- 345. [ 5] 　 V l as senbroeck J. A Chebyshev t echnique f or s ol ving nonl inear opt imal cont rol probl ems [ J] . IEEE T rans on A ut omat ic Cont rol , 1988, 33( 4) : 333- 340.

Biography: ZHOU Rui 　 He wa s bo rn in 1968, and r eceived his docto ral deg ree fro m Ha rbin Institut e o f T echno lo gy in 1997, and then became a teacher in Depart ment of A uto matic Contr ol, Beijing U niver sity of A er onautics and A stro nautics, w here he became an asso ciate pro fesso r in 1999. His resear ch inter ests are flig ht co ntro l, missile guidance, and intelligent co ntr ol. T el: ( 010) 82317318, E-mail: zhr @ . dept 3. buaa. edu. cn

Design of Closed Loop Optimal Guidance Law Using Neural Networks

Design of Closed Loop Optimal Guidance Law Using Neural Networks

Recommend Documents