Optimisation in stochastic models

Optimisation in stochastic models

OPTIMISATION IN STOCHASTIC MODELS* L. S. GURIN (MOSCOW) (Received 20 February This paper is a direct continuation of in referring numbering of t...

267KB Sizes 6 Downloads 167 Views

OPTIMISATION IN STOCHASTIC MODELS* L.

S.

GURIN

(MOSCOW) (Received

20

February

This paper is a direct continuation of in referring numbering of the formulae; the in [ll.

1964)

[I].

This also involves the to formulae (1) - (15). we mean

If we take condition (4) as the end of the computation. then. as already indicated in (1). the accuracy will depend not only on E but also on ?I and other parameters. In order to guarantee a definite degree of accuracy in this case it is necessary to make E a function of these parameters. Moreover a reasonable value should be chosen for a in formula (4). In order shall start With the

to obtain definite again from heuristic symbols

recommendations considerations.

on these

cluestions

we

adopted

f (20. YO)- f @l* YI) = fn @o, Yo)- fN (Q, t/l) + 5 ($)

= Af + 4 (y)

,

whence M if 6% yo) -

f (21, yl)la = (Af)2 + g,

Comparing with formula (4). we obtain the resuit mean value of the square of the difference will On the other

hand,

for

the

function

that for be 39.

(9)

[f(so, YO)--f(x~. Y,)]* = 16iiz(M2z,* + m*y$). Taking

l

the means,

we obtain

Zh. Vych. Mat. 4. No. 6,

1134 - 1137, 228

1964.

(16) a = 2/3

the

Optinisation

in

stochastic

229

nodels

Hence it is possibleto obtain the dependenceof E on A, and that of M on In,and with these we may expect the computationto end with a mean error M(x2+ Y")< e'.

($8)

In fact, taking the mean of xo2 + yo2 in accordancewith condition (171, i.e. assuming

and taking

CJJ to

be uniformlydistributedin (0, WV

we obtain

Comparing(19) and (18). +e obtain for the given E' E=

64Pe" 3L-i12‘ .W / nz~I i:

(3-O)

In Table I the results are given of a calculationfor the function fk

Y) = 5xz+4xy+&2,

consideredin [l], for E' = 0.01, x0 = 100, y. = 0,

k

=

0,

n

=

4;

results

TABLE I

64(l) 213 5.4 8.2 4.9 7.6 4.2 5.0 3.9 4.0 4 5 4 4 _I__.._I____J__I__/---L-J0.320.400.220.300.220.300.320.380.440.410.75~.64

L.S.

230

are given corresponding position of the results

Gurin

to various different values of A and - h. The in each cell of the table is as follows: 10-M (ZN)

I

10-b (ZN)

1~~ (12+ ya) loos The number of cases

(out of

(cl9 + y”) .

100) of divergence

is given

in brackets.

Analysis of Table 1 shows that it is indeed true that RIfz2 + y*) scarcely depends on h at all and. as a rule, &RX* + y*f < E*. Further, it is evident that hopt and A may be chosen in accordance with the formulae (11) and (15) obtained earlier, using E‘ in place of E, i.e. to take

These formulae were obtained in the following way: with A given in accordance with (21). formula (21) for A 1s obtained from formula (15). Substituting A and h into formula (111, we find that the appropriate inequality is amply satisfied. In fact,

both the latter

expressions

exceed

Mu and (11)

gives

The increase of A by a factor J-6 is brought about in order to to possible guarautee greater stability for the process in relation vergence. Thus we may regard on its parameters for clarified.

di-

the dependence of the convergence of the process given coefficients of quadratic form as fully

However in t&e solution of practical problems these coefficients are unknown. This creates a new problem which may be solved by two methods: 1) in searching for an extremum for the values ating the second derivatives of the function f(x.

of

f~(x,

yf,

by evalu-

y) and in accordance

Optimisslfion

them calculating

with 2)

selecting

of the point We shall

A

(x,,

in

stochastic

M, UI, and then also

and A directly yn).

consider

231

models

h, A;

by analysing

the nature

of the motion

method (1).

We shall introduce heuristic considerations at the start in the choice of the tiral step for evaluating the second derivatives. We have

Hence h* must be of the same order

as the mean square deviation

{,

i.e.

(22)

where the constant

A

has the

vnlue aefined

above.

In order not to return to this question, we note that the calculations described below were carried out for various values of P and it turned out that it is best to take u = 1. The algorithm for method (1) takes the following form: we are given arbitrary A and A; we then carry out a calculation according to the algorithm described above for the given A and A; on each occasion when it is necessary to increase N a times we carry out a re-computation of A and A. evaluating first of all the second derivatives through the second differences (in this we make the trial steps in conformity with (22)). and then determining M, II and, finally, h and A in accordance with (21). TABLE 2

lU-BM(Zh’) 27.2 io-%((ZN) 82 HJOM(zw+y*) 0.65

2.54 2.0 0.31

uJOs(z~+y~)

0.28,

0.92

3.31 4.0 0.23 0.26

6.02 9.8 0.29 0.41

Moreover, two further refinements have been introduced into the algorithm in order to avoid overloading the discharge network during

L.S.

232

operations

on the electronic

Curin

computer:

a) if it turns out that the step for the antigradient obtained in accordance with the formulae is greater than a certain constant L, it should be taken as equal to L (we took L = 100); b) if it turns out that III< 8. where 6 > 0 is a sufficiently small number (we took 6 = 0.011, then we should proceed to the usual multiplication of N by TVand recompute h and A. It is easy to see that with a positive-definite quadratic successful choice of 6 no cyclic process will occur.

form and a

A calculation carried out under the same conditions as in Table 1 gave results which are set out in Table 2. We shall compare the last column of Table 2 with Table 1, where n was equal to 4. In Table 1 for a successful choice of h and A lesser values of M( EN) were obtained. but for many A and A larger values were obtained. This means that in the finishing process the approximation to optimal h and A proceeds slowly; this is due to the difficulty of evaluating the second deriva(a difficulty demanding, furthertives of f(~, y) by random realisations more, supplementary realisations). However even the better results from Table 1 are only 1% times better than the corresponding results of Table 2. Therefore we may regard method (1) of evaluating the quadratic form as satisfactory. It may be expected that method (2) will turn out to be better only for the less accurate optimisation of f(x, y). But in this case we should bring in for comparison other methods of optimisation also, for example, the method of random search. We note further that Table 2 confirms the conclusion drawn in (1) that an optimal n exists near to n = 1.4. Without embarking on a more accurate determination of the value of n, we shall merely emphasise the following essential fact: uopt > 1.

Translated

by

R.F.B.

Qreig

REFERENCE 1.

in stochastic models (Optimizatsiya v GURIN, L. S., Optimisation stokhasticheskikh modelyakh), Zh. Vych. Mat. 4, No. 2. 367-370.1964.