The method of projections for finding the common point of convex sets

The method of projections for finding the common point of convex sets

THE METHOD OF PROJECTIONS FOR FINDING TNE POINT OF CONVEX SETS* COMMON L. G. GUBIN . B. T. POLYAK and E.V. RAIK MOscow (Aeceived 29 November 1...

1MB Sizes 23 Downloads 78 Views

THE METHOD OF PROJECTIONS FOR FINDING TNE POINT OF CONVEX SETS* COMMON L. G.

GUBIN .

B. T.

POLYAK and E.V.

RAIK

MOscow (Aeceived

29

November

1966)

MANYmathematical and applied problems can be reduced to finding some common point of a system (finite or infinite) of convex sets. Usually each of the sets is such that it is not difficult to find the projection of any point on to this set. In this paper we shall consider various methods of finding points from the intersection of sets, using projection on to a separate set as an elementary operation. The strong convergence of the sequences obtained in this way is proved. Applications are given to various problems, including the problem of best approximation and problems of optimal control. Particular attention is paid in the latter case to problems with restrictions on the phase coordinates.

1. Methods of successive projection A prqject point .+(x)

ion

E

of the point x on to the set ;; of a normed space I’? is a .:, such that (see Fig. 1) 112--R(z)ll=

Theproperties of projections

infib-yll= p(z,R). tlER

of projections will be given in Section 2, and examples for actual sets will be given in Section 4.

Hereafter we shall assume that f? is a Hilbert space, although some of the results given below are also valid for a wider class of spaces. Suppose we are given

l

Zh.

vychisl.

Mat.

a system of sets I:)~ E E, a E

mat.

Fiz.

7,

6,

A, where :1 is a

1211 - 1226, 196’7. 1

2

L.G.

Gubin

et

finite or infinite set of subscripts. point x of this system of sets, XEQ=

al.

The problem n Qa.

is to find

some common successive

The methods of

GA

FIG.

1.

projection consists in constructing a sequence x0, x1, . . . , xn, . . . , where the point x o is arbitrary, and in order to obtain the point znt 1 from x* a set Qacn, is chosen and a step is made in the direction of projecting the point xn on to this set, i. e. Pfl Hereafter

=

39 + h, (PC+)(P)

Pacn)(z)

=

Po,(,)

-

zn) (

In particular,

(x). zn+l =

o< if

h, <

2.

(1)

A, 3 1, we have

Pa(n) (rn) 7

U’)

i.e. for xntl we take the projection xn on to gc(,,). The order in which the sets are selected (i.e. the method of choosing a(n)), can vary. If A is a finite set, 3 = {Ccl, . . ., a,), a cyclic order of projection can be used CL(n)

=

h(mod

(2)

m)+iv

where n(mod m) represents the remainder obtained from dividing n by m. In the general case the method of projecting on to the most remote set can be used, i e. p(xn,

Qacn,)= sup

p(xn, Qa)=

@(zn).

(3’)

CUSA

Since, however, such a set may not exist or it may be difficult to find, it is sufficient to solve the problem of finding the remotest set approximately, namely to choose an arbitrary a(n) satisfying the following condition:

The

fromP (xn,QCC ( R,) --t 0 it

method

of

follows

projections

thata

3

(rn) =

The convergence of these methods follows which will be proved in Section 2.

sup p (P, Qa)-+ 0. aEA

from Theorem 1 given

(3)

below,

Theorem 1 Let all

sets

Cj, be closed

and convex,

Q =

fl Qa

be non-empty and

CLEA 0

<

&i <

Let any of the following (a)

Q, n

( aO_

h,

<

2 -

conditions

Q,)" is

E2 >

E2,

0.

(4)

be satisfied:

non-empty

(here ?o

is a fixed

set

from the

a&

system f&t if0 denotes

the interior

of ‘I) ;

(b) all Oar, with the possible

exception of one (f&J, vex* with the common function S(T) ;

(c) B is

finite

(d) A = {a,, (2, = {X1

con-

dimensional;

. . . . a,,,) is finite, (Ci,

X><

Then in methods (l), strongly

are uniformly

convergent

and all

&

are half-spaces,

i.e.

/3i).

(2)

and (11,

to sane point

(3) for x0 E Q =

any x0 the semence xn is fl Qz.. In the cases (a) WEA

and (d)

in method (11,

(3’)

the rate

of convergence

is geometrical.

The methods (l), (2) and (11, (3) are suggested in [2, 31 for the finite dimensional case. Their convergence is proved in the same place. Some generalizations of the results of [2, 31 for the finite dimensional case are given in [41. In [51 the weak convergence of the methods in the general case is proved, and no conditions of the type (a) - (d) are required. The assertion of Theorem 1 concerning the strong convergence of the method and an estimate of the rate of convergence for the case (a), are obviously new.

l

A set R is called uniformly convex if there is a function E(T), S(T) > 0 when -r > 0, such that from z ER, y E 9 it follows that z E R for all z, for which lb - (X + Y) / 211< 6(11X- yll) [ll .

4

L.G.

Gubin

et

al.

Let us now consider the case when the sets ‘So do not necessarily have a common point. Here we.shall confine ourselves to finite A and to the method Cl’), (2). i.e. to the case of cyclic projection. Theorem

2

Let all them (for

Oai,

i = 1, . ..,

explicitness,

points

2; E

m - li

;j_,G,)

$oi,

m be closed

o,,)

and convex and at least

be finite.

Then it

is possible

i = 1, . . . , m such that I)ci,,Cx while

= x1,

in the method (l’),

(2)

) = Xi+l,

one of

to find i = 1, . . . ,

we have xkmtitl

-

.km+l

ana xkmti weakly converges to Xi as k -+ co. If in -* %i+l - Xi, the convergence addition any of the conditions (b) - (d) is satisfied, will be strong. If all sets C&.., apart from possibly one, are also convex*,

and

;Q,

the rate

of a geometrical

i=i

is empty, the sequence xkmti converges

1

to Xi at

progression.

Note 1. For the case of two sets such that

llZ_i - ZAl =

m = 2 we obtain

the points

x1, xz

inf IIII:- YII= P (Qan Qd.

=Qa, VEQ, .Note 2. For the strong convergence require that at least one of the sets be uniformly marks also

of the method it

is sufficient

to

(Joj) should Qai (for explicitness, convex and that Qa. n Qa, should be empty (similar re-

apply to the geometric

The method (1’).

J-1 rate of convergence).

(2) with m = 2, go,

n Q,,

= 9 has been suggested

[sI only when one of the in [sI . However its convergence is proved in sets is compact or finite dimensional. In addition, for m = 2 the result of Theorem 2 has been obtained by different methods in IIT]. The case of an arbitrary m (but assuming the compactness of one of the sets) has been considered in Ltd. The proof of Theorem 2 is given below in Section 2. Let us now consider *

yet

another version

A set H is called strongly convex, the function S(T) = y-r2, y > 0.

if

of the method, using a it

is uniformly

convex with

The

different

way of choosing zn+* =

P,(5n D (zn)

method

of

the length

of the step,

+ yn (0 (In) )--i (Pa(*) (2”) =

IIPcLc,,, (xn)

-

5

projections

Al

=

-

namely sn)),

a,;q+a

yn 2

P (xn,

(1”)

0,

Qa) .

(3”)

In other words, during each iteration a step of length yn is made in the direction of projecting on to the remotest set, and the point obtained is projected on to some selected set Q,. Theorem

3

If all and vn+O, finite

Qa are closed

sets

condition

g yn = 00, the method (1”).

number of steps,

Indeed

and convex,

Q =

zn E Q =

i.e.

Qa fl (x : g(x)

<

g(s) = The supporting

functional

nn has the form

En =

0}, sup

n Qa c&s4

gives for

is satisfied,

a solution

within

a

some 72.

where

ac54, a&

to the set

(3”)

(a)

P (2, Qa),.. {r : g(x)

(@(z~~))--~(P~(~)(s~)

-C).

<

g(z”)}

After

at the point this

it

is clear

that Theorem 3 follows from the general results of the paper [91, since the method (1’3, (3”) is a special case of the method considered in [91, and all conditions of Theorem 1 in [91 are satisfied.

2.

Proof of Theorems 1 and 2

1. The properties of projections. We shall nw describe some general properties of the operation of projection, which will be needed subsequently. It will always be understood that R is a convex closed set, and the projection of x on to R is denoted by P(x).

First of all P(x) exists [lo], p. 286). and therefore

and is unique for every x (see method (1) is meaningful.

for

example

Lemma 1 The vector

z - P(X) is supporting

to .9 at the point

P(x),

i.e.

L.G.

Gubin

et

so,

@-J%),Y--P4) If R is strongly

Proof.

we have

---P(x)) (0.

(x-P(X),Y

convex with the function

For 0 < h < 1 we have

convex nature

(5)

y E R.

x@R,~#P(x),~ER,

convex,

If R is uniformly

al.

hy +

(5')

E(T),

(1 -

we have

E R

h)i(z)

from the definition

of R. Therefore,

in view of the

of projection

lb - P(x) II2 < II5 - hy - (1 - h)P(x) 112= = lb - P(x) II2+ AW(z) - YIP + h(z - P(x), P(x) - y). Hence (x-P(x), (5).

P(x))

y-

< hljP(x) - ~112, and taking h + 0, we obtain

In the case of a strongly

that

1/2(P(x)+y)+z~R

llx -

p (4 II,

(5),

we obtain

then

convex set there when

exists

We takea=s(s-p(z))/

tk.llS:.

l/z(P(x) + y) + z E R.

w =

Replacing

0 2 (x -P(x), w -4(4)=‘/2(~--(4,Y + i.e.

(3 -

P(x),

Y -

,y by zu in

-P(x))+

,lp(x;_x,,

P(x))

such an E > 0,

(~-p(49x--(xh

-2ellx - P(x)11 c 0.

<

Finally, for a uniformly convex set, equality, since we can take e = ,S(lly -

(5”)

follows

from the last

in-

P(&)ll).

Lemma2 The projection

operator

is non-stretching,

i.e.

IIP(x) - P(y)II < 112- Yll. If R is strongly

convex,

x # y and x e

IIP(x) -P(y) If R is strongly

convex,

we have

(6)

R (or y 6?? RI, we have

II c 115- Yll.

03’)

The nethod

of projections

P(s) - P(y)!1 < dlz - yll, de Q =

7

1 ~w~~+llv--~iv)ll)

1+2Y(lls-

(6”) Roof.

Applying

(5)

twice,

(5 - W), JYY)-Jw) Adding these

inequalities

we obtain

G 0,

b--P(Y),+)

-WY))

we have

0 > (x - $4,P(x) - P(y) ) + I!P (x) - WY) II2 2 2 -lb - yll IP(z) -P(Y) II + IIW) Hence

IIP(z) -P(y)

II <

< 0.

11x-- yll.

Similarly,

using,

formula (5’) for a strictly convex set, and (5”) set (with 5(-r) = y-r?), we obtain (6’) and (6”).

for

---p(Y) IF.

instead

of

a strongly

(5),

the

convex

2. scheme of the proof for Theorem 1. It is easy to see that the statement of Theorem 1 concerning the convergence of xn follows from the following chain of lemmas.

In method (1) ZEQ=

n&

for

any choice

of a(n)

for

every point

on to

andforalln

%A

IIZn+i- 2116 IIP - zll.

(7)

Lemma 4

In the method Cl),

(2) and (l),

lim 0 (zn) =

R-kCS

0,

(3) with the condition

(4) on h,

CD(x)= SUPP(Gu).

(8)

%SA

Lemma5 If any of the conditions (a) - (d) are satisfied, (8). the following condition sequence xn, satisfying lim p (xn, Q) = 0. Tl-rco

for any bounded is satisfied (9)

8

L.G.

Gubin

et

al.

Lemma 6

If for a convex closed set Q end sequence a? the conditions (9) axe satisfied, xn converges to some point x* E Q.

(7) and

The proofs of Lemmas3 to 6 will be given below. 3. Proof of Lemma .3. Using formula (5) for the sets condition O< h, ,12, we obtain for x EZ Q

4. hoof

of I!&URU4,

p(xn, 0) = 11~ - P,(P) THUS,

p(S’,

Using Lemma3 with n = k$(xn),

II >

Q) monotonically

Ilc+i - P&P) = p (P+‘, Q) . decreases,

Qatn f and the

we obtain

ll 2 ll~+i - pc, (xn+i) ll =

and therefore

there exists

PRO*

P =Iimp(x*,Q), n-+“W ~~he~ore,

Q) - p2(xn+l, Q) > 11x*- PQ(x”) 112 - IIxn+i - PQ(xn) 112 = = llxfl - PQ(xn) II2- IW + hn (Pct(72,(2”) - Zn) - PQ (@) II2 =

p2(x”,

-- llxn - P&y12 - b" -

-

Z?w(z=

+

%I(~cZ,?~)(~

-~Q(2n),&(?&)(z") -

z",PQ(Zn)

Using Lemma1 and condition p2W

pQ(z")!i2 -

xn)

=

k2ik(n)(xn) %&

-

-

d12

-

~~)llP~~~~(~n)

-

d2

+

-PC&~*)).

(4). we find

Q) - @(x~+*, Q) a ~32tl~~n,(~~)

But, as has been shown, p (~9, Q) --f p, that

therefore

-

d12.

(W

from (10) it follows

P(@, Qacn,)= IF - Pa(n)(sn) 11 + o. For the method (I), For the method (l),

(3) in accordance with (3) it follows that @(a?) 4 0. (2) and a certain E > 0 we choose N such that

The method

p (xn, Qw+,) <

e / 2m for

all

II > I’. Then for n >N

= hnp(zn, Qacn)) II e / m.

IIZn+* - znll = 3LnIIPcqn)(zn) - Al For every l< for for n>N

one can find

i
9

of projections

k < m

that a(n + k) = ai.

such

There-

p (P, Qai) < 11~ - ~~+~ll + [IP+~ - Pa(n+k)(P+k) II < 112”- zn+‘II + . * * . . . + ll~+k--i - ~n+k ll + ll~n+k - Pacn+k)(znfk) II < ke I m + e / 2m -C e. Therefore # (xn) = max p (xn, Qai) < e, i

which proves

Lemma 4 for

the method (I),

5. Proof of Lemma.5. Let condition

=Q,fl U-IQ~)~ end a&

llz -

xl} <

such that

(E + 6) E

0,

for

all

of all

that

a # z.

z E Qol, P,(y)

We take .y = P--o(P)

We choose

all

a # a and all

is 811 arbitrary

w =

ex/

point

(e + 6) +

6,~/

Indeed,

But lb - 41 = (6 / e) Ily - P,(Y) II < 6,

z E Qa. Thus UI is an internal

end points

if ,y

e for all a # a, then

where z = x + @/E) (.y - pa(y)). therefore

(a) be satisfied.

6 > 0 such that t E Q, for

6. We remark first p(y, QOL) <

(2).

E Qa,

point

therefore

and put E = 20(x”).

P (Y, Qa) G lb -&W)

il G IIu - Al

= P (xn, Q$+ Therefore if we form w as above, since x E &, y E Qa, therefore Therefore

+++11xn-

of sn interval

having

w E Q,. Then for a # z

+ 11~~- Pa(xn) II =

P (xn, Q,) <%

+ G = e.

we have w E Qa for all a # or. But also w E Qa. Consequently, w E 3).

XII< e(-$+f)

=$ce.

the

L.C. Cubin et al.

10

Here we have used inequality condition

that

(6)

the sequence

(whence

rn(IIrn -

IIy -

XII <

R)

xl1 <

llz’l -

211 ) and the

is bounded.

Thus,

(11) which in view of

(8) proves

Lemma 5.

Let condition

(b) be satisfied.

that there

is a subsequence

i.e.

We assume that Lemma 5 is invalid, xnk such that

kll_P(Z”k, We choose Q, (r”)

E=

min {p/4,6

E, p (rN, Q) >,

<

Let us consider

that

6.(ljPa($N)

PQ(zN) II 2

i.e.

y E

Q, for

YII =

lb--“‘ -

dicts

all

‘12(Po(rN)

hand,

with its

centre

ipa

at

E/2

+

*/2(PQ(~N)

+

1/2p(~N, Q) f

&(xN)).

It is obvious

E,

Pa(zN)

i-e.

to Q,.

(p/2)

- E a

‘/21lp,-

and

But’ Il’Pa(~N) --

6(P/4) >, 5

II s

con-

p/2

-

ami Ily(xN)

--~W

+

i. e. y E Qa. Since y E Q- and

N E

Pu (z”,)

2

+ PQ(x”))

*/2(Pa(sN)

- ZNII 3

c/2 =

I/zE,

f

entirely

P&T)ll>

-

a f a, therefore

the assumption

= N such that

nk

for a # iii, in view of the uniform

II = ‘/2II&(sN)

zNII <

and find

0,

PQ(s~) II) belongs

G([IP,(xN)

+PQp)

1/211Pol(~N)-

f

y =

hN - PQ (x”) 11-

P/4 = P/4, 1/2(Pa(zN)

-

e >

o > 0.

P/Z.

the point

of Qo, a sphere

radius

p,(s)jl

(p/4)},

N E QG. ~1 the other

vexity

Q) =

0. Hence

II d pbNI

p(rN,

Q)

\(

llxN -

lt211~N - p&N) Ii + Y211~NQ,s E


that pCxN, Q) ap/2.

When condition (c) is satisfied Lemma 5 is easily proved by contradiction. Let there be a subsequence xnk, for which p(x*k, Q) > p > 0. In view of the boundedness of the sequence {x”k), another subsequence can be separated out from it (which for simplicity will also be denoted converging to some point x*. Since p(xnk, Q,) -+ 0 when k + co by ~“~1, for every a, and Q, is closed, therefore x* E Qo for all a. Therefore z* E Q. which contradicts the assumption that p(xnk, Q)a p > 0. Finally, if condition (d) is satisfied, we denote by ,!, a finite dimensional subspace going through x0 and spanned by the vectors c r, . . . ,c,,,, and we denote by in the projection of xn on L. It is easy to prove that

The method

P WYQi)

P (zn,Qi),

=

P

w, Q) =

to show Lemma 5 over the finite the following estimate is true

from which the statement

p (Sn, Q),

for

all

s m+1 =

Let xf E IF

-

and in view of

7.

Estimating

satisfied definition

; S,,. n=o

X*1/ < (9),

closed

Q) for

But a sequence

%ll.

section.

are convex, p(z”,

all

pQ(xcn)

rate

p) a sphere with its

11 +

i.e.

(since

in view of

xm E S,)

x* ES(P~(Z”), ilpQ(xn)

(12))

-

and the estimate

P (r”, Q) Q cm (Xn) =

cp (Xn.pQqn)) < c

p(xn, Q)),

x*11

<

2p

<

(lo),

p2(xna Q)- p2(xnfi,

(I -e$$)‘h P(XnY Q).

But from (13) 2p(xn,

Q),

we obtain

Therefore

<

we have

(13)

(a) or (d) is Using the

whence &

(xn,

&(xn).

8182

~‘11

(71,

and in addition

when condition

p(sn,Q)

of method (3’)

llxn -

at

0.

of convergence.

(11) or

P(p+i,Q)

centre

P(cQ)).

n
x.11=

n-

the

follows.

of such sets must have a non-empty inter-

lim (lxn -

we have (see

case

(12)

and non-empty

Then since

km -

is sufficient

cases.

&n =n~OS(P~(~n),

Ii <

it

space L. But in this

i

6. Proof of Lemma 6. We denote by S(x, the point x and of radius p and consider

11~m-Po(zn)

therefore

dimensional [21

of the lemma immediately

Lemma 5 is proved

Then the sets S,

11

Q)< cmaxp(m,Qi),

~(5~~

Thus

of projections

Q) < 4

I-

E$)n’2

-

Q)

L.G.

12

Gubin

et

al.

We remark that the $81118 proof can be extended also to the more general method (l), (3) with some refinement of the condition (3). At the expense of some complication it is also possible to prove the geometric rate of convergence for the method (l), (2). of Theorem 2. For explicitness,

8. Proof

let Qol be bounded. Consider

the operator

pa,(Pad. *-(PmPa,(x) ) 1.- .) ),

P(x) =

W

which successively projects on to all m sets. Using inequality (6) m times, we obtain that the operator p(x) is non-stretching, i.e. IIP(s) - P(y) II
YII.

In addition it takes a closed convex bounded set Qa into itself.

Therefore [ll,

putting 5%= Pa,(E),

of a Hilbert space

121 it is a stationary ioint x1 E QaJ.

. . . , F,, =

f’,,

from the condition PG,)

(Zm-~),

=

x1 we obtain Pa,(&) =%I, i.e. the points xl, ._., FIRsatisfy the conditions of the theorem. Furthermore, for the method cl’), (2) we have ~lxitkm

-_ fill2

-

p+km+i

_

112=

his,,

-

xi+km+lII

_ x:itkm+i,

-

_

IJZ~+~ _

+

2(x’+km+’

X

IIZi+l - iT;;_ilI - (Xi+km+i - Xifkm, Ei+i - Zi) ] >

>

zjl(2) _

2 (xi+km

(llxi+k”

+

zifi,

fi+i

_ zi)

(llxi+km - xi+km+iIj -

+

xi+kmfi

2[ IIxi+km

z&+~)

+

_ xi+km+iII

x

lIZi+* .- ,ill)“.

Hence ll~i+~m- %ll > 112 i+kn+i - Zi+iII. Consequently, the quantity llri+knr_ ziII monotonically decreases and as k -, OI it approaches a limit which is independent of i. Therefore in the inequality obtained above the left-hand part approaches zero as k -. co, the thereforellzi+km Zi+km+iII+ IIz~+~- iill, and also llri+km- ri+km+lIIIl~i+i- ~?=uill -

(2’r+km+i _

xi+km

,

Zi+l

Xi+kmlla ~~;~k:zp._

Xi+kmll

(xi+km+i-

Xi+km,

-

+ -

Zi)

--t

IIs~+~ IIfi-i

fi+i

-

-Zi)

0. But II (Xi+km+i ~~112

-

Z(x”+km+i

-

Xi+km)

-

_

Xi+km,

zi+i

(fi+i _

zi)

Z;;ui) 11’ = =

+2[ll&+km+i -- xi+kml\(\5i+i - ~~(10, therefore xi+km+i- xi*km--t Ei+l - Zi.

Zi11)2

] 4

Furthermore. the sequence xltknt, k = 0. 1, . . . , is bounded; hence there exists a subsequence which is weakly convergent to some point X,. Since Qal is convex and closed, and therefore it is weakly closed, and xltkm E

Qa,, therefore

also i, E Q,,. Using the convergence $+-km-

xi+km-t& - fi, proved above it is easy to show that the corresponding

The

nethod

of projections

subsequence x Ilfkn weakly converges to

13

A = Pa, (21) = 5~4+ i!g - fi.

weak convergence of the subsemence x i+km to 4i = P,,(&_t) “i

-

;;-I

is proved similarly.

Pa, (%=J = %Fz+ zi -

The

= f+r +

Hence it follows that

zm = 5Ti+ “r: pi+* -

Z$) + 2%-

z,

= ff.

i=i

Thus, the points x”i satigfy . . . , m. -

1, Pa,(&)

=

&,

the conditions

Pa i+i$Zi) = i?i+t, i =z 1,

snd W@ can put Xui= Xi.

IFheweak convergence

-55;;uill Of all sequencesXi+km t0 Xi follows from the fact that 1/$9+km monotonically

decreases with increasing k

(which has been shown earlier).

Now let all sets Qai be uniformly convex. If all points xl, . . . , x,,, coincide, the strong convergence of the method follows from Theorem 1. Otherwise there is a uuifonnlv convex set C&., for which Xj f Gj-1 (this happens in the case remarked on in note 2 ofjthe theorem). Then Xj is a boundary point of the set Qo. (since fj = Paj (Sj-i), Sj-i@ Qaj). But if the sequence of points .xi’km’ 1s ’ weakly convergent to a boundary point of the uniformly convex set, it is also strongly convergent ([131, Theorem 3). Therefore .j+k= * _j_ But in view of the continuity of the projection operator (6)) x itkm + xi for all i. Finally if all sets Qoi, except possibly

one, are strongly convex,

among these sets there is one such that xj # xj-l* close to “j-1, x, Y s Qaj I, sufficiently

Then for

ofrc, Qocj) > p > 0, p(y, Qaj) 2 p > 0; llPaj (x) -Pa,

(y) II <

qlla: -

yll, q <

therefore

1. As a result

i.e.

%,j-l

@

‘~,i

*

we shall have from (6”) we obtain operator (14) is com-

pressing in the neighbourhood of the point Zj_1, But all xj-ltka are near “j-1 for fairly large k. Therefore the principle of compressed mappings is applicable, from which it follows that .jtkm -( Xi at the rate of a geometric progression. But then (in view of (6)) for every i xitkm will convergeto Xi at the sme rate,

3. Rate of convergence As we shall see from the examples given below, each step of the successive projection method can be realized very simply: the amount of calculation associated with one iteration is usually small. Apart from this the method is stable with respect to error. However, a serious

14

L.C. Cubin et at.

drawback of the method may appear to be its slow convergence. As can be seen from the simplest examples the rate of convergence is sometimes actually small. For example, for set over the two-dimensional plane

FIG.

2.

{(xi, x2) : 22 < 0). ft can be al 01, Q2= Qi = { (51, x2) :x2 a ai??, proved that for any initial approximation the method converges to the point (0, 0) at a rate slower than that of sny geometrical progression (namely. at the rate of the order of l/n). This case, however, is fairly typical. Indeed, if the original problem is to minimize the functional

f(x)

over the set Q1, while f =

in1 f(rf

is known, the problem is re-

duoed to finding any point of the set Q = Qt f-j Qg, where Q2 = {x: j(3) < j”} . Here normally Q consists of a unique point, so that condi tio;i fa) of Theorem I is not applicable and the geometric progression rate of convergence cannot be asserted. However, even when the geometric progression rate of convergence follows from Theorem.1, the denominator

FIG. 3. of the progression may be nearly unity= This is Fig. 2 (two hyperplanes intersecting at a small is the minimization problem considered earlier, from 2, taking Q2 = (X : f(x) f f” + E, E > O>.

the case for example in angle). Another example when it is solved apart In this case QI fl Qza is usually non-empty and the rate of convergence is geometrical, However*

The

method

of

15

projections

as follows from the estimates obtained in proving Theorem 1, the denominator of the progression is nearer unity, the smaller 6, i.e. the smaller E. The situation is similar with method (1”)) (3’): if Theorem 3 is applicable, the number of sets is finite, but may be very large. Because of this the problem of accelerating the rate of convergence is important. We shall consider one possible way of achieving this. We shall only deal with the case of two sets. Thus, we want to find x* E Q = Q1 fl Q2, We form, Lc3=

beginning

from x0,

the sequence

ri = P;(9),

22 =

p2(r1),

Pl(X"), Lz+ = xi+ a(s3-xi), 11x*-

A=

(51 -

x2112

x3, xi

-

(151

x2) *

Here Pi is a projection on to ?i, the first three points coincide with those obtained in method (l’), (2) and the value of A is so chosen that of the straight line (x1, x3) with the point x4 lies at the intersection a hyperplane which supports Q2 at the x2 (see Fig. 3). Further points are obtained in a similar manner: x5 =

x’ = Pi(X6),

x6 = PP (x5),

Pi(b),

11x5-

h=------

x8=x5+h(x’-x5),

x6112

(x5-27,x5-56)

etc.

Let us now prove

Theorem

the convergence

of this

VI n Q20 (01Q10 n

(b) Q2 (or (c)

method.

4

Let Q1 and Q2 be convex and closed, Q = Q1 any of the following conditions be satisfied: (a)

,’

Q2)

Q1) is uniformly

E is finite

flQ2 be non-empty and let

is non-empty; convex;

dimensional.

Then in the method described

above xn + X* E 3.

Proof. We shall use the ssme scheme as in proving Theorem 1. At first we shall prove that I[xR+~-xl! < llxn -xz(I~ for all n and for any x E (2. For n = 0. 1,

2, 4, 5, 6. 3,

. . . this

follows

from Lemma 3. It remSinS

L.G.

16

to prove peatedly,

NI inequality we obtain

Gubin

of the type

et

(Iti -

(Xi-

x3, xi

-

=

x2) ,

=

llxi-

x2112-

(x2

511<

119 -211.

Using (6)

re-

llxi - x2112

11x*- X2112

a=

al.

-

(x’

-

x2, xi

IIS’

-

x2112

x3lx2

-

x3)

-

x2)+(x2

-

x3,

xi

-

x2)

=

51 +

(x2

-

~3,

XI

-

~3)

>‘I ’

IIS’

z2112

-

52112 =



+ h(x3 -x’)) = lld - lc2ll2+ h(z’ 2’) = 0; -9, ti (xi-9, 3+-z) = (39-a+, x4 - 2) + (x2 - x3, x4 - 5) = (2’ - 9, 33 Sk-2) = (9” - x2) + (2’ -x2, 9 - cz) + ( d-3.3, 33-3) > (1-x3, - Lz+_+--9) + (33-33, 9-z) > (h-l)(P9,s - 2’) B 0; 1133 - zp = llti - 3jp + lb.+- 54112 + 2(x4 - 5, 33 - 9) = Ilti - 412 + + (Iti - xqp + 2(a - 1) (x’ - ti, 33 - 3) > lIti - xl12. (x’

-

9,

&

-

x2)

Furthermore

=

(x’

-

x2, .xf -x2

we show that

0(x”)

In the same way as in the proof pw(xn, Q) Q2) <

-

P~(P+~,

=

msx(p(sn,

Qt), p(P,

of Lemma4 we obtain,

Q) = L2, that 6, + 0 and p (x0,

Q2)) + 0.

putting QI)

<

61,

p(z’,

P (x2, Vi) < 63, p(ti, Q) < 64, P (x5, Q) < 6?, etc., therefore into account that p (~3, @) = ~(“3, Qi) = p(x5, Qi) = . . . = 0 and

62,

taking

p (x2, Q2) = p (x6, Q2) =

@(x4) < p (ti,

. . . = 0, we obtain

(D(9)

< 62, CD(S) < da,

Qt) + p (x5, Qz) < 64 + (65,@(x5) < 65, i. e. in fact

uJ(x”) -a 0. With this we have proved (7) and (8) for this method. Lemmas 5 and 6 are applicable, which proves the theorem.

Therefore

We shall not obtain an estimate of the rate of convergence for this method but will merely show with two examples that it does in fact accelerate the convergence. The first example is a case of linear constraints. It is easy to see that if Q1 and !& are hyperplsnes (or halfspaces), the point x4 is already a solution (as we have seen the original method may converge very slowly for this case, see Fig. 2). In the case of sets over a two-dimensional plane Qi = { (xi,x2) : ~2 > aq2, a > 0), : x2 < 0}, considered earlier, it is not difficult to obQz = {(aix2) tain

the estimate

the convergence Thus it

Il~+ll <

(l/2n)

of the original

lls”ll,

for the accelerated

method is extremely

is to be hoped that this

simple method of

method, whilst

slow. accelerating

of projections

The nethod

17

convergence is fairly effective. We also remark that the idea of this method can be carried over to the case when the intersection of m (and not two) sets is required. In method (l’), (2) (or (l’), (3’)) for this purpose so-called accelerating steps of the type (15) have to be made for various pairs of sets. The order of choosing these pairs may be of various kinds. In particular they can be chosen such that for the case of linear constraints an accurate solution is obtained in a finite number of steps.

4. Examples and applications We shall consider the various forms of the method of successive j ections through several problems. 1.

Solution

are given

where co E is a set of The problem ing a point a half-space. h,(x)=

of

a set

of

linear

. In a Hilbert

inequalities

linear

scalar product, aa: is a number, and A infinite). We shall assume that c&O. inequalities is equivalent to finawhere each

It is easy to verify

ha,

li, = ((h,

Quc=

5) -

if if

k)llhl12,

The method (l’), (3’) - the method of projecting restriction - takes the form xn+i =

xn-( (Cqn),xn) -

{CT: (ca, Z) <

a,}

is

that

x, x -

space ,F suppose we

inequalities

E, (c, x) denotes the subscripts (finite or of solving this set of of the set Q =,?p,

(

pro-

aqn,)Cqn)/llCqn)ll?

xn,

(ca, z) < aa, (C,,Z)>

tZa;

on to the furthest

if

(Cw, x*1 > aq4t

if

(h(n),

Xn)

<

%nb

(17) where a(n)

is determined

from the condition

(3’):

(cmq P(x~,Q~~))=@(X~)=W& (here

we assume that the supremum is reached).

II4l

- act

(18)

18

L.C.

Gubin

et

al.

Let us now consider another variant, that of projecting on to the restriction which is least satisfied, where xntl is computed according to (17), and a(n) is chosen from the condition ( Cqn),

P) -

sup 1(Cat zn) - 4.

Uqn) =

(j9)

USA We show that choosing

if

n(n)

0 <

condition

$c,ll <

(3) will

<

BEA

(cqn),

zn) -

a%

m Applying Theorem

with such a method of

M, a ~-4,

be satisfied.

Indeed,

(cav ;I;,,aa SUP (Ca,P)m -

a-J(z”) = sup =

m <

<

=

WEA

M -

@w,

\rn

Theorem 1 we obtain

f&c

4

-

acr(,)

IIcqn,ll

the following

=$4w2atn,).

theorem.

5

Let the system (16) have a solution and let any of the following conditions be satisfied: (a) Q” is non-empty; (b) E is finite dimensional; (c) A is finite. Then the method (17), (18) converges to a solution, and if

m <

llcall <

(a) and (c)

the method (17))

M,

the rate of convergence

(19) also

converges.

In the cases

is geometrical.

The problem considered in this section has numerous applications. Two of these will be given below in examples 2 and 4. In addition, linear programming and the solution of sets of linear equations can be reduced to this problem. We shall not however discuss these here since these applications are well known (see for example [2 - 5, 141). 2. The problem of the best uniform approximation. Over the interval [a, bl let a continuous function p(t) be given. We are required to find its best approximation (in the Chebyshev sense) by a set of known functions cpl(t), . . . . q+,,(t), continuous over [a, bl. In other words, we seek the coefficients x1, . . . , x,, minimizing

f(z)= We specify equalities

$a&

/rp(~~&~i(~) i=i

some number A > 0, and we shall

--
&i’pi(%h, i=i

1.

seek the solution

a
of the in-

(20)

The

method

of

19

projections

If the solution of (20) exists, A> f * = inf f(x); if it does not exist, case h can be reduced, and in the second it can be A < f*. In the first increased. In this wsy we can obtain the solution of the original problem with any accuracy. Thus the problem has been reduced to the solution of an auxiliary problem of the kind (20). But such a problem is a specia case of problem (16) considered above, if for A we take the double interval de[a, bl, a = t, ca.= (vi(t) ,..., (pm(t)) or ca.= --(vi(t), . . . . q,(t)), pending on whether the leftor right-hand part of the inequality in Therefore the method (17) takes the form (20) is under consideration.

I Cl Xin

Xi

nil

xsian(q(L)--i

where t,

is

xin(Pi(tn)

I]

!

5

($Ji2(t))-L

xi”rFi(t7z)),

if

1 q(L)-~

found from a condition

$j

Xi”C@(tn)

I( $

Xi”(&(tn)

1 >h,

of the type

(18)

~iZ(t*))p”*=

i=i

i=l

= or of the type

X

iI,

i=i

] Cp(tn)-

cpi(tn)

i=i

i=i

I

=

Ii

TCtn)-

lb-

max

a
[I q(t)-iXi”qi(t) / ( $j

q$(t))+]

i=i

i=i

(19)

m

If

2

q$(L) >

0,

a S t<

b, the convergence

of the methods follows

i=i

from Theorem 5 (E is finite dimensional), and if h 7 f*, the methods converge at a geometric rate (since in this case Q” is non-empty). This method of solution is extremely simple: within each step it requires only that the maximum of a numerical function over sn interval should be found.

L.C.

20

3.

The problem

function strictions 1)

o,f optimal

u(t) ELz’[O, T],

Gubin

control.

u(t) =

et

al.

It is required to find a vector

(ui(t), . . . ,u,(t)),

satisfying

re-

of the following kind:

Gi = {u : IlUll Q C}, where IlUll = j ri; n?(t)dt, 0

c > 0;

i=i

2) Qz = {U : u(t) EM for almost all t. O< 2
where M is a closed

d E Em 31 Q3= {zz : s(T) = d}, where z(t) = (s,(t), . . .,zm(t)), x(t) and u(t) are related through the differential equation

dx

Ax+Bu,

-= dt

x (0) = x0,

while

(21)

where A is an m x m matrix, and B is an r x m matrix. Certain problems of optimal control csn be reduced to this problem. For example, in a problem concerning operating speed with restrictions Qr, QS, QJ we can fix the value of T and seek a control function u(t), satisfying the restrictions. If such a u(t) is found, T is not less than the optimum, while if it is not found, it is greater than the optimum. In the other problem - that of minimizing fi urz(t)dt 0 +=i

under the restrictions Qz, Q3 - it is possible to specify sn approximate value of the functional, equal to c, and to reduce the problem to finding the intersections Q1 n QZ n Q3. We write down the projections Qz, QJ in a Hilber space Lzr(O, are lengthy, although simple) :

Pi(U) = p2(u) = bfd(U(t)), where set M at each instant t.

. . . . r, CF< t
of an arbitrary u(t) on to the set Q1, (we shall omit the proofs, since they

T)

U,

if

C~/IMI,

if

PM is a projection

For example, if

we have P*(U) = u(t),

Ilull G C, II4 > c, on to a finite

dimensional

Q2 = {u : \ur(t) 1 < 1, i = i, where

The method

1, pi(t)

=

W(t)9

i

Pa(u) = z>(t) = u-B**,

of

21

projections

if if

@i(t) > 1, (uj@)

-4, if

I< 1,

w(t) < -1,

where i&I&=

--A’*,

and v(t)

condition a(T) = d, where dx/dt = Ax t &I, x(O) = c is possible to solve the linear boundary problem

d9

dx

-A*$,

t=

In other words, it

x(0) = c,

z=Ax+Bu-BB'\p,

is chosen from

x(T)

=

d.

Thus the projection on to each of the sets Q1,Qz,QS is found without difficulty. Therefore, it is possible to use the method of successive project ion (l), (2) or (1). (3), each step of which can be realized simply. Let us now give a result concerning the convergence of the method for various sets of restrictions (we shall denote by Q the set to which u must belong). Theoren

6

Let any of the following and there exists u(t),

conditions be satisfied:

for which

/lUll
(1) C?= Qr n QJ

u Z(T) = d;(3) Q = Qz

n Q3,

and the system (21) is not degenerate [131, A is bounded and there exists u(t), for which v(t) E M for some 6 > 0 for almost all t, if Ilc(t) -

E(t)11 6 6, Z(T) = d; (3) Q =

to condition

Q1 ~Q~~Q~

and, in addition

(2), IlUll< c.

Then in the method of successive projection lim /IV - ball = 0, where u* EZQ. ?%In

fact,

Q.1,Qz, and Q3 are convex closed sets in L,‘.

In case (1)

QIO n QJ is non-empty, i.e.

condition (a) of Theorem 1 is satisfied. In cases (2) and (3) condition (a) is not satisfied, but it can be shown that Lemma5 is true. Since conditions (a) - (d) of Theorem 1 are used only in proving Lemma5, Theorem 1 is valid also in cases (2) and (3). In this way Theorem 6 is proved. 4. The pigpen

of optimaI

control

with

restrictions

over

the

phase

22

L.G.

Gubin

et

al.

coordinates. Let it be required, apart from the restrictions that the control u(t) should belong to the set Q;=

where given

{u: (q(t),

z(t))

q(t) = {e(t), . . . , q,,‘(t)} numerical

<

b(t),

is a given

0
<

Q1, Q2, Q3,

V,

function,

b(t)

is

a

function, m

(W)~s(t))= 2

Qi(l)X&),

while x(t) and u(t) are related as before through equation (21). The projection on to Q4 is difficult to find, but a different approach is possible. The set Q4 can be represented as the intersection of a continuum of sets Qt of the kind

Qt={u:(q(t), x(t)) Each set Qt is a half-space.

x(t)= where F(t) e-At,

b(t)},

4

ct =

Ct (4

at =

matrix),

of the system dx/dt

=xt (2)8’ (4 z) q(z)

b(t)-(q(t),

=

Therefore

)

{,‘I ;,“;;

xtw=

F(qxo);

uw, u(z)=

(F(t)

mere

Thus Q4 has the form (KS), where a = t, on to Qt has the form

=

= As

@(t, 2) =B’F(t)F-l(t).

0

pQ t (u)

Q~. Tl

i s(t,T)U(t)dt 0

+

is the fundamental matrix

QI = {u: (ct, U)L, < at},

s[O:

Indeed,

F(t)xo

if A is a constant

n

Q=

u(~)-[(q(t),x(t))--b(t)l[

x xt(+w+7(~),

i=l

A = LO, ~1. The projection

if

(q(t),X(t))

j ll~‘(t,r)q(t)l12d~]-i 0

if

(q(%W)>

G b(t),

x

W).

The

Therefore

cal

of

confine

22

projections

the method of successive

Qz 0 n Qt (we shall since

method

for Q = Qz

projection

ourselves

to these

cases

for

simplicity,

the presence of the sets Q1 and Q3 does not introduce cifficulties) takes the form

p+i

P2(4

=

{ where t,

PC+

is determined

9 n

tun19

if

p(un,Q2)>

p(umQtn)

if

p(u”,

p(@,

from a condition

[(q(ln),zn(tn))-b(ln)l

Q2) <

of the type

fl Q4 =

any theoreti-

Qt,),

(18):

( (jlie*(ln,r)q(ln),,2dr)-1~= 0

w%

end the problem is (if q(t) and b(t) are continuous, (q(O),x0) -K More simply the instant non-degenerate [131, this maximum is achieved). on to the least satisfied t, is determined in the method of projection restriction (see (19)):

Finally,

b(t),

if O,C

1 follows

E(t) EM,

0 <

t <

T,

exists,

such that

(q(t),

x(t))

<

t ,(T, it follows that Q2 n Q4” is non-empty and from Theorem the strong convergence of the methods described.

projection method given in [151 The version of the successive _ _ is more complicated for this problem. Apart fromthis, the paper 1151 proves only the weak convergence of that method. Tram

lated

by

H.F.

Cleaves

REFERENCES 1.

POLYAK, B.T. Existence theorems and convergence of minimizing sequences in extremum problems with restrictions. Dokl. Akad Nauk SSSR, 166, 8, 287 - 290. 1966.

2.

AGMON.S. The relaxation 6. 3, 392 - 393, 1954.

method for

linear

inequalities.

Cnn.J.~~nth.

24

3.

4.

L.G.

MOTZKIN. T.S. equalities. EREMIN, Usp.

al.

and SBOENBERG, I. The relaxation J. Math. 6, 3, 393 - 404,

Can.

I. I. mat.

et

Gubin

Generalization 20. 2,

of the Motskin-Agmon 183 - 188, 1965.

Nauk,

5.

BREGMAN, L;M. Finding the of successive projection. 490. 1965.

6.

CHENN, 1. and GOLDSTEIN, A.A. Proximity Am. Math. Sot. 10, 3, 448 - 450, 1959.

7.

LEVITIN, E.S. strictions.

linear

relaxation

maps for

convex

sets.

and PGLYAK, B.T. Methods of minimizing under vychisl. .Mat. mat. Fiz. 6, 5, 78’7 - 823,

Closest-point 3, 212 - 225.

9.

POLYAK, B.T. A general method of solving Akad. Nauk SSSR, 114, 33 - 36, 1967.

10.

BOURBAKI, N. Topological 1959.

11. BROWDER, F.E. Ftoc. natn.

maps and their 1965.

vector

spaces,

Nonexpansive nonlinear Acad. Sci. U.S.A. 54,

GDBDE, D. Zum Prinzip - 4, 251

der

- 258,

in-

method,

products.

Nieuw.

extremum

problems.

Izd.-vo

in.

Arch.

Do& 1.

Moscow,

lit.

operators in a Banach 4, 1041 - 1044, 1965.

kontraktiven

Abbildung.

Proc.

re1966.

Zh.

STILES, W.J. Wisk., 13,

3

for

common point of convex sets by the method Dokl. A&ad. Nauk SSSR, 162, 3, 487 -

8.

12.

method 1954.

space.

Nachr.

lath.

30,

1965.

13.

LEVITIN, E.S. and POLYAK, B.T. On the convergence of minimizing sequences in conditional extremum problems. Dokl. Akad. Nauk SSSR, 168, 5, 997 - 1000, 1966.

14.

TOMPKINS, C. Projection linear

15.

Progr.,

2,

methods 425

VAISBORD, E.M. The method mate solution mat.

Fiz.

6,

of 6,

-

448,

in calculation. Proc. Washington, 1955.

of successive

an optimal

971 - 980,

control

1966.

projection problem.

Zh.

for

II

Symp.

the

vychis

approxi-

1.

!vlat .