Mixed strategies in two-person positional games
11. ROZENMULLER, I., Cooperative games and markets (Russian
45
translation),Mir, Moscow, 1974.
12. PELEG, B., An inductive method for constructing minimal balanced collections of finite sets, Naval Res Logist. Quart., 12, No. 2, 155-162, 1965. 13. OWEN, G., Game theory, Saunders, New York, 1968. 14. SHAPLEY, L. S., Cores of convex games, Internat. J. Game Theory, 1, No. 1, 1 l-26,
197 1.
MIXED STRATEGIES IN TWO-PERSON POSITIONAL GAMES WITH INFORMATION EXCHANGE*
T.N.DANIL'CHENKO Moscow (Received 20February 1975) THE USE of mixed strategies is considered in a class of two-person positional games in which the first player immediately tells the second of his strategy function at all n moves. The first player’s optimal guaranteeing strategy and optimal guaranteed results are found. Kuhn’s definition [l] of a positional N-person game embraces a wide class of games. Very general assumptions can be made about the mutual amounts of information of the players at the instant of deciding any intermediate move; and accordingly, the concept of a player’s strategy is wide, including randomization of pure strategies on the whole or successive randomization of moves (a probability distribution in alternatives for each information set of the players). However, a solution of the game is understood in quite a narrow sense, namely, as a search for equilibrium strategies. At the present time, however, especially in two-person games (N= 2) the principle of the maximum guaranteed result is being widely used. Study is evolving in two directions from the point of view of strategies: the use of mixed strategies in two-person games, where the first player expects information to reach him just once about the second player’s strategies vector [2] , and the discussion of dynamic two-person games in pure strategies with a sequential step-by-step inflow of information about the opponent’s strategies vector [3,4] . In the present paper an attempt is made to combine these two trends. Dynamic two-person games with perfect information (the players know the prehistory) are considered, under different assumptions about the knowledge of concrete realizations of the players’ choices. The following method of randomization is used: at the j-th move each player chooses a probability distribution on alternatives of the j-th move, j = 1,2, . . . , n. The first player’s strategy may be communicated to the second in different ways: a priori communication of the strategies immediately at all n steps of the game; step-by-step communication of the strategies only at each j-th move in turn; and various combinations of the a priori and step-by-step methods of strategy communication at the n interaction steps. We shah confine ourselves to finding the optimal guaranteed result and the fust player’s optimal strategies, in the case when the first player tells the second of his strategy-functions immediately at all steps of the game, and the second player is cautious, and confines himself to the principle of the maximum guaranteed result.
*Zh. vychisl. Mat. mat. Fiz., 16,5,
1136-1145,
1976.
T. N. Danil’chenko
46
1. Formulation of the problem Let the sets of player’s choices at the j-th step be specified by the metric compacta X, Yj, j=l, 2,. . . , n; ff (x, y) , i=l, 2, are the continuous pay-off functions, specified on XX Y. Here,
x=(t,, . . . 5*)=X,
Y=(Yi,
)
n
X=
YlEY,9
* * *, YJEY,
V-5,
=II ,=i II
Xj, IIj=i
Y
Yj.
Let Pi, & denote the sets of all probability measures in Xi, Yi respectively (or what amounts to the same thing, the first and second player’s sets of mixed strategies at the j-th move). The respective mixed strategies at the j-th move are PI, EP, and Q”,EQj . For simplicity it will be assumed throughout that j$ =p), Qqj =Qj. We introduce x,= (51, . . . ,x3, J’j’ ($I?, . - - , Yj>, IGjGn. Here, pi, qj are the distribution laws for the Pj’(Pi, ***, Pj)9 Qj=(Qi,..*tQj)9 quantities xi, vj respectively. We assume that each player agrees to take as his criterion the mathematical expectation of his pay-off. We consider the strategy in which each player’s choice is made successively in n steps, at each of which pj=Pi, qj=Qj, j=l, 2,. . . , n. Both players try to obtain the maximum guaranteed result, and the fust player knows that this is the second player’s principle of behaviour. The result of the game depends in an essential way on whether or not the players have at their disposal information about the concrete realizations of their opponent’s choices. We consider the case when the first player knows at the j-th move both the mixed strategy qj?Qj, and the concrete realization of the second player’s choice YjEYj, and also the over-all prehistory pj-1, Yj-1, l
Q*),
P”j=Pj(Pj-r, Yj, qj),
2GjGn.
(Here, p”j=pj‘(pj-1, Yj, qj) is the distribution law of the quantity xi.) We shall consider the problem when the first player tells the second immediately at all n moves his strategy functions
P(Y,S>={Fi=Pl
(Yi, qi), p”3=Pj(Pj-1, Yjl CIJ7 2sj+
(here and below; p= (pi,. . . ,P”), q= (q+, . . . , qn> ), after which the second player maximizes his own pay-off, which now depends solely on his choice, i.e., on the choice q= (q,, . . . , q,J . If the first player decided to use the functions p=p(Y,
q)={~l=Pi(Y,,
41); p”j’Pj(Pj-it
Yj, QJ, 2Gj+,
and the second chose the mixed strategy q= (Q~,. . . , q,,), where qjEQ1, lGj
then the
47
Mixed strategiesin two-personpositionalgames
(The assumed type of strategies is such that the order of integration can be changed.)
2. Lemma about the second player’s guaranteed result Let us find the result Lo which can be independently achieved by the second player by using mixed strategies, while confining himself to the principle of the guaranteed result. Let Fi be the set of all strategies of the type ~j=p~(pj-~, Jrj, ~j), and 5’ the set of all strategies of the type p”,‘=pj’(p+1, yj), IfjGn, where j&, p”~’are any functions of the arguments indicated, with values in Pi. Notice that the set @ is only pad of the set P,, l
PO,
qo,
YO
symbolize the absence of an argument.
1
If both players agree to use mixed strategies and during the game, the second has no information about the concrete realizations of the choices xi of the first, while the first, in addition to information about the mixed strategies of the second Q,EQ, counts on having information about the concrete realizations yjezYj, then Lo = max min.. YlEYl
PIEPI
. mas U,,EY
min 12P”EPn
I x,
...
J
fz(x,y)dp,..
.dp,.
Y,
Proof: Under the conditions described, the second player can guarantee himself the following result, whatever the actions of the first player: Lo’ = rn: max F, (t, q). P
Let us show that Lo’=Lo.
In
fact,
On the other hand, since the set of second player’s pure strategies is part of his set of mixec strategies, we have
T. N. Danil’chenko
Lo* =
min $jE
max I... 1 [s... \fP(x,
l
Pj,
Y,
q
Y,
min
>
Xl
max y i..;s
?;jzPj[Pj-l, YjgB(Y~)I, i
min
= Pj’E
max y I..;!
l
Pj’,
Mx,Y)d~,..&
*-?a
fz (x, Y) &’
x
r)d”p,...d~~]dq,...dq,
x,
. . . &I’.
72
Here and below, IQ) denotes the choice of y with probability 1. Hence,
= max min . . . max min s . . . s fz(x, y)dp,‘...d& YGYl 5,’ %IEY?I Pn’ Xl x, = max min...max
I/EYl PGPI U&Yn
min PnEPn
1...1 f2 (x, y) dp,
&
. . . dp, = Lo.
x,
Notice that Lo can be realized by the second player on pure strategies. The lemma is proved.
3. Auxiliary constructions We introduce the quantities
L(P,
q) =Fz(P, q),
Lj (Pi, qj) =
min max qjtlEQj+l PjtlE Pjtl
Lj+l
(Pi,
Pjtls qjtl)s
O\< j\
(3.1)
Let L,’ (p, y) =j Xl ~5’ (ph Yj) =
. . . s fi (x, y) dpn . . . dpi, Xl& max uj+i EYjti
min Li+i b+i, PHleP1ti
Yhi 1 ,
OGjGn-4.
(3.2)
By the same method as was used for proving Lemma 1, we can show that Lj(pj, Q)=J . . .J Lj’(pj, yj)dqj.. . dqs, Y’1 YJ
lGj
Lo=Lo’.
(3 *3)
Further, let Pj’(Pj-l, Yj) = {piEPjl
min Lj’(Pj7 Yj)‘L,‘(Pj-i7 PjEPj
Pi7 Yj)}9
IGjGn,
(3 -4)
49
Mixed strategies in two-person positional games
(3.5)
Ei’(Pj-1~Yj-1)={Zj~EYjl~~‘(pj-l,P~,Yf~,~j)=~j-*(Pj-*~Yj-l)}t
l
while on the other hand, from (3.3), (3.4) we have
i.e.,
minLj (Pj_1, pj~~
~jl
Sj)> Lj
(Pj_1,
Pj’7qj)7
l
j
The strategies pl* (p,-,, y,), j+l
L, (PI, qj) * We define the sets Ej(P+*, Yj-r)={qFB(YJ
IYFEjl(Pj-l,
Yj-l)}, lGj
(3 -6)
(the Ei’ are given by (3.5)). Assume that the first player has used a strategyp (Y, q) = {pj =Pj (Pj-1, Yj, qj), lGj
(Otherwise, the apparatus of e-strategies has to be introduced.) Given the first player’s strategy p(y, q), the second chooses any strategy qd the choice of strategies p (y, q) and qd?(p) we have
L,‘a
(p) . Given
max L(pi, sd,
(3.7)
0<,<7t--i
since it follows from (3.1)-(3.3) and (3.6) that,
only
by choosing Q&‘j(Pj-i, qi); O
can the second player guarantee himself the pay-off L”>Li(pi,
Yj-i), i+l
50
T. N. Danil’chenko
We put
(3.8)
Let M,/(P,Y)=
Mj'
J... JMGYML..~P,, xi XII
(Pj, Yj) =
inf , “j+lcZEj+l
(Pj* yj)
We can show, by the same method as was used in the proof of Lemma 1, that
Mj (pjpa) =
J .. JM~’ (pj,yj) dqj a
r1
s.
.
IG~G~,
dq,,
M~=M,‘.
(3.10)
rI
Let Py (Pj-*, YJ) realize
max Jf,’ (P,, YJ =M,‘(P,-i,
iqez,
PP, Yj),
(3.11)
P,EPj
where
maxMj (Pi_1'Pi’ qj) =
pj~~j
inf QjEEj(Pj-1,
Yj-1)
Mj(Pj_19
py,
qj), py -from(3.11); inf
MI (Pj-I9 Pj”, Qj) =
4jEgj’tPj-1s
J!fj(Pj_lr Pj”CSj), Yj-1)
i
We introduce the sets and quantities
D, =
(PA Qr)
SUP
Kj=
(3.12)
PiEPi, q,EQ<, l
M,(P,, qd,
(p, q>)=‘J) -m7
if
Dj+@, (3.13)
if
DI=O,
IGjGn.
We have: Lemma 2.1
If D,+ 0, for some j, then MoGKj. 2. If. for some i and j such that i < j, we have D,P0
and DjZ0,
then KSKj.
51
Mixed strategiesin two-personpositionalgames
Proo$ Let us prove assertion 2 (the proof of assertion 1 is similar). We have the inclusions t, (p,_-i,
(pm_,, y,+)
gm’
>L&_l(p,_,,
Yma?I’(Pm-~~
={pd=P,,
Jr,_,)
Y-1))
{pm=Pm, Y?J+Y~I~~ (pmym)
=
ym--i)}Et,“(Pm-~,Ym-l)=
Since D,+0, DjZ@, we obtain from (3.9), (3.10), (3.12), and (3.13), in the light of these inclusions, the chain of inequalities
inf
... “JeEj’
“P
(Pj-1, Yj-1) PjGPj
Mi’ (Pj, Yj)l ‘qi ...
aup aup (Pi*$)Eni &+l, qi+$)Qq+l(pi,
=G
.
sup
Pi)
' ' (Pj.QjE"j (Pj-1,Sj_1) I11 y ' *. 1
* ’ ’ dql
Mj’(pj, Yj) dqj
s
. . . de]
yi
aup
<
LPj,qj)EEj'i(Pjs Crj)lLj(Pj, Pi)>/ max L,
Mi(Pj7 Sj)
(P,. 9,))
O
=
(Pj, qj) = Kj*
cP,SyTEDMj
3’1
j
If, for arbitrary j, 1
4. Optimal strategy
=p;, ifqi I
We define the strategies Pi
.* = Pi=P:, P
The
E
i
El (pi_,,
Y&
I\< i < n,
Pia,
if if
q1 e El, qi’=% (pi_,, y&,
Pi’, r -I- 1
if
Q,,, e E,, (p,, y,).
point (P,r”, qi?) =(pljo, . . . , p,:, qi?, . . . , qd), Lj (pjj’, Q”)>
max
lGj
I
is such that
Lt (piio, Psfo) and Mj(pti”, qjjo) >Kj-8
oct
(here, pIjo, qij” are the distributions of the quantities xi,yi respectively);
(4.1)
52
T. N. Danit’chenko
i
pa
=
i
if
,“:;
if
a ’ P,=Pa*71
p,=
if
p& ( Pa’, m+f\(a
if if if
(4.2)
if if
(Ptl
We have: Theorem
Under the game conditions stated, the first player’s ~-optimal gu~anteed result is: max i, the strategy pt.8 of (4.2), (4.3) guarantees, possibly with r,
1) y-K, where j= e-accuracy, the result Kj; 2) y=Mo, if, for any MO is obtained.
i,
1 Qi
we
have D,=%; the strategy p** of (4.1) guar~tees that
we haveD,=0. Then, given any strategy Proo$ 1. Assume that, for any i, I-.M,. Assertion 2 of the theorem is proved. 2. Let j=rCi,ra; ~a ,i.e, D,+0, given any strategy’&y’, q), we have
and let all the sets D,===0 for all i, j
t
1G
i Q
n.
Then,
(4.4)
Mixed strategiesin two-personpositionalgames
53
If j = n, then L,‘3
mas
L(P,, qj).
O
If the strategy p(y, q) is such that L, * = LO, then nothing prevents the second player from choosing any strategy crlEB, (p-f, yi-,), 1-Gi
(4.5)
It will next be assumed that the strategy p(y, q) is such that L,* > Lo. If, given the strategy P_(Y,_91=(pI,=Pj(P,-I, P?, qtv.--t
- ] dl), 41
yj. qj>, lGiGn)~ then, by (3.13),
%Cp>= {it, 4~~~fi)E;:{q(I>,E.R(p)
ye-cl.
[ [pi, 1.. 3
(4.6)
If
then, by (3.7) and (4.41, for i < n, Lnf=Ln--1(p,,-,. (I,‘_-1)= . . . =L, (Pjs SJ =
max
L, (Pi, $1.
Z
(4.8)
Ifi = n, then L,’ = max L, h ,
qd.
Ln this case, any strategy &,I of (4.7) is equivalent from the point of view of the second player and realizes, in the light of (4.81, L“==&JPty,
&I,>,
. * . = Lm, (pm,, qm~-f.
4d=L,r i~‘ncm,c
(Pm, %) ===.L~(Prnll%nJ =. -1 . . .
1.
(4.9)
Assume that the second player has chosen any strategy (r,,, (W from the class of strategies such that the least number in (4.9) is m. If qi,m,’==(qi’n,. . . , (I,,*=, Q g+l. . . . , Y,~“‘), then, by (4.91, L,,‘=L,,Ip(y.q:~~),Y:~‘l=L!,,(p,,”.cf,n~a~>L.(pl~,qi”),
l~iaw-4,
(4. IO)
which implies, in view of (3.17), (p,,,“‘. q,,!“‘) dl,,.
(4.11)
The strategy q’={y,“‘, $GKGD?; ~Fk,fp,-,~ yt-,jt nz-!-i<‘tGn] equivalent to the strategy qtp)(‘n). For, from the form of q’, the definition of Ez(pI-~, y.._*) and (3.3j. we have L,“ZL,[p(y, q’), q’f3L,(p,“‘. (I,,:“‘) =L,.‘,i.e.. L,,‘== L,[p(y, q’), q’]~l~,~, (p,:,“!, qS,,“‘), the assumption that L,, ip(y, q’), q’] =L,(p,. (I, ! for r
54
T. N. Danil%henko
In short, we can carry out the estimation of 7 for q’. From (3.8)-(3.10) it follows that the use of the strategy q’ by the second player does not promise the fust more than M,[P(y, q') , $1 =&I (P, m, qmm), in which case, by (4.11) and Lemma 2,
sup
1G
J&n (p?n,qnr)=Km~&*
(P&x,)ED,
Thus,
given any strategy p(y, 4) and optimal second player’s behaviour, the estimate (4.6)
holds for 7. On the other hand, we can show, using a similar method to that in (31, that the strategy ~01 of (4.2), (4.3) guarantees the first player the result Kj - E. The theorem is proved.
5. Examples and notes Let USgive some examples to illustrate our theorem (n = 2, discrete case). ExampIe I. fi (xi,
x2,
pi,
~2.) =--xi
2+~2z-zA2+iQ,
fZ(YJ,
112)=5drr2--2Y22,
zt,
Y*=Q,
3
(i=l,
2).
In this example, D,= 0, i&=0, +&=O.
1
fi=1,
Example 2. ff (G .Q, Yi, 2). Here, D2=%, D,f0 Example
x,,
3. fi ($1,
yl=O, 1 (i=t,
2).
32,
yi,
~zf=-Xi2-X22f~i2+y~2,
(M,=ft~,-~/~),
Y2) =;5Si2-Xz2+51/iz+Yz2,
Here,D2#%, D,*0
f&r,
Yi, yzf =-Xi2-~~~2+~~2,
Xi,
~0,
y=K.j=6/3.
fz(Zi,
22,
yi,
~2)
=5~~~-~~~-10yi~-Yz~;
(.~,=~cK,=~~I~~K,=~~/~),
p=K~=f7/2,
Notes. 1. In the case ji (x, y) +f2(x, y) =O (which implies that the player’s interests are opposed), we find, on following formally the procedure described above, that the optimal guaranteed result for the .first player is y= min max.., !/I= l-1
PIEPI
min max vnE.
Yn
which is in general greater than the corre~ond~
PnPPn
Jx,. . .X” J
fj(x,y)dp,...dpj,
pay-off in pure strategies.
2. In the case when the first player knows only the concrete realizations of the second’s choices, the first player can, as in [S] , disregard the possibility of the second using mixed strategies while he fmds the guaranteed and optimal guaranteed results and the corresponding strategy. 3. If the first player knows only the second’s mixed strategies, the problem is obtained [3] with the pay-off functions i=s112; wp,q)=J J . ..JJfl(x.~)dp,d~,...dpidpl, Y, x,
Y,K”
p=(Pl, *a*, Pnf, q==(qJ, -**, q,,): pp==P, respectively at the j-th move, 1
q,=Q,
are the first and second player’s strategies
Problems of the transport type
55
The author thanks Yu. B. Germeier, K. K. Mosevich, and A. F. Kononenko for valuable discussions and comments. Translated by
D. E. Brown
REFERENCES 1.
KUHN, G. U., Positional games and the information problem, in: Positionalgames (Russian translations), Nauka, 13-40, Moscow, 1967.
2.
KONONENICO, A. F., and KUKUSHKIN, N. S., Mixed strategies in games with a fixed sequence of moves, Dokl. Akad. Nauk SSSR, 209, No. 6,1274-1277.1973.
3.
DANIL’CHENKO, T. N., and MOSEVICH, K. K., Multistep two-person games with a fixed sequence of moves, Zh. uychisl.Mat. mat. Fiz., 14, No. 4,1047-1052, 1974.
4.
DANIL’CHENKO, T. N., and MOSEVICH, K. K., Multistep two-person games with unopposed interests, Dokl. Akad. Nauk SSSR, 211, No. 6,1273-12751973.
5.
GERMEIER, Yu. B., Games with unopposed interests (Igry s neprotivopolozhnymi Mosk. un-ta, Moscow, 1972.
interesami), Izd-vo
ALGORITHMS OF FEASIBLE DIRECTIONS FOR SOLVING CERTAIN PROBLEMS OF THE TRANSPORT TYPE* N. D. ASTAKHOV Moscow (Received
3 1 March 1975)
ALGORITHMS of feasible directions, suitable for solving certain linear programming problems of the transport type, are described. By taking the example of the allocation problem, we show that the algorithms are applicable to a wide range of basically similar transport problems.
1. The transport problem with bounded sets of production points (closed model)
We consider the problem: to find
(1) under the conditions
c
x,j= b,,
j=J,
1EI
Here and throughout,
c
Xtj=ai,
i=I,
X,,>c),
id,
j=J.
JEJ
I= {I, 2, . . . , m}, J= {I, . . . , n}.
We shall assume that the condition for solvability [l] of this problem is satisfied, i.e.,
*Zh. v_?chisl.Mat. mat. Fir., 16,5, 1146-1154,
1976.