Counterexamples to the conjecture on stationary probability vectors of the second-order Markov chains

Linear Algebra and its Applications 507 (2016) 153–157 Contents lists available at ScienceDirect Linear Algebra and its Applications www.elsevier.co...

Download PDF

239KB Sizes 0 Downloads 54 Views

Report

PDF Reader
Full Text

Linear Algebra and its Applications 507 (2016) 153–157

Contents lists available at ScienceDirect

Linear Algebra and its Applications www.elsevier.com/locate/laa

Counterexamples to the conjecture on stationary probability vectors of the second-order Markov chains Mansoor Saburov ∗ , Nur Atikah Yusof Faculty of Science, International Islamic University Malaysia, P.O. Box, 25200, Kuantan, Pahang, Malaysia

a r t i c l e

i n f o

Article history: Received 28 November 2015 Accepted 6 June 2016 Available online xxxx Submitted by P. Semrl MSC: 15A69 15A18

a b s t r a c t It was conjectured in the paper “Stationary probability vectors of higher-order Markov chains” (Li and Zhang, 2015 [7]) that if the set of stationary vectors of the second-order Markov chain contains k-interior points of the (k −1)-dimensional face of the simplex Ωn then every vector in the (k − 1)-dimensional face is a stationary vector. In this paper, we provide counterexamples to this conjecture. © 2016 Elsevier Inc. All rights reserved.

Keywords: Transition hypermatrix The second-order Markov chain Stationary probability vector

1. The m-order Markov chains A discrete-time m-order Markov chain is a stochastic process with a sequence of random variables {Xt , t = 0, 1, 2 . . .} , * Corresponding author. E-mail address: [email protected] (M. Saburov). http://dx.doi.org/10.1016/j.laa.2016.06.012 0024-3795/© 2016 Elsevier Inc. All rights reserved.

154

M. Saburov, N.A. Yusof / Linear Algebra and its Applications 507 (2016) 153–157

which takes on values in a discrete ﬁnite state space [n] = {1, . . . , n} for a positive integer n, such that pi,i1 ···im = Pr (Xt+1 = i|Xt = i1 , Xt−1 = i2 , . . . , X1 = it , X0 = it+1 ) = Pr (Xt+1 = i|Xt = i1 , · · · , Xt−m+1 = im ) , where i, i1 , · · · , im , · · · , it+1 ∈ [n] and n

pi,i1 ···im ≥ 0,

pi,i1 ···im = 1,

1 ≤ i, i1 , · · · , im ≤ n.

i=1

Namely, the current state of the chain depends on m past states (see [1,2]). Here n P = (pi,i1 ···im )i,i1 ,...,im =1 is called the transition hypermatrix of an m-order Markov chain {Xt , t = 0, 1, 2 . . .}. Let x(t) = (x1 (t), · · · , xn (t))T be the tth-distribution of the discrete-time m-order Markov chain {Xt , t = 0, 1, 2 . . .}. Note that P is an (m +1)-order stochastic hypermatrix governing the transition of states in the m-order Markov chain according to the following rule

xi (t + 1) =

pi,i1 ···im xi1 (t) · · · xim (t),

i = 1, · · · , n.

1≤i1 ,··· ,im ≤n

Denote by Ωn =

x = (x1 , · · · , xn )T : x1 , · · · , xn ≥ 0,

n

xi = 1

(1)

i=1

the standard simplex consisting of probability vectors in Rn . A polynomial operator P : Ωn → Ωn associated with the (m + 1)-order stochastic n hypermatrix P = (pi,i1 ···im )i,i1 ,...,im =1 (P(x))i =

pi,i1 ···im xi1 · · · xim ,

i = 1, · · · , n,

(2)

1≤i1 ,··· ,im ≤n

is called a nonlinear Markov operator (see [3]). It is clear that a set of all stationary distributions of the m-order Markov chain is nothing but a set of all ﬁxed points of the nonlinear Markov operator (2). Due to Brouwer’s ﬁxed-point theorem, the set Fix(P) = {x ∈ Ωn : P(x) = x} is nonempty. If m = 2 then P : Ωn → Ωn is a quadratic stochastic operator which has an incredible application in population genetics (see [4–6]).

M. Saburov, N.A. Yusof / Linear Algebra and its Applications 507 (2016) 153–157

155

In the paper [7], it was conjectured that if the set of stationary vectors of 2-order Markov chain contains k-interior points of the (k − 1)-dimensional face of the simplex Ωn then every vector in the (k − 1)-dimensional face is a stationary vector. In this paper, we provide counterexamples to this conjecture. 2. Counterexamples 3

Let P = (pi,jk )i,j,k=1 be a cubic stochastic hypermatrix, i.e., pi,jk ≥ 0 and p1,jk + p2,jk + p3,jk = 1 for any 1 ≤ i, j, k ≤ 3. Without loss of generality, we may assume that pi,jk = pi,kj for any 1 ≤ i, j, k ≤ 3. Otherwise, we may deﬁne a cubic stochastic 3 pi,jk + pi,kj such that psi,jk = psi,kj for hypermatrix P s = psi,jk where psi,jk := 2 i,j,k=1 any 1 ≤ i, j, k ≤ 3 and the associated quadratic stochastic operators QP and QP s are the same, i.e., QP (x) = QP s (x) for any x ∈ Ω2 . Let QP : Ω2 → Ω2 be a quadratic stochastic operator associated with P = 3 (pi,jk )i,j,k=1 ⎧ 2 2 2 ⎪ ⎪ ⎨(QP (x))1 = p1,11 x1 + p1,22 x2 + p1,33 x3 + 2p1,12 x1 x2 + 2p1,13 x1 x3 + 2p1,23 x2 x3 QP : (QP (x))2 = p2,11 x21 + p2,22 x22 + p2,33 x23 + 2p2,12 x1 x2 + 2p2,13 x1 x3 + 2p2,23 x2 x3 ⎪ ⎪ ⎩(Q (x)) = p x2 + p x2 + p x2 + 2p x x + 2p x x + 2p x x P 3,11 1 3,22 2 3,33 3 3,12 1 2 3,13 1 3 3,23 2 3 3 We say that P > 0 if pi,jk > 0 for any 1 ≤ i, j, k ≤ 3. In this case, Q : Ω2 → Ω2 is called the positive quadratic stochastic operator. Our aim is to provide an example for the positive quadratic stochastic operator Q : Ω2 → Ω2 which has exactly three ﬁxed points in the interior set intΩ2 = {x ∈ Ω2 : x1 x2 x3 > 0} of the simplex Ω2 , i.e., |Fix(Q) ∩ intΩ2 | = 3. In order to accomplish our aim, we use the following strategy: instead of ﬁnding ﬁxed points of the given positive quadratic stochastic operator, we try to ﬁnd a positive quadratic stochastic operator for which the given points are its ﬁxed points. Let A = (0.1, 0.2, 0.7)T , B = (0.4, 0.3, 0.3)T and C = (0.59, 0.31, 0.1)T be interior points of the simplex Ω2 . We deﬁne a positive quadratic stochastic operator Q0 : Ω2 → Ω2 as follows ⎧ ⎪ ⎪(Q0 (x))1 = ⎪ ⎪ ⎨(Q (x)) = 0 2 Q0 : ⎪ (Q (x)) ⎪ 0 3 = ⎪ ⎪ ⎩

232873 2 4717 2 207 7 3 1 2 319300 x1 + 10300 x2 + 63860 x3 + 5 x1 x2 + 5 x1 x3 + 50 x2 x3 27 2 1 2 3 2 470171 378421 158157 100 x1 + 2 x2 + 20 x3 + 814300 x1 x2 + 407150 x1 x3 + 814300 x2 x3 54 433 27037 2 18409 191589 2 2 79825 x1 + 10300 x2 + 31930 x3 + 814300 x1 x2 + 407150 x1 x3 + 1454157 814300 x2 x3

A straightforward calculation shows that A, B, C are ﬁxed points of the positive quadratic stochastic operator Q0 : Ω2 → Ω2 . It is clear that if each point of the set intΩ2 is a ﬁxed point of Q0 : Ω2 → Ω2 then due to continuity of Q0 and intΩ2 = Ω2 , each point of the simplex Ω2 must be a ﬁxed point.

156

M. Saburov, N.A. Yusof / Linear Algebra and its Applications 507 (2016) 153–157

This is impossible because of Q0 (∂Ω2 ) ⊂ intΩ2 where ∂Ω2 = {x ∈ Ω2 : x1 x2 x3 = 0} is the boundary of the simplex Ω2 . We can deﬁne another positive quadratic stochastic operator Q1 : Ω2 → Ω2 as follows

Q1 :

⎧ ⎪ (Q1 (x))1 = ⎪ ⎪ ⎪ ⎪ ⎪ ⎪ ⎨(Q1 (x))2 = ⎪ ⎪ ⎪ ⎪ (Q1 (x))3 = ⎪ ⎪ ⎪ ⎩

17322871 2 990257 2 1559 13 16 2 22351000 x1 + 2163000 x2 + 13410600 x3 + 10 x1 x2 + 25 x1 x3 224 2 488 2 125 2 703327 19461451 1000 x1 + 1000 x2 + 1000 x3 + 1017875 x1 x2 + 24429000 x1 x3 8271787 + 24429000 x2 x3 4301 117199 2 2 4470200 x1 + 2163000 x2 + 1601951 977160 x2 x3

+

2933179 2 3352650 x3

+

18371 2035750 x1 x2

+

+

11 500 x2 x3

13761989 24429000 x1 x3

A straightforward calculation shows that A, B, C are also ﬁxed points of the positive quadratic stochastic operator Q1 : Ω2 → Ω2 . Now, we can deﬁne a family of positive quadratic stochastic operators Qε : Ω2 → Ω2 as Qε (x) = (1 − ε)Q0 (x) + εQ1 (x) for any x ∈ Ω2 and 0 ≤ ε ≤ 1. It is clear that A, B, C are also ﬁxed points of the family of positive quadratic stochastic operators Qε : Ω2 → Ω2 . It is worth mentioning that an example for positive quadratic stochastic operator having three ﬁxed points inside the simplex was also given by Yu. Lyubich in his book [4]. However, Lyubich’s example is wrong. Actually the positive quadratic stochastic operator in Lyubich’s example has a unique ﬁxed point. This issue was addressed in the paper [8]. Here, we provide a simple proof of the following result which was proven in the general setting in [4]. Proposition 1. Any positive quadratic stochastic operator Q : Ω2 → Ω2 has either unique or three ﬁxed points in the simplex, i.e., |Fix(Q) ∩ Ω2 | = 1 or 3. Proof. We shall ﬁrst prove that |Fix(Q) ∩ Ω2 | ≤ 4. Since Q : Ω2 → Ω2 is a positive quadratic stochastic operator, we have that Q(∂Ω2 ) ⊂ intΩ2 . Therefore, we get that Fix(Q) intΩ2 . Since x3 = 1 − x1 − x2 , it is clear that Fix(Q) is deﬁned by an intersection of two second order curves f1 (x1 , x2 ) = 0 and f2 (x1 , x2 ) = 0 inside the triangle deﬁned by 0 ≤ x1 + x2 ≤ 1 and 0 ≤ x1 , x2 ≤ 1. Due to Bezout’s theorem, the set Fix(Q) is either ﬁnite (with |Fix(Q)| ≤ 4) or inﬁnite. Suppose the set Fix(Q) is inﬁnite. Then an intersection of two second order curves must be a compact inﬁnite set, i.e., an ellipse Ef1 ,f2 . Let e1 = (1, 0, 0)T and c be the center of the ellipse Ef1 ,f2 . Let us consider a straight line L connecting two points e1 and c. It is clear that |L ∩ Ef1 ,f2 | = 2 and L ∩ Ef1 ,f2 ⊂ Fix(Q). Let L ∩ ∂Ω2 = {e1 , q}, where q = (0, q2 , q3 )T with q2 + q3 = 1 and q2 , q3 > 0. Since the straight line L contains two ﬁxed points, the following quadratic equation with respect to t (Q((1 − t)e1 + tq))1 = ((1 − t)e1 + tq)1 must have two roots in the segment [0, 1]. It is clear that

(3)

M. Saburov, N.A. Yusof / Linear Algebra and its Applications 507 (2016) 153–157

157

Q((1 − t)e1 + tq) = (1 − t)2 Q(e1 ) + 2t(1 − t)B(e1 , q) + t2 Q(q) where B : Ω2 × Ω2 → Ω2 is a symmetric bilinear operator associated with Q. Let a = (Q(e1 ))1 , b = (B(e1 , q))1 , and c = (Q(q))1 such that 0 < a, b, c < 1. It follows from (3) that the following quadratic function g(t) = (1 − t)2 a + 2t(1 − t)b + t2 c + t − 1 must have two roots in the segment [0, 1]. Therefore, g(0)g(1) > 0. This contradicts to the fact g(0)g(1) = (a − 1)c < 0. Consequently, Fix(Q) is a ﬁnite set such that |Fix(Q)| ≤ 4. Let us consider the mapping F : Ω2 → R3 , F(x) = Q(x) − x (it is called a vector ﬁeld, see [9]). Let Θ = (0, 0, 0) and F−1 (Θ) = {x : Ω2 : F(x) = Θ} be zeros (critical or singular points) of the vector ﬁeld F. Since Q(∂Ω2 ) ⊂ intΩ2 , the vector ﬁeld F(x) = Q(x) − x on the boundary ∂Ω2 of the simplex Ω2 is directed its inside. Hence, the index of the vector ﬁeld F over the boundary ∂Ω2 of the simplex Ω2 is 1. On the other hand, the index of the critical point of the vector ﬁeld F is either 1 or −1. Due to Hopf’s index theorem (see [9]), the index of the vector ﬁeld F on the boundary ∂Ω2 is equal to the sum of indices of the critical points of the vector ﬁeld F inside the simplex Ω2 . Consequently, the total number of the critical points of the vector ﬁeld F must be odd. This means that |Fix(Q)| is odd, i.e., |Fix(Q)| = 1 or 3. This completes the proof. 2 Acknowledgements This work has been partially supported by the MOHE grant FRGS14-141-0382. The ﬁrst author (M.S.) also thanks to the Junior Associate Scheme, Abdus Salam International Centre for Theoretical Physics (ICTP), Trieste, Italy, where this paper was written, for the invitation and hospitality. The authors are also greatly indebted to the anonymous reviewer for several useful comments which improved the presentation of the paper. References [1] A. Raftery, A model of high-order Markov chains, J. R. Statist. Soc. 47 (1985) 528–539. [2] A. Berchtold, A. Raftery, The mixture transition distribution model for high-order Markov chains and non-Gaussian time series, Statist. Sci. 7 (2002) 328–356. [3] V.N. Kolokoltsov, Nonlinear Markov Processes and Kinetic Equations, Cambridge University Press, New York, 2010. [4] Y. Lyubich, Mathematical Structures in Population Genetics, Springer-Verlag Berlin Heidelberg, Berlin, 1992. [5] R. Ganikhodzhaev, F. Mukhamedov, U. Rozikov, Quadratic stochastic operators and processes: results and open problems, Inﬁn. Dimens. Anal. Quantum Probab. Relat. Top. 14 (02) (2011) 279–335. [6] F. Mukhamedov, N. Ganikhodjaev, Quantum Quadratic Operators and Processes, Springer International Publishing, Switzerland, 2015. [7] C.-K. Li, S. Zhang, Stationary probability vectors of higher-order Markov chains, Linear Algebra Appl. 473 (2015) 114–125. [8] M. Saburov, N.A. Yusof, On quadratic stochastic operators having three ﬁxed points, J. Phys. Conf. Ser. 697 (2016) 012012. [9] E. Outerelo, J.M. Ruiz, Mapping Degree Theory, American Mathematical Society, Providence, RI, 2009.

Counterexamples to the conjecture on stationary probability vectors of the second-order Markov chains

Counterexamples to the conjecture on stationary probability vectors of the second-order Markov chains

Recommend Documents