Journal
of Statistical
Planning
and Inference
119
24 (1990) 119-133
North-Holland
GENERALIZED RANDOM WALK AND DISTRIBUTIONS SOME RANK ORDER STATISTICS Jagdish
SARAN
and Sarita
OF
RAN1
Department of Mathemafical Statistics, University of Delhi, Delhi 110007, India
Received
7 December
Recommended
1987; revised
manuscript
Abstract: This paper deals with the derivation of some rank order statistics the extended journs number
above
received
30 September
1988
by I. Vincze
Dwass technique. height r, number
of sojourns
at height
of the joint and marginal
related to the generalized The rank order statistics of crossings r from above
random
probability
distributions
walk with steps 1 and -,LJ by using
considered
of height r, number
include total length of all soof sojourns
at height r and the
(r>O).
AIMS Subject Classification: 62630. Key words andphrases: Extended Dwass technique; generalized random walk; sojourns r; sojourns at height r from above; total length of all sojourns above height r; crossings r - at lattice and non-lattice points.
at height at height
1. Introduction Mohanty and Handa (1970) have extended Dwass’ technique (1967) to the case when one sample size is an integer multiple of the other (i.e. m=yn) and derived the distributions of some rank order statistics. Later Saran and Sen (1979) and Sen and Saran (1983) derived the joint distributions of some rank order statistics related to the maxima of the generalized random walk with steps 1 and -p and the distribution of ND:,(r), the number of crossings of height r (rr 0), respectively, using the extended Dwass technique. Further Sen and Kaul (1985) have derived joint and marginal distributions of LP,n, the length of positive sojourns, NP,n, the total number of sojourns, NV:,,, the number of positive sojourns, and N&, the number of crossings of the origin. They have also derived the joint and marginal distributions of L,,,, R,,., Rln and NPTn, where R,,. and R& denote, respectively, the number of reflections and the number of positive reflections. In this paper we consider the above mentioned generalized random walk with steps 1 and -p and derive, for r> 0, the joint and marginal distributions of the following rank order statistics: (i) L,,,(r), the total length of all sojourns above height r = total number of steps above height r, 0378-3758/90/$3.50 0 1990, Elsevier Science Publishers
B.V. (North-Holland)
120
J. Saran. S. Rani / Generalized random walk
(ii) N,,.(r), the total number of sojourns at height r, (iii) NfiTn(r), the number of sojourns at height r from above, and (iv) NpTn(r), the number of crossings of height r, by using the extended Dwass technique, thus generalizing and extending of Aneja
(1975), Sen and Kaul (1985) and Sen and Saran
the work
(1983).
2. The method In order to derive the distributions of rank order statistics we shall use the extended Dwass technique given by Mohanty and Handa (1970). They considered the generalized random walk (Sj: Sj = Cf= i Wi, S, = W0 = 0} generated by a sequence { Wi} of independent random variables with common probability distribution P(Wi=+l)=p,
p being a positive integer. The main theorem used for finding quoted
from Mohanty
15i
P(Wi=-~)=q=l-p,
and Handa
the distributions
of rank
order
statistics
is
(1970):
1. Suppose Vp,n is a rank order statistic for every n and VP is the corresponding function defined on the random walk which is completely determined by W,, W,, . . . . W,and does not depend on WT+,, Wr+2, . . . . whenever T>O (where T is the time for the last return to zero in the random walk). Define
Theorem
h(p)=E(V&
(1)
a<@(~++).
Then we have the following power series (in powers of ppq) expansion
h(p) l-(p+l)PPqYh
(2)
(PP’4)“, n=O
where for y see (ii) and (iii) in Section 3 below.
3. Some auxiliary
results
The basic results needed in the sequel (1970) and Sen and Kaul (1985). (i) For any (Y and p,
5 &(a,
k=O
where Ak@98)=$j$j
a)@ = xa
are quoted
from
Mohanty
and Handa
(3)
121
J. Saran, S. Rani / Generalized random walk
the last inequality assuring the convergence (ii) The probability generating function
of the series. (PGF) for the first return
to the origin
is F(t) = (,Lf+ l)ppqxV+’
(4)
where PV4f
/I+1 _ x-1 -x”+1
(iii) The probability
and
(t(~‘+‘p~q<~~/(LL+l)~+l.
of never returning
6 = l-F(l)
to the origin
is
= 1-(p+l)pPqyP
(5)
where y is the value of x when t = 1. (iv) The PGF for the time to reach k is G(t,k) = (px#,
(v) The probability
k=
1,2 ,....
of ever reaching
G(l,k)=(~y)~,
k=
(6) k is
1,2 ,....
(7)
(vi) The contribution a to the PGF of Lp by a positive return at a lattice point or a positive sojourn which is such that Sk, = 0, Sj> 0 and Sk, = 0 for all k,
(8)
(vii) The contribution /3 to the PGF of Lp by a negative return which is always at a lattice point or a negative sojourn such that in the path segment defining it Sk, = 0, Sj< 0 and Sk, = 0 for all kl < j < k, is given by P = PV’4YP’.
(9)
(viii) The contribution to the PGF of Lp by the pair of positive and negative journs involved in a crossing at the lattice or non-lattice point is given by
so-
where y=l+-
t p-1
xt
aj=i
Y
j
cC-F
(ix) The segment of the path {L$} between the two consecutive indices j for which Sj_, = -1, Sj=O and Sj+l = +l will be called a section. Path segments between the origin and the first such index j where St = +l and also between the last such indexj and the last return to the origin from the negative side will also be con-
122
J. Saran, S. Rani / Generalized random walk
sidered to be sections. Let the i-th section in the path {A”] comprise mi positive sojourns (returns) preceding a pair of positive and negative sojourns (not necessarily returns) involved in a crossing followed by ni negative sojourns (returns). Then the contribution to the PGF of 15~ by the i-th section is
(11) (x) The following
power
series expansions
are also used in the sequel: (12)
where (z) is the smallest
integer
greater
than
or equal to z. Further
(5)(i)(‘c:-‘)(~,~)j(*~,~)j+i+~(~-l), j=O
whence
k=O
(13)
i=O
when t = 1, x= y and a =p and we have
IfIt=, = wPYY~o
(-l)'(;)(PY)'-'
= (l@YY(PY - 1Y.
4. Joint distribution
of L,(r),
(14)
Np(r), Np+(r) and N,*(r), r > 0
Let E(tL”(“; N,(r) = a, N,+(r) = b, N,*(r) = 2c) denote the PGF of the length L,(r) of all sojourns above height r when the number N,(r) of total sojourns at height r is a, the number Np+(r) of positive sojourns at height r from above is b and the number N,*(r) of crossings of height r is 2c in the generalized random walk {A”, 0 ~j< m}. Then for r>O, E(t’fl@); N,(r) = a, NV+(r)= b, N,*(r) = 2c) = ; P&(r) = (py)’
= g, N,(r) = a, N;(r)
= 6, N:(r)
= 2c)tg
Aabpa-b-‘yc-lq(l-(py)~‘)+Babpa~b-IyCq(l-(py)~) +L4&‘pa-b-‘yC-‘qt
M-1 ,;, (pxt)S(l-
(py)fl”-“)
1
(1%
where
To establish (15), there arise the following four mutually exclusive cases: (i) the first and the last points of contact of height rare return points (Figure I), (ii) the first point of contact of height r is a crossing and the last point of contact of height r is a return point (Figure 2),
123
J. Saran, S. Rani / Generalized random walk
(iii) the first point of contact contact of height r is a crossing
of height r is a return (Figures 3 and 4),
point
and the last point
of
(iv) the first and the last points of contact of height Y are crossings. If there is a crossing at the last point of contact of height r (as in (iii) and (iv) above) then again there are two cases: (a) when the crossing at the last point of contact takes place at a lattice point, (b) when the crossing at the last point of contact takes place at a non-lattice point. Let M and D be the first and last return points of height r, respectively. Then the path {S,} comprises three segments, viz. from 0 to M, M to D and D to 03. Of these, the first segment from 0 to M occurs with probability (py)‘, by (7). The probability of the last segment from D to 03 equals q(l - (p~)~) provided the last point of contact of height r is a lattice point.
\
Fig. 1.
In case (i), taking M as a new origin, the path segment NU = a - 1, Nti’ = b, Nfl*= 2c (see Figure 1) and the contribution MD to the requisite PGF is
MD entails Si =-p, of the path segment
by (13b) of Sen and Kaul (1985). Thus in case (i), the PGF of L,(r),
when N,(r)
= a,
N,i(r) = b, N,*(r) = 2c is given
by
Similarly, in case (ii), on transferring the origin to M, the path segment MD entails S, = +l, Nfi =a- 1, N,’ =6, N,*=2c1 (see Figure 2). Then on using (14) of
-
\
v Fig. 2.
J. Saran, S. Rani / Generalized random walk
124
Sen and Kaul (1985), the PGF of L,(r) when N,(r) = a, Np+(r) = b, N:(r) = 2c, in case (ii), is given by (17) In case (iii)(a), the last point of contact of height r is a crossing occurring at a lattice point (see Figure 3). Arguing in a similar manner as in case (i), the PGF of L,(r) when N,(r) = a, NU+(r)= b, N,*(r) = 2c, in case (iii)(a), is given by
(18)
\
Fig. 3.
In case (iii)(b), there is a crossing at the last point of contact of height r occurring at a non-lattice point (see Figure 4). Taking M as a new origin, the path segment MD has S, = -p, Nti = a - 2, N,’ = b - 1, N;” = 2c- 1. Again, the path segment MD comprises a sub-segment MA, which contains, say n > 0 negative returns, c - 1 sections A,A2, A,A3, . . . , A,_,A, and a sub-segment A,D which contains, say m 20 positive returns. Excluding c - 1 pairs of positive and negative sojourns which may or may not be returns involved in c- 1 downward crossings (one appearing in each of the c - 1 sections), there are b - 1 - c + 1 = b - c positive and II - b - c negative sojourns which are returns also. The number of arrangements of b-c positive soSimilarly, the number journs into c sub-segments A,A,,A2A3, . . . , A,D is (:I:). of arrangements of a - b - c negative sojourns into c sub-segments MA,, A, AZ,. . . , A,_,A, with the sub-segment MA, comprising at least one sojourn is (“J!;‘). Therefore, the total number of arrangements of b - c positive and a - b - c negative sojourns is (~I{)(“,“~“). Let the i-th section contain mi positive sojourns preceding
I
‘5
Fig. 4.
.I. Saran, S. Ram’ / Generalized random walk
125
the pair of positive and negative sojourns constituting a crossing followed by nj negative sojourns. The path segment from D to 00 (Figure 4) is such that it starts from height r with a positive step, then crosses height r only once at a non-lattice point and thereafter it does not reach height r. Suppose it crosses height r at a non-lattice point from the point P such that the ordinate of the point P is s (O
by using a similar argument as used by Sen and Saran (1983, Lemma 3). Thus in case (iii)(b), the PGF of L,(r) when N,(r) = a, $(r) = b, N,*(r) = 2c is given by
using
(8), (9) and (ll),
and this equals
b-rpa-b~‘yC-rqt~~r (PYY(~I:)(a;~;2)a
P-1
(JJxf)s(l-(py)@-s)
since Cfl: mifm = b-c and J$cr: ni+n=a-b-c. Likewise, it can be shown that the PGF of L,(r) when N,(r) =a, NT(r) = 2c in cases (iv)(a) and (iv)(b), respectively, are given by
(19)
N,f(r) = b,
and P-1
b-‘pa-b-lyc-lqt s;l (pxt)S(l(PYY( ;I:)( “r”,‘)cf The result
(15) follows
on adding
(p_JJ)~‘-“).
(21)
(16) to (21).
5. Deductions (i) Summing I$&(‘);
(1.5) over a, we get Np+(r) = 6, Np*(r) = 2c)
+ab-lpc-lycyc-l
V-1 El
mwq~(l-
(PY)p-“)
. (22)
J. Saran, S. Rani / Generalized random walk
126
(ii) Summing
(22) over b, we get
E(&@); =
N,*(r) = 2c)
(PY)’
t
@x)cPc-lYcrc-lq(l - (PYYW+-Pvv> M-1 +,c-l,cpc-lycvc-1
(iii) Summing
El
wosq~u
- h?wS)
I
(23)
1
(15) over c, we get
E(W); N,(r) = a, Np+(r) = b) abD”pb-lq(l-(PY)“)
(24) (iv) Summing E(t”“‘;
(24) over a, we get abyq(l
N,+(r) = b) = (py)’
+j3yy)b(l
- (PY)~)
i
P-1 SC] wf)v
+CPYU +PYy)b-kt (v) Summing E(W);
N,(r)
- (~~1fi-9
1
.
(25)
(24) over 6, we get = a)
.bpOpb-‘yCq(l
- (pyp) v-1
c
.
s=l (26)
(vi) Summing
E(t Lq
(25) over b, we get
= (py)‘(l-a-&J’)-’
i\
q(l-(py)“) +
where y’= 1+ t c j=l
(xt/y)j.
w
P-1
c (PxtY(l-
s=l
(PYY")
I
(27)
J. Saran,
When t=l, a=/$ x=Y reduce, respectively, to (vii)
P(N,(r)
and
S. Rani / Generalized
random
walk
127
y’=yy,
the results
(15) and (22) to (26)
say, then
= a, NO+(r) = 6, N,*(r) = 2c)
+(“91)
P
+
YF4(1-
pII-2Y;-1q
(PY)P)
P-1 c (PYRl - (PYF”)
* (28) 1
s=l
(viii)
P(NP+(r) = 6, N,*(r) = 2c) b+cP lYlcPIYC(l +PwM1-
h?w~
P-2
+P b+c-2Y;-1Ycq El (PY>“U-(PYY”> (ix)
P(N,*(r)
= 2c) O-2 EO (PY)P-s-P(PY)P+l
= (PY)‘(PY-l)c-lPcm’Yc+lq [ (equivalent
60
(xi>
to (26) in Sen and Saran P&(r)
I
.
(29)
1
(30)
(1983)).
= a, Np+(r) = b)
Wfl+W= 6) = (MPbrq (1+PYIY)~(~ - (PYY') P-1
+(l+DYIY)b-l (xii)
P&(r)
1
sgl (PY)W (PYY”) .
(32)
= a)
When p= 1, a=+[1-(1-4pqt2)1’2],
/?=p and y=y’=l,
then the results
(1.5) and
J. Saran, S. Rani / Generalized random walk
128
(22) to (33) reduce,
respectively,
to the corresponding
results
of Aneja
(1975,
Ch. II).
6. Probability
distributions
The following distributions can be derived by using Theorem PGF’s coefficients of tg(ppq)“((p+ lb ) from the corresponding previous section and usingnresults (12), (13) and (14).
(cl+l>n
(
>p(LP,n(r)= g>N,,.(r) = q&((r) = (EJjo f, ,jo ~=~,~)(-l)*(i)(‘+i-l)~~(~~+l) a,
n
=
1 and collecting derived in the
b, A&(r)
= 2c)
.[(“;“r’)(c;1)+(“-y’>(5)] (p+l)m-r-l
*I(
A,,(r-H+~(a-j-l),~+l)
m-l
>
(~+l)m-r
-
A,#-H+M-j),fi+l) >
(
1
+(:::),;,r’)x
1,
iO ;;:(-l)x
. (‘l’)(~)~+i-l)a~,(H+s-~,~+~~ (~+l)rq-r-s-1 ii *L m1=((r+Wp) (
ml-l
A,,(r-H+~(a-j-l),~+l) >
(p+l)m,-r-p-1
-
E m2=((r+Wll)
mz-1
(
A,,(r-s-H+~(a-j),~+l) >
1 ,
(34)
where H=j+i-k+p(k+b-j), H+b+(~+l)~=H+b+(~+1)~2,+s-~=g, l+v,+a-j-l+m=A+v*+a-j+m=A,+v,+m,+a-j-2 = A,+v,+m2+a-j-2
(P + lb
(
> =(91;>i n
W,,
Ar)
= g, N,In(r)
= 6, Iv&(~)
= n.
= 24
..
i E .i o*(/k)(/+:_I)A*(H,p+l)
j=O k=O i=O
m=(r/p)
J. Saran, S. Rani / Generalized random walk
4
(,u+l)m-r-l
129
C-l
m-l
A&-H+c+p(b+c-j-l),p+l)
j
I(
>
+0 C
A&-H+c+l+,n(b-j+c),p+l)
j
_
(jY+l)m-r
(
c-l
m
A,,(r-H+c+p(b-j+c),p+l)
j
>I( +
I
>
0
‘: A,,@-H+c+l+p((b--j+c+l),p+l) 1
c-l ~cr’i~~~~+~-ll~*,~~+~-~,~+lI + jFo
,go
;go
1::
(-lJk
(p+l)m,-r-s-l f
-L
ml=(v+WP) _
A,,(r+c-H+p((b-j+c-l),p+l)
m,-1
(
ii m2=((r+LWjl) (
1
(p+l)m,-r-p-l ,
A,,(r+c-H-s+,u(b-j+c),p+l)
m,-1
>
(35)
I
where H+b+(/f+l)A=H+b+(p+l)A,+s-p=g, L+v,+b+c-j+m-1
=A+v,+b+c-j+m=A+v,+b-j+c+m =A+v,+b-j+c+l+m =b+c-j-2+A,+v,+m, =I+v,+b+c-j-2+m,=n.
= f: i f j=O k=O i=O
i m=(r/b)
(_l)k
-I (~)(i+:I)A;(l,~-ti)
(,u+l)m,-r-s-l
(p+l)m,-r-p-l (36)
130
J. Saran, S. Rani / Generalized random walk
where
c+j+i-k+p(k+c-j),
I=
I+(p+l)i
J= c+r-j-i+k+p(c-l-k),
=I+s-,u+(,B+l)A,
=g,
A,+v,+2c-j-1+m=A+vz+2c-j+m=A+vg+2c-jfm =A+v,+2c-j+l+m=A,+v,+2c-j-2++ =A,+v,+2c-j-2+m,=n. P(-$, n(r) = g, &Ft, (r) = b)
(p+l)m,-r-s-l
(p+l)m,-r-p-l 4,(M+p
-.%P + 1)
where
r+I+l-j-i+k+p(I-k) H+b+(p+l)A
=M,
=H+s+l+(p+l)A,
=g,
A+v,+b-j+I+m=A+v2+b-j+l+l+m=A,+vv,+b-j+l+m, = ~i,+v3+b-j+I+m2
(P + lb
(
n
=
= n.
>
PC&,n (4 = 8)
E i ; h E (-l)k u=O I=0
j=O
k=O
;=O
i
m=(r/fl>
’ (J(;>c)(“:-‘> (p+l)m-r-l
K
m-l
>
4,(MP
+ 1)
(37)
J. Saran, S. Rani / Generalized random walk
131
-((~+~-r)A”*(M+lr,a+l)] p-1
c
+
(p+l)m,-r-s-l
A,,(k+s+pu,p+l)
S=l
I
Ii _
m,-1
(
ml = ((r+s)/p)
>
A”,(MP+
1)
(p+l)m,-r-p-l
(
ii mz=((r+JtV,u)
.A”,w+P--s,P++l)
m,-1
)
II
(38)
7
where k=j+i-k+p(t!-j+k), k-tu+(p+l)A
M=r+/+l-j-i+k+p((I-k), =k+u+s+2+pu+(p+l)A,
=g,
A+v,+v+I+m-j=A+v,+v-j+I+l+m=A,+v,+v+l-j+f+m, =A,+v,+v+I+l-j+m,=n.
(P + lb
(
>
Wy,, n (4 = a, y,(r)
= 6, N&(r)
= 2c)
_;:‘I:l(““r’~~~~(-l)~~c~l~~~~l~;
. [,n=i,pl
[ (
(p+l)m-r-l m_l (p+l)m-r
_
m
( P-1 +c s=l
L m, _
)4(r++~~~-c).~+1)
E = ((r+s)/fl)
)
A>.,@-j+&-c+l),p+l)
I
(p+l)m,-r-s-1 C
m,-1
(,n+l)m,-r-p-l E m,-1 W=((r+Pc)//c) (
A&-j+s+,u(a-c-l),p+l) )
)
Ai@-j+&-c),p+
1) 11
(p+l)m-r-1
-K _
m-l
(p+l)m-r m
>
Aiz(r-j+P(@-c-
A2(r-j+,da-c),p+l)
lX,L+ 1)
I
,
where A+a-c+m=A,+a-c+l+m=A,+a-c-l+m =A,+a-c-l+m,
=A+a-c-l+m,=n.
(39)
132
J. Saran, S. Rani / Generalized random watk
. L1;;mJ,p) ((~+l)m-r-rl)Ai(c-j+r+s+B(bl),ptl) m-l m -Lc
c mI=(v+P)/P) (
(p+l)m,-r-p-l >
m,-1
where A+&l+m=At+mt+b-1
A,,(c+r-j+l+pb,/J+1)
1 )
(40)
=n.
(p+l)n n
(
>
P($,(r)
c-l = i;O (-1)’
= 2c)
C;l
$-l-i
( -
L
1:;
> ((iu~1)~-~+s)A,~x_,((~+2)c+r-i-s~+l)
J_
-,n,,~~:,,, (equivalent
.
i
5;
(‘P+1’:-r-1)n,~Ir~((~+2)c+r-i+l.p+l)]
to (25) in Sen and Saran
m_,zsj,p,
(p+l)mt-r-l A,_,-,,(r+f+2_j+~ub,~+l)
m,-1
mr =cCr.//l)K
>
(y+l)m,-r
_ ( a,
(1983)).
((e+l)~I:s-l~A,,_h~~(~+~+~+l-~+~~,~+l)
cc +Ll
(41)
ml
A.~,,_,(r+[+2-j+~((b+l),~+l) >
(p+l)m,-r-p-1
1
. (42) 3 Probability distributions corresponding to (24), (26), (31) and (33) can be obtained from (34) and (39) by summing over appropriate variables. -fi
c
m2=
((r+pO/p)
m,-1
A,_,_,,(r+p+l+l-j+@,p+l)
>
J. Saran,
S. Rani / Generalized
random
walk
133
References Aneja,
K.G. (1975). Random
Statistics, Dwass,
walk and rank order statistics.
of Delhi,
M. (1967). Simple random
Mohanty, Studia Saran,
University
S.G. and B.R. Sci. Math.
Handa
Hungar.
Delhi 110007,
walk and rank order (1970).
of Mathematical
order
statistics.
statistics
Ann.
related
Math.
Statist.
to a generalised
38, 1042-1053. random
walk.
of rank order statistics
related
to a generalized
random
walk.
31, 121-135.
Sen, K. and J. Saran Plann.
Thesis, Department
5, 267-276.
J. and K. Sen (1979). Distributions
Metron
Rank
Ph.D.
India.
Inference
(1983). On the distribution
Sen, K. and C.L. Kaul(1985). J. Statist.
Plann.
of crossings
in a generalized
random
walk. J. Statist.
7, 371-385. Inference
Distributions 12, 177-191.
of rank order statistics
related to a generalised
random
walk.