Boltzmann-like kinetic models and Boltzmann maps

1MB Sizes 1 Downloads 117 Views

Report

PDF Reader
Full Text

Vol. 25, No. 2, pp. 53-67, 1997 Copyright@1997 Elsevier Science Ltd Printed in Great Britain. All rights reserved 0895-7177/97 $17.00 + 0.00

Mathl. Comput.

Modelling

PII:SO8957177(97)00006-X

Boltzmann-Like Kinetic Models and Boltzmann Maps L.

RONDONI

Dipartimento di Matematica, Politecnico di Torino Corso Duca degli Abruzzi, 24, 10129 Torino, Italy rondoniQpolito.it (Received May 1996; accepted

July

1996)

Abstract-This

work shows how Boltzmann maps can be used in the study of a class of kinetic models formally similar to those recently proposed in population dynamics. Boltzmann maps are nonlinear, discrete, doubly-stochastic processes which have been introduced in statistical physics for the description of nonequilibrium phenomena. The key feature of Boltzmann maps is that they increase the entropy, when applied to nonstationary states, if their interaction operators have a spectral gap. This fact can be used to analyse the asymptotic behaviour of the relevant dynamics. In particular, in some circumstances we can prove that stationary states exist, are unique, and stable. Moreover, we can supplement these results with theorems on the global convergence of the dynamics to such stationary states. Keywords-Nonlinear

stochastic

processes, Entropy,

Stability.

1. INTRODUCTION Recently,

a model equation

for the evolution

of dominance

in populations

with anonymous

or-

In that work, dominance is assumed to be a scalar quantity, taking values in a finite interval, by which individuals of the same population can be discriminated. Technical generalizations and qualitative analysis of the initial value problem have been considered in [2,3]. In particular, [2] generalizes the original model to systems of equations, and

ganisms

has been proposed

in [l].

provides existence and uniqueness results for the initial value problem, under quite general assumptions on the initial data. The paper [3], on the other hand, deals with models with multiple collisions, and with the existence of equilibrium solutions, while leaving open the problem of their stability. Further developments of the models proposed in [2,3] make possible the modelling of cellular interactions in biological systems, as reported in [4], but we do not deal with such systems in this work. A common feature of the relevant equations is that their mathematical structure is reminiscent of classical kinetic theory, especially of the structure of the Boltzmann equation. In particular, the model of [l] was obtained under the assumption that interactions only involve pairs of individuals (binary interactions), thus producing quadratic nonlinearities in the interaction terms. In this paper, we consider a class of models, obtained from those mentioned above, with some restriction on the functions describing the interactions. Moreover, we introduce suitable approximations for such models, which can be viewed as Boltzmann maps [5,6]. The models of our interest may be derived under the following assumptions. This work has been supported in part by GNFM-CNR

(Italy),

and by the Australian

Ftesearch Council

Typ-t 53

by AA&-W

54

L.

F~OND~NI

ASSUMPTION 1.1. The system under investigation is constituted by the interacting individuals of p E N populations. The physical state (dominance) of the individuals within each population is described by a variable u E [-l,l].

ASSUMPTION 1.2. The probability distribution over the sample space [-1, l] at each instant of time t E [0, T], and for each population, is defined by the probability density fi :

[W] x

[-1, l] + ]w+,

i = 1,. . . ,p.

(1.1)

Then,

represents the probability of finding an individual of the ith population with state in [ul, uz] at time t.

ASSUMPTION 1.3. The number of interactions per unit time (encounter rate) involving one individual from population i (called individual i), with state v, and one individual k, with state 20, isgiven byqik(v,w), where~~~maps[-1,1]2into+IR+U{O}foralli,k=1,...,p. we mean the Cartesian product of I copies of [-1, 11.

Here, by [-l,l]“,

ASSUMPTION 1.4. The probability density for an individual i with state v, to change it to u in an encounter with an individual k with state w is denoted by $ik(v, w, u), where ?+!&maps [-1, 113 into R+ U (0) for all i, k = 1,. . . ,p.

ASSUMPTION 1.5. The evolution of the distribution of states is determined exclusively by the effects of the interactions. These only involve pairs of individuals (binary interactions). the total derivative of fi does not depend on u.

Moreover,

Using these hypotheses, a balance of losses and gains of individuals yields

(1.3) -,,t&J

k=l

%k(U,

[-1,11

w)fk(t,

W> dw,

i=l,...,p,

where the conditions

1

tiik(V,

i--1,11

w, u> du = 1,

forallv,wE[-l,l]

andall

i,k=l,...,

p

(1.4)

must be satisfied, so that (formally) the following holds:

J

I 1 1l fi(t,U)du - ,

= 1,

foralli=l,...,p.

Models of this form are called conservative.

REMARK 1.1. The gain term-i.e., the first term on the right-hand side of equation (1.3)implements the idea that one individual i may only result from interactions where at least one individual i is involved. If we relax the condition expressed by Remark 1.1, the relevant equations should be correspondingly modified. For instance, interactions could be allowed to also change the species of

Boltzmann-Like KineticModels individuals,

55

i.e., the populations to which they belong, similar to what happens in chemical re-

actions. In that case, equation (1.3) must be replaced by a different set of equations, propose the following:

and we

(1.5)

where &&(t&W, u) du = 1,

for all a, b = 1, . . . ,p

and all

w,w E [-l,l].

(1.6)

Here, the functions q,b have same meaning as for the models of equation (1.3), while $Ab(v, w, U) represents the probability density for an individual a with state TJ, and an individual b with state w, to produce one individual i with state u. We refer to the models of form equation (1.5), and satisfying equation (1.6) as quasi-conservative. Then, a quasi-conservative system is conservative if $& = &a?+!&,with 6ij the Kronecker symbol, for all i, a, b = 1, . . . ,p. To use this formalism in the framework of chemical kinetics, the language of population dynamics must be interpreted accordingly. In this case, individuals represent single molecules, populations identify chemical species, while the state of one individual may refer to difierent properties of the molecules. For instance, the state of a molecule could represent its degree of ionization, or its excitation level, and so on, according to the situation at hand. Section 2 of this paper describes the class of models which constitute the starting point of our investigation, and introduces for them suitable discrete approximations. In given circumstances, such approximations constitute examples of Boltzmann Maps (BM), which are introduced in Section 3. Section 4 contains our original results, which are proved in Section 5. In particular, we obtain existence, uniqueness, and stability of the stationary solutions both for the discrete, and for the continuous (with respect to time) models of our interest. Furthermore, we prove that such stationary states are global attractors for the relevant discrete dynamical systems, so that all initial probability distributions converge to them, in the long time limit. In order to get these results, Theorem

4.5 is proved first, which is also relevant to the theory of Boltzmann

maps.

Section 6 concludes the paper with some examples.

2. A CLASS

OF MODELS

AND

THEIR

APPROXIMATIONS

In this section, we consider a special class of models of the form given above, identified by certain conditions on the functions q and ?(I. We then introduce suitable approximations for them, whose qualitative analysis is reported in Section 4, and we observe that part of the relevant results holds for the time-continuous models as well. Note that our approximations are not very efficient from a computational point of view; their merit is that they can be thoroughly analyzed, so that global convergence theorems, based on the existence of strict Lyapunov functionals, can be obtained for them. Similar functionals are not easily (or at all) found in other systems, hence, the difficulty in more general settings to go beyond stability results. Let us introduce a finite partition {I,.}~!1 of the sample space I = [-l,l], such that 1,. = [u,., u,+l) for f = 1,. . . ,m - 1, and 1, = [um,um+l], where u, = -1 + 2(7- - 1)/m. We impose the following restriction. RESTRICTION 1. We restrict our attention to the cases where the functions rlab = q > 0 are constant for all a, b, while the functions $J:,(+, ., u) : [-1, 112 --+ lR+ U (0) are simple for every u E I: (2.1)

L.R,~ND~NI

56 and $&, E L1([-1,113) interval Ik. Furthermore,

for all i,a,b = l,... p. Here, xk is the characteristic we require that equation (1.6) be satisfied.

function

of the

The condition that $ib be simple is not a serious limitation, given the density of simple functions in L1, while the restriction implied by equation (1.6) has been adopted in the literature in various circumstances. restriction.

Therefore,

the new technical

RESTRICTION 2. For the functions

assumption

$&, obeying

we are going to make is the following

Restriction

1, we further

require

that

&,(v, w, .) dv dw = 2~.

Restrictions

1 and

consider.

Under

Equation

(2.3) suggests

it with respect

dfi x

A

dt

&$

-&2

m2 = 4

Equation associated

I xl xl k 1 I

approximations

we are going

to

(1.5) takes the form

of the original

model, formally

obtained

(2.3) by integrating

u over any Is:

?&,(v, w, u) a=1 k=l

J

whose

1, equation

k ~/&&k(t) - &z(t)

fb,l(t)

I=1

, I

for i = 1,. . . ,p, z = 1,. . . ,m.

(2.4)

dv dw du,

(2.4) constitutes a nonlinear system of ordinary differential equations (ODE) which, with initial conditions f” E Wmp, has a unique solution in a given time interval [0, T): f

Moreover,

the class of problems of Restriction

an approximation

to the variable

= 77

b=l

2 identify

the conditions

(2.2)

because

(t, f”) = {f&r (c (f&H 17

of equation

(1.6), the following

&rb(u) du = 1,

As shown in Sections

4 and 5, equation

holds:

$1 F i=l

fi,Z(t)

i=l

= const,

for all t E [0, T).

(2.5)

z=l

(2.5) implies

Ml = f:

fort E [O,T)*

that the L’-norm

of f,

2 ksl

z=l

is always 1,if f” is normalized and taken from the positive cone of lP”P. The model equation (2.4) may be further approximated, and replaced by a given mapping of IPP, by discretizing them with respect to time. Because we are interested in the qualitative analysis of the solutions of the initial value problems, it suffices to adopt Euler’s scheme, which is the simplest of all, and which can be shown to be a Boltzmann map (BM) in some instances. Assuming the normalization of f, Euler’s scheme for equation (2.4) reads

Boltzmann-Like KineticModels

57

where

and fi,= = f+ xbI , fk,r. In case p = 1, the superscripts (i, ab) may be removed, and we write

fx=

cpx,kl

fl,x, f@+‘) 5

=

fl:

fq;:;,

h,kl

=

&,::, (2.8)

t&k&)fi(n).

k,l=l

3. BOLTZMANN

MAPS

AND

NUMERICAL

SCHEMES

In this section, we introduce Boltzmann maps (BM) [5,6], and we show that they may be viewed as Euler’s schemes like equation (2.8). Therefore, given approximations of continuous models described in Section 2 happen to be BM, if some conditions are satisfied. To verify this fact, consider a population dynamics model with m species, and binary interactions. The species can be different populations, different states within a population, or both. We postulate that the dynamics of the system satisfies the following assumptions. ASSUMPTION 3.1.

Interactions only involve two individuals and preserve their number, i.e., the

encounter between the individuals i and j always results in another pair of individuals, k and 1 say The system can be represented by elements of the simplex Q c IV” of probability measures on the sample space Cl = (1, . . . , m}. One element f = { fi}& E Q represents a state of the system, and its component fi, the probability of finding one individual i in it.

ASSUMPTION 3.2.

The probability of finding individuals i is independent of the probability finding individuals j. Thus, the joint probability of finding both is the product of the two.

ASSUMPTION 3.3.

of

Assumption 3.3 is equivalent to Boltzmann’s hypothesis of molecular chaos. Let Q2 c RmP be the simplex of probability measures on R x Sz. A doubly stochastic matrix acting on Qz is an m2 x m2 nonnegative matrix, whose column sums and row

DEFINITION 3.1.

sums are 1. A doubly stochastic matrix transforms Q2 into itself, because it is stochastic [6]. The effect of interactions on the product of probability measures representing the state of the system is expressed by linear, doubly stochastic matrices.

ASSUMPTION 3.4.

DEFINITION 3.2. Given a probability measure f = {fl, . . . , fm} on fi = (1,. . . , m}, the map d : Q + Q2, defined by g(f) = f @ f, is called sampling map. The map IE : Qz -+ Q, acting expectation as lE(F) = {C3m,i Fij}g”=,, for all F = {{F&}~C”=l}~l E Q 2, is called conditional onto the first factor.

Assume the state of the system the state on Q is given by E(F).

ASSUMPTION 3.5.

{{F~~}~l}~l,

is known on Q2, and it takes the form F =

The system evolves exclusively in consequence of interactions of individuals. Such an evolution is observed at discrete times n E M.

ASSUMPTION 3.6.

With these assumptions, the evolution determined by a BM involves three steps. (I) From a probability measure for single individuals, produce a product probability. (II) Transform the product probability to account for interactions. (III) Reduce the resulting joint probability to a probability for single individuals.

L. RONDONI

58

Such steps constitute the foundations of the arguments which led Boltzmann to his famous equation, after limits to vanishing volumes and time intervals had been taken. Differently, BM evolve probability measures without resorting to limiting procedures, and they can be formally defined as follows. DEFINITION 3.3. Assume that T : &2 +

Q2 is a doubly stochastic matrix. The map r : & + Q

given by f A

f@f

Z+

for every f E &, is called a Boltzmann

T(f@f)

+% f’=Q),

(3.1)

map of order 2.

To express in explicit form the dynamics of the BM (3.1), let us say that the entries of T can be written as Tij,kl, where the indices i, j = 1, . . . , m label the rows of T, while k, 1 = 1,. . . , m label its columns. Then, we have for all i = 1,. . . ,m,

(3.2)

for the ith component of 7(f). Thus, equation (3.2) is formally identical to equation (2.8) if we assume that their coefficients obey the following relation: m 9,r,kl = c

Tij,kl,

for all i, k, 1 = 1,. . . , m.

j=l

(3.3)

Similarly, given equation (2.8), we can put them in the form of equation (3.2), by letting, for instance, % kl foralli,j,k,1=1,..., m. Tij,kl = A (3.4) m’ The basic properties of BM which we need in our study are that they preserve probability measures, and that, in given circumstances, they make the entropy of the system increase by finite amounts [6]. Given a discrete probability measure F = {FI, . . . , F,.} (e.g., for F E Qz, we have T = m2), its entropy is defined as

S(F) = -

2 Fi log Fi. i=l

(3.5)

For any function g : [0, co) 3 W, such that g(0) = 0, we define

with ai in an interval I c W, for every i. Then, the following theorem holds [7, pp. 27,281. THEOREM 3.1. Let a, b be in P, for I an interval in JR,and P the Cartesian product of m copies of I. Then a = Tb, with T a doubly stochastic matrix, if and only if C, (a) L C,(b) for all concave functions g : I -+ W.

with g(z) = 2 logz, and g is concave, the action of a doubly Because S(F) = -C,(F), stochastic operator on F does not decrease its entropy. Moreover, given f E &, we have that S(f @ f) = 2S(f), and that S(E(f @ f)) = S(f @ f)/2. Thus, the entropy of the system has not decreased if the action of the sampling map is followed immediately by the action of the conditional expectation. This is the case when f @ f is a fixed point of the matrix T, in the definition of a BM. If f @If is not fixed for T, T(f @ f) is not necessarily a product probability, and we have S(T(f@f)) 2 2S(f), b ut we cannot say that S(E(T(f @ f))) 2 S(f). For that to be the case, we need stronger properties, as in the following result [5].

Boltzmann-Like KineticModels PROPOSITION 3.2.

Consider a doubly stochastic

59

matrix T, whose elements obey

Tzj,kl = Tji,kr.

(3.6)

The associated BM of Definition 3.3 does not decrease the entropy This result can be further strengthened,

in some circumstances,

by the following [S, p. 5111.

Let T be a doubly stochastic matrix on Q2, such that equation (3.6) holds. Let T” be the adjoint of T, and assume that 1 is a simple eigenvalue of TT*, separated by a gap A > 0

LEMMA 3.3.

from the second largest eigenvalue. Then.

If F E Qg, define G E &-J through Gij = CT,_,

S(G) 2 S(F) +

Tij,klFkr.

+llF- e/l:,

where 11. 112is the L2 norm on Rm2, defined by the scalar product (F, G) = Ci,j FijGij, e = (1,. . . , 1}/m2 is the uniform distribution Therefore, S is a strict

(3.7) and

in Qz.

Lyapunov function for the cases satisfying the hypothesis of Lemmas 3.2

and 3.3.

4. RESULTS Here, we give the list of our original results. Their proofs are to be found in Section 5. Consider any approximate l-population model of form equation (2.81, on llP. be defined by equation (2.7), and &;E be defined by equation (2.4), where $J:, obeys

PROPOSITION 4.1.

Let @i$

Restrictions

1 and 2. The associated m x m2 matrix, M = { @i,kl} say, has the following properties.

(a) M is nonnegative. (b) cz1 (C) c&

@i,kl = 1. @Pi,kl= m.

Conversely, for every m x m2 matrix M satisfying conditions (a), (b), and (c), there is a function $J obeying Restrictions 1 and 2, which yields a model of form equation (2.8) on Rm, through relations

(2.4) and (2.7).

PROPOSITION 4.2. Any approximate p-population model of form equation (2.6), with m subdivisions of the interval I, is a given transformation of Iwmp which can be mapped into an equivalent l-population model of form equation (2.8), with mp subdivisions of I. The converse is also true.

It appears that an approximate,

conservative, p-population model can be viewed as a special

case of equation (2.8), with mp subdivisions of I, where all P,.,~~ vanish except (possibly) for m2p of them. The opposite is not true. Indeed, the matrix M = {@i,kl} of a conservative ppopulation model splits into a direct sum of p blocks of size m x m2p, while there are lpopulation models, with mp subdivisions of I with no vanishing coefficients. This is a consequence of the conservative condition for many populations, i.e., that production of individuals i occurs only through interactions involving other individuals i. This result does not depend on our technical assumptions, but only on the form of the approximate models equation (2.6) and equation (2.8). Consider an approximate l-population model defined as in Proposition 4.1, and denote by r such a mapping of W”. For 7 to be a BM, it is necessary and sufficient that 0 5 At I Ate = mini,r l/(n(l - pLi,ir)). PROPOSITION 4.3.

Clearly, Ate is not smaller than l/r], which is a positive quantity. Let Q c Wm be the simplex of probability measures on the sample space s1 = , m}. Consider the ODE system equation (2.4) with p = 1, where n > 0, and Qi, satisfies Restriction 1. The initial value problem equation (2.41, with initial data Ui(O>) = Vi01 E Q, *w25:z-c

PROPOSITION 4.4. {l,...

60

L. F&JND~NI

has a unique solution in Q for all times t L 0, f : [O,oo) + Q say Also, f is nonnegative and normalized, and Euler’s scheme equation (2.8) converge to it as At is made smaller and smaller. THEOREM

4.5.

Consider

(2.4) with p = 1, and assume that $i,

equation

obeys Restrictions

1

and 2. Denote by r the associated BM, equation (2.8), with time step At, E (0, Ate), and Ato defined in Proposition 4.3. T may be replaced by another BM, r’ say, which is an Euler’s scheme for the same T’ = {T&r}

ODE problem,

but has smaller or equal time step,

T&,kr = Tjli,kr = Tj+ Moreover,

and its interaction

operator

obeys

property

(4.1) is common

9

foralli,j,k,l=l,...,

to all maps obtained

m.

from r’ by reducing

(4.1) its time step.

Thus, the results of Lemma 3.3 can be used, when the conditions of Theorem 4.5 are satisfied. This is not always the case. Indeed, not all BM can be replaced by others with same dynamics and interaction operator obeying equation (4.1) (see Example 6.2). The BM we consider here, however, satisfies those conditions, their special feature being that they are derived as approximations of evolution equations. We now exploit this fact. COROLLARY 4.6. Consider an approximate l-population model of form equation (2.8), defined as in Proposition 4.1, and denote by r the relevant map of Rm. Assume that all terms @i,kr are positive, and that the dynamics of r can be obtained from a doubly stochastic matrix T say, with Tij,kl = Tji,kr for all i, j, k, 1 = 1,. . . , m. r is a BM with a unique fixed point in the simplex f = (l/m, l/m,. . . , l/m). Such a of probability measures & C IP, the uniform distribution stationary state is stable, and a global attractor in & under the dynamics of the map r. If equation (2.4) is considered independently of its connection with equation (1.3)) Restriction 1 may be relaxed to produce other models, as in the following. COROLLARY 4.7. Consider the ODE system equation (2.4) with p = 1. Assume that $J!, is a positive, integrable function $ : [-1, 113 -+ R+, such that equation (1.6), and Restriction 2 are satisfied. This model has a unique stationary state in &: the uniform distribution 1. Also, 3 is stable for the associated

dynamics.

However, Restriction 1 may still be retained because of the following results. PROPOSITION 4.8. The class of functions

of form (2.1) is dense in the set

{Ic, E Co (r-1,11”) according

: 1c,2 0)

to the uniform norm.

COROLLARY 4.9. Consider q > 0 for all a, b, and that initial value problem, with W+ u {0},for some T > 0, Let {Izc}y be the partition

a quasi-conservative model of form equation (1.3) such that nab = the functions $66 are continuous in [-1, 113. Assume the associated fi(O,u) = f!(u), h as a solution f = {fi); say, _f’i: [O,T) x [-1, l] 4 such that f(t, .) and $$ are integrable over [-1, l] for all t E [O,T). of [-1, l] introduced in Section 2, and

Fe(t) =

s I

2

f&7 u> d%

for all t E [O,T).

Take any E > 0. There is an m = m(T, E) E N such that II{F~(t)}

- {fG(t)}\I1

I &,

for allt E [O,T),

(4.2)

where {f$$} is the solution of the initial value problem equation (2.41, with fi,=(O) = F+,,(O), obtained from suitable functions $ib satisfying Restriction 1. Thus, if the functions $ib fall within the limits of Restriction 2, the corresponding ODE problem can be obtained as the limit for At 3 0 of a given BM. In this way, we have enslaved the evolution of our model, with continuous functions @ia, to the dynamics of a BM, for the time interval [0, T) .

Boltzmann-Like KineticModels

61

5. PROOFS Here we give the proofs of the results

announced

in Section

4.

PROOF OF PROPOSITION 4.1. Conditions (a) and (b) are satisfied by the coefficient matrix of equation (2.8), as seen from its definition equation (2.7), if equation (1.6) is satisfied. This is the case if Restriction 1 holds. Condition (c) holds as well. Indeed, for every u E [-1, 11,we have

s

$(v, w, u) dv dw =

IkXIl

and Restriction

2 implies

m

c

m b,kl

c

!m(u)du = m,

=

(5.2)

k&l

from which we get m

/m

c

@i,kl

=m+dt

\

(,,Pi,,,

-_!

(5.3)

.

k,l=l

It remains

to show that

given

M = {@i,kl}, for which conditions

a matrix

l-population model of form equation (2.8), satisfying Restrictions of it. To do that, choose 11= l/At (hence, @i&l = pi&l), let akZ(‘ZLi)

=

for all i,k,l

;@i.tl,

(a),(b),(c)

hold,

1 and 2 can be constructed

,m,

= 1,.

a

out

(5.4)

and define pkI(u)

=

(5.5)

&kl(ui)%(u). i=l

Now, the question

is whether

the function

ti(v,%u)

=

2

~kl(u)xk(v)xl(w)

(5.6)

k,l=l

satisfies integral

the necessary prerequisites. with respect to u yields

J:

$(v, w, u)

Indeed,

du =

2

it does. In the first place, II, is nonnegative,

xk(‘u)xl(w)

2

Et,=,

Xk(V)xr(w)

is identically

s, (-1

lla ti(v,w,u)dvdw

which is the condition

=

=

1, and property

(b) holds.

Furthermore,

eXi(U)y

@i,kl 1-1

i=l

required

(5.7)

($+,kL)

by Restriction

1,

i=l

k,l=l

because

and its

2

lea dvdw

=

we have

2,

115.8)

k&l

2. This completes

the proof.

I

PROOF OF PROPOSITION 4.2. The pairs of indices (i,z), (a, k), and (b, 1) in equation (2.6), which constitutes the rectangular array {{(h, q)}ph,l}!&, may be organized into a linear array, (1,. . . , mp}, with the substitutions (i, z) -+ r = (i - 1)m + 2, (a, k) --+ y = (u - 1)m + k, and (b, 1) -+ z = (b - l)m + 1. Then, equation (2.6) can be written as mp

w

(5.9)

L.

62

ROND~NI

where @r,yz = a:$. Thus, formally all our approximate models are equivalent to a l-population model with mp subdivisions of I. It remains to check that the corresponding matrices M = {ai,kl} have the correct properties. These are nonnegativity, the fact that columns sum up to 1, and that rows sum up to the number of subdivisions of I times the number of populations. Nonnegativity is obviously satisfied. The fact that columns sum up to 1 is a consequence of Restriction 1 and equations (2.5),(2.7), while Restrictions 1 and 2 combined with equation (2.7) ensure that the rows of M add up to mp. Hence, from Proposition 4.1, we conclude that a quasiconservative model with p populations and m subdivisions of I can be mapped into a l-population model, with mp subdivisions of I, and vice versa. I For the special case of conservative ppopulation as fjn+i)

am c

= f,‘“’ + g z=l

models, equation (5.9) can be explicitly written

,1

%r/tf$V)

[ y=(a-l)m+l

r=l,...,mp,

(5.10)

where the value of a is linked to that of T by a = [r/m]*, with r r * E m

if ?Z- E N, m

rn’

” {

integer part of

(

;

>

+l,

(5.11)

ifkeN.

PROOF OF PROPOSITION 4.3. Because of equation (2.4), and of the first identity in equation (2.5), we have for all i, 1 = 1, . . . , m,

hence,Ato=$n((1_~i,i1,9>

Zl>O.

Thus, for 0 I At I Ate, we have @i&l 2 0 and EE, @i,kl = 1. This means that the matrix T = (Tii,kl) defined by relation (3.4) is (column) stochastic. Furthermore, the same relations imply

Tij,ki = 1,

for all i,j = l,...,

m,

k,l=l

which is the row-stochasticity condition for T. Thus, T is doubly stochastic, and equation (2.8) defines a BM of order 2. However, taking At < 0 or At > Ate makes some coefficient @i&l < 0, i.e., some element of T necessarily negative. In that case, T is not doubly stochastic, and equation (2.8) is not a BM. I PROOF OF PROPOSITION4.4. The vector field of the ODE (2.4) is smooth, thus a local solution, f : [O,T) ---t Wm say, for some T > 0, exists. Moreover, such a solution can never cross the boundaries of a compact subset of Wm. Indeed, fi(t) > 0, for all i = 1,. . . , m, and CE”=,( g) = 0. Thus, existence and uniqueness of solutions extend to all times. Convergence of the Euler’s scheme is also guaranteed by the smoothness of the vector field. That is to say, that for every E > 0, every T E (0, oo), and all initial conditions in Q, there is a At’ > 0 such that Ilf(nAt) - f(n))(i < c, for all At 5 At’, and all n such that 0 5 nht 2 T. I PROOF OF THEOREM 4.5. Given a map of form (2.8), derived from equation (2.4), where $J:, satisfies Restrictions 1 and 2, we introduce a doubly stochastic matrix T(At) whose entries are defined by Tdi,kl (At) = @i&r/m. In this way, the dynamics of r can be cast into the form of equation (3.2). Then, we introduce a set of parameters {Kij,kl} through Tij,ii(At) = 1 + Kij,ijAt, Tij,kl(At) = Kii,klAtr

for all i,j = l,...,

m,

for all i, j, Ic,1 = 1,. . . , m,

such that (i, j) # (Ic, 1).

(5.12)

Boltzmann-Like

We have Kij,ij

< 0, for all i,j

The rows and columns of T(At)

Kinetic Models

63

and Kij,kl 2 0 whenever the pairs (i,j)

and (k,l)

are different.

all sum up to 1, which means that

m

c Kij,kl = c Kij,kl =

for all Ic,l = l,...,

0,

m,

&j-l

(5.13)

m

foralli,j=l,...,

0,

m.

k,l=l

If we fix the constants Kij,kl, and let At vary, we obtain a doubly stochastic matrix T(At) At 5 Ato. Furthermore, if At is small compared to 1, the diagonal terms of T(At) form Tij,ij) are of order o(l), while all the others are of order o(At/m). The contribution

to f!“+”

given by the product (fp’fi(n)),

for all

(i.e., those of

in equation (3.2), can be cumulated

with that given by (fr(nifrln)) 1and the two may be symmetrized by replacing Tij,kl and Tij,lk with
=
=

Tij,/cl+ Tij,Zk 2

.

Thus, the dynamics of equation (3.2) is identical to that of (5.14)

whose coefficients are symmetric under the exchange k H 1. We express this by saying that such is symmetric by columns. In particular, we have a matrix {
for all i = 1,. . . ,m,

Eij,ij = Jij,ji =

k(1 + (Kij,ij

-

=

=

1

2

(Kij,kl

+ Kij,ji)At)

+ Kij,lk)

At,

,

for all i # j, for all pairs (i,j)

# (k,E).

Thus, we assume that the constants Kij,kl have been fixed, and that T(At) satisfies equation (5.15). We show that T(At) can be replaced by a doubly stochastic T’(At) whose entries obey T’ij,kl = T’ji,kl (i.e., are symmetric by rows), and preserve the sums (3.3), provided that At is small enough. One way to proceed is to take one entry of T, Trs,yZ say, such that Trs,yz 5 Tij,kl for all i,j, k, I = 1,. . . , m. For At small enough, T,.S,yz can only be an off-diagonal element of T, such that (T, s) # (z, 7~). TO see this, take any At’ 5 Ato, fix the values of all terms Kij,kl, and observe whether the above statement is satisfied or not. If it is not, keep the same values Kij,kl,, and reduce At, until the largest term of form Ktj,klAt is smaller than the the smallest term of form l/2 + Kij,ijAt. This happens for some finite value of At. Moreover, the resulting interaction operator determines a map which is Euler’s scheme for the original ODE problem, where the time step has been reduced. Given TrS,yz, if T = s, there is no need to change the matrix T, because Trs,yZ = Tsr,yZ. Thus, we move to another element of T. Let this one not be greater than any other, except (possibly) the entry previously considered. If T # s, we have T,.s,yZ 5 Tsr,yz by definition. We replace Tssr,yz with TA.,yz = Trs,yZ, and we add the difference lrs,yz = Tsr,yZ - Trs,yZ to the entry Tss,yZ, i.e., we The resulting matrix T is nonnegative, and its column sums are 1, define TglSlyz= Tss,yz + &,z. because the sum (3.3) defining @s,yz has been preserved (which implies that the dynamics of r has not changed either). However, the sum of the elements of row (ST) is now 1 - IrS,yz, while the sum of the elements of row (ss) is 1 + Zrs,yz. In other words, the resulting T’ is not doubly stochastic,

unless Irs,yz = 0. To fix this, subtract Irs,yz from T,,,,,,

i.e., let T,‘, ss = T,,,,, - ‘!rs,yz,

64

L. RONDONI

and add the same amount to T,,,,,

i.e., let TLr,ss = Tsr,ss + lrs,yz. At this point, we have a

matrix whose entries Trs,yz and Tsr,yt are equal, and which is doubly stochastic if Tss,+ - Irs,yz is nonnegative. Now, Tss+ has form 1 + K,,,,,At, and lrs,yZ is proportional to At. Therefore, there is a finite value of At below which the operation above yields a doubly stochastic matrix. After this, we move to the next smallest element Tij,klwhich has not already been symmetrized, and we repeat the process above, taking care that nothing is done to the terms that have already been symmetrized. For instance, assume that we change Tji,kl in order to match it with Tij,kl. Then, to compensate in the sum (3.3), we add E.’ y,kl to Tjj,kl, and to compensate the row sums, we change Tjj,jj and Tji,jj accordingly. However, Tji,jj could have already been symmetrized, and we do not want to change it again. Thus, we adjust our scheme by adding r!ij,kl to a different Tjs,kl> instead of Tjj,kl. In turn, this requires US to change different terms, i.e., Tj,,j, and Tji,j,. At a certain point, we may have to consider one element of T such that there are no terms where changes could be made without involving other elements that have already been symmetrized. This means that Tij,kl is already equal to Tji,kl. In fact, columns and rows sum up to 1, thus fixing one of their elements, when all the others have been fixed. At every step, the range of values of At which allows us to perform all the symmetrization operations may be reduced. However, there are only a finite number of steps to be performed, thus, the range remains finite at the end of the process. The result of this is that the original map r has been replaced by another one, r’ say, defined by (5.16)

which is the Euler’s scheme for the same ODE problem, but has smaller time step. Moreover, the elements of the interaction operator of 7’ have the important symmetry property T’ij,kl = T’ji,kl = T’ji,lk.

(5.17)

Such a property remains if we further reduce the value of the time step. Indeed, the symmetrization process has merely added or subtracted from each Tij,kl a quantity proportional to At. Thus, the elements of T’ can be written as the tij,kl in equation (5.15) (with different Kij,kl), whose symmetry is not altered by changes in At. In particular, reducing At preserves the doubly stochssticity of the matrix, while increasing it above a certain value does not. I PROOF OF COROLLARY 4.6. That the uniform distribution f^ is a fixed point follows from the fact that f@f = (l/m2,. , . , l/m2) is a uniform distribution on Wm2. Thus, I@ f^ is a fixed point of T. Then, the associated BM acts on f as follows: (5.18) That the fixed point f^ is unique in Q follows from the the uniqueness of p@ f^ as a fixed point of T in Qs, and from the fact that T increases the entropy of any other input probability. Indeed, we note that @i&l > 0, for all i, k, 2 = 1,...,m, implies that all entries of T, constructed as in the previous proof, are positive. Then, TT* is a positive matrix, and Perron-Frobenius theory implies that TT* has a spectral gap A > 0. Thus, take f # f^. Then, f @ f is not fixed for T, and the entropies go like

Wf) = W @f) < S(T(f @f)) I WWW @f)>) = 2W’h

(5.19)

where the last inequality is a consequence of the property Tij,kl = Tji,kl (see Proposition 3.2 and Lemma 3.3). Because the entropy of f’ is different from that of f, we have f # f’, i.e., f is not fixed for the BM.

Boltzmann-Like Kinetic Models Stability

of the uniform

of the uniform neighbourhood

distribution,

under

65

the BM, is explained

as follows.

The

entropy

distribution _/ is the maximum value that can be attained in Q. Consider the of f^ made of all points f with S(f) > S(f) -,E and E > 0. By varying E, we can

make this set contained in any other fixed neighbourhood of f. Because entropy cannot decrease under the action of our BM, for every neighbourhood of f^, Uf say, there is another one, centered in f^, such that the iterations of its points cannot leave Uf. Thus, f^ is stable. To get the convergence part of our result, we summarise the proof of Theorem that the increase in entropy during In fact, the entropy is a bounded

uniqueness of f^ as a fixed point of our BM, the previous fact implies that The fixed point f is then unique, stable, and a global attractor in Q. 4.7. If $J > 0, all coefficients

PROOF OF COROLLARY

@+l are positive.

stationary state follows from the mere fact that a fixed point vector field of equation (2.4), and vice versa. Indeed, if

f

=

fn+l)

G(f),

:=

f(“)

-+ f^ as n -+ co. I

Then,

uniqueness

of the

of the BM (2.8) is a zero of the

f’“’ + G (f’.“‘)At

is an ODE system with associated Euler’s scheme, both the continuous are stationary for G = 0, while they are not if G # 0. Stability

3 in [5]. Observe

the evolution of any f E Q, f # f^ must go to zero as n -+ co. quantity in Q. Because of Lemma 3.3, and because of the

of the fixed point follows from the fact that the evolution

and the discrete

systems

of the ODE problem

decrease the entropy. In fact, if there is a decrease of entropy in the with given initial condition, there are two instants of time tz > tl > 0 and c > 0. But in the interval of time [tr, t2], the solution of equation as accurately as desired, by choosing a sufficiently small time step At

solution

cannot

of equation

(2.4),

such that S(t2) = S(tl) -c (2.4) can be approximated (Proposition 4.4). Thus, a

decrease of entropy would also be observed in the evolution of all the discrete maps with a time step smaller than a given At, > 0. This cannot be the case, as the discrete schemes are BM. 1 PROOF OF PROPOSITION 4.8. Given described in Section 2, and define

a $J E Ce([-1,

s

113), introduce

the same partition

$(w, w, u) dwdw.

of [- 1, l]

(5.20)

Ikxh

Then,

we can write

for all 21,W E Ik x Il,

(5.21)

for some ?& E Ik and 61 E 11. But the value E, = maXi=i,...,m(Ui+l - Ui) converges to zero as m gets larger and larger. Therefore, given the continuity of $J in [-1, 113, for every E > 0, we can find one m E N such that for all (w, w, U) E [-1, 113. This completes PROOF

(5.22)

the proof.

OF COROLLARY

I 4.9. Because

of our assumptions

w, w,

I

27p2%n, m

u) -

and of Proposition

q!&@)(w, w, u))fa(t,

4.8, we can write

'U)fb(t, w) dw dw du

(5.23)

L.F~~NDoNI

66

where gab i’(m) is defined as in the previous proof, through equations (5.20),(5.21), for all i,a,b, and e, can be taken smaller and smaller as m grows. Then IFz(t)

- fg(t)j

I $aB,t,

for all i,5,

(5.24)

and I 2P3??GlJ.

(5.25)

Thus, for given T and E, we only need to take m sufficiently large.

I

IIVGHt)

- U$Ht)(ll

6. EXAMPLES The following examples address, in the simplest possible way, the following three questions. Question 1: Are there chemical systems expressible in terms of quasi-conservative models? Answer 1: Yes. Question 2: Can all doubly stochastic matrices be symmetrized by rows? Answer 2: No. Question 3: What sort of functions $ obey Restrictions 1 and 2? Answer 3: Among the others, all functions which are probability densities in all their variables v, w, 21.

EXAMPLE 6.1.(A simple BM) Consider a model with two species only. The relevant sample space is R = {1,2},

and a probability measure on it has the form f = {fi, fi},

with f2 = 1 - fi.

The

probabilities of picking two independent individuals from the system are (fi)2, if the individuals are both of species 1, fif2 = Fiji, if they belong to different species, and (f~)~, if they are of species 2. In particular, let T be of the form

1-a

P

P

T=

0

0

1-P

2 1-P 2 0

2 1-P

0

it

1

--

P--

P

1-P

0

(6.1)

The associated BM is given by

fin+11=fi’“’- p (fp2 - fi’“‘fq

fp+l) =fp+p(fp)a_

,

(6.2)

fpfp).

If we let p = KAt, equation (6.2) is Euler’s scheme for the evolution of the concentrations fi and f2 of the chemicals A and B, in the reaction 2A = A+B, with reaction rate K. equation (6.2) is a BM if p I l/2, and they can be derived from a quasi-conservative model of form (1.15), if we have p = 2, m = 1, constant encounter rate v, and

l-u ti,111(vJ,u) = 2 ' 1+u ~:2(vw)

=

-

4

’

ti:1(wv) = f, 1cIP2(w+)

=

1-V 4'

-

with Y E (0,l). In this case, we have p= VVAt. More complex models of chemical reactions can be thought of and produced with this technique. In particular, any convex combination of maps like these yields another such BM.

Boltzmann-Like EXAMPLE

there

6.2.

(On the symmetry

is no symmetry

Kinetic Models

67

a case where T is doubly

by rows) Consider

stochastic,

T=

If f # (l/2,1/2),

but

by rows, e.g.,

the entropy

(6.4)

of the relevant

2W) = w

@f)

probabilities

obeys

< WYf 8 f)) <

s (f’) + s (9’)

9

where f’ and g’ are the marginals of T(f 8 f). Nonetheless, 7(f) = f, and S(T(~)) = S(f). Moreover, we find that not all doubly stochastic matrices can be symmetrized by rows, in such a way that

they remain

doubly

stochastic,

the only way to get the same dynamics, is to fill such rows with zeroes.

and that the associated and equality

BM are not changed.

of the second with the third

Hence, we see the importance

of Theorem

Indeed,

row of our T,

4.5 for the subsequent

results. EXAMPLE 6.3. Consider a l-population model whose function $ : [-1, 113 --) R+ U (0) is a probability distribution in all of its three arguments, i.e., &I(V) = $(., w, .) dw and dVz(w) = measures on [-1, 11. Then, if $ is simple, it satisfies Restrictions 1 $J(.>.? w) dw are probability

and 2, as $(w, W, U) du, = 1 The corresponding are probability

l(6.5)

s [-1,112

quasi-conservative

distributions

$(v, w, u) dv dw = 2.

and

s [-1311

models are those for which both Cz=,

&,

and CL-,

T&,

in all their variables.

REFERENCES 1. E. Jager and L. Segel, On the distribution of dominance in a population of interacting anonymous organisms, SIAM J. Appl. Math. 52, 1442-1468 (1992). 2. L. Arlotti and N. Bellomo, On a new model of population dynamics with stochastic interaction, nansp. Theory Stat. Phys. 24, 431-443 (1995). 3. L. Arlotti and N. Bellomo, Solution of a new class of nonlinear kinetic models of population dynamics, Appl. Math. Lett. 9 (2), 65-70 (1996). 4. L. Preziosi, From population dynamics to modelling the competition between tumors and immune system, Mathl. Comput. Modelling 23 (6), 135-152 (1996). 5. R.F. Streater, Convergence of the iterated Boltzmann map, Publ. Research Inst. Math. Sciences, Kyoto University 20, 913-927 (1984). 6. RF. Streater, Statistical Dynamics, Imperial College Press, London, (1995). 7. P.M. Alberti and A. Uhlmann, Stochasticity and Partial Order, D. Reidel, Dordrecht, (1982).

Boltzmann-like kinetic models and Boltzmann maps

Boltzmann-like kinetic models and Boltzmann maps

Recommend Documents