CHARLESJ.LUMSDEN Department of Medicine, University of Toronto, Toronto, Canada
We consider a class of differentiable dynamical systems that fulfill a criterion of adaptation, or being fit. The adaptation criterion is found to be sufficient for the existence of hierarchical behavior in these systems, once they become complex. Commutative diagram methods are used to show that under reasonable conditions the hierarchical behavior takes the form of a closed dynamics of aggregate variables. This dynamics characterizes the system functions as a whole. Existence conditions for the aggregate dynamics are obtained, together with their differential form and their connections to the original system description. The results indicate that evolutionary processes are likely to produce systems that are hierarchically organized in terms of their function as well as their structure.
Introduction. Darwinian theory (Darwin, 1857; Mayr, 1970) observes that an adaptive fit often exists between organisms and the environments they occupy. Natural selection is viewed as the prime mover shaping organic diversity and phenotypic design. The fit is created as natural selection acts on individuals through their differential survival and reproduction. The winners of the ensuing evolutionary competitions are postulated to be those whose fitness is greatest. In mathematical versions of the theory, fitness is typically a function that integrates reproductive success with survival likelihood over an individual’s lifetime. The concept can be extended to cover genes that related individuals share by common descent (Hamilton, 1964; Maynard Smith. 1964: Kurland, 1980), and to selection processes that act on groups, societies, and even entire populations (Levins, 1970: Eshel, 1972: Boorman and Levitt, 1973; Wilson, 1980). Substitute measures of quantitative fitness (such as physiological efficiency) more amenable to observation than net reproductive success are often used in modeling (Rosen, 1967; Oster and Wilson, 1978). Fit organisms correspond to the phenotypes or gene variants at which the fitness functional takes on its maximal values. In this report we are interested in relating adaptive design to hierarchical organization. Rather than structure, the focus will be on the dynamical order characterizing the phenotype. A class of differentiable dynamical systems will be introduced for this purpose. Its members are naturally described as 1.
591
592
C. J. LUMSDEN
fit in the sense that they obey a principle of adaptive design. An approach to hierarchical organization is then presented that scales among levels of dynamical order. Commutative diagram methods are used to show that such levels of order exist in fit systems once they attain a certain degree of complexity. Complexity is measured merely by the number of independent state variables (equivalently, the number of differential equations) required to describe the phenotype. Thus the fact that a complex system is the product of adaptive design suffices to establish a degree of hierarchical order within its dynamical behavior. The notion that biological systems have a hierarchical design, with many interconnected levels of structure and function, has a long and productive history (Novikoff, 1945; Wilson, 1969; Pattee, 1973). There is an extensive literature deaiing with biological complexity and hierarchical order in a qualitative way (e.g. Wilson, 1969; Whyte et al. 1969; Dresden, 1974; Cottrell, 1977; Allen and Starr, 1982). In addition, there has recently been interest in exact mathematical models of biological processes that have multiple levels of organization. Some investigations have concentrated on making rigorous the meaning of multiple alternative descriptions (Rosen, 1968; 1976; 1977a, b; Primas, 1977). Others have analyzed the types of hierarchical order present in specific models (Simon and Ando, 1961; Simon, 1962; Wilson, 1968; Mesarovic, Madco and Takabarz, 1970; Weidlich, 1972; Demetrius, 1977; Haken, 1977; Vemuri, 1978; Lumsden and Trainor, 1979; Lumsden and Wilson, 1981; Lumsden, 1982; Auger, 1983). The increasing number of studies of the second kind raises an interesting question: what can be said in formal terms about the hierarchical organization of dynamics as an adaptive trait in itself? The purpose of this study is to establish the existence of a connection between fit design and this form of organization. The models to be used in the investigation are defined and their dynamics specified in Section 2. Section 3 introduces a scaling procedure and uses it to relate the system dynamics to the organization of its aggregate variables. Theorem 2 establishes the connection linking adaptation to multilevel ordering of the fit dynamics. The mathematical findings are then discussed in the paper’s concluding section. We consider a system 9 with state variables xk E w, k = 2. Fit Systems. I N characterizing its morphology, behavior, and consequences of this * * 7 behavior (e.g., proportion of body fat, number of matings per unit time, number of offspring produced per unit time). Depending on the traits of relevance, the same state variables might be used to compare members of different biological species, or their application might be restricted to deal
FITDYNAMICALSYSTEMS
593
with intraspecific variation. Activity of Ywithin its environment is characterized by temporal changes in xk. The dynamics of the state vector x=(X1,. . . , xN) follow the first-order differential equations dx, -ZX dt The the For tion
.
k=Xk(x,
t),
k= I,...
.N.
form of the Xk for systems of biological interest ultimately expresses interaction of genotype and environment in shaping the phenotype. the systems relevant to the present investigation the principal restricon the .& is smoothness of a minimal sort, modeled by @“-continuity on
RN+‘. x:lRN+‘+lR~:(x,
t) p
x
where t E W denotes the time variable and we adopt the mapping conventions of Arbib and Manes (1975). The equations (1) induce the flow 4 for Ythrough the configuration phase spaceRN: $:RN+r+RN:
(x,, to) I+ x(t 1x0, to).
For every # on R N+’ induced by equations (1) we define the reward function or fitness as a scalar functional J that integrates over the life history [to, tf ] of9 J=
tfG(x, x, t)dt, I to
(2)
where G(O) is the reward rate on [to, tf I. The reward rate of particular interest in population biology is the net reproductive success of SPper unit time, or (more commonly) a phenotype variable believed to determine a significant proportion of the variance in this fitness measure. The choice of G(*> is influenced by the type of phenotypic designs to be compared and the envisaged nature of their variation under genetic change (Rosen, 1967; Oster and Wilson, 1978). For example, biological organisms have a finite reproductive success, and generally G( ) is constructed such that J < UJfor each3 Similarly, among the possible G(O) are only those with support in the subsets of pertinent initial states. Conversely, not all formally conceivable initial states x(t,) will enter into the determination of a G( l ) [e.g., ant societies based on claustral founding always start from x(to) = 1 individual, the fertilized queen, irrespective of other possible modes of variation (Oster and Wilson, 1978; Lumsden, 1982)]. In view of these considerations we introduce the following definition. Definition. The differentiable system 9 is a fit system F iff for each x(t 1x0, to) solving (1) the fitness J is maximal on the Life history [to, tf ] : l
594
C.J.LUMSDEN
= max.
J[x(*)]
(3)
The set of all such fit systems will be denoted byP. Remark. This definition is clearly over-inclusive in the sense that all systems (1) for which (3) is true are admissible fit systems, and this set may include elements that are not biological in any reasonable sense. However, because processes of competition coupled with differential (re)production influence a broad range of natural and man-made systems, in a manner often resembling classical natural selection, it is useful to relax the notion somewhat and for the time being include systems of the latter types within 9. Equally intriguing is the question of whether 9 includes any biological systems at all, or whether near-optimally designed phenotypes rarely appear during biological evolution. Our definition would then be too restrictive for biological purposes. Although much work remains to be done on this issue, the available evidence from studies of physiological (Rosen, 1967; Alexander, 1982) ethological (McFarland and Houston, 1981; Krebs and McCleery, 1984), and societal (Wilson, 1980a, b) design indicates that Scontains some (but by no means necessarily all) real biological systems, and that nearoptimal phenotypic traits may be rather more common than previously supposed. The flows $ in which equation (3) is true follow from the variational principle rfG(x, i, t)dt = 0. s to
6J=S
(4)
In differential form this condition is expressible as the control equations -aG ---=d aG dt a.& aXk
0,
k=
l,...,N
(Bryson and Ho, 1969; Vagners, 1974). It is useful to transform symmetric notation in which costate variables h&6x/8X&,
k=
i.,...,
N
(5) a more (6)
replace the x&s. Using the generating function Q=G-5
h&x& k=l
(7)
in the fitness principle (3), one obtains from the control equations (5) the costate dynamics dXk/dt
=-aQl&Yk,
k = 1,. . . ,N
(84
FITDYNAMICALSYSTEMS
595
as well as the state variable equations dx-,/dt = aQ/ah,,
li = I,. . . >N.
(8b)
The natural canonical or Hamiltonian structure of equations (8) follows from the variational structure of the fitness problem (3) and the use of costate variables to express the control equations as a first-order differential dynamics. Unlike physical systems, whose microscopic dynamics are also essentially canonical, fit systems are not restricted to reward functions G( l) that have the form of an energy function. This flexibility opens the possibility of new and unexpected behaviors from a physical point of view. Moreover, treatments of physical systems far from thermodynamic equilibrium have demonstrated that even rather conventional energy-Hamiltonian systems can exhibit entirely novel models of spatio-temporal organization relative to the characteristics of near-equilibrium (Prigogine, 1980; Prigogine and Stengers. 1984). The present development will not restrict the fitsystem dynamics to orders analogous to ‘at’ or ‘near’ some condition of aggregate equilibrium. For some applications it is advantageous to incorporate constraints among the state variables into the explicit statement (3) of the fitness problem. The resulting control equations preserve the basic canonical structure of equations (6)-(g), with modifications introduced by the presence of the constraining conditions (Bryson and Ho, 1968; Oster and Wilson, 1978). Methods treating the collective dynamics of constrained systems have recently become a subject of analysis and application (Lumsden and Trainor, 1979? 1985). In this study we will assume that any constraining conditions among the state variables have already been incorporated by eliminating variables (1) and appropriately rewriting equations (l)-(3). When N %=1 in equations (l)-(8) it becomes useful to introduce variables that characterize the system, or large parts of it, as a whole. The original description in terms of (x, x) or (x, h) contains an abundance of detail. Dejhitiorz. The %’ map
A,:lRZN+ lR:(x, A) PA,(x,
h) ER,
(9)
is an aggregate variable of F iff dim [dom (A,)] /2N % 1. In other words, A, depends on most or all of the state variables xk or state and costate variables Sk> hk: k = l> . . . ) N. Remark. A system may possess a number of aggregate variables A,, 111= 1, . . , > M relevant to a particular set of properties, so we introduce the aggregate descriptiolz d s {A, I nz = 1, . . . , M}, along with the following definition. Dejkitiorl. An aggregate description dis interesting iff M Q N.
596
C. J. LUMSDEN
Interesting aggregate descriptions characterise F by means of relatively few relevant variables A,. They are economical relative to state-variable descriptions in terms of the xk and h k. In biological applications the A, represent such important concepts as the local concentration of a biochemical species (sum over many molecules), the number of individuals in a population (sum over many individuals), and net matter-energy fluxes though trophic networks (sum over activities of many producers and consumers). The best-understood aggregate variables are in fact in the form of simple sums over the state and costate variables. The set J$ is a subset of 9, the set of all observables f:lRvv + lR defined on (1). A weak criterion for %?, sufficient for the present study, is that it comprises the set of all %‘I complex functions on lRzN which vanish at infinity. The behavior at infmity models the finite capacities of biological systems; formally, it allows meaningful expectation values to be constructed. Since lR2N is locally compact, relative to the sup norm llfll ~,~;2JfoI
(10)
and the involution f t+ f*, * G complex conjugation,@ is an Abelian B*algebra over the system phase space lRZN (Ruelle, 1969). Through d the flow 4 in lRN (equivalently, IR2N) connects the aggregate variables at t,, with their values at any t E [to, tfl . 9
I XI), to>
a$----
x0
d
54 I A@ I At,, to>
I AO,)
(Dl)
If there exists a t-autonomous where A(t) G (A r(t), . . . , AM(t)). differentiable flow GAon RM such that # x(to)------+
x(t I x0, to>
Nto)
A@IAo, to)
-
V1-
@A
with d* $A = 9 at we say that &f?constitutes a closed aggregate description of F. The corresponding differential equations can be written as l
k,=Y,(A,
)...)
A,),
m=
I)...)
M.
(11)
FITDYNAMICALSYSTEMS
591
For fit systems that admit closed aggregate descriptions the system
+ aA?3a9 --=
ahk axk
Ym(A1,.
.
. ,A,),
m = l,..
. ,M(12)
is solvable. On the basis of present knowledge it appears that pairs (Y,4qd) for which (D2) is true are either rare or quite difficult to find (e.g., Coxson, 1984, p. 435). Commonly there is some leakage from the x-level to the A-level that prevents closure in the sense of(D2) (see below). All fit systems are, however, A-closed in the following sense. THEOREM 1.Let p(x, h) be an element of Xw, the set of integrable, unitnormaIizable (‘probability ‘) functions in R 2N.Xw is a convex subset of 4V’. the dual of the algebra of system observabIes S (Ruelle, 1969). Let P(a) EX”, where ,XM is defined similarly to X2N. Then there exist Gp and q$ such that the diagram (03) commutes.
(D3)
P(a, toI
d?P
b
P(a, t).
Remark. The operation sdenotes construction of the probability distribution P [system ensemble (Lumsden and Trainor, 1979)] on lRM for a given specification of aggregate variables. Notation. As context warrants we shall write equivalently p(x, X, t) G p(x, X) = p(t) for the value of p at time t. We prove Theorem 1 in several steps.
LEMMA
1.In the (x, h) variables the differential form of & is apiat = -i.C$p
(13)
where
(14)
598
C. J. LUMSDEN
Proof. Since
s
dx dX p(x, X, t) = 1
RZN
vt
1,151
p(t) has a local balance equation
(16) (Lumsden and Trainor, 1979). Using equation (8),
+I:
N --------_oo. ap39 ap w kzl
axk ahk
ahk axk
(17)
The generating function Q(x, X) is X-linear and the formal difference a2QjaXk a& - a2Q]&aXk
(18)
n vanishes. When .Y is explicitly independent of time, the propagator T(tl to) that takes p(x, h, to) into p(x, A, t) for t > to is by operator integration
T(t
1 to) = e-t(r-rO)~
(1%
where @p: p(to) 5 T(t I to) : p(to) /-+p(t) = e-‘(r-ro)9p(to).
(20)
Explicit time dependence may occur in Ywhen the fit system F is coupled to an external environment or in the presence of certain forms of dissipation (Lumsden and Trainor, 1979). This inclusion requires the use of timeordering operators (Fetter and Walecka, 1971) and makes the development more cumbersome, but entails essentially no new formal results. To keep the presentation streamlined we treat the case of explicit time-independence, and without loss of generality set co = 0. LEMMA 2. The constructive procedure 5 exists for diagram (03). Proof. She A,,, = A&x, X), m = 1, . . . , M, an A, contour with value a, is a hypersurface in lR2N. Intersection of the hypersurfaces A,(x,X)=a,(t),
m = 1,. . . ,M
(21)
FITDYNAMICALSYSTEMS
599
generates at any time t a 2N -M-dimensional subset IYa,. . . . , uM) - I’(a) C R2” consistent with A having value a(t) - @r(t), . . . I a,~&t>). Define P(a,t)da~Prob(a,
that a,,