Modular genetic regulatory networks increase organization during pattern formation

Modular genetic regulatory networks increase organization during pattern formation

Accepted Manuscript Title: Modular Genetic Regulatory Networks Increase Organization During Pattern Formation Author: Hamid Mohamadlou Gregory J. Podg...

3MB Sizes 2 Downloads 92 Views

Accepted Manuscript Title: Modular Genetic Regulatory Networks Increase Organization During Pattern Formation Author: Hamid Mohamadlou Gregory J. Podgorski Nicholas S. Flann PII: DOI: Reference:

S0303-2647(16)30037-5 http://dx.doi.org/doi:10.1016/j.biosystems.2016.04.004 BIO 3652

To appear in:

BioSystems

Received date: Accepted date:

15-2-2016 4-4-2016

Please cite this article as: Hamid Mohamadlou, Gregory J. Podgorski, Nicholas S. Flann, Modular Genetic Regulatory Networks Increase Organization During Pattern Formation, (2016), http://dx.doi.org/10.1016/j.biosystems.2016.04.004 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ip t

Modular Genetic Regulatory Networks Increase Organization During Pattern Formation Hamid Mohamadloua , Gregory J. Podgorskic,d , Nicholas S. Flanna,b a

us

cr

Department of Computer Science, Utah State University b Institute for Systems Biology, Seattle c Department of Biology, Utah State University d Center for Integrated BioSystems, Utah State University

an

Abstract

Ac ce p

te

d

M

Studies have shown that genetic regulatory networks (GRNs) consist of modules that are densely connected subnetworks that function quasiautonomously. Modules may be recognized motifs that comprise of two or three genes with particular regulatory functions and connectivity or be purely structural and identified through connection density. It is unclear what evolutionary and developmental advantages modular structure and in particular motifs provide that have led to this enrichment. This study seeks to understand how modules within developmental GRNs influence the complexity of multicellular patterns that emerge from the dynamics of the regulatory networks. We apply an algorithmic complexity to measure the organization of the patterns. A computational study was performed by creating Boolean intracellular networks within a simulated epithelial field of embryonic cells, where each cell contains the same network and communicates with adjacent cells using contact-mediated signaling. Intracellular networks with random connectivity were compared to those with modular connectivity and with motifs. Results show that modularity effects network dynamics and pattern organization significantly. In particular: (1) Modular connectivity alone increases complexity in network dynamics and patterns; (2) Bistable switch motifs simplify both the pattern and network dynamics; (3) All other motifs with feedback loops increase multicellular pattern complexity while simplifyEmail addresses: [email protected] ( Hamid Mohamadlou), [email protected] ( Gregory J. Podgorski), [email protected] ( Nicholas S. Flann)

Preprint submitted to BioSystems

May 5, 2016

Page 1 of 19

ing the network dynamics. (4) Negative feedback loops affect the dynamics complexity more significantly than positive feedback loops.

6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30

us

an

5

M

4

d

3

Multicellular organisms contain a large variety of cellular patterns. For instance, Figure 1 illustrates a Drosophila melanogaster embryo in which muscle and nervous system structures interconnect through sensory and activation signaling. These patterns are formed during development and are a consequence of genetic regulatory networks (GRNs) that operate within cells and that respond to communication between cells [1] [2], [3]. GRNs are networks of interacting genes where the expression or non-expression of genes determines the expression state of other genes. The dynamics of GRNs determine the gene expression profile for each cell leading to spatial patterns of cellular differentiation. This process is repeated to implement an organism’s body plan, and subsequent morphology [4]. GRNs contain subnetworks of genes referred to as modules. Smaller modules, usually consisting two or three genes with specific functioning, are known as motifs [5] [6]. Motifs are detected at a higher frequency than would be expected in random networks. Computational biologists have hypothesized that motifs play a determinative role in cell function [7] [8]. However, their influence on pattern formation during development is poorly understood. This work presents a computational study aimed at understanding how the presence of motifs within intracellular networks changes the GRN dynamics and the emergent multicellular patterns. In particular, this study measures the complexity of the organization of the dynamics and patterns. As can be seen in Figure 1, patterns can involve complex arrangements of specialized regions and interconnections that develop later, or earlier patterns such as simple segmentation [9] or mosaic arrangements [10] [11]. To quantify the level of organization we employ Kolmogorov complexity, also known as algorithmic complexity [12]. Such methods measure the information contained within an object, such as a cellular pattern, by considering the size of the algorithm needed to generate the object, hence the term algorithmic complexity. The smaller the algorithm, the

te

2

1. Introduction

Ac ce p

1

cr

ip t

Keywords: Network motifs, Kolmogorov complexity, Pattern formation, Genetic regulatory networks

2

Page 2 of 19

33 34 35 36 37

M

an

us

38

ip t

32

simpler the pattern. The calculations of Kolmogorov complexity is detailed in the methods section following. To understand how GRNs regulate biological events, scientists have developed mathematical and computational models to generate predictions and explain experimental observations. Among these modeling approaches is to represent a GRN as a Boolean network in which the activity of a gene is either on or off, determined by a set of logical functions over the activity of other genes [13]. It is this modeling framework that we apply in this study.

cr

31

40 41 42 43 44 45 46 47 48

49

50 51 52

To evaluate the influence of the modular structure of networks and in particular motifs on network dynamics and patterns, we design GRNs that are embedded into cells arranged in a 2D grid, simulating an epithelium. Such abstractions of the epithelium have been employed successfully in many developmental systems, such as the cellularized Drosophila embryo [9], and the sensory epithelia of the developing vertebrate retina [14] and the inner ear [10]. Each cell contains the same Boolean network, referred to as an intracellular network. We explore the impact on network dynamics and multicellular patterns of modules with random connectivity and regulatory functions, and when the modules are explicitly recognized motifs.

Ac ce p

39

te

d

Figure 1: Ventral view of stage 16 Drosophila melanogaster embryo immunostained for tropomyosin (green; a protein expressed in muscle), Pax 3/7 (blue; a regulatory protein expressed in central nervous system nuclei and ectoderm), and HRP (red; neurons). All nuclei shown in gray (DAPI). Courtesy of Julieta Mara Acevedo and Lucas Leclere, Marine Biological Laboratory, Woods Hole, www.mbl.edu/ dev.biologists.org/

2. Modular structure and motifs within gene regulatory networks Networks are structurally modular if they contain highly connected clusters of genes that are linked by sparser connections than those within the modules. Figure 2(a-b) shows a small network with a modular structure 3

Page 3 of 19

56 57 58 59 60 61

an

62

ip t

55

cr

54

compared to randomly connected networks. Evidence suggests that modular network structure increases the functional modularity of gene regulatory networks by performing relatively independent tasks [15], [16] such as decision making, signal processing, and communication. The modular organization of biological structure is supported by experimental studies from pathogen structure, gene networks, and protein-protein interaction networks [17]. For example, Kim et al. [15] considered the connected subset of protein networks in protein-protein interaction data for budding yeast. Their analysis suggests that the yeast protein network is significantly modular, and it contains various motifs.

us

53

(b)

(d)

(e)

(g)

(f)

(h)

d

(a)

M

(c)

63 64 65 66 67 68 69 70 71 72 73

Ac ce p

te

Figure 2: This figure illustrates networks with modular structures and example of variety of motifs. (a) A network with modular structure where intra-modular connectivity is higher than inter-module connectivity. (b) A randomly connected network. (c) A positive feedback loop (a double inhibitory loop with two positive autoregulatory loops). (d) A positive feedback loop (a double excitatory loop with two positive autoregulatory loops).(e) A negative feedback loop [18] with two positive autoregulatory loops. (f) Coupled positivepositive feedback loops. (g) Coupled positive-negative feedback loops. (h) The type-1 coherent feed-forward loop [19].

We study two kinds of modularity in this work: structural modularity when the modules within the intracellular network are connected randomly and functional modularity when the subnetworks are recognized motifs. Motifs frequently occur and consist of few interacting genes [8] and were first noted in Escherichia coli, where they were detected at a higher frequency than would be expected in random networks. Since then multiple motifs have been identified in bacteria and yeast [20], the immune system [21], and Drosophila [15]. This finding suggests that motifs are building blocks of transcription networks and that they may have evolved to achieve specific regulatory behaviors in cellular transcription networks [18]. Regulatory motifs have been found in regulatory networks that perform two distinct functions: 4

Page 4 of 19

107

3. Multicellular model and implementation

82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105

108 109

cr

81

us

80

an

79

M

78

d

77

te

76

Ac ce p

75

ip t

106

(1) Developmental networks that guide differentiation and cell fate determination by transducing signals into irreversible cell-fate decisions [22] [23]; and (2) Sensory networks that respond to signals such as stresses and nutrients rapidly and make reversible decisions [5]. The motifs that are associated with developmental networks are commonly comprised of feedback loops. Positive-feedback loops are most common and are made up of two transcription factors that regulate each other. There are two kinds of positive-feedback loops, a double excitatory loop (Figure 2(b)) and a double-inhibitory loop (Figure 2(a)). The regulatory dynamics of these gene pairs coupled by positive feedback loops often results in two or more steady states and is referred to as multistability [18]. Positive feedback loops amplify signals and elongate the time required to reach a steady state, referred to as a network attractor [20]. This slowed response can be helpful when a cell makes significant decisions such as irreversible cell specification and apoptosis. Unlike positive feedback loops, negative feedback loops (Figure 2(c)) often enhance attractor stability [8]. They also function as noise filters and make cells more robust to signal noises. Additionally, positive and negative feedback loops are coupled into structures containing two feedback loops, such as positive-positive, positive-negative and negative-negative feedback loops (Figure 2(d,e)). Coupled feedback loops perform functions that single feedback loops cannot. In particular, Kim et al. [18] found that a positive-positive feedback loop enhances signal amplification and bistability, and a positive-negative feedback loop increases reliable decision-making by modulating signal responses and effectively dealing with noise. Feed-forward loops (FFL) are another family of motifs that do not contain feedback and are associated with sensory networks. FFL are found in a variety of organisms such as Saccharomyces cerevisiae, Bacillus subtilis, Caenorhabditis Elegans and humans [19]. FFLs consists of three genes (Figure 2(f)). The first regulatory gene controls the second and the third genes. The third gene is also regulated by the second gene. Logical gates such AND or OR could be applied to the three regulatory interactions in the FFL. The best known FFL which frequently occurs in (E. coli and yeast), is the coherent type-1 FFL [24] with all AND gates.

74

The size of gene networks for multicellular pattern formation drove the decision to use Boolean networks as the framework for this computational 5

Page 5 of 19

113 114 115 116 117

ip t

112

cr

111

study. A Boolean network is a simplified model of a genetic regulatory network. In this application, each gene is represented as a network node that takes binary values (1 for expressed and 0 for not expressed). The state of a gene (0 or 1) is determined by its Boolean function defined as the expressions over AND, OR, NOT on the inputs from other genes. These inputs are represented as directed edges in the network graph. Boolean networks provide a qualitative description of gene states and their interactions, first introduced by Kauffman [13], [25]. (b)

(a) Epithelium field

us

110

Epithelium field

Signaling module

Random network

te

d

M

an

Cell membrance

Double positive loop

118 119 120 121 122 123 124 125 126

Ac ce p

Figure 3: (a) Structural modularity. Cell-cell signaling is implemented through specific modules referred to as signaling modules (depicted in green). In this example, cell-cell signaling is orthogonal such that two adjacent cells signal directionally [north-south and east-west]. These directions can be thought of as corresponding to the anterior-posterior and dorsal-ventral embryonic axis. (b) Motif insertion. Example motif insertion into a random GRN. Dashed arrows represent random outgoing signals from the motif. Outgoing and incoming signals from and to the random GRN are randomly connected to genes within the random network.

For the first part of this study, to evaluate the effect of modularity in network dynamics and pattern complexity, structurally modular networks were created in the critical or chaotic domain, see Figure 3(a). For the latter part of this study, motifs were inserted in randomly created networks, see Figure 3(b). For instance, the insertion of a double positive loop is produced by adding two autoregulatory genes that regulate each other and also regulate two random genes from the rest of the network. This work extends our previous study [3] that investigated the role of different dynamical regimes (ordered, critical or chaotic) on the complexity 6

Page 6 of 19

134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164

ip t

cr

133

us

132

an

131

M

130

d

129

te

128

of multicellular pattern formation. The simulation model was unchanged along with the epithelial model as a lattice of cells, with each cell holding an intracellular Boolean network. Cell-cell signaling was implemented in the model as an edge connecting the state of one gene in a cell to an input of a Boolean function of one or more of its neighbors. Such genes are called communicating genes. The number of communicating genes is referred to as the signaling bandwidth. Signaling bandwidth was set to half of the total number of genes in a cell as our previous study showed that this configuration established the most efficient cell-cell signaling. Experiments were run for two different intercellular connection configurations, orthogonal and symmetric signaling (see Figure 4). When connected symmetrically, each cell receives signals from its four neighbors, inputs them to a logical OR function creating a single input to the intracellular network. This architecture biologically implements contact-mediated signalling [3] [26]. When connected orthogonally, each cell connects with its four neighbors, but the signals are independent inputs to the intracellular GRN [27]. Orthogonal signals correspond to intercellular communication along the anteriorposterior and dorsal-ventral embryonic axes essential to generate complex patterns illustrated in Figure 1. Networks in two complexity classes were considered: critical and chaotic. To produce networks of different complexity domains, the number of inputs to each Boolean function (node in the network) is set according to s = 2kp(1−p) where s is the sensitivity of the network to perturbations in gene values, p is the probability of the output of each Boolean function being 1, and k is the count of inputs to each Boolean function [28]. When s = 1 a single bit change is on average propagated to one other node, and the network is in the critical domain. In a chaotic network, s > 1 and perturbations tend to grow. In this work, p was fixed at 0.5 and k was changed to create networks of different domains: k = 2 for critical and k = 3 for chaotic. The state of each cellular GRN is initialized randomly by setting the state of each gene to 0 or 1. Randomly generated logic functions are assigned to networks as the transition rules used to determine the state of genes [3]. The state of the system during simulation is clocked synchronously until a steady or cyclic state (in up to 300 updates) is reached for all individual cells. When the state of genes change in a repetitive cycle or reach a fixed state, then the cell is in attractor state [13]. Since cells are connected through directional signaling, neighboring cells may converge to different attractors, forming a regular pattern, where cells in the same attractor have differentiated into

Ac ce p

127

7

Page 7 of 19

174

4. Methodology

175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198

cr

172

us

171

an

170

4.1. Information Complexity Set Complexity [29] is an information complexity metric derived from Kolmogorov’s original work on algorithmic complexity. In this seminal paper by Galis et al., the method was developed explicitly to measure the information contained in sets of biological data at different scales: the molecular, sequence and network. Recently, Set Complexity has been applied to study modularity in biological networks [30] and large-scale genetic interaction networks [31]. In this work, we extend the approach to the analysis of multicellular patterns and the network dynamics that create them. Kolmogorov complexity of an object is the size of the shortest program that can produce that object. The exact calculation is undecidable and, therefore, cannot be computed as defined. As an approximation, the compression size of an object is used. Kolmogorov analysis describes the similarity of two objects as the size of the shortest program that can translate one object to the other. This measure is approximated by the Normalized Compression Distance (NCD) [32], where the NCD of two objects is 0.0 if they are identical and 1.0 if they are random. Set Complexity combines the analysis of single objects and pairwise objects to arrive at a complexity measure of a set of objects, given in Equation 1. By employing NCD as a metric to evaluate the similarity between all pairs of objects in a set, set complexity discounts the influence of the pairs of objects that are randomly related or redundant. As long as any object can be encoded as a string, Set Complexity can compute the information content that resides in the set. Set Complexity of a set of n strings S = {s1 , . . . , sn } is defined:

M

169

d

168

te

167

Ac ce p

166

ip t

173

the same cell types. The state of all the genes as the networks along with multicellular patterns are recorded for analysis of information content. After running the randomly-generated GRNs, single and coupled feedback and feedforward loops are inserted into the randomly generated intracellular GRN (see Figure 3). The network is run again with the inserted motifs to identify the attractors and visualize the multicellular patterns that are formed. The Kolmogorov complexity of both the gene network dynamics and multicellular patterns is computed by an information theoretic measure called Set Complexity described below.

165

8

Page 8 of 19

X X 1 C(si ) NCD(si , sj )(1 − NCD(si , sj )) n(n − 1) s ∈S s 6=s i

205 206 207 208 209 210 211 212 213 214 215

216 217 218 219

220 221 222 223 224 225 226

cr

us

204

an

203

M

202

d

201

te

200

i

Where C(si ) is the compression size of string si . The term NCD(si , sj )(1− NCD(si , sj )) is maximized when NCD(si , sj ) = 0.5, which occurs when C(si + sj ) ' C(si )/2 − C(sj ), assuming C(si ) > C(sj ). The influence of similar and dissimilar strings is discounted by NCD(si , sj )(1 − NCD(si , sj )) because NCD(si , sj ) will be near 1.0 or 0.0. In this implementation, bZip is used as a compression algorithm. A one-to-one mapping is required to encode an object into a string so that no information is lost. The temporal dynamics are converted to a string by concatenating in column-order the state of the network at each time period. The spatial pattern is likewise converted by traversing the cells in row order and concatenating the type ID of each cell. To convert a GRN to a string, a more complex procedure is needed. Here, a unique ID is assigned to each logic gate and edge; then the whole epithelial network is traversed, and substrings created for each logic gate that includes its truth table and its input and output edge IDs. The string also includes a row order traversal of the connection matrix. See [33] for the exact algorithm for encoding Boolean networks. 4.2. Structural modularity score measurement The most common method used in the literature to score structural modularity is a method by Newman[34]. Newman’s method to compute the modularity score Q for a given network is as follows:

Ac ce p

199

j

(1)

ip t

Ψ(S) =

c X Q= (eii − a2i )

(2)

i=1

where c is the total number of modules, eii is fraction of edges in module i, eij is the fraction of edges that connect module i to module j and ai is the fraction P of edges that connect module i to other modules and is as follows: ai = j eij In our experiments, the nodes in the modules are known since each module is generated with higher intra-connection than the sparse connections between the modules. 9

Page 9 of 19

234 235 236 237 238 239 240 241

ip t

cr

233

us

232

an

231

M

230

d

229

5.1. Structural modularity To analyse how the structural modularity of a network influences the dynamics and pattern complexity, we partition a population of random networks by percentile of their modularity score distribution. Networks that lie in lower third are defined as non-modular, and networks that lie in upper third are defined as highly modular. Those networks considered non-modular exhibit the same low modularity score as random networks. Networks that lie in the middle third were not considered in this study. Figure 4 illustrates the average dynamics complexity and pattern complexity for 600 non-modular and 600 highly-modular networks. Results in Figure 4(a) show how highly modular networks produce higher dynamics and pattern complexity when cell-cell signaling is orthogonal. A statistical analysis confirms the distribution of the two classes of networks are significantly different Table 1 and Table 2.

te

228

5. Results

Ac ce p

227

Figure 4: Non modular vs. modular. Average dynamics and pattern complexity for 60 non-modular (arrow tails) and 60 highly-modular (arrow heads) networks for orthogonal(orange) and symmetric signaling(dark red).

10

Page 10 of 19

Critical modular

Chaotic modular

90.059

112.25

20.921

26.565

15.255

1315.4

1680.0

18.769

19.127

df 59 t Stat 11.480 P(T¡=t)

5.6E-17

t Criti1.671 cal

59 3.265

59 7.993

9E-5

2.8E-11

1.671

1.671

Dynamics complexity Non Highly Modular Modular 107.85

134.96

1212.8

2035.9

59 3.714

us

11.282

23.328

2E-4

an

Mean 15.593 Variance

Pattern complexity Non Highly Modular Modular

cr

Pattern complexity Dynamics complexity Non Non Highly Highly Modular Modular Modular Modular

ip t

Table 1: Orthogonal signaling. Dynamics and pattern complexity change when non modular networks become highly modular networks.

1.671

Critical modular

M

Table 2: Symmetric signaling. Dynamics and pattern complexity change when non modular networks become highly modular networks.

12.603

1.1E-09

t Criti1.671 cal

242 243 244 245 246 247 248 249 250 251

Dynamics complexity Non Highly Modular Modular

31.017

31.844

19.193

17.479

74.169

39.422

11.919

175.18

138.65

18.212

20.827

690.37

273.23

59 0.382

59 2.526

59 9.192

0.3517

7E-04

2E-13

1.671

1.671

1.671

Ac ce p

df 59 t Stat 7.046 P(T¡=t)

Chaotic modular

Pattern complexity Non Highly Modular Modular

14.940

te

Mean 11.499 Variance

d

Pattern complexity Dynamics complexity Non Non Highly Highly Modular Modular Modular Modular

The observation for symmetric signaling with a critical configuration for each module in Figure 4(b), suggest that the networks with modular structure can produce multicellular patterns with higher information content without an increase in the network dynamics complexity. Another interesting observation was that for symmetric signaling with chaotic configuration (Figure 4(b)), both network dynamics and pattern complexity decrease. As shown in the previous study [3] symmetric signaling produces only simple information patterns and dynamics because information transfer among adjacent cells combined using disjunction, which loses directional information. Now when the configuration of each module is chaotic, structural modularity of network 11

Page 11 of 19

258 259 260 261 262 263 264 265 266 267 268 269 270

ip t

Ac ce p

271

cr

257

us

256

an

255

M

254

5.2. Functional Modularity In this section, we explore the effect of insertion of the best-known motifs illustrated in Figure 2(b) into randomly generated intracellular GRNs in the critical domain. For each motif, 600 random networks were generated and then each network was modified by the insertion of that regulatory motif. To obtain the data for analysis, each network was executed from random initial conditions and the dynamics and resulting pattern recorded. Following this step, the network data is grouped into sets of 10 and the set complexity computed for the networks, the dynamics, and the pattern. Figure 5 shows the influence of insertion of a positive feedback loop (Figure 2(b)) into a random GRN. The vertical axis of both graphs is the pattern complexity; on the left shows the relationship with dynamics complexity, on the right shows the relationship with network complexity. Each point on the graph corresponds to a set of 10 runs described above. Insertion of a double positive feedback loop into random GRNs clearly modifies the behavior of the network. The addition of the motif increases pattern complexity significantly while slightly reducing the complexity of the dynamics (Figure 5 left). Little effect on network complexity is observed (Figure 5 right).

d

253

helps to produce a more simplified pattern and dynamics.

te

252

Figure 5: Effect of insertion of a double positive feedback loop on network, network dynamics and pattern complexity for GRNs in the critical domain. 272

Figure 6 illustrates the average network, network dynamics and pattern 12

Page 12 of 19

280 281 282 283 284 285 286 287 288

ip t

cr

279

us

278

an

277

M

276

d

275

te

274

complexity for 600 network sets for each of the motifs. The influence of each motif is represented as an arrow. The start of the arrow is the set of random networks and is the same for all motifs. The arrowhead is the average complexity of the set of all networks with the indicated motif inserted. The insertion of a negative feedback loop, a double negative feedback loop or a double positive feedback loop all have the same qualitative effect of decreasing the complexity of network dynamics and increasing pattern complexity. However, a double positive loop increases pattern complexity much more than either of the negative feedback loops (Figure 6(a)). Insertion of feed-forward motifs decreases dynamics with almost no effect on the pattern complexity. Only the double negative motif leads to a reduction in pattern complexity. Statistical analysis confirms that insertion of these motifs makes a significant change in the average network dynamics and patterns complexity. ANOVA analysis presented in Table.3 and Table 4 shows that insertion of these motifs makes a significant change in the average network dynamics and patterns complexity.

Ac ce p

273

Figure 6: (a) Effects on network dynamics and pattern complexity of inserting regulatory motifs into random GRNs. Average dynamics and pattern complexity for 60 random sets of GRNs (arrow tails) and 60 sets of GRNs with the indicated inserted motifs (arrow heads). (b) Effects on network and pattern complexity of inserting regulatory motifs into random GRNs.

13

Page 13 of 19

Table 3: ANOVA analysis of dynamics complexity illustrates differences for various groups of motifs over networks with orthogonal and symmetric signaling

ip t

cr

us

MS 3419.93 266.27

F 12.84

P-value 1.1E-11

F crit 2.234

F 39.74

P-value 9.5E-34

F crit 2.234

an

df 5 444 449

Symmetric signaling Sum Average Variance 1560.25 20.80 15.96 1423.40 18.97 8.18 1828.02 24.37 7.64 1658.00 22.10 8.15 1466.11 19.54 4.17 1718.50 22.91 4.02

df 5 444 449

MS 318.94 8.02

290 291 292 293 294 295 296 297 298 299 300 301 302 303 304

6. Discussion

In this study, we explored how network dynamics and pattern complexity are affected by structural modularity in general and regulatory motifs in particular. The results show that networks with a modular structure, in general, tend to produce more complex multicellular patterns without a significant increase in gene expression dynamics. The results indicate that network motifs that contain feedback loops increase the information complexity of the multicellular patterns. Another important observation was negative feedback loops do not affect the dynamics complexity significantly as positive feedback loops do. We hypothesize that variation in dynamics complexity associated with adding different types of motifs to random GRNs originates from the time it takes for the GRN to reach a steady state and the proportion of single state versus cyclic attractors produced by the GRN. Since the motifs studied in this work act as multistable switches, they simplify the complex cyclic attractors to attractors with a few states. Gene expression in feedback loops

Ac ce p

289

te

d

Orthogonal signaling ANOVA Source of Variation SS Between Groups 17099.6 Within Groups 118226.6 Total 135326.3 Symmetric signaling ANOVA Source of Variation SS Between Groups 1594.73 Within Groups 3563.42 Total 5158.16

M

Groups Without Motifs Double-negative Double-positive Negative-Feedback Coupled P-P Type-1 feed Forward

Orthogonal signaling Sum Average Variance 3911.45 52.15 474.35 3017.06 40.22 284.05 2633.55 35.11 101.26 3322.99 44.30 181.91 3355.92 44.74 306.40 2560.45 34.13 249.67

14

Page 14 of 19

Table 4: ANOVA analysis of pattern complexity illustrates differences for various groups of motifs over networks with orthogonal and symmetric signaling.

ip t

cr

us

MS 473.35 19.06

F 24.82

P-value 4.4E-22

F crit 2.234

F 67.26

P-value 3.0E-52

F crit 2.234

an

df 5 444 449

Symmetric signaling Sum Average Variance 592.51 7.90 3.55 456.57 6.08 1.97 805.59 10.74 5.08 687.03 9.16 2.66 530.80 7.07 2.35 692.85 9.23 3.22

df 5 444 449

MS 211.53 3.14

306 307 308 309 310 311 312 313 314 315 316 317 318 319 320

reach a steady state quickly, and so reduce the length and complexity of cyclic attractors. We hypothesize that this is why in all cases dynamics complexity decreases from that of the original random networks. The rate of dynamics complexity reduction associated with the addition of the motifs with negative feedback loops is significantly lower than for positive loops. Unlike positive feedback loops, negative feedback loops do not increase the time to reach steady states [18]. Therefore, they don’t affect the dynamics complexity noticeably. As results show, negative feedback loop motifs (such as single negative feedback loop and coupled positive-negative loops) have the lowest reduction in their dynamics complexity. All the motifs with feedback loops impact the pattern complexity. In fact, the only motifs in this study that has no effect on pattern complexity are feed-forward loops. This observation confirms the association of feedback loop motifs with developmental networks that mediate important cell fate decision. The increase in information complexity of the multicellular patterns shows more certain cell-fate decisions have been taken. Only double nega-

Ac ce p

305

te

d

Orthogonal signaling ANOVA Source of Variation SS Between Groups 2366.77 Within Groups 8466.32 Total 10833.10 Symmetric signaling ANOVA Source of Variation SS Between Groups 1057.66 Within Groups 1396.30 Total 2453.97

M

Groups Without Motifs Double-negative Double-positive Negative-Feedback Coupled P-P Type-1 feed Forward

Orthogonal signaling Sum Average Variance 1233.65 16.44 25.15 1026.61 13.68 22.74 1559.75 20.79 13.10 1457.84 19.43 19.91 1372.30 18.29 24.27 1250.56 16.67 9.22

15

Page 15 of 19

332

Acknowledgements

325 326 327 328 329 330

cr

324

us

323

an

322

ip t

331

tive motifs decrease the complexity of the patterns formed. We hypothesize that this is due to the bistable behavior of these motifs, where they rapidly converge to one state or another producing few distinct non-cyclic attractors. Overall, we find that all motifs with feedback decrease the complexity of the dynamics. By decreasing the complexity of the dynamics, these motifs could increase the robustness in pattern formation in the presence of noise as shown in [8]. Feedback motifs tend to increase pattern complexity while decreasing dynamics complexity. This implies that these motifs increase the“efficiency” of pattern formation where simpler dynamics leads to more complex patterns. Such an increase in efficiency may be one of the principle reasons that feedback motifs are enriched within developmental networks.

321

338

References

339 340

341

342 343 344

345 346

347 348 349

d

336

te

335

[1] A. D. Lander, Morpheus unbound: reimagining the morphogen gradient., Cell 128 (2007) 245–256.

Ac ce p

334

M

337

Research reported in this publication was supported by the National Institute Of General Medical Sciences of the National Institutes of Health under Award Number P50GM076547, Thanks to Ilya Shmulevich for helpful discussions. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

333

[2] A. D. Lander, Pattern, Growth, and Control, Cell 144 (2011) 955–969. [3] N. S. Flann, H. Mohamadlou, G. J. Podgorski, Kolmogorov complexity of epithelial pattern formation: The role of regulatory network configuration, Biosystems 112 (2013) 131–138. [4] E. H. Davidson, Emerging properties of animal gene regulatory networks, Nature 468 (2010) 911–920. [5] S. S. Shen-Orr, R. Milo, S. Mangan, U. Alon, Network motifs in the transcriptional regulation network of Escherichia coli, Nature Genetics 31 (2002) 64–68.

16

Page 16 of 19

357

358 359 360

361 362 363

364 365 366 367

368 369

370 371

372 373 374

375 376 377 378

379 380 381

ip t

cr

356

[8] A. Ghaffarizadeh, N. Flann, G. Podgorski, Multistable switches and their role in cellular differentiation networks, BMC Bioinformatics 15 (2014) S7+.

us

355

[9] A. Mazumdar, M. Mazumdar, How one becomes many: blastoderm cellularization in Drosophila melanogaster., BioEssays : news and reviews in molecular, cellular and developmental biology 24 (2002) 1012–1022.

an

354

[7] A.-L. Barabasi, Z. N. Oltvai, Network biology: understanding the cell’s functional organization, Nat Rev Genet 5 (2004) 101–113.

[10] R. Goodyear, G. Richardson, Pattern formation in the basilar papilla: evidence for cell rearrangement., The Journal of neuroscience : the official journal of the Society for Neuroscience 17 (1997) 6289–6301.

M

353

[11] G. J. Podgorski, M. Bansal, N. S. Flann, Regular mosaic pattern development: a study of the interplay between lateral inhibition, apoptosis and differential adhesion., Theoretical biology & medical modelling 4 (2007) 43+.

d

352

te

351

[6] R. Milo, S. Shen-Orr, S. Itzkovitz, N. Kashtan, D. Chklovskii, U. Alon, Network motifs: simple building blocks of complex networks., Science (New York, N.Y.) 298 (2002) 824–827.

[12] A. N. Kolmogorov, Three approaches to the quantitative definition of information., Problems in Information Transmission 1 (1965) 1–7.

Ac ce p

350

[13] S. A. Kauffman, Metabolic stability and epigenesis in randomly constructed genetic nets., Journal of theoretical biology 22 (1969) 437–467. [14] S. J. Eglen, D. J. Willshaw, Influence of cell fate mechanisms upon retinal mosaic formation: a modelling study., Development (Cambridge, England) 129 (2002) 5399–5408. [15] M. S. Kim, J. R. Kim, D. Kim, A. Lander, K. H. Cho, Spatiotemporal network motif reveals the biological traits of developmental gene regulatory networks in Drosophila melanogaster, BMC Systems Biology 6 (2012) 31+. [16] J. Clune, J.-B. Mouret, H. Lipson, The evolutionary origins of modularity, Proceedings of the Royal Society B: Biological Sciences 280 (2013) 20122863. 17

Page 17 of 19

389

390 391

392 393

394 395 396

397 398 399

400 401 402 403

404 405

406 407 408 409 410

411 412

ip t

cr

388

[19] S. Kalir, S. Mangan, U. Alon, A coherent feed-forward loop with a SUM input function prolongs flagella expression in Escherichia coli., Molecular systems biology 1 (2005) msb4100010–E1–msb4100010–E6.

us

387

[20] U. Alon, Network motifs: theory and experimental approaches, Nat Rev Genet 8 (2007) 450–461.

an

386

[18] J.-R. R. Kim, Y. Yoon, K.-H. H. Cho, Coupled feedback loops form dynamic motifs of cellular networks., Biophysical journal 94 (2008) 359– 365.

[21] H. Singh, A. A. Khan, A. R. Dinner, Gene regulatory networks in the immune system, Trends in Immunology 35 (2015) 211–218.

M

385

[22] M. Levine, E. H. Davidson, Gene regulatory networks for development, Proceedings of the National Academy of Sciences of the United States of America 102 (2005) 4936–4942.

d

384

[23] G. Swiers, R. Patient, M. Loose, Genetic regulatory networks programming hematopoietic stem cells and erythroid lineage specification, Developmental Biology 294 (2006) 525–540.

te

383

[17] D. M. Lorenz, A. Jeng, M. W. Deem, The Emergence of Modularity in Biological Systems, 2012.

[24] H.-W. Ma, B. Kumar, U. Ditges, F. Gunzer, J. Buer, A.-P. Zeng, An extended transcriptional regulatory network of Escherichia coli and analysis of its hierarchical structure and network motifs, Nucleic Acids Research 32 (2004) 6643–6649.

Ac ce p

382

[25] S. Kauffman, The Origins of Order: Self-Organization and Selection in Evolution, Oxford University Press,USA, ???? [26] M. Villani, R. Serra, P. Ingrami, S. A. Kauffman, Coupled Random Boolean Network Forming an Artificial Tissue, in: S. Yacoubi, B. Chopard, S. Bandini (Eds.), Cellular Automata, volume 4173 of Lecture Notes in Computer Science, Springer Berlin Heidelberg, 2006, pp. 548–556.

[27] J. F. Knabe, C. L. Nehaniv, M. J. Schilstra, Evolution and morphogenesis of differentiated multicellular organisms: autonomously generated 18

Page 18 of 19

420

421 422 423

424 425 426 427

428 429 430

431 432 433

434 435

ip t

cr

419

[29] D. J. Galas, M. Nykter, G. W. Carter, N. D. Price, I. Shmulevich, Biological information as set-based complexity, IEEE Transactions on Information Theory 56 (2010) 667–677.

us

418

[30] T. Ignac, N. Sakhanenko, D. Galas, Relations between the setcomplexity and the structure of graphs and their sub-graphs 2012 (2012) 1–10.

an

417

[28] B. Derrida, Y. Pomeau, Random networks of automata: A simple annealed approximation, EPL (Europhysics Letters) 1 (1986) 45–49.

[31] G. W. Carter, C. G. Rush, F. Uygun, N. A. Sakhanenko, D. J. Galas, T. Galitski, A systems-biology approach to modular genetic complexity, Chaos: An Interdisciplinary Journal of Nonlinear Science 20 (2010) 026102+.

M

416

[32] X. Chen, B. Francia, M. Li, B. McKinnon, A. Seker, Shared information and program plagiarism detection, Information Theory, IEEE Transactions on 50 (2004) 1545–1551.

d

415

te

414

diffusion gradients for positional information, in: Artificial Life XI: Proceedings of the Eleventh International Conference on the Simulation and Synthesis of Living Systems, MIT Press, 2008, pp. 321–328.

[33] E. Balleza, E. R. Alvarez-Buylla, A. Chaos, S. Kauffman, I. Shmulevich, M. Aldana, Critical Dynamics in Genetic Regulatory Networks: Examples from Four Kingdoms, PLoS ONE 3 (2008) e2456+.

Ac ce p

413

[34] M. E. J. Newman, Finding community structure in networks using the eigenvectors of matrices. Phys, in: Rev. E.

19

Page 19 of 19