The population genetics of Trypanosoma cruzi revisited in the light of the predominant clonal evolution model

The population genetics of Trypanosoma cruzi revisited in the light of the predominant clonal evolution model

Accepted Manuscript Title: The population genetics of Trypanosoma cruzi revisited in the light of the predominant clonal evolution model Author: Miche...

461KB Sizes 1 Downloads 62 Views

Accepted Manuscript Title: The population genetics of Trypanosoma cruzi revisited in the light of the predominant clonal evolution model Author: Michel Tibayrenc Francisco J. Ayala PII: DOI: Reference:

S0001-706X(15)00140-0 http://dx.doi.org/doi:10.1016/j.actatropica.2015.05.006 ACTROP 3618

To appear in:

Acta Tropica

Received date: Revised date: Accepted date:

9-3-2015 2-5-2015 6-5-2015

Please cite this article as: Tibayrenc, M., Ayala, F.J.,The population genetics of Trypanosoma cruzi revisited in the light of the predominant clonal evolution model, Acta Tropica (2015), http://dx.doi.org/10.1016/j.actatropica.2015.05.006 This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

1 1

The population genetics of Trypanosoma cruzi revisited in the light of the

2

predominant clonal evolution model.

3 4

Michel Tibayrenc1 and Francisco J. Ayala2

6

1

7

Montpellier Cedex 5, France

8

2

9

92697, USA

ip t

5

cr

MIVEGEC (IRD 224-CNRS 5290-UM1-UM2), IRD Center, BP 64501, 34394

us

Department Ecology and Evolutionary Biology, University of California, Irvine, CA

11

an

10

Corresponding author: Tibayrenc, M. ([email protected]).

M

12

Comparing the population structure of Trypanosoma cruzi with that of other

14

pathogens, including parasitic protozoa, fungi, bacteria and viruses, shows that the

15

agent of Chagas disease shares typical traits with many other species, related to a

16

predominant

17

disequilibrium,

18

subdivisions somewhat blurred by occasional genetic exchange/hybridization) and

19

―Russian doll‖ patterns (PCE is observed, not only at the level of the whole species,

20

but also, within the near-clades). Moreover, T. cruzi population structure exhibits

21

linkage with the diversity of several strongly selected genes, with gene expression

22

profiles, and with some major phenotypic traits. We discuss the evolutionary

23

significance of these results, and their implications in terms of applied research

24

(molecular epidemiology/strain typing, analysis of genes of interest, vaccine and drug

evolution

(PCE)

ce pt

clonal

ed

13

multilocus

statistically significant

genotypes,

near-clades

linkage (genetic

Ac

overrepresented

pattern:

Page 1 of 52

2 25

design, immunological diagnosis) and of experimental evolution. Lastly, we revisit the

26

long-term debate of describing new species within the T. cruzi taxon.

27 Key-words: predominant clonal evolution, near-clade, discrete typing unit,

29

recombinational load, drug resistance

ip t

28

30 * see glossary of specialized terms.

cr

31

Ac

ce pt

ed

M

an

us

32

Page 2 of 52

3 33 34

Although much progress has been made in vector control, Chagas disease

35

remains a major health problem in Latin America, and is now threatening industrial

36

countries, in particular Spain and the USA. With the hope of developing radical control means for Chagas disease, extensive

38

studies have been conducted on Trypanosoma cruzi population genetics, and have

39

made this parasite probably one of the micropathogens whose intraspecific diversity

40

is best known.

us

cr

ip t

37

A comparative population genetic analysis of T. cruzi and of many other

42

pathogens, including parasitic protozoa, fungi, bacteria and viruses, has broaden our

43

view of the agent of Chagas disease’ population structure and has made it possible

44

to reveal the evolutionary strategies used by this pathogen. Moreover, such a

45

comparative population genetics has brought relevant considerations about the

46

possibility to describe new species within the T. cruzi taxon.

ed

M

an

41

47

50 51

ce pt

49

The predominant clonal evolution (PCE) model of pathogens

The model will be recalled only briefly here, since it has been exposed at length in several recents articles (Tibayrenc & Ayala, 2012, 2013, 2014 a and b).

Ac

48

52

PCE is not defined by any precise mating system or cytological mechanism, but

53

only by strongly restrained genetic recombination* on an evolutionary scale. It

54

therefore deals with population structure of the whole species in the long run only.

55

This definition is accepted by most authors working on pathogen population genetics,

56

including parasitic protozoa, fungi and bacteria (Tibayrenc & Ayala, 2012) and is well

57

accepted in the T. cruzi literature (Barnabé et al., 2013; Flores-López & Machado,

Page 3 of 52

4 2011; Minning et al., 2011). According to our definition, PCE includes, not only mitotic

59

reproduction, but also selfing*/strong homogamy*, several cases of parthenogenesis*

60

(Tibayrenc & Ayala, 1991, 2002), and all cases of ―unisex‖ (Feretzaki & Heitman,

61

2013). PCE would group all means used by micropathogens to escape the

62

―recombinational load‖ (disrupting favorable multilocus genotypes by genetic

63

recombination) (Agrawal, 2006).

ip t

58

Equating selfing/strong homogamy and many cases of parthenogenesis to

65

clonality is not confined to microbiologists and extends to evolutionists working on

66

higher organisms too (Avise, 2008). Some authors, working on Leishmania

67

(Rougeron et al., 2009) or on T. cruzi (Llewellyn et al., 2009a) recommend to

68

distinguish selfing/homogamy from ―strict‖ clonality (= mitotic propagation). This is a

69

matter of definition. The problem deals with the means used for distinguishing

70

selfing/homogamy from ―strict‖ clonality.

M

an

us

cr

64

The main consequences of PCE are: (i) statistically significant linkage

72

disequilibrium, or nonrandom association of genotypes occuring at diffferent loci

73

(LD); (ii) overrepresented multilocus genotypes (MLGs), which can be lacking if the

74

mutation rate of the marker is high (all strains have different genotypes); (iii) the

75

presence of ―near-clades‖ (genetic subdivisions somewhat blurred by occasional

76

genetic exchange/hybridization); ―Russian doll‖ patterns (RDPs- PCE is observed,

77

not only at the level of the whole species, but also, within the near-clades) (Tibayrenc

78

& Ayala, 2013).

Ac

ce pt

ed

71

79

We have coined the term ―near-clade‖ (Tibayrenc & Ayala, 2012) because it can

80

be suspected that virtually all micropathogen species undergo some bouts of

81

recombination, including among different species. This makes the demands of a strict

82

cladistic* analysis impossible. We have recommended to evidence the presence of

Page 4 of 52

5 near-clades with the tools of population genetics analysis (LD), then by a flexible

84

phylogenetic approach relying on the ―congruence principle‖ (Avise, 2004). The use

85

of more reliable information supports more and more the working hypothesis (here:

86

the presence of near-clades). For example, in the case of MLST*, the phylogenies of

87

individual genes may exhibit some discrepancies. However, the use of the whole set

88

of genes evidences a strong phylogenetic signal (Lauther et al., 2012). Or different

89

clustering/phylogenetic approaches, relying on different working hypotheses, give

90

convergent results, which confirm the robustness of the branchings (fig.1). Or the

91

phylogenies designed from different molecular markers corroborate each other

92

(Brisse, Barnabé & Tibayrenc, 2000; Tibayrenc et al., 1993). Identification of PCE

93

relies on the concept of a ―clonality threshold‖, beyond which the impact of clonal

94

evolution overcomes that of recombination and leads to the inescapable evolutionary

95

divergence of the near-clades. This concept of a ―PCE border‖ makes it possible to

96

replace vague and subjectives assertions (―substantial‖ recombination; de Paula

97

Baptista et al., 2014; "Gross incongruences", "multiple introgressions‖: Messenger et

98

al., 2012) with a clear-cut criterion. The alternative hypothesis to PCE is that

99

recombination is abundant enough to erase the effects of clonal evolution in the long

100

run. This corresponds to the ―epidemic clonality‖ model (Maynard Smith et al., 1993)

101

or ―semiclonal model‖ (Maiden, 2006)).

103 104

cr

us

an

M

ed

ce pt

Ac

102

ip t

83

Some important points of the model

105 106

- As recalled many times, the PCE model does not state that recombination is

107

absent, but rather, that it is not frequent enough to break the prevalent PCE pattern.

Page 5 of 52

6 108

The model considers that recombination/hybridization could play a major role in

109

pathogens’ evolution, but only on an evolutionary scale. It does not state by any

110

means that recombination is ―inconsequential‖ (Ramírez & Llewellyn, 2014) or ―of

111

little consequence‖ (Miles et al., 2009). - In a first step, PCE should be explored at the level of the entire species. The

113

species concept, as we will see further, is a matter of debate in the case of

114

micropathogens. However, the unit of analysis when PCE is explored should be the

115

presently described species. It has been argued that ―it makes little sense to address

116

each parasite species as a whole‖, because these ―genetic subdivisions (i.e.: the

117

near-clades) act as reproductive barriers‖ (Ramírez & Llewellyn, 2014). However,

118

only the PCE approach is able to evidence the existence of reproductive barriers

119

among the near-clades. Claiming that such reproductive barriers are self-evident

120

(Ramírez & Llewellyn, 2014) is quite untrue. Evidencing them is the very goal of the

121

PCE approach.

122

- To explore PCE, sampling should be twofold: (i) in a classical population genetics

123

strategy, it should be conducted in close sympatry* on very limited spans of time, in

124

order to ascertain that the organism under study has actual opportunity for mating

125

(de Meeûs et al., 2007). However, it is extremely difficult to ascertain a strict

126

sympatry when micropathogens are concerned. Even sampling at the level of

127

individual hosts is not a guarantee for it. If different genotypes of the species

128

concerned do not have the same tropism for different tissues of the host, they could

129

have little opportunity for mating. (ii) Since the PCE pattern is definitely a ―bird’s eye

130

view‖ of the overall genetic variability of the whole species in the long term, we

131

recommend a global sampling of the entire species on its complete ecogeographical

132

range, including all known hosts, and relying on retrospective studies to evaluate the

Ac

ce pt

ed

M

an

us

cr

ip t

112

Page 6 of 52

7 stability of the PCE pattern. Retrospective studies can concern the analysis of

134

ancient culture collections, or of ancient publications, or both. Such a sampling

135

strategy makes it possible to ascertain that ubiquitous MLGs, near-clades and RDPs

136

are stable in space and time, are specific properties of the species under study, and

137

cannot be explained by a Wahlund effect* (Rougeron et al., 2014). If inhibition of

138

recombination were due to trivial physical obstacles (Wahlund), overrepresented

139

MLGs, near-clades and RDPs would be strongly linked to space and/or time.

cr

ip t

133

- The PCE model predicts that different clonal genotypes, due to inhibition of

141

recombination among them, will tend to accumulate divergent mutations, which could

142

concern genes governing relevant phenotypes (virulence, resistance to drugs). This

143

should be especially true when the evolutionary divergence among clones is high.

144

However, the model does not predict that all phenotypes should be strictly linked to

145

the clonal population structure. This should be only a general tendency, and could be

146

not verified at microevolutionary levels, and when strongly selected phenotypes (drug

147

resistance) are concerned. Such selected phenotypes are bound to emerge

148

independently in distinct lineages. This is not an evidence for recombination in itself

149

(Rougeron, de Meeûs & Bañuls, 2015), although it can be consistent with

150

recombination events. Chloroquine resistance is observed in different Plasmodium

151

species. This is not the result of genetic exchange between these different species,

152

but only of a common selective pressure.

Ac

ce pt

ed

M

an

us

140

153

- The clonet concept: Identical MLG is a relative notion, and is highly dependent

154

on the resolution power/mutation rate/molecular clock* of the marker employed. A set

155

of strains that appears as identical for a given marker could reveal genetic

156

heterogeneity when a more discriminative marker is used. If PCE obtains, this set

157

should be considered as a family of closely related clones, rather than a true clone.

Page 7 of 52

8 The term ―clonet‖ has been coined by us to refer to those sets of strains that appear

159

to be identical with a given set of genetic markers in a species that undergoes PCE

160

(Tibayrenc, Kjellberg & Ayala, 1991). A typical example of clonet is the zymodeme*

161

MON1 of Leishmania infantum. This MLEE* MLG is widespread in the ancient and

162

new world. The use of a more discriminating marker (microsatellites) shows that

163

MON1 is genetically heterogeneous (Amro et al., 2009; Chargui et al., 2009). One

164

important point with the clonet concept is that the population genetic tests and

165

phylogenetic analyses made on the basis of a given marker remain entirely valid

166

when more discriminating markers evidence additional genetic variability by

167

comparison with this marker. A given marker addresses a given level of resolution

168

and evolutionary divergence and is better adapted for given studies.

an

us

cr

ip t

158

170

PCE and Trypanosoma cruzi:

ed

171

M

169

T. cruzi is a paradigmatic case of PCE. It is actually the parasitic model used by

173

us for long to design the PCE approach (Tibayrenc et al., 1981, 1986), and to extend

174

it later to various other pathogens (Tibayrenc, Kjellberg & Ayala, 1990; Tibayrenc et

175

al., 1991, 2012).

ce pt

172

Apart from our own studies, many data are available in the literature, back to the

177

70s (the pioneering studies by Miles et al., 1977, 1978, 1981), relying on various

178

markers, which makes comparative and retrospective studies easy (Zingales et al.,

179

2012). Retrospective studies (ancient strain collections and publications) are highly

180

relevant to evaluate the persistence of T. cruzi PCE features (widespread MLGs,

181

near-clades, RDPs). Many of the ancient studies dealing with the population genetics

182

and strain typing of T. cruzi and various other micropathogens rely on MLEE. It is

Ac

176

Page 8 of 52

9 remarkable that MLEE results as a rule have been fully confirmed by modern DNA

184

markers. So ancient MLEE data, when properly interpreted, give a reliable basis for

185

retrospective studies. Lastly, the analysis of paleo DNA (Araujo, Reinhard & Ferreira,

186

2008) constitutes a promising approach to evaluate the antiquity of T. cruzi

187

population structure.

ip t

183

188

cr

190

The PCE indices rely on many raw data sets that can be easily verified by anybody. They are as follows:

us

189

191

- LD: We do not consider LD and PCE as ―synonymous‖ (Rougeron, de Meeûs &

193

Bañuls, 2015). However, we and many authors working on micropathogen population

194

genetics (Tibayrenc & Ayala, 2012) consider LD as a valid circumstantial evidence

195

for clonal evolution, as long as the bias due to Wahlund effect is excluded. LD in T.

196

cruzi is observed: (i) among MLEE loci (Tibayrenc et al., 1981, 1986); (ii) among

197

different markers (―test g‖ of LD; Tibayrenc, Kjellberg & Ayala, 1990): between MLEE

198

and RAPD* (Tibayrenc et al., 1993; Brisse, Barnabé & Tibayrenc, 2000); between

199

microsatellites and ribosomal DNA restriction fragment length polymorphism (Oliveira

200

et al., 1998); between MLST*, MLEE and RAPD (Lauthier et al., 2012); between

201

microsatellites and the sequence of the Gpi gene (Lewis et al., 2011). LD in T. cruzi

202

is not limited to classical markers, and is observed between genetic markers and

203

copy number variants (Minning et al., 2011); microsatellites and DNA content (Lewis

204

et al., 2009); between MLEE and RAPD on one hand, 12 antigen genes (which are

205

under strong selection) on the other hand (Rozas et al., 2007); between classical

206

markers ad the cruzipain antigen gene (which is under strong selection too) (Lima et

207

al., 2012). Lastly, a statistical association is observed between the variability

Ac

ce pt

ed

M

an

192

Page 9 of 52

10 208

revealed by molecular markers and the polymorphism of expressed genes (Telleria

209

et al., 2004, 2010, 2013).

210 - Overrepresented, widespread genotypes: we have called them: ―major clones‖

212

(Tibayrenc & Brenière, 1988). This is the case for example of the MLEE MLGs n° 19,

213

20, 32 and 39 in Tibayrenc et al. (1986), which have been observed in diverse

214

countries, years apart. For example; MLG 19 has been sampled in Brazil, Venezuela,

215

Colombia, Bolivia, in Chagasic patients, as well as in various mammalian hosts and

216

triatomine bug species, from 1977 to 1983. With more discriminating MLEE typing,

217

ubiquitous MLGs still are observed. One MLG repeated 31 times has been observed

218

in four different countries (Barnabé, Brisse & Tibayrenc, 2000). The remarks made

219

about the clonet concept (see above) should be recalled here. MLEE reveals

220

ubiquitous MLGs. However, when more discriminating markers (microsatellites) are

221

used, each strain may have a different genotype, due to the fast mutation rate of this

222

marker (Oliveira et al., 1998).

cr

us

an

M

ed

ce pt

223

ip t

211

- Near-clades. The near-clade concept (Tibayrenc & Ayala, 2012) gives a clear

225

evolutionary definition to the descriptive term DTU* (discrete typing unit) coined by us

226

(Tibayrenc, 1998). On the basis of both MLEE and RAPD typing, it has been

227

proposed that T. cruzi is subdivided into 6 discrete genetic lineages, or DTUs

228

(Barnabé, Brisse and Tibayrenc, 2000; Brisse, Barnabé & Tibayrenc, 2000). T. cruzi

229

DTUs perfectly fit the definition of near-clades: well-individualized genetic lineages

230

that are somewhat blurred up by occasional genetic exchange and/or hybridization.

231

Hybridization events have been described since long in T. cruzi (Machado & Ayala,

232

2001; Brisse et al., 2003), although their precise evolutionary history still is a matter

Ac

224

Page 10 of 52

11 of debate (de Freitas et al., 2006; Ferreira & Briones, 2012; Subileau et al., 2009;

234

Westenberger et al., 2005, 2006). The classification into 6 DTU/near-clades has

235

been recently validated by a panel of international experts (Zingales et al., 2012). To

236

a certain extent, T. cruzi near-clades correspond to the ―principal zymodemes‖ early

237

described by Miles et al. (1977, 1978). This illustrates their permanency in time. T.

238

cruzi near-clades have been corroborated by: (i) MLEE, RAPD (Brisse, Barnabé &

239

Tibayrenc, 2000); (ii) MLST (Lauthier et al., 2012; see fig. 1); (iii) fluorescent

240

fragment length barcoding (Hamilton et al., 2011); (iv) microsatellites, 3 mitochondrial

241

genes and the 24Sα rRNA gene (de Freitas et al., 2006); (v) the SSU rDNA,

242

Cytochrome B, Histone 2B genes, ITS1rDNA genotyping, RAPD (5 primers) and

243

PFGE* (Marcili et al., 2009); (vi) the sequences of 32 genes (Flores-López &

244

Machado, 2011). All T. cruzi near-clades are stable in time and widespread, although

245

their geographical distribution is not the same (Barnabé, Brisse & Tibayrenc, 2000;

246

Zingales et al., 2012). Although the precise history of hybridization still is questioned

247

(see above), it is considered that near-clades V and VI have a hybrid origin.

ed

M

an

us

cr

ip t

233

- An additional near-clade, named ―TC-Bat‖ (for it has been isolated from bats only

249

until now) has been recently described. It has been recorded in Brazil and Panama

250

years apart, in different species of bats (Marcili et al. 2009; Pinto et al. 2012). This is

251

a striking illustration of the stability in space and time of the near-clades.

Ac

252

ce pt

248

253

- Russian doll patterns (RDPs). It is frequently inferred that genetic exchange is

254

inhibited among the genetic clusters (= near-clades) that subdivide a species, but not

255

within each of them (Campbell et al., 2005). When exploring this hypothesis, it is

256

important to be conscious that a lower evolutionary scale is under stidy. The

257

molecular markers used at the level of the whole species may lack resolution at such

Page 11 of 52

12 lesser levels. This, together with the fact that the sample size is generally much

259

lowered when considering the near-clades separately, increases by far the risk of

260

statistical type II error (impossibility to reject the null hypothesis of random genetic

261

exchange, not because it is verified, but because the test lacks power). This bias

262

being discarded, contrary to what has been recently stated (Ramírez & Llewellyn,

263

2014), RDP evidence is abundant in T. cruzi, as well as in other parasites

264

(Leishmania, Giardia). The near-clade ―TC I‖ (Zingalez et al., 2012) has never been

265

considered ―homogeneous‖ (Guhl & Ramírez, 2011). On the contrary, MLEE as well

266

as RAPD studies have always shown that it exhibits a high variability (Tibayrenc et

267

al., 1993; Barnabé, Brisse & Tibayrenc, 2000; Brisse, Barnabé & Tibayrenc, 2000).

268

However, until recently, it had been not possible to evidence LD and a clear

269

structuring within it (RDP). This is now done, thanks to the availability of more

270

discriminating markers and ampler samplings. Geographical distance has an obvious

271

role in the distribution of TCI genetic structuring (Llewellyn et al., 2009b). However, it

272

cannot account for the data cited thereafter. Within TCI selvatic strains, strong LD

273

evidences ―widespread clonality, infrequent recombination‖ (Llewellyn et al., 2011).

274

Contrary to what Ramírez et al. (2013b) state, within Colombian TCI strains, there is

275

LD too. As a matter of fact, the p values for the LD test and the Ia index of

276

association (another LD test) are 4 X 10

277

2013). As noted by Tomasini et al. (2014), this is evidence for LD, and not for the

278

opposite, as wrongly concluded by Ramírez et al. (2013). Moreover, absence of LD

279

would be inconsistent with the presence of 2 genetic subdivisions, clearly

280

corroborated by bootstrap analysis, within this sample. This last result is evidence for

281

RDP. As a matter of fact, not only TCI does exhibit LD, but it is also subdivided into

282

lesser near-clades that are widespread throughout different countries and exhibit

Ac

ce pt

ed

M

an

us

cr

ip t

258

-4

and 0.037, respectively (Ramírez et al.,

Page 12 of 52

13 host specificities (fig. 2; Guhl & Ramírez, 2011; Herrera et al., 2007). TCI lesser near-

284

clades have been corroborated by various markers, including the Cytochrome b gene

285

sequence (a mitochondrial gene) and the mini-exon gene (Ramírez et al., 2011),

286

PCR-RFLP of 5 coding regions (Ramírez et al., 2012), mini-exon and MLST

287

(Tomasini et al., 2011, 2014).

ip t

283

Population genetics within other T. cruzi near-clades has been less explored.

289

However, in the near-clade TCIII, a strongly significant (p<0.001) LD (measured

290

using the Index of Association (IA)) was detected in all populations except one,

291

where only marginal significance was observed (p=0.032) (Llewellyn et al., 2009a).

292

This supports the hypothesis that TCIII exhibits also a RDP.

an

us

cr

288

- Indices of frequent recombination within near-clades?: A lack of congruence

294

between mitochondrial and nuclear phylogenies (Ramírez et al., 2012) indicates at

295

best limited introgression, as it can be observed even between different biological

296

species. The null hypothesis of the RDP model is not occasional bouts of genetic

297

exchange, which is consistent with PCE. It is random or quasi-random

298

recombination, as it can be observed in human populations for example. In humans,

299

the only obstacles to genetic exchange are geographical or cultural ones (quasi-

300

random recombination). TCI populations isolated from selvatic triatomine bugs in

301

Bolivia show data that are compatible with such frequent genetic exchange.

302

However, according to the authors themselves, these data are prone to type II error

303

and should be corroborated by stronger sampling (Barnabé et al., 2013). A survey in

304

Ecuador (Ocaña et al., 2010) is consistent with recombination in a small TCI

305

subpopulation. However, as we have shown (Tibayrenc & Ayala, 2013), (i) the

306

―domestic‖ subpopulation is poorly defined (33% of selvatic strains are included in it,

307

while the ―selvatic‖ clade comprises 12% of domestic strains); (ii) the risk of type II

Ac

ce pt

ed

M

293

Page 13 of 52

14 error is high, due to very limited sampling (only 12 truly ―domestic‖ strains).

309

Additionally, the ―selvatic‖ subpopulation exhibits strong LD and clonality, which fits a

310

RDP. De Paula Baptista et al. (2014) have proposed that Brazilian strains of the TCII

311

near-clade undergo substantial recombination. However, this study is based also on

312

limited sampling and is subject to type II error. Moreover, there is a strong

313

contradiction between the hypothesis of substantial recombination (with lack of LD)

314

and the evidence for clear structuration of these populations, even in sympatry, as

315

shown by STRUCTURE analysis. In conclusion: it is quite possible that some

316

populations within T. cruzi near-clades undergo frequent recombination. However,

317

the limited evidence presently available obviously has to be completed by far more

318

extensive studies. On the contrary, the evidence above exposed for RDPs is robust.

an

us

cr

ip t

308

M

319

From the results above exposed, it is hardly questionable that T. cruzi at the level

321

of the whole species undergoes PCE. Moreover, when enough data are available, it

322

is apparent that PCE operates also within the near-clades (RDP). Genetic

323

recombination/hybridization in the agent of Chagas disease has probably an

324

important impact at an evolutionary scale. However, it is unable to counter the effects

325

of clonal evolution in the long run (―clonality threshold‖) and to prevent the

326

evolutionary divergence of the near-clades. When looking for recombination in a

327

given species amounts to ―searching for a needle in a haystack‖ (Ramírez &

328

Llewellyn, 2014), it does show that recombination is strongly restrained, in other

329

words, that the PCE model is corroborated. When it is not, like in the highly

330

recombining bacterium Helicobacter pylori, one finds easily clear indications for

331

recombination without searching a haystack.

Ac

ce pt

ed

320

Page 14 of 52

15 In the framework of PCE, it is quite possible that local populations of genetically

333

related strains undergo more genetic exchange than clonal propagation (Barnabé et

334

al., 2013; Ocaña et al., 2010; de Paula Baptista et al., 2014). However, this has to be

335

better explored on more extensive samples. It is possible that in T. cruzi,

336

recombination is less restrained among identical genotypes (= selfing) or similar

337

genotypes (tending to selfing) than among more distantly related genotypes. This

338

happens in bacteria, where recombination is more frequent among ―sister cells‖ than

339

among cells that have radically different genotypes (Caugant & Maiden, 2009). This

340

leads to ―invisible sex‖ (Balloux, 2010) and is explained by the fact that the higher the

341

genetic divergence, the more the restriction systems are different (Caugant &

342

Maiden, 2009). Moreover, local introgressions of mitochondrial genomes are quite

343

possible in T. cruzi (Messenger et al., 2012; Ramírez et al., 2012). All this does not

344

challenge the PCE model. As exposed earlier, the alternative hypothesis to PCE is

345

that recombination is abundant enough to overcome the effects of clonal evolution in

346

the long run.

ce pt

347

ed

M

an

us

cr

ip t

332

Lastly, restrained recombination in T. cruzi cannot be attributed to a lack of

349

opportunity for mating. As a matter of fact, multiclonal infections in the same host,

350

either vectors (Tibayrenc et al., 1985) or chagasic patients (Brenière et al., 1985) are

351

very frequent.

352

Ac

348

353 354

Aneuploidy and the selfing/homogamy debate

355

Page 15 of 52

16 We recall here that selfing/homogamy is included in our PCE concept, a view

357

shared by many, if not most, specialists of pathogen population genetics (Tibayrenc

358

& Ayala, 2012) as well as by evolutionist working on metazoa undergoing uniparental

359

reproduction (Avise, 2008). However, other scientists recommend distinguishing

360

selfing/homogamy from ―strict‖ (= mitotic) clonality (Rougeron et al., 2009; Rougeron,

361

de Meeûs & Bañuls, 2015). When the macroevolution scale recommended by the

362

PCE approach is considered, the distinction in terms of population structure has

363

limited relevance, since both ―strict‖ clonality and selfing/homogamy lead to

364

restrained recombination and LD. Now selfing as a mode of restrained recombination

365

has the evolutionary advantage of permitting DNA repair, which is not the case of a

366

strictly mitotic mode of propagation. If one wants to distinguish selfing from ―strict‖

367

clonality, the problem lies on the way to do it. When bacteria are considered, since

368

they are haploid, selfing, which is certainly very frequent in these pathogens (Maiden

369

2008; Caugant & Maiden 2009), is impossible to distinguish from mitotic propagation

370

(―invisible sex‖; Balloux, 2010). When protozoa are concerned, the evidence

371

proposed for selfing relies on heterozygote deficit (Rougeron et al., 2009). However,

372

apart from selfing, many phenomena could cause heterozygote deficit, such as null

373

alleles, allelic drop-out, homoplasy (a strong concern in the case of microsatellites)

374

(Alam et al., 2009; Lehmann et al., 2004, Pearson et al., 2009), and genome-wide

375

mitotic gene conversion (Llewellyn et al., 2009a, 211). Experiments in the strictly

376

asexual species Daphnia pulex have shown that a loss of heterozygosity by mitotic

377

recombination is 1,000 times more frequent than an accumulation of divergent

378

mutations (Omilian et al. 2006). The probability of the possible causes of

379

heterozygote deficit should not be evaluated separately (Rougeron et al., 2009),

380

since they are not mutually exclusive. Moreover, heterozygote deficit inferred from

Ac

ce pt

ed

M

an

us

cr

ip t

356

Page 16 of 52

17 microsatellite analysis clashes with SNP* data for Leishmania mexicana and L.

382

braziliensis (Rogers et al., 2011) and T. cruzi (Yeo et al., 2011), where an excess of

383

heterozygotes is recorded. However, selfing should cause a heterozygote deficit for

384

all loci, including the ones involved in SNPs. The main problem with the way to

385

explore heterozygote deficit is that all tests proposed for it (de Meeûs et al., 2007)

386

are based on the hypothesis that the organism under survey is diploid*. Now strong

387

convergent results suggest that Leishmania undergoes widespread aneuploidy

388

(Downing et al. 2011; Lachaud et al. 2014; Mannaert et al. 2012; Rogers et al. 2011;

389

Sterkers et al. 2011, 2012, 2014), which renders the tests invalid, and at the same

390

time, is an explanation for apparent heterozygote deficit. As a matter of fact,

391

widespread aneuploidy should cause a rapid elimination of heterozygosity by

392

frequent passage through a haploid state (Sterkers et al. 2012). This should happen

393

even if aneuploidy is transitory in Leishmania, as proposed by Rougeron, de Meeûs

394

& Bañuls (2015). The evidence for aneuploidy in T. cruzi still is more limited than for

395

Leishmania, although progress is rapid (Minning et al. 2011; Souza et al. 2011).

396

When population genetic data are concerned, contrary to what has been claimed

397

(Ramírez et al., 2013a), in a survey dealing with more than 200 cloned stocks from

398

the same host, in mammals from Venezuela and Bolivia, ―multiple aneuploid clones

399

were isolated across those infrapopulations studied‖ (Llewellyn et al., 2011). It has

400

been proposed that microsatellite data do not suggest aneuploidy in Leishmania

401

(Rougeron, de Meeûs & Bañuls, 2015). However, Lachaud et al. (2014) disagree with

402

this statement. The conclusion of all this is that selfing in Leishmania and T. cruzi is a

403

reasonable hypothesis. However, the tests proposed to explore it may not be reliable.

404

As stated by Ramírez et al. (2012): ―In general, we urge caution in the interpretation

405

of heterozygosity statistics at STR loci in the context of parasite sexuality, especially

Ac

ce pt

ed

M

an

us

cr

ip t

381

Page 17 of 52

18 406

given that strong evidence for linkage disequilibrium often accompanies both

407

negative and positive values for FIS‖. Such difficulties in evidencing selfing in

408

parasitic protozoa gives additional ground to our proposal of including it in the PCE

409

model. It should be noted that LD tests do not suffer from this problem. As a matter of

411

fact, they can be practiced whatever be the ploidy of the organism under study, and

412

even, when this ploidy level is not known (Tibayrenc, Kjellberg & Ayala, 1990).

cr

ip t

410

414

us

413

Evolutionary features that could mimic recombination

an

415

Recombination can be confounded with several evolutionary features. They

417

include: (i) homoplasy; (ii) lack of phylogenetic signal; (iii) natural selection; (iv)

418

different molecular clocks; (v) long branch attraction; (vi) different modes of

419

inheritance.

ed

M

416

Homoplasy is a concern for many, if not most, molecular markers. Microsatellites

421

are considered as especially prone to it (Alam et al., 2009; Lehmann et al., 2004,

422

Pearson et al., 2009). This could lead to overestimate both recombination and

423

heterozygote deficit.

ce pt

420

Lack of phylogenetic signal: if the population is close to genetic monomorphism,

425

or if the molecular marker has too low a resolution power, it is impossible to evidence

426

LD and the presence of near-clades. This should not be taken as evidence for

427

recombination. A lack of phylogenetic signal could be due also to a saturation of the

428

molecular marker. As an example, microsatellites are not adapted to survey higher

429

levels of evolutionary divergence, since their molecular clock is too fast. MLSTs and

430

MLEE could reveal a strong phylogenetic signal where microsatellites cannot.

Ac

424

Page 18 of 52

19 Natural selection is difficult to distinguish from recombination (Didelot & Maiden,

432

2010). Strongly selected characters, such as drug resistance, should be especially

433

prone to this bias. Resistance to the same antibiotics can be present in different

434

bacterial lines, and even in different species. This can be the result of genetic

435

exchange, but it is also easily explained by an identical selective pressure. Such

436

convergent selected characters should be therefore considered as evidence for

437

recombination only carefully (Rougeron, de Meeûs & Bañuls, 2015).

cr

ip t

431

Differences in molecular clocks could lead to discrepancies between trees

439

designed from different genes. In Giardia duodenalis, Wielinga & Thompson (2007)

440

have attributed the discrepancies observed among trees built from different genes to

441

both homoplasy and molecular clock differences, not to recombination. Similarly, in

442

the case of the SARS coronavirus, phylogenetic incongruence among trees has been

443

attributed to different molecular clocks (Moya et al., 2004).

M

an

us

438

Long branch attraction: analyzing the phylogeny of the SARS coronavirus, Yip et

445

al. (2009) have recommended to not confound recombination with both homoplasy

446

and long branch attraction.

ce pt

ed

444

Markers that have a different mode of inheritance will exhibit different evolutionary

448

patterns. For example, in humans, the population structures inferred from autosomal

449

genes, mitochondrial genes and Y chromosome genes, respectively, are quite

450

different (Zoossmann-Diskin, 2010). In T. cruzi, experimental mating has shown that

451

nuclear genes are inherited from both parental cells, while the kinetoplast is inherited

452

from only one parental cell (Gaunt et al., 2003). In the framework of the PCE model,

453

in case of occasional bouts of genetic exchange, this phenomenon is bound to

454

maximize the discrepancies between mitochondrial and nuclear phylogenies.

Ac

447

Page 19 of 52

20 Lastly, these different phenomena are not exclusive of each other and could

456

combine their effects. In the case of T. cruzi, microsatellites are subject to

457

homoplasy. They have a fast molecular clock, they are explored at the level of the

458

nucleus, whose inheritance is biparental, and they are considered as selectively

459

neutral. Mitochondrial maxicircle genes correspond to coding sequences, which are

460

subject to natural selection. They have a slower molecular clock than microsatellites,

461

and their mode of inheritance is uniparental. So, although the discrepancies between

462

microsatellite and mitochondrial gene phylogenies are consistent with recombination

463

(Ramírez et al., 2012), alternative explanations are possible.

us

cr

ip t

455

an

464 465

A brief comparison with other micropathogens

468

We have performed a compared population genetic approach dealing with viruses

469

(22 species and categories), bacteria (43 species), parasitic protozoa (30 species)

470

and fungi (13 species) (Tibayrenc & Ayala, 2012 and in preparation). This shows that

471

T. cruzi population structure is not unique and, on the contrary, is very similar to that

472

of many major pathogen species. Each species exhibits specific traits. However, the

473

common denominator among many of them is strong, and suggests the existence of

474

common evolutionary strategies, which could be either ancestral or the result of

475

convergence.

Ac

ce pt

ed

467

M

466

476

Well-established T. cruzi ―siblings‖ in terms of population structure are the species

477

for which the PCE features (LD, widespread MLGs, near-clades and RDPs) have

478

been recorded. It is assumed that these species have crossed the ―clonality

479

threshold‖ beyond which the impact of clonal evolution will overcome the effects of

Page 20 of 52

21 recombination. Such species include: (i) parasitic protozoa: the Leishmania infantum

481

complex (Tibayrenc & Ayala, 2013), Giardia duodenalis, (Tibayrenc & Ayala, 2014b),

482

Toxoplasma gondii (Tibayrenc & Ayala, 2014a); (ii) fungi: Candida albicans, the

483

Cryptococcus neoformans complex; (iii) bacteria: Escherichia coli, Mycobacterium

484

tuberculosis, Neisseria meningitidis, Staphylococcus aureus. Many other species

485

exhibit clonality traits, but the PCE pattern is not complete, either because these

486

species undergo more recombination, or because the studies are less advanced than

487

for the species above listed, or both. When parasitic protozoa are concerned, such

488

examples include: Leishmania braziliensis/peruviana, the Trypanosoma brucei

489

complex, T. congolense, Cryptosporidium spp.

an

us

cr

ip t

480

Plasmodium falciparum and P. vivax constitute distinctive cases. Since P.

491

falciparum undergoes meiosis in the vector, it has been long assumed that the agent

492

of the most malignant form of malaria was a panmictic* organism. Our cautious

493

proposal that ―uniparental and biparental lineages could coexist within this species‖,

494

based on the observation of departures from panmixia in some populations

495

(Tibayrenc, Kjellberg & Ayala, 1990), had been rejected but it was subsequently

496

confirmed (Anderson et al., 2000). Many P. falciparum and P. vivax populations

497

exhibit clonality traits. Selfing is probably at the origin of them. However, the

498

hypothesis that selfing is strictly correlated to the level of transmission is at odds with

499

many data (Tibayrenc & Ayala, 2014a). This is epidemiologically relevant, since LD

500

and the degree of clonality in P. falciparum cannot be taken as an indication for the

501

transmission rate (Tibayrenc & Ayala, 2014a), as it had been proposed (Volkman et

502

al., 2012). Now P. falciparum and P. vivax definitely do not fit the PCE model.

503

Clonality traits (LD, widespread MLGs and near-clades) that are observed in these

504

species are labile and change over time, due to frequent genetic recombination. Still

Ac

ce pt

ed

M

490

Page 21 of 52

22 505

the fact remains that clonal propagation introduces a major stratification effect in P.

506

falciparum and P. vivax natural populations, that should be taken into account in

507

studies dealing with malaria epidemiology, drug resistance and the analysis of genes

508

of interest. When considering the various species that fit the PCE model, two facts should be

510

underlined: (i) for several of them, PCE was not expected. Neisseria meningitidis was

511

considered a ―highly recombining‖ bacterium (Maiden, 2008). However, widespread

512

MLGs and near-clades in this species are extremely stable, as verified by

513

retrospective studies. Moreover, whole genome sequencing has revealed the

514

existence of deep phylogenies (Budroni et al., 2011), which is incompatible with the

515

―epidemic clonality model‖ (occasional bouts of clonal propagation in an otherwise

516

recombining species; Maynard Smith et al., 1993). Toxoplasma gondii constitutes

517

another unexpected case of PCE, since this parasite undergoes meiosis in the

518

mammalian host. We have proposed that T. gondii undergoes frequent, and possibly

519

predominant clonal evolution (Tibayrenc et al., 1991). This has been fully confirmed

520

by further studies (Sibley & Boothroyd, 1992). High resolution typing with many

521

different markers (Su et al., 2012) has shown the existence of deep phylogenies, with

522

6 major ―clades‖ (= near-clades), which definitely includes T. gondii in the PCE model

523

(Tibayrenc & Ayala, 2014a). (ii) the species listed pertain to phylogenetically highly

524

diverse categories (bacteria, fungi, and parasitic protozoa), and exhibit various host

525

ranges, modes of transmission (air-, vector-, food- and water-borne) and tissue

526

tropisms. This strongly suggests that PCE represents a common evolutionary

527

strategy of many pathogens, probably linked to parasitism.

Ac

ce pt

ed

M

an

us

cr

ip t

509

528 529

Experimental mating, meiosis genes and PCE

Page 22 of 52

23 530 The seminal work by Gaunt et al. (2003) has demonstrated that the potential for

532

recombination is present in T. cruzi. However, these experiments dealt only with

533

related strains pertaining to the near-clade TCI. It remains to be established whether

534

mating can also be obtained between strains pertaining to different near-clades.

535

More important: these experiments provide no indication about the frequency of

536

genetic exchange in natural populations, which matters when population structure is

537

concerned. Successful mating experiments cannot therefore be used to challenge

538

the PCE model.

us

cr

ip t

531

Genes homologous to meiosis genes are observed in many micropathogen

540

species, including T. cruzi (El-Sayed et al., 2005). However, this does not imply that

541

T. cruzi undergoes meiosis (Ramírez & Llewellyn, 2014), and if it did, the presence of

542

these genes does not give indications about the frequency of it. Moreover, genes

543

analogous to meiosis genes may have acquired totally different functions through

544

evolution. ―Evolution is constantly re-using old genes for new purposes‖ (Birky 2009).

545

We have suggested that, in micropathogens undergoing PCE, the meiosis kit could

546

actually

547

micropathogens to use either one or the other mode of propagation when facing

548

different evolutionary challenges (Tibayrenc & Ayala, 2012). In conclusion: like for

549

mating experiments, the presence of genes homologous to meiosis genes is

550

therefore not evidence against the PE model.

ce pt

ed

M

an

539

to

a

―sexuality/clonality

machinery

that

could

allow

Ac

correspond

551 552

The PCE model and the problem of pathogen species definition

553

Page 23 of 52

24 Defining species in micropathogens has been always a challenge, since the

555

mixiological species concept (Dobzhansky, 1937), which addresses only organisms

556

with obligatory mating, cannot be applied to them. Moreover, the phylogenetic

557

species concept (Cracraft, 1983) is difficult to apply, since it is most probable that

558

some genetic exchange always occurs in micropathogens natural populations, even

559

between different species. The near-clade concept, which relaxes the cladistic

560

demands, makes it possible to revisit the phylogenetic species concept for

561

micropathogens (Tibayrenc and Ayala, 2014b). Two situations occur. The first case is

562

when presently described species can be equated to near-clades in an evolutionary

563

point of view. Many, if not most, Leishmania species are in this case. As a matter of

564

fact, in general, they are clearly discriminated by molecular markers. However,

565

hybridizations are frequent among them. Actually, many Leishmania species are

566

perfectly equivalent to T. cruzi near-clades from an evolutionary point of view. The

567

second case is when presently observed near-clades have not been described as

568

Linnean species yet. This is the case for the Giardia duodenalis ―assemblages‖ (=

569

near-clades) (Tibayrenc & Ayala, 2014b). T. cruzi near-clades could constitute the

570

basis for splitting the agent of Chagas disease into several species. We do not

571

recommend it for several reasons. First, the inventory of T. cruzi genetic diversity

572

could still evidence new features and unknown near-clades. Second, the advent of

573

whole genome sequencing could lead to reconsider the presently accepted near-

574

clade classification (Zingales et al., 2012). Third, the epidemiological and

575

pathological specificity of T. cruzi near-clades is not clear-cut. The Chagas disease

576

scientific community will decide whether the description of these new species is

577

desirable. The near-clade concept may constitute a necessary, but not sufficient,

578

criterion for it. Since near-clading is widespread among micropathogens, it would be

Ac

ce pt

ed

M

an

us

cr

ip t

554

Page 24 of 52

25 579

misleading to equate all of them to Linnean species. This should be done only when

580

a given near-clade exhibits strong phenotypic specificities dealing with relevant

581

characters (host range, drug resistance, pathogenicity for example).

582

584

ip t

583 The PCE model and applied research.

cr

585

The PCE model provides applied research with relevant units of analysis that are

587

based on simple concepts, and are sharply defined. Their stability in space and time

588

has been fully ascertained by retrospective studies, as well as by population genetics

589

and phylogenetic analysis. Clonal MLGs (clonets) can be used for epidemiological

590

tracking. To characterize them, markers with different mutation rates and level of

591

resolution can be selected according to the study concerned: microsatellites for

592

microepidemiology, MLST and whole genome sequencing plus SNPs (when these

593

two last tools become more accessible) for epidemiological surveys at broader

594

scales. Near-clades and their subdivisions (RDPs) can, and should be used as units

595

of analysis for epidemiological tracking, drug design (Zingales et al., 2014), clinical

596

studies and experimental evolution. Taking the near-clades as units of analysis, we

597

have conducted long-term experimental evolution studies, which have shown that

598

many phenotypic characters (including drug sensitivity, virulence in mice and cell

599

infectivity), as well as gene expression patterns, were linked to the near-clades

600

(Revollo et al., 1998; Telleria et al., 2004, 2010, 2013).

Ac

ce pt

ed

M

an

us

586

601

A major application of the PCE model for strain typing/molecular epidemiology is

602

indirect typing, which is a remarkable property of LD. Since all genes are linked

603

together (―block of genes‖), the near-clades can be identified by only one or two

Page 25 of 52

26 markers (―tags‖; Tibayrenc, 1998). Several RAPD and isoenzyme tags have been

605

identified for the 6 near-clades, including near-clade TCI, (Brisse et al., 2000). By the

606

amplification of a unique gene (TcSC5D), Cosentino & Agüero (2012) were able to

607

identify the T. cruzi near-clades I, II, II, IV, V+VI, TC-Bat, and the subspecies T. cruzi

608

marinkellei. We recommend this ―phylogenetic character mapping‖ (PCM) approach

609

(Avise, 2004). A convenient population genetics framework is first designed through

610

the PCE approach. Once the relevant units of analysis (near-clades) are clearly

611

identified, phenotypes of interest (drug resistance, virulence) are mapped onto this

612

framework. This makes it possible to see the links between the overall evolution of

613

the organism under study and its phenotypes, and at the same time, to show the

614

specific phenotypic properties of the near-clades.

an

us

cr

ip t

604

616

M

615 Conclusion

ed

617

Due to the dynamism of many research groups throughout the world, with a major

619

contribution from Latin American teams, the intraspecific genetic variability of T. cruzi

620

is probably one of the best known among micropathogens. The agent of Chagas

621

disease could be considered a star model for evolution, like Drosophila

622

melanogaster, Mus musculus, Caenorabditis elegans and Escherichia coli.

Ac

623

ce pt

618

The PCE model, which has been

elucidated and enriched by wide surveys

624

dealing with many pathogens (Tibayrenc & Ayala, 2012), sets in perspective the

625

particular case of T. cruzi population genetics. It shows that T. cruzi shares many

626

features with other major pathogens, including parasitic protozoa, fungi and bacteria.

627

Promising avenues of evolutionary research are represented by experimental

628

mating (less advanced for T. cruzi than for T. brucei), within near-clade population

Page 26 of 52

27 genetics, experimental evolution, and the development of more powerful tools of

630

analysis (whole genome sequencing, SNP analysis). These tools constitute a

631

challenge for analysis, since the computing power required for treating these data is

632

huge. However, the use of these tools becomes routine for viruses and bacteria, and

633

should be more and more accessible when parasitic protozoa are concerned, making

634

it possible to develop a population genomics of these pathogens.

Ac

ce pt

ed

M

an

us

cr

635

ip t

629

Page 27 of 52

28 636 637

Legends of the figures

638 Figure 1: The six Trypanosoma cruzi near-clades evidenced by MLST analysis (after

640

Lauthier et al., 2012). Left: neighbor-joining tree; right: split decomposition analysis.

641

The two modes of analysis, which rely on different computing methods and different

642

working hypotheses, show the same near-clading pattern. This is an indication that

643

the clustering of the near-clades is robust, according to the congruence principle

644

(Avise, 2004).

us

cr

ip t

639

an

645

Figure 2: Geographic distribution of the five lesser near-clades evidenced within the

647

Trypanosoma cruzi near-clade TCI (see fig. 1) (after Gull & Ramírez, 2011). The

648

lesser near-clades are widespread and occur sympatrically, which is an evidence of a

649

―Russian doll Pattern‖ (Tibayrenc & Ayala, 2013).

ed

M

646

Ac

ce pt

650

Page 28 of 52

29 651 652

Glossary of specialized terms:

653 Allopatry: a situation where different populations occur at different locations; cf

655

sympatry*

ip t

654

656

Aneuploidy: a situation in which the chromosome number of an individual is not an

658

exact multiple of the haploid set of the species. Copy numbers may vary among

659

chromosomes.

us

cr

657

an

660

Clade: evolutionary line defined by cladistic analysis. A clade is monophyletic* and is

662

genetically differentiated (i.e., evolves independently) from other clades.

M

661

663

Cladistic analysis: phylogenetic analysis based on the distribution of characters that

665

are

666

characteristics. Only apomorphic characters shared by all members of a given clade*

667

(synapomorphic character) convey relevant phylogenetic information.

670

ancestral

(plesiomorphic)

and

derived

(apomorphic)

ce pt

669

between

Diploid: possessing 2 copies of each chromosome.

Ac

668

shared

ed

664

671

DTU = Discrete Typing Unit: sets of strains that are genetically closer to each other

672

than to any other stock and are identifiable by common genetic, molecular or

673

immunological markers called tags (Tibayrenc, 1998).

674 675

Haploid: possessing only one copy of each chromosome.

Page 29 of 52

30 676 677

Homogamy: preferential mating between individuals that are genetically identical or

678

extremely similar to each other.

679 Homoplasy: distinct evolutionary units possess similar characters that do not

681

originate from a common ancestor.

ip t

680

cr

682

MLEE: = Multilocus Enzyme Electrophoresis. Analysis by electrophoresis of protein

684

extracts from biological samples. MLEE banding reflects the aminoacid sequence of

685

the proteins, coded by the DNA sequences. It is an efficient genetic typing method.

686

However, identical MLEE bands may correspond to different DNA sequences

687

(homoplasy*).

M

an

us

683

688

MLST = Multilocus sequence typing : a highly standardized typing technique

690

based on the sequencing of 450-bp segments of a set of housekeeping genes

691

(usually seven). It has been widely used for a high number of bacterial species and

692

some eukaryotic pathogens as well.

693

Molecular clock : in its strict, original sense (more correctly named: ―DNA clock

694

hypothesis‖), the concept that the rate of nucleotide substitutions in DNA remains

695

constant. In a broader sense, simply the rate of evolution of the genomic part that

696

codes for the variability of a given marker. This fastness is driven by the rate of

697

substitution/mutation. It may be regular or irregular.

Ac

ce pt

ed

689

698 699

Monophyletic: the case where a given phylogenetic lineage has only one ancestor

700

(see clade*).

Page 30 of 52

31 701 702

Panmixia, panmictic: a situation where genetic exchange occurs at random.

703 Parthenogenesis: Reproduction by the development of a single gamete without

705

fertilization by a gamete of the opposite sex.

ip t

704

706

PFGE = Pulse Field Gel Electrophoresis: separation of large DNA fragments by a

708

particular electrophoresis technique using alternately pulsed, perpendicularly

709

oriented electrical fields. Strains that share the same PFGE are referred to as pulse

710

types. In the case of bacteria, the large DNA fragments result from the action of a

711

―low-frequency cutter‖ (a bacterial endonuclease which restriction action has a low

712

frequency) on the bacterial chromosome. . In the case of parasitic protozoa

713

(Trypanosoma and Leishmania) and of yeasts, the large DNA fragments correspond

714

to entire chromosomes (―molecular karyotype‖).

ed

M

an

us

cr

707

715

RAPD = Random Amplified Polymorphic DNA: In the classical PCR method, the

717

primers used are known DNA sequences, whereas the RAPD technique relies on

718

primers whose sequence is arbitrarily determined.

719

Ac

ce pt

716

720

Recombination, recombining : Reassortment of genotypes occurring at different

721

loci.

722 723

Selfing : self-fertilization of an organism. If the organism is haploid (bacteria), or if all

724

loci are homozygous, selfing is undistiguishable from mitotic clonality. In diploid

Page 31 of 52

32 725

organisms, selfing leads to a deficit of heterozygotes, and to a lack of genetic

726

recombination

727 SNP = Single Nucleotide Polymorphism : a DNA sequence variation occurring

729

commonly within a population, in which a single nucleotide — A, T, C or G differs

730

between individuals.

ip t

728

cr

731

Sympatry: a situation where different populations are present in the same place; cf

733

allopatry*.

us

732

an

734 735

Wahlund

736

subpopulations (isolated in time or space or both) that have different genotype

737

compositions, population genetic tests performed at the level of the whole population

738

show departures from panmictic expectations, even if each subpopulation is

739

panmictic*.

742

sampled

population is composed

of

several

ed

M

the

ce pt

741

when

Zymodeme : MLEE* MLG.

Ac

740

effect:

Page 32 of 52

33 743 744

References

745 Agrawal, A.F. 2006. Evolution of Sex: Why Do Organisms Shuffle Their Genotypes?

747

Curr. Biol. 16: R696- R704.

748

ip t

746

749

Alam,

750

Shamsuzzaman, A.K.M., Raju, B. V.S., Salotra, P., Dujardin, J.C. & Schönian, G.

751

2009. Multilocus microsatellite typing (MLMT) reveals genetic homogeneity of

752

Leishmania donovani strains in the Indian subcontinent. Infection, Genetics and

753

Evolution 9: 24-31.

Kuhls,

K.,

Schweynoch,

C.,

Shyam

Sundar,

S.,

Rijal,

S.,

an

us

cr

M.Z.,

M

754

Amro, A., Schönian, G., Barakat Al-Sharabati, M.B., Azmi, K., Nasereddin, A.,

756

Abdeen, Z., Schnur, L.F., Baneth, G., Jaffe, C.L. & Kuhls, K. 2009. Population

757

genetics of Leishmania infantum in Israel and the Palestinian Authority through

758

microsatellite analysis. Microbes and Infect. 11 : 484-492.

ce pt

759

ed

755

Anderson, T.J., Haubold, B., Williams, J.T., Estrada-Franco, J.G., Richardson, L.,

761

Mollinedo, R., Bockarie, M., Mokili, J., Mharakurwa, S., French, N., Whitworth, J.,

762

Velez, I.D., Brockman, A.H., Nosten, F., Ferreira, M.U. & Day, K. 2000. Microsatellite

763

markers reveal a spectrum of population structures in the malaria parasite

764

Plasmodium falciparum. Mol. Biol. Evol. 17 (10): 1467-1482.

Ac

760

765 766

Araujo, A., Reinhard, K. & Ferreira, L.F. 2008. Parasite findings in archeological

767

remains: Diagnosis and interprétation. Quaternary Internat.180 : 17–21

Page 33 of 52

34 768 769

Avise, J.C. 2004. Molecular markers, Natural History and Evolution. 2 nd Ed.

770

Chapman & Hall, New York. London.

771 Avise, J. 2008. Clonality. The genetics, ecology, and evolution of sexual abstinence

773

in vertebrate animals. Oxford University Press.

ip t

772

cr

774

Balloux, F. 2010. Demographic Influences on Bacterial Population Structure. In:

776

Robinson DA, Falush, D., Feil, E.J. (Eds.), Bacterial Population Genetics in Infectious

777

Disease. Wiley-Blackwell, Hoboken, pp. 103-120.

an

us

775

778

Barnabé, C., Brisse, S. & Tibayrenc, M. 2000. Population structure and genetic

780

typing of Trypanosoma cruzi, the agent of Chagas’ disease: a multilocus enzyme

781

electrophoresis approach. Parasitol. 150: 513-526.

ed

M

779

782

Barnabé, C., De Meeûs, T., Noireau, F., Bosseno, M.F., Monje, E.M., Renaud, F. &

784

Brenière, S.F. 2011. Trypanosoma cruzi discrete typing units (DTUs): Microsatellite

785

loci and population genetics of DTUs TcV and TcI in Bolivia and Peru. Infect. Genet.

786

Evol. 11 : 1752-1760.

Ac

787

ce pt

783

788

Barnabé, C., Buitrago, R., Brémond, P., Aliaga, C., Salas, R., Vidaurre, P., Herrera,

789

C., Cerqueira, F., Bosseno, M.F., Etienne Waleckx, E. & Brenière, S.F. 2013.

790

Putative Panmixia in Restricted Populations of Trypanosoma cruzi Isolated from Wild

791

Triatoma

792

doi:10.1371/journal.pone.0082269

infestans

in

Bolivia.

Plos

One

8

(11):

e82269.

Page 34 of 52

35 793 794

Birky, C.W. 2009. Giardia Sex? Yes, but how and how much? Trends Parasitol. 26:

795

70-74.

796 Brenière, S.F., Tibayrenc, M., Antezana, G., Pabon, J., Carrasco, R., Selaès, H. &

798

Desjeux, P. 1985. Résultats préliminaires en faveur d’une relation faible ou

799

inexistante entre les formes cliniques de la maladie de Chagas et les souches

800

isoenzymatiques de Trypanosoma cruzi. C. R. Acad. Sci. Paris., 300 (15): 555-558.

us

cr

ip t

797

801

Brisse, S., Barnabé, C. & Tibayrenc, M. 2000. Identification of six Trypanosoma cruzi

803

phylogenetic lineages by random amplified polymorphic DNA and multilocus enzyme

804

electrophoresis. Int. J. parasitol. 30: 35-44.

M

an

802

805

Brisse, S., Henriksson, J., Barnabé, C., Douzery, E.J.P., Berkvens, D., Serrano, M.,

807

De Carvalho, M.R.C., Buck, G.A., Dujardin, J.C. & Tibayrenc, M. 2003. Evidence for

808

genetic exchange and hybridization in Trypanosoma cruzi based on nucleotide

809

sequences and molecular karyotype. Infection, Genetics and Evolution: 2 (3): 173-

810

183.

ce pt

Ac

811

ed

806

812

Budroni, S., Siena, E., Dunning Hotopp, J.C., Seib, K.L., Serruto, D., Nofroni, C.,

813

Comanducci, M., Riley, D.R., Daugherty, S.C., Angiuoli, S.V., Covacci, A., Pizza, M.,

814

Rappuoli, R., E. Moxon, E.R., Tettelin, H. & Medini, D. 2011. Neisseria meningitidis is

815

structured in clades associated with restriction modification systems that modulate

816

homologous recombination. Proc. Nat. Acad. Sci. USA 108: 4494-4499.

817

Page 35 of 52

36 818

Campbell, L.T., Currie, B.J., Krockenberger, M., Malik, R., Meyer, W., Heitman, J. &

819

Carter, D. 2005. Clonality and Recombination in Genetically Differentiated Subgroups

820

of Cryptococcus gattii. Eukaryotic Cell 4: 1403-1409.

821 Caugant, D. & Maiden, M.C.J. 2009. Meningococcal carriage and disease—

823

Population biology and evolution. Vaccine 27S: B64-B70.

cr

824

ip t

822

Chargui, N., Amro, A., Haouas, N., Schönian, G., Babba, H., Schmidt, S., Ravel, C.,

826

Lefebvre, M., Bastien, P., Chaker, E., Aoun, K., Zribi, M. & Kuhls, K. 2009.

827

Population structure of Tunisian Leishmania infantum and evidence for the existence

828

of hybrids and gene flow between genetically different populations. Int. J. Parasitol.

829

39: 801-811.

M

an

us

825

830

Cosentino, R.O. & Agüero, F. 2012. A Simple Strain Typing Assay for Trypanosoma

832

cruzi: Discrimination of Major Evolutionary Lineages from a Single Amplification

833

Product. Plos Neglect. Trop. Dis. 6: 1-11.

ce pt

834

ed

831

Cracraft, J. 1983. Species concept and speciation analysis. Pp. 159-187 in: Current

836

ornithology, R.F. Johnson (ed.), Plenum Press, New York.

837

Ac

835

838

de Freitas, J.M., Augusto-Pinto, L., Pimenta, J.R., Bastos-Rodrigues, L., Gonçalves,

839

V.F., Teixeira, S.M.R., Chiari, E., Junqueira, A.C.V., Fernandes, O., Macedo, A.M.,

840

Machado, C.R. & Pena, S.D.J. 2006. Ancestral Genomes, Sex, and the Population

841

Structure of Trypanosoma cruzi. Plos Pathogens 2 (3): e24.

842

Page 36 of 52

37 843

De Meeûs, T., McCoy, K., Prugnolle, F., Chevillon, C., Durand, P., Hurtrez-Boussès,

844

S. & Renaud, F. 2007. Population genetics and molecular epidemiology or how to

845

"débusquer la bête". Infect., Genet. Evol. 7: 308-332.

846 de Paula Baptista, R., Alchaar D’Ávila, D., Segatto, M., Faria do Valle, I., Regina

848

Franco, G., Silva Valadares, H.M., Dias Gontijo, E., da Cunha Galvão, L.M., Pena,

849

S.D.J., Chiari, E., Machado, C.R. & Macedo, A.M. 2014. Evidence of substantial

850

recombination among Trypanosoma cruzi II strains from Minas Gerais. Infect. Genet.

851

Evol. 22: 183-191.

us

cr

ip t

847

an

852

Didelot, X. & Maiden, M.C.J. 2010. Impact of recombination on bacterial evolution.

854

Trends Microbiol. 182: 315-322.

M

853

855

Dobzhansky, T. 1937. Genetics and the origin of species. Columbia University Press,

857

New York

ce pt

858

ed

856

Downing,T. Imamura, H., Decuypere, S. Clark, T.G., Coombs, G.H., Cotton, J.A.,

860

Hilley, J.D., de Doncker, S., Maes, L., Mottram, J.C., Quail, M.A., Rijal, S., Sanders,

861

M., Schönian, G., Stark, O., Sundar, S., Vanaerschot, M., Hertz-Fowler, C., Dujardin,

862

J.C.& Berriman, M. 2011. Whole genome sequencing of multiple Leishmania

863

donovani clinical isolates provides insights into population structure and mechanisms

864

of drug resistance. Genome Res. 21: 2143-2156.

Ac

859

865 866

El-Sayed, N.M., Myler, P.J., Bartholomeu, D.C., Nilsson, D., Aggarwal, G., Tran,

867

A.N., Ghedin, E., Worthey, E.A., Delcher, A.L., Blandin, G., Westenberger, S.J.,

Page 37 of 52

38 Caler, E., Cerqueira, G.C., Branche, C., Haas, B., Anupama, A., Arner, E., Aslund,

869

L., Attipoe, P., Bontempi, E., Bringaud, F., Burton, P., Cadag, E., Campbell, D.A.,

870

Carrington, M., Crabtree, J., Darban, H., da Silveira, J.F., de Jong, P., Edwards, K.,

871

Englund, P.T., Fazelina, G., Feldblyum, T., Ferella, M., Frasch, A.C., Gull, K., Horn,

872

D., Hou, L., Huang, Y., Kindlund, E., Klingbeil, M., Kluge, S., Koo, H., Lacerda, D.,

873

Levin, M.J., Lorenzi, H., Louie, T., Machado, C.R., McCulloch, R., McKenna, A.,

874

Mizuno, Y., Mottram, J.C., Nelson, S., Ochaya, S., Osoegawa, K., Pai, G., Parsons,

875

M., Pentony, M., Pettersson, U., Pop, M., Ramirez, J.L., Rinta, J., Robertson, L.,

876

Salzberg, S.L., Sanchez, D.O., Seyler, A., Sharma, R., Shetty, J., Simpson, A.J.,

877

Sisk, E., Tammi, M.T., Tarleton, R., Teixeira, S., Van Aken, S., Vogt, C., Ward, P.N.,

878

Wickstead, B., Wortman, J., White, O., Fraser, C.M., Stuart, K.D. & Andersson, B.

879

2005. The genome sequence of Trypanosoma cruzi, etiologic agent of Chagas

880

disease. Science 309: 409-415.

ed

881

M

an

us

cr

ip t

868

Feretzaki, F. & Heitman, J. 2013. Unisexual Reproduction Drives Evolution of

883

Eukaryotic Microbial Pathogens. PLoS Pathogens 9(10): e1003674. doi:10.1371/

884

journal.ppat.1003674

885

ce pt

882

Ferreira, R.C. & Briones, M.R.S. 2012. Phylogenetic evidence based on

887

Trypanosoma cruzi nuclear gene sequences and information entropy suggest that

888

inter-strain intragenic recombination is a basic mechanism underlying the allele

889

diversity of hybrid strains. Infect. Genet. Evol. 12: 1064-1071.

Ac

886

890 891

Flores-López, C.A. & Machado, C.A. 2011. Analyses of 32 Loci Clarify Phylogenetic

892

Relationships among

Trypanosoma cruzi

Lineages and

Support a

Single

Page 38 of 52

39 893

Hybridization prior to Human Contact. Plos Neglect. Trop. Dis. 5(8): e1272.

894

doi:10.1371/journal.pntd.0001272

895 Gaunt, M.W., M. Yeo, I.A. Frame, J.R. Tothard, H.J. Carrasco, M.C. Taylor, S.S.

897

Mena, P. Veazey, G.A. Miles, N. Acosta, A. Rojas de Arias, and M.A. Miles. 2003.

898

Mechanism of genetic exchange in American trypanosomes. Nature 421: 936-939.

ip t

896

cr

899

Guhl, F.& Ramírez, J.D. 2011. Trypanosoma cruzi I diversity: Towards the need of

901

genetic subdivision? Acta Tropica 119: 1-4.

us

900

an

902

Hamilton, P.B., Lewis, M.D., Cruickshank, C., Gaunt, M.W., Yeo, M., Llewellyn, M.S.,

904

Valente, S.A., Maia da Silva, F., Stevens, J.R., Miles, M.A. & Teixeira, M.M.G. 2011.

905

Identification and lineage genotyping of South American trypanosomes using

906

fluorescent fragment length barcoding. Infect. Genet. Evol. 11: 44-51.

ed

M

903

907

Herrera, C., Bargues, M.D., Fajardo, A., Montilla, M., Triana, O., Vallejo, G.A. & Guhl,

909

F. 2007. Identifying four Trypanosoma cruzi I isolate haplotypes from different

910

geographic regions in Colombia. Infect. Genet. Evol. 7: 535-539.

Ac

911

ce pt

908

912

Lachaud, L., Bourgeois, N., Kuk, N., Morelle, C., Crobu, L., Merlin, G., Bastien, P.,

913

Pagès, M. & Sterkers, Y. 2014. Constitutive mosaic aneuploidy is a unique genetic

914

feature widespread in the Leishmania genus. Microbes Infect. 16: 61-66.

915 916

Lauthier, J.J., Tomasini, N., Barnabé, C., Monje Rumi, M.M., Alberti D’Amato, A.M.,

917

Ragone, P.G., Yeo, M., Lewis, M.D., Llewellyn, M.S., Basombrío, M.A., Miles, M.A.,

Page 39 of 52

40 918

Tibayrenc, M. & Diosque, P. 2012. Candidate targets for Multilocus Sequence Typing

919

of Trypanosoma cruzi: Validation using parasite stocks from the Chaco Region and a

920

set of reference strains. Infect. Genet. Evol. 12: 350-358.

921 Lehmann, T., Graham, D.H., Dahl, E.R., Bahia-Oliveira, L.M.G., Gennari, S.M. &

923

Dubey, J.P. 2004. Variation in the structure of Toxoplasma gondii and the roles of

924

selfing, drift, and epistatic selection in maintaining linkage disequilibria. Infect. Genet.

925

Evol. 4: 107-114.

us

cr

ip t

922

926

Lewis, M.D., Llewellyn, M.S., Gaunt, M.W., Yeo, M., Carrasco, H.J. & Miles, M.A.

928

2009. Flow cytometric analysis and microsatellite genotyping reveal extensive DNA

929

content variation in Trypanosoma cruzi populations and expose contrasts between

930

natural and experimental hybrids. Int. J. Parasitol. 39: 1305-1317.

M

an

927

ed

931

Lewis, M.D., Llewellyn, M.S., Yeo, M., Acosta, N., Gaunt, M.W. & Miles, M.A. 2011.

933

Recent, Independent and Anthropogenic Origins of Trypanosoma cruzi Hybrids.

934

PLos Neglect. Trop. Dis. 5: 1-13.

935

ce pt

932

Lima, L., Ortiz, P.A., da Silva, F.M., Alves, J.M.P., Serrano, M.G., Cortez, A.P.,

937

Alfieri, S.C., Buck, G.A. & Teixeira, M.M.G. 2012. Repertoire, Genealogy and

938

Genomic Organization of Cruzipain and Homologous Genes in Trypanosoma cruzi,

939

T. cruzi-Like and Other Trypanosome Species.

940

doi:10.1371/journal.pone.0038385.

Ac

936

Plos One

7(6): e38385.

941 942

Llewellyn, M.S., Lewis, M.D., Acosta, N., Yeo, M., Carrasco, H.J., Segovia, M.,

Page 40 of 52

41 943

Vargas, J., Torrico, F., Miles, M.A., & Gaunt, M.W. 2009a. Trypanosoma cruzi IIc:

944

phylogenetic and phylogeographic insights from sequence and microsatellite analysis

945

and potential impact on emergent Chagas disease. PLoS Negl. Trop. Dis. 3 (9):

946

e510. doi:10.1371/journal.pntd.0000510

ip t

947 Llewellyn, M.S., Miles, M.A., Carrasco, H.J., Lewis, M.D., Yeo, M., Vargas, J.,

949

Torrico, F., Diosque, P., Valente, V., Valente, S.A. & Gaunt, M.A. 2009b. Genome-

950

Scale Multilocus Microsatellite Typing of Trypanosoma cruzi Discrete Typing Unit I

951

Reveals Phylogeographic Structure and Specific Genotypes Linked to Human

952

Infection. Plos Pathog. 5(5): e1000410. doi:10.1371/journal.ppat.1000410.

an

us

cr

948

953

Llewellyn, M.S., Rivett-Carnac, J.B., Fitzpatrick, S., Lewis, M.D., Yeo, M., Gaunt,

955

M.W. & Miles, M.A. 2011. Extraordinary Trypanosoma cruzi diversity within single

956

mammalian reservoir hosts implies a mechanism of diversifying selection. Int. J.

957

Parasitol. 41: 609-614.

ed

ce pt

958

M

954

Machado, C.A. & Ayala, F.J. 2001. Nucleotide sequences provide evidence of

960

genetic exchange among distantly related lineages of Trypanosoma cruzi. Proc. Nat.

961

Acad. Sci. U.S.A. 98: 7396–7401.

962

Ac

959

963 964 965

Maiden, M.C.J. 2006. Multilocus Sequence Typing of Bacteria. Ann. Rev. Microbiol. 60 : 561-588.

966

Maiden, M.C.J. 2008. Population genomics: diversity and virulence in the Neisseria.

967

Curr. Opinion in Microbiol. 11 : 467-471.

968

Page 41 of 52

42 969

Mannaert, A., Downing, T., Imamura, H. & Dujardin, J.C. 2012. Adaptive

970

mechanisms in pathogens: universal aneuploidy in Leishmania. Trends Parasitol.

971

28 : 370-376.

972 Marcili, A., Lima, L., Cavazzsana, M., Junqueira, A.C.V., Veludo, H.H., Maia da Silva,

974

F., Campaner, M., Paiva, F., Nunes, V.L.B. & Teixeira, M.M.G. 2009. A new

975

genotype of Trypanosoma cruzi associated with bats evidenced by phylogenetic

976

analyses using SSU rDNA, cytochrome b and Histone H2B genes and genotyping

977

based on ITS1 rDNA. Parasitology 136 : 641-655.

an

978

us

cr

ip t

973

Maynard Smith, J., Smith, N.H., O’Rourke, M. & Spratt, B.G. 1993. How clonal are

980

bacteria? Proc. Natl. Acad. Sci. USA, 90: 4384-4388.

M

979

981

Messenger, L.A., Llewellyn, M.S., Bhattacharyya, T., Franzén, O., Lewis, M.D.,

983

Ramírez, J.D., Carrasco, H.J., Andersson, B. & Miles, M.A. 2012. Multiple

984

Mitochondrial Introgression Events and Heteroplasmy in Trypanosoma cruzi

985

Revealed by Maxicircle MLST and Next Generation Sequencing. Plos Neglect. Trop.

986

Dis. 6: e1584. doi:10.1371/journal.pntd.0001584.

ce pt

Ac

987

ed

982

988

Miles, M.A., Llewellyn, M.S., Lewis, M.D., Yeo, M., Baleela, R., Fitzpatrick, S., Gaunt,

989

M.W. & Mauricio,, I.L. 2009. The molecular epidemiology and phylogeography of

990

Trypanosoma cruzi and parallel research on Leishmania: looking back and to the

991

future. Parasitology 136: 1509-1528.

992

Page 42 of 52

43 993

Miles, M.A., Souza, A., Povoa, M., Shaw, J.J., Lainson, R. & Toyé, P.J. 1978.

994

Isozymic heterogeneity of Trypanosoma cruzi in the first autochtonous patients with

995

Chagas’ disease in Amazonian Brazil. Nature, 272: 819-821.

996 Miles, M.A., Povoa, M., Prata, A., Cedillos, R.A., De Souza, A.A. & Macedo, V. 1981.

998

Do radically dissimilar Trypanosoma cruzi strains (zymodemes) cause Venezuelan

999

and Brazilian forms of Chagas’ disease ? Lancet, 8234: 1336-1340.

cr

ip t

997

us

1000

Miles, M.A., Toyé, P.J., Oswald, S.C. & Godfrey, D.G. 1977. The identification by

1002

isoenzyme patterns of two distinct strain-groups of Trypanosoma cruzi, circulating

1003

independently in a rural area of Brazil. Trans. R. Soc. Trop. Med. Hyg., 71 (3): 217-

1004

225.

M

an

1001

1005

Minning, T.A., Weatherly, D.B., Flibotte, S., Tarleton, R.L. 2011. Widespread, focal

1007

copy number variations (CNV) and whole chromosome aneuploidies in Trypanosoma

1008

cruzi strains revealed by array comparative genomic hybridization. BMC Genomics

1009

12: 139.

ce pt

1010

ed

1006

Moya, A., Holmes, E.C. & González-Candelas, F. 2004. The population genetics and

1012

evolutionary epidemiology of RNA viruses. Nature Rev. Microbiol. 2: 279-288.

1013

Ac

1011

1014

Ocaña-Mayorga, S., Llewellyn, M.S., Costales, J.A., Miles, M.A. & Grijalva, M.J.

1015

2010. Sex, Subdivision, and Domestic Dispersal of Trypanosoma cruzi Lineage I in

1016

Southern Ecuador. Plos Neglected Trop. Dis. 4: 1-8.

1017

Page 43 of 52

44 1018

Oliveira, R.P., Broude, N.E., Macedo, A.M., Cantor, C.R., Smith, C.L. & Pena, S.D.J.

1019

1998. Probing the genetic population structure of Trypanosoma cruzi with

1020

polymorphic microsatellites. Proc. Nat. Acad. Sci. USA 95: 3776-3780.

1021 Omilian, A.R., Cristescu, M.E.A., Dudycha, J.L. & Lynch, M. 2006. Ameiotic

1023

recombination in asexual lineages of Daphnia. Proc. Nat. Acad. Sci. USA 103 :

1024

18638-18643.

cr

ip t

1022

us

1025

Pearson, T., Okinaka, R.T., Foster, J.T. & Keim, P. 2009. Phylogenetic

1027

understanding of clonal populations in an era of whole genome sequencing. Infect.

1028

Genet. Evol. 9: 1010-1019.

an

1026

M

1029

Pinto, C.M., Kalko, E.K.V., Cottontail, I., Wellinghausen, N. & Cottontail, V.M. 2012.

1031

TcBat a bat-exclusive lineage of Trypanosoma cruzi in the Panama Canal Zone, with

1032

comments on its classification and the use of the 18S rRNA gene for lineage

1033

identification. Infect. Genet. Evol. 12: 1328-1332.

ce pt

1034

ed

1030

Ramírez, J.D., Duque, M.C. & Guhl, F. 2011. Phylogenetic reconstruction based on

1036

Cytochrome b (Cytb) gene sequences reveals distinct genotypes within Colombian

1037

Trypanosoma cruzi I populations. Acta Tropica119: 61-65.

1038

Ac

1035

1039

Ramírez, J.D., Guhl, F., Messager, L.A., Lewis, M.D., Montilla, M., Cucunubá, Z.,

1040

Miles, M.A. & Llewellyn, M.S. 2012. Contemporary cryptic sexuality in Trypanosoma

1041

cruzi. Molec. Ecol. 17: 4216-4226.

1042

Page 44 of 52

45 1043

Ramírez, J.D., Herrera, C., Bogotá, Y., Duque, M.C., Suárez-Rivillas, A. & Guhl, F.

1044

2013a. Validation of a Poisson-distributed limiting dilution assay (LDA) for a rapid

1045

and accurate resolution of multiclonal infections in natural Trypanosoma cruzi

1046

populations. J. Microbiol. Methods 92: 220-225.

ip t

1047 Ramírez, J.D., Tapia-Calle G.& Guhl, F. 2013b. Genetic structure of Trypanosoma

1049

cruzi in Colombia revealed by a High-throughput Nuclear Multilocus Sequence

1050

Typing (nMLST) approach. BMC Genet. 14: 96.

us

cr

1048

1051

Ramírez, J.D. & Llewellyn, M.S. 2014. Reproductive clonality in protozoan

1053

pathogens—truth or artefact? Molec. Ecol. 23, 4195-4202.

an

1052

M

1054

Revollo, S., Oury, B., Laurent, J.P., Barnabé, C., Quesney, V., Carrière, V., Noël, S.

1056

& Tibayrenc, M. 1998. Trypanosoma cruzi: impact of clonal evolution of the parasite

1057

on its biological and medical properties. Exp. Parasitol. 89: 30-39.

ce pt

1058

ed

1055

Rogers, M.B., Hilley, J.D., Dickens, N.J., Wilkes, J., Bates, P.A., Depledge, D.P.,

1060

Harris, D., Her, Y., Herzyk, P., Imamura, H., Otto, T.D., Sanders, M., Seeger, K.,

1061

Dujardin, J.C., Berriman, B., Smith, D.F., Hertz-Fowler, C. & Mottram, J.C. 2011.

1062

Chromosome and gene copy number variation allow major structural change

1063

between species and strains of Leishmania. Genome Res. 21: 2129-2142.

Ac

1059

1064 1065

Rougeron, V., De Meeûs, T., Hide, M., Waleckx, E., Bermudez, H., Arevalo, J.,

1066

Llanos-Cuentas, A., Dujardin, J.C., De Doncker, S., Le Ray, D., Ayala, F.J., &

1067

Bañuls, A.L. 2009. Extreme inbreeding in Leishmania braziliensis. Proc. Nat. Acad.

Page 45 of 52

46 1068

Sci. USA 106: 10224-10229.

1069 1070

Rougeron, V., De Meeûs, T. & Bañuls, A.L. 2015. A primer for Leishmania population

1071

genetic studies. Trends Parasitol. 31: 52-59.

ip t

1072 Rozas, M. De Doncker, S., Adaui, V., Coronado, X. Barnabé, C. ,Tibayrenc, M.,

1074

Solari, A. & Dujardin, J.C. 2007. Multilocus Polymerase Chain Reaction Restriction

1075

Fragment–Length Polymorphism Genotyping of Trypanosoma cruzi (Chagas

1076

Disease): Taxonomic and Clinical Applications. J. Infect. Dis. 195: 1381-1388.

us

cr

1073

an

1077

Sibley, L.D., Boothroyd, J.C., 1992. Virulent strains of Toxoplasma gondii comprise a

1079

single clonal lineage. Nature 359, 82–85.

M

1078

1080

Souza, R.T., Lima, F.M., Moraes Barros, R., Cortez, D.R., Santos, M.F., Cordero,

1082

E.M., Conceiçao Ruiz, J., Goldenberg, S., Teixeira, M.M.G. & Franco da Silveira, J.

1083

2011. Genome Size, Karyotype Polymorphism and Chromosomal Evolution in

1084

Trypanosoma cruzi. Plos One 6 (8): e23042. doi:10.1371/journal.pone.0023042.

ce pt

ed

1081

1085

Sterkers, Y., Crobu, L., Lachaud, L., Pagès, M. & Bastien, P. 2014. Parasexuality

1087

and mosaic aneuploidy in Leishmania: alternative genetics. Trends Parasitol. 30 :

1088

429-435.

Ac

1086

1089 1090

Sterkers, Y., Lachaud, L. Bourgeois, N., Crobu, L. Bastien, P. & Pagès, M. 2012.

1091

Novel insights into genome plasticity in Eukaryotes: mosaic aneuploidy in

1092

Leishmania. Molec. Microbiol. 86 : 15-23.

Page 46 of 52

47 1093 1094

Sterkers, Y., Lachaud, L., Crobu, L., Bastien, P. & Pagès, M. 2011. FISH analysis

1095

reveals aneuploidy and continual generation of chromosomal mosaicism in

1096

Leishmania major. Cellular Microbiol. 139 : 274-283.

ip t

1097 Su, C., A. Khan, P. Zhou, D. Majumdara, D. Ajzenberg, M.L. Dardé, X.Q. Zhu, J.W.

1099

Ajioka, B.M. Rosenthal, J.P. Dubey, and D. Sibley. 2012. Globally diverse

1100

Toxoplasma gondii isolates comprise six major clades originating from a small

1101

number of distinct ancestral lineages. Proc. Nat. Acad. Sci. USA 109 : 5844-5849.

us

cr

1098

an

1102

Subileau, M., Barnabé, C., Douzery, E., Diosque, P. & Tibayrenc, M. 2009.

1104

Trypanosoma cruzi: New insights on ecophylogeny and hybridization by Multigene

1105

sequencing of three nuclear and one maxicircle genes. Exp. Parasitol.122: 328-337..

M

1103

ed

1106

Telleria, J., Barnabé, C., Hide, M., Bañuls, A.L. & Tibayrenc, M. 2004. Predominant

1108

clonal evolution leads to a close parity between gene expression profiles and

1109

subspecific phylogeny in Trypanosoma cruzi. Mol. Biochem. Parasitol. 137 (1): 133-

1110

141.

Ac

1111

ce pt

1107

1112

Telleria J., Biron, D.G., Brizard, J.P., Demettre, E., Seveno, M., Barnabé, C., Ayala,

1113

F.J. & Tibayrenc M. 2010. Phylogenetic Character Mapping of proteomic diversity

1114

shows high correlation with subspecific phylogenetic diversity in Trypanosoma cruzi,

1115

the agent of Chagas disease. Proc. Nat. Acad. Sci. USA 107: 20411-20416.

1116

Page 47 of 52

48 1117

Telleria, J., Barnabé, C., Ayala, F.J. & Tibayrenc, M. 2013. Phylogenetic character

1118

mapping of RADES Probing, a new marker for exploring the clonal evolution of

1119

expressed coding sequences in Trypanosoma cruzi, the agent of Chagas disease.

1120

Infect. Genet. Evol. 19: 287-291.

ip t

1121 Tibayrenc, M. 1998. Genetic epidemiology of parasitic protozoa and other infectious

1123

agents: the need for an integrated approach. Int. J. Parasitol. 28, 1: 85-104.

cr

1122

us

1124 1125

Tibayrenc,

1126

microorganisms: the clonal theory of parasitic protozoa. Parasitol. Today: 7 (9): 228-

1127

232.

&

Ayala,

F.J.

1991a.

Towards

a

population

genetics

of

an

M.

M

1128

Tibayrenc, M. & Ayala, F.J. 2002. The clonal theory of parasitic protozoa: 12 years

1130

on. Trends Parasitol. 18 (9): 405-410.

ed

1129

1131

Tibayrenc, M. & Ayala, F.J. 2012. Reproductive clonality of pathogens: A perspective

1133

on pathogenic viruses, bacteria, fungi, and parasitic protozoa. Proc. Nat. Acad. Sci.

1134

USA 109 (48): E3305-E3313.

Ac

1135

ce pt

1132

1136

Tibayrenc, M. & Ayala, F.J. 2013. How clonal are Trypanosoma and Leishmania?

1137

Trends Parasitol. 29 : 264-269

1138 1139

Tibayrenc, M. & Ayala, F.J. 2014a. New insights into Clonality and Panmixia in

1140

Plasmodium and Toxoplasma. Adv. Parasitol. 84: 253-268.

1141

Page 48 of 52

49 1142

Tibayrenc, M. & Ayala, F.J. 2014b. Cryptosporidium, Giardia, Cryptococcus,

1143

Pneumocystis genetic variability: cryptic biological species or clonal near-clades?

1144

Plos Pathogens 10(4): e1003908. doi:10.1371/journal.ppat. 1003908

1145 Tibayrenc, M. & Brenière, S.F. 1988. Trypanosoma cruzi: major clones rather than

1147

principal zymodemes. Mem. Inst. Oswaldo Cruz, Rio De Janeiro, Suppl.1 83: 249-

1148

255.

cr

ip t

1146

us

1149

Tibayrenc, M., Cariou, M.L., Solignac, M. & Carlier, Y. 1981. Arguments génétiques

1151

contre l’existence d’une sexualité actuelle chez Trypanosoma cruzi; implications

1152

taxinomiques. C. R. Acad. Sci. Paris, 293: 207-209.

an

1150

M

1153

Tibayrenc, M., Cariou, M.L., Solignac, M., Dédet, J.P., Poch, O. & Desjeux, P. 1985.

1155

New electrophoretic evidence of genetic variation and diploidy in Trypanosoma cruzi,

1156

the causative agent of Chagas’ disease. Genetica. 67: 223-230.

ed

1154

ce pt

1157

Tibayrenc, M., Kjellberg, F. & Ayala, F.J. 1990. A clonal theory of parasitic protozoa:

1159

the population structure of Entamoeba, Giardia, Leishmania, Naegleria, Plasmodium,

1160

Trichomonas and Trypanosoma , and its medical and taxonomical consequences.

1161

Proc. Nat. Acad. Sci. USA : 87: 2414-2418.

1162

Ac

1158

1163

Tibayrenc, M., Kjellberg, F. & Ayala, F.J. 1991. The clonal theory of parasitic

1164

protozoa: a taxonomic proposal applicable to other clonal organisms. Bioscience: 41

1165

(11): 767-774.

1166

Page 49 of 52

50 1167

Tibayrenc, M., Kjellberg, F., Arnaud, J., Oury, B., Brenière, S.F., Dardé, M.L. &

1168

Ayala, F.J. 1991. Are eukaryotic microorganisms clonal or sexual? A population

1169

genetics vantage. Proc. Natl. Acad. Sci. USA: 88: 5129-5133.

1170 Tibayrenc, M., Neubauer, K., Barnabé, C., Guerrini, F., Skarecki, D. & Ayala, F.J.

1172

1993. Genetic characterization of six parasitic protozoa: parity of random-primer

1173

DNA typing and multilocus isoenzyme electrophoresis. Proc. Natl. Acad. Sci. USA,

1174

90: 1335-1339.

us

cr

ip t

1171

1175

Tibayrenc, M., Ward, P., Moya, A. & Ayala, F.J. 1986. Natural populations of

1177

Trypanosoma cruzi, the agent of Chagas’ disease, have a complex multiclonal

1178

structure. Proc. Nat. Acad. Sci. USA. 83: 115-119.

M

an

1176

1179

Tomasini, N., Lauthier, J.J., Monje Rumi, M.M., Ragone, P.G., Alberti D’Amato, A.A.,

1181

Pérez Brandan, C., Cura, C.I., Schijman, A.G., Barnabé, C., Tibayrenc, M.,

1182

Basombrıó, M.A., Falla, A., Herrera, C., Guhl, F. & Diosque, P. 2011. Interest and

1183

limitations

1184

Trypanosoma cruzi I phylogenetic diversity in the Argentinean Chaco. Infect. Genet.

1185

Evol. 11: 300-307.

ce pt

of

Spliced

Leader

Intergenic

Region

sequences

for

analyzing

Ac

1186

ed

1180

1187

Tomasini, N., Lauthier, J.J., Monje Rumi, M.M., Ragone, P.G., Alberti D’Amato, A.M.,

1188

Pérez Brandán, C., Basombrío, M.A. & Diosque, P. 2014. Preponderant clonal

1189

evolution of Trypanosoma cruzi I from Argentinean Chaco revealed by Multilocus

1190

Sequence Typing (MLST). Infect. Genet. Evol. 27: 348-354.

1191

Page 50 of 52

51 1192

Volkman, S.K., Neafsey, D.E., Schaffner, S.F., Park, D.J. & Wirth, D.F. 2012.

1193

Harnessing genomics and genome biology to understand malaria biology. Nature

1194

Rev. Genet. 13: 315-328.

1195 Westenberger, S.J., Barnabé, C., Campbell, D.A. & Sturm, N.R. 2005. Two

1197

Hybridization Events Define the Population Structure of Trypanosoma cruzi. Genetics

1198

171: 527-543.

cr

ip t

1196

us

1199

Westenberger, S.J., Sturm, N.R. & Campbell, D.A. 2006. Trypanosoma cruzi 5S

1201

rRNA arrays define five groups and indicate the geographic origins of an ancestor of

1202

the heterozygous hybrids. Int. J. Parasitol. 36: 337-346.

an

1200

M

1203

Wielinga, C.M. & Thompson, R.C.A. 2007. Comparative evaluation of Giardia

1205

duodenalis sequence data. Parasitol. 134: 1795-1821.

ed

1204

1206

Yeo, M., Mauricio, I.L., Messenger, L.A., Lewis, M.D., Llewellyn, M.S., Acosta, N.,

1208

Bhattacharyya, T., Diosque, P., Carrasco, H.J. & Miles, M.A. 2011. Multilocus

1209

Sequence Typing (MLST) for Lineage Assignment and High Resolution Diversity

1210

Studies in

1211

doi:10.1371/journal.pntd.0001049.

Trypanosoma

Ac

1212

ce pt

1207

cruzi.

Plos

Neglect.

Trop.

Dis.

5

(6):

e1049.

1213

Yip, C.W., Hon, C.C., Shi, M., Lam, T.T., Chow, K.Y., Zeng, F. & Leung, F.C. 2009.

1214

Phylogenetic perspectives on the epidemiology and origins of SARS and SARS-like

1215

coronaviruses. Infect. Genet. Evol. 9: 1185-1196.

1216

Page 51 of 52

52 Zingales, B., Miles, M.A., Campbell, D., Tibayrenc, M., Macedo, A.M., Teixeira, M.M.,

1218

Schijman, A., Llewellyn, M.S., Lages-Silva, E., Machado, C.R., Andrade, S.G. &

1219

Sturm, N.R.. 2012. The revised Trypanosoma cruzi subspecific nomenclature:

1220

rationale, epidemiological relevance and research applications. Infect. Genet.

1221

Evol. 12: 240-253.

ip t

1217

1222

Zingales, B., Miles, M.A., Moraes, C.B., Luquetti, A., Guhl, F., Schijman, A.G.,

1224

Ribeiro. 2014. Drug discovery for Chagas disease should consider Trypanosoma

1225

cruzi strain diversity. Mem. Inst. Oswaldo Cruz, Rio de Janeiro, 109: 828-833.

us

cr

1223

an

1226

Zoossmann-Diskin, A. 2010. The origin of Eastern European Jews revealed by

1228

autosomal, sex chromosomal and mtDNA polymorphisms. Biol Direct 5: 57.

M

1227

1229

ed

1230

1234 1235 1236 1237 1238 1239 1240 1241 1242 1243

Highlight

 We expose our view on Trypanosoma cruzi population genetics

Ac

1232 1233

ce pt

1231

 We show it is enlightened by our model of pathogen predominant clonal evolution.  We draw comparisons with other pathogens, in particular Leishmania.  We underline the implications of the model for applied research.

1244

Page 52 of 52