T4-type viruses: Important impacts on shaping bacterial community along a chronosequence of 2000-year old paddy soils

T4-type viruses: Important impacts on shaping bacterial community along a chronosequence of 2000-year old paddy soils

Accepted Manuscript T4-type viruses: Important impacts on shaping bacterial community along a chronosequence of 2000-year old paddy soils Yong Li, Hai...

926KB Sizes 0 Downloads 53 Views

Accepted Manuscript T4-type viruses: Important impacts on shaping bacterial community along a chronosequence of 2000-year old paddy soils Yong Li, Haiyang Liu, Hong Pan, Xinyu Zhu, Chen Liu, Qichun Zhang, Yu Luo, Hongjie Di, Jianming Xu PII:

S0038-0717(18)30355-9

DOI:

10.1016/j.soilbio.2018.10.007

Reference:

SBB 7311

To appear in:

Soil Biology and Biochemistry

Received Date: 21 July 2018 Revised Date:

26 September 2018

Accepted Date: 15 October 2018

Please cite this article as: Li, Y., Liu, H., Pan, H., Zhu, X., Liu, C., Zhang, Q., Luo, Y., Di, H., Xu, J., T4type viruses: Important impacts on shaping bacterial community along a chronosequence of 2000-year old paddy soils, Soil Biology and Biochemistry (2018), doi: https://doi.org/10.1016/j.soilbio.2018.10.007. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT Title

2

T4-type viruses: important impacts on shaping bacterial community along a

3

chronosequence of 2000-year old paddy soils

4

Authors

5

Yong Li1, Haiyang Liu1, Hong Pan1, Xinyu Zhu2, Chen Liu3, Qichun Zhang1, Yu Luo1,

6

Hongjie Di1, Jianming Xu1*

7

1

8

Institute of Soil and Water Resources and Environmental Science, College of

9

Environmental and Resource Sciences, Zhejiang University, Hangzhou 310058, China;

SC

RI PT

1

M AN U

Zhejiang Provincial Key Laboratory of Agricultural Resources and Environment,

10

2

11

310007, China;

12

3

13

Agricultural Sciences, Hangzhou 310021, China.

14

Correspondence: Jianming Xu, College of Environmental and Resource Sciences,

15

Zhejiang University, 866 Yuhangtang Road, Hangzhou 310058, Zhejiang, China. Tel.:

16

+86 88982069; Fax: 86-571-88982069; E-mail: [email protected].

TE D

Environmental Science Research & Design Institute of Zhejiang Province, Hangzhou

AC C

EP

Institute of Environment, Resource, Soil and Fertilizer, Zhejiang Academy of

1

ACCEPTED MANUSCRIPT Abstract

18

There is increasing evidence to suggest that viruses may influence the succession of

19

individual populations of microorganisms, biogeochemical cycles and, ultimately,

20

microbial community structure. However, it is still not well understood if T4-type

21

viruses can affect the bacterial communities of terrestrial ecosystems. Here, we

22

report an investigation of the impact of T4-type phage and bottom-up

23

(environmental factors) controls on bacterial community structures along a

24

2000-year paddy soil chronosequence. T4-type myoviral and bacterial communities

25

were evaluated by clone sequencing and high-throughput sequencing of the gene

26

encoding the major capsid protein (g23) and 16S ribosomal DNA, respectively.

27

Long-term (centurial/millennial) anthropogenic managements of paddy soils resulted

28

in an accumulation of nutrients and soil acidification. Significant shifts in soil

29

bacterial and phage communities were detected during the development of paddy

30

soils at millennial time scales. The Mantel test and variation partitioning analysis

31

(VPA) suggested that the profile of bacterial community composition was strongly

32

affected by both T4-type phage and environmental variables. Network analysis

33

between phage and bacterial taxa indicated that six bacterial families were

34

implicated as potential hosts of T4-type phages. These results suggest that phage

35

lysis is important in shaping bacterial communities in the soil environment.

36

Key Words: T4-type phage, Bottom-up controls, Bacteria, g23, Paddy soil

37

chronosequence.

AC C

EP

TE D

M AN U

SC

RI PT

17

2

ACCEPTED MANUSCRIPT 1. Introduction

39

As the most abundant biological entities, viruses are increasingly recognized as a

40

major driving force of global biogeochemical nutrient cycles and have received

41

considerable attention over the past two decades (Fuhrman, 1999; Suttle, 2005;

42

Weitz and Wilhelm, 2012). Through direct mortality of nearly all forms of cellular life,

43

including bacteria, archaea and microeukaryotes, and the release of cellular

44

nutrients, viruses influence the succession of

45

microorganisms, biogeochemical cycles and, ultimately, microbial community

46

structure (Suttle, 2007; Weinbauer, 2004; Wilhelm and Matteson, 2008; Zhong and

47

Jacquet, 2014). In contrast to this top-down control (Viral lysis), bacterial activity is

48

also mediated by bottom-up controls (e.g., resource availability and competition)

49

(Chow et al., 2014). It is generally accepted that the dominant top-down controls, or

50

sources of bacterial mortality, in the open ocean are thought to be viral lysis and

51

protistan grazing (Chow et al., 2014). For example, viruses are important agents of

52

bacterial removal and are believed to be responsible for 10–50% of the total

53

bacterial mortality in surface waters (Fuhrman, 1999; Suttle, 2007). Many studies

54

have sought to quantify grazing and viral lysis to determine the impact of top-down

55

controls on structuring microbial communities in aquatic ecosystems (Baudoux et al.,

56

2008; Fuhrman and Noble, 1995; Longnecker et al., 2010; Staniewski et al., 2012;

57

Weinbauer et al., 2003; Weinbauer et al., 2007). Chow et al (2014) found that virus–

58

bacteria relationships were more cross-linked than protist–bacteria relationships,

59

and their association networks supported the paradigm that microbes were

RI PT

38

AC C

EP

TE D

M AN U

SC

individual populations of

3

ACCEPTED MANUSCRIPT 60

regulated by both bottom-up and top-down controls. Despite the significant effects of viruses on shaping microbial communities, all

62

those studies focused on virus–bacteria relationships either in marine water (Cram et

63

al., 2016; Fuhrman and Noble, 1995; Suttle, 1994; Weinbauer et al., 2003; Weitz et

64

al., 2015; Wilhelm and Matteson, 2008) or in sediments (Corinaldesi et al., 2010;

65

Engelhardt et al., 2015; Middelboe and Glud, 2006). There is only limited information

66

on the impact of viruses on microbial communities in terrestrial ecosystems (Allen et

67

al., 2010; Helsley et al., 2014). Nevertheless, direct counts have shown that phages

68

are extremely abundant in terrestrial ecosystems. Moreover, Srinivasiah et al (2015)

69

recently reported that phage abundance responded quickly to changes in substrate

70

availability and host growth in soils, implying that phages were active and dynamic

71

members of the community. T4-type bacteriophages, an important component of

72

the Myoviridae family, have been widely studied after Filée et al (2005) first

73

investigated uncultured bacteriophage from diverse marine environments.

74

Subsequent studies revealed that T4-type bacteriophages in soil environments were

75

different from those in aquatic environments (Fujii et al., 2008; Jia et al., 2007; Liu et

76

al., 2011; Wang et al., 2009a; Wang et al., 2009b; Zheng et al., 2013). Furthermore,

77

the T4-type phage communities in paddy fields were found to be more

78

phylogenetically diverse than those in marine environments, and the soil-specific

79

phages retrieved from Japanese and Chinese paddy field soils were phylogenetic

80

clustered into nine paddy groups (Wang et al., 2009a; Wang et al., 2009c; Li et al.,

81

2013). The extremely abundant and higher phylogenetic diversity of bacteriophages

AC C

EP

TE D

M AN U

SC

RI PT

61

4

ACCEPTED MANUSCRIPT point to potential impacts of viruses on microbial communities in soil environments,

83

even though the roles that viruses may play in soil are different from those in aquatic

84

environments (Kimura et al., 2008). Allen et al (2010) investigated top-down controls

85

by phages on soil microbes using both field and laboratory experiments and

86

suggested that top-down controls, such as phage lysis, are critical to the regulation of

87

microbial activities in Arctic soils. Nevertheless, Helsley et al (2014) found little

88

evidence that phages exerted significant top-down controls on bacterial abundance,

89

respiration, or growth in temperate soils and suggested that the role phages play in

90

soil ecosystems varied dramatically with biome type. However, information on the

91

effect of viral lysis on soil microbes is scarce (Ghosh et al., 2008; Nakayama et al.,

92

2007; Williamson et al., 2003). Most studies on soil bacterial communities focus on

93

bottom-up controls (e.g., pH, soil organic carbon, N availability), while not enough

94

attention has been paid to the impact of viruses (Liu et al., 2014; Luo et al., 2017; Sul

95

et al., 2013; Zeng et al., 2016).

TE D

M AN U

SC

RI PT

82

Here, we present evidence of T4-type phages, which has an important impact

97

on shaping bacterial communities in a paddy soil chronosequence. We investigated

98

the structure and diversity of both bacterial and bacteriophage communities on a soil

99

chronosequence of up to 2000 years of rice cultivation after reclamation from a

100

mudflat in the Yangtze River Delta by means of deep MiSeq sequencing of the 16S

101

rRNA gene amplicons for bacterial communities, and sequencing major capsid gene

102

(g23) amplicons for T4-type phages, respectively. We focused specifically on the

103

T4-type myovirus family because T4-type viruses are diverse, abundant, and

AC C

EP

96

5

ACCEPTED MANUSCRIPT detectable through cultivation-independent methods and because their major capsid

105

gene (g23) has been shown to serve as a reasonable proxy for variation in globally

106

ubiquitous myovirus genomes (Chow et al., 2014; Comeau and Krisch, 2008; Filee et

107

al., 2005; Needham et al., 2013). This investigation was designed to answer two

108

questions: (i) How are the soil bacterial and phage communities associated with

109

paddy soil chronosequence? (ii) Would T4-type phage affect the distribution of the

110

bacterial community?

111

2. Materials and Methods

112

2.1 Soil sampling and analysis of basic soil properties

113

The soil samples used in this study were collected from paddy fields located in Cixi,

114

Zhejiang Province, China (121°12’-121°42’N, 30°21’-30°24’E). The sampling area is a

115

marine deposit plain. The deposited materials originated from the nearby Yangtze

116

River as evidenced by geochemical studies (Cheng et al., 2009). Chronosequences

117

were established on dikes constructed in different periods. Rice cultivation occurring

118

in the sampling area was identified according to the Cixi County Annals. A paddy

119

chronosequence with 50-, 100-, 300-, 700-, 1000-, and 2,000-year (S50, S100, S300,

120

S700, S1000 and S2000) durations of rice cultivation was sampled in March 2011,

121

before rice was sown (Fig. S1). We sampled mudflats in sites near the Yangtze River

122

as a reference for initial soil development characteristics, and this was defined as age

123

0 (S0). Soil sampling was carried out in triplicate fields using soil auger, and 0- to

124

10-cm surface soils were collected by mixing five random soil cores. The soil samples

125

were transported to the laboratory at 4°C. The soil samples were passed through a

AC C

EP

TE D

M AN U

SC

RI PT

104

6

ACCEPTED MANUSCRIPT 126

2-mm mesh sieve and stored at -20°C until use. All analyses were conducted according to the protocols of the Handbook of Soil

128

Analysis (Pansu and Gautheyrou, 2007). In brief, soil pH and electric conductivity (EC)

129

were measured at a soil:water ratio of 1:2.5. Total C was determined by oxidation

130

with dichromate, total N was measured by the Kjeldahl method, and total P was

131

determined following a wet acid digestion procedure with perchloric and sulfuric acid

132

and then measured by the molybdenum blue method. All measurements are the

133

mean of three replicate analyses.

134

2.2 Soil sample DNA extraction and PCR amplification

135

DNA was extracted from 0.5 g of soil using a Fast DNA SPIN kit for soil (MP

136

Biomedicals; Solon, OH, USA) according to the manufacturer’s instructions.

M AN U

SC

RI PT

127

The major capsid gene, g23 of T4-type phages was amplified with the

138

degenerate g23 primers MZIA1bis (5′-GAT ATT TGI GGI GTT CAG CCI ATG A-3′) and

139

MZIA6 (5′-CGC GGT TGA TTT CCA GCA TGA TTT C-3′) (Filee et al., 2005) according to

140

the method of Fujii et al (2008). PCRs were performed in a total volume of 50 µL in

141

200-µL microtubes; each reaction contained 0.4 µL of each primer (50 pmol each), 5

142

µL of 2.5 mM dNTP mixture, 5 µL of 10× Ex Taq DNA buffer (20 mM Mg2+; TaKaRa,

143

Tokyo, Japan), 0.1 µL of 0.1% BSA (TaKaRa Bio Inc.), 0.5 µL of Ex Taq DNA polymerase

144

(TaKaRa, Bio), 1 µL of DNA template and 31.75 µL of milli-Q water. The cycling

145

conditions for PCR amplification were as follows: initial denaturation at 94°C for 5

146

min, 35 cycles of denaturation at 95°C for 1 min, annealing at 55°C for 1 min and

147

extension at 72°C for 1 min, and a final extension at 72°C for 5 min using a TaKaRa

AC C

EP

TE D

137

7

ACCEPTED MANUSCRIPT PCR Thermal Cycler Dice (TaKaRa) (Fujii et al., 2008). The PCR products were

149

visualized on 2% (w/v) agarose gels made with TAE buffer (40 mM Tris-acetate and 2

150

mM EDTA) and ethidium bromide (10 mg mL-1) staining. Electrophoresis of agarose

151

gels was performed with 1× TAE buffer in a Mini Gel Electrophoretic System (Advance,

152

Tokyo, Japan) at 100 V for 30 min.

153

2.3 16S rRNA gene and g23 sequencing

154

The V4-V5 regions of the 16S rRNA gene were analyzed by the MiSeq sequencing

155

platform to investigate changes in the bacterial community structure with a universal

156

515F-907R primer assay (Stubner, 2002). MiSeq sequencing was performed by

157

Majorbio Bio-pharm Technology Co., Ltd. (Shanghai, China), on an Illumina® MiSeq

158

sequencer (Illumina, San Diego, CA, USA). The high-throughput sequencing data were

159

processed using Quantitative Insights into Microbial Ecology (QIIME) software

160

(Caporaso et al., 2010), and only sequences >200 bp in length, with an average

161

quality score >20, and without ambiguous base reads were included for the

162

downstream analyses.

EP

TE D

M AN U

SC

RI PT

148

PCR products with target g23 genes were purified using a QIAquick Gel

164

Extraction kit (Qiagen, Tokyo, Japan). The purified DNA was cloned into the pEASY-T1

165

Simple Cloning Vector (TransGen Biotech). Approximately 50 clones from each

166

transformation were chosen from white colonies and were then PCR-amplified with

167

the primers MZIA1bis and MZIA6. The PCR program was the same as described

168

above, except for the reduction of the cycle number to 26. Plasmid DNA was

169

extracted

AC C

163

from

different

clones

after

overnight

cultivation

using 8

ACCEPTED MANUSCRIPT EZNA™ Plasmid Mini kit (Omega Bio-Tek, USA), and approximately 300 ng of DNA was

171

subjected to cycle sequencing reactions. Nucleotide sequences were determined

172

with an ABI PRISM® 3730 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA)

173

using a BigDye® Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems).

174

2.4 Community analysis

175

The nucleotide sequences of g23 were translated into deduced amino acid

176

sequences using the EMBOSS Transeq program at the European Bioinformatics

177

Institute website (https://www.ebi.ac.uk/). The identities of deduced amino acid

178

sequences were analyzed using the ClustalW program through the DNA Data Bank of

179

Japan (DDBJ) website. The closest relatives of all sequences at the amino acid level

180

were identified using a Basic Local Alignment Search Tool (BLAST) search on the

181

National

182

(http://www.ncbi.nlm.nih.gov).

for

Biotechnology

Information

(NCBI)

website

TE D

Center

M AN U

SC

RI PT

170

To evaluate whether the distributions of the g23 clone were related to soil

184

chronosequence and their assemblages in different environments on the global scale,

185

the UNIFRAC statistical analysis tool available at http://unifrac.colorado.edu/ was

186

utilized (Lozupone and Knight, 2005). Briefly, all g23 clones of amino acid sequences

187

with soil chronosequences in this study and those obtained from different

188

environments were analyzed together using a P-test and principal coordinate analysis

189

(PCoA). The alignments were first compared with T-evens, PseudoT-evens,

190

SchizoT-evens, ExoT-evens, marine clones (Filee et al., 2005) and lake clones (Butina

191

et al., 2010; Lopez-Bueno et al., 2009) and then with the g23 sequences obtained

AC C

EP

183

9

ACCEPTED MANUSCRIPT from paddy field soils (Fujii et al., 2008; Liu et al., 2012; Wang et al., 2009a). The

193

alignment was conducted with Clustal X 1.81 (Thompson et al., 1997), and an

194

unrooted phylogenetic tree was drawn using the interactive Tree of Life online

195

program (Letunic and Bork, 2007) based on the distance matrix data calculated by

196

Molecular Evolutionary Genetic Analysis software (MEGA 4.1), and a rooted

197

neighbor-joining tree was constructed by MEGA 4.1 with 1000-fold bootstrap

198

support (Kumar et al., 2004).

SC

RI PT

192

To compare community diversities of bacteria and phages between samples

200

with different cultivation time, principal coordinate analyses based on pairwise

201

weighted UniFrac (Lozupone and Knight, 2005) distances were calculated in the Ape

202

package of R v.3.2.1 (https://www.r-project.org/). BIO-ENV based on the Speraman's

203

rank correlation coefficient (ρ) was used to show the association between the

204

microbial

205

correspondence analysis (CCA) to identify the dominate factors including T4-type

206

phage and soil properties (i.e., pH, EC, Tot C, and Tot N) on the bacterial community

207

composition along the 2000-year paddy soil chronosequence. The environmental

208

variables and diversity indices of phages were then used to construct a factors matrix

209

for variation partitioning analysis (VPA) in R within the vegan package. Contributions

210

of the selected T4-type phage and environmental variables to the bacterial

211

community variation were estimated by VPA.

TE D

M AN U

199

and

environmental

variables.

We

used

canonical

AC C

EP

communities

212

Nonrandom cooccurrence analyses were performed using SparCC, a tool capable

213

of estimating correlation values from compositional data (Friedman and Alm, 2012). 10

ACCEPTED MANUSCRIPT In brief, quality reads were clustered at 97% sequence identity (Edgar, 2010), and

215

the 10 most abundant families of OTUs per samples were retained for analysis. For

216

each network analysis, P-values were obtained by 99 permutations of random

217

selections of the data table and were subjected to the same analytical pipeline.

218

SparCC correlations with statistically significant (P<0.01) were incorporated into

219

network analyses. The nodes in the reconstructed networks represent the OTU

220

families at 97% identity, whereas the edges (that is, connections) correspond to a

221

strong and significant (positive denoted by red, and negative denoted by black)

222

correlation between the nodes. Networks were visualized using the interactive

223

platform Cytoscape (v3.2.1) (Shannon et al., 2003).

224

2.5 Nucleotide sequence accession numbers

225

The MiSeq sequencing reads of the 16S rRNA genes of the soil chronosequences

226

were deposited in the National Center for Biotechnology Information (NCBI)

227

database under accession number SRP095970. The DNA sequences of the g23 clones

228

were deposited in DNA Data Bank of Japan (DDBJ) under accession numbers

229

LC210641 to LC210722.

230

3. Results

231

3.1 Soil physicochemical properties

232

The general physicochemical characteristics for the soil samples are summarized in

233

Table 1. The soil pH dropped significantly from 8.65 in the S0 soil down to 5.62 in the

234

S2000 soil, and soil total C and total N in the cultivated soils were significantly higher

AC C

EP

TE D

M AN U

SC

RI PT

214

11

ACCEPTED MANUSCRIPT than those of the reference soil, varying from 26.13 to 45.60 g C kg-1 and from 0.82 to

236

2.74 g N kg-1, respectively. In contrast, the EC of the paddy soils was significantly

237

lower than that of the S0 soil. It was notable that the concentration of total

238

phosphorus (tot P) in the S0 soil was significantly lower than that in the S100, S300

239

and S700 soils and was significantly higher than that in the S2000 soil.

240

3.2 Taxonomic assemblages of the bacterial community

241

In total, we obtained 944,069 quality sequences from seven samples (mean,

242

134,867), and they were clustered into 46,748 OTUs after trimming and filtration. To

243

compare the soil bacterial community diversity among all the soils, the same survey

244

effort level of 104,000 sequences was randomly selected from each sample in the

245

sequencing library. The dominant groups (relative abundance > 1%) across all soil

246

samples were Proteobacteria, Acidobacteria, Actinobacteria, Planctomycetes,

247

Verrucomicrobia, Chloroflexi, Bacteroidetes, Nitrospirae, Gemmatimonadetes and

248

Firmicutes, and these groups accounted for more than 90% of the bacterial

249

sequences (Fig. 1a). Soil exploitation history was closely correlated with the

250

abundance of some dominant bacterial groups. Proteobacteria decreased in relative

251

abundance in the soils along the chronosequence (r=-0.798, p<0.05), while

252

Verrucomicrobia increased (r=0.755, p<0.05). Actinobacteria and Chloroflexi showed

253

the lowest and highest relative abundances in the control and 50-year duration of

254

cultivation soils, respectively, and then decreased along the chronosequence.

AC C

EP

TE D

M AN U

SC

RI PT

235

255

We analyzed twenty families with a relative abundance of >1.0% in at least one

256

sample (Fig. 1b). Soil exploitation history had significant effects on the proportions of 12

ACCEPTED MANUSCRIPT Chthoniobacteraceae (r=0.839, p<0.05) and Chitinophagaceae (r=0.823, p<0.05) (Fig.

258

S2). Soil pH was significantly corrected with the proportions of Chthoniobacteraceae

259

(r=-0.994, p<0.01), Chitinophagaceae (r=-0.958, p<0.01) and Brucellaceae (r=-0.884,

260

p<0.01). Total C were associated with the proportions of Syntrophobacteraceae

261

(r=0.782, p<0.05), , Chthoniobacteraceae (r=-0.781, p<0.05) and Desulfobulbaceae

262

(r=-0.895, p<0.01), wherever total K with that of Chthoniobacteraceae (r=-0.858,

263

p<0.05) and Chitinophagaceae (r=-0.884, p<0.01) across the soil chronosequence (Fig.

264

S2).

265

3.3 Phylogeny of g23 clones

266

The neighbor-joining tree showed that 202 of the g23 clones (92.6% of clones)

267

obtained in this study belonged to paddy (68.8%), marine (11.9%) and lake groups

268

(11.9%), with the remaining 15 clones ungrouped (7.4%) (Fig. 2; Table 2). Clones

269

belonging to the paddy group were exclusively from the chronosequence soils with

270

cultivation periods of more than 50 years, wherever clones from the control

271

contributed most to the marine and lake groups, with 50% and 27.3% of the clones

272

belonging to those two groups, respectively. Paddy groups VIII, IV and IX were three

273

dominant groups with proportions of 54.7%, 12.9% and 12.9% among the paddy

274

groups, respectively (Table 2).

SC

M AN U

TE D

EP

AC C

275

RI PT

257

Figure S3 shows the phylogenetic comparison of amino acid sequences obtained

276

from the paddy soil chronosequence in this study with amino acid sequences (i) from

277

marine waters (Filee et al., 2005) and lake freshwaters (Butina et al., 2010;

278

Lopez-Bueno et al., 2009) (Fig. S3a) and (ii) from paddy field soils from Japan (Fujii et 13

ACCEPTED MANUSCRIPT al., 2008; Jia et al., 2007; Wang et al., 2009a) and Northeast China (Wang et al.,

280

2009b) (Fig. S3b). Most of the g23 clones obtained in this study formed several

281

clusters separate from the clones of marine and lake origins, except for some clones

282

from soils with 0 and 50 years of cultivation (Fig. S3a). Compared to the paddy field

283

soils from Northeast China, these clones were closer to those from Japanese paddy

284

field soils (Fig. S3b).

RI PT

279

The g23 clone assemblages from the chronosequence soils in this study were

286

further compared with clone assemblages from other environments by UniFrac

287

analysis (Fig. 3). PCoA plot based on PC1/PC2 showed that the g23 phages obtained

288

from the reference soil (S0) were similar to those from Baikal lake, wherever the g23

289

obtained from the soils with 50- and 300-year duration of rice cultivation were

290

similar with those from the paddy soils in Japan and Northeast China. In contrast, the

291

soils with longer cultivation periods were located in a cluster separated from those

292

from marine waters, lake freshwaters and paddy field soils (Fig. 3).

293

3.4 Bacterial and phage diversity

294

We observed that both bacterial and T4-type phage community diversities were

295

highly variable with respect to Faith’s phylogenetic diversity (PD) (ranging from 274

296

to 386), phylotype richness (ranging from 3681 to 5985) for bacteria and phylotype

297

richness (ranging from 5 to 21), shannon index (ranging from 2.19 to 3.45) for phage

298

in the soil chronosequence, respectively (Table 1). Of the soil characteristics that

299

were considered, we found that the soil EC was significantly negatively correlated

300

with both the bacterial phylotype richness (r =-0.833, P <0.05) and the phylogenetic

AC C

EP

TE D

M AN U

SC

285

14

ACCEPTED MANUSCRIPT diversity (r=- 0.804, P <0.05), while TC was positively correlated with bacterial

302

phylotype richness (r =0.785, P <0.05) and negatively correlated with T4-type phage

303

phylotype richness(r =-0.784, P <0.05). All other soil parameters were not related to

304

microbial diversity.

RI PT

301

Principal coordinate analyses (PCoAs) of weighted UniFrac distances were

306

performed to evaluate the β-diversity of bacterial and T4-type phage communities in

307

the soil chronosequences (Fig. 4). Bacterial microbiomes in the chronosequence soils

308

and the control were separated along PCo1 (explaining 51.69% of the total variance),

309

and this separation was mainly associated with the bacterial taxa of

310

Planctomycetaceae, Piscirickettsiaceae, Desulfobulbaceae and Hyphomicrobiaceae

311

(Fig. 4a). The PCo2 was indicative of Nocardioidaceae, Chitinophagaceae and

312

Brucellaceae, separating the bacteria of S50 and S700 with other cultivated soils,

313

which explained 18.17% of the total variance. T4-type phage communities also

314

showed obvious separation among the control, short soil exploitation history (50 and

315

300 y) and others. This separation was mainly associated with paddy groups II, VIII,

316

and IX, the marine group, and the ungrouped phages along the first two axes

317

(explaining 29.95% and 22.73% of total variance, respectively) (Fig. 4b).

318

3.5 Impact of T4-type phage and bottom-up controls on shaping bacterial community

319

The BIO-ENV analysis showed the best correlations of both bottom-up controls (pH

320

and EC) and T4-type phage community (CCA index and Shannon index of T4-type

321

phage) with the bacterial population (Table 3), while only bottom-up controls (total C,

322

EC and Cultivation time) correlated with the phage community in soil

AC C

EP

TE D

M AN U

SC

305

15

ACCEPTED MANUSCRIPT chronosequence (Table S1). CCA analysis further showed that combination of these

324

factors explained the highest percentage of variance of bacterial communities (Fig.

325

5a), and 63.82% of the variance of bacteria could be explained by the selected

326

variables with the first two axes. The CCA index and the Shannon index of T4-type

327

phage and soil EC were significantly correlated with the first axis (explaining 45.26%

328

of the total variance). The soil pH was significantly correlated with the second axis

329

(explaining 18.56% of the total variance). VPA analysis was constructed by

330

designating the explanatory variables and covariates. A total of 80.5% of the variance

331

of bacteria could be explained by T4-type phage community and soil properties. It

332

showed that T4-type phage community explained 17.5% of the total variation and

333

that the soil properties explained 16.8% (Fig. 5b). These two parameters co-explained

334

46.2% of the total variation. A Mantel test showed that the important parameters

335

that significantly determined bacterial community structure were the CCA index of

336

phage (r=0.999, P<0.001), soil EC (r=-0.995, P<0.001) and pH (r=-0.776, P<0.05) (Fig.

337

5b).

EP

TE D

M AN U

SC

RI PT

323

To further explore the relationship between the bacterial community and

339

phages, we constructed a network between the phage clusters and the bacterial taxa

340

(at the family level). Previous study showed that both negative and time lagged

341

positive correlations may suggest the presence of viral lysis (Chow et al., 2014). We

342

hypothesized that the non-random, co-occurrence patterns of phage and bacterial

343

taxa could be used to provide new insights into phages and their potential hosts if

344

the phages and the co-existing bacterial taxa possessed a strong and significantly

AC C

338

16

ACCEPTED MANUSCRIPT correlation. As shown, we found that six bacterial families could be possible T4-type

346

phage hosts of the cooccurrence phages. For instance, Brucellaceae and

347

Xanthomonadaceae were possible hosts of T4-type phages belonging to paddy

348

groups IV, while Hyphomicrobiaceae for paddy groups VIII and Planctomycetaceae

349

and Pseudomonadaceae for paddy groups IX, respectively. Interestingly, phage clones

350

of paddy group II, with the lowest proportion (1%) in paddy fields, showed a strong

351

positive correlation with Nocardioidaceae (Fig. 6).

352

4. Discussion

353

Literature analysis suggests that the contemporary environment has a prominent role

354

in shaping microbial biogeographic patterns (Hanson et al., 2012). There have been

355

many previous studies demonstrating that environmental factors such as pH (Ding et

356

al., 2017; Kaiser et al., 2016; Liu et al., 2014; Luo et al., 2018), soil organic carbon

357

(Luo et al., 2013; Li et al., 2011; Li et al., 2013; Sul et al., 2013; Li et al., 2017), N

358

availability (Ai et al., 2013; Wang et al., 2018) and phosphorus contents (Wei et al.,

359

2017; Wu et al., 2017; Ai et al., 2018) are prevailing environmental factors in shaping

360

the soil bacterial community compositions in various environments (Santini et al.,

361

2015; Zeng et al., 2016). However, phage lysis, which exerts top-down controls on

362

bacterial communities, cannot be reasonably assessed and still poorly elucidate,

363

since methods for directly assessing top-down control and trophic impacts of phages

364

in soils have not been developed yet. Here, we found that bacterial communities

365

were also regulated at least in part by T4-type bacteriophages in addition to the

AC C

EP

TE D

M AN U

SC

RI PT

345

17

ACCEPTED MANUSCRIPT bottom-up controls of environmental factors along the 2000-year paddy soil

367

chronosequence.

368

4.1 Soil physicochemical properties along the soil chronosequence

369

The observed significant accumulation of total C and total N and the clear decrease

370

in pH in the tested soil chronosequence (Table 1) were consistent with the

371

observations of previous studies, which attributed the accumulation of total C and

372

total N in the soil to long-term rice cultivation (Ding et al., 2017; Liu et al., 2016). The

373

accumulation of P on the decadal time scale and a decrease in P on the millennial

374

time scale were unexpected (Table 1). Many agronomic field studies on decadal time

375

scales have shown that P addition by paddy cultivation results in expected increases

376

in P in surface soils (Lee et al., 2004; Zhang et al., 2006). However, in contrast to this

377

tendency, long-term paddy cultivation significantly degraded phosphorus sorption

378

capacity in surface paddy soils due to the loss of phosphorus sorbents (CaCO3, Fe and

379

Al oxides, and clays) (Huang et al., 2014), which is responsible for the rapid decline in

380

P in the later paddy soil chronosequence in this study (Table 1). P becomes gradually

381

depleted and less biologically available during natural soil development (Selmants

382

and Hart, 2010; Wardle et al., 2004). These results suggest that the anthropogenic

383

management of paddy soils not only resulted in the accumulation of various

384

nutrients (e.g., organic carbon, nitrogen) over a considerably long (centurial) time

385

period but also caused negative effects (e.g., a decline in the pH and total P) on

386

millennial time scales. These findings are important for understanding the soil

387

nutrient status at different stages of pedogenesis and for providing theoretical and

AC C

EP

TE D

M AN U

SC

RI PT

366

18

ACCEPTED MANUSCRIPT scientific bases for the rational fertilization and sustainable management of paddy

389

soils during crop production.

390

4.2 Bacterial community composition during paddy soil development

391

Significant shifts in soil bacterial diversity and community composition were detected

392

during the development of paddy soils derived from mudflat (Table 1 and Fig. 4). The

393

greatly increased bacterial community diversity observed from phylogenetic diversity

394

and phylotype richness analyses suggested that the longer cultivation and primary

395

productivity inputs led to more diverse microbial communities under rice cropping

396

(Zhu et al., 2016). This finding could be partially ascribed to the improved nutrient

397

availability (e.g., C and N) resulting from large amounts of fertilization applications

398

and continuous inputs of nutrients by root exudates (Ding et al., 2017; Su et al., 2017)

399

following the conversion of mudflat into paddy soils.

TE D

M AN U

SC

RI PT

388

The bacterial community composition differed significantly between the mudflat

401

and the paddy soil with 50 years of ongoing rice cultivation (Fig. 1 and Fig. 4a). The

402

shift in the soil bacterial community structure was mainly attributed to remarkable

403

enrichments

404

(Gammaproteobacteria),

405

Desulfobulbaceae

406

(Actinobacteria) and Chitinophagaceae (Bacteroidetes) in the paddy soil from areas

407

with 50- and 700-year durations of rice cultivation. In contrast, pronounced declines

408

were observed in Hyphomicrobiaceae (Alphaproteobacteria) and Brucellaceae

409

(Alphaproteobacteria) in mudflats and paddy soils of S50 and S700. The families

EP

400

the

AC C

in

relative

abundances

Planctomycetaceae

(Deltaproteobacteria)

in

of

Piscirickettsiaceae

(Planctomycetes)

mudflats

and

and

Nocardioidaceae

19

ACCEPTED MANUSCRIPT Piscirickettsiaceae, Planctomycetaceae and Desulfobulbaceae in the mudflats are

411

typical marine taxa and are widely distributed in marine environments, such as

412

marine sands, mud flats and shelf seafloors (Nguyen and Landfald, 2015; Staley and

413

Sadowsky, 2016). Most species from these families are halotolerant microbes and

414

even halophilic (Kuever, 2014). In contrast, the families of Hyphomicrobiaceae and

415

Brucellaceae (Fig. 1 and Fig.4a), affiliated with Alphaproteobacteria, can benefit from

416

plants and thrive in the presence of some suitable carbon sources in the soil

417

rhizosphere, thus they were enriched in the cultivated soils (Kyselkova et al., 2014;

418

Oren and Xu, 2014). The enrichment of Nocardioidaceae in the paddy soils of S50

419

and S700 could be attributed to the relatively low organic matter but significantly

420

higher total K in the S50 and S700 soils, as Nocardioidaceae was reported to

421

significantly correlate with soil available elements (K) and dominate in soils with

422

relatively low organic matter and/or in oligotrophic waters (Bell et al., 2013; Huang et

423

al., 2013; Stevens et al., 2007). The members of Chitinophagaceae seemed to be

424

cosmopolitan taxa for both native and agricultural plants and function as

425

carbohydrate degraders adapting to diverse environments (Joseph et al., 2007;

426

Shiratori-Takano et al., 2016). Considering that rice cultivation brought in straw and

427

residue with high contents of cellulose and/or lignin, the increase in the relative

428

abundance of Chitinophagaceae with the cultivation in paddy soils is reasonable.

429

4.3 Phage community during paddy soil development

430

T4-type phage communities were reported to be different among different soil

431

environments (Zheng et al., 2013). The Bio-ENV analysis indicated association of

AC C

EP

TE D

M AN U

SC

RI PT

410

20

ACCEPTED MANUSCRIPT phage communities with the soil properties, like total C and EC (Table S1). The

433

positive correction of total carbon with bacterial phylotype richness, and the

434

negative correlation between total carbon and phage phylotype richness was

435

unexpected. Forest soils with higher organic matter were reported to harbor more

436

diverse assemblages of viruses than agricultural soils in terms of morphological

437

distribution (Williamson et al., 2005).

RI PT

432

Phylogenetic analysis showed that most of the g23 clones obtained in this study

439

formed several clusters separate from the clones of marine and lake origins and

440

closer to those from Japanese paddy field soils relative to those from paddy soils in

441

Northeast China (Fig. S3). UniFrac analysis further showed that g23 clones from the

442

mudflat were similar to those from the aquatic environment, even though they were

443

separated from g23 by relatively short-term cultivation (50-300 years) of the Chinese

444

and Japanese paddy soils; they were also separated from the long-term cultivation

445

(700-2000 years) paddy soils (Fig. 3). It should be noted that the cultivation time of

446

these paddy soils was on the order of several decades in China and Japan (Fujii et al.,

447

2008; Wang et al., 2009b). The T4-type phage community composition was

448

speculated to be similar if the soil experienced the same ecological processes (Liu et

449

al., 2011). However, the phylogenetic and UniFrac analysis of g23 genes in the tested

450

soil chronosequence indicated that the T4-type phage community composition of

451

cultivated soil on centurial and/or millennial time scales was phylogenetically

452

separated from those on decadal time scales (Fig. 3 and Fig. 4b). Previous studies

453

have demonstrated that the distribution of g23 genes varied with the environment

AC C

EP

TE D

M AN U

SC

438

21

ACCEPTED MANUSCRIPT (Filee et al., 2005; Fujii et al., 2008; Jia et al., 2007; Liu et al., 2011; Nakayama et al.,

455

2009; Wang et al., 2011; Zheng et al., 2013). In this study, obvious shifts were

456

observed in the representation of nucleotide sequences of the g23 major capsid

457

protein, emblematic of the T4-type phages, across the chronosequence except for

458

S100 grouping with paddy soils with long term cultivation(S700, S1000, and S2000)

459

(Table 2; Fig. 4b). This shift in the soil phage community was mainly ascribed to the

460

phylogenetic distribution of the g23 clone in paddy groups VIII and IX and the marine

461

group (Fig. 4b). It is the g23 clone in paddy groups VIII and IX that resulted in the

462

phylogenetic separation of the T4-type phages in the paddy soils of S50 and S300

463

from those in the paddy field under longer cultivation times (Fig. 4b). These two g23

464

groups were widely distributed in paddy soils in China and Japan (Liu et al., 2012;

465

Wang et al., 2009c). It is interesting to note that relatively high contents of lake

466

group g23 were detected in the paddy soils of S700 and S2000 (Table 2; Fig. 2).

467

T4-type phages in wetland water and paddy field floodwater were phylogenetically

468

clustered close to those of lake waters but not to those of marine waters (Zheng et

469

al., 2013). Thus, we speculate that long-term (centurial and/or millennial) flooding

470

would result in shifts of T4-type phages towards the lake group.

471

4.4 Impact of T4-type phage and bottom-up controls on shaping bacterial community

472

Environmental factors (e.g. pH, soil organic carbon, N availability) were prevailing

473

factors in shaping soil bacterial community composition in various environments

474

(Kaiser et al., 2016; Liu et al., 2014; Santini et al., 2015; Sul et al., 2013; Zeng et al.,

475

2016). More recently, Ding et al (2017) reported that shifts in the bacterial

AC C

EP

TE D

M AN U

SC

RI PT

454

22

ACCEPTED MANUSCRIPT community structure were mainly driven by changes in the soil physicochemical

477

properties during paddy soil development from the same site, as in this study, which

478

explained 62% of the variation in the bacterial community structure. However,

479

despite the increasing recognition that phages play a significant role in shaping

480

bacterial population dynamics and altering both intra- and inter-specific competition

481

among bacterial hosts, their effect on the bacterial community was ignored in these

482

previous studies. In this study, the results of the VPA analysis showed that over 80%

483

of the variation in the bacterial community structure could be explained by the

484

combined T4-type phage and soil parameters, which were closely associated with

485

continuous rice cultivation (Fig. 5b). This indicated that the shifts in bacterial

486

communities were driven not only by soil properties during the long-term paddy soil

487

development but also by T4-type phage lysis. The Mantel test further confirmed the

488

important role of T4-type phages in shaping the bacterial community (Fig. 5b). These

489

results implied that both the button-up (soil properties) and top-down controls

490

(T4-type phage) shaped the bacterial communities along the 2000-year paddy soil

491

chronosequence.

SC

M AN U

TE D

EP

AC C

492

RI PT

476

Previous studies have often investigated the host range by describing a

493

bacteriophage that can infect multiple strains of the same species of bacteria

494

(Barrangou et al., 2002; Holmfeldt et al., 2007). Recently, some studies have

495

confirmed that network analysis can be used to provide new insights into

496

interactions between bacteriophages and their possible hosts in complex

497

environmental scenarios (Comeau et al., 2010; Flores et al., 2011; Koskella and 23

ACCEPTED MANUSCRIPT Brockhurst, 2014; Weitz et al., 2013). In the present study, we explored the

499

relationship between T4-type phages and bacterial taxa using network analysis, and

500

we were able to visualize the complicated associations between phage subtypes and

501

the potential hosts. The cooccurrence patterns between T4-type phages and

502

microbial taxa suggested that six bacterial families were implicated as being possible

503

T4-type phage hosts (Fig. 6). Nevertheless, we focused specifically on the T4-type

504

myovirus family in lieu of the entire viral community and ignored the function of

505

protist predation on shaping the bacterial communities. The investigated

506

protistan-bacterial associations were far fewer than virus-bacteria associations in

507

previous studies (Chow et al., 2014). Looking forward, there is a need to extend novel

508

culture-independent methods and tools to the study of currently underrepresented

509

viruses. There is also a need to use these methods to assess the functions of protists

510

in shaping microbial communities in diverse terrestrial environments.

511

5. Conclusions

EP

TE D

M AN U

SC

RI PT

498

In conclusion, our study provides evidence for the important role of T4-type

513

phages in shaping the bacterial community with respect to soil development during

514

2000 years of rice cultivation after reclamation from mudflats. The conversion of

515

mudflats into paddy soils significantly increased soil nutrients (e.g., organic carbon

516

and nitrogen) and resulted in the acidification of the soil. Soil bacterial and T4-type

517

phage communities were significantly shifted in the development of the paddy soils

518

over millennial time scales from those at centurial time scales and the mudflat

AC C

512

24

ACCEPTED MANUSCRIPT control. VPA analysis indicated that shifts in the bacterial communities were driven

520

not only by soil properties but also by T4-type phage lysis during the long-term

521

paddy soil development. The Mantel test provided further evidence for the

522

important role of T4-type phages in shaping the bacterial community. This study

523

provides insights into the impact of phage lysis on the profiles of bacterial

524

compositions in the soil environment.

525

Acknowledgements

526

This research was financially supported by the National Natural Science Foundation

527

of China (41671249 and 41721001). We gratefully acknowledge the support of

528

Analysis Center of Agrobiology and Environmental Sciences, Zhejiang University for

529

the network analysis. Yong Li extends his thanks to the Pao Yu-kong and Pao

530

Zhao-Long Scholarship for their financial support.

531

References

532

Ai, C., Zhang, S., Zhang, X., Guo, D., Zhou, W., & Huang, S., 2018. Distinct responses of

533

soil bacterial and fungal communities to changes in fertilization regime and crop

SC

M AN U

TE D

EP

AC C

534

RI PT

519

rotation. Geoderma, 319, 156-166.

535

Ai, C., Liang, G., Sun, J., Wang, X., He, P., & Zhou, W., 2013. Different roles of

536

rhizosphere effect and long-term fertilization in the activity and community

537

structure of ammonia oxidizers in a calcareous fluvo-aquic soil. Soil Biology and

538

Biochemistry, 57, 30-42.

539

Allen, B., Willner, D., Oechel, W.C., Lipson, D., 2010. Top-down control of microbial 25

ACCEPTED MANUSCRIPT 540

activity and biomass in an Arctic soil ecosystem. Environmental Microbiology 12,

541

642-648. Barrangou, R., Yoon, S.S., Breidt, F., Fleming, H.P., Klaenhammer, T.R., 2002.

543

Characterization of Six Leuconostoc fallax Bacteriophages Isolated from an

544

Industrial Sauerkraut Fermentation. Applied and Environmental Microbiology 68,

545

5452-5458.

RI PT

542

Baudoux, A.C., Veldhuis, M.J.W., Noordeloos, A.A.M., van Noort, G., Brussaard, C.P.D.,

547

2008. Estimates of virus- vs. grazing induced mortality of picophytoplankton in

548

the North Sea during summer. Aquatic Microbial Ecology 52, 69-82.

M AN U

SC

546

Bell, T.H., Yergeau, E., Maynard, C., Juck, D., Whyte, L.G., Greer, C.W., 2013.

550

Predictable bacterial composition and hydrocarbon degradation in Arctic soils

551

following diesel and nutrient disturbance.The ISME Journal 7, 1200-1210.

TE D

549

Butina, T.V., Belykh, O.I., Maksimenko, S.Y., Belikov, S.I., 2010. Phylogenetic diversity

553

of T4-like bacteriophages in Lake Baikal, East Siberia. FEMS Microbiology Letters

554

309, 122-129.

556

Campbell, J.I., Albrechtsen, M., Sørensen, J., 1995. Large Pseudomonas phages

AC C

555

EP

552

isolated from barley rhizosphere. FEMS Microbiology Ecology 18, 63-74.

557

Caporaso, J.G., Kuczynski, J., Stombaugh, J., Bittinger, K., Bushman, F.D., Costello, E.K.,

558

Huttley, G.A., Knight, R., 2010. QIIME allows analysis of high-throughput

559

community sequencing data. Nature Methods 7, 335-336.

560

Cheng, Y.-Q., Yang, L.-Z., Cao, Z.-H., Ci, E., Yin, S., 2009. Chronosequential changes of

561

selected pedogenic properties in paddy soils as compared with non-paddy soils. 26

ACCEPTED MANUSCRIPT 562

Geoderma 151, 31-41. Chow, C.E., Kim, D.Y., Sachdeva, R., Caron, D.A., Fuhrman, J.A., 2014. Top-down

564

controls on bacterial community structure: microbial network analysis of

565

bacteria, T4-like viruses and protists.The ISME Journal 8, 816-829.

RI PT

563

Comeau, A.M., Arbiol, C., Krisch, H.M., 2010. Gene network visualization and

567

quantitative synteny analysis of more than 300 marine T4-like phage scaffolds

568

from the GOS metagenome. Molecular Biology and Evolution 27, 1935-1944.

569

Comeau, A.M., Krisch, H.M., 2008. The capsid of the T4 phage superfamily: the

570

evolution, diversity, and structure of some of the most prevalent proteins in the

571

biosphere. Molecular Biology and Evolution 25, 1321-1332.

M AN U

SC

566

Corinaldesi, C., Dell'Anno, A., Magagnini, M., Danovaro, R., 2010. Viral decay and viral

573

production rates in continental-shelf and deep-sea sediments of the

574

Mediterranean Sea. FEMS Microbiology Ecology 72, 208-218.

TE D

572

Cram, J.A., Parada, A.E., Fuhrman, J.A., 2016. Dilution reveals how viral lysis and

576

grazing shape microbial communities. Limnology and Oceanography 61,

577

889-905.

AC C

EP

575

578

Ding, L.-J., Su, J.-Q., Li, H., Zhu, Y.-G., Cao, Z.-H., 2017. Bacterial succession along a

579

long-term chronosequence of paddy soil in the Yangtze River Delta, China. Soil

580 581 582 583

Biology and Biochemistry 104, 59-67.

Edgar, R.C., 2010. Search and clustering orders of magnitude faster than BLAST. Bioinformatics 26, 2460-2461. Engelhardt, T., Orsi, W.D., Jorgensen, B.B., 2015. Viral activities and life cycles in deep 27

ACCEPTED MANUSCRIPT 584

subseafloor sediments. Environmental Microbiology Reports 7, 868-873. Filee, J., Tetart, F., Suttle, C.A., Krisch, H.M., 2005. Marine T4-type bacteriophages, a

586

ubiquitous component of the dark matter of the biosphere. Proceedings of

587

the National Academy of Sciences 102, 12471-12476.

RI PT

585

Flores, C.O., Meyer, J.R., Valverde, S., Farr, L., Weitz, J.S., 2011. Statistical structure of

589

host–phage interactions. Proceedings of the National Academy of Sciences 108,

590

288-297.

593 594 595 596

M AN U

592

Friedman, J., Alm, E.J., 2012. Inferring correlation networks from genomic survey data. PLoS computational biology 8, e1002687.

Fuhrman, J.A., 1999. Marine viruses and their biogeochemical and ecological effects. Nature 399, 541-548.

Fuhrman, J.A., Noble, R.T., 1995. Viruses and protists cause similar bacterial mortality

TE D

591

SC

588

in coastal seawater. Limnology and Oceanography 40, 1236-1242. Fujii, T., Nakayama, N., Nishida, M., Sekiya, H., Kato, N., Asakawa, S., Kimura, M.,

598

2008. Novel capsid genes (g23) of T4-type bacteriophages in a Japanese paddy

599

field. Soil Biology and Biochemistry 40, 1049-1058.

AC C

EP

597

600

Ghosh, D., Roy, K., Williamson, K.E., White, D.C., Wommack, K.E., Sublette, K.L.,

601

Radosevich, M., 2008. Prevalence of lysogeny among soil bacteria and presence

602 603

of 16S rRNA and trzN genes in viral-community DNA. Applied and Environmental Microbiology 74, 495-502.

604

Hanson, C.A., Fuhrman, J.A., Horner-Devine, M.C., Martiny, J.B., 2012. Beyond

605

biogeographic patterns: processes shaping the microbial landscape. Nature 28

ACCEPTED MANUSCRIPT 606

Reviews Microbiology 10, 497-506. Helsley, K. R., Brown, T. M., Furlong, K., & Williamson, K. E., 2014. Applications and

608

limitations of tea extract as a virucidal agent to assess the role of phage

609

predation in soils. Biology and Fertility of Soils 50(2), 263-274.

RI PT

607

Holmfeldt, K., Middelboe, M., Nybroe, O., Riemann, L., 2007. Large variabilities in

611

host strain susceptibility and phage host range govern interactions between lytic

612

marine phages and their Flavobacterium hosts. Applied and Environmental

613

Microbiology 73, 6730-6739.

M AN U

SC

610

614

Huang, J., Sheng, X., He, L., Huang, Z., Wang, Q., Zhang, Z., 2013. Characterization of

615

depth-related changes in bacterial community compositions and functions of a

616

paddy soil profile. FEMS Microbiology Letters 347, 33-42. Huang, L.-M., Thompson, A., Zhang, G.-L., 2014. Long-term paddy cultivation

618

significantly alters topsoil phosphorus transformation and degrades phosphorus

619

sorption capacity. Soil and Tillage Research 142, 32-41.

TE D

617

Jia, Z., Ishihara, R., Nakajima, Y., Asakawa, S., Kimura, M., 2007. Molecular

621

characterization of T4-type bacteriophages in a rice field. Environmental

AC C

622

EP

620

Microbiology 9, 1091-1096.

623

Joseph, B., Patra, R.R., Lawrence, R., 2007. Characterization of plant growth

624

promoting rhizobacteria associated with chickpea (Cicer arietinum L.).

625

International Journal Of Plant Production 1, 141-151.

626

Kaiser, K., Wemheuer, B., Korolkow, V., Wemheuer, F., Nacke, H., Schoning, I.,

627

Schrumpf, M., Daniel, R., 2016. Driving forces of soil bacterial community 29

ACCEPTED MANUSCRIPT 628

structure, diversity, and function in temperate grasslands and forests. Scientific

629

Reports 6, 33696.

631

Kimura, M., Jia, Z.-J., Nakayama, N., Asakawa, S., 2008. Ecology of viruses in soils: Past, present and future perspectives. Soil Science and Plant Nutrition 54, 1-32.

RI PT

630

Koskella, B., Brockhurst, M.A., 2014. Bacteria-phage coevolution as a driver of

633

ecological and evolutionary processes in microbial communities. FEMS

634

Microbiology Reviews 38, 916-931.

SC

632

Kuever, J., 2014. The Family Desulfobulbaceae. 75-86.

636

Kumar, S., Tamura, K., Nei, M., 2004. MEGA3 Integrated software for Molecular

637

Evolutionary Genetics Analysis and sequence alignment. Briefings in

638

Bioinformatics 5, 150-163.

M AN U

635

Kyselkova, M., Almario, J., Kopecky, J., Sagova-Mareckova, M., Haurat, J., Muller, D.,

640

Grundmann, G.L., Moenne-Loccoz, Y., 2014. Evaluation of rhizobacterial

641

indicators of tobacco black root rot suppressiveness in farmers' fields.

642

Environmental Microbiology Reports 6, 346-353.

644 645 646 647

EP

Lee, C.H., Park, C.Y., Park, K.D., Jeon, W.T., Kim, P.J., 2004. Long-term effects of

AC C

643

TE D

639

fertilization on the forms and availability of soil phosphorus in rice paddy. Chemosphere 56, 299-304.

Letunic, I., Bork, P., 2007. Interactive Tree Of Life (iTOL): an online tool for phylogenetic tree display and annotation. Bioinformatics 23, 127-128.

648

Liu, C., Ding, N., Fu, Q., Brookes, P.C., Xu, J., Guo, B., Lin, Y., Li, H., Li, N., 2016. The

649

influence of soil properties on the size and structure of bacterial and fungal 30

ACCEPTED MANUSCRIPT 650

communities along a paddy soil chronosequence. European Journal of Soil

651

Biology 76, 9-18. Liu, J., Sui, Y., Yu, Z., Shi, Y., Chu, H., Jin, J., Liu, X., Wang, G., 2014. High throughput

653

sequencing analysis of biogeographical distribution of bacterial communities in

654

the black soils of northeast China. Soil Biology and Biochemistry 70, 113-122.

RI PT

652

Liu, J., Wang, G., Wang, Q., Liu, J., Jin, J., Liu, X., 2012. Phylogenetic diversity and

656

assemblage of major capsid genes (g23) of T4-type bacteriophages in paddy

657

field soils during rice growth season in Northeast China. Soil Science and Plant

658

Nutrition 58, 435-444.

M AN U

SC

655

Liu, J., Wang, G., Zheng, C., Yuan, X., Jin, J., Liu, X., 2011. Specific assemblages of

660

major capsid genes (g23) of T4-type bacteriophages isolated from upland black

661

soils in Northeast China. Soil Biology and Biochemistry 43, 1980-1984.

TE D

659

Li, Y.C., Li, Y.F., Chang, S.X., Liang, X., Qin, H., Chen, J.H., Xu, Q.F., 2017. Linking soil

663

fungal community structure and function to soil organic carbon chemical

664

composition in intensively managed subtropical bamboo forests. Soil Biology &

665

Biochemistry 107, 19-31.

AC C

EP

662

666

Li, Y.F., Zhang, J.J., Chang, S.X., Jiang, P.K., Zhou, G.M., Fu, S.L., Yan, E.R., Wu, J.S., Lin,

667

L., 2013. Long-term intensive management effects on soil organic carbon pools

668 669

and chemical composition in Moso bamboo (Phyllostachys pubescens) forests in subtropical China. Forest Ecology and Management 303, 121-130.

670

Li, Y., Watanabe, T., Murase, J., Asakawa, S., Kimura, M., 2013. Identification of major

671

capsid gene (g23) of T4-type bacteriophages that assimilate substrate from root 31

ACCEPTED MANUSCRIPT 672

cap cells in aerobic and anaerobic soil conditions using a DNA-SIP approach. Soil

673

Biology and Biochemistry 63, 97-105. Li, Y., Lee, C.G., Watanabe, T., Murase, J., Asakawa, S., Kimura, M., 2011.

675

Identification of microbial communities that assimilate substrate from root cap

676

cells in an aerobic soil using a DNA-SIP approach. Soil Biology and Biochemistry

677

43, 1928-1935.

RI PT

674

Longnecker, K., Wilson, M.J., Sherr, E.B., Sherr, B.F., 2010. Effect of top-down control

679

on cell-specific activity and diversity of active marine bacterioplankton. Aquatic

680

Microbial Ecology 58, 153-165.

M AN U

SC

678

Lopez-Bueno, A., Tamames, J., Velazquez, D., Moya, A., Quesada, A., Alcami, A., 2009.

682

High diversity of the viral community from an Antarctic lake. Science 326,

683

858-861.

684

TE D

681

Lozupone, C., Knight, R., 2005. UniFrac: a new phylogenetic method for comparing microbial

686

8228-8235.

688 689 690 691

Applied

and

Environmental

Microbiology

71,

Luo, Y., Durenkamp, M., De Nobili, M., Lin, Q., Devonshire, B.J., Brookes, P.C., 2013.

AC C

687

communities.

EP

685

Microbial biomass growth, following incorporation of biochars produced at 350 °C or 700 °C, in a silty-clay loam soil of high and low pH. Soil Biology and Biochemistry 57, 513-523.

Luo, Y., Zang, H., Yu, Z., Chen, Z., Gunina, A., Kuzyakov, Y., Xu, J., Zhang, K., Brookes,

692

P.C.,

2017.

Priming

effects

in

biochar

enriched

soils

using

a

693

three-source-partitioning approach: 14 C labelling and 13 C natural abundance. 32

ACCEPTED MANUSCRIPT 694

Soil Biology and Biochemistry 106, 28-35. Luo, G., Rensing, C., Chen, H., Liu, M., Wang, M., Guo, S., Ling, N., Shen, Q., 2018.

696

Deciphering the associations between soil microbial diversity and ecosystem

697

multifunctionality driven by long‐term fertilization management. Functional

698

Ecology, 32(4), 1103-1116.

700

Middelboe, M., Glud, R.N., 2006. Viral activity along a trophic gradient in continental margin sediments off central Chile. Marine Biology Research 2, 41-51.

SC

699

RI PT

695

Nakayama, N., Asakawa, S., Kimura, M., 2009. Comparison of g23 gene sequence

702

diversity between Novosphingobium and Sphingomonas phages and phage

703

communities in the floodwater of a Japanese paddy field. Soil Biology and

704

Biochemistry 41, 179-185.

M AN U

701

Nakayama, N., Okumura, M., Inoue, K., Asakawa, S., Kimura, M., 2007. Seasonal

706

variations in the abundance of virus-like particles and bacteria in the floodwater

707

of a Japanese paddy field. Soil Science and Plant Nutrition 53, 420-429.

TE D

705

Needham, D.M., Chow, C.E., Cram, J.A., Sachdeva, R., Parada, A., Fuhrman, J.A., 2013.

709

Short-term observations of marine bacterial and viral communities: patterns,

AC C

710

EP

708

connections and resilience.The ISME Journal 7, 1274-1285.

711

Nguyen, T.T., Landfald, B., 2015. Polar front associated variation in prokaryotic

712

community structure in Arctic shelf seafloor. Frontiers in Microbiology 6, 17.

713

Oren, A., Xu, X.-W., 2014. The Family Hyphomicrobiaceae. 247-281.

714

Pansu, M., Gautheyrou, J., 2007. Handbook of soil analysis: mineralogical, organic

715

and inorganic methods. Springer Science & Business Media, pp551-833. 33

ACCEPTED MANUSCRIPT Santini, T.C., Warren, L.A., Kendra, K.E., 2015. Microbial Diversity in Engineered

717

Haloalkaline Environments Shaped by Shared Geochemical Drivers Observed in

718

Natural Analogues. Applied and Environmental Microbiology 81, 5026-5036.

719

Selmants, P.C., Hart, S.C., 2010. Phosphorus and soil development_ does the Walker

720

RI PT

716

and Syers model apply to semiarid ecosystems. Ecology 91, 474-484.

Shannon, P., Markiel, A., Ozier, O., Baliga, N.S., Wang, J.T., Ramage, D., Amin, N.,

722

Schwikowski, B., Ideker, T., 2003. Cytoscape: A software environment for

723

integrated models of biomolecular interaction networks. Genome Research 13,

724

2498-2504.

M AN U

SC

721

Shiratori-Takano, H., Takano, H., Ueda, K., 2016. Whole-genome sequence of

726

Filimonas lacunae, a bacterium of the family Chitinophagaceae characterized by

727

marked colony growth under a high-CO2 atmosphere. Genome Announcements

728

4, e00667-00616.

TE D

725

Srinivasiah, S., Lovett, J., Ghosh, D., Roy, K., Fuhrmann, J. J., Radosevich, M., &

730

Wommack, K. E., 2015. Dynamics of autochthonous soil viral communities

731

parallels dynamics of host communities under nutrient stimulation. FEMS

AC C

732

EP

729

Microbiology Ecology 91(7).

733

Staley, C., Sadowsky, M.J., 2016. Regional Similarities and Consistent Patterns of Local

734

Variation in Beach Sand Bacterial Communities throughout the Northern

735

Hemisphere. Applied and Environmental Microbiology 82, 2751-2762.

736

Staniewski, M.A., Short, C.M., Short, S.M., 2012. Contrasting community versus

737

population-based estimates of grazing and virus-induced mortality of 34

ACCEPTED MANUSCRIPT 738

phytoplankton. Microbial Ecology 64, 25-38. Stevens, H., Brinkhoff, T., Rink, B., Vollmers, J., Simon, M., 2007. Diversity and

740

abundance of Gram positive bacteria in a tidal flat ecosystem. Environmental

741

Microbiology 9, 1810-1822.

RI PT

739

Stubner, S., 2002. Enumeration of 16S rDNA of Desulfotomaculum lineage 1 in rice

743

field soil by real-time PCR with SybrGreen(TM) detection. Journal of

744

Microbiological Methods 50, 155-164.

SC

742

Su, J.-Q., Xia, Y., Yao, H.-Y., Li, Y.-Y., An, X.-L., Singh, B.K., Zhang, T., Zhu, Y.-G., 2017.

746

Metagenomic assembly unravel microbial response to redox fluctuation in acid

747

sulfate soil. Soil Biology and Biochemistry 105, 244-252.

M AN U

745

Sul, W.J., Asuming-Brempong, S., Wang, Q., Tourlousse, D.M., Penton, C.R., Deng, Y.,

749

Rodrigues, J.L.M., Adiku, S.G.K., Jones, J.W., Zhou, J., Cole, J.R., Tiedje, J.M., 2013.

750

Tropical agricultural land management influences on soil microbial communities

751

through its effect on soil organic carbon. Soil Biology and Biochemistry 65,

752

33-38.

754

EP

Suttle, C.A., 1994. The significance of viruses to mortality in aquatic microbial

AC C

753

TE D

748

communities. Microbial Ecology 28, 237-243.

755

Suttle, C.A., 2005. Viruses in the sea. Nature 437, 356-361.

756

Suttle, C.A., 2007. Marine viruses--major players in the global ecosystem. Nature

757

Reviews Microbiology 5, 801-812.

758

Thompson, J.D., Gibson, T.J., Plewniak, F., Jeanmougin, F., Higgins, D.G., 1997. The

759

CLUSTAL_X Windows Interface Flexible Strategies for Multiple Sequence 35

ACCEPTED MANUSCRIPT 760

Alignment Aided by Quality Analysis Tools. Nucleic acids research 25,

761

4876-4882. Wang, G., Hayashi, M., Saito, M., Tsuchiya, K., Asakawa, S., Kimura, M., 2009a. Survey

763

of major capsid genes (g23) of T4-type bacteriophages in Japanese paddy field

764

soils. Soil Biology and Biochemistry 41, 13-20.

RI PT

762

Wang, G., Jin, J., Asakawa, S., Kimura, M., 2009b. Survey of major capsid genes (g23)

766

of T4-type bacteriophages in rice fields in Northeast China. Soil Biology and

767

Biochemistry 41, 423-427.

M AN U

SC

765

Wang, G., Murase, J., Taki, K., Ohashi, Y., Yoshikawa, N., Asakawa, S., Kimura, M.,

769

2009c. Changes in major capsid genes (g23) of T4-type bacteriophages with soil

770

depth in two Japanese rice fields. Biology and Fertility of Soils 45, 521-529.

771

Wang, G., Yu, Z., Liu, J., Jin, J., Liu, X., Kimura, M., 2011. Molecular analysis of the

772

major capsid genes (g23) of T4-type bacteriophages in an upland black soil in

773

Northeast China. Biology and Fertility of Soils 47, 273-282.

TE D

768

Wang, Q., Wang, C., Yu, W., Turak, A., Chen, D., Huang, Y.,Ao, J., Jiang, Y., Huang, Z.,

775

2018. Effects of Nitrogen and Phosphorus Inputs on Soil Bacterial Abundance,

777 778 779

AC C

776

EP

774

Diversity, and Community Composition in Chinese Fir Plantations. Frontiers in microbiology, 9.

Wardle, D.A., Walker, L.R., Bardgett, R.D., 2004. Ecosystem properties and forest decline in contrasting long-term chronosequences. Science 305, 509-513.

780

Wei, X., Hu, Y., Peng, P., Zhu, Z., Atere, C. T., O’Donnell, A. G., Wu, J., Ge, T., 2017.

781

Effect of P stoichiometry on the abundance of nitrogen-cycle genes in 36

ACCEPTED MANUSCRIPT 782

phosphorus-limited paddy soil. Biology and Fertility of Soils, 53(7), 767-776.

783

Weinbauer, M.G., 2004. Ecology of prokaryotic viruses. FEMS Microbiology Reviews

784

28, 127-181. Weinbauer, M.G., Christaki, U., Nedoma, J., Simek, K., 2003. Comparing the effects of

786

resource enrichment and grazing on viral production in a meso-eutrophic

787

reservoir. Aquatic Microbial Ecology 31, 137-144.

RI PT

785

Weinbauer, M.G., Hornak, K., Jezbera, J., Nedoma, J., Dolan, J.R., Simek, K., 2007.

789

Synergistic and antagonistic effects of viral lysis and protistan grazing on

790

bacterial biomass, production and diversity. Environmental Microbiology 9,

791

777-788.

M AN U

SC

788

Weitz, J.S., Poisot, T., Meyer, J.R., Flores, C.O., Valverde, S., Sullivan, M.B., Hochberg,

793

M.E., 2013. Phage-bacteria infection networks. Trends in Microbiology 21,

794

82-91.

TE D

792

Weitz, J.S., Stock, C.A., Wilhelm, S.W., Bourouiba, L., Coleman, M.L., Buchan, A.,

796

Follows, M.J., Fuhrman, J.A., Jover, L.F., Lennon, J.T., Middelboe, M.,

797

Sonderegger, D.L., Suttle, C.A., Taylor, B.P., Frede Thingstad, T., Wilson, W.H., Eric

799 800 801 802 803

AC C

798

EP

795

Wommack, K., 2015. A multitrophic model to quantify the effects of marine viruses on microbial food webs and ecosystem processes.The ISME Journal 9, 1352-1364.

Weitz, J.S., Wilhelm, S.W., 2012. Ocean viruses and their effects on microbial communities and biogeochemical cycles. F1000 Biology Reports 4, 17. Wilhelm, S.W., Matteson, A.R., 2008. Freshwater and marine virioplankton: a brief 37

ACCEPTED MANUSCRIPT 804

overview of commonalities and differences. Freshwater Biology 53, 1076-1089. Williamson, K.E., Wommack, K.E., Radosevich, M., 2003. Sampling Natural Viral

806

Communities from Soil for Culture-Independent Analyses. Applied and

807

Environmental Microbiology 69, 6628-6633.

RI PT

805

Williamson, K. E., Radosevich, M., & Wommack, K. E., 2005. Abundance and diversity

809

of viruses in six Delaware soils. Applied and Environmental Microbiology 71(6),

810

3119-3125.

SC

808

Wu, X., Ge, T., Yan, W., Zhou, J., Wei, X., Chen, L., Chen, X., Nannipieri, P., Wu, J., 2017.

812

Irrigation management and phosphorus addition alter the abundance of carbon

813

dioxide-fixing autotrophs in phosphorus-limited paddy soil. FEMS microbiology

814

ecology 93(12), fix154.

M AN U

811

Zeng, J., Liu, X., Song, L., Lin, X., Zhang, H., Shen, C., Chu, H., 2016. Nitrogen

816

fertilization directly affects soil bacterial diversity and indirectly affects bacterial

817

community composition. Soil Biology and Biochemistry 92, 41-49.

TE D

815

Zhang, Q., Wang, G.H., Feng, Y.K., Sun, Q.Z., Witt, C., Dobermann, A., 2006. Changes

819

in soil phosphorus fractions in a calcareous paddy soil under intensive rice

AC C

820

EP

818

cropping. Plant and Soil 288, 141-154.

821

Zheng, C., Wang, G., Liu, J., Song, C., Gao, H., Liu, X., 2013. Characterization of the

822

major capsid genes (g23) of T4-type bacteriophages in the wetlands of

823

northeast China. Microbial Ecology 65, 616-625.

824

Zhong, X., Jacquet, S., 2014. Differing assemblage composition and dynamics in

825

T4-like myophages of two neighbouring sub-alpine lakes. Freshwater Biology 59, 38

ACCEPTED MANUSCRIPT 826

1577-1595. Zhu, Y.-G., Su, J.-Q., Cao, Z., Xue, K., Quensen, J., Guo, G.-X., Yang, Y.-F., Zhou, J., Chu,

828

H.-Y., Tiedje, J.M., 2016. A buried Neolithic paddy soil reveals loss of microbial

829

functional diversity after modern rice cultivation. Science Bulletin 61,

830

1052-1060.

AC C

EP

TE D

M AN U

SC

RI PT

827

39

ACCEPTED MANUSCRIPT Figure Legends

832

Fig. 1. Relative abundance of the soil bacterial community composition at the (a)

833

phylum and (b) family levels. The relative abundance is expressed as the

834

average percentage of the targeted sequences to the total high-quality bacterial

835

sequences of each soil (S0, S50, S100, S300, S700, S1000 and S2000).

836

RI PT

831

Fig. 2. Neighbor-joining phylogenetic tree of g23 sequences obtained in this study.

The black and gray circles indicate internal nodes with at least 90% and 50%

838

bootstrap support, respectively. The black and white squares and triangle denote

839

the reference g23 clones obtained from the paddy, upland soils and marine and

840

lake waters, respectively.

M AN U

841

SC

837

Fig. 3. Principal coordinate analysis of the g23 assemblages obtained in this study with those obtained from marine waters (Filee et al., 2005), lake waters (Butina

843

et al., 2010; Lopez-Bueno et al., 2009), paddy field soils in Japan (Fujii et al.,

844

2008; Jia et al., 2007; Wang et al., 2009a) and NE China (Wang et al., 2009b) by

845

the UniFrac method. Ellipses indicate g23 clusters obtained from different

846

habitats.

AC C

EP

TE D

842

847

Fig. 4. Ordination of soil (a) bacterial and (b) phage communities via principal

848

coordinate analysis (PCoA) based on weighted Unifrac distances. Arrows

849

represent bacteria at the family level and phage clusters whose relative

850

abundances were significantly (P < 0.05) correlated with the first two axes for

851

bacteria and phages, respectively.

40

ACCEPTED MANUSCRIPT Fig. 5. (a) Canonical correspondence analysis (CCA) compares soil bacterial

853

community structure and top-down parameters, including the CCA index and

854

Shannon index of the phages, and bottom-up parameters, including pH and

855

electrical conductivity (EC). (b) Variation partition analysis (VPA) and Mantel

856

test of the relationships between top-down controls (i.e., diversity index of

857

phage), bottom-up controls (i.e., pH and EC) and the microbial community.

858

Fig. 6. Network analysis revealing the cooccurrence patterns between phage

859

subtypes and bacterial taxa (at the family level). The nodes represent the top

860

ten OTUs of bacteria and clusters of phages, whereas the edges (that is,

861

connections) correspond to a strong and significant (positive denoted by red,

862

and negative denoted by black) correlation between nodes.

AC C

EP

TE D

M AN U

SC

RI PT

852

41

ACCEPTED MANUSCRIPT

Soil type

pH

Total C

Total N

Total P

Total K

g kg-1

g kg-1

g kg-1

g kg-1 12.99c

EC

Number of

Phylogenetic

Number of

Shannon

dS m-1

phylotype*

diversity*

phylotype#

index#

3681

274

21

3.45

SC

Soils

RI PT

Table 1. Basic characteristics and number of sequences and OTUs for each soil chronosequence.

S0

Mudflat

8.65a

4.83e

0.56f

0.55c

S50

Paddy field

8.22b

26.13d

0.82e

0.57c

18.27a

0.34b

5407

366

19

3.40

S100

Paddy field

6.93d

30.11c

1.34d

0.72b

12.07c

0.41b

5985

386

9

2.70

S300

Paddy field

6.77d

37.94b

1.63c

0.76b

10.24d

0.15c

5337

346

7

2.49

S700

Paddy field

7.72c

37.58b

1.81b

1.07a

15.72b

0.34b

5397

358

11

2.94

S1000

Paddy field

6.44e

32.56c

1.21d

0.5c

7.28e

0.23bc

5356

353

5

2.19

S2000

Paddy field

5.62f

45.6a

0.39d

5.97e

0.13c

4796

326

10

2.84

AC C

2.74a

2.75a

M AN U

TE D

EP

863

864

Note: The symbol * and # indicates diversities of bacteria and phages, respectively. EC: electric conductivity. Significant differences between the

865

soil chronosequence were determined using one-way ANOVA followed by Duncan's multiple range test at P < 0.05, in which the conditions

866

of normality and homogeneity of variance were met.

42

ACCEPTED MANUSCRIPT Table 2. Phylogenetic groups of g23 amino acids for each soil chronosequence. Phylogenetic S0

S50

S100

S300

S700

S1000

S2000

g23 amino acids*

44

37

33

17

27

25

19

Marine Groups#

0.5

-

-

-

0.07

-

-

Lake Groups

0.27

0.05

-

-

0.15

0.04

0.26

Paddy groups

-

0.9

0.91

1

0.78

0.96

0.74

I

-

-

0.12

-

0.04

-

-

II

-

0.03

-

-

-

-

0.05

IV

-

0.22

0.15

0.18

-

-

0.11

V

-

-

-

0.29

-

-

0.11

VI

-

0.16

-

-

-

-

-

Paddy Un

M AN U

TE D 0.08

0.64

-

0.7

0.96

0.47

-

0.22

-

0.53

0.04

-

-

-

0.19

-

-

-

-

-

0.23

0.05

0.09

-

-

-

-

AC C

Ungrouped

-

EP

IX

RI PT

group

VIII

868

Soils

SC

867

Note: The symbol * and # indicates number and percent of g23, respectively.

43

ACCEPTED MANUSCRIPT 869

Table 3. BIO-ENV analysis based on the Speraman's rank correlation coefficient (ρ), showing the

870

association between bacterial composition, based on the relative abundance of the 16S rRNA

871

gene detected, and environmental variables. Spearman's coefficient (ρ)

RI PT

Combined variables

0.7454 0.9013 0.9247 0.9351 0.9143 0.8909 Note: P (Phage), Shannon, Phylotype and CCA1 indicate diversity indexes of phage.

M AN U TE D EP AC C

872

SC

EC pH + EC pH + EC + Shannon P pH + EC + CCA1 P + Shannon P pH + Tot C + EC + CCA1 P + Shannon P pH + Tot N + EC + CCA1 P + Phylotype P

44

ACCEPTED MANUSCRIPT

Fig. 2

S0-7 S0-10 S0-18 S0-19 S0-25 S0-33 S0-40

Lake Groups S0-34

S50-35 S50-10 S50-15 S300-3 S300-6 S300-7 S300-15 S300-17 S2000-12 S2000-4 S50-19 S50-20 S0-6

Paddy clones Paddy Group V

RI PT

Paddy Group IX Paddy clones

S50-9 S50-26 S50-28 S50-33 S0-4 S2000-15

Paddy Group II

S50-30

S700-12

SC

S0-13

Lake Groups

S1000-16 S2000-2 S2000-16 S0-14

M AN U

S100-19 S100-31 S100-32 S100-6 S100-23 S100-24 S100-28 S700-2

TE D

S700-21 S0-3 S0-15 S0-24 S0-29 S0-37 S0-43 S0-8 S0-17 S0-26 S0-30 S0-39 S0-44 S0-11 S0-20 S0-27 S0-32 S0-41 S0-12 S0-21 S0-28 S0-35 S0-42 S50-8, -21,-34 S100-1,-3~-5,-7,-9~-11 S100-13~-18,-20,22 S100-25~-27,-29,-30 S700-1,-3~-4,-6~-11 S700-13, -15~-18 S700-22,-24,-26~-27

S1000-1~-15,-17~-25 S2000-3,-5~-7,-10 S2000-13~-14,-17~-18 S50-1 S50-24 S100-8 S300-4 S2000-19 S50-7 S50-29 S100-12 S300-5 S50-11 S50-37 S100-21 S300-10 S50-23 S100-2 S100-33 S2000-11 S50-25 S0-23 S50-5 S50-17 S300-8 S300-13 S50-6 S50-18 S300-9 S300-14 S50-13 S300-1 S300-11 S300-16 S50-16 S300-2 S300-12 S700-5 S0-1 S0-2 S700-19 S700-23 S2000-8 S700-20 S2000-1

Marine Groups

Paddy Group VIII

Paddy Group IV

EP AC C

Paddy Group I

Lake Groups Paddy Group IX

Lake Groups

S50-31

Paddy Group VI

S50-12 S50-27 S50-3 S50-32 S50-36 S50-14

S0-22 S50-22

Lake Groups

S0-31 S50-2 S2000-9 S0-5,-9,-16,-36,-38 S50-4 S700-25

0.05 Upland

Paddy

Marine

Lake

T-+PseudoT-evens

Fig. 1b

100

others Flavobacteriaceae Xanthomonadaceae Planctomycetaceae Desulfobulbaceae Brucellaceae Ellin515 Solibacteraceae Pseudomonadaceae Piscirickettsiaceae Comamonadaceae Koribacteraceae Hyphomicrobiaceae Gaiellaceae Pirellulaceae Chitinophagaceae Chthoniobacteraceae Rhodospirillaceae Thermodesulfovibrionaceae Syntrophobacteraceae Nocardioidaceae

M AN U

99 96

60

TE D

40

30

Relative abundance(%)

others Euryarchaeota Spirochaetes Chlamydiae Cyanobacteria Chlorobi Firmicutes Gemmatimonadetes Nitrospirae Bacteroidetes Chloroflexi Verrucomicrobia Planctomycetes Actinobacteria Acidobacteria Proteobacteria

80

0 S0

S50

S100

S300

S700

S1000

EP

20

S2000

AC C

Relative abundance (%)

SC

Fig. 1a

RI PT

ACCEPTED MANUSCRIPT

20

10

0 S0

S50

S100

S300

S700

S1000

S2000

RI PT

ACCEPTED MANUSCRIPT

Fig. 3

SC

0.4 0.3

S100

S0

0.1 0 -0.1

S700

chronosequence soil Paddy soil of China Paddy soil of Japan Marine water Lake freshwater

TE D

S50

-0.2 -0.3 -0.4

-0.4

AC C

-0.6

-0.2

EP

PCo2: 14.93%

0.2

M AN U

S1000S2000

0

S300

0.2

PCo1: 21.98%

0.4

0.6

Fig. 4b

2

0.15 S2000 Chitinophagaceae Brucellaceae Hyphomicrobiaceae

0

Planctomycetaceae

S300

Piscirickettsiaceae Desulfobulbaceae

TE D

S100

M AN U

S0

-1

S0

S700 Nocardioidaceae

0.1

PCo2: 22.73%

S1000

-3

-2

-1

EP

-2

0

PCo1: 51.69%

1

Ungrouped Marine Groups

0.05

S2000 S1000

Paddy group II

0

S100 S700

-0.05

Paddy group IX S50

S50

AC C

PCo2: 18.17%

1

SC

Fig. 4a

RI PT

ACCEPTED MANUSCRIPT

-0.1 -0.15

-0.1

Paddy group VIII

S300

-0.05

0

PCo1: 29.95%

0.05

0.1

Fig. 5b

Fig. 5a

SC

2 S2000

T-like phage Environmental 46.2% 17.5% Variables 16.8%

M AN U

1

S1000 S300

S0

S100

0 EC

Phage CCA1

Phage Shannon index pH

-1

TE D

CCA2: 18.56%

RI PT

ACCEPTED MANUSCRIPT

S700

Mantel test: : CCA: (r=0.999, P<0.001) EC: (r=-0.995, P<0.001) pH: (r=-0.776, P<0.05)

S50

-2 -0.8

0.2

EP

-1.8

CCA1: 45.26%

AC C

-2.8

Bacterial community

1.2

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

Fig. 6

-0.5

0

0.5

ACCEPTED MANUSCRIPT Highlights

AC C

EP

TE D

M AN U

SC

RI PT

Long-term rice cultivation resulted in an accumulation of paddy soil nutrients. Bacterial and T4-type phage communities distinctly shifted along the soil chronosequence. Bacterial community was strongly affected by both T4-type phage and soil properties. Network analysis indicated six bacterial taxa were potential hosts of T4-type phage.