Erwinia amylovora phage vB_EamM_Y3 represents another lineage of hairy Myoviridae

Erwinia amylovora phage vB_EamM_Y3 represents another lineage of hairy Myoviridae

Accepted Manuscript Erwinia amylovora phage vB_EamM_Y3 represents another lineage of hairy Myoviridae Colin Buttimer, Yannick Born, Alan Lucid, Martin...

5MB Sizes 2 Downloads 54 Views

Accepted Manuscript Erwinia amylovora phage vB_EamM_Y3 represents another lineage of hairy Myoviridae Colin Buttimer, Yannick Born, Alan Lucid, Martin J. Loessner, Lars Fieseler, Aidan Coffey PII:

S0923-2508(18)30062-7

DOI:

10.1016/j.resmic.2018.04.006

Reference:

RESMIC 3652

To appear in:

Research in Microbiology

Received Date: 20 December 2017 Accepted Date: 23 April 2018

Please cite this article as: C. Buttimer, Y. Born, A. Lucid, M.J. Loessner, L. Fieseler, A. Coffey, Erwinia amylovora phage vB_EamM_Y3 represents another lineage of hairy Myoviridae, Research in Microbiologoy (2018), doi: 10.1016/j.resmic.2018.04.006. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Page 1 of 36

ACCEPTED MANUSCRIPT

Erwinia amylovora phage vB_EamM_Y3 represents another lineage

2

of hairy Myoviridae

3

Colin Buttimera, Yannick Bornb,c‡, Alan Lucida†, Martin J. Loessnerb, Lars Fieselerb‡, Aidan

4

Coffeya*

5

a

Department of Biological Sciences, Cork Institute of Technology, Cork, Ireland.

6

b

Institute of Food, Nutrition, and Health, ETH Zurich, Zürich, Switzerland.

7

c

Agroscope, Research Division Plant Protection, Wädenswil, Switzerland

8



9

Sciences, Wädenswil, Switzerland.

M AN U

SC

RI PT

1

Present address: Institute of Food and Beverage Innovation, Zurich University of Applied

TE D

10

Email addresses

12

Colin Buttimer - [email protected]

13

Yannick Born - [email protected]

14

Martin J. Loessner - [email protected]

15

Lars Fieseler - [email protected]

16

Aidan Coffey - [email protected] *Correspondence and reprints

AC C

EP

11

17

18

1

Page 2 of 36

ACCEPTED MANUSCRIPT Abstract

20

To date, a small number of jumbo myoviruses have been reported to possess atypical

21

whisker-like structures along the surface of their contractile tails. Erwinia amylovora phage

22

vB_EamM_Y3 is another example. It possesses a genome of 261,365 kbp with 333 predicted

23

ORFs. Using a combination of BLASTP, Interproscan and HHpred, about 21% of its putative

24

proteins could be assigned functions involved in nucleotide metabolism, DNA replication,

25

virion structure and cell wall degradation. The phage was found to have a signal-arrest-

26

release (SAR) endolysin (Y3_301) possessing a soluble lytic transglycosylase domain. Like

27

other SAR endolysins, inducible expression of Y3_301 caused Escherichia coli lysis, which is

28

dependent on the presence of an N-terminal signal sequence. Phylogenetic analysis showed

29

that its closest relatives are other jumbo phages including Pseudomonas aeruginosa phage

30

PaBG and P. putida phage Lu11, sharing 105 and 87 homologous proteins respectively. Like

31

these phages, Y3 also shares a distant relationship to Ralstonia solanacearum phage ΦRSL1

32

(sharing 55 homologous proteins). As these phages are unrelated to the Rak2-like group of

33

hairy phages, Y3 along with Lu11 represent a second lineage of hairy myoviruses.

34

Introduction

35

The order of Caudovirales (the tailed phages) consists of three families of bacteriophage

36

(phage), with about 25% consisting of members of the family Myoviridae [1]. These

37

members possess virions with contractile tails, and include the phages with the largest

38

genomes sizes [2]. Those with genomes greater than 200 kbp are typically referred to as

39

“jumbo phages”; and currently, there are over ninety in the databases with most having

40

Gram negative hosts [3,4]. Those with the largest genomes identified to date are Bacillus

41

phage G (498 kbp) and those belonging to the Rak2-like group such Cronobacter sakazakii

AC C

EP

TE D

M AN U

SC

RI PT

19

2

Page 3 of 36

ACCEPTED MANUSCRIPT phage vB_CsaM_GAP32 (358 kbp), Serratia marcescens phage BF (357 kbp) and

43

Pectobacterium phage vB_PcaM_CBB (355 kbp) [5–7]. Erwinia phages are of relevance in the

44

context of alternative methods for bacterial crop disease control. Indeed, recent years have

45

seen intensive investigation into the use of these phages for biocontrol of contagious fire

46

blight of apples and pears. Such research is motivated by increasing reports of resistance to

47

other antibacterial agents [8–10]. The search for new phages for biocontrol applications has

48

resulted in the identification of a number of jumbo phages such as Erwinia amylovora jumbo

49

phages Ea35-70 (accession no. NC_023557.1; [11]), φEaH1 [12], φEaH2 [13], including

50

another sixteen jumbo phages which have had their genome sequences only recently

51

published [14].

52

Another important bacterial pathogen is Ralstonia solanacearum, which colonises the xylem

53

and causes wilt in a wide range of economically important crops. Its relevance has resulted

54

in the discovery of Ralstonia solanacearum phage ΦRSL1 (231 kbp, accession no.

55

NC_010811.2), described in by Yamada et al (2010) and found to possess very little

56

homology to known myoviruses. In recent times, a number of other jumbo phages have

57

been identified with low but significant homology to phage ΦRSL1 at the protein level.

58

These are Pseudomonas phage Lu11 (280 kbp, accession no. NC_017972.1) and PaBG (258

59

kbp, accession no. NC_022096.1) as well as phage NTCB (258 kbp, accession no.

60

LT598654.1), whose genome sequence was identified in a metagenomic study of a decaying

61

Trichodesmium bloom, and whose host range remains unknown [16–18]. As well as the

62

recently described Erwinia amylovora phage vB_EamM_Yoloswag (259 kbp, accession no.

63

KY448244), which can also be assigned to this group [14].

AC C

EP

TE D

M AN U

SC

RI PT

42

3

Page 4 of 36

ACCEPTED MANUSCRIPT 64

Here we report a newly isolated jumbo phage, vB_Eam_Y3 of E. amylovora, which possesses

65

atypical whisker-like structures along the surface of its contractile tail; and has been found

66

to share homology (at the protein level) to Lu11 and PaBG with distant homology to ΦRSL1.

RI PT

67

Materials and Methods

69

Bacterial Strains, Phage, and Cultivation Techniques

70

All Erwinia and Pantoea strains were grown in LB medium at 28°C. Phage Y3 was routinely

71

cultivated at 28°C with the soft agar overlay method [19] using LB plates and LC soft agar (LB

72

supplemented with 10 mM CaCl2 and 2 mM MgSO4; 0.4% agar), and E. amylovora 1/79 as

73

host strain. The phage was originally isolated from soil as described previously [20]. Briefly,

74

10 g of soil was enriched in 200 mL SM-buffered (50 mM Tris, 100 mM NaCl, 8 mM MgSO4,

75

pH 7.4) LB broth, which had been inoculated 1:20 with and overnight culture of E. amylovora

76

1/79. After overnight incubation at 28°C, filtered supernatants were spotted onto a freshly

77

growing lawn of E. amylovora 1/79 and plaques were picked with a Pasteur pipette. After

78

isolation, Y3 was propagated to high titers with the soft agar overlay method. Purification for

79

subsequent electron microscopy and DNA isolation was achieved by CsCl density

80

centrifugation [21]. Host range analysis involved the use of the soft agar overlay method

81

with the use of different bacterial strains and the spotting of phage diluents, a positive result

82

was recorded with the detection of plaques after an overnight incubation.

AC C

EP

TE D

M AN U

SC

68

83

84

85 4

Page 5 of 36

ACCEPTED MANUSCRIPT Transmission Electron Microscopy

87

For transmission electron microscopy, CsCl-purified phages were negatively stained with 2%

88

ammonium molybdate and then analysed as previously as described [22], with 10 particles

89

being used to determine virion dimensions.

RI PT

86

90

DNA Isolation and Sequencing

92

For the extraction of phage DNA, CsCl-purified phage particles were dialyzed against a 1,000-

93

fold excess of SM buffer (6 h, RT). Free nucleic acids were degraded 15 min. at 37°C by

94

DNase I (final concentration: 1 U/ml) and RNase A (50 µg/ml). Nucleases were inactivated

95

and phage capsids degraded 1 h at 56°C with EDTA (20 mM, pH 8.0), proteinase K (50 μg/ml),

96

and SDS (0.5% [wt/vol]). DNA was then purified using a phenol-chloroform extraction

97

followed by ethanol-precipitation [23]. DNA sequencing was outsourced to GATC Biotech

98

(Konstanz, Germany). To conduct sequencing DNA libraries were first created by DNA

99

fragmentation, adapter ligation followed by a size selection and amplification. DNA libraries

100

were then measured and quantified on a fragment analyser before sequencing with 2x300

101

bp paired end reads using Illumine Hiseq sequencing technology. The de novo assembly was

102

performed using default parameters with CLC Genomics Workbench v8.0 (Aarhus,

103

Denmark).

104

Bioinformatics Analysis

105

The RAST server (http://rast.nmpdr.org/; [24]) was employed with GLIMMER [25] to predict

106

open reading frames (ORFs) of Y3. With further analysis of ORFs gene product conducted

107

with BLASTP (http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins), Pfam

AC C

EP

TE D

M AN U

SC

91

5

Page 6 of 36

ACCEPTED MANUSCRIPT (http://pfam.xfam.org/search#tabview=tab1; [24]), Interproscan

109

(https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3998142/; [25]) and HHpred

110

(https://toolkit.tuebingen.mpg.de/#/tools/hhpred; [26]). With the detection of ORFs with

111

transmembrane domains, signal peptide sequence and lipoprotein cleavage signal being

112

identified with the use of TMHMM v.2 (http://www.cbs.dtu.dk/services/TMHMM/; [27]),

113

SignalPv.4.1 (http://www.cbs.dtu.dk/services/SignalP/; [28]) and LipoP v.1

114

(http://www.cbs.dtu.dk/services/LipoP/; [29]), respectively. The molecular weights of the

115

predicted ORFs were estimated using the batch protein molecular weight determination of

116

the sequence manipulation suite (http://www.bioinformatics.org/sms2/protein_mw.html).

117

The presence of transfer RNA genes was investigated with the use of tRNAscan-SE

118

(http://lowelab.ucsc.edu/tRNAscan-SE/; [30]) and ARAGORN

119

(http://130.235.46.10/ARAGORN/; [31]). The circular genomic map of the phage was drawn

120

using GCview [34]. Potential Rho-independent terminators in were identified using ARNold

121

(http://rna.igmors.u-psud.fr/toolbox/arnold; [33]) with Mfold QuikFold

122

(http://unafold.rna.albany.edu/?q=DINAMelt/Quickfold; [34]) using RNA energy rules 3.0 to

123

verify predictions. Potential promoters were identified by submitting up streams sequences

124

(100bp) of predicted genes to MEME (Multiple Em for Motif Elicitation) (http://meme-

125

suite.org/tools/meme; [35]).

126

Comparative analysis of phage genomes

127

Genome comparisons between phages were performed using TBLASTX and visualised using

128

Easyfig [38]. Multiple sequence alignments were created with MEGA7 [39] using MUSCLE.

129

Phylograms were constructed using the portal vertex (Y3_003), large terminase (Y3_004),

130

DNA polymerase (Y3_173) and ATP dependent DNA helicase (Y3_176) among phages

AC C

EP

TE D

M AN U

SC

RI PT

108

6

Page 7 of 36

ACCEPTED MANUSCRIPT

possessing homology to ΦRSL1, with the large terminase and portal vertex protein being

132

used among jumbo myoviruses of E. amylovora. To estimate the robustness of the

133

phylograms, the maximum-likelihood algorithm [40] based on the Whelan and Goldman

134

substitution model [41], was conducted with bootstrap support (1000). A total proteome

135

comparison between phages was conducted using Coregenes 3.5 with the BLASTP threshold

136

set at 75% (http://gateway.binf.gmu.edu:8080/CoreGenes3.5/custdata.html; [39]).

137

SAR endolysin expression

138

To assess activity of protein Y3_301, its gene (231,516 – 232,190 bp) was amplified using

139

forward primer Y3_301-Pci: 5’-CATACatgttagttatcatcaacaatcg -3’ and reverse primer Y3_301-

140

Xho1: 5’-AAGCTTctcgagggaagaaggagtatgct-3’. The truncated version of Y3_301, namely rTM-

141

Y3_301 without the transmembrane domain (231,614 - 232,190 bp) was amplified using

142

forward primer rTM-Y3_301-Pci1: 5’- CATACATgttacgcatttcgccgtc-3’ and the above reverse

143

primer Y3_301-Xho1. All amplifications were conducted with High Fidelity PCR Master mix

144

(Roche, Basel, Switzerland) using genomic DNA as a template. The resulting PCR product was

145

digested and ligated in-frame into plasmid vector pET28a, with the resulting construct

146

transformed in E. coli Lemo21 (DE3) [43], following the manufacturer’s standard protocol

147

(New England Biolabs, Ipswich, MA, USA). DNA sequencing was performed on purified

148

plasmid to ensure integrity of the construct. SAR endolysin activity in E coli was determined

149

using 40mL liquid cultures (triplicate) in LB both using selective markers kanamycin (40

150

µg/mL) and chloramphenicol (40 μg/mL) with 1% inoculum of an overnight culture. Cultures

151

were grown at 30ᵒC with agitation at 100 rpm with the optical density (600nm) being

152

measured at 1-hour intervals. At hour 4 (OD600~400), cultures were induced with IPTG (400

153

μM) and readings were continued up to hour 8. Control cultures were left un-induced. The

AC C

EP

TE D

M AN U

SC

RI PT

131

7

Page 8 of 36

ACCEPTED MANUSCRIPT

experiment was conducted in triplicate and averages and standard deviations were

155

determined using Microsoft Excel.

156

Accession Number

157

The genome sequence of phage Y3 was submitted to Genbank under accession number

158

KY984068

RI PT

154

SC

159

Results

161

Growth parameters, host range and morphology

162

Erwinia amylovora phage vB_EamM_Y3 was isolated from soil, which had been collected at

163

an apple orchard with trees displaying fire blight symptoms in Suree (canton of Lucerne,

164

Switzerland) in 2008. The bacterial host strain used for its isolation was Erwinia amylovora

165

stain 1/79, which had originally been isolated from an apple tree in Germany in 1979 [44];

166

and its use as a host strain resulted in tiny clear pin-sized plaques. The phage was found to

167

infect a number of strains of E. amylovora as well as strains of Pantoea agglomerans and

168

Pantoea vagans (Table 1). Essentially, its host range was similar to other previously

169

identified E. amylovora phages [20].

170

Examination of the morphology of Y3 by transmission electron microscopy showed the

171

phage belonged to the family of Myoviridae possessing an A1 morphotype [45], with an

172

icosahedral head of 129 nm (±4 nm) in diameter and with a contractile tail with a length of

173

192 nm (±12 nm) (Fig 1). The phage was named in accordance with the bacterial virus

174

nomenclature set out by Kropinski et al (2009). The most unusual feature of Y3 was the hair-

175

like structures along the surface of its contractile tail, a characteristic that has only been

AC C

EP

TE D

M AN U

160

8

Page 9 of 36

ACCEPTED MANUSCRIPT

observed on a small number of phages to date: these being the jumbo Pseudomonas phage

177

Lu11 [16] which shares protein homology with Y3, and those belonging to jumbo phage

178

Rak2-like group such as Pectobacterium phage vB_PccM_CBB [7] and Escherichia phage

179

121Q (accession no. NC_025447.1; [47]), and phages of unknown genome sizes and lineages

180

such as X particle and Escherichia phage PhAPEC6 [48,49].

181

RI PT

176

General genome features

183

The genome size of Y3 (261,365 bp) places it among the double stranded DNA jumbo

184

phages. The assembly pattern of the reads obtained from sequencing indicated that its

185

genome is likely to be circularly permuted with terminal repeats like that of coliphage T4

186

[50]. Due to this observation, the start position of its Genbank file was set to match that of

187

Erwinia phage Yoloswag, its closest relative available on public databases. The average G+C

188

content of Y3 was found to be 47%, which is lower than that which would typically be found

189

for its host chromosomal DNA at c.53% [51,52].

190

A total of 333 ORFs were identified with putative gene products ranging in size of 39 to 1951

191

amino acids. These ORFs were largely divided into two major gene clusters (Fig 2). The

192

largest region 1 was situated between Y3_75 to 221, with ORFs mostly in the anticlockwise

193

direction and region 2 situated from Y3_001 to 74 and Y3_222 to 333 with ORFs mostly in

194

the clockwise direction. The GC skew correlates well with the ORF orientation of the

195

proposed clusters [53]. The ORFs were found to be tightly organised having little intergenic

196

space, with a number of examples of neighbouring ORFs’ start and end points overlapping.

197

Most ORFs were found to start with AUG, while twenty and three ORFs were predicted to

198

start with GUG and UUG, respectively. Of the gene products of the identified ORFs, seventy

AC C

EP

TE D

M AN U

SC

182

9

Page 10 of 36

ACCEPTED MANUSCRIPT

could be allocated a possible function (supplementary information 1, Table S1) with others

200

identified as putative transmembrane proteins (fourteen) putative lipoproteins (two) and

201

hypothetical proteins (247). No tRNA genes were identified on the genome.

202

In terms of lysogeny, no integrase, excisionase or repressor genes were identified. However,

203

one gene product (Y3_031) was found to possess the AntA/AntB antirepressor domain

204

(IPR013557), which has been associated with anti-repressor proteins such as that encoded

205

by LO142 of temperate phage 993W [54]. These proteins are believed to be involved in the

206

lysis/lysogenic decision making process of the phage, possibly suggesting a temperate

207

ancestry [55], with a homolog of this protein also evident in Yoloswag.

208

A potential promoter with a consensus sequence of CTGTAAATA was identified at thirty two

209

locations in the genome of Y3 (supplementary information 2, Fig S1, Table S2), with a highly

210

similar promoter (ACTGTAAATA[N7]A) also having been predicted in the genome of

211

Pseudomonas phage Lu11 [16]; and like that of Lu11, this promotor is mostly situated before

212

ORFs with products predicted to play roles in nucleotide metabolism and DNA replication. In

213

addition, forty-five potential rho factor independent terminators were located throughout

214

the genome of Y3 (supplementary information 2 Table S3).

SC

M AN U

TE D

EP

AC C

215

RI PT

199

216

Phylogenetic position of phage Y3 within the family Myoviridae

217

At the DNA level, Y3 was found to have little or no homology to phage genomes currently

218

available on public databases, with its closest match being Yoloswag (coverage 6%, identity

219

75%) using BLASTN analysis. However, during BLASTP analysis, a small number of other

220

jumbo shared a number of homologs (large terminase, major capsid protein, etc). These

10

Page 11 of 36

ACCEPTED MANUSCRIPT

were phage NTCB, Pseudomonas phages Lu11 and PaBG and Ralstonia phage ΦRSL1, which

222

hinted at a possible evolutionary relationship (Fig 3). Like Y3 at the DNA level, these five

223

phages (Yoloswag, NCTB, Lu11, PaBG and ΦRSL1) also exhibited no homology to each other

224

indicating that they are at most only distantly related.

225

To determine the relatedness to each other among above six phages, a phylogenetic analysis

226

was conducted using four shared proteins (portal vertex, DNA polymerase, large terminase

227

and ATP dependent DNA helicase) to generate phylogenetic trees with confident bootstrap

228

values (Fig 4). Similar phylogenetic markers have been used in studies of other jumbo phages

229

[4,7,56]. These trees showed that among the six, two separate clades were formed: one

230

included Y3 and Yoloswag; and the second was comprised of those phages infecting the

231

genus Pseudomonas (Lu11 and PaBG). Phages NCTB and ΦRSL1 were outliers. The NCTB

232

position on the tree is somewhere between the Erwinia and Pseudomonas phages. Phage

233

ΦRSL1 appeared on a branch that sits outside of all the other five phages implying a far

234

more distant relatedness. This interpretation was further supported by the analysis of the

235

phages proteomes using Coregene (Table 2). The E. amylovora phages were found to share

236

225 proteins with each other; the Pseudomonas phages shared 136; NTCB shared between

237

88 and 112 proteins with both the Erwinia clade and the Pseudomonas clade respectively.

238

ΦRSL1 was still the most distantly related, sharing only 55 to 65 homologues with the other

239

five phages. This distant evolutionary relationship could possibly be explained in the context

240

of host range with the genera Erwinia and Pseudomonas belonging to the class of

241

Gammaproteobacteria while Ralstonia belongs to the class Betaproteobacteria [57].

242

Finally, phylogenetic analysis was preformed using the large terminase and portal protein of

243

all jumbo myoviruses of E. amylovora that have had their genome sequences submitted to

AC C

EP

TE D

M AN U

SC

RI PT

221

11

Page 12 of 36

ACCEPTED MANUSCRIPT

Genbank to date. Both these proteins have been used as phylogenetic markers in such

245

studies of distantly related phages of Enterobacteriaceae [58]. The resulting phylogram

246

shows that there is a minimum of four clades of these phages infecting this bacterium. With

247

Y3, along with phage Yoloswag, forming a novel clade that is highly distinct from the other

248

three clades identified, as the branch that these phages reside on in phylograms sits outside

249

of those representing the other clades (supplementary information 2, Table S4, Fig 5).

250

Morphogenesis proteins of phage Y3

251

The majority of the putative structural proteins of Y3 were situated in a cluster with a length

252

of 65 kbp (Y3_104 -158) within region 1, with majority of these shared with phages

253

Yolosway, NTCB, PaBG and Lu11. These putative proteins were the tail tube (Y3_132, 134),

254

the tail sheath (Y3_135, 158), the baseplate (113, 114, 124), the tail fibres (Y3_104, 108, 109,

255

110), the tail-collar fibre protein (Y3_110) and the major capsid protein (Y3_139). A putative

256

portal protein (Y3_006) was located outside of this region, being adjacent to the large

257

terminase (Y3_004). Interestingly, the homology of the portal protein to that of Bacillus

258

phage G (gp14, 5E-8, 23%) was found to be high when using BLASTP. Proteins Y3_106 and

259

Y3_143 also shared strong homology with gp431 (2e-9, 28%) and gp19 (3e-09, 33%)

260

respectively of Bacillus phage G. Y3_106 is situated between ORFs predicted to form the tail

261

fibres that possess collagen domains [59]. This gene product was predicted to be a tail fibre

262

hinge protein due a presence of an ILEI domain (PF15711.4, 1.7E-7). This domain belongs to

263

the FAM3 superfamily to which the T4-like phage tail hinge gp35 protein is distantly related

264

[60]. Three of the structural proteins that have been identified in phage ΦRSL1 (ΦRSL1_135,

265

136, 140) as confirmed by N-terminal sequencing [15], were found to have homologues in

AC C

EP

TE D

M AN U

SC

RI PT

244

12

Page 13 of 36

ACCEPTED MANUSCRIPT

phage Y3 (namely Y3_135, 139, 140, 158) providing further evidence that they play a role in

267

the virion structure of Y3.

268

Nucleotide metabolism, DNA replication and DNA modification

269

Typical of large myoviruses, Y3 has a number of proteins with functions predicted to

270

influence the nucleotide pool within its host [3]. These include a CMP deaminase (Y3_035)

271

allowing the hydrolysis of dCMP to dUMP and a putative dUTPase (Y3_300) allowing the

272

conversion of dUTP to dUMP. The dUMP can then be acted upon by a thymidylate synthase

273

(Y3_045), which can catalyse the methylation of dUMP to dTMP, which a dTMP kinase

274

(Y3_118) can in turn convert to dTDP. Another feature typical of large phages is the

275

possession of a number of proteins involved in DNA replication. It is worth mentioning that

276

smaller phages typically use a lesser number of proteins to recruit the host replication

277

machinery to carry out this role [61]. Phage Y3 was identified to possess helicases (Y3_001,

278

168, 176), primases (Y3_024, 227), a DNA ligase (Y3_148), topoisomerase subunits (Y3_327,

279

329), as well as class 1 polymerases (Y3_011, 173) and the delta component (PF13177.5,

280

5.7E-26) of a class III polymerase (Y3_209). Y3 also has proteins involved in DNA repair and

281

maintenance including RecA (Y3_243), UV damage endonuclease UvsE (Y3_287) and MmcB-

282

like DNA repair protein (Y3_119) as well as those involved in DNA recombination (Y3_186)

283

and DNA degradation (Y3_229). DNA methylation is a well-known strategy used by some

284

phages to protect against host restriction endonucleases [62] and accordingly, Y3 appears to

285

be able to methylate both cytosine and adenine (Y3_29, 50).

AC C

EP

TE D

M AN U

SC

RI PT

266

286

287

13

Page 14 of 36

ACCEPTED MANUSCRIPT Other metabolic functions of Y3

289

While the genome of phage Y3 does not possess tRNA genes, it does possess proteins

290

potentially involved in their maturation such as RNA 2'-phosphotransferase (Y3_272) and a

291

carboxy-S-adenosyl-L-methionine synthase (Y3_017). Curiously, the phage was found to

292

possess an asparagine synthase (Y3_020), which would allow the conversion of aspartate to

293

asparagine. Such auxiliary phage genes that produce proteins that compliment a host’s

294

metabolism have been identified in a number phages of marine environments; and the role

295

of their encoded proteins is understood to be the removal of potential bottle-necks in their

296

host metabolism that could potentially hinder the lytic cycle [63].

297

Selfish genetic elements

298

Inteins are selfish genetic elements that self-cleave and remove themselves from a

299

translated protein [64]. Proteins identified with these elements in phage Y3 are the large

300

terminase (Y3_004), C-5 cytosine methyltransferase (Y3_050) and one of the four predicted

301

tail fibre proteins (Y3_110). Also identified was a homing endonuclease of the HNH family

302

(Y3_179), situated between two hypothetical proteins. These homing endonuclease are

303

mobile genetic elements with endonuclease activity that only promote the spread of their

304

own encoding gene [65].

305

Cell wall degrading enzymes

306

Analysis of the protein Y3_301 showed that it possesses an N-terminal transmembrane

307

domain (12-34 aa) with a high alanine content (23%), with one basic residue (arginine), and a

308

C-terminal soluble lytic transglycosylase (SLT) domain (IPR008258; 108-218 aa). The latter

309

domain degrades peptidoglycan by cleavage of the β-1,4 glysosidic bond that results in the

AC C

EP

TE D

M AN U

SC

RI PT

288

14

Page 15 of 36

ACCEPTED MANUSCRIPT

formation of a 1,6-anhydrobond in the muramic acid residue. A homolog of this protein was

311

also found in phage Yoloswag (YOLOSWAG_297). Due the composition of the domains in this

312

protein, it was suspected to be an endolysin that followed the signal arrest (SAR) system like

313

gp45 of φKMV and lyz of phage P1 [66,67]. Analysis of the protein using SignalP 3.0, showed

314

that it possessed an N-terminal signal sequence with a probability of 0.729. However, the

315

more recent version of SignalP did not identify this domain suggesting that the N-terminal

316

secretory signal is not removed after export, but rather remains tethered to the inner

317

membrane after translocation by the host sec translocon system.

318

For number of these types of endolysin, an inducible system with E. coli permits

319

demonstration of SAR endolysin activity using a detectable reduction in culture turbidity

320

upon induction [67,68]. A similar experiment was conducted here for protein Y3_301.

321

Accordingly, gene Y3_301 was cloned into pET28a and transformed into E coli Lemo21 (DE3).

322

Expression of the cloned gene was lethal to cells, as addition of IPTG after 4 hours of growth

323

caused the OD600 to drop from ~0.480 to ~0.320 at 8 hours (Fig 6). No lytic activity was

324

detected after the removal of the signal sequence as seen with the induction of the

325

truncated transmembrane domain Y3_301 (rTM-Y3_301). As the toxic activity of this protein

326

was found to be dependent on the presence of its transmembrane domain, it could be

327

concluded that this protein behaves as a typical SAR endolysin, requiring the

328

transmembrane domain to act as a signal sequence for its translocation enabling the

329

catalytic domain of the protein to reach the periplasm where it acts on the peptidoglycan of

330

the bacterial cell resulting in lysis.

331

An important feature of SAR endolysins is the regulation of their enzymatic activity while in

332

their membrane-tethered form. The two main mechanisms that have been identified for

AC C

EP

TE D

M AN U

SC

RI PT

310

15

Page 16 of 36

ACCEPTED MANUSCRIPT

those with the lysozyme domain are (a) those which utilise steric hindrance of catalytic

334

residues due to the proximity of the bilayer to which the enzyme is embedded (as seen with

335

R21 of lambdoid phage 21) and (b) those which regulate activity by the formation of

336

disulphide bridges as seen with LyzP1 of coliphage P1 [69]. Little is understood of how SAR

337

endolysins with transglycosylase domains regulate their activity. However, the formation of

338

disulphide bridges is a possible explanation of how Y3_301 might regulate itself. Five

339

cysteine resides were identified in its amino acid sequence, one situated within the

340

transmembrane domain (Cys23) and the four-remaining located within its catalytic domain

341

(Cys127, 147, 182 & 188).

342

A second protein, namely Y3_197 was found to possess a soluble lytic transglycosylase (SLT)

343

domain (PF05838), with a peptidoglycan binding domain (IPR018537). A homolog of this

344

protein was also found in phage Yoloswag (YOLOSWAG_189). To date, we have not

345

succeeded in demonstrating its predicted enzymatic activity after cloning in E. coli (data not

346

shown).

347

Conclusion

348

As mentioned above, a number of hairy Myoviridae jumbo phages to date have been placed

349

in the Rak2-like group. The possession of whisker-like structures is shared with the unrelated

350

E. amylovora phage Y3, suggesting that these may be more widespread in the Myoviridae

351

family than previously thought. For the Rak2-like Escherichia phage 121Q, this feature has

352

been linked to a protein which resembles a phage tail fibre [47]. The exact gene products

353

responsible for the atypical whisker-like structures of Y3 is yet unknown. However, a few

354

genes have been predicted to encode tail fibre structures (Y3_104, 106, 108, 109, 110), and

355

any of these could potentially be responsible. The role of these atypical hairs of these phages

AC C

EP

TE D

M AN U

SC

RI PT

333

16

Page 17 of 36

ACCEPTED MANUSCRIPT

is still unknown, but is has been speculated to play a role in enhancing phage ability to

357

detect and infect its host by facilitating the phage adsorption processes [7,47].

358

Phylogenetic studies show that phages Y3 and Yoloswag establish a novel clade of jumbo

359

phages infecting E. amylovora (Fig 5). In the context of shared homologous proteins, Lavigne

360

et al (2009) defined phage subfamily and genus boundaries at ≥20% and ≥40%, respectively.

361

Based on these cut-off values, it would place phages Y3 and Yoloswag within a genus within

362

a subfamily to which NTCB, PaBG and Lu11 are also members and which is peripherally

363

related to ΦRSL1 (Table 2).

364

Little has been described about SAR endolysins in jumbo phages to date. However, a small

365

number of jumbo phages have been predicted to possess them. These include members of

366

the Rak2-like group such as Klebsiella phage vB_KleM-RaK2 (gp506), Pectobacterium phage

367

CBB (CBB_187), Cronobacter phage GAP32 (gp180) and coliphage Q121 (gp532) [5,70]; and

368

upon examination within our study, Erwinia phages Ea35-70 (Ea357_029), Deimos-Minion

369

(accession no. ANH52127.1; DM_29) and Simmy50 (accession no. KU886223.1; gp_30).

370

Phages Y3 and Yoloswag can be included in this group. These endolysins are known to be

371

highly prevalent in phages that infect Enterobacteriaceae [71]; and this may also be the case

372

for their jumbo phages, as a number of such endolysins have already been identified among

373

them.

374

Conflict of interest

375

There is no conflict of interest.

AC C

EP

TE D

M AN U

SC

RI PT

356

376

377

17

Page 18 of 36

ACCEPTED MANUSCRIPT Acknowledgements

379

†Alan Lucid passed away on 31 August 2017 during the preparation of the manuscript and

380

this paper is dedicated to his memory.

381

TEM imagines of phage Y3 were prepared by Rudi Lurz at the Max Planck Institue for

382

Molecular Genetic, Berlin.

383

This work was supported by Science Foundation Ireland Project Ref:12/R1/2335,

384

References

385

[1]

SC

Hatfull GF. Bacteriophage genomics. Curr Opin Microbiol 2008;11:447–53.

[3]

TE D

doi:10.1016/j.mib.2008.09.004.

Hendrix RW. Jumbo bacteriophages. Curr Top Microbiol Immunol 2009;328:229–40.

[4]

EP

doi:10.1007/978-3-540-68618-7-7.

390

391

M AN U

[2]

388

389

Ackermann H-W, Prangishvili D. Prokaryote viruses studied by electron microscopy. Arch Virol 2012;157:1843–9. doi:10.1007/s00705-012-1383-y.

386

387

RI PT

378

Yuan Y, Gao M. Jumbo Bacteriophages: An Overview. Front Microbiol 2017;8:403. doi:10.3389/fmicb.2017.00403.

394

AC C

392

395

46. doi:10.1016/j.virol.2014.05.003.

393

396 397

[5]

Abbasifar R, Griffiths MW, Sabour PM, Ackermann HW, Vandersteegen K, Lavigne R,

et al. Supersize me: Cronobacter sakazakii phage GAP32. Virology 2014;460–461:138–

[6]

Casey E, Fitzgerald B, Mahony J, Lugli GA, Ventura M, van Sinderen D. Genome Sequence of Serratia marcescens Phage BF. Genome Announc 2017;5:e00211-17.

18

Page 19 of 36

ACCEPTED MANUSCRIPT doi:10.1128/genomeA.00211-17.

398

399

[7]

Buttimer C, Hendrix H, Oliveira H, Casey A, Neve H, McAuliffe O, et al. Things Are Getting Hairy: Enterobacteria Bacteriophage vB_PcaM_CBB. Front Microbiol

401

2017;8:44. doi:10.3389/fmicb.2017.00044. [8]

McManus PS, Stockwell VO, Sundin GW, Jones AL. Antibiotic use in plant agriculture.

403

Annu Rev Phytopathol 2002;40:443–65.

404

doi:10.1146/annurev.phyto.40.120301.093927. [9]

Nischwitz C, Dhiman C. Streptomycin Resistance of Erwinia amylovora Isolated from

M AN U

405

SC

402

RI PT

400

406

Apple (Malus domesticus) in Utah. Plant Heal Prog 2013. doi:10.1094/PHP-2013-1025-

407

01-RS.

408

[10]

Tancos KA, Villani S, Kuehne S, Borejsza-Wysocka E, Breth D, Carol J, et al. Prevalence of streptomycin-resistant Erwinia amylovora in New York apple orchards. Plant Dis

410

2016;100:802–9. [11]

Yagubi AI, Castle AJ, Kropinski AM, Banks TW, Svircev AM. Complete Genome

EP

411

TE D

409

Sequence of Erwinia amylovora Bacteriophage vB_EamM_Ea35-70. Genome Announc

413

2014;2:e00413-14. doi:10.1128/genomeA.00413-14.

415

AC C

412

416

applicability of phages for the treatment of fire blight. FEMS Microbiol Lett

417

2014;350:25–7.

414

418 419

[12]

Meczker K, Dömötör D, Vass J, Rákhely G, Schneider G, Kovács T. The genome of the

Erwinia amylovora phage PhiEaH1 reveals greater diversity and broadens the

[13]

Dömötör D, Becságh P, Rákhely G, Schneider G, Kovács T. Complete genomic sequence of Erwinia amylovora phage PhiEaH2. J Virol 2012;86:10899. 19

Page 20 of 36

ACCEPTED MANUSCRIPT doi:10.1128/JVI.01870-12.

420

421

[14]

Esplin IND, Berg JA, Sharma R, Allen RC, Arens DK, Ashcroft CR, et al. Genome Sequences of 19 Novel Erwinia amylovora Bacteriophages. Genome Announc

423

2017;5:1–3.

424

[15]

RI PT

422

Yamada T, Satoh S, Ishikawa H, Fujiwara A, Kawasaki T, Fujie M, et al. A jumbo phage infecting the phytopathogen Ralstonia solanacearum defines a new lineage of the

426

Myoviridae family. Virology 2010;398:135–47. doi:10.1016/j.virol.2009.11.043. [16]

Adriaenssens EM, Mattheus W, Cornelissen A, Shaburova O, Krylov VN, Kropinski AM,

M AN U

427

SC

425

428

et al. Complete Genome Sequence of the Giant Pseudomonas Phage Lu11. J Virol

429

2012;86:6369–70. doi:10.1128/JVI.00641-12.

430

[17]

Pfreundt U, Spungin D, Hou S, Voß B, Berman-Frank I, Hess WR. Genome of a giant bacteriophage from a decaying Trichodesmium bloom. Mar Genomics 2016.

432

doi:10.1016/j.margen.2017.02.001. [18]

Sykilinda NN, Bondar AA, Gorshkova AS, Kurochkina LP, Kulikov EE, Shneider MM, et

EP

433

TE D

431

al. Complete Genome Sequence of the Novel Giant Pseudomonas Phage PaBG.

435

Genome Announc 2014;2:e00929-13-e00929-13. doi:10.1128/genomeA.00929-13.

436

[19]

437

[20]

AC C

434

Adams MH. Bacteriophages. Bacteriophages 1959. Born Y, Fieseler L, Marazzi J, Lurz R, Duffy B, Loessner MJ. Novel virulent and broad-

438

host-range Erwinia amylovora bacteriophages reveal a high degree of mosaicism and

439

a relationship to Enterobacteriaceae phages. Appl Environ Microbiol 2011;77:5945–

440

54. doi:10.1128/AEM.03022-10.

20

Page 21 of 36

ACCEPTED MANUSCRIPT 441

[21]

Sambrook J, Russell DW. Purification of Bacteriophage Lamda Particles by Isopycnic

442

Centrifugation through CsCl Gradients. Mol. Cloning A Lab. Manual, Vol. 1, Cold Spring

443

Harbor Laboratory Press; 2001, p. 2.47. [22]

Klumpp J, Dorscht J, Lurz R, Bielmann R, Wieland M, Zimmer M, et al. The terminally

RI PT

444

redundant, nonpermuted genome of Listeria bacteriophage A511: a model for the

446

SPO1-like myoviruses of gram-positive bacteria. J Bacteriol 2008;190:5753–65.

447

doi:10.1128/JB.00461-08. [23]

Laboratory Press; 2001.

449

450

Sambrook J, Russell DW. Molecular cloning : a laboratory manual. Cold Spring Harbor

[24]

M AN U

448

SC

445

Becker SA, Palsson B?, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: Rapid Annotations using Subsystems Technology. BMC Microbiol 2005;5:8.

452

doi:10.1186/1471-2180-5-8. [25]

[26]

Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res

456

2015;44:D279-85. doi:10.1093/nar/gkv1344.

457

458

EP

identification with GLIMMER. Nucleic Acids Res 1999;27:4636–41.

454

455

Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. Improved microbial gene

AC C

453

TE D

451

[27]

Mitchell A, Chang H-Y, Daugherty L, Fraser M, Hunter S, Lopez R, et al. The InterPro

459

protein families database: the classification resource after 15 years. Nucleic Acids Res

460

2014;43:D213-21. doi:10.1093/nar/gku1243.

461 462

[28]

Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 2005;33:W244-8. 21

Page 22 of 36

ACCEPTED MANUSCRIPT doi:10.1093/nar/gki408.

463

464

[29]

Käll L, Krogh A, Sonnhammer ELL. A combined transmembrane topology and signal peptide prediction method. J Mol Biol 2004;338:1027–36.

466

doi:10.1016/j.jmb.2004.03.016. [30]

peptides from transmembrane regions. Nat Meth 2011;8:785–6.

468

469

Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal

[31]

Juncker AS, Willenbrock H, von Heijne G, Brunak S, Nielsen H, Krogh A. Prediction of

SC

467

RI PT

465

lipoprotein signal peptides in Gram-negative bacteria. Protein Sci 2003;12:1652–62.

471

doi:10.1110/ps.0303703. [32]

genes in genomic sequence. Nucleic Acids Res 1997;25:955–64. [33]

nucleotide sequences. Nucleic Acids Res 2004;32:11–6. doi:10.1093/nar/gkh152. [34]

Bioinformatics 2005;21:537–9. doi:10.1093/bioinformatics/bti054.

477

478

[35]

[36]

483

Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res 2003;31:3406–15.

481

482

Naville M, Ghuillot-Gaudeffroy A, Marchais A, Gautheret D. ARNold: a web tool for the

prediction of Rho-independent transcription terminators. RNA Biol 2011;8:11–3.

479

480

Stothard P, Wishart DS. Circular genome visualization and exploration using CGView.

EP

475

476

Laslett D, Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in

TE D

473

474

Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA

AC C

472

M AN U

470

[37]

Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res 2009;37:W202-8.

22

Page 23 of 36

ACCEPTED MANUSCRIPT doi:10.1093/nar/gkp335.

484

[38]

Bioinformatics 2011;27:1009–10. doi:10.1093/bioinformatics/btr039.

486

487

Sullivan MJ, Petty NK, Beatson SA. Easyfig: a genome comparison visualizer.

[39]

Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis

RI PT

485

488

Version 7.0 for Bigger Datasets. Mol Biol Evol 2016;33:1870–4.

489

doi:10.1093/molbev/msw054. [40]

Jones DT, Taylor WR, Thornton JM. The rapid generation of mutation data matrices

SC

490

from protein sequences. Bioinformatics 1992;8:275–82.

492

doi:10.1093/bioinformatics/8.3.275.

493

[41]

M AN U

491

Whelan S, Goldman N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol

495

2001;18:691–9.

496

[42]

TE D

494

Turner D, Reynolds D, Seto D, Mahadevan P. CoreGenes3.5: a webserver for the determination of core genes from sets of viral and small bacterial genomes. BMC Res

498

Notes 2013;6:140. doi:10.1186/1756-0500-6-140. [43]

Schlegel S, Löfblom J, Lee C, Hjelm A, Klepsch M, Strous M, et al. Optimizing

Membrane Protein Overexpression in the Escherichia coli strain Lemo21(DE3). J Mol

500

Biol 2012;423:648–59. doi:10.1016/J.JMB.2012.07.019.

501

502

AC C

499

EP

497

[44]

Falkenstein H, Bellemann P, Walter S, Zeller W, Geider K. Identification of Erwinia

503

amylovora, the Fireblight Pathogen, by Colony Hybridization with DNA from Plasmid

504

pEA29. Appl Environ Microbiol 1988;54:2798–802.

23

Page 24 of 36

ACCEPTED MANUSCRIPT 505

[45]

2007;152:227–43. doi:10.1007/s00705-006-0849-1.

506

507

Ackermann H-W. 5500 Phages examined in the electron microscope. Arch Virol

[46]

Kropinski AM, Prangishvili D, Lavigne R. Position paper: The creation of a rational scheme for the nomenclature of viruses of Bacteria and Archaea. Environ Microbiol

509

2009;11:2775–7. doi:10.1111/j.1462-2920.2009.01970.x. [47]

[48]

Ackermann HW, Auclair P, Basavarajappa S, Konjin HP, Savanurmath C. Bacteriophages from Bombyx mori. Arch Virol 1994;137:185–90.

513

514

SC

Pittsburgh, 2016.

511

512

Hua J. Capsid structure and DNA packing in jumbo bacteriophage. University of

M AN U

510

RI PT

508

[49]

Tsones J. Use of Bacteriophages as Antimicrobials Against Avian Pathogenic Escherichia coli Infections in Poultry : from Isolation to Therapeutic. KU Leuven and

516

Vrije Universiteit Brussel, 2014.

517

[50]

TE D

515

Casjens SR, Gilcrease EB. Determining DNA Packaging Strategy by Analysis of the Termini of the Chromosomes in Tailed-Bacteriophage Virions. In: Clokie MRJ,

519

Kropinski AM, editors. Bacteriophages Methods Protoc. Vol. 2 Mol. Appl. Asp.,

520

Totowa, NJ: Humana Press; 2009, p. 91–111. doi:10.1007/978-1-60327-565-1_7.

522

AC C

EP

518

523

Conserved Core. PLoS One 2013;8:e55644. doi:10.1371/journal.pone.0055644.

521

524

[51]

Mann RA, Smits THM, Bühlmann A, Blom J, Goesmann A, Frey JE, et al. Comparative

Genomics of 12 Strains of Erwinia amylovora Identifies a Pan-Genome with a Large

[52]

Powney R, Smits THM, Sawbridge T, Frey B, Blom J, Frey JE, et al. Genome sequence of

525

an Erwinia amylovora strain with pathogenicity restricted to Rubus plants. J Bacteriol

526

2011;193:785–6. doi:10.1128/JB.01352-10. 24

Page 25 of 36

ACCEPTED MANUSCRIPT 527

[53]

Marín A, Xia X. GC skew in protein-coding genes between the leading and lagging

528

strands in bacterial genomes: New substitution models incorporating strand bias. J

529

Theor Biol 2008;253:508–13. doi:10.1016/j.jtbi.2008.04.004. [54]

Sandt CH, Hopper JE, Hill CW. Activation of Prophage eib Genes for Immunoglobulin-

RI PT

530 531

Binding Proteins by Genes from the IbrAB Genetic Island of Escherichia coli ECOR-9. J

532

Bacteriol 2002;184:3640–8. doi:10.1128/JB.184.13.3640-3648.2002. [55]

Trotter M, McAuliffe O, Callanan M, Edwards R, Fitzgerald GF, Coffey A, et al. Genome

SC

533

analysis of the obligately lytic bacteriophage 4268 of Lactococcus lactis provides

535

insight into its adaptable nature provides insight into its adaptable nature provides

536

insight into its adaptable nature. Gene 2006;366:189–99.

537

doi:10.1016/j.gene.2005.09.022. [56]

Šimoliūnas E, Kaliniene L, Truncaite L, Zajančkauskaite A, Staniulis J, Kaupinis A, et al.

TE D

538

M AN U

534

539

Klebsiella Phage vB_KleM-RaK2 — A Giant Singleton Virus of the Family Myoviridae.

540

PLoS One 2013;8. doi:10.1371/journal.pone.0060717.

Springer; 2004.

542

543

[58]

Grose JH, Casjens SR. Understanding the enormous diversity of bacteriophages: the

tailed phages that infect the bacterial family Enterobacteriaceae. Virology 2014;468–

544

470:421–43. doi:10.1016/j.virol.2014.08.024.

545

546

Garrity GM. Bergey’s manual of systematic bacteriology. Vol. 2, The proteobacteria.

EP

[57]

AC C

541

[59]

Ghosh N, McKillop TJ, Jowitt TA, Howard M, Davies H, Holmes DF, et al. Collagen-Like

547

Proteins in Pathogenic E. coli Strains. PLoS One 2012;7:e37872.

548

doi:10.1371/journal.pone.0037872.

25

Page 26 of 36

ACCEPTED MANUSCRIPT 549

[60]

Guo J, Cheng H, Zhao S, Yu L. GG: A domain involved in phage LTF apparatus and

550

implicated in human MEB and non-syndromic hearing loss diseases. FEBS Lett

551

2006;580:581–4. doi:10.1016/j.febslet.2005.12.076.

2002;61:471–80. doi:10.1006/tpbi.2002.1590.

553

[62]

Opin Microbiol 2016;31:161–8. doi:10.1016/j.mib.2016.04.002.

557

558

[64]

[65]

Edgell DR, Gibb EA, Belfort M. Mobile DNA elements in T4 and related phages. Virol J 2010;7:290. doi:10.1186/1743-422X-7-290.

561

[66]

Xu M, Struck DK, Deaton J, Wang I-N, Young R. A signal-arrest-release sequence

563

mediates export and control of the phage P1 endolysin. Proc Natl Acad Sci

564

566

AC C

562

Tori K, Perler FB. Expanding the Definition of Class 3 Inteins and Their Proposed Phage Origin. J Bacteriol 2011;193:2035–41. doi:10.1128/JB.01407-10.

559

560

Hurwitz BL, U’Ren JM. Viral metabolic reprogramming in marine ecosystems. Curr

M AN U

[63]

SC

bacterial defences. Nat Rev Microbiol 2013;11:675–87. doi:10.1038/nrmicro3096.

555

556

Samson JE, Magadán AH, Sabri M, Moineau S. Revenge of the phages: defeating

TE D

554

Hendrix RW. Bacteriophages: Evolution of the Majority. Theor Popul Biol

RI PT

[61]

EP

552

567

30. doi:10.4161/bact.1.1.14868.

565

568 569

2004;101:6415–20. doi:10.1073/pnas.0400957101.

[67]

Briers Y, Peeters LM, Volckaert G, Lavigne R. The lysis cassette of bacteriophage ϕKMV

encodes a signal-arrest-release endolysin and a pinholin. Bacteriophage 2011;1:25–

[68]

Lynch KH, Abdu AH, Schobert M, Dennis JJ. Genomic characterization of JG068, a novel virulent podovirus active against Burkholderia cenocepacia. BMC Genomics

26

Page 27 of 36

ACCEPTED MANUSCRIPT 2013;14:574. doi:10.1186/1471-2164-14-574.

570

571

[69]

disulfide caging. J Bacteriol 2010;192:5682–7. doi:10.1128/JB.00674-10.

572

[70]

Buttimer C, O’Sullivan L, Elbreki M, Neve H, McAuliffe O, Ross RP, et al. Genome

RI PT

573

Kuty GF, Xu M, Struck DK, Summer EJ, Young R. Regulation of a phage endolysin by

574

Sequence of Jumbo Phage vB_AbaM_ME3 of Acinetobacter baumanni. Genome

575

Announc 2016;4:e00431-16. doi:10.1128/genomeA.00431-16. [71]

Oliveira H, Melo LDR, Santos SB, Nóbrega FL, Ferreira EC, Cerca N, et al. Molecular

SC

576

aspects and comparative genomics of bacteriophage endolysins. J Virol

578

2013;87:4558–70. doi:10.1128/JVI.03277-12.

M AN U

577

579

583

584

585

EP

582

AC C

581

TE D

580

586

587

588

27

Page 28 of 36

ACCEPTED MANUSCRIPT 589

Table 1. Host range of bacteriophage vB_EamM_Y3 tested on strains of Erwinia amylovora

590

and closely related genera and species, as determined by spot testing with dilution.

RI PT

Sensitivity + + + + + + + + + + + + + + + + + -

SC

Strain CFBP 1430 Erwinia amylovora CFBP 1232 Ea153 1/74 1/79 ACW 38899 ACW 56400 ACW 44274 Ea Rac 3075 ACW 55500 ACW 55835 ACW 55955 IPV 1077/7 01SFR-BO LA 469 LA 071 LA 411 LA 468 LA 477 ACW 40943 Erwinia persicina ACW 40560x ACW 41072 Erwinia billingiae Pagg B90 Eh 42 Pantoea agglomerans Em 406 Em 283 Pantoea vagans C9-1 Pantoea ananatis 351 Lys Results recorded as +, sensitive; −, no infecqon.

591

592

AC C

EP

TE D

M AN U

Species

593

28

Page 29 of 36

ACCEPTED MANUSCRIPT 594

Table 2. Homologous proteins shared between the phages related to Erwinia amylovora

595

phage Y3 (Yoloswag, NTCB, Pseudomonas phages PaBG and Lu11 and Ralstonia phage

596

ΦRSL1).

597

Yoloswag NCTB PaBG Lu11 ΦRSL1 225 112 105 87 55 Y3 (333) (67.57%) (33.63%) (31.53%) (26.13%) (16.52%) Yoloswag 225 113 102 87 53 (334) (67.37%) (26.05%) (15.87%) (33.83%) (30.54%) 112 113 95 88 53 NCTB (301) (37.21%) (37.54%) (31.56%) (29.24%) (17.61) 105 102 95 136 65 PaBG (308) (34.09%) (33.12%) (30.84%) (44.16%) (21.1%) 87 87 88 136 62 Lu11 (391) (22.25%) (22.25%) (22.51) (34.78%) (15.86%) 55 53 53 65 62 ΦRSL1 (343) (16.03%) (15.45%) (15.45) (18.95%) (18.08%) Number in bracket next phage name is total predict proteins with percentages representing

598

the portion of shared proteins within the proteome of the phage in question. Analyses were

599

conducted using Coregenes with the BLASTP threshold set to 75%.

AC C

EP

TE D

M AN U

SC

RI PT

Y3

29

Page 30 of 36

600

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

Fig 1. Electron micrograph of Erwinia amylovora phage Y3. Phage particles were negatively

602

stained with 2% ammonium molybdate. Hair-like structures are evident around the tails.

603

Scale bar represents 100 nm

AC C

EP

601

30

Page 31 of 36

AC C

604

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

605

Fig 2. Summary of the genomic organisation of the 261,365 bp genome of Erwinia amylovora

606

phage Y3. On the outer ring, putative ORFs are represented by purple arrowheads labelled

607

with predicted function where possible. On the middle ring is shown GC content relative to

608

the mean GC content of the genome. On the inner ring, GC skew is illustrated where green

609

represents positive skew and purple a negative skew.

31

Page 32 of 36

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

610

Fig 3. Comparison of the genomes of phage Y3 with jumbo phages Erwinia amylovora phage

612

Yoloswag, phage NCTB, Pseudomonas phages Lu11 and PaBG and Ralstonia phage ΦRSL1

613

using currently available annotations employing TBLASTX and visualised with Easyfig. The

614

blue/red bar chart (top) shows the G+C skew of the Y3 genome. The genome maps comprise

615

of arrows indicating locations and orientation of ORFs. Lines between genome maps indicate

616

level of homology. To assist in the comparison, the largest gene of Y3 was set to the first

617

position of its map with the corresponding homologous gene being set to the first position

618

for the other phage genomes. Level of amino acid identity is shown via the gradient scale.

AC C

EP

TE D

611

32

Page 33 of 36

RI PT

ACCEPTED MANUSCRIPT

M AN U

SC

619

621

AC C

EP

TE D

620

33

Page 34 of 36

RI PT

ACCEPTED MANUSCRIPT

622

Fig 4. Phylogenetic analysis using the portal vertex protein (phylogram A), DNA polymerase

624

(B), large terminase (C) and ATP dependent DNA helicase (D) using the six phages Y3,

625

Yoloswag, NCTB, Lu11, PaBG and ΦRSL1. The amino acid sequences were compared using

626

MUSCLE. The tree was constructed using the maximum likelihood algorithm. The

627

percentages of replicate trees were assessed with the bootstrap test (1000).

AC C

EP

TE D

M AN U

SC

623

628

34

Page 35 of 36

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

629

Fig 5. Phylogenetic analysis using the large terminase (phylogram A) and portal protein (B)

631

of jumbo Erwinia amylovora myoviruses with genome sequences submitted to Genbank to

632

date. The amino acid sequences were compared using MUSCLE. The tree was constructed

633

using the maximum likelihood algorithm. The percentages of replicate trees were assessed

634

with the bootstrap test (1000)

EP

AC C

635

TE D

630

35

Page 36 of 36

SC

RI PT

ACCEPTED MANUSCRIPT

M AN U

636

Fig 6. Growth curve of E. coli Lemo21 (DE3) carrying pET28a with the SAR endolysin Y3_301

638

and its truncated derivative without the transmembrane domain (rTM-Y3_301). The culture

639

was induced at OD600 ~0.450 with 400 µM IPTG (indicated with arrow). All OD measurements

640

were taken in triplicate and the averages are represented.

641

Supplementary material

642

Table S1, Table S2 and Table S3, Table S4

AC C

EP

TE D

637

36