Accepted Manuscript Erwinia amylovora phage vB_EamM_Y3 represents another lineage of hairy Myoviridae Colin Buttimer, Yannick Born, Alan Lucid, Martin J. Loessner, Lars Fieseler, Aidan Coffey PII:
S0923-2508(18)30062-7
DOI:
10.1016/j.resmic.2018.04.006
Reference:
RESMIC 3652
To appear in:
Research in Microbiology
Received Date: 20 December 2017 Accepted Date: 23 April 2018
Please cite this article as: C. Buttimer, Y. Born, A. Lucid, M.J. Loessner, L. Fieseler, A. Coffey, Erwinia amylovora phage vB_EamM_Y3 represents another lineage of hairy Myoviridae, Research in Microbiologoy (2018), doi: 10.1016/j.resmic.2018.04.006. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Page 1 of 36
ACCEPTED MANUSCRIPT
Erwinia amylovora phage vB_EamM_Y3 represents another lineage
2
of hairy Myoviridae
3
Colin Buttimera, Yannick Bornb,c‡, Alan Lucida†, Martin J. Loessnerb, Lars Fieselerb‡, Aidan
4
Coffeya*
5
a
Department of Biological Sciences, Cork Institute of Technology, Cork, Ireland.
6
b
Institute of Food, Nutrition, and Health, ETH Zurich, Zürich, Switzerland.
7
c
Agroscope, Research Division Plant Protection, Wädenswil, Switzerland
8
‡
9
Sciences, Wädenswil, Switzerland.
M AN U
SC
RI PT
1
Present address: Institute of Food and Beverage Innovation, Zurich University of Applied
TE D
10
Email addresses
12
Colin Buttimer -
[email protected]
13
Yannick Born -
[email protected]
14
Martin J. Loessner -
[email protected]
15
Lars Fieseler -
[email protected]
16
Aidan Coffey -
[email protected] *Correspondence and reprints
AC C
EP
11
17
18
1
Page 2 of 36
ACCEPTED MANUSCRIPT Abstract
20
To date, a small number of jumbo myoviruses have been reported to possess atypical
21
whisker-like structures along the surface of their contractile tails. Erwinia amylovora phage
22
vB_EamM_Y3 is another example. It possesses a genome of 261,365 kbp with 333 predicted
23
ORFs. Using a combination of BLASTP, Interproscan and HHpred, about 21% of its putative
24
proteins could be assigned functions involved in nucleotide metabolism, DNA replication,
25
virion structure and cell wall degradation. The phage was found to have a signal-arrest-
26
release (SAR) endolysin (Y3_301) possessing a soluble lytic transglycosylase domain. Like
27
other SAR endolysins, inducible expression of Y3_301 caused Escherichia coli lysis, which is
28
dependent on the presence of an N-terminal signal sequence. Phylogenetic analysis showed
29
that its closest relatives are other jumbo phages including Pseudomonas aeruginosa phage
30
PaBG and P. putida phage Lu11, sharing 105 and 87 homologous proteins respectively. Like
31
these phages, Y3 also shares a distant relationship to Ralstonia solanacearum phage ΦRSL1
32
(sharing 55 homologous proteins). As these phages are unrelated to the Rak2-like group of
33
hairy phages, Y3 along with Lu11 represent a second lineage of hairy myoviruses.
34
Introduction
35
The order of Caudovirales (the tailed phages) consists of three families of bacteriophage
36
(phage), with about 25% consisting of members of the family Myoviridae [1]. These
37
members possess virions with contractile tails, and include the phages with the largest
38
genomes sizes [2]. Those with genomes greater than 200 kbp are typically referred to as
39
“jumbo phages”; and currently, there are over ninety in the databases with most having
40
Gram negative hosts [3,4]. Those with the largest genomes identified to date are Bacillus
41
phage G (498 kbp) and those belonging to the Rak2-like group such Cronobacter sakazakii
AC C
EP
TE D
M AN U
SC
RI PT
19
2
Page 3 of 36
ACCEPTED MANUSCRIPT phage vB_CsaM_GAP32 (358 kbp), Serratia marcescens phage BF (357 kbp) and
43
Pectobacterium phage vB_PcaM_CBB (355 kbp) [5–7]. Erwinia phages are of relevance in the
44
context of alternative methods for bacterial crop disease control. Indeed, recent years have
45
seen intensive investigation into the use of these phages for biocontrol of contagious fire
46
blight of apples and pears. Such research is motivated by increasing reports of resistance to
47
other antibacterial agents [8–10]. The search for new phages for biocontrol applications has
48
resulted in the identification of a number of jumbo phages such as Erwinia amylovora jumbo
49
phages Ea35-70 (accession no. NC_023557.1; [11]), φEaH1 [12], φEaH2 [13], including
50
another sixteen jumbo phages which have had their genome sequences only recently
51
published [14].
52
Another important bacterial pathogen is Ralstonia solanacearum, which colonises the xylem
53
and causes wilt in a wide range of economically important crops. Its relevance has resulted
54
in the discovery of Ralstonia solanacearum phage ΦRSL1 (231 kbp, accession no.
55
NC_010811.2), described in by Yamada et al (2010) and found to possess very little
56
homology to known myoviruses. In recent times, a number of other jumbo phages have
57
been identified with low but significant homology to phage ΦRSL1 at the protein level.
58
These are Pseudomonas phage Lu11 (280 kbp, accession no. NC_017972.1) and PaBG (258
59
kbp, accession no. NC_022096.1) as well as phage NTCB (258 kbp, accession no.
60
LT598654.1), whose genome sequence was identified in a metagenomic study of a decaying
61
Trichodesmium bloom, and whose host range remains unknown [16–18]. As well as the
62
recently described Erwinia amylovora phage vB_EamM_Yoloswag (259 kbp, accession no.
63
KY448244), which can also be assigned to this group [14].
AC C
EP
TE D
M AN U
SC
RI PT
42
3
Page 4 of 36
ACCEPTED MANUSCRIPT 64
Here we report a newly isolated jumbo phage, vB_Eam_Y3 of E. amylovora, which possesses
65
atypical whisker-like structures along the surface of its contractile tail; and has been found
66
to share homology (at the protein level) to Lu11 and PaBG with distant homology to ΦRSL1.
RI PT
67
Materials and Methods
69
Bacterial Strains, Phage, and Cultivation Techniques
70
All Erwinia and Pantoea strains were grown in LB medium at 28°C. Phage Y3 was routinely
71
cultivated at 28°C with the soft agar overlay method [19] using LB plates and LC soft agar (LB
72
supplemented with 10 mM CaCl2 and 2 mM MgSO4; 0.4% agar), and E. amylovora 1/79 as
73
host strain. The phage was originally isolated from soil as described previously [20]. Briefly,
74
10 g of soil was enriched in 200 mL SM-buffered (50 mM Tris, 100 mM NaCl, 8 mM MgSO4,
75
pH 7.4) LB broth, which had been inoculated 1:20 with and overnight culture of E. amylovora
76
1/79. After overnight incubation at 28°C, filtered supernatants were spotted onto a freshly
77
growing lawn of E. amylovora 1/79 and plaques were picked with a Pasteur pipette. After
78
isolation, Y3 was propagated to high titers with the soft agar overlay method. Purification for
79
subsequent electron microscopy and DNA isolation was achieved by CsCl density
80
centrifugation [21]. Host range analysis involved the use of the soft agar overlay method
81
with the use of different bacterial strains and the spotting of phage diluents, a positive result
82
was recorded with the detection of plaques after an overnight incubation.
AC C
EP
TE D
M AN U
SC
68
83
84
85 4
Page 5 of 36
ACCEPTED MANUSCRIPT Transmission Electron Microscopy
87
For transmission electron microscopy, CsCl-purified phages were negatively stained with 2%
88
ammonium molybdate and then analysed as previously as described [22], with 10 particles
89
being used to determine virion dimensions.
RI PT
86
90
DNA Isolation and Sequencing
92
For the extraction of phage DNA, CsCl-purified phage particles were dialyzed against a 1,000-
93
fold excess of SM buffer (6 h, RT). Free nucleic acids were degraded 15 min. at 37°C by
94
DNase I (final concentration: 1 U/ml) and RNase A (50 µg/ml). Nucleases were inactivated
95
and phage capsids degraded 1 h at 56°C with EDTA (20 mM, pH 8.0), proteinase K (50 μg/ml),
96
and SDS (0.5% [wt/vol]). DNA was then purified using a phenol-chloroform extraction
97
followed by ethanol-precipitation [23]. DNA sequencing was outsourced to GATC Biotech
98
(Konstanz, Germany). To conduct sequencing DNA libraries were first created by DNA
99
fragmentation, adapter ligation followed by a size selection and amplification. DNA libraries
100
were then measured and quantified on a fragment analyser before sequencing with 2x300
101
bp paired end reads using Illumine Hiseq sequencing technology. The de novo assembly was
102
performed using default parameters with CLC Genomics Workbench v8.0 (Aarhus,
103
Denmark).
104
Bioinformatics Analysis
105
The RAST server (http://rast.nmpdr.org/; [24]) was employed with GLIMMER [25] to predict
106
open reading frames (ORFs) of Y3. With further analysis of ORFs gene product conducted
107
with BLASTP (http://blast.ncbi.nlm.nih.gov/Blast.cgi?PAGE=Proteins), Pfam
AC C
EP
TE D
M AN U
SC
91
5
Page 6 of 36
ACCEPTED MANUSCRIPT (http://pfam.xfam.org/search#tabview=tab1; [24]), Interproscan
109
(https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3998142/; [25]) and HHpred
110
(https://toolkit.tuebingen.mpg.de/#/tools/hhpred; [26]). With the detection of ORFs with
111
transmembrane domains, signal peptide sequence and lipoprotein cleavage signal being
112
identified with the use of TMHMM v.2 (http://www.cbs.dtu.dk/services/TMHMM/; [27]),
113
SignalPv.4.1 (http://www.cbs.dtu.dk/services/SignalP/; [28]) and LipoP v.1
114
(http://www.cbs.dtu.dk/services/LipoP/; [29]), respectively. The molecular weights of the
115
predicted ORFs were estimated using the batch protein molecular weight determination of
116
the sequence manipulation suite (http://www.bioinformatics.org/sms2/protein_mw.html).
117
The presence of transfer RNA genes was investigated with the use of tRNAscan-SE
118
(http://lowelab.ucsc.edu/tRNAscan-SE/; [30]) and ARAGORN
119
(http://130.235.46.10/ARAGORN/; [31]). The circular genomic map of the phage was drawn
120
using GCview [34]. Potential Rho-independent terminators in were identified using ARNold
121
(http://rna.igmors.u-psud.fr/toolbox/arnold; [33]) with Mfold QuikFold
122
(http://unafold.rna.albany.edu/?q=DINAMelt/Quickfold; [34]) using RNA energy rules 3.0 to
123
verify predictions. Potential promoters were identified by submitting up streams sequences
124
(100bp) of predicted genes to MEME (Multiple Em for Motif Elicitation) (http://meme-
125
suite.org/tools/meme; [35]).
126
Comparative analysis of phage genomes
127
Genome comparisons between phages were performed using TBLASTX and visualised using
128
Easyfig [38]. Multiple sequence alignments were created with MEGA7 [39] using MUSCLE.
129
Phylograms were constructed using the portal vertex (Y3_003), large terminase (Y3_004),
130
DNA polymerase (Y3_173) and ATP dependent DNA helicase (Y3_176) among phages
AC C
EP
TE D
M AN U
SC
RI PT
108
6
Page 7 of 36
ACCEPTED MANUSCRIPT
possessing homology to ΦRSL1, with the large terminase and portal vertex protein being
132
used among jumbo myoviruses of E. amylovora. To estimate the robustness of the
133
phylograms, the maximum-likelihood algorithm [40] based on the Whelan and Goldman
134
substitution model [41], was conducted with bootstrap support (1000). A total proteome
135
comparison between phages was conducted using Coregenes 3.5 with the BLASTP threshold
136
set at 75% (http://gateway.binf.gmu.edu:8080/CoreGenes3.5/custdata.html; [39]).
137
SAR endolysin expression
138
To assess activity of protein Y3_301, its gene (231,516 – 232,190 bp) was amplified using
139
forward primer Y3_301-Pci: 5’-CATACatgttagttatcatcaacaatcg -3’ and reverse primer Y3_301-
140
Xho1: 5’-AAGCTTctcgagggaagaaggagtatgct-3’. The truncated version of Y3_301, namely rTM-
141
Y3_301 without the transmembrane domain (231,614 - 232,190 bp) was amplified using
142
forward primer rTM-Y3_301-Pci1: 5’- CATACATgttacgcatttcgccgtc-3’ and the above reverse
143
primer Y3_301-Xho1. All amplifications were conducted with High Fidelity PCR Master mix
144
(Roche, Basel, Switzerland) using genomic DNA as a template. The resulting PCR product was
145
digested and ligated in-frame into plasmid vector pET28a, with the resulting construct
146
transformed in E. coli Lemo21 (DE3) [43], following the manufacturer’s standard protocol
147
(New England Biolabs, Ipswich, MA, USA). DNA sequencing was performed on purified
148
plasmid to ensure integrity of the construct. SAR endolysin activity in E coli was determined
149
using 40mL liquid cultures (triplicate) in LB both using selective markers kanamycin (40
150
µg/mL) and chloramphenicol (40 μg/mL) with 1% inoculum of an overnight culture. Cultures
151
were grown at 30ᵒC with agitation at 100 rpm with the optical density (600nm) being
152
measured at 1-hour intervals. At hour 4 (OD600~400), cultures were induced with IPTG (400
153
μM) and readings were continued up to hour 8. Control cultures were left un-induced. The
AC C
EP
TE D
M AN U
SC
RI PT
131
7
Page 8 of 36
ACCEPTED MANUSCRIPT
experiment was conducted in triplicate and averages and standard deviations were
155
determined using Microsoft Excel.
156
Accession Number
157
The genome sequence of phage Y3 was submitted to Genbank under accession number
158
KY984068
RI PT
154
SC
159
Results
161
Growth parameters, host range and morphology
162
Erwinia amylovora phage vB_EamM_Y3 was isolated from soil, which had been collected at
163
an apple orchard with trees displaying fire blight symptoms in Suree (canton of Lucerne,
164
Switzerland) in 2008. The bacterial host strain used for its isolation was Erwinia amylovora
165
stain 1/79, which had originally been isolated from an apple tree in Germany in 1979 [44];
166
and its use as a host strain resulted in tiny clear pin-sized plaques. The phage was found to
167
infect a number of strains of E. amylovora as well as strains of Pantoea agglomerans and
168
Pantoea vagans (Table 1). Essentially, its host range was similar to other previously
169
identified E. amylovora phages [20].
170
Examination of the morphology of Y3 by transmission electron microscopy showed the
171
phage belonged to the family of Myoviridae possessing an A1 morphotype [45], with an
172
icosahedral head of 129 nm (±4 nm) in diameter and with a contractile tail with a length of
173
192 nm (±12 nm) (Fig 1). The phage was named in accordance with the bacterial virus
174
nomenclature set out by Kropinski et al (2009). The most unusual feature of Y3 was the hair-
175
like structures along the surface of its contractile tail, a characteristic that has only been
AC C
EP
TE D
M AN U
160
8
Page 9 of 36
ACCEPTED MANUSCRIPT
observed on a small number of phages to date: these being the jumbo Pseudomonas phage
177
Lu11 [16] which shares protein homology with Y3, and those belonging to jumbo phage
178
Rak2-like group such as Pectobacterium phage vB_PccM_CBB [7] and Escherichia phage
179
121Q (accession no. NC_025447.1; [47]), and phages of unknown genome sizes and lineages
180
such as X particle and Escherichia phage PhAPEC6 [48,49].
181
RI PT
176
General genome features
183
The genome size of Y3 (261,365 bp) places it among the double stranded DNA jumbo
184
phages. The assembly pattern of the reads obtained from sequencing indicated that its
185
genome is likely to be circularly permuted with terminal repeats like that of coliphage T4
186
[50]. Due to this observation, the start position of its Genbank file was set to match that of
187
Erwinia phage Yoloswag, its closest relative available on public databases. The average G+C
188
content of Y3 was found to be 47%, which is lower than that which would typically be found
189
for its host chromosomal DNA at c.53% [51,52].
190
A total of 333 ORFs were identified with putative gene products ranging in size of 39 to 1951
191
amino acids. These ORFs were largely divided into two major gene clusters (Fig 2). The
192
largest region 1 was situated between Y3_75 to 221, with ORFs mostly in the anticlockwise
193
direction and region 2 situated from Y3_001 to 74 and Y3_222 to 333 with ORFs mostly in
194
the clockwise direction. The GC skew correlates well with the ORF orientation of the
195
proposed clusters [53]. The ORFs were found to be tightly organised having little intergenic
196
space, with a number of examples of neighbouring ORFs’ start and end points overlapping.
197
Most ORFs were found to start with AUG, while twenty and three ORFs were predicted to
198
start with GUG and UUG, respectively. Of the gene products of the identified ORFs, seventy
AC C
EP
TE D
M AN U
SC
182
9
Page 10 of 36
ACCEPTED MANUSCRIPT
could be allocated a possible function (supplementary information 1, Table S1) with others
200
identified as putative transmembrane proteins (fourteen) putative lipoproteins (two) and
201
hypothetical proteins (247). No tRNA genes were identified on the genome.
202
In terms of lysogeny, no integrase, excisionase or repressor genes were identified. However,
203
one gene product (Y3_031) was found to possess the AntA/AntB antirepressor domain
204
(IPR013557), which has been associated with anti-repressor proteins such as that encoded
205
by LO142 of temperate phage 993W [54]. These proteins are believed to be involved in the
206
lysis/lysogenic decision making process of the phage, possibly suggesting a temperate
207
ancestry [55], with a homolog of this protein also evident in Yoloswag.
208
A potential promoter with a consensus sequence of CTGTAAATA was identified at thirty two
209
locations in the genome of Y3 (supplementary information 2, Fig S1, Table S2), with a highly
210
similar promoter (ACTGTAAATA[N7]A) also having been predicted in the genome of
211
Pseudomonas phage Lu11 [16]; and like that of Lu11, this promotor is mostly situated before
212
ORFs with products predicted to play roles in nucleotide metabolism and DNA replication. In
213
addition, forty-five potential rho factor independent terminators were located throughout
214
the genome of Y3 (supplementary information 2 Table S3).
SC
M AN U
TE D
EP
AC C
215
RI PT
199
216
Phylogenetic position of phage Y3 within the family Myoviridae
217
At the DNA level, Y3 was found to have little or no homology to phage genomes currently
218
available on public databases, with its closest match being Yoloswag (coverage 6%, identity
219
75%) using BLASTN analysis. However, during BLASTP analysis, a small number of other
220
jumbo shared a number of homologs (large terminase, major capsid protein, etc). These
10
Page 11 of 36
ACCEPTED MANUSCRIPT
were phage NTCB, Pseudomonas phages Lu11 and PaBG and Ralstonia phage ΦRSL1, which
222
hinted at a possible evolutionary relationship (Fig 3). Like Y3 at the DNA level, these five
223
phages (Yoloswag, NCTB, Lu11, PaBG and ΦRSL1) also exhibited no homology to each other
224
indicating that they are at most only distantly related.
225
To determine the relatedness to each other among above six phages, a phylogenetic analysis
226
was conducted using four shared proteins (portal vertex, DNA polymerase, large terminase
227
and ATP dependent DNA helicase) to generate phylogenetic trees with confident bootstrap
228
values (Fig 4). Similar phylogenetic markers have been used in studies of other jumbo phages
229
[4,7,56]. These trees showed that among the six, two separate clades were formed: one
230
included Y3 and Yoloswag; and the second was comprised of those phages infecting the
231
genus Pseudomonas (Lu11 and PaBG). Phages NCTB and ΦRSL1 were outliers. The NCTB
232
position on the tree is somewhere between the Erwinia and Pseudomonas phages. Phage
233
ΦRSL1 appeared on a branch that sits outside of all the other five phages implying a far
234
more distant relatedness. This interpretation was further supported by the analysis of the
235
phages proteomes using Coregene (Table 2). The E. amylovora phages were found to share
236
225 proteins with each other; the Pseudomonas phages shared 136; NTCB shared between
237
88 and 112 proteins with both the Erwinia clade and the Pseudomonas clade respectively.
238
ΦRSL1 was still the most distantly related, sharing only 55 to 65 homologues with the other
239
five phages. This distant evolutionary relationship could possibly be explained in the context
240
of host range with the genera Erwinia and Pseudomonas belonging to the class of
241
Gammaproteobacteria while Ralstonia belongs to the class Betaproteobacteria [57].
242
Finally, phylogenetic analysis was preformed using the large terminase and portal protein of
243
all jumbo myoviruses of E. amylovora that have had their genome sequences submitted to
AC C
EP
TE D
M AN U
SC
RI PT
221
11
Page 12 of 36
ACCEPTED MANUSCRIPT
Genbank to date. Both these proteins have been used as phylogenetic markers in such
245
studies of distantly related phages of Enterobacteriaceae [58]. The resulting phylogram
246
shows that there is a minimum of four clades of these phages infecting this bacterium. With
247
Y3, along with phage Yoloswag, forming a novel clade that is highly distinct from the other
248
three clades identified, as the branch that these phages reside on in phylograms sits outside
249
of those representing the other clades (supplementary information 2, Table S4, Fig 5).
250
Morphogenesis proteins of phage Y3
251
The majority of the putative structural proteins of Y3 were situated in a cluster with a length
252
of 65 kbp (Y3_104 -158) within region 1, with majority of these shared with phages
253
Yolosway, NTCB, PaBG and Lu11. These putative proteins were the tail tube (Y3_132, 134),
254
the tail sheath (Y3_135, 158), the baseplate (113, 114, 124), the tail fibres (Y3_104, 108, 109,
255
110), the tail-collar fibre protein (Y3_110) and the major capsid protein (Y3_139). A putative
256
portal protein (Y3_006) was located outside of this region, being adjacent to the large
257
terminase (Y3_004). Interestingly, the homology of the portal protein to that of Bacillus
258
phage G (gp14, 5E-8, 23%) was found to be high when using BLASTP. Proteins Y3_106 and
259
Y3_143 also shared strong homology with gp431 (2e-9, 28%) and gp19 (3e-09, 33%)
260
respectively of Bacillus phage G. Y3_106 is situated between ORFs predicted to form the tail
261
fibres that possess collagen domains [59]. This gene product was predicted to be a tail fibre
262
hinge protein due a presence of an ILEI domain (PF15711.4, 1.7E-7). This domain belongs to
263
the FAM3 superfamily to which the T4-like phage tail hinge gp35 protein is distantly related
264
[60]. Three of the structural proteins that have been identified in phage ΦRSL1 (ΦRSL1_135,
265
136, 140) as confirmed by N-terminal sequencing [15], were found to have homologues in
AC C
EP
TE D
M AN U
SC
RI PT
244
12
Page 13 of 36
ACCEPTED MANUSCRIPT
phage Y3 (namely Y3_135, 139, 140, 158) providing further evidence that they play a role in
267
the virion structure of Y3.
268
Nucleotide metabolism, DNA replication and DNA modification
269
Typical of large myoviruses, Y3 has a number of proteins with functions predicted to
270
influence the nucleotide pool within its host [3]. These include a CMP deaminase (Y3_035)
271
allowing the hydrolysis of dCMP to dUMP and a putative dUTPase (Y3_300) allowing the
272
conversion of dUTP to dUMP. The dUMP can then be acted upon by a thymidylate synthase
273
(Y3_045), which can catalyse the methylation of dUMP to dTMP, which a dTMP kinase
274
(Y3_118) can in turn convert to dTDP. Another feature typical of large phages is the
275
possession of a number of proteins involved in DNA replication. It is worth mentioning that
276
smaller phages typically use a lesser number of proteins to recruit the host replication
277
machinery to carry out this role [61]. Phage Y3 was identified to possess helicases (Y3_001,
278
168, 176), primases (Y3_024, 227), a DNA ligase (Y3_148), topoisomerase subunits (Y3_327,
279
329), as well as class 1 polymerases (Y3_011, 173) and the delta component (PF13177.5,
280
5.7E-26) of a class III polymerase (Y3_209). Y3 also has proteins involved in DNA repair and
281
maintenance including RecA (Y3_243), UV damage endonuclease UvsE (Y3_287) and MmcB-
282
like DNA repair protein (Y3_119) as well as those involved in DNA recombination (Y3_186)
283
and DNA degradation (Y3_229). DNA methylation is a well-known strategy used by some
284
phages to protect against host restriction endonucleases [62] and accordingly, Y3 appears to
285
be able to methylate both cytosine and adenine (Y3_29, 50).
AC C
EP
TE D
M AN U
SC
RI PT
266
286
287
13
Page 14 of 36
ACCEPTED MANUSCRIPT Other metabolic functions of Y3
289
While the genome of phage Y3 does not possess tRNA genes, it does possess proteins
290
potentially involved in their maturation such as RNA 2'-phosphotransferase (Y3_272) and a
291
carboxy-S-adenosyl-L-methionine synthase (Y3_017). Curiously, the phage was found to
292
possess an asparagine synthase (Y3_020), which would allow the conversion of aspartate to
293
asparagine. Such auxiliary phage genes that produce proteins that compliment a host’s
294
metabolism have been identified in a number phages of marine environments; and the role
295
of their encoded proteins is understood to be the removal of potential bottle-necks in their
296
host metabolism that could potentially hinder the lytic cycle [63].
297
Selfish genetic elements
298
Inteins are selfish genetic elements that self-cleave and remove themselves from a
299
translated protein [64]. Proteins identified with these elements in phage Y3 are the large
300
terminase (Y3_004), C-5 cytosine methyltransferase (Y3_050) and one of the four predicted
301
tail fibre proteins (Y3_110). Also identified was a homing endonuclease of the HNH family
302
(Y3_179), situated between two hypothetical proteins. These homing endonuclease are
303
mobile genetic elements with endonuclease activity that only promote the spread of their
304
own encoding gene [65].
305
Cell wall degrading enzymes
306
Analysis of the protein Y3_301 showed that it possesses an N-terminal transmembrane
307
domain (12-34 aa) with a high alanine content (23%), with one basic residue (arginine), and a
308
C-terminal soluble lytic transglycosylase (SLT) domain (IPR008258; 108-218 aa). The latter
309
domain degrades peptidoglycan by cleavage of the β-1,4 glysosidic bond that results in the
AC C
EP
TE D
M AN U
SC
RI PT
288
14
Page 15 of 36
ACCEPTED MANUSCRIPT
formation of a 1,6-anhydrobond in the muramic acid residue. A homolog of this protein was
311
also found in phage Yoloswag (YOLOSWAG_297). Due the composition of the domains in this
312
protein, it was suspected to be an endolysin that followed the signal arrest (SAR) system like
313
gp45 of φKMV and lyz of phage P1 [66,67]. Analysis of the protein using SignalP 3.0, showed
314
that it possessed an N-terminal signal sequence with a probability of 0.729. However, the
315
more recent version of SignalP did not identify this domain suggesting that the N-terminal
316
secretory signal is not removed after export, but rather remains tethered to the inner
317
membrane after translocation by the host sec translocon system.
318
For number of these types of endolysin, an inducible system with E. coli permits
319
demonstration of SAR endolysin activity using a detectable reduction in culture turbidity
320
upon induction [67,68]. A similar experiment was conducted here for protein Y3_301.
321
Accordingly, gene Y3_301 was cloned into pET28a and transformed into E coli Lemo21 (DE3).
322
Expression of the cloned gene was lethal to cells, as addition of IPTG after 4 hours of growth
323
caused the OD600 to drop from ~0.480 to ~0.320 at 8 hours (Fig 6). No lytic activity was
324
detected after the removal of the signal sequence as seen with the induction of the
325
truncated transmembrane domain Y3_301 (rTM-Y3_301). As the toxic activity of this protein
326
was found to be dependent on the presence of its transmembrane domain, it could be
327
concluded that this protein behaves as a typical SAR endolysin, requiring the
328
transmembrane domain to act as a signal sequence for its translocation enabling the
329
catalytic domain of the protein to reach the periplasm where it acts on the peptidoglycan of
330
the bacterial cell resulting in lysis.
331
An important feature of SAR endolysins is the regulation of their enzymatic activity while in
332
their membrane-tethered form. The two main mechanisms that have been identified for
AC C
EP
TE D
M AN U
SC
RI PT
310
15
Page 16 of 36
ACCEPTED MANUSCRIPT
those with the lysozyme domain are (a) those which utilise steric hindrance of catalytic
334
residues due to the proximity of the bilayer to which the enzyme is embedded (as seen with
335
R21 of lambdoid phage 21) and (b) those which regulate activity by the formation of
336
disulphide bridges as seen with LyzP1 of coliphage P1 [69]. Little is understood of how SAR
337
endolysins with transglycosylase domains regulate their activity. However, the formation of
338
disulphide bridges is a possible explanation of how Y3_301 might regulate itself. Five
339
cysteine resides were identified in its amino acid sequence, one situated within the
340
transmembrane domain (Cys23) and the four-remaining located within its catalytic domain
341
(Cys127, 147, 182 & 188).
342
A second protein, namely Y3_197 was found to possess a soluble lytic transglycosylase (SLT)
343
domain (PF05838), with a peptidoglycan binding domain (IPR018537). A homolog of this
344
protein was also found in phage Yoloswag (YOLOSWAG_189). To date, we have not
345
succeeded in demonstrating its predicted enzymatic activity after cloning in E. coli (data not
346
shown).
347
Conclusion
348
As mentioned above, a number of hairy Myoviridae jumbo phages to date have been placed
349
in the Rak2-like group. The possession of whisker-like structures is shared with the unrelated
350
E. amylovora phage Y3, suggesting that these may be more widespread in the Myoviridae
351
family than previously thought. For the Rak2-like Escherichia phage 121Q, this feature has
352
been linked to a protein which resembles a phage tail fibre [47]. The exact gene products
353
responsible for the atypical whisker-like structures of Y3 is yet unknown. However, a few
354
genes have been predicted to encode tail fibre structures (Y3_104, 106, 108, 109, 110), and
355
any of these could potentially be responsible. The role of these atypical hairs of these phages
AC C
EP
TE D
M AN U
SC
RI PT
333
16
Page 17 of 36
ACCEPTED MANUSCRIPT
is still unknown, but is has been speculated to play a role in enhancing phage ability to
357
detect and infect its host by facilitating the phage adsorption processes [7,47].
358
Phylogenetic studies show that phages Y3 and Yoloswag establish a novel clade of jumbo
359
phages infecting E. amylovora (Fig 5). In the context of shared homologous proteins, Lavigne
360
et al (2009) defined phage subfamily and genus boundaries at ≥20% and ≥40%, respectively.
361
Based on these cut-off values, it would place phages Y3 and Yoloswag within a genus within
362
a subfamily to which NTCB, PaBG and Lu11 are also members and which is peripherally
363
related to ΦRSL1 (Table 2).
364
Little has been described about SAR endolysins in jumbo phages to date. However, a small
365
number of jumbo phages have been predicted to possess them. These include members of
366
the Rak2-like group such as Klebsiella phage vB_KleM-RaK2 (gp506), Pectobacterium phage
367
CBB (CBB_187), Cronobacter phage GAP32 (gp180) and coliphage Q121 (gp532) [5,70]; and
368
upon examination within our study, Erwinia phages Ea35-70 (Ea357_029), Deimos-Minion
369
(accession no. ANH52127.1; DM_29) and Simmy50 (accession no. KU886223.1; gp_30).
370
Phages Y3 and Yoloswag can be included in this group. These endolysins are known to be
371
highly prevalent in phages that infect Enterobacteriaceae [71]; and this may also be the case
372
for their jumbo phages, as a number of such endolysins have already been identified among
373
them.
374
Conflict of interest
375
There is no conflict of interest.
AC C
EP
TE D
M AN U
SC
RI PT
356
376
377
17
Page 18 of 36
ACCEPTED MANUSCRIPT Acknowledgements
379
†Alan Lucid passed away on 31 August 2017 during the preparation of the manuscript and
380
this paper is dedicated to his memory.
381
TEM imagines of phage Y3 were prepared by Rudi Lurz at the Max Planck Institue for
382
Molecular Genetic, Berlin.
383
This work was supported by Science Foundation Ireland Project Ref:12/R1/2335,
384
References
385
[1]
SC
Hatfull GF. Bacteriophage genomics. Curr Opin Microbiol 2008;11:447–53.
[3]
TE D
doi:10.1016/j.mib.2008.09.004.
Hendrix RW. Jumbo bacteriophages. Curr Top Microbiol Immunol 2009;328:229–40.
[4]
EP
doi:10.1007/978-3-540-68618-7-7.
390
391
M AN U
[2]
388
389
Ackermann H-W, Prangishvili D. Prokaryote viruses studied by electron microscopy. Arch Virol 2012;157:1843–9. doi:10.1007/s00705-012-1383-y.
386
387
RI PT
378
Yuan Y, Gao M. Jumbo Bacteriophages: An Overview. Front Microbiol 2017;8:403. doi:10.3389/fmicb.2017.00403.
394
AC C
392
395
46. doi:10.1016/j.virol.2014.05.003.
393
396 397
[5]
Abbasifar R, Griffiths MW, Sabour PM, Ackermann HW, Vandersteegen K, Lavigne R,
et al. Supersize me: Cronobacter sakazakii phage GAP32. Virology 2014;460–461:138–
[6]
Casey E, Fitzgerald B, Mahony J, Lugli GA, Ventura M, van Sinderen D. Genome Sequence of Serratia marcescens Phage BF. Genome Announc 2017;5:e00211-17.
18
Page 19 of 36
ACCEPTED MANUSCRIPT doi:10.1128/genomeA.00211-17.
398
399
[7]
Buttimer C, Hendrix H, Oliveira H, Casey A, Neve H, McAuliffe O, et al. Things Are Getting Hairy: Enterobacteria Bacteriophage vB_PcaM_CBB. Front Microbiol
401
2017;8:44. doi:10.3389/fmicb.2017.00044. [8]
McManus PS, Stockwell VO, Sundin GW, Jones AL. Antibiotic use in plant agriculture.
403
Annu Rev Phytopathol 2002;40:443–65.
404
doi:10.1146/annurev.phyto.40.120301.093927. [9]
Nischwitz C, Dhiman C. Streptomycin Resistance of Erwinia amylovora Isolated from
M AN U
405
SC
402
RI PT
400
406
Apple (Malus domesticus) in Utah. Plant Heal Prog 2013. doi:10.1094/PHP-2013-1025-
407
01-RS.
408
[10]
Tancos KA, Villani S, Kuehne S, Borejsza-Wysocka E, Breth D, Carol J, et al. Prevalence of streptomycin-resistant Erwinia amylovora in New York apple orchards. Plant Dis
410
2016;100:802–9. [11]
Yagubi AI, Castle AJ, Kropinski AM, Banks TW, Svircev AM. Complete Genome
EP
411
TE D
409
Sequence of Erwinia amylovora Bacteriophage vB_EamM_Ea35-70. Genome Announc
413
2014;2:e00413-14. doi:10.1128/genomeA.00413-14.
415
AC C
412
416
applicability of phages for the treatment of fire blight. FEMS Microbiol Lett
417
2014;350:25–7.
414
418 419
[12]
Meczker K, Dömötör D, Vass J, Rákhely G, Schneider G, Kovács T. The genome of the
Erwinia amylovora phage PhiEaH1 reveals greater diversity and broadens the
[13]
Dömötör D, Becságh P, Rákhely G, Schneider G, Kovács T. Complete genomic sequence of Erwinia amylovora phage PhiEaH2. J Virol 2012;86:10899. 19
Page 20 of 36
ACCEPTED MANUSCRIPT doi:10.1128/JVI.01870-12.
420
421
[14]
Esplin IND, Berg JA, Sharma R, Allen RC, Arens DK, Ashcroft CR, et al. Genome Sequences of 19 Novel Erwinia amylovora Bacteriophages. Genome Announc
423
2017;5:1–3.
424
[15]
RI PT
422
Yamada T, Satoh S, Ishikawa H, Fujiwara A, Kawasaki T, Fujie M, et al. A jumbo phage infecting the phytopathogen Ralstonia solanacearum defines a new lineage of the
426
Myoviridae family. Virology 2010;398:135–47. doi:10.1016/j.virol.2009.11.043. [16]
Adriaenssens EM, Mattheus W, Cornelissen A, Shaburova O, Krylov VN, Kropinski AM,
M AN U
427
SC
425
428
et al. Complete Genome Sequence of the Giant Pseudomonas Phage Lu11. J Virol
429
2012;86:6369–70. doi:10.1128/JVI.00641-12.
430
[17]
Pfreundt U, Spungin D, Hou S, Voß B, Berman-Frank I, Hess WR. Genome of a giant bacteriophage from a decaying Trichodesmium bloom. Mar Genomics 2016.
432
doi:10.1016/j.margen.2017.02.001. [18]
Sykilinda NN, Bondar AA, Gorshkova AS, Kurochkina LP, Kulikov EE, Shneider MM, et
EP
433
TE D
431
al. Complete Genome Sequence of the Novel Giant Pseudomonas Phage PaBG.
435
Genome Announc 2014;2:e00929-13-e00929-13. doi:10.1128/genomeA.00929-13.
436
[19]
437
[20]
AC C
434
Adams MH. Bacteriophages. Bacteriophages 1959. Born Y, Fieseler L, Marazzi J, Lurz R, Duffy B, Loessner MJ. Novel virulent and broad-
438
host-range Erwinia amylovora bacteriophages reveal a high degree of mosaicism and
439
a relationship to Enterobacteriaceae phages. Appl Environ Microbiol 2011;77:5945–
440
54. doi:10.1128/AEM.03022-10.
20
Page 21 of 36
ACCEPTED MANUSCRIPT 441
[21]
Sambrook J, Russell DW. Purification of Bacteriophage Lamda Particles by Isopycnic
442
Centrifugation through CsCl Gradients. Mol. Cloning A Lab. Manual, Vol. 1, Cold Spring
443
Harbor Laboratory Press; 2001, p. 2.47. [22]
Klumpp J, Dorscht J, Lurz R, Bielmann R, Wieland M, Zimmer M, et al. The terminally
RI PT
444
redundant, nonpermuted genome of Listeria bacteriophage A511: a model for the
446
SPO1-like myoviruses of gram-positive bacteria. J Bacteriol 2008;190:5753–65.
447
doi:10.1128/JB.00461-08. [23]
Laboratory Press; 2001.
449
450
Sambrook J, Russell DW. Molecular cloning : a laboratory manual. Cold Spring Harbor
[24]
M AN U
448
SC
445
Becker SA, Palsson B?, Best AA, DeJongh M, Disz T, Edwards RA, et al. The RAST Server: Rapid Annotations using Subsystems Technology. BMC Microbiol 2005;5:8.
452
doi:10.1186/1471-2180-5-8. [25]
[26]
Finn RD, Coggill P, Eberhardt RY, Eddy SR, Mistry J, Mitchell AL, et al. The Pfam protein families database: towards a more sustainable future. Nucleic Acids Res
456
2015;44:D279-85. doi:10.1093/nar/gkv1344.
457
458
EP
identification with GLIMMER. Nucleic Acids Res 1999;27:4636–41.
454
455
Delcher AL, Harmon D, Kasif S, White O, Salzberg SL. Improved microbial gene
AC C
453
TE D
451
[27]
Mitchell A, Chang H-Y, Daugherty L, Fraser M, Hunter S, Lopez R, et al. The InterPro
459
protein families database: the classification resource after 15 years. Nucleic Acids Res
460
2014;43:D213-21. doi:10.1093/nar/gku1243.
461 462
[28]
Söding J, Biegert A, Lupas AN. The HHpred interactive server for protein homology detection and structure prediction. Nucleic Acids Res 2005;33:W244-8. 21
Page 22 of 36
ACCEPTED MANUSCRIPT doi:10.1093/nar/gki408.
463
464
[29]
Käll L, Krogh A, Sonnhammer ELL. A combined transmembrane topology and signal peptide prediction method. J Mol Biol 2004;338:1027–36.
466
doi:10.1016/j.jmb.2004.03.016. [30]
peptides from transmembrane regions. Nat Meth 2011;8:785–6.
468
469
Petersen TN, Brunak S, von Heijne G, Nielsen H. SignalP 4.0: discriminating signal
[31]
Juncker AS, Willenbrock H, von Heijne G, Brunak S, Nielsen H, Krogh A. Prediction of
SC
467
RI PT
465
lipoprotein signal peptides in Gram-negative bacteria. Protein Sci 2003;12:1652–62.
471
doi:10.1110/ps.0303703. [32]
genes in genomic sequence. Nucleic Acids Res 1997;25:955–64. [33]
nucleotide sequences. Nucleic Acids Res 2004;32:11–6. doi:10.1093/nar/gkh152. [34]
Bioinformatics 2005;21:537–9. doi:10.1093/bioinformatics/bti054.
477
478
[35]
[36]
483
Zuker M. Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res 2003;31:3406–15.
481
482
Naville M, Ghuillot-Gaudeffroy A, Marchais A, Gautheret D. ARNold: a web tool for the
prediction of Rho-independent transcription terminators. RNA Biol 2011;8:11–3.
479
480
Stothard P, Wishart DS. Circular genome visualization and exploration using CGView.
EP
475
476
Laslett D, Canback B. ARAGORN, a program to detect tRNA genes and tmRNA genes in
TE D
473
474
Lowe TM, Eddy SR. tRNAscan-SE: a program for improved detection of transfer RNA
AC C
472
M AN U
470
[37]
Bailey TL, Boden M, Buske FA, Frith M, Grant CE, Clementi L, et al. MEME SUITE: tools for motif discovery and searching. Nucleic Acids Res 2009;37:W202-8.
22
Page 23 of 36
ACCEPTED MANUSCRIPT doi:10.1093/nar/gkp335.
484
[38]
Bioinformatics 2011;27:1009–10. doi:10.1093/bioinformatics/btr039.
486
487
Sullivan MJ, Petty NK, Beatson SA. Easyfig: a genome comparison visualizer.
[39]
Kumar S, Stecher G, Tamura K. MEGA7: Molecular Evolutionary Genetics Analysis
RI PT
485
488
Version 7.0 for Bigger Datasets. Mol Biol Evol 2016;33:1870–4.
489
doi:10.1093/molbev/msw054. [40]
Jones DT, Taylor WR, Thornton JM. The rapid generation of mutation data matrices
SC
490
from protein sequences. Bioinformatics 1992;8:275–82.
492
doi:10.1093/bioinformatics/8.3.275.
493
[41]
M AN U
491
Whelan S, Goldman N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol
495
2001;18:691–9.
496
[42]
TE D
494
Turner D, Reynolds D, Seto D, Mahadevan P. CoreGenes3.5: a webserver for the determination of core genes from sets of viral and small bacterial genomes. BMC Res
498
Notes 2013;6:140. doi:10.1186/1756-0500-6-140. [43]
Schlegel S, Löfblom J, Lee C, Hjelm A, Klepsch M, Strous M, et al. Optimizing
Membrane Protein Overexpression in the Escherichia coli strain Lemo21(DE3). J Mol
500
Biol 2012;423:648–59. doi:10.1016/J.JMB.2012.07.019.
501
502
AC C
499
EP
497
[44]
Falkenstein H, Bellemann P, Walter S, Zeller W, Geider K. Identification of Erwinia
503
amylovora, the Fireblight Pathogen, by Colony Hybridization with DNA from Plasmid
504
pEA29. Appl Environ Microbiol 1988;54:2798–802.
23
Page 24 of 36
ACCEPTED MANUSCRIPT 505
[45]
2007;152:227–43. doi:10.1007/s00705-006-0849-1.
506
507
Ackermann H-W. 5500 Phages examined in the electron microscope. Arch Virol
[46]
Kropinski AM, Prangishvili D, Lavigne R. Position paper: The creation of a rational scheme for the nomenclature of viruses of Bacteria and Archaea. Environ Microbiol
509
2009;11:2775–7. doi:10.1111/j.1462-2920.2009.01970.x. [47]
[48]
Ackermann HW, Auclair P, Basavarajappa S, Konjin HP, Savanurmath C. Bacteriophages from Bombyx mori. Arch Virol 1994;137:185–90.
513
514
SC
Pittsburgh, 2016.
511
512
Hua J. Capsid structure and DNA packing in jumbo bacteriophage. University of
M AN U
510
RI PT
508
[49]
Tsones J. Use of Bacteriophages as Antimicrobials Against Avian Pathogenic Escherichia coli Infections in Poultry : from Isolation to Therapeutic. KU Leuven and
516
Vrije Universiteit Brussel, 2014.
517
[50]
TE D
515
Casjens SR, Gilcrease EB. Determining DNA Packaging Strategy by Analysis of the Termini of the Chromosomes in Tailed-Bacteriophage Virions. In: Clokie MRJ,
519
Kropinski AM, editors. Bacteriophages Methods Protoc. Vol. 2 Mol. Appl. Asp.,
520
Totowa, NJ: Humana Press; 2009, p. 91–111. doi:10.1007/978-1-60327-565-1_7.
522
AC C
EP
518
523
Conserved Core. PLoS One 2013;8:e55644. doi:10.1371/journal.pone.0055644.
521
524
[51]
Mann RA, Smits THM, Bühlmann A, Blom J, Goesmann A, Frey JE, et al. Comparative
Genomics of 12 Strains of Erwinia amylovora Identifies a Pan-Genome with a Large
[52]
Powney R, Smits THM, Sawbridge T, Frey B, Blom J, Frey JE, et al. Genome sequence of
525
an Erwinia amylovora strain with pathogenicity restricted to Rubus plants. J Bacteriol
526
2011;193:785–6. doi:10.1128/JB.01352-10. 24
Page 25 of 36
ACCEPTED MANUSCRIPT 527
[53]
Marín A, Xia X. GC skew in protein-coding genes between the leading and lagging
528
strands in bacterial genomes: New substitution models incorporating strand bias. J
529
Theor Biol 2008;253:508–13. doi:10.1016/j.jtbi.2008.04.004. [54]
Sandt CH, Hopper JE, Hill CW. Activation of Prophage eib Genes for Immunoglobulin-
RI PT
530 531
Binding Proteins by Genes from the IbrAB Genetic Island of Escherichia coli ECOR-9. J
532
Bacteriol 2002;184:3640–8. doi:10.1128/JB.184.13.3640-3648.2002. [55]
Trotter M, McAuliffe O, Callanan M, Edwards R, Fitzgerald GF, Coffey A, et al. Genome
SC
533
analysis of the obligately lytic bacteriophage 4268 of Lactococcus lactis provides
535
insight into its adaptable nature provides insight into its adaptable nature provides
536
insight into its adaptable nature. Gene 2006;366:189–99.
537
doi:10.1016/j.gene.2005.09.022. [56]
Šimoliūnas E, Kaliniene L, Truncaite L, Zajančkauskaite A, Staniulis J, Kaupinis A, et al.
TE D
538
M AN U
534
539
Klebsiella Phage vB_KleM-RaK2 — A Giant Singleton Virus of the Family Myoviridae.
540
PLoS One 2013;8. doi:10.1371/journal.pone.0060717.
Springer; 2004.
542
543
[58]
Grose JH, Casjens SR. Understanding the enormous diversity of bacteriophages: the
tailed phages that infect the bacterial family Enterobacteriaceae. Virology 2014;468–
544
470:421–43. doi:10.1016/j.virol.2014.08.024.
545
546
Garrity GM. Bergey’s manual of systematic bacteriology. Vol. 2, The proteobacteria.
EP
[57]
AC C
541
[59]
Ghosh N, McKillop TJ, Jowitt TA, Howard M, Davies H, Holmes DF, et al. Collagen-Like
547
Proteins in Pathogenic E. coli Strains. PLoS One 2012;7:e37872.
548
doi:10.1371/journal.pone.0037872.
25
Page 26 of 36
ACCEPTED MANUSCRIPT 549
[60]
Guo J, Cheng H, Zhao S, Yu L. GG: A domain involved in phage LTF apparatus and
550
implicated in human MEB and non-syndromic hearing loss diseases. FEBS Lett
551
2006;580:581–4. doi:10.1016/j.febslet.2005.12.076.
2002;61:471–80. doi:10.1006/tpbi.2002.1590.
553
[62]
Opin Microbiol 2016;31:161–8. doi:10.1016/j.mib.2016.04.002.
557
558
[64]
[65]
Edgell DR, Gibb EA, Belfort M. Mobile DNA elements in T4 and related phages. Virol J 2010;7:290. doi:10.1186/1743-422X-7-290.
561
[66]
Xu M, Struck DK, Deaton J, Wang I-N, Young R. A signal-arrest-release sequence
563
mediates export and control of the phage P1 endolysin. Proc Natl Acad Sci
564
566
AC C
562
Tori K, Perler FB. Expanding the Definition of Class 3 Inteins and Their Proposed Phage Origin. J Bacteriol 2011;193:2035–41. doi:10.1128/JB.01407-10.
559
560
Hurwitz BL, U’Ren JM. Viral metabolic reprogramming in marine ecosystems. Curr
M AN U
[63]
SC
bacterial defences. Nat Rev Microbiol 2013;11:675–87. doi:10.1038/nrmicro3096.
555
556
Samson JE, Magadán AH, Sabri M, Moineau S. Revenge of the phages: defeating
TE D
554
Hendrix RW. Bacteriophages: Evolution of the Majority. Theor Popul Biol
RI PT
[61]
EP
552
567
30. doi:10.4161/bact.1.1.14868.
565
568 569
2004;101:6415–20. doi:10.1073/pnas.0400957101.
[67]
Briers Y, Peeters LM, Volckaert G, Lavigne R. The lysis cassette of bacteriophage ϕKMV
encodes a signal-arrest-release endolysin and a pinholin. Bacteriophage 2011;1:25–
[68]
Lynch KH, Abdu AH, Schobert M, Dennis JJ. Genomic characterization of JG068, a novel virulent podovirus active against Burkholderia cenocepacia. BMC Genomics
26
Page 27 of 36
ACCEPTED MANUSCRIPT 2013;14:574. doi:10.1186/1471-2164-14-574.
570
571
[69]
disulfide caging. J Bacteriol 2010;192:5682–7. doi:10.1128/JB.00674-10.
572
[70]
Buttimer C, O’Sullivan L, Elbreki M, Neve H, McAuliffe O, Ross RP, et al. Genome
RI PT
573
Kuty GF, Xu M, Struck DK, Summer EJ, Young R. Regulation of a phage endolysin by
574
Sequence of Jumbo Phage vB_AbaM_ME3 of Acinetobacter baumanni. Genome
575
Announc 2016;4:e00431-16. doi:10.1128/genomeA.00431-16. [71]
Oliveira H, Melo LDR, Santos SB, Nóbrega FL, Ferreira EC, Cerca N, et al. Molecular
SC
576
aspects and comparative genomics of bacteriophage endolysins. J Virol
578
2013;87:4558–70. doi:10.1128/JVI.03277-12.
M AN U
577
579
583
584
585
EP
582
AC C
581
TE D
580
586
587
588
27
Page 28 of 36
ACCEPTED MANUSCRIPT 589
Table 1. Host range of bacteriophage vB_EamM_Y3 tested on strains of Erwinia amylovora
590
and closely related genera and species, as determined by spot testing with dilution.
RI PT
Sensitivity + + + + + + + + + + + + + + + + + -
SC
Strain CFBP 1430 Erwinia amylovora CFBP 1232 Ea153 1/74 1/79 ACW 38899 ACW 56400 ACW 44274 Ea Rac 3075 ACW 55500 ACW 55835 ACW 55955 IPV 1077/7 01SFR-BO LA 469 LA 071 LA 411 LA 468 LA 477 ACW 40943 Erwinia persicina ACW 40560x ACW 41072 Erwinia billingiae Pagg B90 Eh 42 Pantoea agglomerans Em 406 Em 283 Pantoea vagans C9-1 Pantoea ananatis 351 Lys Results recorded as +, sensitive; −, no infecqon.
591
592
AC C
EP
TE D
M AN U
Species
593
28
Page 29 of 36
ACCEPTED MANUSCRIPT 594
Table 2. Homologous proteins shared between the phages related to Erwinia amylovora
595
phage Y3 (Yoloswag, NTCB, Pseudomonas phages PaBG and Lu11 and Ralstonia phage
596
ΦRSL1).
597
Yoloswag NCTB PaBG Lu11 ΦRSL1 225 112 105 87 55 Y3 (333) (67.57%) (33.63%) (31.53%) (26.13%) (16.52%) Yoloswag 225 113 102 87 53 (334) (67.37%) (26.05%) (15.87%) (33.83%) (30.54%) 112 113 95 88 53 NCTB (301) (37.21%) (37.54%) (31.56%) (29.24%) (17.61) 105 102 95 136 65 PaBG (308) (34.09%) (33.12%) (30.84%) (44.16%) (21.1%) 87 87 88 136 62 Lu11 (391) (22.25%) (22.25%) (22.51) (34.78%) (15.86%) 55 53 53 65 62 ΦRSL1 (343) (16.03%) (15.45%) (15.45) (18.95%) (18.08%) Number in bracket next phage name is total predict proteins with percentages representing
598
the portion of shared proteins within the proteome of the phage in question. Analyses were
599
conducted using Coregenes with the BLASTP threshold set to 75%.
AC C
EP
TE D
M AN U
SC
RI PT
Y3
29
Page 30 of 36
600
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
Fig 1. Electron micrograph of Erwinia amylovora phage Y3. Phage particles were negatively
602
stained with 2% ammonium molybdate. Hair-like structures are evident around the tails.
603
Scale bar represents 100 nm
AC C
EP
601
30
Page 31 of 36
AC C
604
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
605
Fig 2. Summary of the genomic organisation of the 261,365 bp genome of Erwinia amylovora
606
phage Y3. On the outer ring, putative ORFs are represented by purple arrowheads labelled
607
with predicted function where possible. On the middle ring is shown GC content relative to
608
the mean GC content of the genome. On the inner ring, GC skew is illustrated where green
609
represents positive skew and purple a negative skew.
31
Page 32 of 36
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
610
Fig 3. Comparison of the genomes of phage Y3 with jumbo phages Erwinia amylovora phage
612
Yoloswag, phage NCTB, Pseudomonas phages Lu11 and PaBG and Ralstonia phage ΦRSL1
613
using currently available annotations employing TBLASTX and visualised with Easyfig. The
614
blue/red bar chart (top) shows the G+C skew of the Y3 genome. The genome maps comprise
615
of arrows indicating locations and orientation of ORFs. Lines between genome maps indicate
616
level of homology. To assist in the comparison, the largest gene of Y3 was set to the first
617
position of its map with the corresponding homologous gene being set to the first position
618
for the other phage genomes. Level of amino acid identity is shown via the gradient scale.
AC C
EP
TE D
611
32
Page 33 of 36
RI PT
ACCEPTED MANUSCRIPT
M AN U
SC
619
621
AC C
EP
TE D
620
33
Page 34 of 36
RI PT
ACCEPTED MANUSCRIPT
622
Fig 4. Phylogenetic analysis using the portal vertex protein (phylogram A), DNA polymerase
624
(B), large terminase (C) and ATP dependent DNA helicase (D) using the six phages Y3,
625
Yoloswag, NCTB, Lu11, PaBG and ΦRSL1. The amino acid sequences were compared using
626
MUSCLE. The tree was constructed using the maximum likelihood algorithm. The
627
percentages of replicate trees were assessed with the bootstrap test (1000).
AC C
EP
TE D
M AN U
SC
623
628
34
Page 35 of 36
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
629
Fig 5. Phylogenetic analysis using the large terminase (phylogram A) and portal protein (B)
631
of jumbo Erwinia amylovora myoviruses with genome sequences submitted to Genbank to
632
date. The amino acid sequences were compared using MUSCLE. The tree was constructed
633
using the maximum likelihood algorithm. The percentages of replicate trees were assessed
634
with the bootstrap test (1000)
EP
AC C
635
TE D
630
35
Page 36 of 36
SC
RI PT
ACCEPTED MANUSCRIPT
M AN U
636
Fig 6. Growth curve of E. coli Lemo21 (DE3) carrying pET28a with the SAR endolysin Y3_301
638
and its truncated derivative without the transmembrane domain (rTM-Y3_301). The culture
639
was induced at OD600 ~0.450 with 400 µM IPTG (indicated with arrow). All OD measurements
640
were taken in triplicate and the averages are represented.
641
Supplementary material
642
Table S1, Table S2 and Table S3, Table S4
AC C
EP
TE D
637
36