Accepted Manuscript Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk Meryem Moumene, Fatima Drissi, Olivier Croce, Bilel Djebbari, Catherine Robert, Emmanouil Angelakis, Djamel Eddine Benouareth, Didier Raoult, Dr. Vicky Merhej PII:
S2052-2975(16)00011-1
DOI:
10.1016/j.nmni.2016.01.009
Reference:
NMNI 122
To appear in:
New Microbes and New Infections
Received Date: 20 November 2015 Revised Date:
28 December 2015
Accepted Date: 14 January 2016
Please cite this article as: Moumene M, Drissi F, Croce O, Djebbari B, Robert C, Angelakis E, Benouareth DE, Raoult D, Merhej V, Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk, New Microbes and New Infections (2016), doi: 10.1016/j.nmni.2016.01.009. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk Meryem Moumene1,2*, Fatima Drissi1*, Olivier Croce1, Bilel Djebbari1, Catherine Robert1, Emmanouil Angelakis1, Djamel Eddine Benouareth2, Didier Raoult1,3, Vicky Merhej1# Aix Marseille Université, URMITE, UM63, CNRS 7278, IRD 198, INSERM 1095, Institut
RI PT
1
Hospitalo-Universitaire, Marseille, France. University 08 may 1945, Guelma, Algeria
3
Special Infectious Agents Unit, King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi
Arabia * Both authors contributed equally to this study.
Dr. Vicky Merhej
M AN U
# Corresponding author :
SC
2
Aix-Marseille Université, Faculté de Médecine, 27 Bd Jean Moulin, 13385 Marseille cedex 5 Email:
[email protected] Phone: (33) 491 38 55 17
TE D
Fax: (33) 491 83 03 90
Keywords: Lactococcus garvieae, fermented milk, lactic acid bacteria, genome, sequencing, taxonomy.
AC C
EP
Running head: Lactococcus garvieae genome.
1
ACCEPTED MANUSCRIPT Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk
Keywords: Lactococcus garvieae, fermented milk, lactic acid bacteria, genome, sequencing.
AC C
EP
TE D
M AN U
SC
RI PT
Running head: Lactococcus garvieae genome.
ACCEPTED MANUSCRIPT Abstract
2
We describe using a polyphasic approach that combines proteomic by MALDI-TOF spectra
3
analysis, genomic data and phenotypic characterization the features of Lactococcus garvieae
4
strain M14 newly isolated from the fermented milk (Raib) of an Algerian cow. The 2,188,835
5
bp containing genome sequence displays a metabolic capacity to form acid fermentation that
6
is very useful for industrial applications and encodes for two bacteriocins responsible for its
7
eventual bioprotective properties.
AC C
EP
TE D
M AN U
SC
RI PT
1
ACCEPTED MANUSCRIPT 8
Introduction Lactococcus garvieae is a LAB that has commonly been used in manufacture of many
10
varieties of cheese and other fermented milk products [1,2] and meat products [3]. The ability
11
of some LAB to produce proteins with bactericidal properties called bacteriocins, led to their
12
potential utilization as biopreservatives in food industry against a range of pathogenic
13
bacteria, including Listeria sp. and Clostridium sp. [4,5]. Also, because of bacteriocins, some
14
LAB are thought to act as bioprotective organisms that play a major role in the composition of
15
the microbiota [6]. First isolated from cases of bovine mastitis [7], L. garvieae has gained
16
recognition as a potential pathogen of various fish species, including rainbow trout [8].
17
Moreover, L. garvieae has been involved in many clinical cases including infective
18
endocarditis associated with septicemia, spondylodiscitis, urinary and skin infections [9-15].
19
Genomic interspecies microarray hybridization and pan-genome comparative analysis of the
20
pathogenic strain Lg2 and the non-virulent strain YT-3 identified genes encoding for host
21
colonization and the development of pathogenesis including a capsule gene cluster and genes
22
encoding for a myosin-cross-reactive antigen and haemolysin [16,17]. The phenotypic
23
diversity of L. garvieae seems to be related to the specific animal host they colonize [18,19].
24
Altogether, the analysis of L. garvieae genomes isolated from a dairy product and its
25
comparison with pathogenic isolates seems to be necessary to unveil the global genetic
26
variations that may justify, at least in part, the observed phenotypic differences.
SC
M AN U
TE D
EP
AC C
27
RI PT
9
In this work, we have isolated and identified a new strain of Lactococus garvieae from
28
the fermented milk product of an Algerian cow, as a part of the study of LAB and revealed
29
their antibacterial activity. A total of 47 different bacterial species including Lactococcus and
30
Streptococcus spp. as identified by API 50CHL and mass spectrometry, were isolated from
31
the milk product specimens (unpublished data). Lactococcus spp. was the only species with
32
antibacterial effect as shown by the agar well-diffusion assay. We accomplished deep studies
ACCEPTED MANUSCRIPT including phenotypic, polyphasic taxonomy, genotypic and phylogenetic analyses. We display
34
below a set of features for the identified Lactococcus garvieae strain M14 together with the
35
description of the complete genome sequence and annotation. We present a comparative
36
genome analysis of all available sequences of closely related species to L. garvieae. This
37
integrative approach dealing with a large set of data helds the potential to explore the
38
relationship between the presence of L. garvieae in dairy and food safety in order to recognize
39
and prevent a potential hazard for the consumers.
SC
40
RI PT
33
Materials and methods
42
Samples Collection
43
The raw cow milk samples were collected from a farm localized at Guelma (in the Est of
44
Algeria) in sterile glass bottles and transported in an isotherm container to the laboratory. The
45
200 ml of milk samples was allowed to clot at room temperature to promote the development
46
of endogenous lactic flora, according to Zadi-Karam and Karam [20].
47
Strain isolation
48
M17 agar media (SIGMA ALDRICH) prepared in accordance with manufacturer’s
49
instructions was used for the growth of LAB strain screened in this investigation. Thus, 0.1 ml
50
of fermented milk sample was plated onto M17 to promote the bacterial flora cultivation in
51
aerobic conditions by incubation at 37°C for 24 hours. Lactococcus garvieae strain M14 was
52
then isolated and stored at -20°C until its further use.
53
Phenotypic, genotypic and phylogenetic analyses
AC C
EP
TE D
M AN U
41
ACCEPTED MANUSCRIPT We used 16S ribosomal RNA gene sequencing (16S rRNA) to provide genus and species
55
identification for the isolate [21] and taxonomic classification of strain M14. A comparison of
56
nucleotides query sequences against the nucleotide sequence database was also performed
57
using Basic Local Alignment Search Tool (BLAST). The 16S rRNA sequences of all
58
Lactococcus strains with draft genome, were searched within the scaffolds using RNAmmer
59
server [22]. The phylogenetic tree based on almost complete 16S rRNA gene sequences with
60
a minimum length of 1,517 nucleotides were reconstructed using distance matrix (neighbor-
61
joining) within the MEGA software version 5 [23]. Sequences were aligned using Clustal X
62
version 2.0. [24].
63
Different temperatures (room temperature, 28°C, 37°C and 45°C) were tested to determine
64
growth temperature range and the optimal growth temperature for the strain. Growth of the
65
strain was tested on 5% sheep blood agar under anaerobic and microaerophilic conditions
66
using the GENbag anaer and GENbag microaerosystems, respectively (BioMérieux, Marcy
67
l’Etoile, France), and in aerobic conditions, with or without 5% CO2.
68
L. garvieae strain M14 morphology was characterized by transmission electron spectroscopy
69
(TEM) using a Morgani 268D (Philips) spectrometer with operating voltage of 60kV.
70
Polyphasic taxonomic identification by manufactured kits is widely used, the API 50CH
71
carbohydrate fermentation strips and API ZYM enzyme test system (bioMérieux, France)
72
were used to determine the biochemical profile of strain M14 according to the manufacturer’s
73
instructions.
74
Protein
75
desorption/ionization time-of-flight (MALDI-TOF) technique as previously described [25],
76
using a Microflex spectrometer (Bruker Daltonics, Leipzig, Germany). Twelve distinct
77
deposits were made for strain M14 from 12 isolated colonies.
AC C
EP
TE D
M AN U
SC
RI PT
54
Mass
spectroscopy
analysis
was
carried
out
by
Matrix-assisted
laser-
ACCEPTED MANUSCRIPT The twelve collected spectra for M14 were imported into the MALDI BioTyper software
79
(version 2.0, Bruker) and analyzed by standard pattern matching (with default parameter
80
settings) against 7.289 bacterial spectra including 26 spectra from three L. garvieae species,
81
used as reference data, in the BioTyper database. Interpretation of scores as established by the
82
manufacturer Bruker Daltonics was as follows: a score > 1.9 to a validly published species
83
enabled the identification at the species level, a score > 1.7 but < 1.9 enabled the
84
identification at the genus level; and a score < 1.7 did not enable any identification.
85
Growth conditions and genomic DNA preparation
86
L. garvieae was grown on 5 % sheep blood-enriched Columbia agar (BioMerieux, France) at
87
37°C in aerobic atmosphere. Bacteria grown on three Petri dishes were harvested and
88
resuspended in 4x100µL of TE buffer. Then, 200 µL of this suspension was diluted in 1ml TE
89
buffer for lysis treatment. After a lysozyme incubation of 30 minutes at 37°C the lysis was
90
performed with lauryl sarcosyl by 1% final and RNAse A treatment at 50µG/µL final
91
concentration during 1hr at 37°C, followed by an overnight Proteinase K incubation at 37°C.
92
Extracted DNA was then purified using three successive phenol-chloroform extractions and
93
ethanol precipitation at -20°C overnight. After centrifugation, the DNA was resuspended in
94
70 µL TE buffer. The yield and concentration were measured by the Quant-it Picogreen kit
95
(Invitrogen) on the Genios-Tecan fluorometer at 113ng/µl.
96
Genome sequencing and assembly
97
Genomic DNA of L. garvieae was sequenced using MiSeq Technology sequencer (Illumina
98
Inc, San Diego, CA, USA) with the mate pair strategy. The gDNA was barcoded in order to
99
be mixed with 11 others projects with the Nextera Mate Pair sample prep kit (Illumina). The
100
DNAg was quantified by a Qubit assay with the high sensitivity kit (Life technologies,
101
Carlsbad, CA, USA) to 40.8/µl .The mate pair library was prepared with 1µg of genomic
AC C
EP
TE D
M AN U
SC
RI PT
78
ACCEPTED MANUSCRIPT DNA using the Nextera mate pair Illumina guide. The genomic DNA sample was
103
simultaneously fragmented and tagged with a mate pair junction adapter. The profile of the
104
fragmentation was validated on an Agilent 2100 BioAnalyzer (Agilent Technologies Inc,
105
Santa Clara, CA, USA) with a DNA 7500 labchip. The DNA fragments exhibited a mean of
106
4.5 kb (4,486 pb). No size selection was performed and only 308.9 ng of tagmented fragments
107
were circularized. The circularized DNA was mechanically sheared to small fragments with
108
an optimal size of 652 bp on the Covaris device S2 in microtubes (Covaris, Woburn, MA,
109
USA).The library profile was visualized on a High Sensitivity Bioanalyzer LabChip (Agilent
110
Technologies Inc, Santa Clara, CA, USA. The libraries were normalized at 2 nM and pooled.
111
After a denaturation step and dilution at 10 pM, the pool of libraries was loaded onto the
112
reagent cartridge and then onto the instrument along with the flow cell. Automated cluster
113
generation and sequencing run were performed in a single 42-hours run in a 2x251-bp.Total
114
information of 8.6 Gb was obtained from a 950 K/mm2 cluster density with a cluster passing
115
quality control filters of 93.12% (18,182,000 clusters). Within this run, the index
116
representation for L. garvieae was determined to 9.73%. The whole-genome shotgun strategy
117
using Illumina sequencing technology gave 3,294,808 reads. llumina reads where trimmed
118
using Trimmomatic [26], then assembled thought Spades software [27,28]. Contigs obtained
119
were combined together by SSpace [29] and Opera software v1.2 [30] helped by GapFiller
120
V1.10 [31] to reduce the set. Some manual refinements using CLC Genomics v7 software
121
(CLC bio, Aarhus, Denmark) and homemade tools in Python improved the genome.
122
Genome annotation
123
Non-coding genes and miscellaneous features were predicted using RNAmmer [22],
124
ARAGORN [32], Rfam [33], PFAM [34], Infernal [35]. Coding DNA sequences (CDSs)
125
were predicted using Prodigal [36] and functional annotation was achieved using BLAST+
126
[37] and HMMER3 [38] against the UniProtKB database [39]. The KEGG orthology [40]
AC C
EP
TE D
M AN U
SC
RI PT
102
ACCEPTED MANUSCRIPT annotation was done by KAAS online server [41] using the SBH method. The pathways in
128
which each gene might be involved were derived from the best KO hit. Gene functions were
129
assigned by Clusters of Orthologous Groups (COGs) [42,43]. The bacteriocin database of the
130
URMITE, named BUR database, was used to annotate genes encoding for bacteriocins [44].
131
Results and discussion
132
Organism classification and features
133
The sequenced 16S rRNA gene of strain M14 was deposited in the GenBank database under
134
the accession number LK985397. A BLAST search against the nucleotide database showed
135
that strain M14 was most closely related to Lactococcus species with gene sequence identity
136
value of 99.7 % with Lactococcus garvieae YT-3. Based on the comparative sequence
137
analysis of 16S rRNA gene sequence, strain M14 belongs to the already described species L.
138
garvieae [45,46].
139
Neighbour-joining phylogenetic tree, based on almost-complete 16S rRNA gene sequences of
140
Lactococcus garvieae M14 strain and closely related species is shown in Fig. 1. Sequences of
141
the closest species including Lactococcus garvieae strains and strains of Lactococcus lactis
142
subsp. lactis (NR_103918), Lactococcus lactis subsp. cremoris (NR_074949), Lactobacillus
143
rhamnosus GG (NR_102778), Lactobacillus sakei subsp. sakei 23K (NR_075042),
144
Lactobacillus plantarum WCFS1 (NR_075041), Lactobacillus fermentum IFO 3956
145
(NR_075033), Lactobacillus salivarius UCC118 (NR_074589) and Lactobacillus ruminis
146
ATCC 27782 (NR_102839) were aligned with the 16S rRNA gene sequence of the strain
147
M14. The strain M14 formed together with Lactococcus garvieae strains and the L. lactis
148
species a common lineage supported by a high bootstrap value of 99% (Fig. 1).
149
Growth occurred for all the tested temperatures, but the optimal growth was observed at 37°C.
150
The colonies were 1- 6 mm in diameter and moderately opaque in facultative anaerobic
AC C
EP
TE D
M AN U
SC
RI PT
127
ACCEPTED MANUSCRIPT conditions on enriched-blood Columbia agar (BioMerieux) and appeared whitish in color at
152
28°C. The motility test was negative. Gram staining showed Gram positive non sporulating
153
cocci. Cells grown on agar exhibit length ranging from 0.79 to 0.93 µm (mean 0.86 µm) and a
154
diameter ranging from 0.59 to 0.63 µm (mean 0.61 µm) as determined by negative staining
155
TEM micrograph.
156
Strain M14 did neither have catalase nor oxidase activity. Using the API 50CH system a
157
positive reaction was observed for D-ribose, D-glucose, D-fructose, D-mannose, D-galactose,
158
D-mannitol, amygdalin, arbutin, N-acetylglucosamine, esculin, salicine, D-cellobiose, D-
159
lactose, D-saccharose and D-trehalose. Negative reactions were observed for glycerol,
160
erythriol,
161
xylopyranoside,
162
mannopyranoside, Metyl-αD-glucopyranoside, D-maltose, D-melibiose, inulin, D-melezitose,
163
D-raffinose, starch, glycogen, xylitol, Gentiobiose, D-turanose, D-lyxose, D-tagatose, D-
164
fucose, L-fucose, D-arabitol and gluconate. Using the API ZYM system, negative reactions
165
were observed for alkaline phosphatase, cystine arylamidase (proteases), trypsin, alpha-
166
galactosidase (melibiase), beta-glucosidase (cellulase), alpha-mannosidase, alpha-fucosidase,
167
positive reactions were observed for esterase, esterase lipase, lipase, leucine and valine
168
arylamidase, alpha-chemotrypsin, acid phosphatase, beta-galactosidase, beta-glucuronidase,
169
alpha- and beta-glucosidase. The urease reaction, nitrate reduction and indole production were
170
negative. When compared to the phylogenetically close species from Lactococcus and
171
Lactobacillus [17,47-53] L. garvieae strain M14 exhibited the phenotypic differences detailed
172
in Table 1. L. garvieae was susceptible to amoxicilline, imipenem, piperacilin, ciprofloxacine,
173
ceftriaxon,
174
rifampicin but resistant to cefoxitin and cotrimoxazol.
175
Extended features descriptions
D-arabinose,
M AN U
SC
RI PT
151
L-arabinose,
L-rhamnose,
L-xylose,
Dulcitol,
Inositol,
D-adonitol,
Methyl-βD-
D-sobitol,
Methyl-αD-
AC C
EP
TE D
L-sorbose,
D-xylose,
erythromycin,
vancomycin,
nitrofurantoin,
nitrofurantoin,
metronidazol,
ACCEPTED MANUSCRIPT MALDI-TOF analysis results of strain M14 showed scores ranging from 2.177 to 2.343 with
177
Lactococcus spp., suggesting that our isolate was a member of Lactococcus species yet not a
178
member of known strains. We incremented our database with the spectrum of strain M14.
179
Spectrum differences with others of phylogenetically close species are shown in Figure 2-a.
180
The gel view (Fig. 2-b) displays the raw spectra of loaded spectrum files arranged in a
181
pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running
182
spectrum number originating from subsequent spectra loading. The peak intensity is
183
expressed by a Gray scale scheme code. The color bar and the right y-axis indicate the
184
relation between the color a peak is displayed with and the peak intensity in arbitrary units.
185
The compared species are indicated on the left.
186
Genome project history
187
Lactocococcus garvieae is repeatedly used in the industry of dairy products manufactured
188
from raw milk. Moreover, the M14 strain has shown bactericidal effects against several
189
bacteria (unpublished data) which indicate the eventual role played by L. garvieae in the
190
composition of the gut microbiota. The sequencing of this strain is part of a study of the
191
human digestive flora aiming at describing the force balance that shapes its composition
192
including the antibacterial potency. Indeed, L. garvieae strain M14 was the 46th genome from
193
the genus Lactococcus and the 16th genome of L. garvieae sp. The EMBL accession number
194
is CCXC01000001-CCXC01000013 and consists of 13 contigs without gaps including four contigs
195
that have been assigned to four plasmids (Table 2).
196
Genome properties
197
The draft genome of L. garvieae M14 consists of 13 contigs of sizes ranging between 889 and
198
1,512,971 bp. The genome of M14 is composed of a single linear chromosome (2,188,835
199
bp; 37.69 % G+C content) and four plasmids ranging in size from 42,306 to 1,095 bp,
AC C
EP
TE D
M AN U
SC
RI PT
176
ACCEPTED MANUSCRIPT including one circular plasmid (Fig.3, Table 3 and Table 4). The chromosome contains 91
201
predicted RNA including 5 rRNA (one 16S, one 23S and three 5S), 45 tRNA, 1 tmRNA and
202
40 miscellaneous RNA and 2,214 protein coding genes which represent 1,934,957 (88.40 % of
203
the genome). A total of 1,515 genes (68.42 %) were assigned a putative function (by Clusters
204
of Orthologous Groups COGs) [44,45]. We found that 23.63 % of the genes encode for
205
information storage and processing (J, A, K, L and B categories), 21.11 % were involved in
206
cellular processes and signaling (D, V, T, M, N, U, O and X categories), 40.29 % participate
207
in metabolism (C, G, E, F, H, I, P and Q categories) and 14.97 % were poorly characterized
208
(R and S categories). The distribution of genes into COGs functional categories is presented
209
in the figure 4.
210
Genome comparison
211
When comparing L. garvieae M14 with nine Lactobacillus species and two Lactococcus
212
strains that have similar 16S rRNA sequences, we found that the genome sequence of L.
213
garvieae M14 is smaller than those of Lactobacillus plantarum WCFS1, Lactobacillus
214
rhamnosus GG, Lactococcus lactis subsp. cremoris SK11 and Lactococcus lactis subsp. lactis
215
Il1403 (3.35, 3.01, 2.60 and 2.37 MB respectively), but larger than those of Lactobacillus
216
salivarius UCC118, Lactobacillus fermentum IFO 3956, Lactobacillus ruminis ATCC 27782,
217
L. garvieae Lg2, L.garvieae YT-3, and Lactobacillus sakei subsp. sakei 23K (2.13, 2.10, 2.07,
218
1.96, 1.95 and 1.88 MB, respectively) (Table 5). The G+C content of L. garvieae M14 is
219
smaller than those of L. fermentum IFO 3956, L. rhamnosus GG, L. plantarum WCFS1, L.
220
ruminis ATCC 27782, L. sakei subsp. sakei 23K, L. garvieae YT-3 and L. garvieae Lg2
221
(51.47, 46.69, 44.42, 43.47, 41.26, 38.83, 38.76 % respectively), but larger than those of L.
222
lactis subsp. cremoris SK11, L. lactis subsp. lactis Il1403 and L. salivarius UCC118 (35.78,
223
35.33 and 33.04, and % respectively) (Table 5). The gene content of L. garvieae M14 is
224
smaller than those of L. plantarum WCFS1, L. rhamnosus GG and L. lactis subsp. cremoris
AC C
EP
TE D
M AN U
SC
RI PT
200
ACCEPTED MANUSCRIPT SK11 and L. lactis subsp. lactis Il1403 (3,063, 2,944, 2,504 and 2,277 respectively), but
226
larger than those of L. salivarius UCC118, L. sakei subsp. sakei 23K, L. ruminis ATCC 27782
227
and L. fermentum IFO 3956 (2,014, 1,885, 1,862 and 1,843 respectively) (Table 5). The
228
proportion of gene count (in percentage) related to each COG categories was similar among
229
the studied strains of L. garvieae. However, the distribution of genes into COG categories was
230
not entirely similar in all the compared genomes (Fig. 4). L. garvieae M14 have an important
231
number of genes participating in carbohydrate transport and metabolism (175 genes), yet less
232
important than those of L. plantarum WCFS1 and L. rhamnosus GG (267 and 263 genes
233
respectively). L. garvieae M14 and L. rhamnosus GG, have the highest number of genes
234
involved in defense mechanisms (51 and 63 genes, respectively), compared to the other
235
analyzed genomes (37 genes in average). Unlike L. garvieae YT-3, L. garvieae strain M14
236
possess plasmids which sequences were closely related to L. garvieae strain 21881 plasmid
237
pGL5, L. garvieae strain IPLA31405 plasmid plG42, L. lactis plasmid pSRQ900 and L. lactis
238
strain MJC15 plasmid pCD4 sequences. The genes of the plasmids encode for a type IV
239
secretory pathway and for proteins with hypothetical functions.
240
Energy Metabolism and transporters
241
The CDSs annotated by the COG database revealed as much as 10.5% of the genomes
242
corresponding to genes involved in carbohydrate transport and metabolism. Like all obligately
243
homofermentative strains, L. garvieae M14 was found to possess the fructose-bisphosphate
244
aldolase (EC 4.1.2.13) in its genome, which is a key enzyme of the glycolysis pathway,
245
whereas it lacks the phosphoketolase enzyme (EC 4.1.2.9) of the pentose phosphate pathway,
246
present only in heterofermentative bacteria genomes. All the genes required for the
247
degradation of the glucose to pyruvate are present in the genome, as well as the lactate
248
dehydrogenase gene that allow the conversion of pyruvate into lactic acid. Several enzymes
249
acting on pyruvate, including α-acetolactate synthase, pyruvate-formate lyase, lactate
AC C
EP
TE D
M AN U
SC
RI PT
225
ACCEPTED MANUSCRIPT dehydrogenase and pyruvate oxidase, have also been identified in strain M14 genome.
251
Further, genome examination indicates that some enzymes needed for the full citrate cycle
252
and for the gluconeogenesis are missing. PTS systems for fructose, galactose, mannose,
253
maltose, lactose, sucrose, trehalose, mannitol and cellobiose were present in the genome,
254
while PTS systems for xylose, gluconate and ribose were absent. Based on its metabolic
255
profile, L. garvieae M14 produces primarily lactic acid from hexoses, using the glycolysis.
256
This ability of homolactic fermentation is very useful for industrial applications.
257
Defense mechanism
258
We identified in the genome of L. garvieae M14 several phage-related genes and 51 proteins
259
involved in defense features including a glycopeptide antibiotics resistance protein and two
260
bacteriocins that are localized in the chromosome. The first bacteriocin has a length of 64
261
amino acids and the use of the BUR database allowed us to identify a very similar bacteriocin
262
sequence in the genomes of L. garvieae strains YT-3, Lg2 and TRF1. These sequences have
263
been previously annotated as encoding for hypothetical proteins in the genomes of L. garvieae
264
strains YT-3 and Lg2. The second bacteriocin has a length of 184 aa and corresponds to a
265
colicin V, also found in the genome of the strain Lg2. Garviecin L1-5 was the first bacteriocin
266
detected in a Lactococcus garvieae strain [54]. It has been shown to inhibit the growth of
267
species relatively closely related to the producer, but also of the human pathogen Listeria
268
monocytogenes. Altogether, the production of bacteriocins gives L. garvieae strains a
269
competitive advantage within their environment, allowing them to directly inhibit other
270
bacteria and proliferate.
271
Conclusions
272
We have presented the phenotypic, phylogenetic and genomic analyses that allowed the
273
description of Lactococcus garvieae strain M14. This new bacterial strain was isolated from
AC C
EP
TE D
M AN U
SC
RI PT
250
ACCEPTED MANUSCRIPT fermented milk sample of an Algerian cow is essential in the manufacture of diary products
275
and seems to play a major role as a biopreservative in food industry and as a potential
276
probiotic if the infectious risk is assuredly ruled out.
277
Acknowledgements
278
The authors thank Xegen Company (www.xegen.fr) for automating the genomic annotation
279
process. VM was supported by a Chairs of Excellence program from the CNRS (Centre
280
National de Recherche Scientifique). This study was funded by the Mediterranee Infection
281
Foundation.
282
Competing interests
283
The authors declare that they have no competing interests.
AC C
EP
TE D
M AN U
SC
RI PT
274
ACCEPTED MANUSCRIPT References 1. El-Baradei G, Delacroix-Buchet A, Ogier JC (2007) Biodiversity of bacterial ecosystems in traditional Egyptian Domiati cheese. Appl Environ Microbiol 73 (4):1248-1255. doi:10.1128/AEM.01667-06
RI PT
2. El-Baradei G, Delacroix-Buchet A, Ogier JC (2008) Bacterial biodiversity of traditional Zabady fermented milk. International Journal of Food Microbiology 121 (3):295-301. doi:http://dx.doi.org/10.1016/j.ijfoodmicro.2007.11.014
SC
3. Aquilanti L, Santarelli S, Silvestri G, Osimani A, Petruzzelli A, Clementi F (2007) The microbial ecology of a typical Italian salami during its natural fermentation. International of
Food
Microbiology
120
(1–2):136-145.
M AN U
Journal
doi:http://dx.doi.org/10.1016/j.ijfoodmicro.2007.06.010
4. Schoster A, Kokotovic B, Permin A, Pedersen PD, Bello FD, Guardabassi L (2013) In vitro inhibition of Clostridium difficile and Clostridium perfringens by commercial probiotic
TE D
strains. Anaerobe 20:36-41. doi:http://dx.doi.org/10.1016/j.anaerobe.2013.02.006 5. Monteagudo-Mera A, Rodríguez-Aparicio L, Rúa J, Martínez-Blanco H, Navasa N, GarcíaArmesto MR, Ferrero MÁ (2012) In vitro evaluation of physiological probiotic properties of
EP
different lactic acid bacteria strains of dairy and human origin. Journal of Functional Foods 4 (2):531-541. doi:http://dx.doi.org/10.1016/j.jff.2012.02.014
AC C
6. Bernbom N, Licht TR, Brogren C-H, Jelle B, Johansen AH, Badiola I, Vogensen FK, Nørrung B (2006) Effects of Lactococcus lactis on Composition of Intestinal Microbiota: Role
of
Nisin.
Applied
and
Environmental
Microbiology
72
(1):239-244.
doi:10.1128/AEM.72.1.239-244.2006 7. Collins MD, Farrow JAE, Phillips BA, Kandler O (1983) Streptococcus garvieae sp. nov. and
Streptococcus
plantarum
sp.
doi:doi:10.1099/00221287-129-11-3427
nov.
J
Gen
Microbiol
129
(11):3427-3431.
ACCEPTED MANUSCRIPT 8. Vendrell D, Balcázar JL, Ruiz-Zarzuela I, de Blas I, Gironés O, Múzquiz JL (2006) Lactococcus garvieae in fish: A review. Comparative Immunology, Microbiology and Infectious Diseases 29 (4):177-198. doi:http://dx.doi.org/10.1016/j.cimid.2006.06.003 9. Fihman V, Raskine L, Barrou Z, Kiffel C, Riahi J, Bercot B, Sanson-Le Pors MJ (2006)
RI PT
Lactococcus garvieae endocarditis: identification by 16S rRNA and sodA sequence analysis. J Infect 52 (1):e3-6. doi:10.1016/j.jinf.2005.04.021
10. Chan JF, Woo PC, Teng JL, Lau SK, Leung SS, Tam FC, Yuen KY (2011) Primary
SC
infective spondylodiscitis caused by Lactococcus garvieae and a review of human L. garvieae infections. Infection 39 (3):259-264. doi:10.1007/s15010-011-0094-8
M AN U
11. Wang CY, Shie HS, Chen SC, Huang JP, Hsieh IC, Wen MS, Lin FC, Wu D (2007) Lactococcus garvieae infections in humans: possible association with aquaculture outbreaks. Int J Clin Pract 61 (1):68-73. doi:10.1111/j.1742-1241.2006.00855.x 12. Vinh DC, Nichol KA, Rand F, Embil JM (2006) Native-valve bacterial endocarditis by
Lactococcus
garvieae.
Diagn
Microbiol
Infect
Dis
56
(1):91-94.
TE D
caused
doi:10.1016/j.diagmicrobio.2006.02.010
13. Wilbring M, Alexiou K, Reichenspurner H, Matschke K, Tugtekin SM (2011)
EP
Lactococcus garvieae causing zoonotic prosthetic valve endocarditis. Clin Res Cardiol 100 (6):545-546. doi:10.1007/s00392-011-0286-3
AC C
14. Li W-K, Chen Y-S, Wann S-R, Liu Y-C, Tsai H-C (2008) Lactococcus garvieae Endocarditis with Initial Presentation of Acute Cerebral Infarction in a Healthy Immunocompetent
Man.
Internal
Medicine
47
(12):1143-1146.
doi:10.2169/internalmedicine.47.0795 15. Yiu KH, Siu CW, To KK, Jim MH, Lee KL, Lau CP, Tse HF (2007) A rare cause of infective
endocarditis;
Lactococcus
doi:10.1016/j.ijcard.2005.11.092
garvieae.
Int
J
Cardiol
114
(2):286-287.
ACCEPTED MANUSCRIPT 16. Aguado-Urda M, Lopez-Campos GH, Fernandez-Garayzabal JF, Martin-Sanchez F, Gibello A, Dominguez L, Blanco MM (2010) Analysis of the genome content of Lactococcus garvieae by genomic interspecies microarray hybridization. BMC Microbiol 10:79. doi:10.1186/1471-2180-10-79
RI PT
17. Morita H, Toh H, Oshima K, Yoshizaki M, Kawanishi M, Nakaya K, Suzuki T,
Miyauchi E, Ishii Y, Tanabe S, Murakami M, Hattori M (2011) Complete genome sequence and comparative analysis of the fish pathogen Lactococcus garvieae. PLoS One. 6(8):e23184.
SC
doi: 10.1371/journal.pone.0023184.
18. Miyauchi E, Toh H, Nakano A, Tanabe S, Morita H (2012) Comparative Genomic
M AN U
Analysis of Lactococcus garvieae Strains Isolated from Different Sources Reveals Candidate Virulence Genes. Int J Microbiol 2012:728276. doi: 10.1155/2012/728276. 19. Kawanishi M, Yoshida T, Yagashiro S, Kijima M, Yagyu K, Nakai T, Murakami M, Morita H, Suzuki S (2006) Differences between Lactococcus garvieae isolated from the genus
TE D
Seriola in Japan and those isolated from other animals (trout, terrestrial animals from Europe) with regard to pathogenicity, phage susceptibility and genetic characterization. J Appl Microbiol 101 (2):496-504. doi:10.1111/j.1365-2672.2006.02951.x
EP
20. Zadi-Karam H, Karame N-E (2005) Lactic acid bacteria from Camel milk. Paper presented at the Renc. Rech. Ruminants,
AC C
21. Fox GE, Magrum LJ, Balch WE, Wolfe RS, Woese CR (1977) Classification of methanogenic bacteria by 16S ribosomal RNA characterization. Proceedings of the National Academy of Sciences 74 (10):4537-4541 22. Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW (2007) RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35 (9):3100-3108. doi:10.1093/nar/gkm160
ACCEPTED MANUSCRIPT 23. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and
maximum
parsimony
methods.
Mol
Biol
Evol
28
(10):2731-2739.
doi:10.1093/molbev/msr121
RI PT
24. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG (2007) Clustal
W
and
Clustal
X
version
2.0.
Bioinformatics
(21):2947-2948.
SC
doi:10.1093/bioinformatics/btm404
23
25. Seng P, Drancourt M, Gouriet F, La Scola B, Fournier PE, Rolain JM, Raoult D (2009)
M AN U
Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin Infect Dis 49 (4):543-551. doi:10.1086/600885
26. Lohse M, Bolger AM, Nagel A, Fernie AR, Lunn JE, Stitt M, Usadel B (2012) RobiNA: a
TE D
user-friendly, integrated software solution for RNA-Seq-based transcriptomics. Nucleic Acids Res 40 (Web Server issue):W622-627. doi:10.1093/nar/gks540 27. Nurk S, Bankevich A, Antipov D, Gurevich AA, Korobeynikov A, Lapidus A, Prjibelski
EP
AD, Pyshkin A, Sirotkin A, Sirotkin Y, Stepanauskas R, Clingenpeel SR, Woyke T, McLean JS, Lasken R, Tesler G, Alekseyev MA, Pevzner PA (2013) Assembling single-cell genomes
AC C
and mini-metagenomes from chimeric MDA products. J Comput Biol 20 (10):714-737. doi:10.1089/cmb.2013.0084 28. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA (2012) SPAdes: a new genome assembly algorithm and its applications
to
single-cell
doi:10.1089/cmb.2012.0021
sequencing.
J
Comput
Biol
19
(5):455-477.
ACCEPTED MANUSCRIPT 29. Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W (2011) Scaffolding preassembled
contigs
using
SSPACE.
Bioinformatics
27
(4):578-579.
doi:10.1093/bioinformatics/btq683 30. Gao S, Sung WK, Nagarajan N (2011) Opera: reconstructing optimal genomic scaffolds high-throughput
paired-end
sequences.
J
Comput
doi:10.1089/cmb.2011.0170
Biol
18
(11):1681-1691.
RI PT
with
31. Boetzer M, Pirovano W (2012) Toward almost closed genomes with GapFiller. Genome
SC
Biol 13 (6):R56. doi:10.1186/gb-2012-13-6-r56
32. Laslett D, Canback B (2004) ARAGORN, a program to detect tRNA genes and tmRNA
M AN U
genes in nucleotide sequences. Nucleic Acids Res 32 (1):11-16. doi:10.1093/nar/gkh152 33. Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR (2003) Rfam: an RNA family database. Nucleic Acids Res 31 (1):439-441
34. Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K,
TE D
Ceric G, Clements J, Heger A, Holm L, Sonnhammer EL, Eddy SR, Bateman A, Finn RD (2012) The Pfam protein families database. Nucleic Acids Res 40 (Database issue):D290-301. doi:10.1093/nar/gkr1065
EP
35. Nawrocki EP, Kolbe DL, Eddy SR (2009) Infernal 1.0: inference of RNA alignments. Bioinformatics 25 (10):1335-1337. doi:10.1093/bioinformatics/btp157
AC C
36. Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ (2010) Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. doi:10.1186/1471-2105-11-119 37. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL (2009)
BLAST+:
architecture
doi:10.1186/1471-2105-10-421
and
applications.
BMC
Bioinformatics
10:421.
ACCEPTED MANUSCRIPT 38. Eddy SR (2011) Accelerated Profile HMM Searches. PLoS Comput Biol 7 (10):e1002195. doi:10.1371/journal.pcbi.1002195 39. UniProt C (2011) Ongoing and future developments at the Universal Protein Resource. Nucleic Acids Res 39 (Database issue):D214-219. doi:10.1093/nar/gkq1020
RI PT
40. Kanehisa M, Goto S (2000) KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Research 28 (1):27-30. doi:10.1093/nar/28.1.27
41. Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M (2007) KAAS: an automatic
issue):W182-185. doi:10.1093/nar/gkm321
SC
genome annotation and pathway reconstruction server. Nucleic Acids Res 35 (Web Server
M AN U
42. Tatusov RL, Galperin MY, Natale DA, Koonin EV (2000) The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Research 28 (1):3336. doi:10.1093/nar/28.1.33
43. Tatusov RL, Koonin EV, Lipman DJ (1997) A genomic perspective on protein families.
TE D
Science 278 (5338):631-637. doi:10.1126/science.278.5338.631
44. Drissi F, Buffet S, Raoult D, Merhej V (2015) Common occurrence of antibacterial agents in human intestinal microbiota. Front Microbiol 6:441. doi:10.3389/fmicb.2015.00441
EP
45. Stackebrandt E, Ebers J (2006) Taxonomic parameters revisited: tarnished gold standards. Microbiol Today 33:152-155
AC C
46. Tindall BJ, Rossello-Mora R, Busse HJ, Ludwig W, Kampfer P (2010) Notes on the characterization of prokaryote strains for taxonomic purposes. Int J Syst Evol Microbiol 60 (Pt 1):249-266. doi:10.1099/ijs.0.016949-0 47. Schleifer KH, Kraus J DC, Kilpper-Bälz R, Collins MD, W. Fischer ( 1985) Transfer of Streptococcus lactis and Related Streptococci to the Genus Lactococcus gen.nov. Syst Appl Microbiol 6 (2):183-195. doi: 10.1016/S0723-2020(85)80052-7.
ACCEPTED MANUSCRIPT 48. Morita H, Toh H, Oshima K, Murakami M, Taylor TD, Igimi S, Hattori M (2009) Complete genome sequence of the probiotic Lactobacillus rhamnosus ATCC 53103. J Bacteriol 191 (24):7630-7631. doi:10.1128/JB.01287-09 49. Chaillou S, Champomier-Verges MC, Cornet M, Crutz-Le Coq AM, Dudez AM, Martin
RI PT
V, Beaufils S, Darbon-Rongere E, Bossy R, Loux V, Zagorec M (2005) The complete genome sequence of the meat-borne lactic acid bacterium Lactobacillus sakei 23K. Nat Biotechnol 23 (12):1527-1533. doi:10.1038/nbt1160
SC
50. Kleerebezem M, Boekhorst J, van Kranenburg R, Molenaar D, Kuipers OP, Leer R, Tarchini R, Peters SA, Sandbrink HM, Fiers MW, Stiekema W, Lankhorst RM, Bron PA,
M AN U
Hoffer SM, Groot MN, Kerkhoven R, de Vries M, Ursing B, de Vos WM, Siezen RJ (2003) Complete genome sequence of Lactobacillus plantarum WCFS1. Proc Natl Acad Sci U S A 100 (4):1990-1995. doi:10.1073/pnas.0337704100
51. Morita H, Toh H, Fukuda S, Horikawa H, Oshima K, Suzuki T, Murakami M, Hisamatsu
TE D
S, Kato Y, Takizawa T, Fukuoka H, Yoshimura T, Itoh K, O'Sullivan DJ, McKay LL, Ohno H, Kikuchi J, Masaoka T, Hattori M (2008) Comparative genome analysis of Lactobacillus reuteri and Lactobacillus fermentum reveal a genomic island for reuterin and cobalamin
EP
production. DNA Res 15 (3):151-161. doi:10.1093/dnares/dsn009 52. Claesson MJ, Li Y, Leahy S, Canchaya C, van Pijkeren JP, Cerdeno-Tarraga AM, Parkhill
AC C
J, Flynn S, O'Sullivan GC, Collins JK, Higgins D, Shanahan F, Fitzgerald GF, van Sinderen D, O'Toole PW (2006) Multireplicon genome architecture of Lactobacillus salivarius. Proc Natl Acad Sci U S A 103 (17):6718-6723. doi:10.1073/pnas.0511060103 53. Forde BM, Neville BA, O'Donnell MM, Riboulet-Bisson E, Claesson MJ, Coghlan A, Ross RP, O'Toole PW (2011) Genome sequences and comparative genomics of two Lactobacillus ruminis strains from the bovine and human intestinal tracts. Microb Cell Fact 10 Suppl 1:S13. doi:10.1186/1475-2859-10-S1-S13
ACCEPTED MANUSCRIPT 54. Villani F, Aponte M, Blaiotta G, Mauriello G, Pepe O, Moschetti G (2001) Detection and characterization of a bacteriocin, garviecin L1-5, produced by Lactococcus garvieae isolated from raw cow’s milk. Journal of Applied Microbiology 90 (3):430-439. doi:10.1046/j.1365-
AC C
EP
TE D
M AN U
SC
RI PT
2672.2001.01261.
ACCEPTED MANUSCRIPT Figure legends Fig. 1. Phylogenetic tree highlighting the position of L. garvieae strain M14 (LK985397) relative to other phylogenetically close strains within the genus Lactococcus and Lactobacillus, with Lysinibacillus sphaericus as the outgroup. Numbers at the nodes are
RI PT
percentages of bootstrap values obtained by repeating 500 times the analysis to generate a majority consensus tree. The scale bar represents 2% nucleotide sequence divergence.
Fig. 2. a. Reference mass spectrum from L. garvieae strain M14 and b. gel view comparing L.
SC
garvieae M14 spectra with Lactococcus species (Lactococcus lactis ssp lactis, Lactococcus lactis ssp cremoris and two strains of Lactococcus garvieae) and with Lactobacillus species
M AN U
(Lactobacillus sakei ssp sakei, Lactobacillus salivarus, Lactobacillus ruminis, Lactobacillus rhamnosus, Lactobacillus plantarum and Lactobacillus fermentum). Fig. 3. Circular representation of the Lactococcus garvieae strain M14 genome. Circles from the center to the outside: GC screw (green/purple), GC content (grey/black), scaffolds in grey
TE D
arrows, tRNA (blue), rRNA (red), tmRNA (light blue), miscellaneous RNA (brown) on forward strand, genes on forward strand colored according to categories determined in the cluster of orthologous group (COG), genes on reverse strand colored by COGs categories,
strand.
EP
tRNA (blue), rRNA (red), tmRNA (light blue), miscellaneous RNA (brown) on reverse
AC C
Fig. 4. Functional classification of the genes encoded by Lactococcus garvieae M14 and its comparison with Lactococcus garvieae YT-3 and Lactococcus garvieae Lg2. The proteincoding sequences are classified according to the COGs categories.
ACCEPTED MANUSCRIPT
L. garvieae YT-3 DSM 6783
L. lactis subsp. lactis DSM 20481
Gram stain
Positive
Positive
Cell shape
Characteristics
L. rhamnosus DSM 20021
L. sakei subsp. sakei DSM 20017
L. plantarum DSM 20174
L. fermentum DSM 20052
L salivarius DSM 20555
Positive
L. lactis subsp. cremoris DSM 20069 Positive
Positive
Positive
Positive
Positive
Positive
Rod
Rod
Rod
Rod
Rod
-
-
+
-
-
Cocci
Rod
Rod
-
-
-
-
D-ribose
+
+
+
D-glucose
+
+
+
D-Galactose
+
+
+
D-mannitol
+
+
+
Amygdalin
+
+
+
Arbutin
+
+
+
Esculin
+
+
D-Cellobiose
+
+
D-lactose
+
-
Inulin
-
-
D-Melezitose
-
-
Glycogen
-
-
+
α-Glucosidase
+
N-acetyl-βglucosaminidase α-mannosidase
-
+
+
-
+
+
+
+
+
+
+
+
+
+
+
+
-
+
-
+
-
+
-
+
-
-
-
+
-
+
-
-
+
+
+
+
+
+
+
+
+
+
-
+
+
-
+
-
+
-
+
+
+
+
-
-
-
+
-
-
-
-
+
-
+
-
-
-
-
-
-
-
-
-
M AN U
-
-
EP
-
Acid phosphatase
+
+
-
-
+
-
-
-
-
+
-
+
+
-
-
-
+
+
+
-
+
-
-
+
+
-
+
-
+
-
-
+
-
-
+
-
-
-
+
+
-
AC C
Alkaline phosphatase
+
+
TE D
Cocci
L-Arabinose
SC
L. garvieae M14 DSM 29394
RI PT
Table 1: Differential phenotypic characteristics between L. garvieae strain M14 and phylogenetically close species. The following type strains of related lactic acid bacteria species were obtained from the DSMZ culture collection (Braunschweig, Germany). All of these strains were cultured according to the recommendations given in the DSMZ catalogue of strains.
1
ACCEPTED MANUSCRIPT Table 1. Project information.
AC C
EP
TE D
M AN U
SC
MIGS-13
Property Term Finishing quality High quality draft Libraries used 1 mate-paired Sequencing platforms MiSeq Illumina Sequencing coverage 110 Assemblers Spades Gene calling method Prodigal Genbank ID CCXC01000001-CCXC01000013 Genbank Date of Release Oct, 2014 Source material identifier M14 Project relevance Potential probiotic and biopreservative
RI PT
MIGS ID MIGS-31 MIGS-28 MIGS-29 MIGS-31.2 MIGS-30 MIGS-32
2
ACCEPTED MANUSCRIPT Table 3. Nucleotide content and gene count levels of the genome. Value 2,188,835 1,934,957 827,233 2,264 5 45 1 40 2,214 1,651 1,515
*
% of Total* 100.00 88.40 37.79 100.00 0.21 1.90 0.04 1.69 97.79 72.92 68.42
RI PT
Attribute Genome size (bp) DNA coding region (bp) DNA G+C content (bp) Total genes rRNA tRNA tmRNA miscRNA Protein-coding genes Genes with function prediction Genes assigned to COGs
AC C
EP
TE D
M AN U
SC
The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.
3
ACCEPTED MANUSCRIPT Table 2. Nucleotide content and gene count levels of the plasmids. Value 64,869 (42,306; 16,485; 4,983; 1,095)# 55,608 22,256 75 0 75
Size (bp) DNA coding region (bp) DNA G+C content (bp) Total genes rRNA Protein-coding genes Genes with function prediction Genes assigned to COGs
20 10
100 85.72 34.31 100 0 100
35.09 17.54
SC
*
% of Total*
RI PT
Attribute
AC C
EP
TE D
M AN U
The total is based on either the size of the plasmids in base pairs or the total number of protein coding genes in the annotated sequences. # The sizes of the different plasmids are represented between brackets.
4
ACCEPTED MANUSCRIPT Table 5. General genome features. G+C Percent 44.42
Gene Content 3,063
Lactobacillus rhamnosus GG
3.01
46.69
2,944
Lactococcus lactis subsp. cremoris SK11
2.60
35.78
2,504
Lactococcus lactis subsp. lactis Il1403
2.37
35.33
2,277
Lactococcus garvieae M14
2.19
37.79
2,214
Lactobacillus salivarius UCC118
2.13
33.04
2,014
Lactobacillus fermentum IFO
2.10
51.47
1,843
Lactobacillus ruminis ATCC 27782
2.07
43.47
1,862
Lactococcus garvieae Lg2
1.96
38.76
1,968
Lactococcus garvieae YT-3
1.95
38.83
1,947
Lactobacillus sakei subsp. sakei 23K
1.88
41.26
1,885
SC
RI PT
Size (Mb) 3.35
AC C
EP
TE D
M AN U
Strains Lactobacillus plantarum WCFS1
5
RI PT
ACCEPTED MANUSCRIPT
24
42 96
22
60
0.01
AC C
EP
TE D
M AN U
SC
Lactococcus garvieae M14 (LK985397.1) Lactococcus garvieae 21881 (AFCC01) 53 Lactococcus garvieae 8831 (AFCD01) Lactococcus garvieae Lg-ilsanpaik-gs201105 (JPUJ01) Lactococcus garvieae TB25 (AGQX01) 53 Lactococcus garvieae Lg2 (AB267897.1) Lactococcus garvieae UNIUD074 (AFHF01) Lactococcus garvieae YT-3 (NC_015930.1) 54 Lactococcus garvieae NBRC 100934 (BBJW01) 63 Lactococcus garvieae IPLA 31405 (AKFO01) 100 Lactococcus garvieae Tac2 (AMFE01) Lactococcus garvieae LG9 (AGQY01) Lactococcus garvieae I113 (AMFD01) 98 Lactococcus garvieae DCC43 (AMQS01) Lactococcus lactis subsp. lactis Il1403 str. IL1403 (NR_103918) 100 Lactococcus lactis subsp. cremoris SK11 str. SK11 (NR_074949) Lactobacillus ruminis ATCC 27782 (NR_102839) Lactobacillus salivarius UCC118 (NR_074589) Lactobacillus fermentum IFO 3956 (NR_075033) Lactobacillus plantarum WCFS1 (NR_075041) Lactobacillus rhamnosus GG (NR_102778) Lactobacillus sakei subsp. sakei 23K (NR_075042) Lysinibacillus sphaericus (EU855791.1)
AC C
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
AC C
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
ACCEPTED MANUSCRIPT
J:Translation, ribosomal structure and biogenesis 200
S:Function unknown
K:Transcription
180
140
Q:Secondary metabolites biosynthesis, transport and catabolism
120 100 80
P:Inorganic ion transport and metabolism
M AN U
60
L:Replication, recombination and repair
RI PT
160
SC
R:General function prediction only
D:Cell cycle control, cell division, chromosome partitioning
V:Defense mechanisms
40 20
I:Lipid transport and metabolism
T:Signal transduction mechanisms
TE D
0
M:Cell wall/membrane biogenesis
EP
H:Coenzyme transport and metabolism
N:Cell motility
AC C
F:Nucleotide transport and metabolism
U:Intracellular trafficking and secretion, and vesicular transport
E:Amino acid transport and metabolism
G:Carbohydrate transport and metabolism
C:Energy production and conversion
Lactococcus garviae M14
O:Posttranslational modification, protein turnover, chaperones X:Mobilome: prophages, transposons
Lactococcus garvieae YT-3
Lactococcus garvieae Lg2