Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk

Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk

Accepted Manuscript Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk Meryem Moumene, Fatima ...

1000KB Sizes 7 Downloads 38 Views

Accepted Manuscript Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk Meryem Moumene, Fatima Drissi, Olivier Croce, Bilel Djebbari, Catherine Robert, Emmanouil Angelakis, Djamel Eddine Benouareth, Didier Raoult, Dr. Vicky Merhej PII:

S2052-2975(16)00011-1

DOI:

10.1016/j.nmni.2016.01.009

Reference:

NMNI 122

To appear in:

New Microbes and New Infections

Received Date: 20 November 2015 Revised Date:

28 December 2015

Accepted Date: 14 January 2016

Please cite this article as: Moumene M, Drissi F, Croce O, Djebbari B, Robert C, Angelakis E, Benouareth DE, Raoult D, Merhej V, Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk, New Microbes and New Infections (2016), doi: 10.1016/j.nmni.2016.01.009. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk Meryem Moumene1,2*, Fatima Drissi1*, Olivier Croce1, Bilel Djebbari1, Catherine Robert1, Emmanouil Angelakis1, Djamel Eddine Benouareth2, Didier Raoult1,3, Vicky Merhej1# Aix Marseille Université, URMITE, UM63, CNRS 7278, IRD 198, INSERM 1095, Institut

RI PT

1

Hospitalo-Universitaire, Marseille, France. University 08 may 1945, Guelma, Algeria

3

Special Infectious Agents Unit, King Fahd Medical Research Center, King Abdulaziz University, Jeddah, Saudi

Arabia * Both authors contributed equally to this study.

Dr. Vicky Merhej

M AN U

# Corresponding author :

SC

2

Aix-Marseille Université, Faculté de Médecine, 27 Bd Jean Moulin, 13385 Marseille cedex 5 Email: [email protected] Phone: (33) 491 38 55 17

TE D

Fax: (33) 491 83 03 90

Keywords: Lactococcus garvieae, fermented milk, lactic acid bacteria, genome, sequencing, taxonomy.

AC C

EP

Running head: Lactococcus garvieae genome.

1

ACCEPTED MANUSCRIPT Complete genome sequence and description of Lactococcus garvieae M14 isolated from Algerian fermented milk

Keywords: Lactococcus garvieae, fermented milk, lactic acid bacteria, genome, sequencing.

AC C

EP

TE D

M AN U

SC

RI PT

Running head: Lactococcus garvieae genome.

ACCEPTED MANUSCRIPT Abstract

2

We describe using a polyphasic approach that combines proteomic by MALDI-TOF spectra

3

analysis, genomic data and phenotypic characterization the features of Lactococcus garvieae

4

strain M14 newly isolated from the fermented milk (Raib) of an Algerian cow. The 2,188,835

5

bp containing genome sequence displays a metabolic capacity to form acid fermentation that

6

is very useful for industrial applications and encodes for two bacteriocins responsible for its

7

eventual bioprotective properties.

AC C

EP

TE D

M AN U

SC

RI PT

1

ACCEPTED MANUSCRIPT 8

Introduction Lactococcus garvieae is a LAB that has commonly been used in manufacture of many

10

varieties of cheese and other fermented milk products [1,2] and meat products [3]. The ability

11

of some LAB to produce proteins with bactericidal properties called bacteriocins, led to their

12

potential utilization as biopreservatives in food industry against a range of pathogenic

13

bacteria, including Listeria sp. and Clostridium sp. [4,5]. Also, because of bacteriocins, some

14

LAB are thought to act as bioprotective organisms that play a major role in the composition of

15

the microbiota [6]. First isolated from cases of bovine mastitis [7], L. garvieae has gained

16

recognition as a potential pathogen of various fish species, including rainbow trout [8].

17

Moreover, L. garvieae has been involved in many clinical cases including infective

18

endocarditis associated with septicemia, spondylodiscitis, urinary and skin infections [9-15].

19

Genomic interspecies microarray hybridization and pan-genome comparative analysis of the

20

pathogenic strain Lg2 and the non-virulent strain YT-3 identified genes encoding for host

21

colonization and the development of pathogenesis including a capsule gene cluster and genes

22

encoding for a myosin-cross-reactive antigen and haemolysin [16,17]. The phenotypic

23

diversity of L. garvieae seems to be related to the specific animal host they colonize [18,19].

24

Altogether, the analysis of L. garvieae genomes isolated from a dairy product and its

25

comparison with pathogenic isolates seems to be necessary to unveil the global genetic

26

variations that may justify, at least in part, the observed phenotypic differences.

SC

M AN U

TE D

EP

AC C

27

RI PT

9

In this work, we have isolated and identified a new strain of Lactococus garvieae from

28

the fermented milk product of an Algerian cow, as a part of the study of LAB and revealed

29

their antibacterial activity. A total of 47 different bacterial species including Lactococcus and

30

Streptococcus spp. as identified by API 50CHL and mass spectrometry, were isolated from

31

the milk product specimens (unpublished data). Lactococcus spp. was the only species with

32

antibacterial effect as shown by the agar well-diffusion assay. We accomplished deep studies

ACCEPTED MANUSCRIPT including phenotypic, polyphasic taxonomy, genotypic and phylogenetic analyses. We display

34

below a set of features for the identified Lactococcus garvieae strain M14 together with the

35

description of the complete genome sequence and annotation. We present a comparative

36

genome analysis of all available sequences of closely related species to L. garvieae. This

37

integrative approach dealing with a large set of data helds the potential to explore the

38

relationship between the presence of L. garvieae in dairy and food safety in order to recognize

39

and prevent a potential hazard for the consumers.

SC

40

RI PT

33

Materials and methods

42

Samples Collection

43

The raw cow milk samples were collected from a farm localized at Guelma (in the Est of

44

Algeria) in sterile glass bottles and transported in an isotherm container to the laboratory. The

45

200 ml of milk samples was allowed to clot at room temperature to promote the development

46

of endogenous lactic flora, according to Zadi-Karam and Karam [20].

47

Strain isolation

48

M17 agar media (SIGMA ALDRICH) prepared in accordance with manufacturer’s

49

instructions was used for the growth of LAB strain screened in this investigation. Thus, 0.1 ml

50

of fermented milk sample was plated onto M17 to promote the bacterial flora cultivation in

51

aerobic conditions by incubation at 37°C for 24 hours. Lactococcus garvieae strain M14 was

52

then isolated and stored at -20°C until its further use.

53

Phenotypic, genotypic and phylogenetic analyses

AC C

EP

TE D

M AN U

41

ACCEPTED MANUSCRIPT We used 16S ribosomal RNA gene sequencing (16S rRNA) to provide genus and species

55

identification for the isolate [21] and taxonomic classification of strain M14. A comparison of

56

nucleotides query sequences against the nucleotide sequence database was also performed

57

using Basic Local Alignment Search Tool (BLAST). The 16S rRNA sequences of all

58

Lactococcus strains with draft genome, were searched within the scaffolds using RNAmmer

59

server [22]. The phylogenetic tree based on almost complete 16S rRNA gene sequences with

60

a minimum length of 1,517 nucleotides were reconstructed using distance matrix (neighbor-

61

joining) within the MEGA software version 5 [23]. Sequences were aligned using Clustal X

62

version 2.0. [24].

63

Different temperatures (room temperature, 28°C, 37°C and 45°C) were tested to determine

64

growth temperature range and the optimal growth temperature for the strain. Growth of the

65

strain was tested on 5% sheep blood agar under anaerobic and microaerophilic conditions

66

using the GENbag anaer and GENbag microaerosystems, respectively (BioMérieux, Marcy

67

l’Etoile, France), and in aerobic conditions, with or without 5% CO2.

68

L. garvieae strain M14 morphology was characterized by transmission electron spectroscopy

69

(TEM) using a Morgani 268D (Philips) spectrometer with operating voltage of 60kV.

70

Polyphasic taxonomic identification by manufactured kits is widely used, the API 50CH

71

carbohydrate fermentation strips and API ZYM enzyme test system (bioMérieux, France)

72

were used to determine the biochemical profile of strain M14 according to the manufacturer’s

73

instructions.

74

Protein

75

desorption/ionization time-of-flight (MALDI-TOF) technique as previously described [25],

76

using a Microflex spectrometer (Bruker Daltonics, Leipzig, Germany). Twelve distinct

77

deposits were made for strain M14 from 12 isolated colonies.

AC C

EP

TE D

M AN U

SC

RI PT

54

Mass

spectroscopy

analysis

was

carried

out

by

Matrix-assisted

laser-

ACCEPTED MANUSCRIPT The twelve collected spectra for M14 were imported into the MALDI BioTyper software

79

(version 2.0, Bruker) and analyzed by standard pattern matching (with default parameter

80

settings) against 7.289 bacterial spectra including 26 spectra from three L. garvieae species,

81

used as reference data, in the BioTyper database. Interpretation of scores as established by the

82

manufacturer Bruker Daltonics was as follows: a score > 1.9 to a validly published species

83

enabled the identification at the species level, a score > 1.7 but < 1.9 enabled the

84

identification at the genus level; and a score < 1.7 did not enable any identification.

85

Growth conditions and genomic DNA preparation

86

L. garvieae was grown on 5 % sheep blood-enriched Columbia agar (BioMerieux, France) at

87

37°C in aerobic atmosphere. Bacteria grown on three Petri dishes were harvested and

88

resuspended in 4x100µL of TE buffer. Then, 200 µL of this suspension was diluted in 1ml TE

89

buffer for lysis treatment. After a lysozyme incubation of 30 minutes at 37°C the lysis was

90

performed with lauryl sarcosyl by 1% final and RNAse A treatment at 50µG/µL final

91

concentration during 1hr at 37°C, followed by an overnight Proteinase K incubation at 37°C.

92

Extracted DNA was then purified using three successive phenol-chloroform extractions and

93

ethanol precipitation at -20°C overnight. After centrifugation, the DNA was resuspended in

94

70 µL TE buffer. The yield and concentration were measured by the Quant-it Picogreen kit

95

(Invitrogen) on the Genios-Tecan fluorometer at 113ng/µl.

96

Genome sequencing and assembly

97

Genomic DNA of L. garvieae was sequenced using MiSeq Technology sequencer (Illumina

98

Inc, San Diego, CA, USA) with the mate pair strategy. The gDNA was barcoded in order to

99

be mixed with 11 others projects with the Nextera Mate Pair sample prep kit (Illumina). The

100

DNAg was quantified by a Qubit assay with the high sensitivity kit (Life technologies,

101

Carlsbad, CA, USA) to 40.8/µl .The mate pair library was prepared with 1µg of genomic

AC C

EP

TE D

M AN U

SC

RI PT

78

ACCEPTED MANUSCRIPT DNA using the Nextera mate pair Illumina guide. The genomic DNA sample was

103

simultaneously fragmented and tagged with a mate pair junction adapter. The profile of the

104

fragmentation was validated on an Agilent 2100 BioAnalyzer (Agilent Technologies Inc,

105

Santa Clara, CA, USA) with a DNA 7500 labchip. The DNA fragments exhibited a mean of

106

4.5 kb (4,486 pb). No size selection was performed and only 308.9 ng of tagmented fragments

107

were circularized. The circularized DNA was mechanically sheared to small fragments with

108

an optimal size of 652 bp on the Covaris device S2 in microtubes (Covaris, Woburn, MA,

109

USA).The library profile was visualized on a High Sensitivity Bioanalyzer LabChip (Agilent

110

Technologies Inc, Santa Clara, CA, USA. The libraries were normalized at 2 nM and pooled.

111

After a denaturation step and dilution at 10 pM, the pool of libraries was loaded onto the

112

reagent cartridge and then onto the instrument along with the flow cell. Automated cluster

113

generation and sequencing run were performed in a single 42-hours run in a 2x251-bp.Total

114

information of 8.6 Gb was obtained from a 950 K/mm2 cluster density with a cluster passing

115

quality control filters of 93.12% (18,182,000 clusters). Within this run, the index

116

representation for L. garvieae was determined to 9.73%. The whole-genome shotgun strategy

117

using Illumina sequencing technology gave 3,294,808 reads. llumina reads where trimmed

118

using Trimmomatic [26], then assembled thought Spades software [27,28]. Contigs obtained

119

were combined together by SSpace [29] and Opera software v1.2 [30] helped by GapFiller

120

V1.10 [31] to reduce the set. Some manual refinements using CLC Genomics v7 software

121

(CLC bio, Aarhus, Denmark) and homemade tools in Python improved the genome.

122

Genome annotation

123

Non-coding genes and miscellaneous features were predicted using RNAmmer [22],

124

ARAGORN [32], Rfam [33], PFAM [34], Infernal [35]. Coding DNA sequences (CDSs)

125

were predicted using Prodigal [36] and functional annotation was achieved using BLAST+

126

[37] and HMMER3 [38] against the UniProtKB database [39]. The KEGG orthology [40]

AC C

EP

TE D

M AN U

SC

RI PT

102

ACCEPTED MANUSCRIPT annotation was done by KAAS online server [41] using the SBH method. The pathways in

128

which each gene might be involved were derived from the best KO hit. Gene functions were

129

assigned by Clusters of Orthologous Groups (COGs) [42,43]. The bacteriocin database of the

130

URMITE, named BUR database, was used to annotate genes encoding for bacteriocins [44].

131

Results and discussion

132

Organism classification and features

133

The sequenced 16S rRNA gene of strain M14 was deposited in the GenBank database under

134

the accession number LK985397. A BLAST search against the nucleotide database showed

135

that strain M14 was most closely related to Lactococcus species with gene sequence identity

136

value of 99.7 % with Lactococcus garvieae YT-3. Based on the comparative sequence

137

analysis of 16S rRNA gene sequence, strain M14 belongs to the already described species L.

138

garvieae [45,46].

139

Neighbour-joining phylogenetic tree, based on almost-complete 16S rRNA gene sequences of

140

Lactococcus garvieae M14 strain and closely related species is shown in Fig. 1. Sequences of

141

the closest species including Lactococcus garvieae strains and strains of Lactococcus lactis

142

subsp. lactis (NR_103918), Lactococcus lactis subsp. cremoris (NR_074949), Lactobacillus

143

rhamnosus GG (NR_102778), Lactobacillus sakei subsp. sakei 23K (NR_075042),

144

Lactobacillus plantarum WCFS1 (NR_075041), Lactobacillus fermentum IFO 3956

145

(NR_075033), Lactobacillus salivarius UCC118 (NR_074589) and Lactobacillus ruminis

146

ATCC 27782 (NR_102839) were aligned with the 16S rRNA gene sequence of the strain

147

M14. The strain M14 formed together with Lactococcus garvieae strains and the L. lactis

148

species a common lineage supported by a high bootstrap value of 99% (Fig. 1).

149

Growth occurred for all the tested temperatures, but the optimal growth was observed at 37°C.

150

The colonies were 1- 6 mm in diameter and moderately opaque in facultative anaerobic

AC C

EP

TE D

M AN U

SC

RI PT

127

ACCEPTED MANUSCRIPT conditions on enriched-blood Columbia agar (BioMerieux) and appeared whitish in color at

152

28°C. The motility test was negative. Gram staining showed Gram positive non sporulating

153

cocci. Cells grown on agar exhibit length ranging from 0.79 to 0.93 µm (mean 0.86 µm) and a

154

diameter ranging from 0.59 to 0.63 µm (mean 0.61 µm) as determined by negative staining

155

TEM micrograph.

156

Strain M14 did neither have catalase nor oxidase activity. Using the API 50CH system a

157

positive reaction was observed for D-ribose, D-glucose, D-fructose, D-mannose, D-galactose,

158

D-mannitol, amygdalin, arbutin, N-acetylglucosamine, esculin, salicine, D-cellobiose, D-

159

lactose, D-saccharose and D-trehalose. Negative reactions were observed for glycerol,

160

erythriol,

161

xylopyranoside,

162

mannopyranoside, Metyl-αD-glucopyranoside, D-maltose, D-melibiose, inulin, D-melezitose,

163

D-raffinose, starch, glycogen, xylitol, Gentiobiose, D-turanose, D-lyxose, D-tagatose, D-

164

fucose, L-fucose, D-arabitol and gluconate. Using the API ZYM system, negative reactions

165

were observed for alkaline phosphatase, cystine arylamidase (proteases), trypsin, alpha-

166

galactosidase (melibiase), beta-glucosidase (cellulase), alpha-mannosidase, alpha-fucosidase,

167

positive reactions were observed for esterase, esterase lipase, lipase, leucine and valine

168

arylamidase, alpha-chemotrypsin, acid phosphatase, beta-galactosidase, beta-glucuronidase,

169

alpha- and beta-glucosidase. The urease reaction, nitrate reduction and indole production were

170

negative. When compared to the phylogenetically close species from Lactococcus and

171

Lactobacillus [17,47-53] L. garvieae strain M14 exhibited the phenotypic differences detailed

172

in Table 1. L. garvieae was susceptible to amoxicilline, imipenem, piperacilin, ciprofloxacine,

173

ceftriaxon,

174

rifampicin but resistant to cefoxitin and cotrimoxazol.

175

Extended features descriptions

D-arabinose,

M AN U

SC

RI PT

151

L-arabinose,

L-rhamnose,

L-xylose,

Dulcitol,

Inositol,

D-adonitol,

Methyl-βD-

D-sobitol,

Methyl-αD-

AC C

EP

TE D

L-sorbose,

D-xylose,

erythromycin,

vancomycin,

nitrofurantoin,

nitrofurantoin,

metronidazol,

ACCEPTED MANUSCRIPT MALDI-TOF analysis results of strain M14 showed scores ranging from 2.177 to 2.343 with

177

Lactococcus spp., suggesting that our isolate was a member of Lactococcus species yet not a

178

member of known strains. We incremented our database with the spectrum of strain M14.

179

Spectrum differences with others of phylogenetically close species are shown in Figure 2-a.

180

The gel view (Fig. 2-b) displays the raw spectra of loaded spectrum files arranged in a

181

pseudo-gel like look. The x-axis records the m/z value. The left y-axis displays the running

182

spectrum number originating from subsequent spectra loading. The peak intensity is

183

expressed by a Gray scale scheme code. The color bar and the right y-axis indicate the

184

relation between the color a peak is displayed with and the peak intensity in arbitrary units.

185

The compared species are indicated on the left.

186

Genome project history

187

Lactocococcus garvieae is repeatedly used in the industry of dairy products manufactured

188

from raw milk. Moreover, the M14 strain has shown bactericidal effects against several

189

bacteria (unpublished data) which indicate the eventual role played by L. garvieae in the

190

composition of the gut microbiota. The sequencing of this strain is part of a study of the

191

human digestive flora aiming at describing the force balance that shapes its composition

192

including the antibacterial potency. Indeed, L. garvieae strain M14 was the 46th genome from

193

the genus Lactococcus and the 16th genome of L. garvieae sp. The EMBL accession number

194

is CCXC01000001-CCXC01000013 and consists of 13 contigs without gaps including four contigs

195

that have been assigned to four plasmids (Table 2).

196

Genome properties

197

The draft genome of L. garvieae M14 consists of 13 contigs of sizes ranging between 889 and

198

1,512,971 bp. The genome of M14 is composed of a single linear chromosome (2,188,835

199

bp; 37.69 % G+C content) and four plasmids ranging in size from 42,306 to 1,095 bp,

AC C

EP

TE D

M AN U

SC

RI PT

176

ACCEPTED MANUSCRIPT including one circular plasmid (Fig.3, Table 3 and Table 4). The chromosome contains 91

201

predicted RNA including 5 rRNA (one 16S, one 23S and three 5S), 45 tRNA, 1 tmRNA and

202

40 miscellaneous RNA and 2,214 protein coding genes which represent 1,934,957 (88.40 % of

203

the genome). A total of 1,515 genes (68.42 %) were assigned a putative function (by Clusters

204

of Orthologous Groups COGs) [44,45]. We found that 23.63 % of the genes encode for

205

information storage and processing (J, A, K, L and B categories), 21.11 % were involved in

206

cellular processes and signaling (D, V, T, M, N, U, O and X categories), 40.29 % participate

207

in metabolism (C, G, E, F, H, I, P and Q categories) and 14.97 % were poorly characterized

208

(R and S categories). The distribution of genes into COGs functional categories is presented

209

in the figure 4.

210

Genome comparison

211

When comparing L. garvieae M14 with nine Lactobacillus species and two Lactococcus

212

strains that have similar 16S rRNA sequences, we found that the genome sequence of L.

213

garvieae M14 is smaller than those of Lactobacillus plantarum WCFS1, Lactobacillus

214

rhamnosus GG, Lactococcus lactis subsp. cremoris SK11 and Lactococcus lactis subsp. lactis

215

Il1403 (3.35, 3.01, 2.60 and 2.37 MB respectively), but larger than those of Lactobacillus

216

salivarius UCC118, Lactobacillus fermentum IFO 3956, Lactobacillus ruminis ATCC 27782,

217

L. garvieae Lg2, L.garvieae YT-3, and Lactobacillus sakei subsp. sakei 23K (2.13, 2.10, 2.07,

218

1.96, 1.95 and 1.88 MB, respectively) (Table 5). The G+C content of L. garvieae M14 is

219

smaller than those of L. fermentum IFO 3956, L. rhamnosus GG, L. plantarum WCFS1, L.

220

ruminis ATCC 27782, L. sakei subsp. sakei 23K, L. garvieae YT-3 and L. garvieae Lg2

221

(51.47, 46.69, 44.42, 43.47, 41.26, 38.83, 38.76 % respectively), but larger than those of L.

222

lactis subsp. cremoris SK11, L. lactis subsp. lactis Il1403 and L. salivarius UCC118 (35.78,

223

35.33 and 33.04, and % respectively) (Table 5). The gene content of L. garvieae M14 is

224

smaller than those of L. plantarum WCFS1, L. rhamnosus GG and L. lactis subsp. cremoris

AC C

EP

TE D

M AN U

SC

RI PT

200

ACCEPTED MANUSCRIPT SK11 and L. lactis subsp. lactis Il1403 (3,063, 2,944, 2,504 and 2,277 respectively), but

226

larger than those of L. salivarius UCC118, L. sakei subsp. sakei 23K, L. ruminis ATCC 27782

227

and L. fermentum IFO 3956 (2,014, 1,885, 1,862 and 1,843 respectively) (Table 5). The

228

proportion of gene count (in percentage) related to each COG categories was similar among

229

the studied strains of L. garvieae. However, the distribution of genes into COG categories was

230

not entirely similar in all the compared genomes (Fig. 4). L. garvieae M14 have an important

231

number of genes participating in carbohydrate transport and metabolism (175 genes), yet less

232

important than those of L. plantarum WCFS1 and L. rhamnosus GG (267 and 263 genes

233

respectively). L. garvieae M14 and L. rhamnosus GG, have the highest number of genes

234

involved in defense mechanisms (51 and 63 genes, respectively), compared to the other

235

analyzed genomes (37 genes in average). Unlike L. garvieae YT-3, L. garvieae strain M14

236

possess plasmids which sequences were closely related to L. garvieae strain 21881 plasmid

237

pGL5, L. garvieae strain IPLA31405 plasmid plG42, L. lactis plasmid pSRQ900 and L. lactis

238

strain MJC15 plasmid pCD4 sequences. The genes of the plasmids encode for a type IV

239

secretory pathway and for proteins with hypothetical functions.

240

Energy Metabolism and transporters

241

The CDSs annotated by the COG database revealed as much as 10.5% of the genomes

242

corresponding to genes involved in carbohydrate transport and metabolism. Like all obligately

243

homofermentative strains, L. garvieae M14 was found to possess the fructose-bisphosphate

244

aldolase (EC 4.1.2.13) in its genome, which is a key enzyme of the glycolysis pathway,

245

whereas it lacks the phosphoketolase enzyme (EC 4.1.2.9) of the pentose phosphate pathway,

246

present only in heterofermentative bacteria genomes. All the genes required for the

247

degradation of the glucose to pyruvate are present in the genome, as well as the lactate

248

dehydrogenase gene that allow the conversion of pyruvate into lactic acid. Several enzymes

249

acting on pyruvate, including α-acetolactate synthase, pyruvate-formate lyase, lactate

AC C

EP

TE D

M AN U

SC

RI PT

225

ACCEPTED MANUSCRIPT dehydrogenase and pyruvate oxidase, have also been identified in strain M14 genome.

251

Further, genome examination indicates that some enzymes needed for the full citrate cycle

252

and for the gluconeogenesis are missing. PTS systems for fructose, galactose, mannose,

253

maltose, lactose, sucrose, trehalose, mannitol and cellobiose were present in the genome,

254

while PTS systems for xylose, gluconate and ribose were absent. Based on its metabolic

255

profile, L. garvieae M14 produces primarily lactic acid from hexoses, using the glycolysis.

256

This ability of homolactic fermentation is very useful for industrial applications.

257

Defense mechanism

258

We identified in the genome of L. garvieae M14 several phage-related genes and 51 proteins

259

involved in defense features including a glycopeptide antibiotics resistance protein and two

260

bacteriocins that are localized in the chromosome. The first bacteriocin has a length of 64

261

amino acids and the use of the BUR database allowed us to identify a very similar bacteriocin

262

sequence in the genomes of L. garvieae strains YT-3, Lg2 and TRF1. These sequences have

263

been previously annotated as encoding for hypothetical proteins in the genomes of L. garvieae

264

strains YT-3 and Lg2. The second bacteriocin has a length of 184 aa and corresponds to a

265

colicin V, also found in the genome of the strain Lg2. Garviecin L1-5 was the first bacteriocin

266

detected in a Lactococcus garvieae strain [54]. It has been shown to inhibit the growth of

267

species relatively closely related to the producer, but also of the human pathogen Listeria

268

monocytogenes. Altogether, the production of bacteriocins gives L. garvieae strains a

269

competitive advantage within their environment, allowing them to directly inhibit other

270

bacteria and proliferate.

271

Conclusions

272

We have presented the phenotypic, phylogenetic and genomic analyses that allowed the

273

description of Lactococcus garvieae strain M14. This new bacterial strain was isolated from

AC C

EP

TE D

M AN U

SC

RI PT

250

ACCEPTED MANUSCRIPT fermented milk sample of an Algerian cow is essential in the manufacture of diary products

275

and seems to play a major role as a biopreservative in food industry and as a potential

276

probiotic if the infectious risk is assuredly ruled out.

277

Acknowledgements

278

The authors thank Xegen Company (www.xegen.fr) for automating the genomic annotation

279

process. VM was supported by a Chairs of Excellence program from the CNRS (Centre

280

National de Recherche Scientifique). This study was funded by the Mediterranee Infection

281

Foundation.

282

Competing interests

283

The authors declare that they have no competing interests.

AC C

EP

TE D

M AN U

SC

RI PT

274

ACCEPTED MANUSCRIPT References 1. El-Baradei G, Delacroix-Buchet A, Ogier JC (2007) Biodiversity of bacterial ecosystems in traditional Egyptian Domiati cheese. Appl Environ Microbiol 73 (4):1248-1255. doi:10.1128/AEM.01667-06

RI PT

2. El-Baradei G, Delacroix-Buchet A, Ogier JC (2008) Bacterial biodiversity of traditional Zabady fermented milk. International Journal of Food Microbiology 121 (3):295-301. doi:http://dx.doi.org/10.1016/j.ijfoodmicro.2007.11.014

SC

3. Aquilanti L, Santarelli S, Silvestri G, Osimani A, Petruzzelli A, Clementi F (2007) The microbial ecology of a typical Italian salami during its natural fermentation. International of

Food

Microbiology

120

(1–2):136-145.

M AN U

Journal

doi:http://dx.doi.org/10.1016/j.ijfoodmicro.2007.06.010

4. Schoster A, Kokotovic B, Permin A, Pedersen PD, Bello FD, Guardabassi L (2013) In vitro inhibition of Clostridium difficile and Clostridium perfringens by commercial probiotic

TE D

strains. Anaerobe 20:36-41. doi:http://dx.doi.org/10.1016/j.anaerobe.2013.02.006 5. Monteagudo-Mera A, Rodríguez-Aparicio L, Rúa J, Martínez-Blanco H, Navasa N, GarcíaArmesto MR, Ferrero MÁ (2012) In vitro evaluation of physiological probiotic properties of

EP

different lactic acid bacteria strains of dairy and human origin. Journal of Functional Foods 4 (2):531-541. doi:http://dx.doi.org/10.1016/j.jff.2012.02.014

AC C

6. Bernbom N, Licht TR, Brogren C-H, Jelle B, Johansen AH, Badiola I, Vogensen FK, Nørrung B (2006) Effects of Lactococcus lactis on Composition of Intestinal Microbiota: Role

of

Nisin.

Applied

and

Environmental

Microbiology

72

(1):239-244.

doi:10.1128/AEM.72.1.239-244.2006 7. Collins MD, Farrow JAE, Phillips BA, Kandler O (1983) Streptococcus garvieae sp. nov. and

Streptococcus

plantarum

sp.

doi:doi:10.1099/00221287-129-11-3427

nov.

J

Gen

Microbiol

129

(11):3427-3431.

ACCEPTED MANUSCRIPT 8. Vendrell D, Balcázar JL, Ruiz-Zarzuela I, de Blas I, Gironés O, Múzquiz JL (2006) Lactococcus garvieae in fish: A review. Comparative Immunology, Microbiology and Infectious Diseases 29 (4):177-198. doi:http://dx.doi.org/10.1016/j.cimid.2006.06.003 9. Fihman V, Raskine L, Barrou Z, Kiffel C, Riahi J, Bercot B, Sanson-Le Pors MJ (2006)

RI PT

Lactococcus garvieae endocarditis: identification by 16S rRNA and sodA sequence analysis. J Infect 52 (1):e3-6. doi:10.1016/j.jinf.2005.04.021

10. Chan JF, Woo PC, Teng JL, Lau SK, Leung SS, Tam FC, Yuen KY (2011) Primary

SC

infective spondylodiscitis caused by Lactococcus garvieae and a review of human L. garvieae infections. Infection 39 (3):259-264. doi:10.1007/s15010-011-0094-8

M AN U

11. Wang CY, Shie HS, Chen SC, Huang JP, Hsieh IC, Wen MS, Lin FC, Wu D (2007) Lactococcus garvieae infections in humans: possible association with aquaculture outbreaks. Int J Clin Pract 61 (1):68-73. doi:10.1111/j.1742-1241.2006.00855.x 12. Vinh DC, Nichol KA, Rand F, Embil JM (2006) Native-valve bacterial endocarditis by

Lactococcus

garvieae.

Diagn

Microbiol

Infect

Dis

56

(1):91-94.

TE D

caused

doi:10.1016/j.diagmicrobio.2006.02.010

13. Wilbring M, Alexiou K, Reichenspurner H, Matschke K, Tugtekin SM (2011)

EP

Lactococcus garvieae causing zoonotic prosthetic valve endocarditis. Clin Res Cardiol 100 (6):545-546. doi:10.1007/s00392-011-0286-3

AC C

14. Li W-K, Chen Y-S, Wann S-R, Liu Y-C, Tsai H-C (2008) Lactococcus garvieae Endocarditis with Initial Presentation of Acute Cerebral Infarction in a Healthy Immunocompetent

Man.

Internal

Medicine

47

(12):1143-1146.

doi:10.2169/internalmedicine.47.0795 15. Yiu KH, Siu CW, To KK, Jim MH, Lee KL, Lau CP, Tse HF (2007) A rare cause of infective

endocarditis;

Lactococcus

doi:10.1016/j.ijcard.2005.11.092

garvieae.

Int

J

Cardiol

114

(2):286-287.

ACCEPTED MANUSCRIPT 16. Aguado-Urda M, Lopez-Campos GH, Fernandez-Garayzabal JF, Martin-Sanchez F, Gibello A, Dominguez L, Blanco MM (2010) Analysis of the genome content of Lactococcus garvieae by genomic interspecies microarray hybridization. BMC Microbiol 10:79. doi:10.1186/1471-2180-10-79

RI PT

17. Morita H, Toh H, Oshima K, Yoshizaki M, Kawanishi M, Nakaya K, Suzuki T,

Miyauchi E, Ishii Y, Tanabe S, Murakami M, Hattori M (2011) Complete genome sequence and comparative analysis of the fish pathogen Lactococcus garvieae. PLoS One. 6(8):e23184.

SC

doi: 10.1371/journal.pone.0023184.

18. Miyauchi E, Toh H, Nakano A, Tanabe S, Morita H (2012) Comparative Genomic

M AN U

Analysis of Lactococcus garvieae Strains Isolated from Different Sources Reveals Candidate Virulence Genes. Int J Microbiol 2012:728276. doi: 10.1155/2012/728276. 19. Kawanishi M, Yoshida T, Yagashiro S, Kijima M, Yagyu K, Nakai T, Murakami M, Morita H, Suzuki S (2006) Differences between Lactococcus garvieae isolated from the genus

TE D

Seriola in Japan and those isolated from other animals (trout, terrestrial animals from Europe) with regard to pathogenicity, phage susceptibility and genetic characterization. J Appl Microbiol 101 (2):496-504. doi:10.1111/j.1365-2672.2006.02951.x

EP

20. Zadi-Karam H, Karame N-E (2005) Lactic acid bacteria from Camel milk. Paper presented at the Renc. Rech. Ruminants,

AC C

21. Fox GE, Magrum LJ, Balch WE, Wolfe RS, Woese CR (1977) Classification of methanogenic bacteria by 16S ribosomal RNA characterization. Proceedings of the National Academy of Sciences 74 (10):4537-4541 22. Lagesen K, Hallin P, Rodland EA, Staerfeldt HH, Rognes T, Ussery DW (2007) RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res 35 (9):3100-3108. doi:10.1093/nar/gkm160

ACCEPTED MANUSCRIPT 23. Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S (2011) MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and

maximum

parsimony

methods.

Mol

Biol

Evol

28

(10):2731-2739.

doi:10.1093/molbev/msr121

RI PT

24. Larkin MA, Blackshields G, Brown NP, Chenna R, McGettigan PA, McWilliam H, Valentin F, Wallace IM, Wilm A, Lopez R, Thompson JD, Gibson TJ, Higgins DG (2007) Clustal

W

and

Clustal

X

version

2.0.

Bioinformatics

(21):2947-2948.

SC

doi:10.1093/bioinformatics/btm404

23

25. Seng P, Drancourt M, Gouriet F, La Scola B, Fournier PE, Rolain JM, Raoult D (2009)

M AN U

Ongoing revolution in bacteriology: routine identification of bacteria by matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Clin Infect Dis 49 (4):543-551. doi:10.1086/600885

26. Lohse M, Bolger AM, Nagel A, Fernie AR, Lunn JE, Stitt M, Usadel B (2012) RobiNA: a

TE D

user-friendly, integrated software solution for RNA-Seq-based transcriptomics. Nucleic Acids Res 40 (Web Server issue):W622-627. doi:10.1093/nar/gks540 27. Nurk S, Bankevich A, Antipov D, Gurevich AA, Korobeynikov A, Lapidus A, Prjibelski

EP

AD, Pyshkin A, Sirotkin A, Sirotkin Y, Stepanauskas R, Clingenpeel SR, Woyke T, McLean JS, Lasken R, Tesler G, Alekseyev MA, Pevzner PA (2013) Assembling single-cell genomes

AC C

and mini-metagenomes from chimeric MDA products. J Comput Biol 20 (10):714-737. doi:10.1089/cmb.2013.0084 28. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA (2012) SPAdes: a new genome assembly algorithm and its applications

to

single-cell

doi:10.1089/cmb.2012.0021

sequencing.

J

Comput

Biol

19

(5):455-477.

ACCEPTED MANUSCRIPT 29. Boetzer M, Henkel CV, Jansen HJ, Butler D, Pirovano W (2011) Scaffolding preassembled

contigs

using

SSPACE.

Bioinformatics

27

(4):578-579.

doi:10.1093/bioinformatics/btq683 30. Gao S, Sung WK, Nagarajan N (2011) Opera: reconstructing optimal genomic scaffolds high-throughput

paired-end

sequences.

J

Comput

doi:10.1089/cmb.2011.0170

Biol

18

(11):1681-1691.

RI PT

with

31. Boetzer M, Pirovano W (2012) Toward almost closed genomes with GapFiller. Genome

SC

Biol 13 (6):R56. doi:10.1186/gb-2012-13-6-r56

32. Laslett D, Canback B (2004) ARAGORN, a program to detect tRNA genes and tmRNA

M AN U

genes in nucleotide sequences. Nucleic Acids Res 32 (1):11-16. doi:10.1093/nar/gkh152 33. Griffiths-Jones S, Bateman A, Marshall M, Khanna A, Eddy SR (2003) Rfam: an RNA family database. Nucleic Acids Res 31 (1):439-441

34. Punta M, Coggill PC, Eberhardt RY, Mistry J, Tate J, Boursnell C, Pang N, Forslund K,

TE D

Ceric G, Clements J, Heger A, Holm L, Sonnhammer EL, Eddy SR, Bateman A, Finn RD (2012) The Pfam protein families database. Nucleic Acids Res 40 (Database issue):D290-301. doi:10.1093/nar/gkr1065

EP

35. Nawrocki EP, Kolbe DL, Eddy SR (2009) Infernal 1.0: inference of RNA alignments. Bioinformatics 25 (10):1335-1337. doi:10.1093/bioinformatics/btp157

AC C

36. Hyatt D, Chen GL, Locascio PF, Land ML, Larimer FW, Hauser LJ (2010) Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics 11:119. doi:10.1186/1471-2105-11-119 37. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL (2009)

BLAST+:

architecture

doi:10.1186/1471-2105-10-421

and

applications.

BMC

Bioinformatics

10:421.

ACCEPTED MANUSCRIPT 38. Eddy SR (2011) Accelerated Profile HMM Searches. PLoS Comput Biol 7 (10):e1002195. doi:10.1371/journal.pcbi.1002195 39. UniProt C (2011) Ongoing and future developments at the Universal Protein Resource. Nucleic Acids Res 39 (Database issue):D214-219. doi:10.1093/nar/gkq1020

RI PT

40. Kanehisa M, Goto S (2000) KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Research 28 (1):27-30. doi:10.1093/nar/28.1.27

41. Moriya Y, Itoh M, Okuda S, Yoshizawa AC, Kanehisa M (2007) KAAS: an automatic

issue):W182-185. doi:10.1093/nar/gkm321

SC

genome annotation and pathway reconstruction server. Nucleic Acids Res 35 (Web Server

M AN U

42. Tatusov RL, Galperin MY, Natale DA, Koonin EV (2000) The COG database: a tool for genome-scale analysis of protein functions and evolution. Nucleic Acids Research 28 (1):3336. doi:10.1093/nar/28.1.33

43. Tatusov RL, Koonin EV, Lipman DJ (1997) A genomic perspective on protein families.

TE D

Science 278 (5338):631-637. doi:10.1126/science.278.5338.631

44. Drissi F, Buffet S, Raoult D, Merhej V (2015) Common occurrence of antibacterial agents in human intestinal microbiota. Front Microbiol 6:441. doi:10.3389/fmicb.2015.00441

EP

45. Stackebrandt E, Ebers J (2006) Taxonomic parameters revisited: tarnished gold standards. Microbiol Today 33:152-155

AC C

46. Tindall BJ, Rossello-Mora R, Busse HJ, Ludwig W, Kampfer P (2010) Notes on the characterization of prokaryote strains for taxonomic purposes. Int J Syst Evol Microbiol 60 (Pt 1):249-266. doi:10.1099/ijs.0.016949-0 47. Schleifer KH, Kraus J DC, Kilpper-Bälz R, Collins MD, W. Fischer ( 1985) Transfer of Streptococcus lactis and Related Streptococci to the Genus Lactococcus gen.nov. Syst Appl Microbiol 6 (2):183-195. doi: 10.1016/S0723-2020(85)80052-7.

ACCEPTED MANUSCRIPT 48. Morita H, Toh H, Oshima K, Murakami M, Taylor TD, Igimi S, Hattori M (2009) Complete genome sequence of the probiotic Lactobacillus rhamnosus ATCC 53103. J Bacteriol 191 (24):7630-7631. doi:10.1128/JB.01287-09 49. Chaillou S, Champomier-Verges MC, Cornet M, Crutz-Le Coq AM, Dudez AM, Martin

RI PT

V, Beaufils S, Darbon-Rongere E, Bossy R, Loux V, Zagorec M (2005) The complete genome sequence of the meat-borne lactic acid bacterium Lactobacillus sakei 23K. Nat Biotechnol 23 (12):1527-1533. doi:10.1038/nbt1160

SC

50. Kleerebezem M, Boekhorst J, van Kranenburg R, Molenaar D, Kuipers OP, Leer R, Tarchini R, Peters SA, Sandbrink HM, Fiers MW, Stiekema W, Lankhorst RM, Bron PA,

M AN U

Hoffer SM, Groot MN, Kerkhoven R, de Vries M, Ursing B, de Vos WM, Siezen RJ (2003) Complete genome sequence of Lactobacillus plantarum WCFS1. Proc Natl Acad Sci U S A 100 (4):1990-1995. doi:10.1073/pnas.0337704100

51. Morita H, Toh H, Fukuda S, Horikawa H, Oshima K, Suzuki T, Murakami M, Hisamatsu

TE D

S, Kato Y, Takizawa T, Fukuoka H, Yoshimura T, Itoh K, O'Sullivan DJ, McKay LL, Ohno H, Kikuchi J, Masaoka T, Hattori M (2008) Comparative genome analysis of Lactobacillus reuteri and Lactobacillus fermentum reveal a genomic island for reuterin and cobalamin

EP

production. DNA Res 15 (3):151-161. doi:10.1093/dnares/dsn009 52. Claesson MJ, Li Y, Leahy S, Canchaya C, van Pijkeren JP, Cerdeno-Tarraga AM, Parkhill

AC C

J, Flynn S, O'Sullivan GC, Collins JK, Higgins D, Shanahan F, Fitzgerald GF, van Sinderen D, O'Toole PW (2006) Multireplicon genome architecture of Lactobacillus salivarius. Proc Natl Acad Sci U S A 103 (17):6718-6723. doi:10.1073/pnas.0511060103 53. Forde BM, Neville BA, O'Donnell MM, Riboulet-Bisson E, Claesson MJ, Coghlan A, Ross RP, O'Toole PW (2011) Genome sequences and comparative genomics of two Lactobacillus ruminis strains from the bovine and human intestinal tracts. Microb Cell Fact 10 Suppl 1:S13. doi:10.1186/1475-2859-10-S1-S13

ACCEPTED MANUSCRIPT 54. Villani F, Aponte M, Blaiotta G, Mauriello G, Pepe O, Moschetti G (2001) Detection and characterization of a bacteriocin, garviecin L1-5, produced by Lactococcus garvieae isolated from raw cow’s milk. Journal of Applied Microbiology 90 (3):430-439. doi:10.1046/j.1365-

AC C

EP

TE D

M AN U

SC

RI PT

2672.2001.01261.

ACCEPTED MANUSCRIPT Figure legends Fig. 1. Phylogenetic tree highlighting the position of L. garvieae strain M14 (LK985397) relative to other phylogenetically close strains within the genus Lactococcus and Lactobacillus, with Lysinibacillus sphaericus as the outgroup. Numbers at the nodes are

RI PT

percentages of bootstrap values obtained by repeating 500 times the analysis to generate a majority consensus tree. The scale bar represents 2% nucleotide sequence divergence.

Fig. 2. a. Reference mass spectrum from L. garvieae strain M14 and b. gel view comparing L.

SC

garvieae M14 spectra with Lactococcus species (Lactococcus lactis ssp lactis, Lactococcus lactis ssp cremoris and two strains of Lactococcus garvieae) and with Lactobacillus species

M AN U

(Lactobacillus sakei ssp sakei, Lactobacillus salivarus, Lactobacillus ruminis, Lactobacillus rhamnosus, Lactobacillus plantarum and Lactobacillus fermentum). Fig. 3. Circular representation of the Lactococcus garvieae strain M14 genome. Circles from the center to the outside: GC screw (green/purple), GC content (grey/black), scaffolds in grey

TE D

arrows, tRNA (blue), rRNA (red), tmRNA (light blue), miscellaneous RNA (brown) on forward strand, genes on forward strand colored according to categories determined in the cluster of orthologous group (COG), genes on reverse strand colored by COGs categories,

strand.

EP

tRNA (blue), rRNA (red), tmRNA (light blue), miscellaneous RNA (brown) on reverse

AC C

Fig. 4. Functional classification of the genes encoded by Lactococcus garvieae M14 and its comparison with Lactococcus garvieae YT-3 and Lactococcus garvieae Lg2. The proteincoding sequences are classified according to the COGs categories.

ACCEPTED MANUSCRIPT

L. garvieae YT-3 DSM 6783

L. lactis subsp. lactis DSM 20481

Gram stain

Positive

Positive

Cell shape

Characteristics

L. rhamnosus DSM 20021

L. sakei subsp. sakei DSM 20017

L. plantarum DSM 20174

L. fermentum DSM 20052

L salivarius DSM 20555

Positive

L. lactis subsp. cremoris DSM 20069 Positive

Positive

Positive

Positive

Positive

Positive

Rod

Rod

Rod

Rod

Rod

-

-

+

-

-

Cocci

Rod

Rod

-

-

-

-

D-ribose

+

+

+

D-glucose

+

+

+

D-Galactose

+

+

+

D-mannitol

+

+

+

Amygdalin

+

+

+

Arbutin

+

+

+

Esculin

+

+

D-Cellobiose

+

+

D-lactose

+

-

Inulin

-

-

D-Melezitose

-

-

Glycogen

-

-

+

α-Glucosidase

+

N-acetyl-βglucosaminidase α-mannosidase

-

+

+

-

+

+

+

+

+

+

+

+

+

+

+

+

-

+

-

+

-

+

-

+

-

-

-

+

-

+

-

-

+

+

+

+

+

+

+

+

+

+

-

+

+

-

+

-

+

-

+

+

+

+

-

-

-

+

-

-

-

-

+

-

+

-

-

-

-

-

-

-

-

-

M AN U

-

-

EP

-

Acid phosphatase

+

+

-

-

+

-

-

-

-

+

-

+

+

-

-

-

+

+

+

-

+

-

-

+

+

-

+

-

+

-

-

+

-

-

+

-

-

-

+

+

-

AC C

Alkaline phosphatase

+

+

TE D

Cocci

L-Arabinose

SC

L. garvieae M14 DSM 29394

RI PT

Table 1: Differential phenotypic characteristics between L. garvieae strain M14 and phylogenetically close species. The following type strains of related lactic acid bacteria species were obtained from the DSMZ culture collection (Braunschweig, Germany). All of these strains were cultured according to the recommendations given in the DSMZ catalogue of strains.

1

ACCEPTED MANUSCRIPT Table 1. Project information.

AC C

EP

TE D

M AN U

SC

MIGS-13

Property Term Finishing quality High quality draft Libraries used 1 mate-paired Sequencing platforms MiSeq Illumina Sequencing coverage 110 Assemblers Spades Gene calling method Prodigal Genbank ID CCXC01000001-CCXC01000013 Genbank Date of Release Oct, 2014 Source material identifier M14 Project relevance Potential probiotic and biopreservative

RI PT

MIGS ID MIGS-31 MIGS-28 MIGS-29 MIGS-31.2 MIGS-30 MIGS-32

2

ACCEPTED MANUSCRIPT Table 3. Nucleotide content and gene count levels of the genome. Value 2,188,835 1,934,957 827,233 2,264 5 45 1 40 2,214 1,651 1,515

*

% of Total* 100.00 88.40 37.79 100.00 0.21 1.90 0.04 1.69 97.79 72.92 68.42

RI PT

Attribute Genome size (bp) DNA coding region (bp) DNA G+C content (bp) Total genes rRNA tRNA tmRNA miscRNA Protein-coding genes Genes with function prediction Genes assigned to COGs

AC C

EP

TE D

M AN U

SC

The total is based on either the size of the genome in base pairs or the total number of protein coding genes in the annotated genome.

3

ACCEPTED MANUSCRIPT Table 2. Nucleotide content and gene count levels of the plasmids. Value 64,869 (42,306; 16,485; 4,983; 1,095)# 55,608 22,256 75 0 75

Size (bp) DNA coding region (bp) DNA G+C content (bp) Total genes rRNA Protein-coding genes Genes with function prediction Genes assigned to COGs

20 10

100 85.72 34.31 100 0 100

35.09 17.54

SC

*

% of Total*

RI PT

Attribute

AC C

EP

TE D

M AN U

The total is based on either the size of the plasmids in base pairs or the total number of protein coding genes in the annotated sequences. # The sizes of the different plasmids are represented between brackets.

4

ACCEPTED MANUSCRIPT Table 5. General genome features. G+C Percent 44.42

Gene Content 3,063

Lactobacillus rhamnosus GG

3.01

46.69

2,944

Lactococcus lactis subsp. cremoris SK11

2.60

35.78

2,504

Lactococcus lactis subsp. lactis Il1403

2.37

35.33

2,277

Lactococcus garvieae M14

2.19

37.79

2,214

Lactobacillus salivarius UCC118

2.13

33.04

2,014

Lactobacillus fermentum IFO

2.10

51.47

1,843

Lactobacillus ruminis ATCC 27782

2.07

43.47

1,862

Lactococcus garvieae Lg2

1.96

38.76

1,968

Lactococcus garvieae YT-3

1.95

38.83

1,947

Lactobacillus sakei subsp. sakei 23K

1.88

41.26

1,885

SC

RI PT

Size (Mb) 3.35

AC C

EP

TE D

M AN U

Strains Lactobacillus plantarum WCFS1

5

RI PT

ACCEPTED MANUSCRIPT

24

42 96

22

60

0.01

AC C

EP

TE D

M AN U

SC

Lactococcus garvieae M14 (LK985397.1) Lactococcus garvieae 21881 (AFCC01) 53 Lactococcus garvieae 8831 (AFCD01) Lactococcus garvieae Lg-ilsanpaik-gs201105 (JPUJ01) Lactococcus garvieae TB25 (AGQX01) 53 Lactococcus garvieae Lg2 (AB267897.1) Lactococcus garvieae UNIUD074 (AFHF01) Lactococcus garvieae YT-3 (NC_015930.1) 54 Lactococcus garvieae NBRC 100934 (BBJW01) 63 Lactococcus garvieae IPLA 31405 (AKFO01) 100 Lactococcus garvieae Tac2 (AMFE01) Lactococcus garvieae LG9 (AGQY01) Lactococcus garvieae I113 (AMFD01) 98 Lactococcus garvieae DCC43 (AMQS01) Lactococcus lactis subsp. lactis Il1403 str. IL1403 (NR_103918) 100 Lactococcus lactis subsp. cremoris SK11 str. SK11 (NR_074949) Lactobacillus ruminis ATCC 27782 (NR_102839) Lactobacillus salivarius UCC118 (NR_074589) Lactobacillus fermentum IFO 3956 (NR_075033) Lactobacillus plantarum WCFS1 (NR_075041) Lactobacillus rhamnosus GG (NR_102778) Lactobacillus sakei subsp. sakei 23K (NR_075042) Lysinibacillus sphaericus (EU855791.1)

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

ACCEPTED MANUSCRIPT

J:Translation, ribosomal structure and biogenesis 200

S:Function unknown

K:Transcription

180

140

Q:Secondary metabolites biosynthesis, transport and catabolism

120 100 80

P:Inorganic ion transport and metabolism

M AN U

60

L:Replication, recombination and repair

RI PT

160

SC

R:General function prediction only

D:Cell cycle control, cell division, chromosome partitioning

V:Defense mechanisms

40 20

I:Lipid transport and metabolism

T:Signal transduction mechanisms

TE D

0

M:Cell wall/membrane biogenesis

EP

H:Coenzyme transport and metabolism

N:Cell motility

AC C

F:Nucleotide transport and metabolism

U:Intracellular trafficking and secretion, and vesicular transport

E:Amino acid transport and metabolism

G:Carbohydrate transport and metabolism

C:Energy production and conversion

Lactococcus garviae M14

O:Posttranslational modification, protein turnover, chaperones X:Mobilome: prophages, transposons

Lactococcus garvieae YT-3

Lactococcus garvieae Lg2