Rapid detection and differentiation of human noroviruses using RT-PCR coupled to electrospray ionization mass spectrometry

Rapid detection and differentiation of human noroviruses using RT-PCR coupled to electrospray ionization mass spectrometry

Accepted Manuscript Rapid Detection and Differentiation of Human Noroviruses using RT-PCR coupled to Electrospray Ionization Mass Spectrometry Rosalee...

1MB Sizes 2 Downloads 59 Views

Accepted Manuscript Rapid Detection and Differentiation of Human Noroviruses using RT-PCR coupled to Electrospray Ionization Mass Spectrometry Rosalee S. Hellberg , Feng Li , Rangarajan Sampath , Irene J. Yasuda , Heather E. Carolan , Julia M. Wolfe , Michael K. Brown , Richard C. Alexander , Donna M. Williams-Hill , William B. Martin PII:

S0740-0020(14)00123-3

DOI:

10.1016/j.fm.2014.05.017

Reference:

YFMIC 2176

To appear in:

Food Microbiology

Received Date: 26 September 2013 Revised Date:

25 April 2014

Accepted Date: 25 May 2014

Please cite this article as: Hellberg, R.S., Li, F., Sampath, R., Yasuda, I.J., Carolan, H.E., Wolfe, J.M., Brown, M.K., Alexander, R.C., Williams-Hill, D.M., Martin, W.B., Rapid Detection and Differentiation of Human Noroviruses using RT-PCR coupled to Electrospray Ionization Mass Spectrometry, Food Microbiology (2014), doi: 10.1016/j.fm.2014.05.017. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

ACCEPTED MANUSCRIPT

1

Rapid Detection and Differentiation of Human Noroviruses using RT-PCR coupled to

2

Electrospray Ionization Mass Spectrometry

3 Rosalee S. Hellberga*, Feng Lib, Rangarajan Sampathb, Irene J. Yasudab, Heather E. Carolanb,

5

Julia M. Wolfec, Michael K. Brownc, Richard C. Alexanderc, Donna M. Williams-Hilld,

6

William B. Martind

7 8

a

9

Nutrition, One University Drive, Orange, CA 92866

SC

RI PT

4

M AN U

Chapman University, Schmid College of Science and Technology, Food Science and

10

b

11

c

12

d

13

Laboratory Southwest, 19701 Fairchild, Irvine, CA 92612

Ibis Biosciences, Abbott, 2251 Faraday Ave., Suite 150, Carlsbad, CA 92008

Orange County Public Health Laboratory, 1729 West 17th Street, Santa Ana, CA 92706

TE D

U.S. Food and Drug Administration, Office of Regulatory Affairs, Pacific Regional

14 15

*Corresponding author: e-mail: [email protected], Ph: 1-714-628-2811

EP

16

Disclaimer: The views in this publication represent those of the authors. The inclusion of

18

specific trade names or technologies does not imply endorsement by the U.S. Food and Drug

19

Administration nor is criticism implied of similar commercial technologies not mentioned

20

within.

21

Conflict of Interest

22

F.L., R.S., I.Y., and H.C. are employees of Ibis Biosciences, Abbott, the commercial

23

manufacturer of the technology described here.

AC C

17

1

ACCEPTED MANUSCRIPT

Abstract

25

The goal of this study was to develop an assay for the detection and differentiation of

26

noroviruses using RT-PCR followed by electrospray ionization mass spectrometry (ESI-MS).

27

Detection of hepatitis A virus was also considered. Thirteen primer pairs were designed for

28

use in this assay and a reference database was created using GenBank sequences and

29

reference norovirus samples. The assay was tested for inclusivity and exclusivity using 160

30

clinical norovirus samples, 3 samples of hepatitis A virus and 3 other closely related viral

31

strains. Results showed that the assay was able to detect norovirus with a sensitivity of 92%

32

and a specificity of 100%. Norovirus identification at the genogroup level was correct for

33

98% of samples detected by the assay and for 75% of a subset of samples (n = 32) compared

34

at the genotype level. Identification of norovirus genotypes is expected to improve as more

35

reference samples are added to the database. The assay was also capable of detecting and

36

genotyping hepatitis A virus in all 3 samples tested. Overall, the assay developed here allows

37

for detection and differentiation of noroviruses within one working day and may be used as a

38

tool in surveillance efforts or outbreak investigations.

39

EP

TE D

M AN U

SC

RI PT

24

Keywords

41

Norovirus; hepatitis A virus; RT-PCR; electrospray ionization mass spectrometry; genotyping

42 43 44

AC C

40

45 46

2

ACCEPTED MANUSCRIPT

47 48

1. Introduction Noroviruses are the leading cause of foodborne disease among all known pathogens, with an estimated 5.5 million foodborne illnesses annually in the United States (Scallan et al.,

50

2011). These viruses are genetically diverse members of the Caliciviridae family (genus

51

Norovirus) and consist of at least 32 genetic clusters organized into 5 genogroups (GI-GV)

52

(Zheng et al., 2006; Zheng et al., 2010). Genogroups I, II, and IV have been associated with

53

human infection, with the majority of outbreaks due to GII strains, particularly GII.4 variants.

54

The ability to identify noroviruses at the genotype and strain levels is important for tracing the

55

source and spread of outbreaks as well as for routine surveillance. Norovirus surveillance

56

programs have revealed that some norovirus strains are only found within geographically

57

limited regions, while others experience widespread distribution over specific time periods

58

(Siebenga et al., 2009; Vega et al., 2011). Evidence has also been found for the presence of

59

multiple strains associated with a single infection or outbreak, with reports of 3-12% of

60

outbreaks being associated with strains from both GI and GII (Blanton et al., 2006; Green et

61

al., 2001; Hall et al., 2012; Matthews et al., 2012). Food and waterborne outbreaks have been

62

found to be more likely associated with strains from multiple genogroups than person-to-

63

person outbreaks (Matthews et al., 2012).

SC

M AN U

TE D

EP

Because noroviruses have not been successfully cultivated in vitro, nucleic acid-based

AC C

64

RI PT

49

65

methods are widely used for their detection and differentiation (Hall et al., 2011).

66

Noroviruses have a small, single-stranded RNA genome of ~7.5 kb with three open reading

67

frames: ORF1, ORF2, and ORF3 (Green et al., 2001). ORF1 contains several genes coding

68

for nonstructural proteins, including RNA-dependent RNA polymerase (RdRp); ORF2

69

contains the gene coding for the major capsid protein (VP1); and ORF3 contains the gene

3

ACCEPTED MANUSCRIPT

coding for the minor capsid protein (VP2). Reverse-transcriptase polymerase chain reaction

71

(RT-PCR) analysis of regions of the VP1 or RdRp genes is generally used to presumptively

72

detect noroviruses at the organismal and genogroup level, while DNA sequencing can be used

73

for confirmation and for differentiation of genotypes and strains (Zheng et al., 2006).

74

Although standardized methods for differentiation of noroviruses have been proposed based

75

on the complete sequence of the VP1 gene (Zheng et al., 2006), many studies sequence

76

smaller regions of either the RdRp or VP1 genes and use a variety of primer sets, sequence

77

analysis methods, and nomenclature, leading to inconsistencies in data reporting and

78

strain/genotype identification (Kroneman et al., 2013; Kroneman et al., 2011; Mattison et al.,

79

2009). Furthermore, frequent recombination events at the ORF1-ORF2 region have resulted

80

in different genotype assignments depending on which gene region is targeted for sequencing

81

(Bull et al., 2007) and rapid evolution of GII.4 noroviruses has led to the emergence of highly

82

virulent strains that have not previously been characterized (Siebenga et al., 2007; Siebenga et

83

al., 2009; Zheng et al., 2010). Infections consisting of multiple strains are also difficult to

84

analyze using traditional sequencing because this method is not able to discern multiple

85

sequences in a mixed sample.

EP

TE D

M AN U

SC

RI PT

70

The use of RT-PCR followed by electrospray ionization mass spectrometry (ESI-MS)

87

provides a potential means for rapid detection and differentiation of norovirus genotypes and

88

strains (Ecker et al., 2008; Sampath et al., 2007b). With this method, fragments of the

89

genome are first amplified by RT-PCR and then analyzed by ESI-MS to determine the precise

90

mass and corresponding base composition (numbers of A, G, C, and T) of each PCR product.

91

These base compositions are then compared to a reference database to allow for the

92

identification and differentiation of organisms. While this method does not provide the

AC C

86

4

ACCEPTED MANUSCRIPT

ordered sequence of the nucleotides, knowledge of the base composition alone has proven to

94

be sufficient in many cases to allow for organism identification, as PCR/ESI-MS assays

95

utilize primers that bind to conserved regions flanking highly variable sequences among the

96

organism(s) of interest (Ecker et al. 2008; Sampath et al. 2007b). In addition to the potential

97

to detect noroviruses and characterize them at the genotype and strain levels, PCR/ESI-MS

98

allows for the identification of organisms in mixed samples, and analysis can be completed

99

within one working day. Furthermore, this method allows for multiplexing of primers so that

SC

RI PT

93

several regions of the genome can be targeted simultaneously, resulting in greater potential

101

for the identification of new strains and recombinants. PCR/ESI-MS assays have been

102

published for the detection and characterization of a number of organisms, including

103

respiratory pathogens (Ecker et al., 2005; Sampath et al., 2005; Sampath et al., 2007a;

104

Sampath et al., 2007b), vector-borne pathogens (Crowder et al., 2012; Crowder et al., 2010;

105

Eshoo et al., 2010; Eshoo et al., 2007; Grant-Klein et al., 2010), biothreat agents (Jacob et al.,

106

2012; Sampath et al., 2012; Van Ert et al., 2004) and common bacterial food pathogens (e.g.,

107

Salmonella, E. coli, and Campylobacter) (Hannis et al., 2008; Pierce et al., 2012; Shen et al.,

108

2013), but not for noroviruses. PCR/ESI-MS could potentially be used alongside current

109

sequence-based techniques to rapidly identify norovirus genotypes and variants in

110

surveillance and outbreak situations.

TE D

EP

AC C

111

M AN U

100

The goal of this study was to develop a novel RT-PCR/ESI-MS assay for the rapid

112

detection and genetic differentiation of noroviruses. Because of the ability of the assay to test

113

for multiple pathogens simultaneously, detection of hepatitis A virus was also considered for

114

incorporation into this assay. Like norovirus, hepatitis A virus is a single-stranded RNA virus

115

spread through the fecal-oral route (Nainan et al., 2006). There are three known genogroups

5

ACCEPTED MANUSCRIPT

of hepatitis A virus (I-III) that infect humans, with genotype IA being most commonly

117

associated with disease (Sanchez et al. 2007). Although hepatitis A virus has a relatively low

118

incidence among foodborne viruses in the U.S., it has the highest rates of hospitalization

119

(31.5%) and death (2.4%) (Scallan et al. 2011). As current detection techniques for hepatitis

120

A virus generally rely on RT-PCR assays specific for the virus (Sanchez et al. 2007), an assay

121

that allows for simultaneous identification of both noroviruses and hepatitis A viruses would

122

likely prove advantageous when performing screening for these viruses.

123

2. Materials and Methods

124

2.1 Assay Design

SC

M AN U

125

RI PT

116

Over 5000 norovirus sequences and 150 hepatitis A virus sequences were downloaded from GenBank for use in assay design. In cases where genotype information was not

127

available for a particular norovirus sequence, the Norovirus Automated Genotyping Tool was

128

used (Kroneman et al., 2011). Sequences were aligned with Clustal W (Thompson et al.,

129

1994) in BioEdit Sequence Alignment Editor, v. 7.1.3.0 (Hall, 1999) and examined for

130

regions of high variability flanked by regions of conserved primer-binding sites. Based on

131

these alignments, a set of primers was designed for this study to collectively amplify all

132

human noroviruses (Table 1). Norovirus primers were designed for maximal differentiation

133

amongst the various human norovirus genotypes, as well as differentiation among GII.4

134

strains. Primers targeting human hepatitis A viruses (Table 1) were also used in this study for

135

the purpose of universal amplification and detection at the virus level. The hepatitis A virus

136

primers had been designed previously by Ibis Biosciences, but had not been published. A set

137

of primer pairs was also designed for amplification of the internal control. Primers were

138

analyzed in silico for parameters such as primer-dimer formation, %GC, and annealing

AC C

EP

TE D

126

6

ACCEPTED MANUSCRIPT

temperature. The primers were separated into 8 wells, with 5 of the wells containing multiple

140

primer pairs, and then arranged in a 96-well plate format, allowing for analysis of 12 samples

141

per plate (Fig. 1).

142

2.2 Specimen collection and preparation

RI PT

139

A total of 206 stool samples from the Orange County Public Health Laboratory (Santa

144

Ana, CA) were obtained for use in this project. Use of biological specimens was approved by

145

the U.S. Food and Drug Administration (FDA) Research Involving Human Subjects

146

Committee (RIHSC #12-009A). The specimens were derived from various outbreaks and

147

single cases of norovirus illness reported to the public health laboratory during 2006-2011.

148

This collection consisted of 196 stool samples that previously tested positive for norovirus

149

and 10 stool samples that previously tested negative for norovirus. Norovirus detection and

150

genogroup information were based on the results of real-time RT-PCR testing at the time of

151

illness using a previously described method (Kageyama et al., 2003). A subset of these

152

samples (n = 36) had previously undergone sequencing-based genotyping (Vinje et al., 2004)

153

as part of CaliciNet, a national outbreak surveillance network developed by the Centers for

154

Disease Control and Prevention (CDC) (Vega et al., 2011). The genotyped samples were

155

used to build the RT-PCR/ESI-MS database (Table 2). The remaining norovirus samples

156

were used to test the RT-PCR/ESI-MS assay for inclusivity. All stool samples underwent

157

clarification prior to RNA extraction as follows: approximately 0.1 g of stool was added to a

158

glass vial containing 1 ml of Vertrel XF (DuPont, Wilmington, DE) and 1 ml of sterile water.

159

Samples were then mixed by vortex for 1 min and centrifuged at 1700 x g for 10 min. The

160

aqueous phase containing the virus was then removed by pipetting and stored at -70ºC until

161

RNA extraction. RNA was extracted from clarified samples using the QIAamp Viral RNA

AC C

EP

TE D

M AN U

SC

143

7

ACCEPTED MANUSCRIPT

162

Mini Kit (Qiagen, Valencia, CA), Spin Column Protocol, according to the manufacturer’s

163

instructions. Reagent blanks were included in the RNA extractions as negative controls.

164

Hepatitis A virus, feline calicivirus, murine norovirus, and poliovirus from the American Type Culture Collection (ATCC) were also used for inclusivity and exclusivity

166

testing of the RT-PCR/ESI-MS assay. Three samples of hepatitis A virus were tested from

167

strains HAS-15 (ATCC #VR-2281), HM-175/18f (ATCC #VR-1402) and PA21 (ATCC

168

#VR-1357), representing human genotypes IA, IB and IIIA, respectively. One sample of each

169

of the following was used for exclusivity testing: feline calicivirus strain F9 (ATCC #VR-

170

782), murine norovirus-1 (ATCC #PTA-5935), and poliovirus type 3, strain LSC (ATCC

171

#VR-1001AS/HO).

172

2.3 RT-PCR/ESI-MS

M AN U

SC

RI PT

165

All samples underwent RT-PCR/ESI-MS using the assay designed in this study. One-step

174

RT-PCR was performed with each reaction well having the following components: 5 µl RNA

175

or non-template control, 2 U Superscript III Reverse Transcriptase (Life Technologies,

176

Carlsbad, CA), 2 ng/µl sonicated poly A RNA (Sigma-Aldrich, St. Louis, MO), 1 ng/µl

177

random hexamers (Life Technologies), 10 mM dithiothreitol (DTT, Life Technologies), 0.01

178

U/µl SUPERase In RNase Inhibitor (Life Technologies), 5 U AmpliTaq Gold DNA

179

Polymerase (Roche Molecular Systems, Pleasanton, CA), 200 µM each dATP, dCTP, and

180

dTTP (Bioline, Taunton, MA), 200 µM 13C-enriched dGTP (Cambridge Isotope Laboratories,

181

Andover, MA), 1.4 mM MgCl2 (Life Technologies), 250 nM each primer (Table 1; Integrated

182

DNA Technologies, Coralville, IA), 20 mM Tris (pH 8.3), 75 mM KCl, 0.4 M betaine, and 20

183

mM sorbitol (Sigma-Aldrich) in a total volume of 40 µl. RT-PCR was carried out with a

184

Mastercycler proS thermocycler (Eppendorf, Hauppauge, NY) using the following cycling

AC C

EP

TE D

173

8

ACCEPTED MANUSCRIPT

conditions: 60ºC for 5 min; 4ºC for 10 min; 55ºC for 45 min; 95ºC for 10 min; 8 cycles of

186

95ºC for 30 s, 48ºC for 30 s (increase 0.9°C for each cycle), and 72ºC for 30 s; 37 cycles of

187

95ºC for 15 s, 56ºC for 20 s, and 72ºC for 20 s; and a final extension of 72ºC for 2 min

188

followed by a 4ºC hold. Each reaction also contained an internal positive control (“calibrant”)

189

made from synthetic DNA (Blue Heron Biotechnology, Bothell, WA). The calibrant was

190

present in each reaction at a level of 200 copies and allowed for a semi-quantitative estimate

191

of the starting number of genome copies in each PCR well.

SC

192

RI PT

185

Following RT-PCR, the assay plate was loaded onto the PCR/ESI-MS platform (Abbott Molecular, Des Plaines, IL) for amplicon desalting and ESI-MS analysis, as described

194

previously (Ecker et al., 2008; Hofstadler et al., 2005; Jiang and Hofstadler, 2003). The

195

analysis software determined the base compositions of each amplicon based on analysis of

196

both the forward and reverse strands. The resulting base compositions were queried against

197

an ESI-MS database populated with the expected base compositions for the norovirus and

198

hepatitis A virus sequences downloaded from GenBank (described in section 2.1) and the

199

base compositions acquired in this study for the 36 previously genotyped norovirus samples

200

(Table 2). The top database matches, approximate levels of genome copies per well, and

201

associated Q-scores were recorded for each sample. The Q-score, a rating between 0 (low)

202

and 1 (high), represents a relative measure of the strength of the data supporting identification

203

(Sampath et al., 2012; Simner et al., 2013). For this assay, a Q-score ≥ 0.85 was considered to

204

be a reportable result. To reduce the possibility of false positives, a threshold of ≥ 25 genome

205

copies per well was also required for a reportable result. In cases where the ESI-MS result

206

did not match previous identifications, the genotypes of sequences derived from GenBank

AC C

EP

TE D

M AN U

193

9

ACCEPTED MANUSCRIPT

207

were checked with the Norovirus Automated Genotyping Tool to ensure proper nomenclature

208

was used (Kroneman et al. 2011).

209

2.4 Comparison to recognized method The ability of the RT-PCR/ESI-MS assay plate designed in this study to detect and

RI PT

210

differentiate norovirus samples was compared to a DNA sequencing method for genotyping

212

(Vinje et al., 2004). A subset of 41 of the norovirus samples listed above was chosen for

213

method comparison. Samples were selected to include both GI and GII noroviruses detected

214

over the course of several years (2007-2011). These samples had previously been identified

215

as norovirus GI or GII by real-time RT-PCR, but had not been genotyped. All samples

216

underwent both sequencing-based genotyping and RT-PCR/ESI-MS genotyping using the

217

newly developed plate and the identifications from each method were compared. The DNA

218

sequences obtained through sequencing-based genotyping were submitted to GenBank

219

(Accession No. KF569765-KF569799).

220

2.5 Statistical analysis

M AN U

TE D

221

SC

211

Concordance among RT-PCR/ESI-MS-determined genotypes and strains for norovirus samples involved in outbreaks was assessed statistically by assigning a numerical value to

223

each outbreak. Concordant outbreaks were assigned a score of 2, semi-concordant outbreaks

224

were assigned a score of 1, and discordant outbreaks were assigned a score of 0. An outbreak

225

was considered to be concordant when all samples from that outbreak showed the exact same

226

top match or matches to the ESI-MS database, resulting in the exact same identification. An

227

outbreak was considered to be semi-concordant when all samples from that outbreak shared a

228

top match to the ESI-MS database, but one or more samples also showed a top match to

229

another entry in the database. Discordant outbreaks were those in which none of the top

AC C

EP

222

10

ACCEPTED MANUSCRIPT

matches to the ESI-MS database were shared by all samples from a given outbreak. After

231

each outbreak was assigned a concordance value, the samples were randomly sorted into new

232

outbreak groups and the same scoring system was carried out. Analyses were performed

233

separately based on concordance among top genotype matches and concordance among top

234

strain matches. For the purposes of this paper, strains were defined as distinct signatures in

235

the ESI-MS database. The scores of the randomized samples were compared to the original

236

samples using a Pearson’s chi-square test in IBM SPSS Statistics 21 (Armonk, NY), with a

237

pre-determined significance value of p < 0.05, two-tailed.

238

3. Results and Discussion

239

3.1 Assay and database design

SC

M AN U

240

RI PT

230

A total of 13 primer pairs were designed to amplify ~50-150 bp fragments of norovirus RNA, hepatitis A virus RNA, and an internal control in a 96-well plate format

242

(Table 1; Fig. 1). The ESI-MS database was successfully populated with the expected base

243

compositions of over 5000 sequences downloaded from GenBank. In many instances, these

244

sequences did not include information for all gene fragments targeted in this assay, in which

245

case the expected signature could only be generated for a portion of the fragments. In a few

246

cases, incomplete sequences that shared base counts with many other database entries and

247

were only amplified by one set of primers were removed from the database to improve data

248

interpretation. RT-PCR/ESI-MS base composition signatures were successfully obtained for

249

the 36 previously genotyped norovirus samples analyzed in this study (Table 2). These

250

samples had a total of 21 unique RT-PCR/ESI-MS signatures that were added to the database

251

prior to inclusivity and exclusivity testing.

AC C

EP

TE D

241

11

ACCEPTED MANUSCRIPT

252

Due to limitations in sample collection, the reference samples used in the database only included information for 5 genotypes: GI.3, GII.1, GII.4, GII.6, and GII.12, with 78% of

254

samples identified as GII.4 Minerva or GII.4 New Orleans. However, these samples represent

255

some of the most predominant genotypes associated with outbreaks. For example, GII.12

256

accounted for 20% of noroviruses submitted to CDC from October 2009 to June 2010 (Vega

257

and Vinje, 2011) and GII.4 strains were found to be responsible for 44% of U.S. outbreaks

258

over the course of 1994-2006 (Zheng et al., 2010) and 62% of global outbreaks over 2001-

259

2007 (Siebenga et al., 2009). The GII.4 strains Minerva (also known as GII.4 2006b) and

260

New Orleans have been predominant outbreak strains in the U.S., with Minerva surfacing in

261

late 2005/early 2006 and New Orleans surfacing in 2009 (Vega et al., 2011). Interestingly,

262

the RT-PCR/ESI-MS signatures determined for these two GII.4 strains showed a number of

263

subtypes due to differences in base compositions of the amplified regions; the GII.4 Minerva

264

samples revealed 6 unique signatures and the GII.4 New Orleans samples revealed 10 unique

265

signatures (Table 2). These differences are likely due to the fact that the RT-PCR/ESI-MS

266

assay targets multiple regions of the norovirus genome and therefore is able to determine

267

sequence variation at multiple locations.

268

3.2 Inclusivity and exclusivity testing

SC

M AN U

TE D

EP

Norovirus samples used for inclusivity testing included representatives of GI (n = 16),

AC C

269

RI PT

253

270

GI/GII (n = 1), and GII (n = 126), as well as 17 samples with no genogroup information

271

(Table 3). Among these samples, 92% were identified as norovirus and 8% could not be

272

detected by RT-PCR/ESI-MS. Previous PCR/ESI-MS studies have found similar to slightly

273

higher levels of sensitivity, ranging from 92 to 100% (Blyn et al., 2008; Eshoo et al., 2010;

274

Sampath et al., 2012; Sampath et al., 2007a). The estimated number of genomic copies per

12

ACCEPTED MANUSCRIPT

norovirus-positive well ranged from 26 to >2000 copies. It should be noted that these levels

276

are only approximate and are based on relative amplification in comparison to the calibrant.

277

Furthermore, above 2000 genome copies per well, the levels of target organism far exceed

278

those of the calibrant, resulting in an inaccurate estimate of the target concentration (Jacob et

279

al., 2012; Sampath et al., 2012). In these cases, it can only be determined that the amplicon is

280

present in the well at high levels (i.e., >2000 copies). Q-scores ranged from 0.85 to 0.99, with

281

an average of 0.96, indicating high-quality matches to the database entries. Among the

282

samples detected by RT-PCR/ESI-MS for which genogroup information was originally

283

available, the RT-PCR/ESI-MS genogroup designation showed high concordance (98%) with

284

the original real-time RT-PCR-based identification (Table 3). All 125 samples of GII

285

detected by RT-PCR/ESI-MS were correctly identified as belonging to this genogroup and the

286

1 sample previously identified as GI/GII also matched both GI and GII signatures in the RT-

287

PCR/ESI-MS database, indicating a case of illness involving multiple norovirus genogroups.

288

As mentioned previously, GI and GII noroviruses have been reported to occur together in a

289

small percentage (3-12%) of outbreaks (Hall et al., 2012; Matthews et al., 2012). Among the

290

10 GI samples detected by RT-PCR/ESI-MS, 8 were identified as GI and 2 showed top

291

matches to both GI and GII signatures within the RT-PCR/ESI-MS database. Considering

292

that the assay was designed with a focus on GII noroviruses, particularly GII.4 strains, it is not

293

surprising that a greater identification success rate was observed with this genogroup as

294

compared to the GI noroviruses. The 13 norovirus samples that could not be detected by RT-

295

PCR/ESI-MS were relatively old samples within the dataset, having originally been collected

296

during 2007-2008, and included 6 samples with no genogroup information, 6 originally

297

identified as GI and 1 originally identified as GII. These samples likely failed due to poor

AC C

EP

TE D

M AN U

SC

RI PT

275

13

ACCEPTED MANUSCRIPT

298

quality, as the nucleic acid may have degraded over time. Alternatively, it is possible that

299

polymorphisms in the primer-binding areas prevented amplification of the target regions.

300

The 147 samples that tested positive for norovirus with RT-PCR/ESI-MS (Table 3) were also assigned a genotype based on the top match(es) to entries in the ESI-MS database.

302

Within GI, the following genotypes were detected: GI.2, GI.3, GI.4, GI.P3/GI.11, and GI.14.

303

Within GII, the genotypes that were detected included: GII.1, GII.4, GII.5, GII.6, GII.12,

304

GII.13, GII.16, GII.17, and several recombinants: GII.Pa-GII.3, GII.P12-GII.10, GII.P16-

305

GII.2, and GII.P19-GII.5. Nomenclature for recombinant noroviruses is as described in

306

Kroneman et al. (2013), where the RdRp-based genotype is listed first (with a capital P for

307

‘polymerase’), followed by the VP1-based genotype. The majority of samples (n = 119)

308

matched only one genotype in the database, with GII.4 having the greatest number of matches

309

(n = 90). The remaining samples showed top matches to 2-3 genotypes and/or recombinant

310

genotypes in the database.

SC

M AN U

TE D

311

RI PT

301

All three of the hepatitis A virus samples were correctly identified by RT-PCR/ESIMS (Table 3). Hepatitis A virus has been classified into three human (I-III) and three simian

313

(IV-VI) genotypes, with genotypes I and III most commonly isolated from humans (Nainan et

314

al., 2006). The human genotypes have been further divided into subtypes A and B, based on

315

genetic relatedness. Although this assay was only designed with the goal of detecting

316

hepatitis A virus, each of the three samples tested (IA, IB, and IIIA) was also correctly

317

identified at the subgenotype level by RT-PCR/ESI-MS. While these results are promising,

318

additional testing will be necessary to determine whether or not the assay is a reliable

319

indicator of hepatitis A virus genotypes and/or subgenotypes.

AC C

EP

312

14

ACCEPTED MANUSCRIPT

320

As shown in Table 3, no cross reactivity was observed during exclusivity testing with murine norovirus, feline calicivirus, and poliovirus, with all samples testing negative for

322

norovirus and hepatitis A virus. Furthermore, the norovirus primers did not show any cross-

323

reactivity when tested against hepatitis A virus and the hepatitis A virus primers did not cross-

324

react with norovirus samples. The 10 stool samples previously identified as being negative

325

for norovirus with real-time RT-PCR all tested negative for norovirus and hepatitis A virus, as

326

well as the 11 reagent blanks from RNA extraction and 23 non-template controls (Table 3).

327

These results demonstrate the high specificity (100%) of the assay developed here, with no

328

false positives observed. Previous PCR/ESI-MS studies have also reported high levels of

329

specificity, with values of 94-100% (Blyn et al., 2008; Crowder et al., 2012; Eshoo et al.,

330

2010; Sampath et al., 2012; Sampath et al., 2007a).

331

3.3 Comparison to recognized method

SC

M AN U

A subset (n = 41) of the norovirus stool samples tested above with RT-PCR/ESI-MS

TE D

332

RI PT

321

were also genotyped using a currently recognized sequencing-based method (Vinje et al.,

334

2004) for comparison (Table 4). Out of the 41 samples tested, initially 31 were identified by

335

both methods and 22 of these showed concordant identifications, meaning that the sample

336

matched the exact same genotype when tested with RT-PCR/ESI-MS as it did when tested

337

with sequenced-based genotyping. When the results of repeat RNA extraction and

338

sequencing analyses were included for two samples identified as possible mix-ups, a total of

339

32 samples were detected by both methods and 24 showed concordant identifications.

340

Subsequent discussion of results incorporates these two updated identifications. Five samples

341

showed semi-concordant identifications, meaning that the genotype identified by sequencing

342

was also among the top matches to the ESI-MS database, but additional genotypes were also

AC C

EP

333

15

ACCEPTED MANUSCRIPT

among the top database matches. Three of the samples were discordant, meaning that the

344

genotype identified by sequencing was not among the top matches identified by ESI-MS.

345

Among the samples that could not be identified by one or both methods, 5 could not be

346

identified by PCR/ESI-MS, 3 could not be identified by sequencing and 1 could not be

347

identified by either method.

348

RI PT

343

Within the GI noroviruses (n = 16), 10 samples were genotyped by both methods: 7 samples showed complete concordance, 2 were semi-concordant, and 1 was discordant (Table

350

4). The concordant samples included 2 GI.2 noroviruses and 5 GI.3 noroviruses detected with

351

Q-scores of 0.93-0.99 and at levels of 48-1586 genomes/well. One of the semi-concordant

352

samples was identified as GI.3 by sequencing, but had multiple database matches (GI.3 and

353

GII.12) when tested with RT-PCR/ESI-MS. The primary match was to GI.3, with a Q-score

354

of 0.98 and a level of 1564 genomes/well compared to the secondary match to GII.12 with a

355

Q-score of 0.91 and a level of 172 genomes/well. The other semi-concordant sample was also

356

identified as GI.3 by sequencing, but had database matches to both GI.3 and GII.4 when

357

tested with RT-PCR/ESI-MS. As with the other sample, the GI.3 identification showed a

358

higher Q-score and level compared to the GII.4 identification. In both cases with semi-

359

concordant results, the secondary match was to a database entry originating from GenBank

360

with limited sequence information, whereas the top match was to a database entry obtained

361

from one of the GI.3 reference norovirus samples used to initially build the database (Table 2).

362

The one discordant sample was identified as GI.3 by sequencing, but RT-PCR/ESI-MS

363

identified it as GI.4 (Q-score = 0.88; level = 488 genomes/well). The low Q-score indicates a

364

poor quality match that may be improved with the incorporation of GI.4 reference samples

365

into the RT-PCR/ESI-MS database. RT-PCR/ESI-MS was unable to detect five of the

AC C

EP

TE D

M AN U

SC

349

16

ACCEPTED MANUSCRIPT

samples identified by sequencing as GI.3 and both methods were unable to detect one sample

367

originally identified by real-time RT-PCR as a GI norovirus. The inability of RT-PCR/ESI-

368

MS to amplify some of the GI.3 samples may reflect a lack of primer specificity for this

369

genotype. Additionally, all six samples that failed to amplify were collected during the years

370

2007-2008 and the viral nucleic acid may have degraded over time.

371

RI PT

366

As expected, GII noroviruses showed greater success with the RT-PCR/ESI-MS assay, with detections obtained for all 24 GII samples (Table 4). Three of these samples could not

373

be identified with the sequencing-based method even though they previously tested positive

374

for norovirus GII with real-time RT-PCR. In all 3 cases, RT-PCR/ESI-MS was able to

375

identify these noroviruses at the genotype level (Q-scores 0.96-0.99, 564-1662 copies/well).

376

Among the remaining 21 samples, 17 showed complete concordance between the RT-

377

PCR/ESI-MS genotype and the sequencing-based genotype, with the majority of samples (n =

378

10) identified as GII.4. Samples of GII.1 and GII.12 also showed concordant genotype

379

identifications between the two methods. As shown in Table 4, 2 of the GII samples showed

380

semi-concordant genotype identifications and 2 showed discordant genotype identifications.

381

In both instances of semi-concordant genotype identifications, the ESI-MS identification with

382

the highest Q-score was concordant with the sequencing result, while secondary ESI-MS

383

identifications with lower Q-scores did not agree with the sequencing result. This trend was

384

also observed with the GI noroviruses (discussed above), indicating the importance of

385

considering Q-score in the interpretation of results.

M AN U

TE D

EP

AC C

386

SC

372

Among the discordant results for GII noroviruses, one sample was identified by

387

sequencing as GII.4 and by RT-PCR/ESI-MS as GII.16 and the recombinant GII.P12/GII.10

388

(Q-score = 0.89, >2000 copies/well). The low Q-score for this sample indicates a relatively

17

ACCEPTED MANUSCRIPT

poor quality match to entries in the ESI-MS database, a result which may be improved upon

390

as more reference samples are added to the database. Indeed, a previous study using

391

PCR/ESI-MS for the detection of fungi reported that an upgrade to the fungal database

392

reduced the number of misidentifications observed with the assay (Simner et al., 2013). The

393

third discordant sample was identified as GII.4 by RT-PCR/ESI-MS (Q-score = 0.99, >2000

394

copies/well) and as Bacteroides spp. by sequencing, even though it was previously identified

395

as a GII norovirus by real-time RT-PCR. The most likely explanation for this discordance is

396

that the original stool sample contained both norovirus GII and Bacteroides spp., which are

397

significant inhabitants of the gastrointestinal tract and are found in human feces (Wexler,

398

2007). This particular strain of Bacteroides appears to have been preferentially amplified by

399

the norovirus sequencing primers but not by the RT-PCR/ESI-MS assay.

SC

M AN U

400

RI PT

389

Much of the discordance between the sequencing-based method and the RT-PCR/ESIMS method is likely due to the use of incomplete sequences derived from GenBank to

402

populate the ESI-MS database. In many cases, these sequences did not provide full coverage

403

of the regions targeted by the assay and therefore only provided partial matches to the ESI-

404

MS signatures. As more reference samples with complete information for the target regions

405

are added to the database, the occurrence of semi-concordant or discordant results is likely to

406

be reduced. An additional source of discordance may be due to differences in genome

407

coverage between the two methods. While the sequencing-based method assigns genotype

408

based on only a portion of the norovirus genome (< 260 bp) within the VP1 gene (Vinje et al.,

409

2004), RT-PCR/ESI-MS targets multiple regions of the genome within short stretches (50-150

410

bp) of both the RdRp and VP1 genes. Recombination of norovirus genomes at the ORF1-

411

ORF2 junction is common and can result in a different genotype for the RdRp region as

AC C

EP

TE D

401

18

ACCEPTED MANUSCRIPT

compared to the VP1 region (Bull et al., 2007). Therefore, it is possible that a sample

413

identified as one genotype by sequencing may show multiple matches by RT-PCR/ESI-MS

414

due to recombination. While this results in the potential for RT-PCR/ESI-MS to show

415

conflicting results when compared to sequencing, it also allows for greater potential for

416

discovery of variants and recombinants that may not have been detected with sequencing

417

alone. Furthermore, RT-PCR/ESI-MS is able to detect multiple strains within one sample,

418

which may contribute to some discordance when comparing identifications with sequencing-

419

based results, especially if one strain is present in lower concentrations.

SC

RI PT

412

Taken together, the results presented above show that the RT-PCR/ESI-MS assay has

421

a number of advantages and disadvantages when compared to the sequencing-based method.

422

One advantage, which has been illustrated in previous RT-PCR/ESI-MS studies (Crowder et

423

al., 2010; Sampath et al., 2005), is the ability to identify mixtures of multiple genotypes in one

424

sample. Additionally, by covering multiple regions of the norovirus genome, the RT-

425

PCR/ESI-MS assay has potential for a higher level of differentiation among strains than that

426

achieved with sequencing. Considering the high level of diversity and rapid evolution of

427

norovirus strains, especially GII.4 strains (Siebenga et al., 2007; Siebenga et al., 2009; Zheng

428

et al., 2010), coverage of multiple regions also improves the possibility of detecting norovirus

429

using a single assay. Similarly, previous studies found RT-PCR/ESI-MS to be advantageous

430

for the simultaneous detection and differentiation of arboviruses, which are also genetically

431

diverse RNA viruses that often require multiple assays for their detection (Eshoo et al., 2007;

432

Grant-Klein et al., 2010). A further advantage of ESI-MS is its ease of use, as it involves

433

fewer steps than sequencing-based methods and relies primarily on automation. However, a

434

disadvantage of the RT-PCR/ESI-MS assay is that the results are not always concordant with

AC C

EP

TE D

M AN U

420

19

ACCEPTED MANUSCRIPT

the sequencing-based assay, especially in the case of GI or recombinant noroviruses. This is

436

likely to improve as the database is populated with more reference samples, but it may never

437

reach full concordance due to the multiple regions targeted by the RT-PCR/ESI-MS assay and

438

the ability of this assay to detect multiple strains in one sample. With the above in mind, it is

439

recommended that the RT-PCR/ESI-MS assay be used to assist with the rapid detection and

440

differentiation of noroviruses, but that traditional sequencing should also be performed for

441

confirmation purposes. As more reference sample sequences are added to the database, it is

442

expected that the RT-PCR/ESI-MS assay will become an increasingly powerful tool for

443

norovirus testing.

444

3.4 Outbreak concordance

M AN U

SC

RI PT

435

As a further step in evaluating the usefulness of the RT-PCR/ESI-MS assay for

446

norovirus typing in outbreak and surveillance situations, the top ESI-MS database matches for

447

norovirus samples in this study associated with outbreaks were compared for genotype and

448

strain concordance within each outbreak (Figs. 2a and 2b). These samples (n = 131) were

449

associated with 50 different outbreaks in Orange County, CA, including 26 outbreaks with 2

450

samples per outbreak and 24 outbreaks with 3 or more samples per outbreak. Complete

451

concordance among genotype identifications was observed for 24 of the 26 outbreaks

452

represented by 2 cases, meaning that both samples from the same outbreak showed the exact

453

same top match or matches to the ESI-MS database. The majority (n = 21) of the outbreaks

454

with concordant genotype results were associated with GII.4, while a few were associated

455

with other genotypes, including GI.3, GII.1, GII.12, and GII.17. One of the outbreaks

456

represented by 2 cases showed semi-concordant genotype results, in which one sample

457

showed top matches to GII.4 and GII.17 and the other sample showed top matches to GII.4,

AC C

EP

TE D

445

20

ACCEPTED MANUSCRIPT

GII.17, and GII.P16/GII.2. The one outbreak represented by 2 cases that showed discordant

459

genotype results was associated with one sample that had a top match to GII.4 and another

460

sample with a top match to GII.1. When the 26 outbreaks represented by 2 cases each were

461

compared based on identification of samples at the strain level, there were 17 outbreaks with

462

concordant results, 4 outbreaks with semi-concordant results, and 5 outbreaks with discordant

463

results. The majority (82%) of concordant outbreaks were associated with GII.4 strains, while

464

the remaining concordant outbreaks were associated with GI.3, GII.1, or GII.12 strains. The

465

outbreaks with semi-concordant strain identifications included the 1 outbreak described above

466

with semi-concordant genotype results, as well as 3 involving strains from GII.4 and GII.17.

467

With the exception of the outbreak described above involving 2 different genotypes,

468

outbreaks with discordant strain identifications were all associated with GII.4, but were linked

469

to different GII.4 strains in the database.

SC

M AN U

Outbreaks represented by 3 or more cases are shown graphically (Figs. 2a and 2b). As

TE D

470

RI PT

458

shown in Fig. 2a, samples from the same outbreak had a high level of genotype concordance,

472

with 18 of 24 outbreaks showing 100% concordance among all samples analyzed. These

473

outbreaks were each represented by 3-5 samples and were primarily outbreaks of GII.4 (n =

474

14), but also included one outbreak of each of the following: GII.1, GII.6, GII.12, and a set of

475

recombinants (GII.P16/GII.2 and GII.Pa/GII.3). An additional 5 outbreaks showed

476

concordance among genotype results for the majority of samples, but had one sample with

477

semi-concordant genotype results. For example, 2 of the samples associated with outbreak no.

478

2 showed top matches to GII.4 and a third sample showed a top match to database entries

479

corresponding to GII.4 and GII.17. Only 1 of the 24 outbreaks had samples that showed

AC C

EP

471

21

ACCEPTED MANUSCRIPT

480

discordant genotype results (outbreak no. 7). This outbreak had 2 samples with top matches

481

to both GII.16 and GII.P12/GII.10, while a third sample had a top match to GII.4.

482

As shown in Fig. 2b, when the results were compared on the basis of the top strain(s) in the database matching each sample, the number of concordant outbreaks decreased. Out of

484

the 18 outbreaks that showed concordance on the basis of genotype, only 9 continued to show

485

complete concordance with strain identifications. For most (80%) of the outbreaks showing

486

semi-concordance or discordance, the majority of samples showed complete concordance for

487

the top strain match, with just one sample showing a semi-concordant or discordant result.

488

For example, all 5 samples from outbreak no. 17 showed a top match to one of the ESI-MS

489

signatures for GII.4 New Orleans, but 1 of the samples also matched another ESI-MS

490

signature linked to a different sample of GII.4 New Orleans found in the database. Since

491

strains are defined here as ESI-MS database entries with distinct signatures, this sample was

492

determined to be semi-concordant.

SC

M AN U

TE D

493

RI PT

483

The results of statistical analysis on all outbreak samples revealed that both genotype and strain concordance values were significantly greater (p < 0.05, two-tailed) for the original

495

ESI-MS identifications as compared to the randomized dataset. Based on a scale of 0

496

(discordant) to 2 (fully concordant), the average genotype concordance score for the original

497

dataset was 1.8 ± 0.6 compared to 1.0 ± 1.0 in the randomized dataset, while the average

498

strain concordance score for the original dataset was 1.3 ± 0.8 compared to 0.1 ± 0.4 in the

499

randomized dataset. These results show that RT-PCR/ESI-MS allows for a high level of

500

agreement among outbreak samples based on genotype, with an average score close to that for

501

full concordance. The lower level of agreement found with strains is likely due to the fact

502

that single nucleotide polymorphisms among samples associated with the same outbreak can

AC C

EP

494

22

ACCEPTED MANUSCRIPT

result in different matches to the database, especially when the database is largely comprised

504

of fragmented sequences from GenBank. As the database becomes updated with additional

505

reference samples, it is likely that the concordance score will improve. Overall, the results of

506

the outbreak concordance analysis indicate the potential usefulness of this assay for norovirus

507

detection and differentiation in surveillance and outbreak applications, especially with regard

508

to genotype identifications.

509

4. Conclusions

SC

RI PT

503

An RT-PCR/ESI-MS assay was developed in this study for the detection and

511

differentiation of human noroviruses as well as the detection of hepatitis A virus. The assay

512

was successful at identifying human norovirus in 92% of clinical samples and showed a

513

specificity of 100% when tested against closely related viruses. In addition to detection of

514

norovirus, the assay was able to correctly identify norovirus at the genogroup level in 98% of

515

amplified samples and at the genotype level in 71-75% of samples. As more reference

516

samples are added to the database, the genotyping ability of the assay is expected to improve.

517

This assay shows potential to reduce the time and labor needed to detect and differentiate

518

noroviruses and may enhance the ability of public health scientists to identify the source and

519

the spread of norovirus illnesses related to outbreak situations. In addition to identification of

520

strains currently in circulation, this assay allows for the classification of recombinants and

521

new norovirus strains due to its range of target amplicons. Importantly, this assay would not

522

be expected to replace the current sequencing-based genotyping method, but rather it would

523

provide an additional tool for rapid characterization of noroviruses in outbreak or surveillance

524

situations.

525

Acknowledgments

AC C

EP

TE D

M AN U

510

23

ACCEPTED MANUSCRIPT

Funding for this study was provided by the FDA Center for Food Safety and Applied

527

Nutrition (CFSAN). We would like to thank the FDA Commissioner's Fellowship

528

Program; Steven Musser, John Callahan, Rebecca Bell, Marc Allard, and Erik Burrows at

529

FDA/CFSAN for their support; Lee-Ann Jaykus at North Carolina State University for

530

assistance with this project; and Natasha Fazel for help with sample processing.

531

References

532

Blanton, L. H., Adams, S. M., Beard, R. S., Wei, G., Bulens, S. N., Widdowson, M. A., Glass,

SC

RI PT

526

R. I., & Monroe, S. S. (2006). Molecular and epidemiologic trends of caliciviruses

534

associated with outbreaks of acute gastroenteritis in the United States, 2000−2004.

535

Journal of Infectious Diseases, 193(3), 413-421.

M AN U

533

Blyn, L. B., Hall, T. A., Libby, B., Ranken, R., Sampath, R., Rudnick, K., Moradi, E., Desai,

537

A., Metzgar, D., Russell, K. L., Freed, N. E., Balansay, M., Broderick, M. P., Osuna,

538

M. A., Hofstadler, S. A., & Ecker, D. J. (2008). Rapid detection and molecular

539

serotyping of adenovirus by use of PCR followed by electrospray ionization mass

540

spectrometry. Journal of Clinical Microbiology, 46(2), 644-651.

543 544 545

EP

542

Bull, R. A., Tanaka, M. M., & White, P. A. (2007). Norovirus recombination. Journal of General Virology, 88(12), 3347-3359. Crowder, C. D., Matthews, H. E., Rounds, M. A., Li, F., Schutzer, S. E., Sampath, R.,

AC C

541

TE D

536

Hofstadler, S. A., Ecker, D. J., & Eshoo, M. W. (2012). Detection of heartworm

infection in dogs via PCR amplification and electrospray ionization mass spectrometry

546

of nucleic acid extracts from whole blood samples. American Journal of Veterinary

547

Research, 73(6), 854-859.

24

ACCEPTED MANUSCRIPT

548

Crowder, C. D., Matthews, H. E., Schutzer, S., Rounds, M. A., Luft, B. J., Nolte, O., Campbell, S. R., Phillipson, C. A., Li, F., Sampath, R., Ecker, D. J., & Eshoo, M. W.

550

(2010). Genotypic variation and mixtures of Lyme Borrelia in Ixodes ticks from North

551

America and Europe. Plos One, 5(5), e10650-e10650.

552

RI PT

549

Ecker, D. J., Sampath, R., Blyn, L. B., Eshoo, M. W., Ivy, C., Ecker, J. A., Libby, B., Samant, V., Sannes-Lowery, K. A., Melton, R. E., Russell, K., Freed, N., Barrozo, C., Wu, J.,

554

Rudnick, K., Desai, A., Moradi, E., Knize, D. J., Robbins, D. W., Hannis, J. C.,

555

Harrell, P. M., Massire, C., Hall, T. A., Jiang, Y., Ranken, R., Drader, J. J., White, N.,

556

McNeil, J. A., Crooke, S. T., & Hofstadler, S. A. (2005). Rapid identification and

557

strain-typing of respiratory pathogens for epidemic surveillance. Proceedings of the

558

National Academy of Sciences, 102(22), 8012-8017.

M AN U

559

SC

553

Ecker, D. J., Sampath, R., Massire, C., Blyn, L. B., Hall, T. A., Eshoo, M. W., & Hofstadler, S. A. (2008). Ibis T5000: a universal biosensor approach for microbiology. Nature

561

Reviews Microbiology, 6(7), 553-558.

562

TE D

560

Eshoo, M. W., Crowder, C. D., Li, H., Matthews, H. E., Meng, S., Sefers, S. E., Sampath, R., Charles W. Stratton, Blyn, L. B., Ecker, D. J., & Tang, Y.-W. (2010). Detection and

564

identification of Ehrlichia species in blood by use of PCR and electrospray ionization

565

mass spectrometry. Journal of Clinical Microbiology, 48(2), 472-478.

567 568

AC C

566

EP

563

Eshoo, M. W., Whitehouse, C. A., Zoll, S. T., Massire, C., Pennella, T.-T. D., Blyn, L. B., Sampath, R., Hall, T. A., Ecker, J. A., Desai, A., Wasieloski, L. P., Li, F., Turell, M. J.,

Schink, A., Rudnick, K., Otero, G., Weaver, S. C., Ludwig, G. V., Hofstadler, S. A., &

569

Ecker, D. J. (2007). Direct broad-range detection of alphaviruses in mosquito extracts.

570

Virology, 368(2), 286-295.

25

ACCEPTED MANUSCRIPT

571

Grant-Klein, R. J., Baldwin, C. D., Turell, M. J., Rossi, C. A., Li, F., Lovari, R., Crowder, C. D., Matthews, H. E., Rounds, M. A., Eshoo, M. W., Blyn, L. B., Ecker, D. J., Sampath,

573

R., & Whitehouse, C. A. (2010). Rapid identification of vector-borne flaviviruses by

574

mass spectrometry. Molecular and Cellular Probes, 24(4), 219-228.

575

RI PT

572

Green, K. Y., Chanock, R. M., & Kapikian, A. Z. (2001). Human Caliciviruses. In D. M.

Knipe, P. M. Howley, D. E. Griffin, M. A. Martin, R. A. Lamb, B. Roizman & S. E.

577

Straus (Eds.), Field's Virology (4th ed., pp. 841-874). Philadelphia: Lippincott

578

Williams and Wilkins.

Hall, A. J., Eisenbart, V. G., Etingüe, A. L., Gould, L. H., Lopman, B. A., & Parashar, U. D.

M AN U

579

SC

576

580

(2012). Epidemiology of foodborne norovirus outbreaks, United States, 2001–2008.

581

Emerging Infectious Diseases, 18(10), 1566–1573.

582

Hall, A. J., Vinjé, J., Lopman, B. A., Park, G. W., Yen, C., Gregoricus, N., & Parashar, U. D. (2011). Centers for Disease Control and Prevention. Updated norovirus outbreak

584

management and disease prevention guidelines. Morbidity and Mortality Weekly

585

Report, 60(3), 1-15.

TE D

583

Hall, T. A. (1999). BioEdit: a user-friendly biological sequence alignment editor and analysis

587

program for Windows 95/98/NT. Nucleic Acids Symposium Series, 41, 95-98.

588

Hannis, J. C., Manalili, S. M., Hall, T. A., Ranken, R., White, N., Sampath, R., Blyn, L. B.,

590 591 592

AC C

589

EP

586

Ecker, D. J., Mandrell, R. E., Fagerquist, C. K., Bates, A. H., Miller, W. G., & Hofstadler, S. A. (2008). High-resolution genotyping of Campylobacter species by use

of PCR and high-throughput mass spectrometry. Journal of Clinical Microbiology, 46(4), 1220-1225.

26

ACCEPTED MANUSCRIPT

Hofstadler, S. A., Sampath, R., Blyn, L. B., Eshoo, M. W., Hall, T. A., Jiang, Y., Drader, J. J.,

594

Hannis, J. C., Sannes-Lowery, K. A., Cummins, L. L., Libby, B., Walcott, D. J.,

595

Schink, A., Massire, C., Ranken, R., Gutierrez, J., Manalili, S., Ivy, C., Melton, R.,

596

Levene, H., Barrett-Wilt, G., Li, F., Zapp, V., White, N., Samant, V., McNeil, J. A.,

597

Knize, D., Robbins, D., Rudnick, K., Desai, A., Moradi, E., & Ecker, D. J. (2005).

598

TIGER: the universal biosensor. International Journal of Mass Spectrometry, 242(1),

599

23-41.

SC

600

RI PT

593

Jacob, D., Sauer, U., Housley, R., Washington, C., Sannes-Lowery, K., Ecker, D. J., Sampath, R., & Grunow, R. (2012). Rapid and high-throughput detection of highly pathogenic

602

bacteria by Ibis PLEX-ID technology. Plos One, 7(6), e39928.

603

M AN U

601

Jiang, Y., & Hofstadler, S. A. (2003). A highly efficient and automated method of purifying and desalting PCR products for analysis by electrospray ionization mass spectrometry.

605

Analytical Biochemistry, 316(1), 50-57.

606

TE D

604

Kageyama, T., Kojima, S., Shinohara, M., Uchida, K., Fukushi, S., Hoshino, F. B., Takeda, N., & Katayama, K. (2003). Broadly reactive and highly sensitive assay for norwalk-

608

like viruses based on real-time quantitative reverse transcription-PCR. Journal of

609

Clinical Microbiology, 41(4), 1548-1557.

611 612

Kroneman, A., Vega, E., Vennema, H., Vinjé, J., White, P., Hansman, G., Green, K., Martella,

AC C

610

EP

607

V., Katayama, K., & Koopmans, M. (2013). Proposal for a unified norovirus

nomenclature and genotyping. Archives of Virology, 1-10.

613

Kroneman, A., Vennema, H., Deforche, K., Avoort, H., Peñaranda, S., Oberste, M., Vinjé, J.,

614

& Koopmans, M. (2011). An automated genotyping tool for enteroviruses and

615

noroviruses. Journal of Clinical Microbiology, 51(2), 121-125.

27

ACCEPTED MANUSCRIPT

Matthews, J. E., Dickey, B. W., Miller, R. D., Felzer, J. R., Dawson, B. P., Lee, A. S., Rocks,

617

J. J., Kiel, J., Montes, J. S., Moe, C. L., Eisenberg, J. N. S., & Leon, J. S. (2012). The

618

epidemiology of published norovirus outbreaks: a review of risk factors associated

619

with attack rate and genogroup. Epidemiology & Infection, 140(07), 1161-1172.

620

RI PT

616

Mattison, K., Grudeski, E., Auk, B., Charest, H., Drews, S. J., Fritzinger, A., Gregoricus, N., Hayward, S., Houde, A., Lee, B. E., Pang, X. L. L., Wong, J. L., Booth, T. F., & Vinje,

622

J. (2009). Multicenter comparison of two norovirus ORF2-based genotyping protocols.

623

Journal of Clinical Microbiology, 47(12), 3927-3932.

SC

621

Nainan, O. V., Xia, G., Vaughan, G., & Margolis, H. S. (2006). Diagnosis of hepatitis A virus

625

infection: a molecular approach. Clinical Microbiology Reviews, 19(1), 63-79.

M AN U

624

Pierce, S. E., Bell, R. L., Hellberg, R. S., Cheng, C. M., Chen, K. S., Williams-Hill, D. M.,

627

Martin, W. B., & Allard, M. W. (2012). Detection and Identification of Salmonella

628

enterica, Escherichia coli, and Shigella spp. via PCR-electrospray ionization mass

629

spectrometry: isolate testing and analysis of food samples. Applied and Environmental

630

Microbiology, 78(23), 8403-8411.

TE D

626

Sampath, R., Hofstadler, S. A., Blyn, L. B., Eshoo, M. W., Hall, T. A., Massire, C., Levene, H.

632

M., Hannis, J. C., Harrell, P. M., Neuman, B., Buchmeier, M. J., Jiang, Y., Ranken, R.,

633

Drader, J. J., Samant, V., Griffey, R. H., McNeil, J. A., Crooke, S. T., & Ecker, D. J.

635 636

AC C

634

EP

631

(2005). Rapid identification of emerging pathogens: coronavirus. Emerging Infectious Diseases, 11(3), 373-379.

Sampath, R., Mulholland, N., Blyn, L. B., Massire, C., Whitehouse, C. A., Waybright, N.,

637

Harter, C., Bogan, J., Miranda, M. S., Smith, D., Baldwin, C., Wolcott, M., Norwood,

638

D., Kreft, R., Frinder, M., Lovari, R., Yasuda, I., Matthews, H., Toleno, D., Housley,

28

ACCEPTED MANUSCRIPT

639

R., Duncan, D., Li, F., Warren, R., Eshoo, M. W., Hall, T. A., Hofstadler, S. A., &

640

Ecker, D. J. (2012). Comprehensive biothreat cluster identification by

641

PCR/electrospray-ionization mass spectrometry. Plos One, 7(6), e36528. Sampath, R., Russell, K. L., Massire, C., Eshoo, M. W., Harpin, V., Blyn, L. B., Melton, R.,

RI PT

642

Ivy, C., Pennella, T., Li, F., Levene, H., Hall, T. A., Libby, B., Fan, N., Walcott, D. J.,

644

Ranken, R., Pear, M., Schink, A., Gutierrez, J., Drader, J., Moore, D., Metzgar, D.,

645

Addington, L., Rothman, R., Gaydos, C. A., Yang, S., St. George, K., Fuschino, M. E.,

646

Dean, A. B., Stallknecht, D. E., Goekjian, G., Yingst, S., Monteville, M., Saad, M. D.,

647

Whitehouse, C. A., Baldwin, C., Rudnick, K. H., Hofstadler, S. A., Lemon, S. M., &

648

Ecker, D. J. (2007a). Global surveillance of emerging influenza virus genotypes by

649

mass spectrometry. Plos One, 2(5), e489.

M AN U

650

SC

643

Sampath, R. A., Hall, T. A., Massire, C., LI, F., Blyn, L. B., Eschoo, M. W., Hofstadler, S. A., & Ecker, D. J. (2007b). Rapid identification of emerging infectious agents using PCR

652

and electrospray ionization mass spectrometry. Annals of the New York Academy of

653

Sciences, 1102, 109-120.

656 657 658 659

EP

655

Sanchez, G., Bosch, A., & Pinto, R.M. 2007. Hepatitis A virus detection in food: current and future prospects. Letters in Applied Microbiology, 45(1), 1-5. Scallan, E., Hoekstra, R. M., Angulo, F. J., Tauxe, R. V., Widdowson, M. A., Roy, S. L.,

AC C

654

TE D

651

Jones, J. L., & Griffin, P. M. (2011). Foodborne illness acquired in the United States -

major pathogens. Emerging Infectious Diseases, 17(1), 7-15.

Shen, J., Wang, F., Li, F., Housley, R., Carolan, H., Yasuda, I., Burrows, E., Binet, R.,

660

Sampath, R., Zhang, J., Allard, M. W., & Meng, J. (2013). Rapid identification and

661

differentiation of non-O157 shiga toxin–producing Escherichia coli using polymerase

29

ACCEPTED MANUSCRIPT

662

chain reaction coupled to electrospray ionization mass spectrometry. Foodborne

663

Pathogens and Disease, 10(8), 737-743. Siebenga, J. J., Vennema, H., Renckens, B., de Bruin, E., van der Veer, B., Siezen, R. J., &

665

Koopmans, M. (2007). Epochal evolution of GGII.4 norovirus capsid proteins from

666

1995 to 2006. Journal of Virology, 81(18), 9932-9941.

RI PT

664

Siebenga, J. J., Vennema, H., Zheng, D. P., Vinje, J., Lee, B. E., Pang, X. L., Ho, E. C. M.,

668

Lim, W., Choudekar, A., Broor, S., Halperin, T., Rasool, N. B. G., Hewitt, J.,

669

Greening, G. E., Jin, M., Duan, Z. J., Lucero, Y., O'Ryan, M., Hoehne, M., Schreier,

670

E., Ratcliff, R. M., White, P. A., Iritani, N., Reuter, G., & Koopmans, M. (2009).

671

Norovirus illness is a global problem: emergence and spread of norovirus GII.4

672

variants, 2001-2007. Journal of Infectious Diseases, 200(5), 802-812.

673

Simner, P. J., Uhl, J. R., Hall, L., Weber, M. M., Walchak, R. C., Buckwalter, S., &

M AN U

SC

667

Wengenack, N. L. (2013). Broad-range direct detection and identification of fungi by

675

use of the PLEX-ID PCR-electrospray ionization mass spectrometry (ESI-MS) system.

676

Journal of Clinical Microbiology, 51(6), 1699-1706. Thompson, J. D., Higgins, D. G., & Gibson, T. J. (1994). CLUSTAL W: improving the

EP

677

TE D

674

sensitivity of progressive multiple sequence alignment through sequence weighting,

679

position-specific gap penalties and weight matrix choice. Nucleic Acids Research,

680

AC C

678

22(22), 4673-4680.

681

Van Ert, M. N., Hofstadler, S. A., Jiang, Y., Busch, J. D., Wagner, D. M., Drader, J. J., Ecker,

682

D. J., Hannis, J. C., Huynh, L. Y., Schupp, J. M., Simonson, T. S., & Keim, P. (2004).

683

Mass spectrometry provides accurate characterization of two genetic marker types in

684

Bacillus anthracis. BioTechniques, 37(4), 642-651.

30

ACCEPTED MANUSCRIPT

685

Vega, E., Barclay, L., Gregoricus, N., Williams, K., Lee, D., & Vinje, J. (2011). Novel

686

surveillance network for norovirus gastroenteritis outbreaks, United States. Emerging

687

Infectious Diseases, 17(8), 1389-1395.

689 690

Vega, E., & Vinje, J. (2011). Novel GII.12 norovirus strain, United States, 2009-2010. Emerging Infectious Diseases, 17(8), 1516-1518.

RI PT

688

Vinje, J., Hamidjaja, R. A., & Sobsey, M. D. (2004). Development and application of a capsid VP1 (region D) based reverse transcription PCR assay for genotyping of genogroup I

692

and II noroviruses. Journal of Virological Methods, 116, 109-117.

694

Wexler, H. M. (2007). Bacteroides: the good, the bad, and the nitty-gritty. Clinical

M AN U

693

SC

691

Microbiology Reviews, 20(4), 593-621.

Zheng, D. P., Ando, T., Fankhauser, R. L., Beard, R. S., Glass, R. I., & Monroe, S. S. (2006).

696

Norovirus classification and proposed strain nomenclature. Virology, 346(2), 312-323.

697

Zheng, D. P., Widdowson, M. A., Glass, R. I., & Vinje, J. (2010). Molecular epidemiology of

TE D

695

698

genogroup II-genotype 4 noroviruses in the United States between 1994 and 2006.

699

Journal of Clinical Microbiology, 48(1), 168-177. Figure Captions.

701

Figure 1. Layout of the RT-PCR/ESI-MS assay plate developed in this study. This assay

702

plate is able to process 12 samples per run and can simultaneously test for the presence of

703

norovirus and hepatitis A virus. PP, PCR primer pair(s).

704

Figure 2a. Concordance among the genotype identified by RT-PCR/ESI-MS for samples (n =

705

79) from the same outbreak.

706

Figure 2b. Concordance among strains identified by RT-PCR/ESI-MS for samples (n = 79)

707

from the same outbreak.

AC C

EP

700

31

ACCEPTED MANUSCRIPT

Tables

RI PT

Table 1. Primers designed for use in the RT-PCR/ESI-MS assay described in this study. PCR primer pair (PP) groupings indicate singleplex or multiplex arrangements. RdRp, RNA-dependent RNA polymerase; VP1, major capsid protein. PCR primer pair (PP) grouping

Target organism

Target gene

Product length

Primer sequences, 5’-3’ (F, forward; R, reverse)

3047

PP1

Norovirus GII and GIV

RdRp

57 bp

F: TGGGAGGGCGATCGCAATCT R: TCATTCGACGCCATCTTCATTCAC

5742

PP2

Norovirus GI and GII

RdRp

101 bp

F: TAGGCCATGTTCCGCTGGAT R: TGTCCTTCGACGCCATCATCATT

5698

PP3

Norovirus GII, including all GII.4

VP1

141 bp

F: TCAGAGGTCAACAATGAGGTTATGGC R: TACTGTAAACTCTCCACCAGGGGC

5699

PP4

Norovirus GII.4

VP1

3035

PP4

Hepatitis A virus

62 bp

5656

PP5

Norovirus GI

Protease gene/RNA polymerase gene VP1

3043

PP5

Hepatitis A virus

79 bp

5701

PP6

Norovirus GII

RNA polymerase gene VP1

TE D

M AN U

SC

Primer pair name

AC C

EP

126 bp

67 bp

120 bp

F: TCAGGAATGGGTGCAGCACTT R: TAGCCTGATTTATGAAGCTTGCACTC F: TGAAAGTCAGAGAATGATGAAAGTGGA R: TGCGTTTTGGAGACTACATTCATTGAACA

F: TTGGAGTCTTTGTCTTTGTTTCTTGGGT R: TCAGGCAGTTCCCACAGGCTT F: TCCAGGGATGTGTGGTGGGGC R: TCCAGCAACATGAATGCCCAAAATGGCATTCTG F: TTGGCTGGGAATGCGTTCAC R: TACATCAATAATCACATGAGGGCACAT

32

ACCEPTED MANUSCRIPT

VP1

73 bp

5702

PP7

Norovirus GI

VP1

101 bp

5744

PP7

Norovirus GII.4

VP1

135 bp

5651

PP8

Norovirus GIV

RdRp

120 bp

4437

PP8

Internal Positive Control

N/A

77 bp

F: TGGTTACTTCAGGTTTGATTCTTGGGT R: TCGCCCAGTTCCAGTTCCCA F: TACACCCGGTGATGTTTTGTTTGA R: TCATATTGCCAACCCAGCCATTATACAT F: TCAGGCTATGTCACAGTGGCTCA R: TCGTCTACGCCCCGTTCCA F: TCCTTCTATGGTGATGATGAGATTGTGTC R: TGGGCCCTCTGTCTTGTCTGG F: TGACGAGTTCATGAGGGCAGGC R: TCTGGCCTTTCAGCAAGTTTCCAAC

RI PT

Norovirus GII

SC

PP6

M AN U

5746

Table 2. Norovirus samples (n = 36) used to build the RT-PCR/ESI-MS database. All samples were derived from human stool specimens associated with norovirus illness and were previously genotyped through CaliciNet based on sequencing (Vinje et al., 2004). Strain

Samples (n)

GI.3 GII.1

N/A N/A

2 3

GII.4

Minerva

9

GII.4

New Orleans

19

GII.6

N/A

2

GII.12

N/A

Institution Hotel, long-term care facility Institution, school, long-term care facility Institution, hospital, school, long-term care facility Long-term care facility School

EP

AC C 1

Outbreak setting(s)

TE D

Genotype

Collection year(s)

2007 2011

No. of unique RTPCR/ESI-MS signatures 1 2

2008-2011

6

2010-2011

10

2010

1

2011

1

33

ACCEPTED MANUSCRIPT

AC C

EP

TE D

M AN U

SC

RI PT

Table 3. Results of inclusivity, exclusivity and negative control testing with the RT-PCR/ESI-MS assay designed in this study. Test Sample type N RT-PCR/ESI-MS ID (n) Inclusivity Norovirus GI 16 Norovirus GI (n = 8) Norovirus GI/GII (n = 2) Not detected (n = 6) Inclusivity Norovirus GI/GII 1 Norovirus GI/GII Inclusivity Norovirus GII 126 Norovirus GII (n = 125) Not detected (n = 1) Inclusivity Norovirus, genogroup 17 Norovirus GI (n = 3) unknown Norovirus GII (n = 8) Not detected (n = 6) Inclusivity Hepatitis A virus 3 Hepatitis A virus Exclusivity Murine norovirus-1 1 No detections Exclusivity Feline calicivirus 1 No detections Exclusivity Poliovirus type 3 1 No detections Negative Control Norovirus-negative stool 10 No detections sample Negative Control RNA extraction reagent 11 No detections blank Negative Control Non-template control 23 No detections

34

ACCEPTED MANUSCRIPT

Table 4. Identification of norovirus samples (n = 41) by real-time RT-PCR, RT-PCR/ESI-MS and a sequencing-based method for norovirus genotyping (Vinje et al., 2004). RT-PCR/ESI-MS genotype

Genotyping concordance

GI.2 GI.3 GI.3 GI.3 GI.3 GI.3 Unable to genotype GI.3 GII.1b GII.4 GII.4

GI.2 GI.3 GI.4 GI.3 and GII.12 GI.3 and GII.4 Not detected Not detected GI.14, GI.P3/GI.11a and GII.4 GII.1 GII.4 GII.4, GII.12, and GII.17

Concordant Concordant Discordant Semi-concordant Semi-concordant N/A N/A Semi-concordant Concordant Concordant Semi-concordant

1 4 1 1 1 1 1

GII GII GII GII GII GII GII

GII.4 GII.12c GII.12 Bacteroides spp. Negative Negative Negative

a

EP

TE D

M AN U

SC

RI PT

Sequencing-based genotype

2 5 1 1 1 5 1 1 3 10 1

Original identification with real-time RT-PCR GI GI GI GI GI GI GI GI/GII GII GII GII

GII.16 and GII.P12/GII.10 GII.12 GII.4 and GII.12 GII.4 GII.6 GII.4 GII.P19/GII.5

Discordant Concordant Semi-concordant Discordant N/A N/A N/A

AC C

N

Recombinant nomenclature according to Kroneman et al. (2013). One of these samples was originally found to have a mixed sequence, but repeat RNA extraction and sequencing resulted in a GII.1 identification, concordant with the RT-PCR/ESI-MS genotype. c One of these samples was originally identified as GII.1 by sequencing, but repeat RNA extraction and sequencing resulted in a GII.12 identification, concordant with the RT-PCR/ESI-MS genotype. b

35

AC C

EP

TE D

M AN U

SC

RI PT

ACCEPTED MANUSCRIPT

ACCEPTED MANUSCRIPT

RI PT

7

SC M AN U

5

Discordant Semiconcordant

TE D

4

Concordant

EP

3

2

AC C

No. of Samples

6

1

0 1

2

3

4

5

6

7

8

9

10

11

12

13

14

Outbreak No.

15

16

17

18

19

20

21

22

23

24

ACCEPTED MANUSCRIPT

RI PT

7

SC M AN U

5

Discordant Semiconcordant

TE D

4

Concordant

EP

3

2

AC C

No. of Samples

6

1

0 1

2

3

4

5

6

7

8

9

10

11

12

13

14

Outbreak No.

15

16

17

18

19

20

21

22

23

24

ACCEPTED MANUSCRIPT

Highlights

EP

TE D

M AN U

SC

RI PT

This paper describes a novel method for the identification of human noroviruses. RT-PCR was combined with mass spectrometry to develop a foodborne viral assay. This assay showed a sensitivity of 92% for detection of norovirus in 160 samples. Genogroup and genotype were correctly identified in the majority of viral samples. The assay showed 100% specificity and was also able to detect hepatitis A virus.

AC C

• • • • •