Accepted Manuscript Rapid Detection and Differentiation of Human Noroviruses using RT-PCR coupled to Electrospray Ionization Mass Spectrometry Rosalee S. Hellberg , Feng Li , Rangarajan Sampath , Irene J. Yasuda , Heather E. Carolan , Julia M. Wolfe , Michael K. Brown , Richard C. Alexander , Donna M. Williams-Hill , William B. Martin PII:
S0740-0020(14)00123-3
DOI:
10.1016/j.fm.2014.05.017
Reference:
YFMIC 2176
To appear in:
Food Microbiology
Received Date: 26 September 2013 Revised Date:
25 April 2014
Accepted Date: 25 May 2014
Please cite this article as: Hellberg, R.S., Li, F., Sampath, R., Yasuda, I.J., Carolan, H.E., Wolfe, J.M., Brown, M.K., Alexander, R.C., Williams-Hill, D.M., Martin, W.B., Rapid Detection and Differentiation of Human Noroviruses using RT-PCR coupled to Electrospray Ionization Mass Spectrometry, Food Microbiology (2014), doi: 10.1016/j.fm.2014.05.017. This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
ACCEPTED MANUSCRIPT
1
Rapid Detection and Differentiation of Human Noroviruses using RT-PCR coupled to
2
Electrospray Ionization Mass Spectrometry
3 Rosalee S. Hellberga*, Feng Lib, Rangarajan Sampathb, Irene J. Yasudab, Heather E. Carolanb,
5
Julia M. Wolfec, Michael K. Brownc, Richard C. Alexanderc, Donna M. Williams-Hilld,
6
William B. Martind
7 8
a
9
Nutrition, One University Drive, Orange, CA 92866
SC
RI PT
4
M AN U
Chapman University, Schmid College of Science and Technology, Food Science and
10
b
11
c
12
d
13
Laboratory Southwest, 19701 Fairchild, Irvine, CA 92612
Ibis Biosciences, Abbott, 2251 Faraday Ave., Suite 150, Carlsbad, CA 92008
Orange County Public Health Laboratory, 1729 West 17th Street, Santa Ana, CA 92706
TE D
U.S. Food and Drug Administration, Office of Regulatory Affairs, Pacific Regional
14 15
*Corresponding author: e-mail:
[email protected], Ph: 1-714-628-2811
EP
16
Disclaimer: The views in this publication represent those of the authors. The inclusion of
18
specific trade names or technologies does not imply endorsement by the U.S. Food and Drug
19
Administration nor is criticism implied of similar commercial technologies not mentioned
20
within.
21
Conflict of Interest
22
F.L., R.S., I.Y., and H.C. are employees of Ibis Biosciences, Abbott, the commercial
23
manufacturer of the technology described here.
AC C
17
1
ACCEPTED MANUSCRIPT
Abstract
25
The goal of this study was to develop an assay for the detection and differentiation of
26
noroviruses using RT-PCR followed by electrospray ionization mass spectrometry (ESI-MS).
27
Detection of hepatitis A virus was also considered. Thirteen primer pairs were designed for
28
use in this assay and a reference database was created using GenBank sequences and
29
reference norovirus samples. The assay was tested for inclusivity and exclusivity using 160
30
clinical norovirus samples, 3 samples of hepatitis A virus and 3 other closely related viral
31
strains. Results showed that the assay was able to detect norovirus with a sensitivity of 92%
32
and a specificity of 100%. Norovirus identification at the genogroup level was correct for
33
98% of samples detected by the assay and for 75% of a subset of samples (n = 32) compared
34
at the genotype level. Identification of norovirus genotypes is expected to improve as more
35
reference samples are added to the database. The assay was also capable of detecting and
36
genotyping hepatitis A virus in all 3 samples tested. Overall, the assay developed here allows
37
for detection and differentiation of noroviruses within one working day and may be used as a
38
tool in surveillance efforts or outbreak investigations.
39
EP
TE D
M AN U
SC
RI PT
24
Keywords
41
Norovirus; hepatitis A virus; RT-PCR; electrospray ionization mass spectrometry; genotyping
42 43 44
AC C
40
45 46
2
ACCEPTED MANUSCRIPT
47 48
1. Introduction Noroviruses are the leading cause of foodborne disease among all known pathogens, with an estimated 5.5 million foodborne illnesses annually in the United States (Scallan et al.,
50
2011). These viruses are genetically diverse members of the Caliciviridae family (genus
51
Norovirus) and consist of at least 32 genetic clusters organized into 5 genogroups (GI-GV)
52
(Zheng et al., 2006; Zheng et al., 2010). Genogroups I, II, and IV have been associated with
53
human infection, with the majority of outbreaks due to GII strains, particularly GII.4 variants.
54
The ability to identify noroviruses at the genotype and strain levels is important for tracing the
55
source and spread of outbreaks as well as for routine surveillance. Norovirus surveillance
56
programs have revealed that some norovirus strains are only found within geographically
57
limited regions, while others experience widespread distribution over specific time periods
58
(Siebenga et al., 2009; Vega et al., 2011). Evidence has also been found for the presence of
59
multiple strains associated with a single infection or outbreak, with reports of 3-12% of
60
outbreaks being associated with strains from both GI and GII (Blanton et al., 2006; Green et
61
al., 2001; Hall et al., 2012; Matthews et al., 2012). Food and waterborne outbreaks have been
62
found to be more likely associated with strains from multiple genogroups than person-to-
63
person outbreaks (Matthews et al., 2012).
SC
M AN U
TE D
EP
Because noroviruses have not been successfully cultivated in vitro, nucleic acid-based
AC C
64
RI PT
49
65
methods are widely used for their detection and differentiation (Hall et al., 2011).
66
Noroviruses have a small, single-stranded RNA genome of ~7.5 kb with three open reading
67
frames: ORF1, ORF2, and ORF3 (Green et al., 2001). ORF1 contains several genes coding
68
for nonstructural proteins, including RNA-dependent RNA polymerase (RdRp); ORF2
69
contains the gene coding for the major capsid protein (VP1); and ORF3 contains the gene
3
ACCEPTED MANUSCRIPT
coding for the minor capsid protein (VP2). Reverse-transcriptase polymerase chain reaction
71
(RT-PCR) analysis of regions of the VP1 or RdRp genes is generally used to presumptively
72
detect noroviruses at the organismal and genogroup level, while DNA sequencing can be used
73
for confirmation and for differentiation of genotypes and strains (Zheng et al., 2006).
74
Although standardized methods for differentiation of noroviruses have been proposed based
75
on the complete sequence of the VP1 gene (Zheng et al., 2006), many studies sequence
76
smaller regions of either the RdRp or VP1 genes and use a variety of primer sets, sequence
77
analysis methods, and nomenclature, leading to inconsistencies in data reporting and
78
strain/genotype identification (Kroneman et al., 2013; Kroneman et al., 2011; Mattison et al.,
79
2009). Furthermore, frequent recombination events at the ORF1-ORF2 region have resulted
80
in different genotype assignments depending on which gene region is targeted for sequencing
81
(Bull et al., 2007) and rapid evolution of GII.4 noroviruses has led to the emergence of highly
82
virulent strains that have not previously been characterized (Siebenga et al., 2007; Siebenga et
83
al., 2009; Zheng et al., 2010). Infections consisting of multiple strains are also difficult to
84
analyze using traditional sequencing because this method is not able to discern multiple
85
sequences in a mixed sample.
EP
TE D
M AN U
SC
RI PT
70
The use of RT-PCR followed by electrospray ionization mass spectrometry (ESI-MS)
87
provides a potential means for rapid detection and differentiation of norovirus genotypes and
88
strains (Ecker et al., 2008; Sampath et al., 2007b). With this method, fragments of the
89
genome are first amplified by RT-PCR and then analyzed by ESI-MS to determine the precise
90
mass and corresponding base composition (numbers of A, G, C, and T) of each PCR product.
91
These base compositions are then compared to a reference database to allow for the
92
identification and differentiation of organisms. While this method does not provide the
AC C
86
4
ACCEPTED MANUSCRIPT
ordered sequence of the nucleotides, knowledge of the base composition alone has proven to
94
be sufficient in many cases to allow for organism identification, as PCR/ESI-MS assays
95
utilize primers that bind to conserved regions flanking highly variable sequences among the
96
organism(s) of interest (Ecker et al. 2008; Sampath et al. 2007b). In addition to the potential
97
to detect noroviruses and characterize them at the genotype and strain levels, PCR/ESI-MS
98
allows for the identification of organisms in mixed samples, and analysis can be completed
99
within one working day. Furthermore, this method allows for multiplexing of primers so that
SC
RI PT
93
several regions of the genome can be targeted simultaneously, resulting in greater potential
101
for the identification of new strains and recombinants. PCR/ESI-MS assays have been
102
published for the detection and characterization of a number of organisms, including
103
respiratory pathogens (Ecker et al., 2005; Sampath et al., 2005; Sampath et al., 2007a;
104
Sampath et al., 2007b), vector-borne pathogens (Crowder et al., 2012; Crowder et al., 2010;
105
Eshoo et al., 2010; Eshoo et al., 2007; Grant-Klein et al., 2010), biothreat agents (Jacob et al.,
106
2012; Sampath et al., 2012; Van Ert et al., 2004) and common bacterial food pathogens (e.g.,
107
Salmonella, E. coli, and Campylobacter) (Hannis et al., 2008; Pierce et al., 2012; Shen et al.,
108
2013), but not for noroviruses. PCR/ESI-MS could potentially be used alongside current
109
sequence-based techniques to rapidly identify norovirus genotypes and variants in
110
surveillance and outbreak situations.
TE D
EP
AC C
111
M AN U
100
The goal of this study was to develop a novel RT-PCR/ESI-MS assay for the rapid
112
detection and genetic differentiation of noroviruses. Because of the ability of the assay to test
113
for multiple pathogens simultaneously, detection of hepatitis A virus was also considered for
114
incorporation into this assay. Like norovirus, hepatitis A virus is a single-stranded RNA virus
115
spread through the fecal-oral route (Nainan et al., 2006). There are three known genogroups
5
ACCEPTED MANUSCRIPT
of hepatitis A virus (I-III) that infect humans, with genotype IA being most commonly
117
associated with disease (Sanchez et al. 2007). Although hepatitis A virus has a relatively low
118
incidence among foodborne viruses in the U.S., it has the highest rates of hospitalization
119
(31.5%) and death (2.4%) (Scallan et al. 2011). As current detection techniques for hepatitis
120
A virus generally rely on RT-PCR assays specific for the virus (Sanchez et al. 2007), an assay
121
that allows for simultaneous identification of both noroviruses and hepatitis A viruses would
122
likely prove advantageous when performing screening for these viruses.
123
2. Materials and Methods
124
2.1 Assay Design
SC
M AN U
125
RI PT
116
Over 5000 norovirus sequences and 150 hepatitis A virus sequences were downloaded from GenBank for use in assay design. In cases where genotype information was not
127
available for a particular norovirus sequence, the Norovirus Automated Genotyping Tool was
128
used (Kroneman et al., 2011). Sequences were aligned with Clustal W (Thompson et al.,
129
1994) in BioEdit Sequence Alignment Editor, v. 7.1.3.0 (Hall, 1999) and examined for
130
regions of high variability flanked by regions of conserved primer-binding sites. Based on
131
these alignments, a set of primers was designed for this study to collectively amplify all
132
human noroviruses (Table 1). Norovirus primers were designed for maximal differentiation
133
amongst the various human norovirus genotypes, as well as differentiation among GII.4
134
strains. Primers targeting human hepatitis A viruses (Table 1) were also used in this study for
135
the purpose of universal amplification and detection at the virus level. The hepatitis A virus
136
primers had been designed previously by Ibis Biosciences, but had not been published. A set
137
of primer pairs was also designed for amplification of the internal control. Primers were
138
analyzed in silico for parameters such as primer-dimer formation, %GC, and annealing
AC C
EP
TE D
126
6
ACCEPTED MANUSCRIPT
temperature. The primers were separated into 8 wells, with 5 of the wells containing multiple
140
primer pairs, and then arranged in a 96-well plate format, allowing for analysis of 12 samples
141
per plate (Fig. 1).
142
2.2 Specimen collection and preparation
RI PT
139
A total of 206 stool samples from the Orange County Public Health Laboratory (Santa
144
Ana, CA) were obtained for use in this project. Use of biological specimens was approved by
145
the U.S. Food and Drug Administration (FDA) Research Involving Human Subjects
146
Committee (RIHSC #12-009A). The specimens were derived from various outbreaks and
147
single cases of norovirus illness reported to the public health laboratory during 2006-2011.
148
This collection consisted of 196 stool samples that previously tested positive for norovirus
149
and 10 stool samples that previously tested negative for norovirus. Norovirus detection and
150
genogroup information were based on the results of real-time RT-PCR testing at the time of
151
illness using a previously described method (Kageyama et al., 2003). A subset of these
152
samples (n = 36) had previously undergone sequencing-based genotyping (Vinje et al., 2004)
153
as part of CaliciNet, a national outbreak surveillance network developed by the Centers for
154
Disease Control and Prevention (CDC) (Vega et al., 2011). The genotyped samples were
155
used to build the RT-PCR/ESI-MS database (Table 2). The remaining norovirus samples
156
were used to test the RT-PCR/ESI-MS assay for inclusivity. All stool samples underwent
157
clarification prior to RNA extraction as follows: approximately 0.1 g of stool was added to a
158
glass vial containing 1 ml of Vertrel XF (DuPont, Wilmington, DE) and 1 ml of sterile water.
159
Samples were then mixed by vortex for 1 min and centrifuged at 1700 x g for 10 min. The
160
aqueous phase containing the virus was then removed by pipetting and stored at -70ºC until
161
RNA extraction. RNA was extracted from clarified samples using the QIAamp Viral RNA
AC C
EP
TE D
M AN U
SC
143
7
ACCEPTED MANUSCRIPT
162
Mini Kit (Qiagen, Valencia, CA), Spin Column Protocol, according to the manufacturer’s
163
instructions. Reagent blanks were included in the RNA extractions as negative controls.
164
Hepatitis A virus, feline calicivirus, murine norovirus, and poliovirus from the American Type Culture Collection (ATCC) were also used for inclusivity and exclusivity
166
testing of the RT-PCR/ESI-MS assay. Three samples of hepatitis A virus were tested from
167
strains HAS-15 (ATCC #VR-2281), HM-175/18f (ATCC #VR-1402) and PA21 (ATCC
168
#VR-1357), representing human genotypes IA, IB and IIIA, respectively. One sample of each
169
of the following was used for exclusivity testing: feline calicivirus strain F9 (ATCC #VR-
170
782), murine norovirus-1 (ATCC #PTA-5935), and poliovirus type 3, strain LSC (ATCC
171
#VR-1001AS/HO).
172
2.3 RT-PCR/ESI-MS
M AN U
SC
RI PT
165
All samples underwent RT-PCR/ESI-MS using the assay designed in this study. One-step
174
RT-PCR was performed with each reaction well having the following components: 5 µl RNA
175
or non-template control, 2 U Superscript III Reverse Transcriptase (Life Technologies,
176
Carlsbad, CA), 2 ng/µl sonicated poly A RNA (Sigma-Aldrich, St. Louis, MO), 1 ng/µl
177
random hexamers (Life Technologies), 10 mM dithiothreitol (DTT, Life Technologies), 0.01
178
U/µl SUPERase In RNase Inhibitor (Life Technologies), 5 U AmpliTaq Gold DNA
179
Polymerase (Roche Molecular Systems, Pleasanton, CA), 200 µM each dATP, dCTP, and
180
dTTP (Bioline, Taunton, MA), 200 µM 13C-enriched dGTP (Cambridge Isotope Laboratories,
181
Andover, MA), 1.4 mM MgCl2 (Life Technologies), 250 nM each primer (Table 1; Integrated
182
DNA Technologies, Coralville, IA), 20 mM Tris (pH 8.3), 75 mM KCl, 0.4 M betaine, and 20
183
mM sorbitol (Sigma-Aldrich) in a total volume of 40 µl. RT-PCR was carried out with a
184
Mastercycler proS thermocycler (Eppendorf, Hauppauge, NY) using the following cycling
AC C
EP
TE D
173
8
ACCEPTED MANUSCRIPT
conditions: 60ºC for 5 min; 4ºC for 10 min; 55ºC for 45 min; 95ºC for 10 min; 8 cycles of
186
95ºC for 30 s, 48ºC for 30 s (increase 0.9°C for each cycle), and 72ºC for 30 s; 37 cycles of
187
95ºC for 15 s, 56ºC for 20 s, and 72ºC for 20 s; and a final extension of 72ºC for 2 min
188
followed by a 4ºC hold. Each reaction also contained an internal positive control (“calibrant”)
189
made from synthetic DNA (Blue Heron Biotechnology, Bothell, WA). The calibrant was
190
present in each reaction at a level of 200 copies and allowed for a semi-quantitative estimate
191
of the starting number of genome copies in each PCR well.
SC
192
RI PT
185
Following RT-PCR, the assay plate was loaded onto the PCR/ESI-MS platform (Abbott Molecular, Des Plaines, IL) for amplicon desalting and ESI-MS analysis, as described
194
previously (Ecker et al., 2008; Hofstadler et al., 2005; Jiang and Hofstadler, 2003). The
195
analysis software determined the base compositions of each amplicon based on analysis of
196
both the forward and reverse strands. The resulting base compositions were queried against
197
an ESI-MS database populated with the expected base compositions for the norovirus and
198
hepatitis A virus sequences downloaded from GenBank (described in section 2.1) and the
199
base compositions acquired in this study for the 36 previously genotyped norovirus samples
200
(Table 2). The top database matches, approximate levels of genome copies per well, and
201
associated Q-scores were recorded for each sample. The Q-score, a rating between 0 (low)
202
and 1 (high), represents a relative measure of the strength of the data supporting identification
203
(Sampath et al., 2012; Simner et al., 2013). For this assay, a Q-score ≥ 0.85 was considered to
204
be a reportable result. To reduce the possibility of false positives, a threshold of ≥ 25 genome
205
copies per well was also required for a reportable result. In cases where the ESI-MS result
206
did not match previous identifications, the genotypes of sequences derived from GenBank
AC C
EP
TE D
M AN U
193
9
ACCEPTED MANUSCRIPT
207
were checked with the Norovirus Automated Genotyping Tool to ensure proper nomenclature
208
was used (Kroneman et al. 2011).
209
2.4 Comparison to recognized method The ability of the RT-PCR/ESI-MS assay plate designed in this study to detect and
RI PT
210
differentiate norovirus samples was compared to a DNA sequencing method for genotyping
212
(Vinje et al., 2004). A subset of 41 of the norovirus samples listed above was chosen for
213
method comparison. Samples were selected to include both GI and GII noroviruses detected
214
over the course of several years (2007-2011). These samples had previously been identified
215
as norovirus GI or GII by real-time RT-PCR, but had not been genotyped. All samples
216
underwent both sequencing-based genotyping and RT-PCR/ESI-MS genotyping using the
217
newly developed plate and the identifications from each method were compared. The DNA
218
sequences obtained through sequencing-based genotyping were submitted to GenBank
219
(Accession No. KF569765-KF569799).
220
2.5 Statistical analysis
M AN U
TE D
221
SC
211
Concordance among RT-PCR/ESI-MS-determined genotypes and strains for norovirus samples involved in outbreaks was assessed statistically by assigning a numerical value to
223
each outbreak. Concordant outbreaks were assigned a score of 2, semi-concordant outbreaks
224
were assigned a score of 1, and discordant outbreaks were assigned a score of 0. An outbreak
225
was considered to be concordant when all samples from that outbreak showed the exact same
226
top match or matches to the ESI-MS database, resulting in the exact same identification. An
227
outbreak was considered to be semi-concordant when all samples from that outbreak shared a
228
top match to the ESI-MS database, but one or more samples also showed a top match to
229
another entry in the database. Discordant outbreaks were those in which none of the top
AC C
EP
222
10
ACCEPTED MANUSCRIPT
matches to the ESI-MS database were shared by all samples from a given outbreak. After
231
each outbreak was assigned a concordance value, the samples were randomly sorted into new
232
outbreak groups and the same scoring system was carried out. Analyses were performed
233
separately based on concordance among top genotype matches and concordance among top
234
strain matches. For the purposes of this paper, strains were defined as distinct signatures in
235
the ESI-MS database. The scores of the randomized samples were compared to the original
236
samples using a Pearson’s chi-square test in IBM SPSS Statistics 21 (Armonk, NY), with a
237
pre-determined significance value of p < 0.05, two-tailed.
238
3. Results and Discussion
239
3.1 Assay and database design
SC
M AN U
240
RI PT
230
A total of 13 primer pairs were designed to amplify ~50-150 bp fragments of norovirus RNA, hepatitis A virus RNA, and an internal control in a 96-well plate format
242
(Table 1; Fig. 1). The ESI-MS database was successfully populated with the expected base
243
compositions of over 5000 sequences downloaded from GenBank. In many instances, these
244
sequences did not include information for all gene fragments targeted in this assay, in which
245
case the expected signature could only be generated for a portion of the fragments. In a few
246
cases, incomplete sequences that shared base counts with many other database entries and
247
were only amplified by one set of primers were removed from the database to improve data
248
interpretation. RT-PCR/ESI-MS base composition signatures were successfully obtained for
249
the 36 previously genotyped norovirus samples analyzed in this study (Table 2). These
250
samples had a total of 21 unique RT-PCR/ESI-MS signatures that were added to the database
251
prior to inclusivity and exclusivity testing.
AC C
EP
TE D
241
11
ACCEPTED MANUSCRIPT
252
Due to limitations in sample collection, the reference samples used in the database only included information for 5 genotypes: GI.3, GII.1, GII.4, GII.6, and GII.12, with 78% of
254
samples identified as GII.4 Minerva or GII.4 New Orleans. However, these samples represent
255
some of the most predominant genotypes associated with outbreaks. For example, GII.12
256
accounted for 20% of noroviruses submitted to CDC from October 2009 to June 2010 (Vega
257
and Vinje, 2011) and GII.4 strains were found to be responsible for 44% of U.S. outbreaks
258
over the course of 1994-2006 (Zheng et al., 2010) and 62% of global outbreaks over 2001-
259
2007 (Siebenga et al., 2009). The GII.4 strains Minerva (also known as GII.4 2006b) and
260
New Orleans have been predominant outbreak strains in the U.S., with Minerva surfacing in
261
late 2005/early 2006 and New Orleans surfacing in 2009 (Vega et al., 2011). Interestingly,
262
the RT-PCR/ESI-MS signatures determined for these two GII.4 strains showed a number of
263
subtypes due to differences in base compositions of the amplified regions; the GII.4 Minerva
264
samples revealed 6 unique signatures and the GII.4 New Orleans samples revealed 10 unique
265
signatures (Table 2). These differences are likely due to the fact that the RT-PCR/ESI-MS
266
assay targets multiple regions of the norovirus genome and therefore is able to determine
267
sequence variation at multiple locations.
268
3.2 Inclusivity and exclusivity testing
SC
M AN U
TE D
EP
Norovirus samples used for inclusivity testing included representatives of GI (n = 16),
AC C
269
RI PT
253
270
GI/GII (n = 1), and GII (n = 126), as well as 17 samples with no genogroup information
271
(Table 3). Among these samples, 92% were identified as norovirus and 8% could not be
272
detected by RT-PCR/ESI-MS. Previous PCR/ESI-MS studies have found similar to slightly
273
higher levels of sensitivity, ranging from 92 to 100% (Blyn et al., 2008; Eshoo et al., 2010;
274
Sampath et al., 2012; Sampath et al., 2007a). The estimated number of genomic copies per
12
ACCEPTED MANUSCRIPT
norovirus-positive well ranged from 26 to >2000 copies. It should be noted that these levels
276
are only approximate and are based on relative amplification in comparison to the calibrant.
277
Furthermore, above 2000 genome copies per well, the levels of target organism far exceed
278
those of the calibrant, resulting in an inaccurate estimate of the target concentration (Jacob et
279
al., 2012; Sampath et al., 2012). In these cases, it can only be determined that the amplicon is
280
present in the well at high levels (i.e., >2000 copies). Q-scores ranged from 0.85 to 0.99, with
281
an average of 0.96, indicating high-quality matches to the database entries. Among the
282
samples detected by RT-PCR/ESI-MS for which genogroup information was originally
283
available, the RT-PCR/ESI-MS genogroup designation showed high concordance (98%) with
284
the original real-time RT-PCR-based identification (Table 3). All 125 samples of GII
285
detected by RT-PCR/ESI-MS were correctly identified as belonging to this genogroup and the
286
1 sample previously identified as GI/GII also matched both GI and GII signatures in the RT-
287
PCR/ESI-MS database, indicating a case of illness involving multiple norovirus genogroups.
288
As mentioned previously, GI and GII noroviruses have been reported to occur together in a
289
small percentage (3-12%) of outbreaks (Hall et al., 2012; Matthews et al., 2012). Among the
290
10 GI samples detected by RT-PCR/ESI-MS, 8 were identified as GI and 2 showed top
291
matches to both GI and GII signatures within the RT-PCR/ESI-MS database. Considering
292
that the assay was designed with a focus on GII noroviruses, particularly GII.4 strains, it is not
293
surprising that a greater identification success rate was observed with this genogroup as
294
compared to the GI noroviruses. The 13 norovirus samples that could not be detected by RT-
295
PCR/ESI-MS were relatively old samples within the dataset, having originally been collected
296
during 2007-2008, and included 6 samples with no genogroup information, 6 originally
297
identified as GI and 1 originally identified as GII. These samples likely failed due to poor
AC C
EP
TE D
M AN U
SC
RI PT
275
13
ACCEPTED MANUSCRIPT
298
quality, as the nucleic acid may have degraded over time. Alternatively, it is possible that
299
polymorphisms in the primer-binding areas prevented amplification of the target regions.
300
The 147 samples that tested positive for norovirus with RT-PCR/ESI-MS (Table 3) were also assigned a genotype based on the top match(es) to entries in the ESI-MS database.
302
Within GI, the following genotypes were detected: GI.2, GI.3, GI.4, GI.P3/GI.11, and GI.14.
303
Within GII, the genotypes that were detected included: GII.1, GII.4, GII.5, GII.6, GII.12,
304
GII.13, GII.16, GII.17, and several recombinants: GII.Pa-GII.3, GII.P12-GII.10, GII.P16-
305
GII.2, and GII.P19-GII.5. Nomenclature for recombinant noroviruses is as described in
306
Kroneman et al. (2013), where the RdRp-based genotype is listed first (with a capital P for
307
‘polymerase’), followed by the VP1-based genotype. The majority of samples (n = 119)
308
matched only one genotype in the database, with GII.4 having the greatest number of matches
309
(n = 90). The remaining samples showed top matches to 2-3 genotypes and/or recombinant
310
genotypes in the database.
SC
M AN U
TE D
311
RI PT
301
All three of the hepatitis A virus samples were correctly identified by RT-PCR/ESIMS (Table 3). Hepatitis A virus has been classified into three human (I-III) and three simian
313
(IV-VI) genotypes, with genotypes I and III most commonly isolated from humans (Nainan et
314
al., 2006). The human genotypes have been further divided into subtypes A and B, based on
315
genetic relatedness. Although this assay was only designed with the goal of detecting
316
hepatitis A virus, each of the three samples tested (IA, IB, and IIIA) was also correctly
317
identified at the subgenotype level by RT-PCR/ESI-MS. While these results are promising,
318
additional testing will be necessary to determine whether or not the assay is a reliable
319
indicator of hepatitis A virus genotypes and/or subgenotypes.
AC C
EP
312
14
ACCEPTED MANUSCRIPT
320
As shown in Table 3, no cross reactivity was observed during exclusivity testing with murine norovirus, feline calicivirus, and poliovirus, with all samples testing negative for
322
norovirus and hepatitis A virus. Furthermore, the norovirus primers did not show any cross-
323
reactivity when tested against hepatitis A virus and the hepatitis A virus primers did not cross-
324
react with norovirus samples. The 10 stool samples previously identified as being negative
325
for norovirus with real-time RT-PCR all tested negative for norovirus and hepatitis A virus, as
326
well as the 11 reagent blanks from RNA extraction and 23 non-template controls (Table 3).
327
These results demonstrate the high specificity (100%) of the assay developed here, with no
328
false positives observed. Previous PCR/ESI-MS studies have also reported high levels of
329
specificity, with values of 94-100% (Blyn et al., 2008; Crowder et al., 2012; Eshoo et al.,
330
2010; Sampath et al., 2012; Sampath et al., 2007a).
331
3.3 Comparison to recognized method
SC
M AN U
A subset (n = 41) of the norovirus stool samples tested above with RT-PCR/ESI-MS
TE D
332
RI PT
321
were also genotyped using a currently recognized sequencing-based method (Vinje et al.,
334
2004) for comparison (Table 4). Out of the 41 samples tested, initially 31 were identified by
335
both methods and 22 of these showed concordant identifications, meaning that the sample
336
matched the exact same genotype when tested with RT-PCR/ESI-MS as it did when tested
337
with sequenced-based genotyping. When the results of repeat RNA extraction and
338
sequencing analyses were included for two samples identified as possible mix-ups, a total of
339
32 samples were detected by both methods and 24 showed concordant identifications.
340
Subsequent discussion of results incorporates these two updated identifications. Five samples
341
showed semi-concordant identifications, meaning that the genotype identified by sequencing
342
was also among the top matches to the ESI-MS database, but additional genotypes were also
AC C
EP
333
15
ACCEPTED MANUSCRIPT
among the top database matches. Three of the samples were discordant, meaning that the
344
genotype identified by sequencing was not among the top matches identified by ESI-MS.
345
Among the samples that could not be identified by one or both methods, 5 could not be
346
identified by PCR/ESI-MS, 3 could not be identified by sequencing and 1 could not be
347
identified by either method.
348
RI PT
343
Within the GI noroviruses (n = 16), 10 samples were genotyped by both methods: 7 samples showed complete concordance, 2 were semi-concordant, and 1 was discordant (Table
350
4). The concordant samples included 2 GI.2 noroviruses and 5 GI.3 noroviruses detected with
351
Q-scores of 0.93-0.99 and at levels of 48-1586 genomes/well. One of the semi-concordant
352
samples was identified as GI.3 by sequencing, but had multiple database matches (GI.3 and
353
GII.12) when tested with RT-PCR/ESI-MS. The primary match was to GI.3, with a Q-score
354
of 0.98 and a level of 1564 genomes/well compared to the secondary match to GII.12 with a
355
Q-score of 0.91 and a level of 172 genomes/well. The other semi-concordant sample was also
356
identified as GI.3 by sequencing, but had database matches to both GI.3 and GII.4 when
357
tested with RT-PCR/ESI-MS. As with the other sample, the GI.3 identification showed a
358
higher Q-score and level compared to the GII.4 identification. In both cases with semi-
359
concordant results, the secondary match was to a database entry originating from GenBank
360
with limited sequence information, whereas the top match was to a database entry obtained
361
from one of the GI.3 reference norovirus samples used to initially build the database (Table 2).
362
The one discordant sample was identified as GI.3 by sequencing, but RT-PCR/ESI-MS
363
identified it as GI.4 (Q-score = 0.88; level = 488 genomes/well). The low Q-score indicates a
364
poor quality match that may be improved with the incorporation of GI.4 reference samples
365
into the RT-PCR/ESI-MS database. RT-PCR/ESI-MS was unable to detect five of the
AC C
EP
TE D
M AN U
SC
349
16
ACCEPTED MANUSCRIPT
samples identified by sequencing as GI.3 and both methods were unable to detect one sample
367
originally identified by real-time RT-PCR as a GI norovirus. The inability of RT-PCR/ESI-
368
MS to amplify some of the GI.3 samples may reflect a lack of primer specificity for this
369
genotype. Additionally, all six samples that failed to amplify were collected during the years
370
2007-2008 and the viral nucleic acid may have degraded over time.
371
RI PT
366
As expected, GII noroviruses showed greater success with the RT-PCR/ESI-MS assay, with detections obtained for all 24 GII samples (Table 4). Three of these samples could not
373
be identified with the sequencing-based method even though they previously tested positive
374
for norovirus GII with real-time RT-PCR. In all 3 cases, RT-PCR/ESI-MS was able to
375
identify these noroviruses at the genotype level (Q-scores 0.96-0.99, 564-1662 copies/well).
376
Among the remaining 21 samples, 17 showed complete concordance between the RT-
377
PCR/ESI-MS genotype and the sequencing-based genotype, with the majority of samples (n =
378
10) identified as GII.4. Samples of GII.1 and GII.12 also showed concordant genotype
379
identifications between the two methods. As shown in Table 4, 2 of the GII samples showed
380
semi-concordant genotype identifications and 2 showed discordant genotype identifications.
381
In both instances of semi-concordant genotype identifications, the ESI-MS identification with
382
the highest Q-score was concordant with the sequencing result, while secondary ESI-MS
383
identifications with lower Q-scores did not agree with the sequencing result. This trend was
384
also observed with the GI noroviruses (discussed above), indicating the importance of
385
considering Q-score in the interpretation of results.
M AN U
TE D
EP
AC C
386
SC
372
Among the discordant results for GII noroviruses, one sample was identified by
387
sequencing as GII.4 and by RT-PCR/ESI-MS as GII.16 and the recombinant GII.P12/GII.10
388
(Q-score = 0.89, >2000 copies/well). The low Q-score for this sample indicates a relatively
17
ACCEPTED MANUSCRIPT
poor quality match to entries in the ESI-MS database, a result which may be improved upon
390
as more reference samples are added to the database. Indeed, a previous study using
391
PCR/ESI-MS for the detection of fungi reported that an upgrade to the fungal database
392
reduced the number of misidentifications observed with the assay (Simner et al., 2013). The
393
third discordant sample was identified as GII.4 by RT-PCR/ESI-MS (Q-score = 0.99, >2000
394
copies/well) and as Bacteroides spp. by sequencing, even though it was previously identified
395
as a GII norovirus by real-time RT-PCR. The most likely explanation for this discordance is
396
that the original stool sample contained both norovirus GII and Bacteroides spp., which are
397
significant inhabitants of the gastrointestinal tract and are found in human feces (Wexler,
398
2007). This particular strain of Bacteroides appears to have been preferentially amplified by
399
the norovirus sequencing primers but not by the RT-PCR/ESI-MS assay.
SC
M AN U
400
RI PT
389
Much of the discordance between the sequencing-based method and the RT-PCR/ESIMS method is likely due to the use of incomplete sequences derived from GenBank to
402
populate the ESI-MS database. In many cases, these sequences did not provide full coverage
403
of the regions targeted by the assay and therefore only provided partial matches to the ESI-
404
MS signatures. As more reference samples with complete information for the target regions
405
are added to the database, the occurrence of semi-concordant or discordant results is likely to
406
be reduced. An additional source of discordance may be due to differences in genome
407
coverage between the two methods. While the sequencing-based method assigns genotype
408
based on only a portion of the norovirus genome (< 260 bp) within the VP1 gene (Vinje et al.,
409
2004), RT-PCR/ESI-MS targets multiple regions of the genome within short stretches (50-150
410
bp) of both the RdRp and VP1 genes. Recombination of norovirus genomes at the ORF1-
411
ORF2 junction is common and can result in a different genotype for the RdRp region as
AC C
EP
TE D
401
18
ACCEPTED MANUSCRIPT
compared to the VP1 region (Bull et al., 2007). Therefore, it is possible that a sample
413
identified as one genotype by sequencing may show multiple matches by RT-PCR/ESI-MS
414
due to recombination. While this results in the potential for RT-PCR/ESI-MS to show
415
conflicting results when compared to sequencing, it also allows for greater potential for
416
discovery of variants and recombinants that may not have been detected with sequencing
417
alone. Furthermore, RT-PCR/ESI-MS is able to detect multiple strains within one sample,
418
which may contribute to some discordance when comparing identifications with sequencing-
419
based results, especially if one strain is present in lower concentrations.
SC
RI PT
412
Taken together, the results presented above show that the RT-PCR/ESI-MS assay has
421
a number of advantages and disadvantages when compared to the sequencing-based method.
422
One advantage, which has been illustrated in previous RT-PCR/ESI-MS studies (Crowder et
423
al., 2010; Sampath et al., 2005), is the ability to identify mixtures of multiple genotypes in one
424
sample. Additionally, by covering multiple regions of the norovirus genome, the RT-
425
PCR/ESI-MS assay has potential for a higher level of differentiation among strains than that
426
achieved with sequencing. Considering the high level of diversity and rapid evolution of
427
norovirus strains, especially GII.4 strains (Siebenga et al., 2007; Siebenga et al., 2009; Zheng
428
et al., 2010), coverage of multiple regions also improves the possibility of detecting norovirus
429
using a single assay. Similarly, previous studies found RT-PCR/ESI-MS to be advantageous
430
for the simultaneous detection and differentiation of arboviruses, which are also genetically
431
diverse RNA viruses that often require multiple assays for their detection (Eshoo et al., 2007;
432
Grant-Klein et al., 2010). A further advantage of ESI-MS is its ease of use, as it involves
433
fewer steps than sequencing-based methods and relies primarily on automation. However, a
434
disadvantage of the RT-PCR/ESI-MS assay is that the results are not always concordant with
AC C
EP
TE D
M AN U
420
19
ACCEPTED MANUSCRIPT
the sequencing-based assay, especially in the case of GI or recombinant noroviruses. This is
436
likely to improve as the database is populated with more reference samples, but it may never
437
reach full concordance due to the multiple regions targeted by the RT-PCR/ESI-MS assay and
438
the ability of this assay to detect multiple strains in one sample. With the above in mind, it is
439
recommended that the RT-PCR/ESI-MS assay be used to assist with the rapid detection and
440
differentiation of noroviruses, but that traditional sequencing should also be performed for
441
confirmation purposes. As more reference sample sequences are added to the database, it is
442
expected that the RT-PCR/ESI-MS assay will become an increasingly powerful tool for
443
norovirus testing.
444
3.4 Outbreak concordance
M AN U
SC
RI PT
435
As a further step in evaluating the usefulness of the RT-PCR/ESI-MS assay for
446
norovirus typing in outbreak and surveillance situations, the top ESI-MS database matches for
447
norovirus samples in this study associated with outbreaks were compared for genotype and
448
strain concordance within each outbreak (Figs. 2a and 2b). These samples (n = 131) were
449
associated with 50 different outbreaks in Orange County, CA, including 26 outbreaks with 2
450
samples per outbreak and 24 outbreaks with 3 or more samples per outbreak. Complete
451
concordance among genotype identifications was observed for 24 of the 26 outbreaks
452
represented by 2 cases, meaning that both samples from the same outbreak showed the exact
453
same top match or matches to the ESI-MS database. The majority (n = 21) of the outbreaks
454
with concordant genotype results were associated with GII.4, while a few were associated
455
with other genotypes, including GI.3, GII.1, GII.12, and GII.17. One of the outbreaks
456
represented by 2 cases showed semi-concordant genotype results, in which one sample
457
showed top matches to GII.4 and GII.17 and the other sample showed top matches to GII.4,
AC C
EP
TE D
445
20
ACCEPTED MANUSCRIPT
GII.17, and GII.P16/GII.2. The one outbreak represented by 2 cases that showed discordant
459
genotype results was associated with one sample that had a top match to GII.4 and another
460
sample with a top match to GII.1. When the 26 outbreaks represented by 2 cases each were
461
compared based on identification of samples at the strain level, there were 17 outbreaks with
462
concordant results, 4 outbreaks with semi-concordant results, and 5 outbreaks with discordant
463
results. The majority (82%) of concordant outbreaks were associated with GII.4 strains, while
464
the remaining concordant outbreaks were associated with GI.3, GII.1, or GII.12 strains. The
465
outbreaks with semi-concordant strain identifications included the 1 outbreak described above
466
with semi-concordant genotype results, as well as 3 involving strains from GII.4 and GII.17.
467
With the exception of the outbreak described above involving 2 different genotypes,
468
outbreaks with discordant strain identifications were all associated with GII.4, but were linked
469
to different GII.4 strains in the database.
SC
M AN U
Outbreaks represented by 3 or more cases are shown graphically (Figs. 2a and 2b). As
TE D
470
RI PT
458
shown in Fig. 2a, samples from the same outbreak had a high level of genotype concordance,
472
with 18 of 24 outbreaks showing 100% concordance among all samples analyzed. These
473
outbreaks were each represented by 3-5 samples and were primarily outbreaks of GII.4 (n =
474
14), but also included one outbreak of each of the following: GII.1, GII.6, GII.12, and a set of
475
recombinants (GII.P16/GII.2 and GII.Pa/GII.3). An additional 5 outbreaks showed
476
concordance among genotype results for the majority of samples, but had one sample with
477
semi-concordant genotype results. For example, 2 of the samples associated with outbreak no.
478
2 showed top matches to GII.4 and a third sample showed a top match to database entries
479
corresponding to GII.4 and GII.17. Only 1 of the 24 outbreaks had samples that showed
AC C
EP
471
21
ACCEPTED MANUSCRIPT
480
discordant genotype results (outbreak no. 7). This outbreak had 2 samples with top matches
481
to both GII.16 and GII.P12/GII.10, while a third sample had a top match to GII.4.
482
As shown in Fig. 2b, when the results were compared on the basis of the top strain(s) in the database matching each sample, the number of concordant outbreaks decreased. Out of
484
the 18 outbreaks that showed concordance on the basis of genotype, only 9 continued to show
485
complete concordance with strain identifications. For most (80%) of the outbreaks showing
486
semi-concordance or discordance, the majority of samples showed complete concordance for
487
the top strain match, with just one sample showing a semi-concordant or discordant result.
488
For example, all 5 samples from outbreak no. 17 showed a top match to one of the ESI-MS
489
signatures for GII.4 New Orleans, but 1 of the samples also matched another ESI-MS
490
signature linked to a different sample of GII.4 New Orleans found in the database. Since
491
strains are defined here as ESI-MS database entries with distinct signatures, this sample was
492
determined to be semi-concordant.
SC
M AN U
TE D
493
RI PT
483
The results of statistical analysis on all outbreak samples revealed that both genotype and strain concordance values were significantly greater (p < 0.05, two-tailed) for the original
495
ESI-MS identifications as compared to the randomized dataset. Based on a scale of 0
496
(discordant) to 2 (fully concordant), the average genotype concordance score for the original
497
dataset was 1.8 ± 0.6 compared to 1.0 ± 1.0 in the randomized dataset, while the average
498
strain concordance score for the original dataset was 1.3 ± 0.8 compared to 0.1 ± 0.4 in the
499
randomized dataset. These results show that RT-PCR/ESI-MS allows for a high level of
500
agreement among outbreak samples based on genotype, with an average score close to that for
501
full concordance. The lower level of agreement found with strains is likely due to the fact
502
that single nucleotide polymorphisms among samples associated with the same outbreak can
AC C
EP
494
22
ACCEPTED MANUSCRIPT
result in different matches to the database, especially when the database is largely comprised
504
of fragmented sequences from GenBank. As the database becomes updated with additional
505
reference samples, it is likely that the concordance score will improve. Overall, the results of
506
the outbreak concordance analysis indicate the potential usefulness of this assay for norovirus
507
detection and differentiation in surveillance and outbreak applications, especially with regard
508
to genotype identifications.
509
4. Conclusions
SC
RI PT
503
An RT-PCR/ESI-MS assay was developed in this study for the detection and
511
differentiation of human noroviruses as well as the detection of hepatitis A virus. The assay
512
was successful at identifying human norovirus in 92% of clinical samples and showed a
513
specificity of 100% when tested against closely related viruses. In addition to detection of
514
norovirus, the assay was able to correctly identify norovirus at the genogroup level in 98% of
515
amplified samples and at the genotype level in 71-75% of samples. As more reference
516
samples are added to the database, the genotyping ability of the assay is expected to improve.
517
This assay shows potential to reduce the time and labor needed to detect and differentiate
518
noroviruses and may enhance the ability of public health scientists to identify the source and
519
the spread of norovirus illnesses related to outbreak situations. In addition to identification of
520
strains currently in circulation, this assay allows for the classification of recombinants and
521
new norovirus strains due to its range of target amplicons. Importantly, this assay would not
522
be expected to replace the current sequencing-based genotyping method, but rather it would
523
provide an additional tool for rapid characterization of noroviruses in outbreak or surveillance
524
situations.
525
Acknowledgments
AC C
EP
TE D
M AN U
510
23
ACCEPTED MANUSCRIPT
Funding for this study was provided by the FDA Center for Food Safety and Applied
527
Nutrition (CFSAN). We would like to thank the FDA Commissioner's Fellowship
528
Program; Steven Musser, John Callahan, Rebecca Bell, Marc Allard, and Erik Burrows at
529
FDA/CFSAN for their support; Lee-Ann Jaykus at North Carolina State University for
530
assistance with this project; and Natasha Fazel for help with sample processing.
531
References
532
Blanton, L. H., Adams, S. M., Beard, R. S., Wei, G., Bulens, S. N., Widdowson, M. A., Glass,
SC
RI PT
526
R. I., & Monroe, S. S. (2006). Molecular and epidemiologic trends of caliciviruses
534
associated with outbreaks of acute gastroenteritis in the United States, 2000−2004.
535
Journal of Infectious Diseases, 193(3), 413-421.
M AN U
533
Blyn, L. B., Hall, T. A., Libby, B., Ranken, R., Sampath, R., Rudnick, K., Moradi, E., Desai,
537
A., Metzgar, D., Russell, K. L., Freed, N. E., Balansay, M., Broderick, M. P., Osuna,
538
M. A., Hofstadler, S. A., & Ecker, D. J. (2008). Rapid detection and molecular
539
serotyping of adenovirus by use of PCR followed by electrospray ionization mass
540
spectrometry. Journal of Clinical Microbiology, 46(2), 644-651.
543 544 545
EP
542
Bull, R. A., Tanaka, M. M., & White, P. A. (2007). Norovirus recombination. Journal of General Virology, 88(12), 3347-3359. Crowder, C. D., Matthews, H. E., Rounds, M. A., Li, F., Schutzer, S. E., Sampath, R.,
AC C
541
TE D
536
Hofstadler, S. A., Ecker, D. J., & Eshoo, M. W. (2012). Detection of heartworm
infection in dogs via PCR amplification and electrospray ionization mass spectrometry
546
of nucleic acid extracts from whole blood samples. American Journal of Veterinary
547
Research, 73(6), 854-859.
24
ACCEPTED MANUSCRIPT
548
Crowder, C. D., Matthews, H. E., Schutzer, S., Rounds, M. A., Luft, B. J., Nolte, O., Campbell, S. R., Phillipson, C. A., Li, F., Sampath, R., Ecker, D. J., & Eshoo, M. W.
550
(2010). Genotypic variation and mixtures of Lyme Borrelia in Ixodes ticks from North
551
America and Europe. Plos One, 5(5), e10650-e10650.
552
RI PT
549
Ecker, D. J., Sampath, R., Blyn, L. B., Eshoo, M. W., Ivy, C., Ecker, J. A., Libby, B., Samant, V., Sannes-Lowery, K. A., Melton, R. E., Russell, K., Freed, N., Barrozo, C., Wu, J.,
554
Rudnick, K., Desai, A., Moradi, E., Knize, D. J., Robbins, D. W., Hannis, J. C.,
555
Harrell, P. M., Massire, C., Hall, T. A., Jiang, Y., Ranken, R., Drader, J. J., White, N.,
556
McNeil, J. A., Crooke, S. T., & Hofstadler, S. A. (2005). Rapid identification and
557
strain-typing of respiratory pathogens for epidemic surveillance. Proceedings of the
558
National Academy of Sciences, 102(22), 8012-8017.
M AN U
559
SC
553
Ecker, D. J., Sampath, R., Massire, C., Blyn, L. B., Hall, T. A., Eshoo, M. W., & Hofstadler, S. A. (2008). Ibis T5000: a universal biosensor approach for microbiology. Nature
561
Reviews Microbiology, 6(7), 553-558.
562
TE D
560
Eshoo, M. W., Crowder, C. D., Li, H., Matthews, H. E., Meng, S., Sefers, S. E., Sampath, R., Charles W. Stratton, Blyn, L. B., Ecker, D. J., & Tang, Y.-W. (2010). Detection and
564
identification of Ehrlichia species in blood by use of PCR and electrospray ionization
565
mass spectrometry. Journal of Clinical Microbiology, 48(2), 472-478.
567 568
AC C
566
EP
563
Eshoo, M. W., Whitehouse, C. A., Zoll, S. T., Massire, C., Pennella, T.-T. D., Blyn, L. B., Sampath, R., Hall, T. A., Ecker, J. A., Desai, A., Wasieloski, L. P., Li, F., Turell, M. J.,
Schink, A., Rudnick, K., Otero, G., Weaver, S. C., Ludwig, G. V., Hofstadler, S. A., &
569
Ecker, D. J. (2007). Direct broad-range detection of alphaviruses in mosquito extracts.
570
Virology, 368(2), 286-295.
25
ACCEPTED MANUSCRIPT
571
Grant-Klein, R. J., Baldwin, C. D., Turell, M. J., Rossi, C. A., Li, F., Lovari, R., Crowder, C. D., Matthews, H. E., Rounds, M. A., Eshoo, M. W., Blyn, L. B., Ecker, D. J., Sampath,
573
R., & Whitehouse, C. A. (2010). Rapid identification of vector-borne flaviviruses by
574
mass spectrometry. Molecular and Cellular Probes, 24(4), 219-228.
575
RI PT
572
Green, K. Y., Chanock, R. M., & Kapikian, A. Z. (2001). Human Caliciviruses. In D. M.
Knipe, P. M. Howley, D. E. Griffin, M. A. Martin, R. A. Lamb, B. Roizman & S. E.
577
Straus (Eds.), Field's Virology (4th ed., pp. 841-874). Philadelphia: Lippincott
578
Williams and Wilkins.
Hall, A. J., Eisenbart, V. G., Etingüe, A. L., Gould, L. H., Lopman, B. A., & Parashar, U. D.
M AN U
579
SC
576
580
(2012). Epidemiology of foodborne norovirus outbreaks, United States, 2001–2008.
581
Emerging Infectious Diseases, 18(10), 1566–1573.
582
Hall, A. J., Vinjé, J., Lopman, B. A., Park, G. W., Yen, C., Gregoricus, N., & Parashar, U. D. (2011). Centers for Disease Control and Prevention. Updated norovirus outbreak
584
management and disease prevention guidelines. Morbidity and Mortality Weekly
585
Report, 60(3), 1-15.
TE D
583
Hall, T. A. (1999). BioEdit: a user-friendly biological sequence alignment editor and analysis
587
program for Windows 95/98/NT. Nucleic Acids Symposium Series, 41, 95-98.
588
Hannis, J. C., Manalili, S. M., Hall, T. A., Ranken, R., White, N., Sampath, R., Blyn, L. B.,
590 591 592
AC C
589
EP
586
Ecker, D. J., Mandrell, R. E., Fagerquist, C. K., Bates, A. H., Miller, W. G., & Hofstadler, S. A. (2008). High-resolution genotyping of Campylobacter species by use
of PCR and high-throughput mass spectrometry. Journal of Clinical Microbiology, 46(4), 1220-1225.
26
ACCEPTED MANUSCRIPT
Hofstadler, S. A., Sampath, R., Blyn, L. B., Eshoo, M. W., Hall, T. A., Jiang, Y., Drader, J. J.,
594
Hannis, J. C., Sannes-Lowery, K. A., Cummins, L. L., Libby, B., Walcott, D. J.,
595
Schink, A., Massire, C., Ranken, R., Gutierrez, J., Manalili, S., Ivy, C., Melton, R.,
596
Levene, H., Barrett-Wilt, G., Li, F., Zapp, V., White, N., Samant, V., McNeil, J. A.,
597
Knize, D., Robbins, D., Rudnick, K., Desai, A., Moradi, E., & Ecker, D. J. (2005).
598
TIGER: the universal biosensor. International Journal of Mass Spectrometry, 242(1),
599
23-41.
SC
600
RI PT
593
Jacob, D., Sauer, U., Housley, R., Washington, C., Sannes-Lowery, K., Ecker, D. J., Sampath, R., & Grunow, R. (2012). Rapid and high-throughput detection of highly pathogenic
602
bacteria by Ibis PLEX-ID technology. Plos One, 7(6), e39928.
603
M AN U
601
Jiang, Y., & Hofstadler, S. A. (2003). A highly efficient and automated method of purifying and desalting PCR products for analysis by electrospray ionization mass spectrometry.
605
Analytical Biochemistry, 316(1), 50-57.
606
TE D
604
Kageyama, T., Kojima, S., Shinohara, M., Uchida, K., Fukushi, S., Hoshino, F. B., Takeda, N., & Katayama, K. (2003). Broadly reactive and highly sensitive assay for norwalk-
608
like viruses based on real-time quantitative reverse transcription-PCR. Journal of
609
Clinical Microbiology, 41(4), 1548-1557.
611 612
Kroneman, A., Vega, E., Vennema, H., Vinjé, J., White, P., Hansman, G., Green, K., Martella,
AC C
610
EP
607
V., Katayama, K., & Koopmans, M. (2013). Proposal for a unified norovirus
nomenclature and genotyping. Archives of Virology, 1-10.
613
Kroneman, A., Vennema, H., Deforche, K., Avoort, H., Peñaranda, S., Oberste, M., Vinjé, J.,
614
& Koopmans, M. (2011). An automated genotyping tool for enteroviruses and
615
noroviruses. Journal of Clinical Microbiology, 51(2), 121-125.
27
ACCEPTED MANUSCRIPT
Matthews, J. E., Dickey, B. W., Miller, R. D., Felzer, J. R., Dawson, B. P., Lee, A. S., Rocks,
617
J. J., Kiel, J., Montes, J. S., Moe, C. L., Eisenberg, J. N. S., & Leon, J. S. (2012). The
618
epidemiology of published norovirus outbreaks: a review of risk factors associated
619
with attack rate and genogroup. Epidemiology & Infection, 140(07), 1161-1172.
620
RI PT
616
Mattison, K., Grudeski, E., Auk, B., Charest, H., Drews, S. J., Fritzinger, A., Gregoricus, N., Hayward, S., Houde, A., Lee, B. E., Pang, X. L. L., Wong, J. L., Booth, T. F., & Vinje,
622
J. (2009). Multicenter comparison of two norovirus ORF2-based genotyping protocols.
623
Journal of Clinical Microbiology, 47(12), 3927-3932.
SC
621
Nainan, O. V., Xia, G., Vaughan, G., & Margolis, H. S. (2006). Diagnosis of hepatitis A virus
625
infection: a molecular approach. Clinical Microbiology Reviews, 19(1), 63-79.
M AN U
624
Pierce, S. E., Bell, R. L., Hellberg, R. S., Cheng, C. M., Chen, K. S., Williams-Hill, D. M.,
627
Martin, W. B., & Allard, M. W. (2012). Detection and Identification of Salmonella
628
enterica, Escherichia coli, and Shigella spp. via PCR-electrospray ionization mass
629
spectrometry: isolate testing and analysis of food samples. Applied and Environmental
630
Microbiology, 78(23), 8403-8411.
TE D
626
Sampath, R., Hofstadler, S. A., Blyn, L. B., Eshoo, M. W., Hall, T. A., Massire, C., Levene, H.
632
M., Hannis, J. C., Harrell, P. M., Neuman, B., Buchmeier, M. J., Jiang, Y., Ranken, R.,
633
Drader, J. J., Samant, V., Griffey, R. H., McNeil, J. A., Crooke, S. T., & Ecker, D. J.
635 636
AC C
634
EP
631
(2005). Rapid identification of emerging pathogens: coronavirus. Emerging Infectious Diseases, 11(3), 373-379.
Sampath, R., Mulholland, N., Blyn, L. B., Massire, C., Whitehouse, C. A., Waybright, N.,
637
Harter, C., Bogan, J., Miranda, M. S., Smith, D., Baldwin, C., Wolcott, M., Norwood,
638
D., Kreft, R., Frinder, M., Lovari, R., Yasuda, I., Matthews, H., Toleno, D., Housley,
28
ACCEPTED MANUSCRIPT
639
R., Duncan, D., Li, F., Warren, R., Eshoo, M. W., Hall, T. A., Hofstadler, S. A., &
640
Ecker, D. J. (2012). Comprehensive biothreat cluster identification by
641
PCR/electrospray-ionization mass spectrometry. Plos One, 7(6), e36528. Sampath, R., Russell, K. L., Massire, C., Eshoo, M. W., Harpin, V., Blyn, L. B., Melton, R.,
RI PT
642
Ivy, C., Pennella, T., Li, F., Levene, H., Hall, T. A., Libby, B., Fan, N., Walcott, D. J.,
644
Ranken, R., Pear, M., Schink, A., Gutierrez, J., Drader, J., Moore, D., Metzgar, D.,
645
Addington, L., Rothman, R., Gaydos, C. A., Yang, S., St. George, K., Fuschino, M. E.,
646
Dean, A. B., Stallknecht, D. E., Goekjian, G., Yingst, S., Monteville, M., Saad, M. D.,
647
Whitehouse, C. A., Baldwin, C., Rudnick, K. H., Hofstadler, S. A., Lemon, S. M., &
648
Ecker, D. J. (2007a). Global surveillance of emerging influenza virus genotypes by
649
mass spectrometry. Plos One, 2(5), e489.
M AN U
650
SC
643
Sampath, R. A., Hall, T. A., Massire, C., LI, F., Blyn, L. B., Eschoo, M. W., Hofstadler, S. A., & Ecker, D. J. (2007b). Rapid identification of emerging infectious agents using PCR
652
and electrospray ionization mass spectrometry. Annals of the New York Academy of
653
Sciences, 1102, 109-120.
656 657 658 659
EP
655
Sanchez, G., Bosch, A., & Pinto, R.M. 2007. Hepatitis A virus detection in food: current and future prospects. Letters in Applied Microbiology, 45(1), 1-5. Scallan, E., Hoekstra, R. M., Angulo, F. J., Tauxe, R. V., Widdowson, M. A., Roy, S. L.,
AC C
654
TE D
651
Jones, J. L., & Griffin, P. M. (2011). Foodborne illness acquired in the United States -
major pathogens. Emerging Infectious Diseases, 17(1), 7-15.
Shen, J., Wang, F., Li, F., Housley, R., Carolan, H., Yasuda, I., Burrows, E., Binet, R.,
660
Sampath, R., Zhang, J., Allard, M. W., & Meng, J. (2013). Rapid identification and
661
differentiation of non-O157 shiga toxin–producing Escherichia coli using polymerase
29
ACCEPTED MANUSCRIPT
662
chain reaction coupled to electrospray ionization mass spectrometry. Foodborne
663
Pathogens and Disease, 10(8), 737-743. Siebenga, J. J., Vennema, H., Renckens, B., de Bruin, E., van der Veer, B., Siezen, R. J., &
665
Koopmans, M. (2007). Epochal evolution of GGII.4 norovirus capsid proteins from
666
1995 to 2006. Journal of Virology, 81(18), 9932-9941.
RI PT
664
Siebenga, J. J., Vennema, H., Zheng, D. P., Vinje, J., Lee, B. E., Pang, X. L., Ho, E. C. M.,
668
Lim, W., Choudekar, A., Broor, S., Halperin, T., Rasool, N. B. G., Hewitt, J.,
669
Greening, G. E., Jin, M., Duan, Z. J., Lucero, Y., O'Ryan, M., Hoehne, M., Schreier,
670
E., Ratcliff, R. M., White, P. A., Iritani, N., Reuter, G., & Koopmans, M. (2009).
671
Norovirus illness is a global problem: emergence and spread of norovirus GII.4
672
variants, 2001-2007. Journal of Infectious Diseases, 200(5), 802-812.
673
Simner, P. J., Uhl, J. R., Hall, L., Weber, M. M., Walchak, R. C., Buckwalter, S., &
M AN U
SC
667
Wengenack, N. L. (2013). Broad-range direct detection and identification of fungi by
675
use of the PLEX-ID PCR-electrospray ionization mass spectrometry (ESI-MS) system.
676
Journal of Clinical Microbiology, 51(6), 1699-1706. Thompson, J. D., Higgins, D. G., & Gibson, T. J. (1994). CLUSTAL W: improving the
EP
677
TE D
674
sensitivity of progressive multiple sequence alignment through sequence weighting,
679
position-specific gap penalties and weight matrix choice. Nucleic Acids Research,
680
AC C
678
22(22), 4673-4680.
681
Van Ert, M. N., Hofstadler, S. A., Jiang, Y., Busch, J. D., Wagner, D. M., Drader, J. J., Ecker,
682
D. J., Hannis, J. C., Huynh, L. Y., Schupp, J. M., Simonson, T. S., & Keim, P. (2004).
683
Mass spectrometry provides accurate characterization of two genetic marker types in
684
Bacillus anthracis. BioTechniques, 37(4), 642-651.
30
ACCEPTED MANUSCRIPT
685
Vega, E., Barclay, L., Gregoricus, N., Williams, K., Lee, D., & Vinje, J. (2011). Novel
686
surveillance network for norovirus gastroenteritis outbreaks, United States. Emerging
687
Infectious Diseases, 17(8), 1389-1395.
689 690
Vega, E., & Vinje, J. (2011). Novel GII.12 norovirus strain, United States, 2009-2010. Emerging Infectious Diseases, 17(8), 1516-1518.
RI PT
688
Vinje, J., Hamidjaja, R. A., & Sobsey, M. D. (2004). Development and application of a capsid VP1 (region D) based reverse transcription PCR assay for genotyping of genogroup I
692
and II noroviruses. Journal of Virological Methods, 116, 109-117.
694
Wexler, H. M. (2007). Bacteroides: the good, the bad, and the nitty-gritty. Clinical
M AN U
693
SC
691
Microbiology Reviews, 20(4), 593-621.
Zheng, D. P., Ando, T., Fankhauser, R. L., Beard, R. S., Glass, R. I., & Monroe, S. S. (2006).
696
Norovirus classification and proposed strain nomenclature. Virology, 346(2), 312-323.
697
Zheng, D. P., Widdowson, M. A., Glass, R. I., & Vinje, J. (2010). Molecular epidemiology of
TE D
695
698
genogroup II-genotype 4 noroviruses in the United States between 1994 and 2006.
699
Journal of Clinical Microbiology, 48(1), 168-177. Figure Captions.
701
Figure 1. Layout of the RT-PCR/ESI-MS assay plate developed in this study. This assay
702
plate is able to process 12 samples per run and can simultaneously test for the presence of
703
norovirus and hepatitis A virus. PP, PCR primer pair(s).
704
Figure 2a. Concordance among the genotype identified by RT-PCR/ESI-MS for samples (n =
705
79) from the same outbreak.
706
Figure 2b. Concordance among strains identified by RT-PCR/ESI-MS for samples (n = 79)
707
from the same outbreak.
AC C
EP
700
31
ACCEPTED MANUSCRIPT
Tables
RI PT
Table 1. Primers designed for use in the RT-PCR/ESI-MS assay described in this study. PCR primer pair (PP) groupings indicate singleplex or multiplex arrangements. RdRp, RNA-dependent RNA polymerase; VP1, major capsid protein. PCR primer pair (PP) grouping
Target organism
Target gene
Product length
Primer sequences, 5’-3’ (F, forward; R, reverse)
3047
PP1
Norovirus GII and GIV
RdRp
57 bp
F: TGGGAGGGCGATCGCAATCT R: TCATTCGACGCCATCTTCATTCAC
5742
PP2
Norovirus GI and GII
RdRp
101 bp
F: TAGGCCATGTTCCGCTGGAT R: TGTCCTTCGACGCCATCATCATT
5698
PP3
Norovirus GII, including all GII.4
VP1
141 bp
F: TCAGAGGTCAACAATGAGGTTATGGC R: TACTGTAAACTCTCCACCAGGGGC
5699
PP4
Norovirus GII.4
VP1
3035
PP4
Hepatitis A virus
62 bp
5656
PP5
Norovirus GI
Protease gene/RNA polymerase gene VP1
3043
PP5
Hepatitis A virus
79 bp
5701
PP6
Norovirus GII
RNA polymerase gene VP1
TE D
M AN U
SC
Primer pair name
AC C
EP
126 bp
67 bp
120 bp
F: TCAGGAATGGGTGCAGCACTT R: TAGCCTGATTTATGAAGCTTGCACTC F: TGAAAGTCAGAGAATGATGAAAGTGGA R: TGCGTTTTGGAGACTACATTCATTGAACA
F: TTGGAGTCTTTGTCTTTGTTTCTTGGGT R: TCAGGCAGTTCCCACAGGCTT F: TCCAGGGATGTGTGGTGGGGC R: TCCAGCAACATGAATGCCCAAAATGGCATTCTG F: TTGGCTGGGAATGCGTTCAC R: TACATCAATAATCACATGAGGGCACAT
32
ACCEPTED MANUSCRIPT
VP1
73 bp
5702
PP7
Norovirus GI
VP1
101 bp
5744
PP7
Norovirus GII.4
VP1
135 bp
5651
PP8
Norovirus GIV
RdRp
120 bp
4437
PP8
Internal Positive Control
N/A
77 bp
F: TGGTTACTTCAGGTTTGATTCTTGGGT R: TCGCCCAGTTCCAGTTCCCA F: TACACCCGGTGATGTTTTGTTTGA R: TCATATTGCCAACCCAGCCATTATACAT F: TCAGGCTATGTCACAGTGGCTCA R: TCGTCTACGCCCCGTTCCA F: TCCTTCTATGGTGATGATGAGATTGTGTC R: TGGGCCCTCTGTCTTGTCTGG F: TGACGAGTTCATGAGGGCAGGC R: TCTGGCCTTTCAGCAAGTTTCCAAC
RI PT
Norovirus GII
SC
PP6
M AN U
5746
Table 2. Norovirus samples (n = 36) used to build the RT-PCR/ESI-MS database. All samples were derived from human stool specimens associated with norovirus illness and were previously genotyped through CaliciNet based on sequencing (Vinje et al., 2004). Strain
Samples (n)
GI.3 GII.1
N/A N/A
2 3
GII.4
Minerva
9
GII.4
New Orleans
19
GII.6
N/A
2
GII.12
N/A
Institution Hotel, long-term care facility Institution, school, long-term care facility Institution, hospital, school, long-term care facility Long-term care facility School
EP
AC C 1
Outbreak setting(s)
TE D
Genotype
Collection year(s)
2007 2011
No. of unique RTPCR/ESI-MS signatures 1 2
2008-2011
6
2010-2011
10
2010
1
2011
1
33
ACCEPTED MANUSCRIPT
AC C
EP
TE D
M AN U
SC
RI PT
Table 3. Results of inclusivity, exclusivity and negative control testing with the RT-PCR/ESI-MS assay designed in this study. Test Sample type N RT-PCR/ESI-MS ID (n) Inclusivity Norovirus GI 16 Norovirus GI (n = 8) Norovirus GI/GII (n = 2) Not detected (n = 6) Inclusivity Norovirus GI/GII 1 Norovirus GI/GII Inclusivity Norovirus GII 126 Norovirus GII (n = 125) Not detected (n = 1) Inclusivity Norovirus, genogroup 17 Norovirus GI (n = 3) unknown Norovirus GII (n = 8) Not detected (n = 6) Inclusivity Hepatitis A virus 3 Hepatitis A virus Exclusivity Murine norovirus-1 1 No detections Exclusivity Feline calicivirus 1 No detections Exclusivity Poliovirus type 3 1 No detections Negative Control Norovirus-negative stool 10 No detections sample Negative Control RNA extraction reagent 11 No detections blank Negative Control Non-template control 23 No detections
34
ACCEPTED MANUSCRIPT
Table 4. Identification of norovirus samples (n = 41) by real-time RT-PCR, RT-PCR/ESI-MS and a sequencing-based method for norovirus genotyping (Vinje et al., 2004). RT-PCR/ESI-MS genotype
Genotyping concordance
GI.2 GI.3 GI.3 GI.3 GI.3 GI.3 Unable to genotype GI.3 GII.1b GII.4 GII.4
GI.2 GI.3 GI.4 GI.3 and GII.12 GI.3 and GII.4 Not detected Not detected GI.14, GI.P3/GI.11a and GII.4 GII.1 GII.4 GII.4, GII.12, and GII.17
Concordant Concordant Discordant Semi-concordant Semi-concordant N/A N/A Semi-concordant Concordant Concordant Semi-concordant
1 4 1 1 1 1 1
GII GII GII GII GII GII GII
GII.4 GII.12c GII.12 Bacteroides spp. Negative Negative Negative
a
EP
TE D
M AN U
SC
RI PT
Sequencing-based genotype
2 5 1 1 1 5 1 1 3 10 1
Original identification with real-time RT-PCR GI GI GI GI GI GI GI GI/GII GII GII GII
GII.16 and GII.P12/GII.10 GII.12 GII.4 and GII.12 GII.4 GII.6 GII.4 GII.P19/GII.5
Discordant Concordant Semi-concordant Discordant N/A N/A N/A
AC C
N
Recombinant nomenclature according to Kroneman et al. (2013). One of these samples was originally found to have a mixed sequence, but repeat RNA extraction and sequencing resulted in a GII.1 identification, concordant with the RT-PCR/ESI-MS genotype. c One of these samples was originally identified as GII.1 by sequencing, but repeat RNA extraction and sequencing resulted in a GII.12 identification, concordant with the RT-PCR/ESI-MS genotype. b
35
AC C
EP
TE D
M AN U
SC
RI PT
ACCEPTED MANUSCRIPT
ACCEPTED MANUSCRIPT
RI PT
7
SC M AN U
5
Discordant Semiconcordant
TE D
4
Concordant
EP
3
2
AC C
No. of Samples
6
1
0 1
2
3
4
5
6
7
8
9
10
11
12
13
14
Outbreak No.
15
16
17
18
19
20
21
22
23
24
ACCEPTED MANUSCRIPT
RI PT
7
SC M AN U
5
Discordant Semiconcordant
TE D
4
Concordant
EP
3
2
AC C
No. of Samples
6
1
0 1
2
3
4
5
6
7
8
9
10
11
12
13
14
Outbreak No.
15
16
17
18
19
20
21
22
23
24
ACCEPTED MANUSCRIPT
Highlights
EP
TE D
M AN U
SC
RI PT
This paper describes a novel method for the identification of human noroviruses. RT-PCR was combined with mass spectrometry to develop a foodborne viral assay. This assay showed a sensitivity of 92% for detection of norovirus in 160 samples. Genogroup and genotype were correctly identified in the majority of viral samples. The assay showed 100% specificity and was also able to detect hepatitis A virus.
AC C
• • • • •