Sequence conservation of gene 8 between human and porcine group C rotaviruses and its relationship to the VP7 gene of group A rotaviruses

Sequence conservation of gene 8 between human and porcine group C rotaviruses and its relationship to the VP7 gene of group A rotaviruses

VIROLOGY 182, 562-569 Sequence (1991) Conservation of Gene 8 between Human and Porcine Group C Rotaviruses and Its Relationship to the VP7 Gene of...

1MB Sizes 0 Downloads 53 Views

VIROLOGY

182, 562-569

Sequence

(1991)

Conservation of Gene 8 between Human and Porcine Group C Rotaviruses and Its Relationship to the VP7 Gene of Group A Rotaviruses’

YUAN QlAN,* BAOMING JIANG,t LINDA J. SAIF,i’ SHIEN Y. KANG,t YOSHIRO lSHlMARU,+ YASUTAKA YAMASHITAJ MITSUAKI OSETO,” AND KIM Y. GREEN*v2 *Laboratory of infectious Diseases, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, Maryland 20892; tFood Animal Health Research Program, Ohio Agricultural Research and Development Center, The Ohio State University, Wooster, Ohio 4469 1; +lshimaru Pediatric Clinic, 6-5-l Sanbancho, Matsuyama, Ehime 790, Japan,’ QEhime Prefectural Institute of Public Health, 234-8 Sanban-cho. Matsuyama, Ehime 790, Japan; and “Matsuyama Prefectural Institute of Public Health, 132 Kita-Mochida-cho, Matsuyama, Ehime 790, Japan Received December

13, 1990; accepted

February

12, 199 1

cDNA libraries from porcine group (Gp) C rotavirus strain Cowden and a human Gp C rotavirus strain were generated. The complete nucleotide sequence of gene 8 from the Cowden strain was determined from gene 8-specific clones and viral transcript RNA. A full-length gene 8 clone was generated from the human Gp C virus by polymerase chain reaction (PCR) using primers deduced from the 3’ and 5’ ends of the Cowden strain gene 8, and the sequence of the human Gp C gene 8 was determined from this clone and gene 8 clones in the cDNA library. The gene 8 from the Cowden or the human Gp C strain is 1063 nucleotides in length and contains a long open reading frame beginning at the 49th nucleotide from the 5’ end and terminating with a stop codon 16 bases from the 3’ end. The encoded protein contains 332 amino acids (predicted molecular weight of 37.3 kDa) with two potential N-linked glycosylation sites in the porcine strain and three in the human strain. The polypeptide products derived from in vitro translation of the transcript RNA generated from a porcine gene 8 clone containing the entire open reading frame were analogous in size with the Gp A VP7. The gene 8 of porcine and human Gp C rotaviruses exhibited considerable nucleotide and deduced amino acid sequence identity (83.8 and 88.0%, respectively). Comparison of the Gp C gene 8 protein sequence with the VP7 protein of Gp A rotavirus revealed structural similarities, although the overall amino acid identity was low (~30%). These data suggest that the gene 8 of the porcine or human Gp C rotavirus encodes a protein corresponding to the VP7 outer capsid glycoprotein of Gp A rotaviruses and that the eighth gene is highly conserved in the porcine and human Gp o 1991 Academic Press. Inc C strains examined in this study.

al., 1989; Ushijima et al., 1989) suggest that these viruses could emerge as important pathogens. Gp C rotaviruses were initially detected in piglets with diarrhea (Saif et al., 1980). The Gp C viruses were identified as rotaviruses because of similarities to Gp A rotaviruses in morphology by negative stain electron microscopy and genomic structure (i.e., 11 segments of double-stranded RNA). However, the lack of serologic cross-reactivity with Gp A rotavirus caused them to be designated “atypical” rotaviruses or “pararotaviruses” (Bohl et a/., 1982). Limited serologic surveys have suggested that Gp C rotavirus infections are common in pigs in North America, Europe, and Australia (Saif et al., 1980; Saif and Theil, 1985; Nagesha et a/., 1988). In humans, Gp C rotavirus has been associated with sporadic cases and outbreaks of diarrhea in several countries, suggesting that this group is globally distributed. The unavailability of diagnostic reagents has hampered epidemiologic studies of Gp C rotavirus infection in humans. Hybridization studies have shown that the human and porcine Gp C rotaviruses are closely related geneti-

INTRODUCTION Rotaviruses are etiological agents of severe diarrhea in the young of humans and many animal species (Kapikian and Chanock, 1990). These viruses are members of the family Reoviridae and have been assigned to at least seven groups designated A through G (Estes and Cohen, 1989; Saif, 1990). Among them, Gp A is clearly the most important in diarrhea1 disease (Kapikian et al., 1989). The importance of “non-group A” rotaviruses in diarrhea1 disease has not been established. However, reports of severe gastroenteritis in adults and children associated with Gp B or C rotaviruses (Hung et a/., 1984; Beards et al., 1989; Bonsdorff and Svensson, 1988; Brown et a/., 1988; Caul et al., 1990; Chen et a/., 1988; Dimitrov et a/., 1986; Penaranda et a/., 1989; lshimaru et al., 1990; Matsumoto et

’ The nucleotlde sequence data reported in this paper have been submitted to the GenBank nucleotide sequence database and have been assigned Accession Nos. M61100 and M6 1101. 2 To whom reprint requests should be addressed. 0042-6822/91

$3.00

Copyright 0 1991 by Academic Press, Inc. All rights of reproduction in any form reserved.

562

SEQUENCE

CONSERVATION

OF GENE 8

563

tally (Bridger et al., 1986; Qian et al., 1991). In this study, cDNA libraries of a porcine Gp C reference strain and a human Gp C rotavirus were generated in order to further investigate the molecular basis for the genetic relatedness within Gp C as well as the lack of relatedness among different rotavirus groups and to develop reagents for epidemiological studies of Gp C rotaviruses. MATERIALS

AND METHODS

Viruses. Porcine Gp C rotavirus reference strain Cowden was propagated in gnotobiotic piglets as described (Bohl eta/., 1982). Human Gp C rotavirus strain 88-220 was purified from a stool specimen collected from a child with diarrhea in Matsuyama, Japan, in 1988 (Ishimaru et a/., 1990). Construction of cDNA libraries. Single-stranded (ss) RNA of porcine or human Gp C rotavirus was produced by in vitro transcription of purified virus particles (Qian et a/., 1991). The cDNA Synthesis Plus kit (Amersham Co.) was used for cDNA synthesis from the RNA transcripts of the human or porcine Gp C virus. A “random primer” included in the kit was used to initiate cDNA synthesis. In addition, transcripts of the porcine Cowden strain were polyadenylated (Sippel, 1973) and oligo(dT) was used to initiate cDNA synthesis. Following first-strand cDNA synthesis using reverse transcriptase, second-strand cDNA synthesis was accomplished using Escherichia co/i RNase and DNA polymerase included in the kit. The double-stranded (ds) cDNA was blunt ended with T4 DNA polymerase, ligated into the unique Smal site of vector pTZ18R (Pharmacia Co.) and used for transformation of DH-5 a! E. co/i cells (BRL). Positive clones were identified by colony hybridization using 32P-labeled RNA transcripts produced from homologous virus particles as probe (Qian et al., 1991). Determination of gene specificity of positive cDNA clones by Northern blot hybridization. Northern blot hybridization was pet-formed as described (Qian et a/., 1991). Positive cDNA clones were labeled with 32P by nick translation (Maniatis et al., 1975) and used as probes against viral dsRNA. DNA fragments were generated from clones of known gene specificity using PCR (GeneAmp, Perkin Elmer Cetus), radiolabeled with 32P by nick translation, and used as probes for further characterization of clones in the libraries by dot blot hybridization. Sequencing. Dideoxynucleotide sequencing was performed on both transcript RNA and cloned DNA templates (Sanger et a/., 1977). The sequence of cloned DNA was determined using a T7 DNA polymerase sequencing kit (Pharmacia). Synthetic oligonucleo-

AA..c.....

..C..T...C 850

R..R.....T 860

CTTATCAGAa TRTCTACRTC .T..A..... ..A..A.... 920 910 CRRACAACAC GATCATTTAA . . . . . . . ..R . . ..G..... 970 980 ATT‘ATTACA TCAATACIVLT . . . . . . ..T. .T........ 1030 1040 ‘MGGTTGGR TGCTGAGATb ..T....G.. . . . ..G....

.G..TC....

. . . ..G.... 870

880

G..T..T... 890

900

ATClULGTTTT GATAATTCAT TGTCACCATT AMT‘RTGGA ,,..,,.,.. . . ..A.C..C 950 960 930 940 AATARATGCA AAhRARTGGT GGACGATATT TTACACTATR ..G....... . . ..T..... . ..T..A... 1020 990 loo0 1010 TATRCP.AACT RTGACTCCGA GACRTCGGGC CRTTTRTCCA .G.....G.A . . . . . . ..c. A......... 1050 1060 CGCGT-MC.4 RGATCATGTG GCT T.........

FIG. 1. Complete nucleotide sequences of the gene 8 of Gp C porcine Cowden strain and the human 88-220 strain. The sequences correspond to the plus strand (mRNA sense) of the dsRNA in the 5’to 3’direction. The m-frame start codon beginning at nucleotide 49 and the stop codon beginning at nucleotide 1045 are underlined. Only nucleotides which differ are shown in the sequence of the human strain. The nucleotide (n) sequence of the gene 8 from the Cowden strain was determined from ssRNA transcripts (n l-l 039) and the following overlapping clones from the cDNA library: CA2-28 (n243-1063); CA4-6 (n589-1063); CAl-30 (n270-1063); and PC866 (nl-1062). The sequence of the gene 8 from the human 88-220 strain was determined from the following overlapping clones: HC8F23 (nl-1020); HC4-47 (n42-816); HC9-76 (n42-816); HC5-39 (nl 14-982); and two full-length clones (HC8F-1 and HC8F-3).

tides approximately 18 bases long were deduced from the cDNA insert sequences and used to initiate sequencing on viral transcript RNA in the presence of reverse transcriptase as described (Gorziglia et al., 1986). For initiation of sequencing at the 3’ end of the gene 8 transcript, an oligonucleotide (5’-AGCCACATGATClTG-3’) deduced from the sequence adjacent to the poly(A) tract was used. Construction of a full-length clone of the gene 8 of porcine or human Gp C rotaviruses. A full-length gene 8 cDNA of porcine or human Gp C rotavirus was produced using PCR (Larralde and Flores, 1990). Briefly,

564

QIAN ET AL.

130 IYIx*rxvs .I.....PT.

110 DNDSvcmTC ..IN......

150 160 SYNIWIPDS PlwSESTERI . . . . . . . . . . &zJ...P.

170 AEwILNvwnc

180 DDnNl,DIYTY . . . ..E....

190 200 210 220 230 240 E**G*DNLwA Am3sDcD”s” Cm,rrnmGI GCSPASTETI EVL D LA LLNVVDNVKH ..I..N.... . . . . . ..I.. . . . . ..s... ..“....... p .T R. 260 250 270 280 290 300 RIQnNT*SCK LWCIKOEIR LNT*hIRIST ssslDNsLoP LNDGQTTRSF KINAKKWWTI . . . .._. ,.N....... ,..,._,.,_ . . . . . ..P.. 320 310 330 FmfIDYINT IIPTHTPRHR ALYPEGumLR IA . ....... . . . . . . . . . .“.A......

FIG. 2. The deduced amino acid sequences of the gene 8 proteins of Cowden porcine and 88-220 human Gp C strains. The arrow indicates the potential cleavage site of the predicted protein. The potential N-linked glycosylation sites are boxed. Only amino acids which differ are shown in the sequence of the human strain.

an oligonucleotide (V-AGCCACATGATCTTGTTTACGC-3’) deduced from the 3’ end of the porcine gene 8 was used to initiate cDNA synthesis from the porcine or human Gp C gene 8 transcripts in the presence of reverse transcriptase prior to PCR. An additional oligonucleotide (5’-GCATTTAAAAAAGAAGAAGCTGT-3’) deduced from the 5’ end of the porcine gene 8 was used in the PCR. The amplified DNA fragment was purified on a 1.2% low melting point agarose gel (BRL) and chromatographed on a NACS column (BRL). The dsDNA ends were blunt ended with T4 DNA polymerase, ligated into the Smal site of the plasmid pTZ18, and transformed into DH 5 Q f. co/i cells as described above. In vitro transcription, translation and immunoprecipitation. For in vitro transcription and translation experiments, two porcine gene 8 clones were selected. One (PC8-38) contained a cDNA insert with the correct T7 promoter orientation as well as a full open reading frame, and the other (PC8-18) contained a full open reading frame in the opposite orientation. Transcript RNA was obtained in vitro using a Riboprobe System kit (Promega) following the manufacturer’s instruction. In vitro translation was performed using a nucleasetreated rabbit reticulocyte lysate in the presence or absence of canine microsomal membranes (Promega). One microgram of transcript RNA and 0.8 mCi of [35S]methionine/ml (>800 CVmmol; 1 Ci = 37 GBq) were added to 50 ~1of the reaction mixture. After incubation at 30” for 1 hr, the in vitro translation products were immunoprecipitated with either hyperimmune antiserum raised in gnotobiotic piglets against the porcine Cowden strain or piglet preimmunization serum and

analyzed in a 12% polyacrylamide gel containing 0.1% SDS (Laemmli, 1970). A Cowden virus-infected MA104 cell lysate containing radiolabeled viral polypeptides prepared as described previously was included in some experiments (Jiang et al., 1990). After electrophoresis, the gel was soaked in an enhancer (Enlightning, Du Pont) for 30 min, dried, and exposed to X-Omat film (Kodak) overnight. RESULTS Comparison of the gene 8 sequences. Several partial clones specific for the gene 8 of either porcine or human Gp C rotavirus were selected by Northern blot hybridization or dot blot hybridization from the gene libraries and sequenced. Because polyadenylation was employed in the cloning of the porcine virus from transcript RNA, the sequence of the 3’ end of the porcine gene 8 was considered to begin at the first base after the poly(A) tract. Initiating with an oligonucleotide

4 3 2 I 0 -1 -2

4 3 2 1 0 -1 -2

I 3 2 1 0 -1 -* -3

FIG. 3. Comparison of the hydropathicity profiles of the gene 8 protein of porcine and human Gp C rotaviruses and the VP7 glycoprotein of Gp A rotavirus by the method of Kyte and Doolittle (1982). Hydropathicity profiles of (A) the deduced gene 8 protein of Gp C porcine Cowden strain; (B) the deduced gene 8 protein of the human Gp C 88-220 strain; and (C)the VP7 glycoprotein of the Gp A porcine strain Gottfried. The nucleotide sequence of the VP7 gene of the Gottfried strain was reported previously (Gorziglia et al., 1988). Hydrophobic areas are shown above the horizontal axes and hydrophilic areas are below.

SEQUENCE

CONSERVATION

565

OF GENE 8

326

261 a

c 2

I

“, ,

196

c5

/ /‘. L’ 6i

1%

202

2&Q

Human Group C

68

135

Porcine

262

2&Q

Group C

FIG. 4. Amino acid identity between Gp C porcine and human rotavirus gene 8 proteins, and between the gene 8 protein of the Gp C Cowden strain and the VP7 protein of the Gp A porcine strain Gottfried illustrated by matrix plot analysis. (A) Plot of the human Gp C gene 8 protein vs the porcine Gp C gene 8 protein. (B) Plot of the porcine Gp C gene 8 protein vs the porcine Gp A VP7 protein. Matrix analysis was performed using the MacMolly computer program (Soft Gene, Berlin, Germany).

deduced from the sequence next to the poly(A) tract, the remaining nucleotide sequence of the Cowden strain gene 8 was determined using primer extension on the viral transcript RNA template and the cDNA inserts from the clones. The sequence of the last 24 nucleotides from the 3’ end of the gene was determined only from the cDNA clones. The sequence of the gene 8 of human Gp C was determined by sequence analysis of several full-length clones generated by PCR as well as partial clones from the human Gp C gene library. The complete nucleotide sequences of the gene 8 of the Cowden strain and the human Gp C rotavirus are shown in Fig. 1. The gene 8 of the porcine Cowden strain or the human Gp C rotavirus is 1063 nucleotides in length and contains a long open reading frame beginning at the 49th nucleotide from the 5’ end and terminating with a stop codon 16 bases from the 3’ end. For each gene, the first ATG at bases 49 through 5 1 is conserved (TAAAAG). An additional in-phase ATG is located at bases 124 through 126 in both gene 8s (AAAAAT). Neither sequence is consistent with Kozak’s strong initiation sequence (AXXmG) (Kozak, 1981). Beginning with the first ATG at the 49th nucleotide from the 5’end, the predicted protein contains 332 amino acids. The aligned sequence comparison showed that the first 79 nucleotides from the 5’ end and the last 22 nucleotides from the 3’ end are completely conserved between the gene 8 of porcine and human Gp C virus. Within the coding region, nucleotide substitutions are scattered with the exception of one clustered divergent region located at nucleotides 135 through 183. The overall nucleotide identity of these two genes is 83.8%.

Comparison of the predicted gene 8 proteins. The aligned amino acid sequences deduced from the porcine and human Gp C virus gene 8 showed extensive conservation between these two proteins (Fig. 2). The overall identity of the deduced amino acid sequences is 88.0%. The basic structural features of these two proteins appeared similar, as shown by the hydropathicity profile of Kyte and Doolittle (1982) (Figs. 3A and 3B). Each of the two in-phase initiation codons precedes a region of hydrophobic amino acids at the Nterminus of the protein. The structural similarity was also shown by the presence and location of cysteine and proline residues (1 1 cysteines and 11 prolines conserved in both strains). The amino acid identity of the porcine and human GpC gene 8 proteins is illustrated by matrix plot analysis (Fig. 4A). In general, the amino acid substitutions were scattered, with the exception of one clustered divergent region located at amino acids 31 through 45 which corresponded to the clustered nucleotide divergent region (135 through 183). This clustered amino acid divergent region is located in the second hydrophobic region. There were two conserved potential N-linked glycosylation sites (Asn-X-Ser/Thr) located at amino acid residues 67 through 69 and 225 through 227 in both gene 8 proteins. In the human rotavirus gene 8 protein there was an additional potential glycosylation site at amino acid residues 152 through 154. In vitro translation and immunoprecipitation. The translation products produced in vitro from transcripts derived from porcine gene 8 clones PC8-38 (correct T7 promoter orientation) and PC8-18 (opposite orientation) are shown in Fig. 5A. The lane containing the product obtained by in vitro translation of transcripts from

566

QIAN ET AL.

abcdefghi

M

69

-

46

-

30

-

21.5-

14.3.

A

B

FIG.5. Polyacrylamide gel electrophoresis analysis of radiolabeled in vitro translation products from transcripts generated by in vitro transcription of the gene 8 clone of the porcine Gp C Cowden strain. Clone PC8-38 (contains nucleotides 13 to 1059) was in the correct T7 promoter orientation and clone PC8-18 (contains nucleotides 4 to 1061) was in the opposite orientation. Samples were subjected to electrophoresis in a 12% polyacrylamide gel, and the bands were visualized by fluorography. (A) 35S-labeled translation products from the reactions in the presence (with) or absence (w/o) of canine microsomal membranes from (lane a) clone PC8-18 (w/o); (lane b) clone PC8-18 (with); (lane c) clone PC8-38 (w/o); (lane d) clone PC8-38 (with); (lane f) clone PC8-38 (w/o) precipitated with hyperimmune serum to porcine Gp C rotavirus; (lane g) clone PC8-38 (with) precipitated with hyperimmune serum to porcine Gp C rotavirus; (lane h) clone PC8-38 (w/o) precipitated with preimmune serum; and (lane i) clone PC8-38 (with) precipitated with preimmunization serum. ‘Ylabeled molecular weight markers (Amersham) are shown in lane e. “M” on the left indicates the molecular weight (kDa) of the molecular weight marker and on the right indicates the molecular weight of the in vitro translation product. (B) Comparison of the in vitro translation products from the PC8-38 clone with the 35S-labeled lysate of MA104 cells infected with the Cowden strain. “M” on the left indicates the molecular weight of the “C-labeled molecular weight markers (lane a) and on the right the molecular weight of the 33-kDa outer capsid protein from the porcine Gp C strain Cowden in an infected MA-1 04 cell lysate (lane C) identified by Jiang et al. (1990). Lane b contains the in vitro translation products from the clone PC8-38 prepared in the absence of canine microsomal membranes.

clone PC8-38 in the absence of canine microsomal membranes showed two bands with approximate molecular weights of 34.5 and 37.0 kDa (lane c). In the presence of canine microsomal membranes in the translation reaction of clone PC8-38, three bands with approximate molecular weights of 34.5, 37, and 39 kDa were observed (lane d). No polypeptide product was detected from clone PC8-18 in the presence or absence of microsomal membranes (lanes a and b, respectively). The translation products from clone PC8-38 obtained either in the absence or in the presence of canine microsomal membranes were immunoprecipitated with hyperimmune serum (lanes f and g,

respectively), whereas these products were not precipitated with preimmune serum (lanes h and i, respectively). The 34.5-kDa in vitro translation product from the clone PC8-38 obtained in the absence of microsomal membranes was consistent in size with an outer capsid protein of 33 KDa identified in a radiolabeled lysate of MA-l 04 cells infected with the Gp C Cowden strain (Fig. 5B). Comparison of the gene 8 protein of Gp C with the l/f7 of Gp A rotavirus. Comparison of the gene 8 protein of the porcine Gp C rotavirus strain Cowden with the outer capsid glycoprotein VP7 of the porcine Gp A rotavirus strain Gottfried revealed striking similarities in structure and organization between these proteins, although overall amino acid identity was low (less than 30%). The hydrophobicity profile of the Gp A VP7 was similar to that of the Gp C gene 8 protein (Fig. 3C). However, the matrix plot comparing the amino acid sequence of the gene 8 protein of the Gp C porcine strain with the VP7 of Gp A porcine strain Gottfried demonstrated considerable divergence in amino acid identity (Fig. 4B). The two in-phase initiation codons, each of which precedes a hydrophobic region at the N-terminus of the gene 8 protein of the Gp C strains, are conserved in all Gp A VP7 genes examined (Estes and Cohen, 1989). A glutamine at amino acid residue 51 of the Gp AVP7 is conserved in all strains described thus far and has been identified as a cleavage site in the protein (Estes and Cohen, 1989; Stirzaker et al., 1987). This glutamine is conserved at amino acid residue 50 in both human and porcine gene 8 proteins (Fig. 2). The potential N-linked glycosylation sites in the gene 8 protein of the Gp C strains are also similar in location to those of the VP7 of Gp A rotavirus. In addition, when the aligned amino acid sequences of the gene 8 protein of the Gp C strain Cowden and the VP7 of Gp A porcine strain Gottfried were compared, several conserved amino acid regions were observed (Fig. 6). Among these are amino acid residues 105 through 1 15 in the VP7 of Gottfried strain with amino acid residues 110 through 120 of the gene 8 protein of Cowden strain; 135 through 142 in Gottfried with 140 through 147 in Cowden; 156 through 207 in Gottfried with 161 through 212 in Cowden; and 288 through 304 in Gottfried with 293 through 309 in the Cowden strain. Of interest, all these conserved regions were located in regions of amino acid sequence in the VP7 of Gp A rotaviruses that are conserved among different serotypes (Estes and Cohen, 1989). DISCUSSION The gene coding assignments for Gp C rotavirus have not been established. However, the sequence analysis of individual gene segments should provide

SEQUENCE

CONSERVATION

567

OF GENE 8

166 207 ADLILNEWLCNPMDITLYYYQQTGEANKWISMGSSCTVKVCPLNTQTLGIGC AEWILNVWFCDDMNLDIYTYEQTGIDNLWAAFGSDCDVSVCPLDTTMNGIGC 212

161

1

A

B

C

326

FIG. 6. Location of conserved amino acid regions between the Gp C Cowden gene 8 protein and the Gp A Gottfried VP7 depicted on a schematic representation of the Gp A VP7. Regions defined as divergent in the ammo acid sequence among VP7 proteins from different serotypes of Gp A rotaviruses are indicated with a solid box. The stippled line represents highly conserved amino acid sequences among Gp A VP7s. A, B, and C are antigenic sites where serotype-specific epitopes have been identified in the Gp A VP7 (Dyall-Smith eta/., 1986). The open boxes contain comparisons of amino acid sequences of the Gp A VP7 (upper line) which are conserved with amino acid sequences of the Gp C gene 8 protein (lower line). The conserved amino acrd regions between the Gp A and the Gp C proteins are numbered and identical amino acids are in bold

information relating to the encoded protein function when comparisons are made with Gp A rotaviruses (Bremont et a/., 1990). The gene segment 7, 8, or 9 of Gp A rotavirus (depending on the strain) encodes the outer capsid glycoprotein VP7 which has been identified as one of the proteins of Gp A rotaviruses involved in virus neutralization (Estes and Cohen, 1989). A 39or 37-kDa glycoprotein had previously been identified in the outer capsid of purified Gp C Cowden virus particles in structural protein studies; this protein was analogous in size and location to the VP7 glycoprotein of Gp A rotavirus (Bremont et al., 1988; Jiang eta/., 1990). The structure and organization of the predicted gene 8 protein of the human and porcine Gp C strains in this study resemble those of the VP7 protein of Gp A rotavirus which is consistent with a coding assignment of gene 8 for VP7 in these two Gp C strains. In addition, in vitro translation of transcripts generated from a gene 8 clone in the presence of canine microsomal membranes suggested that the Cowden gene 8 encodes a glycoprotein of similar size to the Gp A VP7. It is of interest that the in vitro translation products from the porcine gene 8 clone contained two bands. It is possible that these two proteins resulted from initiation at the two in-frame start codons. A similar observation has been made from in vitro translation studies of the Gp A VP7 gene (Chan et a/., 1986, Stirzaker et a/., 1987). In the presence of canine microsomal membranes, an additional translation product from the porcine gene 8 clone was detected with a molecular weight of 39.0 kDa, which was consistent with modification by glycosylation and with the size reported by Bremont et al. (1988) for a 39-kDa outer capsid glycoprotein identified in purified virions.

In general, among VP7s from different Gp A serotypes, the overall amino acid sequences are conserved. However, several discrete regions of variable amino acid sequence have been defined upon alignment of the VP7 sequences from different serotypes (Gunn et al., 1985; Green et al., 1989). It is thought that these variable regions are involved in the determination of serotype specificity. Only one clustered amino acid variable region was found between the two deduced Gp C gene 8 proteins but it was located in a hydrophobic area of the protein which, if analogous to the Gp A VP7, is not likely involved in the antigenic structure because it is removed by cleavage (Stirzaker et a/., 1987). Thus, the overall 88% homology between the human and porcine Gp C gene 8 proteins was consistent with shared serotype specificity. However, further studies will be required to confirm the antigenic relationships among Gp C rotaviruses. Previous studies showed the lack of genetic relatedness among rotaviruses from different groups (Bridger et al., 1986; Eiden et al,, 1986; Qian et a/., 1991; Saif and Theil, 1985; Saif, 1990). The data in this study demonstrate that although the overall nucleotide identity is low, there are some structural similarities between the corresponding VP7 genes of Gps A and C. Furthermore, regions of amino acid conservation between the gene 8 protein of Gp C and the VP7 of Gp A rotaviruses were identified and these were located in conserved regions of the VP7 of Gp A rotaviruses. It is possible that these regions provide a common essential function. For example, amino acid residues 133 to 146, which are conserved in all Gp A VP7 proteins examined, were reported to share similarities with Cap-binding sites of other proteins (Estes and Cohen,

568

QIAN ET AL.

1989). A similar region of amino acid sequence was found in the gene 8 proteins of the Gp C strains in this study. It has been proposed that human Gp C rotaviruses were originally derived from pigs through interspecies transmission (Penaranda et a/., 1989). Based upon the sequence analysis in this study, it is likely that the human and porcine Gp C rotaviruses did indeed evolve from a common ancestral source. The data in this study also suggest that Gps A and C rotaviruses may have evolved from a more distant common ancestral source. Additional sequence analysis may yield further insight into evolutionary relationships. Whether Gp C rotaviruses are only of minor importance as etiologic agents of human disease and will remain so or whether they are emerging as pathogens is not clear. Molecular studies should allow the development of reagents to address a number of important epidemiological issues. In addition, molecular studies of rotaviruses that are morphologically similar but antigenitally diverse may lead to a better understanding of mechanisms that are responsible for the important role of rotaviruses in diarrhea1 disease.

ACKNOWLEDGMENTS We extend our appreciation to Drs. Albert Z. Kapikian and Robert M. Chanock for advice and critical review of the manuscript; to Dr. Jerry Keith for assistance with the computer analysis; to Mr. Myron Hill and Dr. Peter Collins for synthesis of primers; and to Johnna Sears for assistance in preparation of the photographs.

REFERENCES BEARDS, G. M., DESSELBERGER,U., and FLEWE~, T. H. (1989). Temporal and geographical distributions of human rotavirus serotypes, 1983-l 988. J. C/in. Microbial. 27, 2827-2833. BOHL, E. H., SAIF, L. J., THEIL, K. W., AGNES, A. G., and CROSS, R. F. (1982). Porcine pararotavirus: Detection, differentiation from rotavirus, and pathogenesis in gnotobiotic pigs. J. C/in. Microbial. 15, 312-319. BONSDORFF, CH. V., and SVENSSON, L. (1988). Human serogroup C rotavirus in Finland. Stand. J. Infect. Dis. 20, 475-478. BREMONT, M., COHEN, J., and MCCRAE, M. A. (1988). Analysis of the structural polypeptides of a porcine group C rotavirus. 1. Viral. 62, 2183-2185. BREMONT, M., CHABANNE-VAUTHEROT, D., VANNIER, P., MCCRAE, M. A., and COHEN, J. (1990). Sequence analysis of the gene (6) encoding the major capsid protein (VP6) of group C rotavirus: Higher than expected homology to the corresponding protein from group A virus. Virology 178, 579-583. BRIDGER,J. C., PEDLEY, S., and MCCRAE, M. A. (1986). Group C rotaviruses in humans. J. C/in. Microbial. 23, 760-763. BROWN, D. W. G., MATHAN, M. M., MATHEW, M., MARTIN, R., BEARDS, G. M., and MATHAN, V. I. (1988). Rotavirus epidemiology in Vellore, South India: Group, subgroup, serotype, and electropherotype. 1. C/in. Microbial. 26, 241 O-2414. CAUL, E. O., ASHLEY, C. R.. DARVILLE,1. M., and BRIDGER,J. C. (1990).

Group C rotavirus associated with fatal enteritis in a family outbreak. J. Med. Viral. 30, 201-205. CHAN, W.-K., PENARANDA, M. E., CRAWFORD.S. E., and ESTES, M. K. (1986). Two glycoproteins are produced from the rotavirus neutralization gene. Virology 151, 243-253. CHEN, G., FAN, R., Guo. X., and HUNG, T. (1988). Group C rotavirus found in sporadic diarrhea in China. J. Exp. C/in. Viral. 2, 1-3. [in Chinese with English abstract]. DIMITROV, D. H., SHINDAROV,L. M., and RANGELOVA,S. (1986). Occurrence of antigenically distinct rotaviruses in infants in Bulgaria. Eur. J. C/in. Microbial. 5, 47 l-473. DYALL-SMITH, M. L., LAZDINS, I., TREGEAR,G. W., and HOLMES, I. H. (1986). Location of the major antigenic sites involved in rotavirus serotype-specific neutralization. Proc. Nat/. Acad. Sci. USA 83, 3465-3468. EIDEN, J., VONDERFECHT,S., THEIL, K., TORRES-MEDINA, A., and YOLKEN, R. H. (1986). Genetic and antigenic relatedness of human and animal strains of antigenically distinct rotaviruses. /. Infect. Dis. 154,972-982. ESTES, M. K.. and COHEN, J. (1989). Rotavirus gene structure and function. Microbial. Rev. 53, 410-499. GORZIGLIA.M., HOSHINO, Y., BUCKLER-WHITE,A., GLASS, R., FLORES,J., KAPIKIAN,A. Z., and CHANOCK, R. M. (1986). Conservation of amino acid sequence of VP8 and cleavage region of 84.kDa outer capsid protein among rotaviruses recovered from asymptomatic neonatal infection. Proc. Nat/. Acad. Sci. USA 83, 7039-7043. GORZIGLIA, M., NISHIKAWA, K., GREEN, K., and TANIGUCHI, K. (1988). Gene sequence of the VP7 serotype specific glycoprotein of Gottfried porcine rotavirus. Nucleic Acids Res. 16, 775. GREEN, K. Y., HOSHINO, Y., and IKEGAMI,N. (1989). Sequence analysis of the gene encoding the serotype-specific glycoprotein (VP7) of two new human rotavirus serotypes. Virology 168, 429-433. GUNN, P. R., SATO, F., POWELL, K. F. H., BELLAMY, A. R., NAPIER,J. R., HARDING, D. R. K., HANCOCK, W. S., SIEGMAN, L. J., and BOTH, G. W. (1985). Rotavirus neutralizing protein VP7: Antigenic determinants investigated by sequence analysis and peptide synthesis. J. Virol. 54,791-797. HUNG, T., WANG, C., FANG, Z., CHOU, Z., CHANG, X., LIONG, X., CHEN, G.. YAO, H.. CHAO, T., YE, W., DEN, S.. and CHANG, W. (1984). Waterborne outbreak of rotavirus diarrhea in adults in China caused by a novel rotavirus. Lancet 1, 1139-l 142. ISHIMARU,Y., NAKANO, H., OSETO, M., YAMASHITA, Y., KOBAYASHI,N., and URASAWA, S. (1990). Group C rotavirus infection and infiltration. Acta Paediatr. Jpn. 32, 523-529. JIANG, B. M., SAIF, L. J., KANG, S.-Y., and KIM, 1. H. (1990). Biochemical characterization of the structural and nonstructural polypeptides of a porcine group C rotavirus. 1. viral. 64, 3171-3178. KAPIKIAN, A. Z., FLORES,J., MIDTHUN, K., HOSHINO, Y., GREEN, K. Y., GORZIGLIA,M., NISHIKAWA,K., CHANOCK, R. M., POTASH, L. P., and PEREZ-SCHAEL,I. (1989). Strategies for the development of a rotavirus vaccine against infantile diarrhea with an update on clinical trials of rotavirus vaccines. Adv. Exp. Med. Biol. 257, 67-89. KAPIKIAN, A. Z., and CHANOCK, R. M. (1990). Rotaviruses. In “Virology” (B. N. Fields et a/., Eds.), 2nd ed., pp. 1353-1404. Raven Press, New York. KOZAK, M. (1981). Possible role of flanking nucleotide in recognition of the ATG initiator codon by eukaryotic ribosomes. Nucleic Acids Res. 9, 5233-5252. KYTE, J., and DOOLIITLE, R. F. (1982). A simple method for displaying the hydropathic character of a protein. 1. Mol. Biol. 157, 105-l 32. LAEMMLI, U. K. (1970). Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature 227,680-685. LARRALDE,G., and FLORES,J. (1990). Identification of gene 4 alleles among human rotaviruses by polymerase chain reaction-derived probes. Virology 179, 469-473.

SEQUENCE

CONSERVATION

MANIATIS, T., JEFFREY,A., and KLEID, D. G. (1975). Nucleotide sequence of the rightward operator of phage lambda. Proc. Nat/. Acad. Sci. USA 72, 1184-l 188. MATSUMOTO, K., HATANO, M., KOBAYASHI. K., HASEGAWA,A., YAMAZAKI, S., NAKATA, S., CHIBA, S., and KIMURA, Y. (1989). An outbreak of gastroenteritis associated with acute rotaviral infection in schoolchildren. J. Infect. Dis. 160, 6 1 1-6 15. NAGESHA, H. S.. HUM, C. P., BRIDGER,1. C., and HOLMES, I. H. (1988). Atypical rotaviruses in Australian pigs. Arch. viral. 102, 91-98. PENARANDA, M. E., CUBII-~, W. D.. SINARACHATANANT, P., TAYLOR, D. N.. LIKANONSAKUL,S., SAIF, L., and GLASS, R. I. (1989). Group C rotavirus infections in patients with diarrhea in Thailand, Nepal, and England. J. Infect. Dis. 160, 392-397. QIAN, Y., SAIF, L. J., KAPIKIAN,A. Z.. KANG, S. Y.. JIANG, B., ISHIMARU,Y., YAMASHITA, Y., OSETO, M., and GREEN, K. Y. (1991). Comparison of human and porcine group C rotaviruses by Northern blot hybridization analysis. Arch. Viral. 118, in press. SAIF, L. J., BOHL, E. H., THEIL, K. W., CROSS, R. F., and HOUSE, J. A. (1980). Rotavirus-like, calicivirus-like and 23-nm virus-like particles associated with diarrhea in young pigs. 1. C/in. Microbial. 12, 105111.

OF GENE 8

569

SAIF. L. J., and THEIL, K. W. (1985). Antigenically distinct rotaviruses of human and animal origin. In “Infectious Diarrhoea in the Young: Strategies for Control in Humans and Animals” (S. Tzipori, Ed.), pp. 208-2 14. Amsterdam: Elsevier, Amsterdam/New York. SAIF, L. J. (1990). Nongroup A rotaviruses. ln “Viral Diarrheas of Man and Animals” (L. 1. Saif and K. W. Theil, Eds.), pp. 73-95. CRC Press, Boca Raton, FL. SANGER, F., NICKLEN, S., and COULSON, A. R. (1977). DNA sequencing with chain-terminating inhibitors. froc. Nat/. Acad. Sci. USA 74, 5463-5467. SIPPEL, A. E. (1973). Purification and characterization of adenosine triphosphate: Ribonucleic acid adenyltransferase from Escherichia coli. Eur. J. Biochem. 37, 3 l-40. STIRZAKER,S. C., WHITFELD, P. L., CHRISTIE,D. L., BELLAMY,A. R., and BOTH, G. W. (1987). Processing of rotavirus glycoprotein VP7: Implications for the retention of the protein in the endoplasmic reticulum. J. Cell Biol. 105, 2897-2903. USHIJIMA, H., HONMA, H., MUKOYAMA, A., SHINOZAKI, T., FUJITA, Y., KOBAYASHI, M., OHSETO, M., MORIKAWA, S., and KITAMURA, T. (1989). Detection of group C rotavirus in Tokyo. 1. Med. Viral. 27, 299-303.