A gene-sized DNA molecule encoding the catalytic subunit of DNA polymerase a in the macronucleus of Oxytricha nova

A gene-sized DNA molecule encoding the catalytic subunit of DNA polymerase a in the macronucleus of Oxytricha nova

Gene, 144 (1994) 1555161 0 1994 Elsevier Science B.V. All rights reserved. 0378-l 119/94/$07,tX~ 155 GENE 07946 A gene-sized DNA molecule encoding...

948KB Sizes 0 Downloads 55 Views

Gene, 144 (1994) 1555161 0 1994 Elsevier Science B.V. All rights reserved.

0378-l 119/94/$07,tX~

155

GENE 07946

A gene-sized DNA molecule encoding the catalytic subunit of DNA polymerase a in the macronucleus of Oxytricha nova (Hypotrichous

ciliates; gene structure;

Sam J. Mansour,

DNA replication;

David C. Hoffman

molecular

evolution;

non-universal

genetic code)

and David M. Prescott

Department qfA4olecu/ar, Cellular and Developmental Biology, University of Colorado, Boulder, CO 80309-0347, USA Received by A.-M. Skalka: 20 December

1993; Revised/Accepted:

4 February/7

February

1994; Received at publishers:

10 March

1994

SUMMARY

We have isolated

a gene-sized

molecule

encoding

the catalytic

subunit

of DNA polymerase

c1 from a macronuclear

genomic library of Oxytricha nova, by using a 0.7-kb fragment of the corresponding human gene as a hybridization probe. Two different versions of the gene are present in the macronucleus, one with an EcoRI site (RI+) and one without an EcoRI site (RI-). The cloned RI- version has been characterized. It is 4938 bp in length, excluding telomeres. It consists of a 329-bp 5’ leader, a 4479-bp coding region and a 130-bp 3’ trailer. The deduced amino-acid sequence shares conserved regions with the yeast and human polypeptides. We also demonstrate by Southern analysis that gene-sized molecules of similar size, homologous to the isolated 0. noua gene are present in the mat genome of closely and distantly related hypotrichs.

INTRODUCTION

Hypotrichous ciliates contain two types of nuclei, a micronucleus (mic) and a macronucleus (mat). The mic genome contains typical chromosomes with hundreds of genes per chromosome. It serves as the germline genome. The mat genome contains millions of gene-sized molecules (0.4-15 kb), each with a single coding function (Klobutcher and Prescott, 1986). Seventy-four molecules have been characterized in various species of the hypotrichs Oxytricha, Stylonychia and Euplotes. These moleCorrespondence to: Dr. D.M. Prescott, University 492-8381;

of Colorado, Fax (l-303)

Boulder, 492-7744;

Department

CO 80309-0347,

of MCD

Biology,

USA. Tel. (l-303)

e-mail: [email protected]

Abbreviations: aa, amino acid(s); bp, base pair(s); E., Euplotes; EtdBr, ethidium bromide; kb. kilobase or 1000 bp; mat, macronucleus( mic, micronucleus( nt, nucleotide(s); O., Oxytricha; ORF, open reading frame; l?, Paramecium; polcc, DNA polymerase(s) a; RI’, version of polcc containing an EcoRI site; RI-, cloned version of polcl without an EcoRI site; S., Stylonychia; SC, Saccharomyces cerevisiae; SDS, sodium dodecyl sulfate; SSC, 0.15 M NaCl/O.OlS M Naacitrate pH 7.6; T., Tetrahymena. SSDI 0378-l

119(94)00157-N

cules conform to one structural form with a 5’ leader, a coding segment and a 3’ trailer. All molecules are flanked by telomeric sequences (Klobutcher and Prescott, 1986). Because of the single coding function, macronuclear DNA molecules are referred to as gene-sized molecules. After cell mating, a mic genome is converted to a new mat genome in which various mic DNA sequences are cut, spliced, eliminated, reordered and amplified to produce gene-sized molecules to which are added telomeric repeats (Klobutcher and Prescott, 1986). In the mature mat of 0. nova, each different gene-sized molecule is present on average in 1000 copies. Characterization of the gene-sized molecule encoding the large catalytic subunit of DNA polymerase CL(pola) in the mat of 0. noua is the subject of this paper. This work is the foundation for a comparison of the mic and mat versions of the gene to be described in a later paper. We chose pola because it is twice the size of the largest scrambled gene identified to date, and it did prove to show a more complex scrambled pattern in the mic than the shorter genes studied. The gene encoding polcl has been cloned from six other eukaryotic species

156 (Braithwaite and Ito, 1993; White et al., 1993). The holoenzyme is composed of four subunits, the largest of which (studied here) is a 160-180-kDa protein that is directly involved in the catalysis of DNA synthesis (Wang, 1991; So and Downey, 1992). The aa sequence comparisons show the presence of several conserved regions dispersed along the polypeptide (Wang et al., 1989; White et al., 1993). In addition to sequencing the large catalytic subunitencoding gene and comparing its putative product to the subunit of other eukaryotes, we have compared the genesized molecules encoding the subunit in several hypotrich species.

RESULTS

1

b

2

1

a

6.5 -

6.5

4.3 -

4.3

2

2.3 2.0 -

AND DISCUSSION

(a) Cloning of the mat gene

The polcl of 0. nova shows a high degree of aa identity in twelve conserved regions (designated I-VII and A-E) associated with DNA polymerases in general and a-type polymerases in particular (see section e below, and Fig. 3). Using a 0.7-kb probe corresponding to the portion of the human pola-encoding gene for regions VI-IIII-C-VII-V (the HindIII-BamHI fragment from pcD-KB pola, a cDNA clone kindly provided by Teresa S.-F. Wang), we probed blots of native 0. now mat DNA by hybridization. ~acronuclear DNA ranges in size from 0.4 kb to 15 kb (Fig. la, lane 1). In Southern analysis, the human probe hybridizes to a 5.0-kb gene-sized molecule (Fig. la, lane 2). Hybridization of the human probe to EcoRI-cut mat DNA detected two molecules, one of 5 kb and one of 2.3 kb (data not shown). Analysis of the pattern indicates that two versions of the pola-encoding gene are present, one that lacks EcoRI sites (RI-) (5-kb molecule) and one with at least a single EcoRI site (RI ‘), which generated the 2.3-kb fragment. No hybridization to other fragments was observed for the RI+ version, indicating that either the probe does not span the EcoRI site, or that the probe does not share significant sequence identity with more than the single 2.3-kb fragment. Subsequent hybridization of the full-length cloned mat pola-encoding gene to EcoRI-cut mat DNA showed three bands, one at 5.0 kb (full-length gene-sized molecule without an EcoRI site) and two at 2.7 kb and 2.3 kb (Fig. lb, lane 2) (the two fragments produced by cutting at a single EcoRI site). (b) Characteristics product

of the mat gene and its encoded

Screening a mat genomic library of 0. nova yielded a single positive h clone designated DC8 Sequencing showed that the insert in the DC8 clone is 4978-bp long

Fig. 1. Southern

analysis

0.7-kb HindHI-BamHf EcoRI insert in DC8 stained

agarose

of 0. notru mat

DNA

probed

with (a) the

fragment of the human gene, and (b) the 5.0 kb Methods: (a) lanes: 1, Native 0. nooa mat (EtdBr

gel). 2, Native

0. nou~l mat (Southern

blot). Native 0.

noau mat DNA (3 ug) was electrophoresed in a 1% agarose gel and stained with EtdBr, after which the DNA was blotted onto Zetabind membrane (Life Science Products, Denver, CO, USA) (Ausubel et al., 1987). The blot was prehybridized at 42-C for 6 h in 20% formamide~3

x SSCj5 x Denhardt’s

solution/i

mM N~pyrophosphate/

1% SDSjlOtI ug/per ml of E. coli tRNA, and then incubated at 42°C for 19 h with the 32P-labeled human probe (Random primed DNA labeling

kit, US Biochemical,

Cleveland,

OH, USA) in the presence

of

5% dextran sulfate. The membrane was washed twice at room temperature for 15 min in 2 x SSC/O.l % SDS, followed by two similar washes at 60°C. Hybridization

signals were visualized

by autoradiography

with

a 21 h exposure period. (b) Lanes: 1, Native and 2, EcoRi digested. Macronuclear DNA (3 ug) was blotted onto Zetabind. prehybridized at 65°C in

I M NaCI/SO mM Tris 8.0/l mM EDTA/O.l%

SDS/ZOO pig

per ml sheared salmon sperm DNA, and then incubated at 65°C for 19 h with the 32P-labeled 5.0-kb insert in DC8. The blot was washed under stringent conditions at 65 ‘C in 0. I x SSC/O.l% SDS. The autoradiograph was developed after a 19-h exposure. Size markers are Hind111 digests of h phage DNA (US Biochemical).

Lengths

are indicated

in kb.

(Fig. 2b). Telomeric repeat sequences (20 bp each) occur at the ends of the insert, indicating the h clone contained a complete gene-sized molecule. The gene-sized molecule contains a 5’ leader of 329 bp extending from the inner end of the 5’ telomere to the ATG start codon (nt 350), and a 3’ trailer of 130 bp extending from the TGA stop codon (nt 4828) to the inner end of the 3’ telomere. The assigned ATG start codon is the fourth ATG triplet from the 5’ end, and is the only one followed by a long ORF. It defines an ORF that begins with an aa region that is conserved at the N terminus in the large subunit of four polcl proteins (Fig. 3, region A) (White et al., 1993). The 5’ leader lacks TATAA, CCAAT and GC boxes.

157

a) sense strand

anti-sense strand

b)

. . . . . ccccaaaaccccaaaaccc~tctagtt=~t=~~=g~g==~t=g=~~tt~=t~g~tt==~g=tttt~~ .

.

.

.

.

.

120

cattttgaacttaaaatagagtatgagcgacaactatactatactactgccc~tgcagagaagtaaagaaaggtggtaaaggagaagttgttaaaaagggattagcagctttgagagcagccaga gaaggtggag~caag~gaaCtg~ct~gtac~C~gta~gtat~~~gcattaatgCtaat~atctttctga~tggCttttc~aaaat~ttgcaacatcatcctcctttctcATGA~~CA MIKHl

240 360

CATTTTATAGGTIGAAGAAAC

480

AAAMAGA’ICTITGAAGAGATAGA~ACMCGAATA~AMAAA

rrTAAGATTrCCGCAAAAA~~A~~~A~~A~~A~ATA

ILQVEETKKIFEEIDDNEYEKIQDSRKNDDFIVDDDGYGY44 TCGCGATCACGGAGGAGAAA~~CAGAGA'PGOAGCC RDHGGEIWDRDGDVEEVGKKKKKKQNNHDPNENIMNYMMP84

600

‘PGCAAAAGATGTAGA~AC~M~~~M~~GC

840

mAGAGTCAAGAGAAGAGCAAGAGCGCGCCGTCGAGA LESREEQERRRQSEQLKQQANIGQNQSDVNPFSKKRKLDE204

960

GlTCTAGCAAGTTCAAGCT~~CATAACT FQQVQANSYQSKQNSHSVSKSKPGDHEMANHADGVDLNLL244

1080

TGCTA?TGACGACACTAAGATGACAGATAGTCACCCTAGC AIDDTKMTDSHPSEIITQNQRASAVSSVNQQIMESTNGKN284

1200

CCAGCTCGAGAAAAA CGATACCGAATOGCMTAGATGAAGGAG~~~C~~~~A~AC~~G~~C~T~~A~~~TM~~C~GACTA~C~~C QLEKNDTEWQQMKEKNAVFNQDLRHNDNALMSNQQDYPLP324

1320

~~GA~AACTCTTTCrrPCTA~~TAGAT LNEDGTLSFYWIDAHEENNGADLFVFGKIYQHEERKFVSC364

1440

CTCTATCAAAGTGAATGGTATGCAAAGAGAOCTATTPGTTA ELFVLPKMSGKSRAAMTTEEEKEQARKVMME404 SIKVNGMQR

1560

ACTTGAAGGAGTAGAAAGAGATTCCAGCTATCACTA LEGVRKRFPAITKWRCKPVTRKYAFELPIQHGEHQFVKIK444

1680

CAAAAACA~PG?TCG~'CAAGAAATGAACAGACCAGTI(;GTCAC AKDVDELEDVNEAKNMFVQEMNRPVAFNKEEDFNNRYSVT164

ATACGA~CAACCTTCCCTTCrrrrCCTAGCACAGTACAG~C YDATFPSLPSTVQGNTFECIFGA

NQSMLESFI

L

K

R

K

I

R

G

1800 P484

A~C~ATGACTATTAGAAACCCTCAAAAGGTAAC? CWMTIRNPQKVTDFRRTWCKQEILVSNPKDIEITLDDLNK524

1920

AACTGAGCTACCTCCTTTGACCTCTGTCACATPTGCCATAG TELPPLTSVTFAIKTCRSSQNTNEIAHLSCIVNENISQEG564

2040 GCCTATGCCAATIGAATAlGAGAGAGCTTl'CAGAGATAAGAAAGAT'I'XT'TCATTTAG~CTT PIEYERAFRD KK DSFIQFF604

TCC~CAAAAATPGA~'PCCACAAA?rA?TCACmACTC PSKIDVHKSFTLLRKLDGKPM CTAACATGAGAGACAACTTAT?GAAGCC~G~CTAAA QHERQLIEAFVAKIYQLDPDLMVAHNLCGGMFD

L

L

L

A

R

I

2160

2280 Q644

GA~CTCAAAATTAGTCAC~G~CAG~~~GCCT~G~~CT~~CCT~C~~~AGA~~GC~AGCC~~A~AGG~~~~~~CC~~T~GT PNKKSDQSGANYGGSQWIPRQV684 HLKISHWSRIGRLKKNQI

2400

TACTPGTGGAAGATPGTTAGTTGATACTPTCTPGACTGCC~~GC~A~A~~~CT~CTA~C~ACCCA~TAGCT~~TA~C~T~G~TA~~AC~A RETNYDLTHLAKVQLQKDRIDFD724 TCGRLLVDTFLTAKELI

2520

TGA~ACCmTGCCAACTTA~TCC~~C~C~~~C~~~ACAC~AG~GACGCCTATCTCAC~~T~~~~~ACC~TAGG~A~C~T LFQLIDHTEKDAYLTLQLMNHLQVIPL764 DDLLPTFYVQMAK

2640

GACTAAGCMCTCACACA~T~~~~A~~~~~CGCCA~GC~~G~~~C~TAC~CAC~G~~G~~G~~~ACC TKQLTNIAGNLWFRSLQNARAERNEHLLLHEFKKKKFVLP804

2760

AGACAAAAAGCAACTTAACGCCAAAGA~~G~G~CA~~CA~~AGTACGM~~G~~~~~~G~~~GA~G~GCAGCGTA~CAG~~~T EYEEGDGKTKGGKRKKAAYAGG DKKQLNAKDLKKNMFAD TGTGA~CCCAAffiCTGG'ITTrTATGATAATATCATC VIEPKAGFYDNII

2880 L 844 3000

IQEYNLCFTTVNRRP884

LLLDFNSLYPSI

TACCAAGAACTTCG4~GC~~~~G~CTAGTAC~G~~~~~~~GAGGTAGATA~~~GC~~~CC~AC~~~T~C~G~~C ENGEEEVDIEEADLPDKNVNLKDA924 TKNFDGSEMKNQYKKG AG'PXlTCCCATGGT-lT%AGAGACCTCGTCTiUAAGAGAAAAGC~l-CAAGGACAAGAnXAAAC VKDKMKTEKDHVKLSQLE VLPMVLRDLVQKRKA

lWdAAAGA%!ACGTCAAGTTATCGCAATTAGAAA!FZAGATAAAAGGCTATTAA I R Q K A I

3120 3240 K964

158 3360 1004 tGCAGAAAAT~GCTTGGATPCAATGTAGTGTATGGn;ATACTT AENKLGFNVVYGDTDSIMINTGSNQLQQALEMGKRLKGEV

3480 1044

~GAAGTATGCAGCTCTG?diGTAl'JAAAACTICCTAAGTCCAGCTGAAGT AAACTGC'lTGTACAAATGCCTlGAAATAGAUiTAGATGGAGTTTIl'AAGAGlT'FXTCTTA NCLYKCLEIEIDGVFKSLLLLKKKKYAALKYENFLSPAEV

3600 1084 3720 1124 3840 1164

3960 RLRDQGKSENQLVNNFIPYVICQQTYGDT1204 IKGQPHVAVAK * . . . . . . . . . 0 . TACCAAATCAAGCACAGCACTCTCAGATAAGGC~ACCACCCAGAXiAAGTGATrAGCTCAAGAGGAAAAGTGACTATAGATAGl?.3AC'lWZTATG?GTCTACCTAATItXTlK!CTCCCAT 4080 TKSSTALSDKAYHPDEVISSRGKVTIDSDWYVSTQLLPPIl244 4200 1284 4320 1324 AGGMTCC~~CCCAGCrrAAACAATACA~GCrrACA

GILVPSSNNTELTGLACVKCNQR

TCTlT'XTXAAGCAACT IPDAYMLNRLNLFLKQL

4440 1364 4560 1404 4680 1444

GCCTM~CMTCATTCAAGC~~~GG~~~~A~~GM~~ATAC~C~G~GA~AGGCMTA~GG~A~AGC~G~AGCT~~C PNFQSFKELQKKVESFMIRSGYNKVDLGNIFGFMSKGANP

4800 1484

TCACTAAC~~GCA~~Aatggcggcattttgatcagagttaatttattttactcaaattccttaagaacaagataaacctcgcgccttttaagaggttatctgcattta H Q Q K M S I F"*

4920 1492

atgcattcatttagtttataaatttacacaaaaaacttggggttttggggttttgggg-3'

4978

Fig. 2. Nucleotide sequence of the polcc-encoding gene and the deduced aa sequence. (a) The RI- version of the gene-sized molecule. Black boxes at the ends of the DNA molecule denote telomeric sequences; the single-stranded extension of telomeres was removed during cloning. The extent of sequence generated from each strand is indicated with arrows. Lengths are indicated in kb. The hatched box denotes the 0.7-kb fragment of the human pola-encoding gene, and is depicted along its target hybridization region in both the RI- and RI+ versions, The EcoRI site (E) in RI+ is shown. The RI’ version has not been isolated. (b) Complete nt sequence of the RI version, and the deduced aa sequence. The ORF is in upper case; the leader and trailer are in lower case. The sequence has been submitted to the GenBank database under the accession No. UO2001. A mat genomic library of 0. nova was constructed by inserting the gene-sized molecules into the unique EcoRl site of hgt 10 as described elsewhere (Klobutcher et al., 1984). About 1.2 x 10’ plaques representing five times the sequence complexity of mat DNA were screened with the 0.7-kb human probe. The screen yielded one positive h clone, DC8. The 5.0-kb insert in this clone was recovered after EcoRl digestion, indicating that it represents the RIversion. The 5.0-kb insert was subcloned in both orientations into the unique EuoRI site of pTZI8R (US Biochemical). Procedures for subcloning and for the generation of nested deletions were according to established protocois (Ausubei et al., 1987). Sequence determination was performed on double-stranded DNA templates with a Sequenase version 2.0 kit (US Biochemical) (Hsiao, 1991).

There are four TATAA-like boxes located in the region 52-279 bp upstream from the translation start codon; however, since the leader is A +T-rich (66% A+T), TATAA-like sequences are expected to occur by chance. Comparison of the leader sequences of different genes within a hypotrich species or between the same gene in different hypotrichs shows little or no sequence similarity among them with the exception of a histone H4-encoding gene in S. herniae and 0. rz~~a(Herrick, 1992). This suggests that hypotrichs may define promoters in other ways. The TGA stop codon at nt 48264828 is followed in the same reading frame by another TGA codon at nt 4841-4843. The 130-bp trailer sequence does not contain the poly(A) addition signal, 5’-AATAAA. The sequence S-ATAAA occurs 109 bp downstream from the translation stop codon, but this may be only by chance in the 71% A i-T 3’ trailer. Of the 47 hypotrich mat sequences deposited in the GenBank database, 17 sequences lack

the poly(A)-addition signal (Prescott, 1994). Thus it is not unexpected that this signal sequence is not present in the pola gene-sized molecule. It is not clear what guides poly(A) addition in hypotrich genes lacking the conventional signal. The ORF is 4479-bp long, including the stop codon, and is 62% A +T. It is not interrupted by A+ T-rich regions that suggest the presence of introns, which average 80% ACT in hypotrichs. The ORF specifies a putative polypeptide of 1492 aa (approx. 172 kDa). It contains 23 TAA and 18 TAG codons. In the ciliate genera Oxytricha, Stylonychia, Tetrahymena and Paramecium, TAA and TAG codons specify Glu rather than signalling translation termination as in most other organisms (Helftenbein, 1985; Preer et al., 1985; Hanyu et al., 1986; Herrick et al., 1987; Gray et al., 1991). Comparison of the sequence of the 0.7-kb human probe with the target sequence reveals an overall identity of

159 Region

A

Region

E

Region

D

10

Region

IV

Region

B

Region

Region

VI

Region

III

Region

Fig. 3. Conserved

II

aa sequence

10

20

10

20

I 5

motifs shared

Region

30

40

30

C

40

Region

VII

Region

V

10

between

the catalytic

subunits

10

of polcc from 0. nova (On), human

(h) and SC. Regions

are presented

in

the order they appear along the polymerase polypeptide (from N to C terminus): A-E-D-IV-B-II-VI-III-I-C-VII-V. Identical aa in all three polypeptides are shown in shaded boxes, while aa shared between any two of the polypeptides are shown in open boxes. The regions encompass the following aa: A: aa 33-48 (On), 76-91 (h), 56-71 (SC); E: aa 349-353

(On), 361-365

(h), 368-372

(SC); D: aa 483-487

(On), 495-499

(h), 493-497

(SC); IV: aa 601-642

(On), 609-650 (h), 610-651 (SC); B: aa 772-807 (On), 772-807 (h), 781-816 (SC); II: aa 840-880 (On), 839-879 (h), 843-883 (SC); VI: aa 924-940 909-925 (h), 904-920 (SC); III: aa 9577998 (On), 943-984 (h), 937-978 (SC); I: aa 1013P1020 (On), 998-1005 (h), 992-999 (SC); C: aa 1051-1064 1036-1049

(h). 1030-1043

(SC); VII: aa 1067P1071

(On), 1052-1056

(h), 1046-1050

56%, with a 185-bp region exhibiting 70% sequence identity. The homologous region maps to the 3’ half of the sense strand (Fig. 2a), spanning the portion of the gene between nt 3110 and 3820. The EcoRI site in the RI+ version is further upstream in the vicinity of nt 2700

(Fig. 2a). (c) Evolutionary conservation of aa sequence motifs The deduced aa sequence of the 0. nova protein was aligned with sequences of the human and Saccharomyces cerevisiae (SC) pola proteins (Fig. 3). Sequence comparisons show the presence of several conserved motifs associ-

(SC); V: aa 1087-1101

(On), 1071-1085

(h), 1065-1079

(On), (On),

(SC).

ated with eukaryotic, viral, and some prokaryotic replicative DNA polymerases, including domains I through VII, and motifs associated specifically with cl-type DNA polymerases, including domains A through E (Wong et al, 1988; Damagnez et al., 1991; White et al., 1993). The overall arrangement of the sequence blocks (from N to C terminus of the protein) is A-E-D-IV-B-IIVI-III-I-C-VII-V (White et al., 1993), a pattern that is also observed in the 0. nova polypeptide (Fig. 3). Regions I through VI are present in seven eukaryotic pola, and in the replicative DNA polymerases of herpes simplex virus, human cytomegalovirus, Epstein-Barr

160 virus and vaccinia

1993). The prokaryotic merases

examined

III, or regions Region

Based

and mitochondrial

contain

VII is also present viral

DNA

suggested

pola, as 6 and in

polymerases

in most

single aa substitutions sensitivity analogs,

(Spicer Wang

have

a set of core

polymerases.

Indeed,

in regions II and III confer altered

to a variety suggesting

DNA

et al., 1988).

et al. (1989)

of nucleotide

these regions

play an important

role

in DNA pblymerization (Wang et al., 1989). Regions A through E are conserved only in the seven eukaryotic region

pola examined

A, which

is missing

to date, with the exception from the T. brucei

and

of P.

fulciparum

polypeptides (White et al., 1993; and this paper), and are thus thought to be associated only with a-type DNA polymerases (Damagnes et al., 1991). A single aa substitution in region D alters the stability of the DNA polymerase-primase complex in SC, implicating this region in protein-protein interactions with other

components of the et al., 1988). Functions determined. Another conserved tides is the Cys-rich nova polypeptide also

replication machinery (Pizzagalli of A, B, C, and E remain to be

Fig. 4. Southern probed

aa 13 14 to 1398, where the spacing between Cys (C) residues is CX,CX,,CX2CX,,CX,CX,,cX,C, where X represents any aa other than Cys. This arrangement closely matches the spacing in the SC (CX,CX,,CX,CX,, CX,CX,JX,C) and the human (CX,CX,,CX,CX,, CX,CX,,CX,C) polypeptides. The Cys residues may form DNA-binding zing-fingers (Wang et al., 1989). (d) Homologous gene-sized molecules coding for pola in other hypotrichs Using the 0. nova mat gene as probe, Southern analysis of native mat DNA from ten closely and distantly related ciliates identified homologous gene-sized molecules ranging from 4.5 kb to 5.2 kb in every species tested (Fig. 4). The size variation observed is not remarkable because DNA polymerases are generally composed of multiple conserved regions separated by variable length spacers (Wang et al., 1989); it may also result from differences in the length of leader or trailer sequences. Lynn and Sogin (1988) have compiled a phylogenetic tree for the ciliates based upon alignments of 16s rRNA sequences that indicates the genera Oxytricha and Stylonychia are closely related, while the genus Euplotes is more distantly related to both. Our Southern analysis confirms this relationship;

4

5

6

7

8

9

10

analysis

of native

mat

DNA from ten ciliate species

insert of DC8. Species examined are 0. trifallax (lane I), 0. sp. (WR) (I ane 2), S. lemnae (lanes 3 and 5), S. mytilus (lane 4), Urost$u grundis (lane 6), Paraurostylu weissii (lane 7),

with the 5.0-kb EcoRl

sp. (lane 8), E.

Holosticha

10). Methods:

Cultures

temperature

Prescott, extracted

octocarinatus (lane 9) and E. aediculutus (lane

of the various

hypotrich

species were grown

at

using the algae Chlorogonium as food. Preparation

of mat was performed

according

to established

procedure

(Zahler

and

1988). The mat suspension was lysed in 0.3% SDS and phenolat 23°C in order to eliminate or inactivate nucleases. The

aqueous phase was subsequently treated with proteinase K (Sigma, St. Louis, MO, USA) and extracted twice with phenol and once with chloroform/isoamyl

alcohol

(24:l).

Macronuclear

DNA

was subse-

quently ethanol precipitated and resuspended in TE ( 10 mM Tris/l EDTA pH 7.5). The native mat DNA was then electrophoresed

mM in a

1% agarose

was

gel and stained

with EtdBr,

after which

the DNA

blotted onto a Zetabind membrane (Life Science Products) (Ausubel et al., 1987). The blot was hybridized at 65’C for 18 h in NES (0.5 M NaH,PO, 1989)

pH 7.2/l mM EDTA

with

the

probe

labeled

pH 8.0/7% with

SDS) (Sambrook

32P using

random

et al.,

hexamers

(Pharmacia, Cambridge, UK) (Sambrook et al., 1989). Lanes 1 through 8 were washed to a final stringency of 0.1 x SSC/O.l% SDS at 53 C for 20 min, and visualized 9 and

feature among the pola polypepregion at the C terminus. The 0. contains a Cys-rich region between

3

n”-~ nl, _‘ Gs .n:*: :”

room

and pyrophosphate

2

5.0 4.3 -

et al., 1989).

in seven eukaryotic

that regions I, II and III comprise present

poly-

I, 11 and

T4, SC DNA polymerase

on these comparisons,

sequences

DNA

only either regions

I, II, III, IV and V (Wang

well as bacteriophage several

1

virus (Wang et al., 1989; White et al.,

by autoradiography

10 were washed

to a final stringency

room temperature, and visualized posure. Background hybridization presence washing

after a 20-h exposure.

Lanes

of 0.1 x SSCjO.1 %SDS

at

by autoradiography after a 17-h exin lanes 9 and 10 resulted from the

of telomeres on the probe fragment and the low-stringency conditions. Size markers are the 0. notrcl gene and Hind111

digest of h phage DNA (US Biochemical).

Lengths

are indicated

in kb.

the bands in both E. octocarinatus and E. aediculatus (Fig. 4, lanes 9 and 10) are visible only after lowstringency washes (0.1 x SSC/O.l% SDS at room temperature). In contrast, bands in the remaining lanes are visible after washing at much higher stringency (0.1 x SSC/O.l% SDS at 53°C). The catalytic subunit of pola in 0. nova contains multiple, highly conserved regions associated with DNA polymerases in general and with cl-type DNA polymerases in particular (Fig. 3). Because of the unique structure and organization of hypotrich genes, studies in 0. nova may yield useful information about transcriptional regulation of the pola-encoding gene.

ACKNOWLEDGEMENTS

This work was supported by NIGMS Grant No. GM19199 to D.M.P. The authors wish to thank Gayle Prescott for assistance with manuscript preparation.

161 among

REFERENCES

ciliated

protists

using partial

rived from reverse transcripts. Ausubel,

F.M., Brent, R., Kingston,

Smith, J.A. and Struhl,

K. (Eds.): Current

Biology, Greene Publishing York, 1987. Braithwaite, ships

R.E., Moore,

Associates

of

DNA

polymerases.

Protocols

served amino Gray,

in Molecular

and phylogenetic

Nucleic

Acids

Res.

New

relation21

(1993)

G.: The POLI

A.-M. and Baldacci,

gene from the fission yeast, Schizosaccharomyces alpha.

J.G.,

and Wiley-Interscience,

D.K. and Ito, J.: Compilation

787-802. Damagnez, V., Tillit, J., de Recondo,

D.D., Seidman,

pombe, shows con-

acid blocks specific for eukaryotic

DNA polymerase

expression specific

D.W., Price, C.M. and Cech, T.R.: Cloning for the Oxytrichu

of genes subunit

interactions

in the

telomere-binding telomere

and

protein:

complex.

Cell

67

(1991) 807-814. Hanyu, N., Kuchino, Y. and Nishimura, S.: Dramatic evolution: alteration of UAA and UAG termination mine

codons

due

tRNAsG’“. EMBO Helftenbein,

to anticodon

mutations

events in ciliate codons to gluta-

in two

Trtrahyniena

sequence

of a macronuclear

coding for a-tubulin from the ciliate codon usage: TAA is not a translation Acids Res. 13 (1985) 415-433. Herrick, Cl., Cartinhour, C.W.,

DNA molecule

Srylonychia termination

Williams,

K.R.

and

lemnae. Special codon. Nucleic Kotter,

K.P.:

Multiple sequence versions of the Oxytricha ,fa/lax Sl-MAC ternate processing family. J. Protozool. 34 (1987) 4299434. Herrick,

G.: Non-coding

trichous

DNA in macronuclear

ciliates. J. Protozool.

L.A., Jahn,

are eliminated ciliated

CL.

procedure for sequencing double Nucleic Acids Res. 19 (1991) 2787.

and Prescott,

from genes during

protozoan

of hypo-

39 (1992) 309-312.

Hsiao, K.-C.: A fast and simple stranded DNA with Sequenase. Klobutcher,

chromosomes

al-

D.M.:

macronuclear

Internal

sequences

development

in the

Oxytricha nova. Cell 36 (1984) 1045-1055.

Klobutcher, L.A. and Prescott, D.M.: The special case of the hypotrichs. In: Gall, J. (Ed.), The Molecular Biology of Ciliated Protozoa. Academic Press, Orlando, FL, 1986, pp. 111-154. Lynn, D.H. and Sogin, M.L.: Assessment of phylogenetic

relationships

P., Plevani,

with other DNA polymerases. 377223776.

RNA sequences

P. and Lucchini,

G.: DNA poly-

cerevisiae: nucleotide mutation, and protein

sequence, homology

B.M. and Barnett,

A.J.: Deviation

from the universal code shown by the gene for surface protein in Paramecium. Nature 314 (1985) 188-190. Prescott, D.M.: The DNA (1994) in press. J., Fritsch,

Laboratory

of ciliated

E.F. and

Manual,

Cold Spring Harbor,

protozoa.

Maniatis,

Microbial.

T.: Molecular

2nd ed. Cold Spring Harbor

Cloning.

Laboratory

A

Press,

NY, 1989.

Spicer, E.K., Rush, J., Fung, lutionary

51A

Rev. 58

So, A.G. and Downey, K.M.: Eukaryotic DNA replication. Biochem. Mol. Biol. 2 (1992) 1299155. Konigsberg,

de-

21 (1988) 2499254.

Proc. Natl. Acad. Sci. USA 85 (1988)

Preer Jr., J.R., Preer, L.B., Rudman,

C., Reha-Krantz,

W.H.: Primary relatedness

polymerases.

J. 5 (1986) 130771311.

E.: Nucleotide

A., Valsasnini,

merase I gene of Saccharomyces mapping of a temperature-sensitive

Sambrook,

Mol. Gen. Genet. 226 (1991) 1822189.

J.T., Celander,

Pizzagalli,

ribosomal

Biosystems

structure

to eucaryotic

Crit. Rev.

L.J., Karam,

J.D. and

of T4 DNA polymerase: and

other

procaryotic

evoDNA

J. Biol. Chem. 263 (1988) 747887486.

Wang, TS-F., Wong, S.W. and Kern, D.: Human DNA polymerase CL: predicted functional domains and relationships with viral DNA polymerases. FASEB J. 3 (1989) 14-21. Wang, T.S.-F.: Eukaryotic DNA polymerases.

Annu. Rev. Biochem.

60

(1991) 5133552. White, J.H., Kilbey, B.J., de Vries, E., Goman, M., Alano, P., Cheesman, S., McAIeese, S. and Ridley, R.G.: The gene encoding DNA polymerase c1 from Plasmodium falciparum. 364333646. Wong, SW., Wahl, A.F., Yuan, K.-I.,

Kern,

D., Hunkapiller,

DNA polymerase and its primary

Acids

Res. 21 (1993)

P.-M., Arai, N., Pearson, M.W. and

rx gene expression

structure

Nucleic

Wang,

is cell proliferation

is similar to both prokaryotic

B.E., Arai.

T.S.-F.:

Human

dependent and eukaryo-

tic replicative DNA polymerases. EMBO J. 7 (1988) 37-47. Zahler, A.M. and Prescott, D.M.: Telomere terminal transferase activity in the hypotrichous ciliate Oxytricha now and a model for replication of the ends of linear DNA molecules. Nucleic Acids Res. 16 (1988) 695336972.