Cloning and sequencing of the gene for the DNA-binding 17 K protein of Escherichia coli

Cloning and sequencing of the gene for the DNA-binding 17 K protein of Escherichia coli

117 Gene, 67 (1988) 117-124 Elsevier GEN 02441 Cloning and sequencing of the gene for the DNA-binding 17K protein of Escherichia coli (Recombinant p...

892KB Sizes 0 Downloads 121 Views

117

Gene, 67 (1988) 117-124 Elsevier GEN 02441

Cloning and sequencing of the gene for the DNA-binding 17K protein of Escherichia coli (Recombinant phobicity;

DNA;

genomic fragment;

phage ,I library; plasmid vector; oligodeoxynucleotide

probe; hydro-

over-production)

A. Holck and K. Kleppe Department of Biochemistry and Laboratory of Biotechnology, University of Bergen, Bergen (Norway) Tel. + 47-(OS)200100 Received Revised

27 December 10 February

1987 1988

Accepted

18 February

1988

Received

by publisher

28 March

1988

SUMMARY

The skp gene encoding the 17 K protein, a basic DNA-binding nucleoid-associated protein of Escherichia coli, was cloned as part of a 2.3-kb genomic fragment. The gene was sequenced and a polypeptide of 161 amino acids (aa) was deduced from the nucleotide sequence. The primary translation product was processed by cutting off the N-terminal 20 aa residues, yielding a mature polypeptide of 141 aa. The M, of the mature polypeptide was 15 674. An E. coli transformant containing the skp gene on the plasmid pGAH317 was shown to overproduce the gene product some 20-fold.

INTRODUCTION

The DNA E. coli exists in the cell as an irregularly shaped body called the nucleoid. Several lines of evidence indicate that proteins may play important roles in the in vivo structure of this nucleoid. We have developed a method for isolating bacterial chromatin from the nucleoid (Sjastad et al., 1982). Correspondence to: Dr. Norwegian AS-NLH

Food (Norway)

Abbreviations:

17K protein,

protein

of approx.

phate;

acid(s);

17 kDa

ssb, single-strand Computer

at his

Institute,

present

P.O.

address:

Box 50, N-1432

bp, base pair; d, deletion;

a basic DNA-binding

ORF, open reading

Genetics

Holck

Tel. + 47-(02)940860.

aa, amino

inosine; otide(s);

A.

Research

of E. coli; kb, 1000 bp; nt, nucleframe;

binding;

Group;

I,

nucleoid-associated

SDS, sodium

UWGCG,

[ 1,designates

dodecyl

sul-

Univ. of Wisconsin plasmid-carrier

state.

The protein components of the bacterial chromatin have been thoroughly investigated (Lossius et al., 1984; Holck et al., 1987a). Purified DNA-protein complexes from bacterial chromatin have been shown to contain several different proteins, among these a group of basic proteins in the molecular size range of 16-18 kDa. Together, they comprise approx. 20% of the total DNA-binding proteins. One of these, the 17K protein, was also among the major DNA-cellulose-binding proteins of E. coli (Holck et al., 1987b). This protein was shown to be different from the ssb protein (Sigal et al., 1972), protein Hl (Cukier-Kahn et al., 1972), and HLPl (Lathe et al., 1980). The 17 K protein was purified to homogeneity, and the N-terminal sequence was determined by edman degradation. We describe here the cloning and the nucleotide sequence of the gene encoding the 17 K protein, here-

0378-l 119/88/$03.50 0 1988 Elsevier Science Publishers B.V. (Biomedical Division)

118

after termed the skp gene. This represents a first step towards determining the three-dimensional structure of the 17 K protein and a better ~derst~ding of the nature of the DNA-protein interaction to elucidate the structure of the bacterial nucleoid.

(c) Constrnction coli library

MATERIALS AND METHODS

(a) Bacterial strains, enzymes and chemicals E. co& B, E. coIi K802 (hsdR _, hsdM+ , gal-, met-, supE) and E. co& HB 101 (F -, hsdS20 (rg , rn, ), recA13, ara-14, proA2, lacY1, galK2, rpsL20, (SmR), ~~1-5,mtl-1, supE44, a-) were from the laboratory stock of the Laboratory of Biotechnology, University of Bergen. E. coli JM105 (Alac-pro, thi, strA,

endA,

sbcB15,

hsdR4~F’~a~36,

preparation of plasmid DNA was according to Frei et al. (1985), except that the plasmids were purified by phenol-chloroform extractions instead of CsCl density-gradient centrifugations. Small-scale preparations of plasmids were prepared by the boiling method (Holmes and Quigley, 1981).

proAl?+,

lacl~ZAM15]) was a gift from Marine Genetics AS, Bergen. E. coli JA200 (F+, thr-1, leu36, trpE63, recA56, thi-1, ara-14, lacY1, galK2, gaET22, ~~1-5, m&l, 1-, supE44, glnV44) harbouring the plasmid pLC26-43, from the Clarke-Carbon E. coli hybrid

ColEl plasmid collection, was supplied by Dr. Barbara Bachmann at the E. coli Genetic Stock Center, New Haven, CT. Enzymes used in the cloning procedures were from New England Biolabs, Beverly, MA and Amersham International (U.K.). Phage IEMBL3 cloning vector and Packagene I DNA packaging system were from Promega Biotec, Madison, WI [ Y-~~P]ATP (3000 Ci/mmol) and [ a-35S]ATP (400 Ci/mmol) were supplied by Amersham International (U.K.). The Gemseq sequencing system was obtained from Promega Biotech. Colicin El was from Sigma Chemical Company, St. Louis, MO. (b) Isolation of DNA E. cob’ B DNA was deprotei~zed, RNase A-treated and purified by isopycnic density-gradient centrifugation in CsCl. Large-scale isolation of specific phages was carried out by precipitation of the lysates from 400 ml liquid cultures with polyethylene glycol. Phage partictes were disrupted by trea~~t with proteinase K in the presence of SDS (Vande Woude et al., 1979) and the released DNA was extracted with phenol-chloroform. Large-scale

and screening

of the ~~~~e~ie~iu

E. coli B DNA was partially digested with Sau3AI

to yield an average fragment length of 13-20 kb. AEMBL3 vector DNA was digested with BamHI foflowed by treatment with calf intestine alkaline phosphatase. The E. coli DNA fragments were ligated to the IEMBL3 arms and packed by a modification of the method of Sternberg et al. (1977), employing the Packagene 1 DNA packaging system. Recombin~t phages (5 x lo5 phages/pg DNA) were plated on E. coli K802 and screened by the method of Benton and Davis (197’7). Imprints on nitrocellulose filters were hybridized as described in section d, below. (d) Southern b~otti~

Genomic, plasmid and 1 DNA preparations were exhaustively digested with a number of restriction enzymes, and the fragments were electrophoresed on 0.7-1.5x agarose gels and blotted onto nitrocellulose according to the method of Southern (1975). The filters were hybridized with a 32P-labelled synthesized oligodeoxynucleotide (the 17 KI probe, a 29-mer) prepared on the basis of knowledge of the N-terminal amino acid sequence of the 17 K protein as shown in Fig. 1 (Holck et al., 1987b) or with a synthetic probe (the 17 KII probe, a 29”mer), hybridizing from nt position 436 to nt position 464 (see Fig. 3). The sequence of the 17KII probe was thus 5’-GAT-GCA-AAC-GCC-GTT-GCT-TACAAC-AGC-AG-3’. (e) Nucleotide

sequencing and computer analysis

DNA fragments were inserted into plasmid vectors pGEM3 or pGEM4 and sequenced according to the chain-termination procedure (Sanger et al., 1977) with the Gemseq sequencing system. The inserts were sequenced using oligodeoxynucleotides

119

from the two flanking and the 17KI Alternatively, M13mp18 essentially performed

SP6 and T7 promoter

and the 17KII DNA

probes

fragments

regions into

and M13mp19 vectors and sequenced as above. The computer analyses were on an IBM personal computer, using the

using

the

UWGCG

AND DISCUSSION

as primers.

were inserted

Staden-Plus sequence analysis programme (Amersham International, U.K.), and on a VAX/780 computer

RESULTS

programme

package

(a) Characterization

of the gene for the 17K protein

To identify and characterize the gene encoding the 17 K protein, and to find restriction sites suitable for cloning and subcloning

of the gene, purified chromo-

somal E. coli DNA was digested with a series of restriction enzymes either alone or in combinations.

(Devereux et al., 1984). The hydropathic index in Fig. 4 is plotted as described by Kyte and Doolittle

The resulting DNA fragments were subjected to electrophoresis and blotted onto nitrocellulose filters.

(1982), using their hydrophobicity

The filters were then hybridized

window

is slid along the sequence

values. In short, a and each amino

acid is assigned a value depending on the residue itself and on the adjacent residues. The diagram is scaled according to the maximum and minimum values in the chain. The charge index curve is obtained using a similar approach. The Robson prediction of the secondary structure is based on the method of Garnier et al. (1978). Protein structures are divided into four classes: turn, coil, sheet, helix. Each residue is assigned a probability of being found in each of these classes. The most likely structure (the highest probability) at each position is indicated by a dot in the Robson prediction plot. (f) Gel electrophoresis

and immunoblotting

One-dimensional SDS-polyacrylamide-gel electrophoresis was carried out using the buffer system of Laemmli and Favre (1973), with 15 y0 polyacrylamide gels. The gels were either stained using the silver staining method or electroblotted as described by Towbin et al. (1979). The antigen was detected employing monospecilic antibodies against the 17K protein and the Bio-Rad Immunoblot (GAR-HRP) Assay kit (Bio-Rad Laboratories, Munich, F.R.G.). The immunoblots were scanned with a Zeineh softlaser scanning densitometer (Biomed Instruments, Fullerton, CA) and the data were recorded on a Hewlett Packard Integrator 3390 A. Various amounts of E. coli overnight cultures were sonicated and the protein content was measured with the Bio-Rad protein assay prior to gel electrophoresis.

oligodeoxynucleotide

prepared

to a mixed synthetic on

the

basis

of

knowledge of the N-terminal amino acid sequence of the 17 K protein (the 17 KI probe). The amino acid sequence and the corresponding synthetic oligodeoxynucleotide are shown in Fig. 1. The probe gave one strong signal and did not hybridize to control DNA (human placenta DNA) confirming the specificity of the hybridization and indicating the presence of a unique single-copy gene. Taken together with the findings that the 17 K protein subunits migrate as one polypeptide upon two-dimensional gel electrophoresis and that the N-terminal sequence of the protein showed no ambiguities, the 17 K protein seemed to consist of four identical subunits in contrast to other known low-M, basic proteins suspected to interact with DNA such as HU which exists as heterotypic dimer (Rouviere-Yaniv and Kjeldgaard, 1979) and Hl, which exist in several forms: Hla, Hlb and Hlc (Spassky et al., 1984). The restriction map of the skp gene showed some similarities with that reported for the JivA gene (Bend& and Friesen, 1981). No signals were obtained when the 17 KI probe was hybridized to the plasmid pLC 26-43 reported to contain the JirA gene (Neidhardt et al., 1983). However, one cannot exclude the possibility that these proteins may show partial homology in other parts of the peptide chain.

NH*-Ala-Asp-Lys-Ile-Ala-Ile-Val-Asn-Met-Gly5'-GCI.GA+.;*ATI.GCI-ATI-GTPAAZ-ATG.GG-3' Fig. 1. The N-terminal

the corresponding nucleotide, inosine.

sequence

sequence

the 17KI

probe,

of the mature

17 K protein

of the mixed synthetic used to identify

and

oligodeoxy-

the &II gene. I,

120 All81

Haelll

Pstl

Sau3A Pstl

Pstl

Alul

Sau3A

Pstl

Alul

Sau3A Rsa I

Sau3A

-

-

17Kll

17KI

100

bp

Fig. 2. Restriction map of the skp gene and the strategy used for sequencing. The bold and the open lines indicate the structural gene and the signal sequence, respectively; transcription is from left to right. The long arrows indicate the direction and extent ofeach sequence determination. Short arrows labelled 17KI and 17KII indicate sequences homologous to the synthetic primers.

(b) Cloning and sequencing of the skp gene Partially Sau3AI-digested E. coli B DNA and IEMBL3 vector DNA were ligated and an E. coli library was constructed as described in MATERIALS AND METHODS, section c. Upon screening the library with the 17KI probe, 12 positive recombinants were found. A 2.3-kb HindIII-Sal1 fragment from one of the positive A clones was subcloned into pGEM3 plasmid vector, resulting in a recombinant plasmid, pGAH317, more suitable for further characterization and for direct sequencing of the gene (the Sal1 site originates from the AEMBL3 polylinker). The pGAH317 plasmid was then digested with restriction enzymes recognizing four bp to find fragments of suitable lengths for direct sequencing after subcloning into pGEM4 or M 13mp 18 and M13mp19 plasmid vectors. The results are summarized in Fig. 2, showing both the position of restriction sites and the sequencing strategy used for determining the skp gene. The positions of the restriction sites were confirmed by the sequence determination. The nucleotide sequence of the skp gene is shown in Fig. 3. A 483-bp ORF is coded from the alternative start codon GTG at nt position 25 to the stop codon at nt position 508. This sequence would encode a polypeptide of 161 aa. A subsequent cleaving at aa position 84 would yield a mature polypeptide consisting of 141 aa with an N-terminal sequence identical to that determined by edman degradation,

starting with the Ala encoded at aa position 85. The M,. of this polypeptide is 15 674, which is approximately the size of the 17 K subunit as determined by SDS-polyacrylamide-gel electrophoresis. The gene contains a nucleotide sequence of 60 nt at the 5’ end encoding a hydrophobic oligopeptide. It is thought that the signal peptide functions as a signal for the initial steps in the protein translocation. Such sequences are thus often, but not always, found on proteins destined for export or integration into membranes. However, when crude membranes and outer and cytoplasmic membranes were isolated (Bakken and Jensen, 1986) and subjected to immunoblotting, only trace amounts of the 17K protein were detected (results not shown). The significance of the hydrophobic N-terminal sequence of the nascent polypeptide thus remains unknown. One might speculate that the hydrophobic sequence of the nascent 17 K protein directs the protein to some kind of hydrophobic scaffold structure. Preceding the alternative start codon is a proposed ribosome-binding site almost identical to the Shine-Dalgarno consensus sequence (AGGAGGT), which would presumably act as a strong translation-stimulating sequence. This might be reflected in the abundance of the 17 K protein (1200 tetramers/cell). The end of the 17 K structural gene is followed by two regions of hyphenated dyad symmetry (nt 523-553 and nt 565-590) which may be involved in the termination of the 17K transcript. However, a conventional terminator structure where

121

50 30 10 AAATGGGATGGTAAGGAGTTTATTGTGAAAAAGTGGTTATTAGCTGCAGGTCTCGGTTTA MetlLysLysTrpLeuLeuAlaAlaGlyLeuGlyLeu 110 v 90 70 GCACTGGCAACTTCTGCTCAGGCGGCTGACAAAATTGCAATCGTCAACATGGGCAGCCTG AlaLeuAlaThrSerAlaGlnAlaAlaAspLy~IleAlaIleValAsnMetGlySerLeu 130 150 170 TTCCAGCAGGTAGCGCAGAAAACCGGTGTTTCTAACACGCTGGAAAATGAGTTCAAAGGC PheGlnGlnValAlaGlnLy~ThrGlyValSerAsnThrLeuGluAsnGluPheLysGly 190 210 230 CGTGCCAGCGAACTGCAGCGTATGGAAACCGATCTGCAGGCTAAAATGAAAAAGCTGCAG ArgAlaSerGluLeuGlnArgMetGluThrAspLeuGlnAlaLysMetLysLy5LeuGln 290 270 250 TCCATGAAAGCGGGCAGCGATCGCACTAAGCTGGAAAAAGACGTGATGGCTCAGCGCCAG SerMetLysAlaGlySerAspArgThrLy~LeuGluLysAspValMetAlaGlnArgGln 350 330 310 ACTTTTGCTCAGAAAGCGCAGGCTTTTGAGCAGGATCGCGCACGTCGTTCCAACGAAGAA ThrPheAlaGlnLysAlaGlnAlaPheGluGlnAspArgAlaArgArgSerAsnGluGlu 410 390 370 CGCGGCAAACTGGTTACTCGTATCCAGACTGCTGTGAAATCCGTTGCCAACAGCCAGGAT ArgGlyLysLeuValThrArgIleGlnThrAlaValLy~SerValAlaAsnSerGlnA~p 470 450 430 ATCGATCTGGTTGTTGATGCAAACGCCGTTGCTTACAACAGCAGCGATGTAAAAGACATC IleAspLeuValValAspAlaAsnAlaValAlaTyrAsnSerSerA~pValLysAspIle 530 510 490 ACTGCCGACGTACTGAAACAGGTTAAATAAGTAATGCCTTCAATTCGACTGGCTGATTTA * ThrAlaAspValLeuLysGlnValLysEnd 590 570 550 GCGCCGGTTGGATGCAGAACTACACGGTGATGGCGATATCGTCATCACCGGCGTTGCGTC Y_ w 650 630 610 CATGCAATCTGCACAAACAGGTCACATTACGTTCATGGTTAACCCAAAATACCGTGAGCA 6’70 TTTAGGCTTGTGCAGGCGT Fig. 3. Nucleotide The numbering indicated horizontal

sequence

of the skp gene and the deduced

amino acid sequence.

is from the 5’ end (with the last digits corresponding

by the downward divergent

arrows

arrowhead.

A possible

with the centers

Shine-Dalgarno

of symmetry

Only the antisense

to the nt). The cleavage sequence

is underlined,

strand

(mRNA-like)

site to yield the mature and dyad symmetries

is shown.

polypeptide

is

are indicated

by

shown by dots.

the dyad symmetries are followed by a G/C stretch and a T stretch as described by Rosenberg and Court (1980), was not found. A second ORF starting at nt position 552 was found out of phase 3’ of the proposed coding region of the skp gene. This ORF would encode a polypeptide of 37 aa. It is thus possible that the skp gene is

transcribed as part of a polycistronic mRNA. These questions will be settled only after the size of the RNA and the actual point of transcription termination have been determined in vitro. The sequence of the skp gene was unique when compared to 65 1 E. coli sequences (798 259 nt) from the EMBL data library.

122

TABLE

I

The search for codon

Amino

acid composition

Amino

acid

of Escherichia coli 17K protein

Subunit Amino

composition

Nucleotide

sequence b

table of codon frequency Ala

18

17

Arg

12

9

Asx Asp

18

(18) 11

Asn 0

0

Glx Glu

25

(22) 8

Gln 7

5

His

1

0

Ile

4

5

Leu

10

9 15

Lys

10

Met

3

5

Phe

4

4

Pro

ND

0 10

Ser

11

Thr

9

8

Trp

ND

0

Tyr

1

1

Val

12

13

145

141

Total

Fig. 3. Values

in parentheses

4o

are

(asp + asn)

I

A

in Fig. 3 turned

Only in this reading frame could the N-ter-

of the deduced

out to be the only one

protein

be identical

to that

The assignment of gene for the 17K

protein is thus convincingly confirmed. The evidence for this is further strengthened when comparing the deduced amino acid composition with that determined for the 17K protein. These results are shown in Table I. The predicted and determined values correspond well. The hydrophobicity and the charge distribution as well as the suggested secondary structure of the proposed nascent polypeptide are displayed in Fig. 4. The polypeptide is apparently divided into three domains. In the hydropathic plot both ends of the polypeptide chain show a hydrophobic character, whereas the middle part of the chain is clearly more hydrophilic. This is reflected also in the charge plot, which indicates that the charged areas are concentrated in the middle of the polypeptide chain. Likewise, a high degree of secondary structure is

or

(glu + gin).

0

suggested

found by Edman degradation. this sequence as the structural

a Values from Holck et al. (1987b). ND, not determined. b From

representative

probable. minal

14

Gly

supposedly

of the skp gene. When the found nucleotide sequence was translated in all three reading frames, the frame

7

Cys

in a nucleotide

sequence is a method for detecting coding sequences (Staden and McLachlan, 1982). The codon frequencies of the ompA, which codes for one of the most abundant E. coli proteins, was used to establish a

from:

acid analysis a

preference

I

I

I

1

I

I

i

r

2:

o”-.‘=

L -0

2

-40 4OrB

_I

-40 E* dF on IY

..

t -c

.. .

C-

h -

. .

..

s-

the dots, representing

I

I

I

I

I

I

20

40

60

80

100

120

140

one aa residue each, are separated section e.)

_ -

I

index (A), charge index (B) and suggested

chain. The second level indicates AND METHODS,

. . ..

...

.

-..........-........

Amino Fig. 4. Hydropathic

.

acid

secondary

number structure

of the proposed

nascent

into four levels. The top level suggests the positions

coils (c), the third sheets (s), and the bottom

level helix structures

polypeptide ofturns

(C). In panel C,

(t) in the polypeptide

(h). (For details, see MATERIALS

123

proposed towards the ends of the polypeptide, predominately random coils and sheet structures, whereas the middle part of the chain is dominated by a long a-helix structure. Pabo and Sauer (1984) have pointed out that DNA-binding proteins often use a-helical regions for DNA

recognition.

A possible

DNA

binding

would thus be in the area from about aa residues

site 65

to 80. This area also has a high number of basic amino acid residues. However, one should keep in mind that the method for prediction, although regarded among the better ones, was correct for only about 56% of the tested amino acid residues (Kabsch and Sander, 1983). (c) Expression of the 17K protein E. coli HB 10 1 transformed with plasmid pGAH3 17, containing the skp gene, was grown under conditions giving a high yield of plasmid. Total sonicated E. coli lysates were then subjected to SDSpolyacrylamide-gel electrophoresis, blotted onto nitrocellulose and the 17K protein detected with monospecific antibodies as described in MATERIALS AND METHODS, section f. The results are presented in Fig. 5. An approx. 20-fold increase in the amount of the 17 K protein was observed in the transformed bacteria compared to the non-transformed bacteria. High levels of expression are obviously not deleterious to the cells. A band corresponding to an &I,. of about 19000 was visible on the immunoblots. This band contained 20 times less protein than that corresponding to the 17 K protein and might well be the precursor of the mature polypeptide. These cells might be used to study the influence of the 17K protein on growth rate and on the nucleoid structure and appearance. Moreover, they will be a good source of 17 K protein for a thorough study of protein structure and protein-DNA interaction. (d) Conclusions Cloning and sequencing of DNA-binding proteins are initial steps in determining the tertiary and quaternary structure of a protein and obtaining a sound understanding of the nature of the proteinDNA interaction. Ultimately these studies may be of importance when elucidating the structure and organization of the bacterial nucleoid.

Fig. 5. Synthesis of 17 K protein. E. coli HB 101 was transformed with plasmid pGAH3 17 containing the gene for the 17 K protein. The bacteria were lysed and subjected to 0.1% SDS-15 % polyacrylamide gel electrophoresis and the 17K protein was identified by immunoblotting. Lanes A and B, silver-stained total E. coli protein. Lanes C and D, immunoblots of identical preparations as A and B, respectively. Lanes A and C, E. coli HB 101; lanes B and D, HBlOl[pGAH317].

The skp gene encoding the 17K protein, a basic DNA-binding protein of E. coli was identified and cloned using a synthetic oligodeoxynucleotide. This structural gene consisted of an ORF of 483 bp. The nascent translation product is processed yielding a polypeptide of 141 aa residues. A possible binding domain for DNA is in the hydrophilic area from about aa residues 65 to 80.

ACKNOWLEDGEMENTS

We want to thank Dr. A. Fjose and his group for preparing 1 and plasmid vectors and A. Nerland and R. Aasland for their help in sequencing and com-

124

puting. This study was supported

by grants from the

Norwegian Humanities.

for

Research

Council

Science

and

Lathe, R., But, H., Lecocq, J.-P. and Bautz, E.K.F.: Procaryotic histone-like

protein interacting

Natl. Acad.

Sci. USA 77 (1980) 3548-3552.

Lossius,

I., Sjastad,

dimensional present

with RNA polymerase.

K., Haarr,

L. and

gel electrophoretic

Kleppe,

separation

Proc.

K.: Two-

of the proteins

of Escherichia coli. J. Gen. Microbial.

in chromatin

130 (1984) 3153-3157. Neidhardt, REFERENCES

F.C., Vaughn,

Gene-protein

T.A. and Bloch,

P.L.:

index of Escherichia coli K-12. Microbial.

V., Phillips,

Rev.

47 (1983) 231-284. Bakken,

V. and Jensen,

H.B.:

Outer

membrane

proteins

Fusobacterium nucleatum Fevl. J. Gen. Microbial.

of

132 (1986)

1069-1078. Bendiak,

J.D.: Organization

of genes in the

four minute region of the Escherichia coli chromosome: dence that rpsB and tsf are co-transcribed.

evi-

Mol. Gen. Genet.

D.W.

and

Davis,

clones by hybridization

R.W.:

Screening

Igt

to single plaques

recombinant

in situ. Science 196

Cukier-Kahn,

R., Jacquet,

low molecular stimulate

from Escherichia coli that

proteins

DNA-directed

RNA synthesis:

Proc. Natl. Acad.

J., Haeberli,

P. and Smithies, 0.: A comprehensive

analysis programs

set

for the VAX. Nucl. Acids Res.

12 (1984) 387-395. Frei, E., Baumgartner,

S., Edstrbm,

J.-E. and Noll, M.: Cloning

of the extra sex combs gene of Drosophila and its identification by P-element-mediated

gene transfer.

EMBO

J. 4 (1985)

J., Osguthorpe,

accuracy

D.J. and Robson,

and implications

the secondary

structure

B.: Analysis

of simple methods

of globular proteins.

of the

of predicting

J. Mol. Biol. 120

DNA-

and

I., Aasland,

RNA-binding

R., Haarr, proteins

L. and Kleppe, of

chromatin

K.: from

Escherichia coli. Biochim. Biophys. Acta 908 (1987a) 188-199. Holck, A., Lossius,

I., Aasland,

and characterization

R. and Kleppe,

of the 17 K protein,

from Escherichia coli. Biochim.

protein

K.: Purification a DNA-binding

Biophys.

Acta

914

D.S. and Quigley,

preparation

of bacterial

M.: A rapid boiling method plasmids.

Anal. Biochem.

for the

114 (1981)

W. and Sander,

secondary

structure?

Kyte, J. and Doolittle, hydropathic

C.: How good are predictions

of protein

character

of a protein.

for displaying

the

J. Mol. Biol. 157 (1982)

Sanger, F., Nicklen,

U.K. and Favre,

teriophage

dimer. FEBS

M.: Maturation

inhibitors.

Proc. Natl. Acad.

Sigal, N., Delius, H., Kornberg, A DNA unwinding

with

Sci. USA 74

with DNA

Natl. Acad. Sjastad,

T., Gefter, M.L. and Alberts, B.:

protein

from Escherichia coli: its

isolated

and with DNA

polymerases.

Proc.

Sci. USA 69 (1972) 3537-3541.

K., Fadnes,

P., Kruger,

P.G., Lossius, I. and Kleppe, K.:

Isolation,

properties

and nucleolytic

degradation

of chroma-

tin from

Escherichia cob’. J. Gen.

Microbial.

128 (1982)

3037-3050. Southern,

E.M.: Detection separated

of specific

sequences

by gel electrophoresis.

among

DNA

J. Mol. Biol. 98

(1975) 503-517. Spassky,

A., Rimsky,

S., Garreau,

E. coli DNA-binding

H. and But,

H.: Hla,

protein which accumulates

compacts

an

in stationary

DNA in vitro. Nucl. Acids Res. 12

(1984) 5321-5340. Staden,

R. and McLachlan,

in identifying

A.D.: Codon preference

and its use

protein coding regions in long DNA sequences.

Nucl. Acids Res. 10 (1982) 141-156. Sternberg,

N., Tiemeier,

of a Dam

vector

D. and Enquist, containing

Towbin, H., Staehelin, fer of proteins

L.: In vitro packaging

EcoRI

T. and Gordon,

DNA

fragments

of

1 (1977) 255-280. J.: Electrophoretic

from polyacrylamide

trans-

gels to nitrocellulose

and some applications.

Proc. Natl. Acad.

Sci. USA 76 (1979) 4350-4354. M., Enquist,

Sullivan,

M. and

Fischinger,

Moloney

sarcoma

proviral

P.J.: DNA

Communicated

L.W., Nomura,

by J.K.C. Knowles.

S.,

Cloning

of integrated

sequences

in bacterio-

Sci. USA 76 (1979) 4464-4468.

of the head of bac-

T4. J. Mol. Biol. 80 (1973) 575-599.

Lett. 106 (1979)

S. and Coulson, A.R.: DNA sequencing

phage 1. Proc. Natl. Acad.

105-132. Laemmli,

Annu.

N.O.: Native Escherichia coli

J. and Kjeldgaard, is a heterotypic

Vande Woude, G.F., Oskarsson,

FEBS Lett. 155 (1983) 179-182. R.F.: A simple method

involved in

13 (1980) 319-353.

sheets: procedure

193-197. Kabsch,

sequences

of RNA transcription.

Escherichia coli and phage Pl. Gene

(198713) 49-54. Holmes,

and termination

phase, strongly

(1978) 97-120. A., Lossius,

Annu.

297-300.

fragments

979-987.

Holck,

Rouviere-Yaniv,

interaction

Sci. USA 69 (1972) 3643-3647. of sequence

M. and Court, D.: Regulatory

the promotion Rev. Genet.

recognition.

(1977) 5463-5467.

M. and Gros, F.: Two heat-resistant,

weight

R.T.: Protein-DNA

53 (1984) 291-321.

chain-terminating

(1977) 180-182.

Garnier,

and Sauer,

HU protein

181 (1981) 356-362.

Devereux,

CO.

Rev. Biochem. Rosenberg,

D.S. and Friesen,

Benton,

Pabo,