91
Gene, 37 (1985) 91-99 Elsevier GENE 1346
Nucleotide sequence of the glnA-glnL intercistronic region of Escherichiu coli (Recombin~t
DNA; gtnALG operon; glutamine synthetase adenylylation site; te~ination;
REP sequences)
Mario Rocha, Martha Wzquez, Alejandro Garciarrubio aud Aiejandra A. Covarrubias* Gentro de Investigacibn sobre Fijacitin de Nitrcigeno, Universidad National Autcinoma de MPxico, Apartado Postal 565-A, Cuernavaca, Morelos (Mexico) Tel. (9073) 139-877 (Received January 4th, 1985) (Revision received February 20th, 1985) {Accepted March lst, 1985)
SUMMARY
The nucleotide (nt) sequence of a 6X2-bp fragment containing the 3’ end of the glnA gene, the region between the glnA and glnL genes, and the 5’ end of the glnL gene from Escherichiu coli was determined. This segment contains the region coding for the last 107 amino acids (aa) of glutamine synthetase, including the adenylylation site of this enzyme. The analysis of this sequence revealed two REP sequences, a Rho-independent terminator, the putative ghzL promoter and the possible binding site for the gtnG product, NR, .
INTRODUCTION
In enteric bacteria the synthesis of GS, as well as that of many proteins that transport or degrade nitrogen compounds (Ntr systems), is regulated in response to the quality and abundance of the nitrogen source in the growth medium (Woolfolk et al., 1966; Prival and Magasanik, 1971; Prival et al., 1973; Kustu et al., 1979a; Magasanik, 1982). Tbis regulation seems to be exerted at the level of transcription through the products of the glnF, glnG * To whom correspondence addressed.
and reprint requests should be
Abbreviations: aa, amino acid(s); bp, base pairs; GS, glutamine synthetase; IPTG, isopropyl-fi-D-thiogalactoside; NR, , nitrogen regulator; nt, nucleotide(s); Ntr, nitrogen-regulated; PolIk, Klenow fragment of the E. cc& DNA polymerase I; RBS, ribosomebinding site; REP, repetitive extragenic p~indromi~; Xgal, 5-brom~-chloro-3-indolyI-~-~g~actoside. 0378-l 119/85/$03.30 0 I985 Eisevier Science Publishers
and g&L genes (for reviews see Magasanik, 1982; Merrick, 1982). Evidence from different laboratories suggests that these gene products function both positively and negatively (Garcia et al., 1977; Kustu et al., 1979a; Pahel and Tyler, 1979; McFarland et al., 1981; MacNeil et al., 1982a; Alvarez-Morales et al., 1984). Both g&G and g1n.i.together with g&A, the glutamine synthetase structural gene, form a complex operon (Pahel et al., 1982; ~ajewska-G~nkiewicz and Kustu, 1984; Alvarez-Morales et al., 1984), whereas the glnFgene is unlinked (Garcia et al., 1977; Pahel et al., 1978; Gaillardin and Magasanik, 1978; Leonardo and Goldberg, 1980). It has been shown that the transcription of the glnALG operon proceeds through glnA, glnL and glnG in that order (Rothstein et al., 1980; Backman et al, 1981; MacNeil et al., 1982b; Pahel et al,, 1982; Espin et al., 1981; Krajewska-G~nkie~~z and Kustu, 1983). The transcription of g&G can be initiated at the promoter for
glnA as well as at a site located glnA (MacNeil processes
downstream
exerted by the pro-
duct of glnG (NR,). This gene product scription et al.,
or absence
initiated
1982;
represses,
of the glnF product,
at the downstream
Backman
et al.,
1983;
et al., 1983; Reitzer and Magasanik, ber of observations region, with promoter
have
suggested
in
tran-
and ligation reactions
to Maniatis
was as in Messing
were performed
et al. (1982). Transformation
et al. (1981).
(d) Nucleotide sequencing
1983). A numthat
a DNA
activity, is localized
between
1982; Covarrubias
Restriction according
site (Pahel Ueno-Nishio
gfnA and glnG (Chen et al., 1982; Goldie and Magasanik,
(c) Cloning procedures
et al., 1982b; Pahel et al., 1982). Both
are subject to control
the presence
from
and Bastarrachea,
Nucleotide
sequencing
was carried out using the
chain termination procedure (Sanger et al., 1977; Messing et al., 198 1; Messing and Vieira, 1982).
1983).
Ueno Nishio et al. (1983) have located a DNA region with the functional properties of a promoter and operator in a 270-bp DNA fragment. This fragment lies at the end of, or distal to, glnA and contains the site at which translation of glnL is initiated. One of our approaches to understand the control of the gInALG operon has been the analysis of its regulatory regions. In this study, we report the complete nucleotide sequence of a 682-bp BarnHI-CfuI fragment containing the 3’ end ofgInA, theglti-glnL intergenic region, as well as the 5’ end of glnL from E. coli.
(e) RNA isolation The MX614 strain was grown on glucose NN minimal medium (Covarrubias et al., 1980b) with glutamine 1 mg/ml as nitrogen source. The culture was harvested at Aeoo = 0.5, and total RNA extracted according to Aiba et al. (198 1).
RESULTS
was
AND DISCUSSION
(a) Sequencing strategy
MATERIALS
AND METHODS
(a) Reagents, media and enzymes Electrophoresis reagents were from Bio-Rad, deoxynucleotides from Sigma Chemical Co., and dideoxynucleotides from PL-Biochemicals. Deoxyadenosine 5-[ a-35S Jthio-triphosphate was purchased from Amersham International. IPTG and Xgal were purchased from Sigma Chemical Co. All the other reagents and media were of analytical grade. Restriction endonucleases, T4 DNA ligase and PolIk were purchased from commercial sources. (b) Bacterial strains and cloning vehicles The E. coli K-12 strain, JMlOl (Messing et al., 1981) was used as host for vector phages M 13mpS and M13mp9 (Messing and Vieira, 1982). The wildtype strain used for RNA isolation, MX614, was described previously (Osorio et al., 1984).
As mentioned above, it has been shown that the presence of a promoter-operator region in a portion of the glnALG operon located between glnA and glnL intercistronic region should be contained in the 691bp BamHI-EcoRI fragment from plasmids pACR.5 (Covarrubias and Bastarrachea, 1983) (Fig. 1). To obtain the nt sequence of the gInL 5’-control region, this BarnHI-EcoRI fragment was cloned in phages M 13mp8 and M 13mp9. The nt sequence of the two ends of this fragment was determined. Fragments of about 300 bp from the glnALG region obtained by sonication (Deininger, 1983) were cloned into SmaI-digested M13mp8. Appropriate clones were identified by dot hybridization (Maniatis et al., 1982) using the 682-bp BamHI-C/u1 fragment (Fig. 1) as probe. At the same time, the purified BamHI-C/a1 fragment was digested with HinfI. The three resulting fragments of 283, 177 and 222 bp were filled-in using PolIk, and cloned into M13mp8 digested with SmaI. These clones allowed us to determine the sequence of both strands of the BamHI-ClaI fragment by the Sanger chain termination procedure (Sanger et al., 1977). The nt sequence obtained is shown in Fig. 2.
93
Fig. 1. Physical map of the gin genes of E. coli and of the pACR5 plasmid. (A) Restriction map of the glnALG operon. The black bar represents the BarnHI-CloI fragment which was sequenced. The positions ofglnA,glnL andglnG are shown by the open bars; the open arrows indicate the direction of transcription. Some of the restriction sites are indicated; the vertical arrows show the Hinfl sites internal to the Bum HI-ClaI fragment. The map positions of the genes are based on the M,s of the peptides synthesized by minicells containing plasmids with multiple deletions of the glnALG DNA region (not shown). The locations assigned to these genes are in agreement with those reported in the literature (Backman et al., 1981; Pahel et al., 1982; Chen et al., 1982; Ueno-Nishio et al., 1983). (B) Map of the pACR5 plasmid (Covarrubias and Bastarrachea, 1983). The dark bar represents DNA from pBR322. The single line indicates DNA derived from wild-type E. coli.
While this work was in progress, Ueno-Nishio et al. (1984) reported part of this sequence, comprising the region from position 281 to position 678 (Fig. 2).
(c) The glnA-gZnL intercistronic region Analysis of the glnA-glnl intercistronic shows four interesting sequences.
region
(b) The C terminus of glutamine synthetase From the first 322 nt we deduced the aa sequence of the C terminus of GS, as well as its adenylylation site. The deduced sequence agrees completely with the known seven C-terminal aa of GS (Ginsburg and Stadtman, 1973). Furthermore, the aa sequence around the adenylylation site, indicated in Fig. 2, is almost identical with that obtained through peptide sequence by Heinrikson and Kindgdon (1971). However, some modifications have been introduced to the reported aa sequence (R.L. Heinrikson, personal communication) which makes it identical to the deduced aa sequence given here. In Fig. 3 the deduced aa of this region obtained from E. coli and Salmonella typhimurium (Hanau et al., 1983; J. Brenchley, personal communication) were compared. Two regions are highly conserved. One, encoded from positions 92-163, has the adenylylation site; the other, encoded from positions 260-325, contains the C terminus. This could be relevant in delimiting the functional domains of the enzyme.
(1) The REP sequences Two regions are highly homologous to the consensus REP sequence (Stern et al., 1984). One of them is located in the upper strand at positions 395-430, and the other is located in the lower strand at positions 423-454 (Fig. 2). The localization of the REP sequence found here is similar to that found in various operons from E. coli and S. typhimurium (Stem et al., 1984). In Fig. 4 the REP sequences we found are compared with the consensus (Higgins et al., 1982a; Stem et al., 1984). An 8-bp overlap exists between the two REP sequences. Two possible RNA secondary structures can be formed which have a free energy of -23.9 kcal/mol and -28.5 kcal/mol, respectively (Fig. 4). This suggests that either one of these structures can be formed in vivo. Stem et al. (1984) proposed that the REP sequences are involved in chromosome structure and organization. We do not know as yet if the sequences described here play any role in regulating the expression of the glnALG operon. However, the analysis of
94
so
. . .
. . .
. . .
. . . TTG GCC ATG G:\C ACG AAA CGA CGC GAC GAC ‘TAC CCG CCA G/U 100
Lys AAG TTC
Ile XTC T:\G
His Pro CA’C CCG GT.~ Gee
Gly GGC ccG
Glu GAA CTT
.A-\ia blct GCC ATG cGG TAC
Asp GAC CTG
Lys AAA ~77
Asn AAC TTG
CT,1 CC;\
I‘AG TTC ‘I’TG
Glu GAA CTT
Lys AAA rri‘
AMP Leu CTG GAC
‘Tbr TAT ATA
Asp CAC CTG
Leu CTG GAC
Pro CCC Gee
Pro CC,\ GGT
Glu. (;,\A 07
Ala CCC CGC
? I)0
150 Pro CCA GGT
Gln cAG GTC
Glu Ile GAG ATC c,rC ‘rA(;
Val
Ala
Gly
Ser
Lcu
Glu
CAA
CGT
CCG
AGA
GAC
CTT
GTT GcA GcicII~ CTG m4
Glu GAA CTT
Ala GCA CGT
Leu 0-G GAC
Asn AAC TTG
Glu GAA CTT
I.eu CT(; GAC
Asp GAT CTA
Leu CTG GAC
Asp GAc CTG
Arg Glu CGC (;Ac GCG (:TC
Phr Leu 7’7x1 c,rG AAG GAC
l.ys AAA TTT
Ala CCC CGG
Leu Arg CTG CGT G.4C GCA
Arg CGC GCC
Glu CAA CTT
Asp GAT CTA
Arg Val CGC GTG GCG CAC
LSO Giy Giy Val GGT GGC GTG CC.4 CCG CAC Hinf
Phe TTC AAG
Thr ACT TGA
Asp GAC CTG
Glu GAA CTT
Ala GCA CGT
IIe ATT TAA
Asp CAT CTA
Ala Tyr GCG TAC CCC ATG
Ilc ATC TAG
Ala GCT CGA
Glu GAA CTT
Asp GAC CTG
350
3uo
1
Arg Met Thr Pro His CGT ATG .4CT CCG CAT GCA TAC TGA GGC GTA
Pro Val CCG GTA GGC CAT
Glu Phe GAG TTT CTC AAA
Glu GAG CTC
Leu CTG GAC
Tyr TAC ATG
Tyr TAC ATC
Scr AGC TCG
Val GTC CAG
End TAA ATT
GTGTTTTAGTTGCCGTGGAAACTTTTC CACAAAATCAACGGCACCTT’I’GAAAAG
400
4
GCCTGTCTCTGGCAGGCCTGGCATCGGTGGCAAGCACATCACGCCGGATGCGACGCA~ATGCG’rCTTATCCGGCCTA~ACG(;‘rGATG~TGT CGGACAGt\GACC(;TCCG(;f\CCCTA(;CCACCtiT’~C(;TGTAGTGCGGCC’rACGC’rGC(;T’rTACG(~A(:AA’rAG~;(:CGGA’rGTGCCACTACTACA k 450
Hinf -
I
,
500
P
1
GGTAGGCCGGr\GCAGG’;‘GAGTCGCTCTCCAACGTGA~GT’rTG’rCAG~TATCTG,r.4G~CCATC’rC’rG~A’rGG~;CTTT~TTCTCCGTC~AI.TC CCATCCGGCCTCGTCCACTCAGCGAGAGGTTCCACTTCACTTCAAACAGTCGA’rAGACA’~CGGGTAGAGA(~G’rACCCGAAAAAAGAGGCAGTTAAG
600
550
.
4
* Met Ala Thr Gl ATG GCA ACA GG TAC CGT TGT cc
CACTAAAATGGTGCAACCTGTKmACTGCTTT GTGATTTTACCACGTTGGACAAGTCCKKACGAAA
TCTGATGCTTCGCGCTTTTTATCCGTAAAAAG AGACTACGAAGCGCGAAAAATAGGCATTTTTC 650
Y Thr Gln Pro Asp Ala Gly Gln Ile Leu Asn Ser Leu Ile Asn Ser Ile Leu Leu Ile Asp c XCG CAG ccc GAT GCT GGG CAG ATC CTC AAC TCG CTG xrr AAC AGT ATT TTG TTA ATC GAT G TGC GTC GGG CTA CGA CCC GTC TAG GAG TTG AGC GAC TAA TTG TCA TAA A.. . . . . . . . . .
REP2
t,, Pg!&
Fig. 2. Nucleotide The REP sequences
sequence
of the glnA 3’ end, the ghA-glnL
are shown by the solid arrows.
by the open facing arrows.
The possible
NR, binding site. The wavy line indicates their respective Heinrikson presumptive
codons.
intercistronic
The inverted
repeat
region and the glnnt 5’ end. Numbering in the rho-independent
-10 region of the glnL promoter the putative
RBS. The deduced
is boxed. The hatched aa sequences
glnA terminator
1971). The drawing (tA), glnL promoter
below the nt sequence
shows
(PRlnL.), and NR, binding
the relative
site (O,,n,).
is from the 5’ end. sequence
bar shows location
for the correct
Symbol AMP points to the tyrosyl residue to which AMP is covalently
and Kingdon,
terminator-like
bound
location
reading
is indicated
of the postulated
frames are shown above
(Shapiro
and Stadtman,
of the REP sequences,
1968; and the
95
E.C. S.t.
CG SC
Gly
Glu
Ala
Met
Asp
Lys
Asn
Leu
AMP I Tyr
Asp
Leu
Pro
Pro
Glu
Glu
Ala
Lys
Glu
Ile
Pro
Gln
Val
Ala
GGC
GAA
GCC
ATG
GAC
AAA
AAC
CTG
TAT
GAC
CTG
CCG
CCA
GAA
GAA
GCG
AAA
GAG
ATC
CCA
CAG
GTT
GCA
tt*
l
***
***
l
**
GCG
AAA
GAG
ATC
CCA
*** CAG
* GTA
::G
Glu GAA *** GAA
Glu GAA *** GAA
Asp GAT l *t GAT
Asp GAC ttt GAC
Arq CGC *** CGC
Val GTG t** GTG
Arq CGT *** CGT
-%G
CG;
AG:
Arq
Ser
.:;:
6;:
1::
::;
:;a
;:I
k::
:;;;
:;a
::G
2::
t::
Gly
Ser
Leu
Glu
Glu
Ala
Leu
Asn
Glu
Le"
Asp
Leu
Asp
Arg
E.C.
GGC
TCT
CTG
GAA
GAA
GCA
CTG
AAC
GAA
CTG
GAT
CTG
GAC
CGC
66bp
s.t.
::T
;:;
:;z;
5:;
AG: A+g
AGC Ser
GCT Ala
TGG Trp
A:C Asn
GC: Ala
CTG Leu
AAA Lys
CT: Leu
GAA Glu
n.d.
Met
Thr
Pro
His
Pro
Val
Glu
Phe
Glu
Leu
Tyr
Tyr
Ser
Val
End
ATG
ACT
CCG
CAT
CCG
GTA
GAG
TTT
GAG
CTG
TAC
TAC
AGC
GTC
TAA
l *t
l t
l *.
l *
ttt
l *t
l **
t**
tt*
l t*
**t
t**
***
t*
l **
ATG
ACC
CCG
CAC
CCG
GTA
GAG
TTT
GAG
CTG
TAC
TAC
AGC
GTT
TAA
E.C.
s.t.
E.C.
**
172bp
Y AGCCCATCTCTGCATGGGCTTTTT
158bp
AGCCCATCCCAAGATGGGCTTTTT
**t**t**
*
CACTAAAATGGTGCAACCTGTTC
TCTCCGTCAATTCTCTGATGCTTCGCGCTTTTTATCCGTAAAAAG t**tt *** l **.** l ***t*tt*tt
l
t***t*tt***
A?%??ACTGCTTT
s.t.
E.C.
s.t.
"et
Ala
Thr
Gly
Thr
Gln
Pro
Asp
Ala
Gly
Gl"
Ile
Leu
Asn
Ser
Leu
Ile
Asn
Ser
ATG et* ATG
GCA t** GCA
ACA
GGC *a* GGC
ACG
CAG **a CAG
CCC l ** CCC
GAT l tt GAT
GCT l ** GCT
GGG l ** GGG
CAG *** CAG
ATC l ** ATC
CTC
AAC
TCG
CTG
ATT
AAC
AGT
**(t
l *
l *t
CTC
AAT
TCG
T;A
:;C
;\A;
::C
l
AGC Ser
Fig. 3. Comparison
between
l
ATA Il.2
of the glnA 3’ end, the glnA-glnL intercistronic
the nt sequences
and that of S. typhimurium (S.t.)DNAs. Only the sequences
of the antisense
strands
region and the glnL 5’ end of E. coli (E.G.)
are shown for both cases. The deduced
codons for the E. coli sequence
for the correct reading frames are shown immediately
above their respective
The asterisks
A 66-bp DNA region has not been sequenced
region
indicate
identities
in the E. coli sequence
between
sequences.
downstream
the grnA gene shows
very few homologies
S. typhimurium. For other symbols see Fig. 2. The sources ofthe S. typhimurium nt sequences for the first 113-bp, and Hanau
CONSENSU3
Gee+
ATG
aa sequences
and below for S. typhimurium.
(nd.) for S. typhimurium. A 172-bp
to the corresponding were J. Brenchley
158-bp
(personal
sequence
in
communication)
et al. (1983) for the rest of the sequence.
. cG~cG$
($GFCTTATC~GGCCTAC AG =-
Kcol mold
23.9
GAU REP2
3’-
GCCGG I 423
ATG
TGCCACTACTACACC
ATCCGGCCT-5’
t
I
:
G
C-G’ A-U C-G’
454
ig
E”c G-C 5:
C-G C-G
UAU-AGC-3’
dG = - 28-S
Fig. 4. The REP sequence with the REP consensus
in thegM-glnL sequence
of Fig. 2. Possible RNA secondary and Borer et al. (1974).
intercistronic
(Higgins structures
region. The REP sequences
Kcal mol -’
found in this region (REP1 and REPZ) are compared
et al., 1982a; Stern et al., 1984). Numbers
below indicate
are also shown. The dG values were calculated
according
their position to Cantor
in the nt sequence
and Schimmel(l980)
the nt sequence intercistronic
of the S. typhimurium
dent terminator
gInA-glnL
region
stem and loop structure
does not show any sequence to the REP sequences. This could sug-
homologous
gest that they are not relevant
for the expression
1978) with a theoretical
of
it is followed
the operon. This type of sequence is also absent from
involvement
the upstream
duction
region of the gbzA gene.
sequence
located
(Adhya
and Gottesman,
LIG of - 13.5 kcal/mol
by a row of six T’s of a termination
of transcription
process
observed
and
(Fig. 2). The in the re-
between
the g/nA
and glnG genes have been suggested (Pahel et al., 1982; Reitzer and Magasanik, 1983). The experi-
(2) A possible glnA terminator The
since it might form a G + C-rich
at
positions
(Fig. 2) has all the characteristics
498-522
ment shown in Fig. 5 demonstrates
of a rho-indepen-
gen limitation,
that, under nitro-
the main glnA transcript
z
h,
I!
is approx.
E 8 t c
0
I
1624 Fig. 5. Northern by Maniatis
blot hybridization.
hybridization
were carried
out as described
glnA, labeled with [32P]dCTP dpm/pg.
of agarose
by Thomas
by nick translation.
gels containing
was added
2.2 M formaldehyde
to all buffers.
rotavirus
(SAE
The probe was used at a concentration
The transcription
M. Rocha,
P. Leon, F. Bastarrachea structure
et al., 1980a; Osorio wavy lines indicate
Arrows indicate
markers:
and A.A. Covarrubias,
submitted
here. The restriction
et al., 1984). The initiation the DNA and the mRNA,
for publication).
sites were localized
Distances
through
filter
internal
to
of 5 x 10’
et al., 1984) was hybridized
in the lower part shows primer extension
RNA from
the predicted
glnA
(A. Garciarrubio,
The stop codon (TAA) and the Rho-independent by conventional
codon (ATG) was taken from Covarrubias respectively.
MX614 (Osorio
2904 nt and 1541 nt, and denaturated
start point (symbol I at the left end of the map) was obtained
(r+,) are described
fragment
of 10 ng/ml and a specific activity
strain
E. colirRNA:
were as described
as well as the nitrocellulose
the 700-bp BarnHI-EcoRI
11): 3700 nt, 1620 nt, 1350 nt, 1120 nt and 660 nt. The scheme
transcript.
terminator-like
fragment.
and electrophoresis
The gel transfers
(1980). We used as a probe
4 pg of total RNA were used per gel slot. Total RNA from the wild-type
against the EcoRI-BumHI-generated human
Preparation
et al. (19X2), except that 2 mM NaKHPO,
restriction
and Bastarrachea
are in bp and the size of the transcript
mapping
(Covarrubias
(1983). The straight is in nt.
and
97
1640 nt long. The existence of a largely preferred site for transcription termination is also suggested since transcripts of other sizes are not evident. Assuming an average M, of 120 per aa residue, the coding requirement for the GS monomer (M, = 55 000; Bender and Streicher, 1979) is approx. 1375 bp. With these considerations in mind, the 3’ end of the 1640-nt transcript should be located not farther than 300 bp after the GS stop codon. A more precise calculation based on the exact location of the 1640-nt transcript 5’ end (A. Garciarrubio, M. Rocha, P. Leon, F. Bastarrachea and A.A. Covarrubias, submitted for publication) allowed us to localize the 3 ’ end of this transcript within 40 nt of the site where the terminator-like sequence is found. Although a more detailed study is needed it seems probable that this sequence corresponds to a functional terminator. This terminator sequence shows high homology with the one found in the S. typhimurium intercistronic region (Hanau et al., 1983; Fig. 3). However, the structure found in S. typhimurium should be more stable than the one found in E. coli since the theoretical free energy of the former is -16.1 kcal/mol. (3) The presumptive glnL promoter Genetic data strongly suggest the existence of a promoter between glnA and glnL (Chen et al., 1982, Goldie and Magasanik, 1982; Ueno-Nishio et al., 1983). Indeed we found a conserved Pribnow box and a putative -35 sequence in this region (from positions 542-573). Recently, Ueno-Nishio et al. (1984) have reported the location of the glnL transcription start site at position 579,6 bp downstream from the conserved Pribnow box shown in Fig. 2. This supports the role of this sequence (TATAAT) as part of the glnL promoter. -..+
c-c-
l
5’. TTGCACCAACATGGTGCTTAATGmGAAGCACTATATTGGTGCAAC ..**** . t.fft.ff. ..**, .f ..*..*, 5’. TGCACTAAAATSGTGCAAC 5’. TGCACTAAA>ITGGTGCAAC -3’ **577, 191 511 -.
Fig. 6. Comparison
between
present
and glnL control
sequence
in the &A
above corresponds
bias and Bastarrachea,
the possible
below refer to the positions
binding
sites
The nucleotide
to theglnA control region (Covarru-
1983). The asterisks
with the 19-mer found in the glnL control dyad symmetries.
Plnl
591
NR,
regions.
UN&
3’ -3’
indicate
identities
region. The numbers
in Fig. 2. The arrows
indicate
the
The -10 region of one of the gInA promoters
is boxed (A. Garciarrubio, and A.A. Covarrubias,
M. Rocha, submitted
P. Leon, F. Bastarrachea
for publication).
(4) A possible NR, binding site A 19-bp sequence that contains an inverted repeat was found close to the presumptive glnL promoter (from positions 573-591). The same sequence is also found in the glnA control region as shown in Fig. 6. In addition, this sequence is also present in the S. typhimurium dhuA promoter (Higgins et al., 1982b), which is subject to nitrogen regulation (Kustu et al., 1979b; Higgins and Ames, 1981; Stern et al., 1984). The fact that the glnL as well as the glnA promoters are repressed by NR, (Goldie and Magasanik, 1982; Ueno-Nishio et al., 1983; Wei and Kustu, 1981; Pahel et al., 1982) is supported by data which show that NR, is able to bind specifically to a DNA fragment containing the glnL promoter (Reitzer and Magasanik, 1983) or the glnA promoter (A. Garciarrubio, M. Rocha, P. Leon, F. Bastarrachea and A.A. Covarrubias, submitted for publication). In addition, the finding that a DNA fragment containing this 19-bp sequence is protected from DNase I digestion (Ueno-Nishio et al., 1984) further supports the role of this sequence in DNA recognition by NR, . Interestingly, we found two possible NR, binding sites in the glnA control region. These two 19-mer sequences are part of a 5 1-bp region which contains an imperfect dyad symmetry (Fig. 6). The differences between the double glnA and single glnL NR, binding sites could mean that NR, binds with a distinct conformation to the glnA and glnL regulatory regions. As shown in Fig. 3, these 19-mers are also present, at the same positions, in the S. typhimurium glnAglnL intergenic sequence. (d) Expression
of the glnL gene
A number of observations suggest that, under nitrogen-limiting conditions, the glnL protein level is lower than that of NR, even though both genes are translated from the glnALG transcript (Pahel et al., 1982; Goldie and Magasanik, 1982; Guterman et al., (1982). Post-transcriptional and translational control mechanisms have been proposed to explain the diminished synthesis of proteins encoded by internal genes in various E. coli operons (Barry et al., 1980; Dean and Nomura, 1980; Gold et al., 1981; Smiley et al., 1982; Gouy and Gautier, 1982). The lack of an efficient RB S was discarded since a recognizable AGGAG sequence (Fig. 2; Shine and Dalgamo, 1974) precedes the glnL gene. Also, there are
no si~~~cant stem and loop structures that could hide this RBS. The most stable one (at positions 566-615) has a theoretical dG of -5.7 kcal/mol. However, the possibility that a protein could stabilize one of these structures cannot be discarded. Another mechanism to maintain the glnL protein at low levels might depend on a high incidence of infrequently used codons. We are presently exploring this possibility.
Chcn, Y.M., Backman, regulation teriol.
the product
of nitrogen A.A., Rocha,
co/i K-12. Gene Covarrubias,
M., Bolivar,
cherichiu coli gexles involved
R.. Osorio, A., Bolivar, l-.
hybrid
F.: Nucleotide
nation
Annu. Rev. Biochem.
Aiba, H., Adhya,
B.: Evidence
for two
in intact Escherichicr co/i cells. J. Biol.
functionalgalpromoters
Chem.256(1981)11905-11910. Alvarez-Morales, negative
A., Dixon,
M.: Feedback
control
of the grnA m&C
~~e~~o~zj~e. EMBO Backman,
K., Chen,
M.: Positive
and
in K~ebsiellu
regulon
Magasanik,
B.: Physical
and
of the g&A-ginG region of the Esche-
genetic characterization richiu c&chromosome.
Proc. Nati. Acad. Sci. USA 78 (1981)
Deininger,
P.L.: Random
tion to shotgun
subcloning
DNA sequence
ofsonicated analysis.
K., Chen, Y.M., Ueno-Nishio.
The product nitrogen Barry,
ofglnL is not essential
assimilation.
G., Squires,
cessing
S. and Magasanik, for regulation
J. Bacterial.
C. and Squires,
of bacterial
Attenuation
Espin, G., Alvarez-Morales, tation analysis
Natl.
Acad.
Sci.
IJSA
unit of 77 (1980)
R.A. and Streicher,
lation, adenyly~ation poiyacryiamide
S.L.: Glutamine
synthetase
state, and strain specificity
gel etectrophoresis.
J. Bacterial.
regu-
analyzed
by
137 (1979)
.A. and Merrick,
of@&-linked
mutations
CM. and Magasanik,
P.N., Dengler,
Stability
B., Tinoco
of ribonucleic
Jr., 1. and Uhlenbeck,
acid double-stranded
O.C.:
helices. J. Mol.
CR.
and
Vol. 111. Freeman,
Schimmel,
P.R.:
San Francisco,
Biophysical
M.: Complemen
which atfect nitrogen
B.: Involvement
of the product
regulation
of glutamine
in KlebsielIa aerogenes. J. Bacterial.
mrmation
E.. Bancroft,
glutamine Ginsburg,
S., Rhee, S.G. and Kustu,
133
S.: The product
gene, glnF, is required
for synthesis
in Sulmonella. Proc. Natl. Acad.
synthetase
A. and Stadtman,
E.T.: Regulation
in Escherichia coli, in Prusiner,
thetase
of Sci.
of Glutamine
Chemistry,
1980, pp. 1201-1210.
of glutamine
syn-
S. and Stadtman,
E.R.
Metabolism.
Academic
Press. New York. 1973, pp. 9-43. Gold, L., Pribnow,
D., Scheir. T., Schindling,
G. Translat~~~nal
Rev. Microbioi. Goldie,
initiation
H. and Magasanik,
B.: Effects ofglnL andother
suppression
tations.
J. Bacterial.
R.. Koduri,
Higgins,
150 (1982) 1314-1321.
t~ph~uri~~. J. Bacterial.
155 (1983) 82-89.
H.S.: Primary
synthetase,
structure
II. The complete
of a tryptic heneicosapeptide
containing
ofEs&eamino acid covalently
acid. J. Biol. Chem. 246 (1971) 1099-1106.
C.F. and Ames, which interact
show extensive
J.: Nucleotide
region for the g!nA and g[rrL genes of
R.L. and Kingdon.
adenylic
in the ,&A by rho mu-
phenotype
R.K., Ho, N. and Brenchley,
richiu coli glutamine bound
G. and Tyler, B.: Polarity of the Reg
of the control
Salmonella
correlation
Nucl. Acids Res. 10 (1982) 7055-7074.
S.K., Roberts.
operon:
regulato-
150 (1982) 231-238.
C.: Codon usage in bacteria:
with gene expressivity.
Heinrikson,
Annu.
of gInA-lac2 fusions in
of transcription
Gouy. M. and Gautier,
Hanau,
S., Singer B.S. and
in prokaryotes.
35 (1981) 365-403.
ry loci on regulation
proteins
Biol. 86 (1974) 843-853. Cantor,
I29
1329-1338.
sequence
1000-1007. Borer,
DNA: applica-
Anal. Biochem.
213-217. Gaillardin,
sequence
3331-3335. Bender,
Sci.
fixation in KlebsieNapneurnc~nicle.Mol. Gen. Genet. 184 ( 198 1)
Guterman.
and pro-
of RNA from the rpLJ1 -rpoBC transcription
Escherichia coli. Proc.
B.:
154 (lY83) 516-519. CL.:
Acad.
(1983) 216-223.
Klehsiella uerogenes. J. Bacterial.
3143-3747. Backman,
of r-protein
Natl.
USA 77 (1980) 3590-3594.
Stormo,
J. 3 (1984) 501-507. Y.M. and
regulation
in Escherichiu coli. Proc.
(Eds.), The Enzymes R. and Merrick,
01 190
US.4 74 (1977) 1662-1666.
47 (1978) 967-996.
S. and de Crombrugghe,
sequence
171-175.
of a newly identified termi-
ET-
of’giutamatc
of E~~~lerj~hiflcoli. Mol. Gen. Genet.
gene expression
Garcia. of transcription
containing
3 ( 1980b) 1X-164.
Plasmid
D. and Nomura,
(1978)
M.: Control
plasmid
in the biosynthesis
A.A. and Bastarrachea.
synthetase
S. and Gottesman,
F.:
239-251.
F.: ColEI
and glutamine.
F. and Bastarrnchea,
of the ginF gent in the autogenous
Adhya,
in the
of the &A gene ofE.d~eric~hitr
mapping
1 I (1980a)
and Bastarrachea,
(1083)
REFERENCES
is involved
in &sc+zrrichic~ &i. J. Bat-
AA., Sanchez-Pescador,
the glm4 control
We thank Drs. J. Leemans, R. Palacios, F. Sanchez and L. Segovia for their critical review of this manuscript, P. Leon for her enthusiastic collaboration during this work, Dr. F. Bastarrachea for valuable discussions, and Ms. E. Miranda and D. Cuellar for typing the manuscript. This work was supported by Consejo National de Ciencia y Tecnolo&a (Mexico) grant PCCBBNA-020413.
utiiization
Cloning and physical
Dean,
B.: Ctlaractcrii,;ltt~}il
of which
150 (1982) 21 I-220.
Covarrubias,
Covarrubias, ACKNOWLEDGEMENTS
K. and Magasanik.
of a gene. &L,
G.F.-L.:
homology:
Proc. Natl. Acad.
Two periplasmic
with a common complete
membrane nucleotide
Sci. USA 78 (1981) 6038-6042.
transport receptor sequences.
99 Higgins, C.F., Ames, G.F.-L., Barnes, W.M., Clement, J.M. and Hofnung, M.A.: A novel intercistronic regulatory element of procaryotic operons. Nature 298 (1982a) 760-762. Higgins, C.F., Haag, P.D., Nikaido, K., Ardeshir, F., Garcia, Cl. and Ames, G.F.-L.: Complete nucleotide sequence and identification of membrane components of the histidine transport operon of S. typhimurium. Nature 298 (1982b) 123-121. Krajewska-Grynkiewicz, K. and Kustu, S.: Regulation of transcription of girrA, the structural gene encoding glutamine synthetase, in ginA :: Mud1 (ApR, lac) fusion strains of Salmonella typhimu~um. Mol. Gen. Genet. 192 (1983) 187-197. Krajewska-Grynkiewicz, K. and Kustu, S.: Evidence that nitrogen regulatory gene ntrC of Salmonella typhimurium is transcribed from the glnA promoter as well as from a separate ntr promoter. Mol. Gen. Genet. 193 (1984) 135-142. Kustu, S., Burton, D., Garcia, E., McCarter, L. and McF~land, N.: Nitrogen control in Salmonella: regulation by theglnR and glnF gene products. Proc. Natl. Acad. Sci. USA 76 (1979a) 4576-4580. Kustu, S., McFarland, N.C., Hui, S.P., Esmon, B. and Ames, G.F.-L.: Nitrogen control in SalmoneRa typhimurium: corregulation of synthesis of glutamine synthetase and aminoacid transport systems. J. Bacterial. 138 (1979b) 218-234. Leonardo, J.M. and Goldberg, R.B.: Regulation of nitrogen metabolism in glutamine auxotrophs of Klebsiellapneumoniae. J. Bacterial. 142 (1980) 99-l 10. MacNeil, T., MacNeil, D. and Tyler, B.: Fine structure deletion map and complementation analysis of the gin-gel-ginG region in~~~~erj~~a cub. J. Bacterial. 150 (1982a) 1302-1313. MacNeil, T., Roberts, G.P., MacNeil, D. and Tyler, B.: The products of glnL and glnG are bifunctional regulatory proteins. Mol. Gen. Genet. 188 (1982b) 325-333. McFarland, N., McCarter, L., Artz, S. and Kustu, S.: Nitrogen regulatory locus ‘*g&R” of enteric bacteria is composed of cistrons n&B and nnC: identification of their protein products. Proc. Natl. Acad. Sei. USA 78 (1981) 2135-2139. Magasanik, B.: Genetic control of nitrogen assimilation in bacteria. Annu. Rev. Genet. 16 (1982) 135-168. Maniatis, T., Fritsch, E.F. and Sambrook, J.: Molecular Cloning. A Laboratory Manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 1982, pp. 324-328. Merrick, M.J.: A new model for nitrogen control. Nature 297 (1982) 362-363. Messing, J., Crea, P. and Seeburg, P.H.: A system for shotgun DNA sequencing. Nucl. Acids Res. 9 (1981) 309-321. Messing, J. and Vieira, J.: A new pair of M 13 vectors for selecting either DNA strand of double-digest restriction fragments. Gene 19 (1982) 269-276. Osorio, A.A., Servin-Gonzalez, L., Rocha, M., Covarrubias, A.A. and Bastarrachea, F.: c&dominant, glutamine synthetase constitutive mutations of Eschertchia coli independent of activation by the glnG and gbtF products. Mol. Gen. Genet. 194 (1984) 114-123. Pahef, G., Zelenetz, A.D. and Tyler, B.: g&Bgene and regulation ofnitrogen metabolism by glutamine synthetase in Escherichia coli. J. Bacterial. 133 (1978) 139-148.
Pahel, G. and Tyler, B.: A new g/d-linked regulatory gene for glutamine synthetase in Escherichia coli. Proc. Natl. Acad. Sci. USA 76 (1979) 4544-4548. Pahel, G., Rothstein, D.M. and Magasanik, B.: Complex gmAglnL-g&G operon of Eschertchia c&i. J. Bacterial. 150 (1982) 202-213. Prival, J.M. and Magasanik, B.: Resistance to catabolite repression of histidase and proline oxidase during nitrogen limited growth of Klebsiella aerogenes. J. Biol. Chem. 246 (1971) 6288-6296. Prival, J.M., Brenchley, J. and Magasanik, B.: Glutamine synthetase and the regulation of histidase formation in Ktebsiella aerogenes. J. Biol. Chem. 248 (1973) 4334-4344. Reitzer, L.J. and Magasanik, B.: Isolation of the nitrogen assimilation regulator NR,, the product of the gZnG gene of Escherichja cob. Proc. Natl. Acad. Sci. USA 71 (1983) 1342-1346. Rothstein, D.M., Pahel, G., Tyler, B. and Magasanik, B.: Regulation of expression from the glnA promoter ofEscherichia colt’ in the absence ofglutamine synthetase. Proc. Natl. Acad. Sci. USA 77 (1980) 7372-7376. Sanger, F., Nicklen, S. and Co&son, A.R.: DNA sequencing with chain terminating inhibitors. Proc. Natl. Acad. Sci. USA 74 (1977) 5463-5467. Shapiro, B.M. and Stadtman, E.R.: 5’-Adenylyl-0-tyrosine: the novel phosphodiester residue of adenylylated glutamine synthetase from Escherichia cob. J. Biol. Chem. 243 (1968) 3769-3771. Shine, J. and Dalgarno, L.: The 3’-terminal sequence OfEscherichia coli 16s ribosomal RNA: complementarity to nonsense triplets and ribosome binding sites. Proc. Natl. Acad. Sci. USA 71 (1974) 1342-1346. Smiley, B.L., Lupski, J.R., Suet, P.S., McMacken, R. and Godson, G.N.: Sequences of the Escherichia cob dnaG primase gene and regulation of its expression Proc. Natl. Acad. Sci. USA 79 (1982) 4550-4554. Stern, M.J., Ames, G.F.-L., Smith, N.H., Robinson, EC. and Higgins, CF.: Repetitive extragenic palindromic sequences: a major component of the bacterial genome. Cell 37 (1984) 1015-1026. Thomas, P.S.: Hyb~d~zation of denaturated RNA and small DNA fragments transferred to nitrocelfulose. Proc. Natl. Acad. Sci. USA 77 (1980) 5201-5205. Ueno-Nishio, S., Backman, KC. and Magasanik, B.: Regulation of the g&,-operator-promoter of the complex gInAL G operon of Escherichia colt. J. Bacterial. 153 (1983) 1247-1251. Ueno-Nishio, S., Mango, S. Reitzer, L.J. and Magasanik, B.: Identification and regulation of the g&L operator-promoter ofthe complex glnALG operon ofEscherichia coli. J. Bacterial. 160 (1984) 379-384. Wei, G.R. and Kustu, S.: Glutamine auxotrophs with mutations in a nitrogen regulatory gene, nrrC, that is near gInA. Mol. Gen. Genet. 183 (1981) 392-399. Woolfolk, CA., Shapiro, B.M. and Stadtman, E.R.: Regulation of glutamine synthesis, I. Puritication and properties of glutamine synthetase of Escherichia colt. Arch. Biochem. Biophys. 116 (1966) 177-192. Communicated by F. Bolivar.