49
Gene, 69 (1988) 49-57 Elsevier GEN 02533
Isolation and characterization Aspergillus nidulans (Recombinant cloning;
DNA;
gene libraries;
of the glyceraldehyde-3-phosphate
heterologous
hybridization;
gene amplification;
dehydrogenase
intron;
nucleotide
gene of
sequencing;
cDNA
phage 1 vector)
Peter J. Punt, Maria A. Dingemanse,
Brigit J.M. Jacobs-Meijsing,
Peter H. Pouwels and Cees A.M.J.J. van
den Hondel TN0 Medical Biological Laboratory, Rijswijk (The Netherlands) Received
6 February
Accepted
2 May 1988
Received
by publisher
1988 18 May 1988
SUMMARY
The isolation and characterization of the highly expressed glyceraldehyde-3-phosphate dehydrogenase (GPD)-coding gene (gpdA) of Aspergillus nidulans is described. The gene was isolated from an A. nidulans 13 gene library with a Saccharomyces cerevisiae GPD-coding gene as a probe. Unlike many other eukaryotes, A. nidulans contains only one GPD-coding gene. At the amino acid level, homology with other GPD enzymes is extensive. The A. nidulans gene contains seven introns, one of which is positioned in the 5’-untranslated part of the gene. The major transcription start point is found at 172 bp upstream from the start codon. Polyadenylation occurs at several sites about 200 bp downstream from the stop codon. Comparison of 5’ and 3’ flanking sequences with flanking sequences of other highly expressed (glycolytic) genes shows several regions of similar sequence.
Glyceraldehyde-3-phosphate dehydrogenase (GPD; EC 1.2.1.12) plays a central role in glycolysis and gluconeogenesis. In glycolysis it converts glyceraldehyde-3-phosphate into biphosphoglycerate, in gluconeogenesis it catalyses the reverse reaction. Much is known about the structure of the enzyme in
different organisms (Harris and Waters, 1976; Skariynski et al., 1987). Also, the nucleotide sequence of the GPD-coding genes of many prokaryotes and eukaryotes has been determined (e.g., Holland and Holland, 1980; Stone et al., 1985a; Tso et al., 1985b; Michels et al., 1986; Yarbrough et al., 1987). Comparison of the structure of the different GPD enzymes and of the nucleotide sequence of
Correspondence 10: Dr. P.J. Punt, TN0
gene coding
INTRODUCTION
ratory,
P.O.
Box 45,
2280 AA
Medical
Rijswijk
Biological
(The
Labo-
Netherlands)
Tel. (015) 138777.
nucleotide(s); gene;
for GPD;
kb, 1000 bp; mc, multiple
pgk, A. nidulans phosphoglycerate
SDS, sodium
dodecyl
0.15 M NaCl + 0.015 M Na, Abbreviations:
bp, base
glyceraldehyde-3-phosphate
pair(s);
ds, double
dehydrogenase;
stranded;
GPD,
&pdA, A. nidulans
gene coding merase-coding state.
037X-l 119/88~$03.50 0 1988 Elsewer
Science Publishers
B.V. (Biomedical
Division)
for GPD;
sulfate; citrate
copies;
ss, single stranded;
SSC,
pH 7.6; tdh, S. cerevisiae
fpiA, A. nidulans triosephosphate
gene; wt, wild type;
nt,
kinase-coding
iso-
[ 1,designates plasmid-carrier
50
their genes among
shows
different
Yarbrough
a high degree
species (Fothergill-Gilmore,
1986;
et al., 1987).
In several species multiple up to several present
of conservation
hundreds)
in the genome
(in higher eukaryotes
GPD-coding
genes
are
(Tso et al., 1985a; Michels
et al., 1986; Yarbrough et al., 1987). In some cases presumably only one of these genes is transcriptionally
the A. nidulans gpdA gene and the amdS
selection
marker.
The strain
contains
about
nine
copies of the gpdA gene (J. Dekker, in preparation). Plasmid pFLl-33 containing the S. cerevisiae GPDcoding
gene
(gap63/tdh2;
1980) was obtained
Holland
and
Holland,
from Dr. L.E. Edens.
(b) Gene libraries
active, the other copies being pseudogenes
(Hanauer
and Mandel,
1984; Fort et al., 1985). In
several other cases GPD tiple genes pressed
contains
which
(McAlister
is synthesized
sometimes
from mul-
are differentially
and Holland,
ex-
1985; Tso et al.,
1985b). In S. cerevisiue and rat muscle up to 5y0 of the total amount of cellular protein consists of GPD (Krebs et al., 1953; Piechaczyk et al., 1984). This implies that the expression signals of the GPDcoding gene(s) are strong, as was demonstrated by Edens et al. (1984). In our research on gene expression and gene regulation in filamentous fungi the expression signals of A. nidulans genes are being analysed (Van Gorcom et al., 1986). Both structural and functional features of these expression signals are under research. In this paper the isolation of the A. nidulans gpdA gene is described. The complete nucleotide sequence of the gene and its 5’ and 3’ flanking regions was determined. Furthermore, the nucleotide sequence of the messenger RNA was determined using cDNA clones and poly(A) + RNA as templates.
An A. nidulans 1Charon provided
AND
METHODS
(a) Strains and plasmids Escherichiu coli K-12 JM109 (Yanisch-Perron et al., 1985) was used for the construction and propagation of vector molecules. A. niduluns strain FGSC4 (Glasgow wild-type; Clutterbuck, 1986) was used for the construction of the A. niduluns cDNA library. Poly(A) + RNA from A. nidufans strain MH 1277[pAN45-lA] 1 was used as a template for mRNA sequence determination. This strain was obtained by transformation of A. nidufuns MH1277 (bti 1, amdS320, amdI18, amdA7, niiA4; Hynes et al., 1983) with plasmid pAN45-lA, which
Dr.
W.E.
4a gene library was kindly Timberlake
(Orr
and
Timberlake, 1982). A partial cDNA library was constructed using poly(A) + RNA isolated from a culture ofA. niduluns FGSC4 cultivated in minimal medium (Pontecorvo et al., 1953) with 2”$ galactose as a carbon source. The ds cDNA was prepared and cloned as described by Teeri et al. (1987). (c) DNA/RNA
manipulations
A. nidulans poly(A)‘RNA was isolated as described by Teeri et al. (1987). Primer extension experiments were performed as described for firststrand cDNA synthesis (Teeri et al., 1987). Heterologous hybridization experiments were carried out at 56°C with final washes in 3 x SSC, 0.1% SDS, 0.1% Na . pyrophosphate at 56’ C, as described by Van Hartingsveldt et al. (1987). All other DNA/RNA manipulations were carried out as described in Maniatis et al. (1982).
RESULTS
MATERIALS
by
AND DISCUSSION
(a) Isolation of the Aspergillus
nidulans gpdA gene
The A. nidulans gpdA gene was isolated from an A. nidulans FGSC4 1 library by heterologous hybridization with a DNA fragment containing one of the S. cerevisiue GPD-coding genes (gap63/tdh2; Holland and Holland, 1980) as a probe. From 25000 A clones screened, five positive clones were obtained. Restriction enzyme analysis revealed that the inserts in these clones had a 4.0-kb BglII-Hind111 fragment in common. Southern-blot analysis showed that only this fragment hybridized with the S. cerevisiae probe under heterologous hybridization conditions (results not shown), suggesting that a complete A. nidzdans gpd gene is located on the fragment.
51
Two lines of evidence confirmed nucleotide quence
sequence of a part
Fig. 2. The major part of the nucleotide
that the complete
A. nidufuns gpdA gene had been isolated. and predicted
First, the
amino
of the BgZII-Hind111
the gpdA mRNA complete
acid sefragment
revealed a clear similarity with S. cerevisiue tdh2 and other gpd sequences (for details see RESULTS AND
cDNA
sequence
was also determined,
of
using
in-
clones and poly(A) + RNA as tem-
plates.
By comparison
mRNA
sequences
of the
the presence
genomic
and
of five introns
the
could
of
be established in the 5’ part of the transcribed region of the A. niduluns gpdA gene (Fig. 3). Comparison of
several copies of the putative gpdA gene in A. nidulans resulted in an increased GPD enzyme activity. The
the genomic sequence with that of the cDNA clones revealed the presence of two additional introns in the
DISCUSSION,
section b). Second,
level of GPD enzyme
introduction
activity appeared
3’ part of the transcribed
to correlate
region (results not shown).
with the number of gene copies (J. Dekker, in preparation). This indicates that the BglII-Hind111 frag-
As can be seen in Fig. 1, a small part of the mRNA
ment contains
donor
a functional
sequence
copy of the gpdA gene
together with its expression signals. The 4.0-kb BglII-Hind111 fragment of one of the A clones was subcloned into a pBR322 derivative, containing a polylinker with a unique BglII and Hind111 site, resulting in plasmid pAN5-22 (Fig. 1). Southern-blot analysis of genomic A. nidulans DNA digested with BglII + Hind111 showed that a single band of 4.0 kb hybridized with pAN5-22 under stringent conditions (results not shown), indicating that the A. nidulans genome, unlike S. cerevisiae and many other eukaryotic genomes, contains only one GPD-coding gene.
The sequence strategy used to determine the nucleotide sequence of the gpa!A gene and its flanking regions is given in Fig. 1. The sequence is shown in
1 St
ss
intron
in the corre-
H
P l,Y_
YII
ss
E St
SS
6
H
SC
-
-
01 kb
Fig. 1. Nucleotide termination
methods
1987) as templates. represents
sequencing
restriction
Nucleotide
for the A. nidulans &pdA gene. Nucleotide
(Sanger sequences
et al., 1977) ds-DNA were analysed
the map of the 4.0-kb BglII-Hind111 regions by narrow
enzyme sites are indicated
the position,
length and direction
clones, those marked
pAN5-22.
strategy
with ss-DNA
the ?I’- and 3’-noncoding
cDNA
LlllIIP
SC
ss
No obvious
sites were found
sponding part of the nucleotide sequence. The features ofthe introns are described in RESULTS AND DISCUSSION, sectionc. The similarity between the predicted amino acid sequence of the GPD polypeptide of A. nidufans (Fig. 2) and the sequences of Nicotiana tabacum (cytosolic) (Shih et al., 1986), Drosophila melanogaster (Tso et al., 1985b), chicken (Stone et al., 1985a), man (Tso et al., 1985a), rat (Tso et al., 1985a), S. cerevisiue (Holland and Holland, 1980) and E. coli (Branlant and Branlant, 1985) is 65-70%. Between the A. nidulans GPD polypeptide and those of Bacillus stearothermophilus (Walker et al., 1980a) and Thermus aquaticus (Walker et al., 1980b) the similarity is 50-55%. In parts of the GPD polypeptide known to be essential for enzymatic activity (Harris and Waters, 1976) similarity is almost 100%. Such relatively high percentages of
(b) Structure of the gpdA gene and GPD enzyme
69
was not investigated.
or acceptor
using UWGCG
insert in pAN5-22.
bars. The introns
as follows: B,BamHI;
of the nucleotide
with ‘m’ sequences
obtained
analysis
data were obtained
1985) and poly(A)+ programs
using dideoxy
by hatched
(Devereux
determined.
from poly(A)+
bars numbered
H,HindIII;
Arrows
marked
with roman
with ‘c’ represent
et al.,
et al., 1984). The top line by open wide bars, numbers.
SC, ScaI; Ss, SsfI; St, StuI. Arrows
RNA. All other sequences
chain-
RNA (Johanningmeier
The coding region of the gpdA gene is indicated
are indicated
Bg, BglII; E,EcoRI;
sequence
sequence
(Chen and Seeburg,
sequences
were obtained
Relevant indicate
obtained
from
from subclones
of
52
-712
-532
-352
-172
369
45Y
549 ggagctaattatgtttagcteacgtcgtcgtagCCCGGGCACCATTGA .. . intron V ]A Y M L K Y
D
S
Q
H
G
Q
F
K
G
T
I
E
639 GACCTACGACGAGCCTCTTATTGTCAACGGCAAGAAGATCCGCTTCCACACCGAGCGTGACCCCGCCMCATCCCCTGGGGCCAGGACGG TYDEGLIVNGKKIRFHTERDPANIPWGQDG 729 TGCTGAATACATTGTCGAGTCCACCGGTGTCACTACCGCTTGTCAT AEYIVESTGVFTTQ EKASAHLKGGAKKVVI I319 CTCTGCCCCATCTGCTCATGCCCCTATGTTCGTCATGGGTGTC~C~~CGAGACCTAC~G~GCACATTCAGGTCCTCTCC~CGCTTC SAPSADAPHFVHCVNNETYKKDIQVLSNAS 909 ~TGCACCACCAACTGCCTTGCCCCTCTCGCC~GGTCATC~CGAC~CTTCCGTATCATCGAGGGTCTGATGACCACCGTCCACTCCTA CTTNCLAPLAKV INDNFGI IEGLHTTVHSY 999 CACTGCTACCCACAAGGTCCTCGACGGCCMGGCATCATCCCCTCCTCCAC TATQKVVDGPSAKDWRGGRTAATNIIPSST 1089 TGGTGCTCCCAAGCCTGTCGGCAAGGTCATTCCTTCGCTC~TGGC~GCTCACCGGCATGGCGATGCGTGTTCCCACCTCC~CGTCTC GAAKAVGKVIPSLNGKLTGMAMRVPTSNVS 1179 CG~GTTGACCTGACCGTCCGCACCGAGAAGGCTGTTACCTACGACCAGATC~GGATGCCGTC~G~GGCTTCTGAG~CGAGCTC~ VVDLTVRTEKAVTYDQIKDAVKKASENELK 1269 GGgtastgtgaatgtgcCtttgctgttggacaCcttcgact~actagttgntttagGCATCCTTGGCTACACCGAGGACGACATCGTCTC . Incran vI........."" JILGYTEDDIVS GI"“" 1359 TACCGACCTCAACGGTGACACCCGCTCTTCCATCTTCGATGCT~GGCGGCTATTGCCCTClZACTCCAACTTCATCAAGCTCGTTTCCTG TDLNGDTRSSIFDAKAGIALNSNFIKLVSW 1449 GTACCACMCGAGTGGGGTTACTCCCGCCGTGTTGTTGACCTCATCAgta~gtc=t=~g~~~g~tgga~c~ttttgtg~gttg~t=~~t= inCron "II.. Y D N E W G Y S R R V V D L I S[""". 1539 gcacccagCCTACATCTCCAGGTTGATGCCCAATAGgaaacagg~cgg~agccaatggccaggagctccttgtaaaa~aat~~t~~ttg "'.."I Y I S K V D A Q * 1629
Fig.2. Nucleotide sequence of the gpdA gene ofA.
nidulans and the predicted amino acid sequence. Coding regions are indicatedin
upper-case letters, all other sequencesinlower-case letters. Below the coding regions,the
the standard by roman
one-letter
numbers
code. Nucleotides
are numbered
(see Fig, l), are indicated
sites( + 1778, + 178 1,
by dotted
box (-616 underlining.
region (between
to -593)
underlining.
to the transcription
The major transcription
predicted
amino acid sequence
isgiven using
start point (+ 1). The introns,
numbered
start point ( + 1) and the polyadenylation
+ 1784) are indicated by asterisks. The putative TATA box (-52 to -47) is overlined. The C + T-rich region (-47
to -1) is overlinedwith a dashed line.The putative the 3’noncoding
with reference
showing
polyadenylation
+ 1640 and + 1730) are indicated
a clear similarity
to a sequence
signal ( + 1760 to + 1766) is underlined. by pairs of convergent
upstream
arrows. The sequence
The inverted upstream
repeats
in
of the TATA
from the TATA box of the A. nidulans pgk gene is indicated
by
53
GA
similarity
RNA
DNA T
G
C
A
T
between
different
homologous
tides have also been found
CwtmC
zymes
(Pichersky
et al.,
Forthergill-Gilmore,
polypep-
for other glycolytic 1984; Tani
et al.,
en1985;
1986).
Codon usage in the A. niduluns gpdA gene is clearly biased, with a preference for a pyrimidine in the third position.
Of all codons
that position
79% have a pyrimidine
(C:55%,
T:24%,
and when a choice between is allowed, chosen. highly
in 93%
G:20%,
in
A:2%),
a purine or a pyrimidine
of the cases
a pyrimidine
is
This bias is similar to that found for other expressed genes in lilamentous fungi
(Kinnaird
and
1983 ; Clements
Fincham,
and
Roberts, 1986; May et al., 1987) but clearly different from that in highly expressed genes of S. cerevisiae (Bennetzen and Hall, 1982).
an
(c) Introns Of the seven introns detected by sequence analysis (Fig. 1) introns II to VII interrupt the coding region of the gene, intron I is part of the 5’-noncoding region of the gene. The nucleotide sequence of the exon-intron boundaries of all introns of the gpdA
from A. niduluns MH1277[pAN45-lA]
and poly(A)‘RNA dicated
by ‘RNA’) primed
1 (in-
with an oligodeoxynucleotide
(CTCAATGGTGCCCTTGAACTGACCGTGC)
primer
complemen-
tary to a part of the coding region 3’ of intron V. In the sequence ladder
the exon-intron
indicated
boundaries
by arrowheads
ladder obtained is indicated
numbers.
lation start codon
arrowheads.
read from bottom
RNA sequence
polarity,
as CAT in the figure). In the RNA sequence structure point(s) 6
of the RNA. of aspecific
To determine
bands)
Poly(A)’
primer extension
containing
gpdA gene, was used for primer and ‘mc’ lanes extension Fig. 3. Sequence
analysis and primer extension
the transcription
start point and the position of introns.
A, T, C show the products quencing
reactions
of dideoxy
with pAN5-22
analysis to locate Lanes G,
chain-termination
DNA (indicated
se-
by ‘DNA’)
are shown.
products
multiple
extension
start
as a con-
experiments
longer exposure.
to additional The rightmost
used as size markers
were
primer used for copies
(wt) and
(mc) of the
reactions.
In the ‘wt’
from these RNA preparations
In both cases one major band was observed.
wt bands (identical products
the transcription
in the RNA sequence
RNA from A. nidulans FGSC4
MH1277[pAN45-lA]l,
to the
thus ATG is read
several bands across
out with the same oligodeoxynucleotide
sequencing.
(note that
due to stops caused by secondary
(which were obscured
sequence carried
probably
of the trans-
by an asterisk
to top, is complementary
and has the opposite
all four lanes appear,
of these introns
The position
(ATG) is indicated
are
In the sequence
with poly(A) + RNA the position
by numbered
the sequence,
of the first five introns
and roman
Minor
mc bands) can only be seen after lane A shows sequence for the primer extension
reaction products.
54
gene fit (with one or two mismatches) to the consensus sequence for fungal introns (Ballance, 1986).
region has been reported
The size of the introns
eukaryotes,
Comparison
varies from 50 to 120 bp.
of the position
GPD-coding
of introns
genes (Fig. 4) shows that intron V of
the A. nidulans gene coincides chicken
in different
GPD-coding
Furthermore,
gene
(Stone
chicken
et al., well
in
nt upstream
known. These results do not support the hypothesis that introns originally mediated exon assembly and thus are present in homologous genes at corresponding positions at boundaries of regions encoding structural domains. Such a hypothesis was proposed on the basis of analysis of different triosephosphate isomerase genes (Straus and Gilbert, 1985; Gilbert et al., 1986) and the chicken GPD-coding gene (Stone et al., 1985b). The presence of very small exons in the A. nidulans gpdA gene (too small to be structural domains) is not consistent either with this hypothesis, although introns very close to each other may have resulted from duplication and movement of early introns (Gilbert et al., 1986). The presence of an intron outside of the coding
9,
*
nidulans
e
~~~~~~~aster
Fig. 4. Position
was determined
*
of the introns
et al., 1987) and D.
sequences of the 3’ end of the gpdA mRNA by analysis
out of eight cDNA
clones
analysed,
In five
the poly(A)
found in one of the cDNA clones. A putative polyadenylation signal (AAUACA) was found 11-17 bp upstream from the poly(A) track. This sequence is related to a sequence (AAUAAA) found at comparable sites in many other eukaryotic messengers (Proudfoot and Brownlee, 1976). The 3’-noncoding sequence of the messenger shows stretches of dyad symmetry, as indicated in Fig. 2. Comparable results were obtained for other A. nidulans genes (Clements and Roberts, 1986; Ward and Turner, 1986). These sequences might be functional in 3’ processing and polyadenylation of the precursor of the messenger RNA (Platt, 1986).
4f
I 7
10
14
17
25
27
31’
227
334
*__‘-:I: in the GPD-coding
genes ofA. niduluns (this report),
chicken (Stone et al., 1985a), C. elegans (Yarbrough
melanogaster(Tso et al., 1985b). The location of the introns in the genes is indicated
A. nidulans gene. The total number region of the GPD-coding
of gpdA cDNA.
track started at nt position + 207 downstream from the stop codon (UAG), in two clones at nt position + 204 and in one clone at + 201 (Fig. 2). Thus, some heterogeneity was observed at the site of polyadenylation. The length of the poly(A) track is not known, but is at least 70 nt since this length was
*
L
et al., 1985b) coding
I
I Chicken
(Stone
(Tso et al., outside the
1985a). as
from the start codon (ATG) (Stone et al., 1985a; Tso et al., 1985b). None of the other introns in the A. nidulans gene coincides with an intron found in one of the other GPD-coding genes of which the mRNA and/or genomic nucleotide sequence is
4 21 ‘IpI
chicken
region have been observed.
The sequence
at IO-20
the
D. melanogaster genes, introns
(d) 3’-Noncoding
an intron
as
including
1985a) and GPD-coding
a
contains
the
et al., 1988). In several genes of higher
D. melanogaster GPD-coding gene the 5’ part of the gene corresponding to the untranslated region of the mRNA
in
with intron III of the
(Saloheimo
for one other fungal gene
genes.
of codons
in each gene is also given. Asterisks
indicate
the position
by the codon numbers
of the
of an intron in the 5’.noncodmg
55
(e) 5’-Noncoding
sequences
Analysis messenger
The transcription
start point(s)
transcription
(corresponding mRNA)
upstream
(ATG)
(Fig. 3).
start point is localized
to a leader
region
Some
minor
sites
172 bp
(ACAAUGG) role
start codon were
in the
eukaryotes
found
sensus
major
AAAAUGG
region of about
*****
*
of a repetitive
the
is thought
efficiency
of translation
ACCUG
1986) in
AUG
and
initiation
resembles
in higher the
genes
of
in
the coneukaryotes
consensus
glycolytic
codon
to play an important
1986) closely
(Cigan and Donahue,
50 bp was observed. This sequence is preceded by a putative TATA element (TATTTT). Similar sequences have been observed upstream from other Aspergillus and N. crussu genes (Ballance, 1986). In the highly expressed A. nidulans oliC (Ward and Turner, 1986) and N. crussa am (Kinnaird and Fincham, 1983) genes the C + T-rich region is particularly long. It has been suggested that the length of the C + T-rich region between the TATA box and the major transcription site in A. niduluns is related to the level of transcription (Ballance, 1986).
(A)
which
sequence
(Kozak,
C + T-rich
1986). around
(Kozak,
between 140 and 170 bp in front of the ATG codon. In the sequence immediately upstream from the site a prominent
presence
one to several copies of which
(Ward and Turner, The sequence
of 56 nt in the
from the translation
the
have also been found in many other A. niduluns genes
were localized by sequence analysis of the gpdA messenger and by primer extension analysis. The major
showed
oligomer (CCAUCU),
of the @dA gene
sequence of the gpdA
of the 5’-noncoding
sequence S. cerevisiae
1987).
Comparison of the sequences between the putative TATA box and the major transcription start point of three A. nidulans glycolytic genes (gp&t, this paper; pgk,
Clements
McKnight
et al.,
and
Roberts,
1986)
(Fig. 5A). Comparison the TATA
1986;
shows
and
a clear
of sequences
tpiA,
similarity
upstream
from
box of the gpdA gene and the pgk gene
shows a region of similar sequence around 500-600 nt upstream from the transcription start point (Fig. 5B).
*
****
**
**
**
A.nidulans
tpiA
+1 TTATTTTCGTCATTCCTCCTTCCCAACCTTCACTCTTCC..aGTTTCCAACT
A. nidulans
Egk
TTATTTA.
A. nidulans
pvdA
ATATTTT....
***
**
. TCCCTGGTCTCTCCCCACTAG..CTGTTCCTGCCcGTCCATCT CCTG..CTCTCCCCACCAG..CTGCTCT
-592 (B) A 2
nidulans
ppdA
TGGCGCTCTGAGGTGCAGTGGATG *
A 2
nidulans
a
*
*
*******
*
*
*
*
*
*
TGCTATTTTGAGGTGTAATGCATG
- 501 Fig. 5. Comparison of sequences of the 5’ flanking region of glycolytic genes of A. niduluns. (Part A) Comparison of the region around the transcription start point. The sequences of the gpdA (this paper), pgk (Clements and Roberts, 1986) and tpiA(McKnight et al., 1986) genes are aligned for maximal similarity by introducing gaps (indicated by dots). Nucleotides identical for all three genes are indicated by asterisks. The major transcription start point is indicated by an underlined lower-case letter. (Part B) Comparison of 5’ upstream sequences of the &p&4and pgk genes. The distance (in nt) from the transcription start point is given. Identical nucleotides are indicated by asterisks.
Clements,
(f) Conclusions
J.M. and Roberts,
signals
The dete~ination
of the nucleotide
sequence
of
the 5’ flanking region of the A. njd~~ans gpdA gene and the comparison of this sequence with that of other genes suggests the presence tory regions. Detailed
functional
gene fusions (Van Gorcom to correlate
functional
analysis
signals
using lacZ
and structural
has
Clutterbuck,
Glasgow
1986. Fungal
Devereux,
J., Haeberli,
of sequence
stock
list of Aspergilius
P. and Smithies,
analysis
plant thaumatin P., Marty,
by gpdA
Jeanteur,
demonstrated
in
express
a
gene from ~~~~~~~~
News Lett. 33 (1986) 59-69. programs
L.E., Born, J., Ledeboer,
controlled
number of Aspergirrus species (Van Gorcom et al., 1986; Punt et al., 1987; Mattern et al., 1987; Mullaney et al., 19SS), Penicilfium chvysogenum (Kolar et al., 1988), Trichoderma reesei (PenttilB: et al., 1987) and in other tilamentous fungi, showing that the expression signals of the A. nidulans gpdA gene are a useful tool in heterologous gene expression in tilamentous fungi.
(PGK)
0.: A comprehensive
set
for the VAX. Nucleic Acids
Res. 12 (1984) 387-395.
Fort,
been
A.J.:
strains
Visser, C. and Verrips,
features of the
and processing
kinase
Aspergilfus nidufans. Gene 44 (1986) 97-105.
Edens,
et al., 1986) is in progress
5’ flanking region. Heterologous gene expression expression
of several regula-
C.F.: Transcription
in the 3-phosphoglycerate
A.M., Maat. J., Toonen,
CT.: Synthesis
M.Y.,
and processing
of the
in yeast. Cell 37 (1984) 629-633.
L., Piechaczyk,
M., El Sabouty,
P. and Blanchard, only one major
J.M.: Various mRNA
species
aldehyde-3-phosphatedehydrogenase
S., Dani, C.,
rat adult tissues from
the glycer-
multigenic
family, Nu-
cleic Acids Res. 13 (1985) 1431-1442. Foth~rgill-Gilmore,
L.A.: The evolution
ways. Trends Gilbert,
Biochem.
W., Marchionni,
of introns. Hanauer,
path-
M. and McKnight,
G.: On the antiquity
Cell 46 (1986) 151-154.
A. and Mandel.
dehydrogenase
J.L.: The glyceraldehyde
gene family: structure
and of an X-chromosome plexity
of the glycofytic
Sci. 11 (1986) 47-51.
3 phosphate
of the human
linked pseudogcne~
of the gene family
in mouse.
cDNA
amazing
EMBO
com-
J. 3 (1984)
2621-2633. Harris,
J.I. and Waters,
hydrogenase. Academic
ACKNOWLEDGEMENTS
Holland,
We gratefully acknowledge the valuable advice and assistance with regard to the construction of the cDNA clones by J.K.C. Knowles and T.T. Teeri. The authors wish to thank R.F.M. van Gorcom, F. Bleichrodt and P. Crowley for critical reading of the manuscript.
M.: Glyceraldehyde
3 phosphate
In Boyer, P.D. (Ed.), The Enzymes, Press, New York,
J.P. and Holland,
de-
Vol. 13C.
1976, pp. l-49.
M.J.: Structural
comparison
of two
yeast glyceraldehyde-3-phosphate
de-
nontandemly
repeated
hydrogenase
genes. J. Biol. Chem. 255 (1980) 2596-2605.
Hynes, M.J., Corrick,
C.M. and King, J.A.: Isolation
their
use in the analysis
nlutations.
ofgenomic
the umdS gene of Aspergillus nidu/ans and
clones containing
of the structural
and regulatory
Mol. Ceil. Biol. 3 (1983) 1430-1439.
Johanningmeier,
LJ., Bodner, U. and Wildner,
G.F.: A new muta-
tion in the gene coding for the herbicide
binding
protein
in
Chlamydomonas. FEBS Lett. 211 (1987) 221-224. Kinnaird,
J.H. and Fincham,
sequence
important
for gene expression
in tila-
J.L. and Hall, B.D.: Codon selection
in yeast. J. Biol.
Chem. 257 (1982) 3026-3031. Branlant,
G. and
Branlant,
C.: Nucleotide
domain
Biochem.
sequence
evolutionary
of the
plasmid
A.M.
Donahue,
T.F.:
and expression
of an Escherichia coli
3 phosphate
.I.
200 (1953) 479-492.
a fast and
DNA. DNA 4 (1985)
Sequence
features
associated
yeast -
a review. Gene 59 (1987) 1-18.
with translational
and
initiator
structural regions
dehydrogenase.
T., Fritsch,
A Laboratory Spring
Harbor,
Yeast protein.
E.F. and Sambrook,
Manual.
by cukaryotic
J.: Molecular
Cold Spring Harbor
I.E., Unkles, S.. Kinghorn,
using the A. niger 460-46 1.
II. J. Biol. Chem. Cloning.
Laboratory,
Cold
IVY, 1982.
den Hondel, C.A.M.J.J.: in
flanking the AUG
translation
Cell 44 (1986) 283-292.
Eur.
sequencing:
define a sequence modulates
Krebs, E.G., Rafter, G.W. and Junge, J.M.: Yeast glyceraldehyde
Mattern, and
that
of
165-l 70. Cigan,
ribosomes.
Maniatis,
for sequencing
codon
of
150 (1985) 61-66.
simple method
M.: Point mutations
domain
dehydrogenase.
P.H.: Supercoil
markers
behaviour
and of the catalytic
D-glyceraldehyde-3-phosphate Chen, E.Y. and Seeburg,
Kozak,
initiation
Escherichia coligap gene. Different the NAD * -binding
nant selection
IucZ fusion gene. Gene 62 (1988) 127-134.
fungi. Yeast 2 (1986) 229-236.
Bennetzen,
nucleotide
of Penicillium chry.wgenum using domi-
H.: Transformation D.J.: Sequences
mentous
The complete
glutamate dehydrogenase) gene. Gene 26 ( 1983) 253-260. Kolar, M., Punt, P.J., Van den Handel, C.A.M.J.J. and Schwab,
REFERENCES Ballance,
J.R.S.:
Neurosporu crassa urn (NADP-specific
of the
J.R., Pouwels,
Transformation
P.H. and Van
ofAspergiNus or,vzae
pyrC gene. Mol. Gen. Genet. 210 (1987)
51
May,G.S.,Tsang,
M.L.-S., Smith, H., Fidel, S. and Morris, N.R.:
Aspergillus niduluns fi-tubulin
genes are unusually
Gene 55 (1987) 231-243. McAlister,
yeast G.L.,
sequence
M.J.: Differential
of the
introns.
dehydrogenase
P.J. and Parker,
triosephosphate
J., Wierenga,
tandemly
Nucleotide
isomerase
gene
Acad.
from
for a differential
Mullaney,
loss of
DNA-mediated Microbial.
K.A., Misset,
O., Van
identical
genes
F.R.:
for the glycosomal in Trypanosoma
dehydrogenase
P.J. and Van den
Hondel,
C.A.M.J.J.:
of Aspergillus ficuum. Appl.
transformation
Biotechnol.
(1988) in press,
Orr, W.C. and Timberlake,
W.E.: Clustering
of spore
genes in Aspergilius nidulans. Proc. Natl. Acad.
specific
Sci. USA 79
(1982) 5976-5980. Penttill,
H.,
J.: A versatile
lulolytic
tilamentous
Ratto,
M.,
transformation
Salminer, system
E. and
for the cel-
Trichoderma reesei. Gene
fungus
61
(1987) 155-164. Pichersky,
E., Gottlieb,
L.D. and Hess, J.F.: Nucleotide
Mol. Gen. Genet. M.,
Panabieres,
sequence
gene of Escherichia coli.
isomerase
195 (1984) 314-320.
Blanchard,
J.M.,
F., El Sabouty,
Posttranscriptional
Marty,
S., Fort,
regulation
phate dehydrogenase
of
Dani,
C.,
P. and Jeanteur,
L.,
P.:
in rat tissues.
Nucleic
termination
Annu.
and the regulation
Rev. Biochem.
of gene
L.J., MacDonald,
K.D.
5 (1953) 141-238. in eukaryotic
G.G.:
3’ Non-coding
messenger
region
RNA. Nature
se-
263 (1976)
211-214. Van den Hondel, on
the
C.A.M.J.J.:
M.A., Pouwels,
of Aspergillus
Transformation
hygromycin
B
P.H. and
resistance
marker
from
Escherichia coli. Gene 56 (1987) 117-124. Saloheimo, Stahlberg,
M.,
Lehtovaara,
J., Johansson,
Tomme, P. and Knowles,
P.,
Penttila,
G., Petterson,
M.,
Sanger, F., Nicklen, chain terminating
genases. Skariynski,
T.T., M.,
J.K.C.: EGIII, a new endoglucanase
S. and Coulson, inhibitors.
and
glycer-
Proc.
Schwartz,
Natl.
R.J.:
glyceraldehyde
Intron
phosphate
313 (1985b) 498-500. engineering
of the chicken
in the pre-
triosephosphate
iso-
gene. Mol. Cell. Biol. 5 (1985) 3497-3506.
Tani, K., Singer-Sam,
J., Munns,
cloning
and structure
human
phosphoglycerate
Teeri, T.T., Kumar,
of cDNA
frequency
cloning
Anal. Biochem. and
kinase.
libraries
gene for
P. and Knowles, by blunt-end
of long cDNA’s
J.: Con-
ligation:
high-
from tilamentous
fungi.
164 (1987) 60-67. Kao,
T.-H.,
Reece,
characterization
aldehyde-3-phosphate complexity
A.: Molecular
processed
Gene 35 (1985) 1 l-18.
V., Lehtovaara,
struction
Isolation
M. and Yoshida.
of an autosomal
K.S. and Wu, R.:
of rat
and
dehydrogenase
and molecular
evolution
human
glycer-
cDNA’s:
genomic
of the gene.
Nucleic
Tso, J.Y., Sun, X.-H. and Wu, R.: Structure
Van Gorcom, Hondel,
R.F.M.,
Punt,
C.A.M.J.J.:
P.J., Pouwels,
P.H. and Van den
A system for the analysis
Van Hartingsveldt,
W., Mattern,
homologous
of expression
I.E., Van Zeyl, C.M.J., Pouwels, C.A.M.J.J.:
Walker,
J.E., Carne,
Harris,
A.F.,
M.J.,
Bridgen,
J.1.: D-Glyceraldehyde-3-phosphate amino acid sequence
J.E., Wonacott,
of an
206 (1987) 71-75.
Runswick,
dehydrogenase:
from Bacillus
108 (1980a)
A.J. and Harris,
I. and
dehydrogenase:
of the enzyme
stearothermophilus. Eur. J. Biochem. Walker,
Development
system for Aspergillus niger based
tranformation
549-565.
J.1.: D-Glyceraldehyde-
complete
aminoacid
sequence
from Thermus aquaricus. Eur. J. Biochem.
of the enzyme
108
(1980b) 581-586. Ward,
M. and Turner,
G.: The ATP synthase
Genet.
of the
subunit
and transcription.
9 gene of Mol. Gen.
205 (1986) 331-338.
Yanisch-Perron,
C., Vieira, J. and Messing,
M13mp18
and host strains:
and
with
Yarbrough,
Sci. USA 74
Klass,
M.R. and Hecht,
phate
dehydrogenase
H.M.: Evidence primary
glyceraldehyde-3-phosphate
in favor
structure
pUC19
and
P.C.E. and Wonacott,
P.O., Hayden.
J.: Improved
nucleotide
vectors.
M.A., Dunn.
Gene
Ml3
sequences 33 (1985)
gene family in the nematode
rhabditis elegans: isolation
and characterization
genes. Biochim.
Acta 908 (1987) 21-33.
Communicated A.J.: Structure
of
L.A.. Vermersch,
Biophys.
P.S.,
R.M.: The glyceraldehyde-3-phos-
dehydro-
Cell 47 (1986) 73-80. T., Moody,
dehy-
genes. J. Biol. Chem. 260 (1985b) 8220-8228.
103-l 19.
A.R.: DNA sequencing
origin of chloroplasts:
of tobacco
of two unlinked
Drosophila melanogaster glyceraldehyde-3-phosphate
phage cloning vectors
of both gene
Proc. Natl. Acad.
(1977) 5463-5467. Shih, M.-C., Lazar, G. and Goodman, evolution
Teeri,
G., Claeyssens,
Gene 63 (1988) 11-21.
of the symbiotic
gene.
W.: Genetic
Aspergilius nidulans: sequence
from Trichoderma reesei: the characterization and enzyme.
and
gene. Nature
3-phosphate
Punt, P.J., Oliver, R.P., Dingemanse, based
K.N.
structure
complete
N.J. and Brownlee,
quences
Cambrian:
T.M.
1628-1632.
of chicken
D. and Gilbert,
Kuo,
of the chicken
on the pyrG gene. Mol. Gen. Genet.
55 (1986) 339-372.
and Bufton, A.W.J.: The genetics ofAspergiilus nidulans. Adv. Genet.
Rothblum, evolution
P.H. and Van de Handel,
G., Roper, J.A., Hemmons,
Proudfoot,
Straus,
M.C.,
signals in Aspergillus. Gene 48 (1986) 211-217.
glyceraldehyde-3-phos-
gene expression
Platt, T.: Transcription expression.
E.M.,
dependent
drogenase
Acids Res. 12 (1984) 6951-6963.
Pontecorvo,
Stone,
Alevy, sequence
Acids Res. 13 (1985a) 2485-2502.
of the triosephosphate Piechaczyk,
J. Mol. Biol. 193
dehydrogenase
Tso, J.Y., Sun, X.-H.,
M., Nevalainen,
Knowles,
K.N.,
R.J.: Complete
Sci. USA 82 (1985a)
merase
J. 5 (1986) 1049-1056.
E.J., Punt,
Rothblum,
dehydrogenase
A., Osinga,
glyceraldehyde-phosphate hrucei. EMBO
E.M.,
aldehyde-3-phosphate M.L.:
R.K., Borst, P. and Opperdoes,
linked
Stone,
Schwartz,
Cell 46 (1986) 143-147.
P.A.M., Poliszczak,
Beeumen, Two
of the
260 (1985) 15019-15027.
O’Hara,
Aspergillus nidulans. Implications Michels,
expression
glyceraldehyde-3-phosphate
genes. J. Bacterial. McKnight,
from Bacil-
dehydrogenase
(1987) 171-187.
L. and Holland,
three
hologlyceraldehyde-3-phosphate
lus stearothermophilus at 1.8 A resolution.
divergent.
by J.K.C. Knowles.
Caeno-
of one of the