Vol. 157, No. 3, 1988
BIOCHEMICAL AND 81OPHYSICAL RESEARCH COMMUNICATIONS Pages 937-943
December 30, 1988
HUMAN
HEXOKINASE:
SEQUENCES HALVES
Shigeo Nishi,
OF AMINO-
ARE
AND
CARBOXYL-TERMINAL
HOMOLOGOUS
Susumu Seino and Graeme I. Bell
Howard Hughes Medical
Institute,
Departments of B i o c h e m i s t r y and M o l e c u l a r Biology and M e d i c i n e The U n i v e r s i t y of Chicago 5841 S. M a r y l a n d Ave., Chicago,
IL
Box 391
60637
Received September 27, 1988
SUMMARY cDNA clones encoding human hexokinase have been isolated from an adult kidney library. A n a l y s i s of this 917 amino acid p r o t e i n (Mr = 102,519) indicates that the sequences of the NH 2and COOH-terminal halves, corresponding to the r e g u l a t o r y and catalytic domains, respectively, are homologous; and that eukaryotic h e x o k i n a s e s evolved by d u p l i c a t i o n of a gene e n c o d i n g a p r o t e i n of "450 amino acids. The C O O H - t e r m i n a l half of the p r o t e i n created by this gene d u P l i c a t i o n retained the glucose b i n d i n g site and glucose p h o s p h o r y l a t i n g activity while the substrate binding sites of the N H 2 - t e r m i n a l half evolved into a new a l l o s t e r i c effector site. ©1988AcademicPress,lnc.
The ~ontrol t ere
enzymatic point
only and
in glycolysis.
of
glucose
In humans,
I,
II,
III
and
is
a
principal
rats and other mammals,
is a family of glucose p h o s p h o r y l a t i n g
hexokinase have
phosphorylation
glucokinase
enzymes,
(1-3).
The
designated hexokinases
a m o l e c u l a r mass of ~i00,000 daltons whereas g l u c o k i n a s e ~50,000 tissue
regulated
protein
disorder.
different
in in
diabetes its
or As
phosphorylating
In
an
mellitus,
abnormal
perhaps a
addition,
mechanisms.
metabolism
having
development
Each enzyme has unique kinetic properties
distribution. by
perturbed involved
daltons.
is
first enzymes,
to
we
As
we
are
determine
sequence
influence step
the
in
glucose
the
each
is
metabolism
is
examining if
the
could
of
key
contribute
the
isolated
enzymes
synthesis
pathophysiology
examining
have
activity
role
and
to of
of
of
a
the this
glucose
characterized
normal adult kidney cDNA clones encoding human hexokinase.
937
0006-291 X/88 $1.50 Copyright © 1988 by Academic Press, Inc. All rights of reproduction in any form reserved.
Vol. 157, No. 3, 1988
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS
MATERIALS
AND METHODS
General Methods. Standard procedures were carried out as d e s c r i b e d in M a n i a t i s et al. (4). D N A s e q u e n c i n g w a s d o n e b y the dideoxynucleotide-chain-termination procedure (5) after subcloning appropriate DNA fragments into M l 3 m p l 8 or M l 3 m p l 9 . T h e s e q u e n c e s of b o t h s t r a n d s a n d a c r o s s all r e s t r i c t i o n sites were determined.
c D N A Cloning. O n e m i l l i o n p h a g e f r o m an a d u l t h u m a n k i d n e y c D N A library having inserts >2 kbp and consisting of "250,000 different clones (6) w e r e screened with a 32p-labeled EcoRI fragment encoding nucleotides 1-2114 [the n u m b e r i n g is from S c h w a b and W i l s o n (7)] of t h e c a t a l y t i c d o m a i n of a rat b r a i n hexokinase I cDNA (prHKl-l, unpublished). This probe was i s o l a t e d f r o m a rat b r a i n c D N A l i b r a r y (Clontech, RLI002, Palo Alto, CA) u s i n g s p e c i f i c o l i g o n u c l e o t i d e p r o b e s b a s e d u p o n the c D N A s e q u e n c e r e p o r t e d b y S c h w a b and W i l s o n (7). P r i o r to the publication of t h e p a r t i a l r a t h e x o k i n a s e I c D N A sequence, we were attempting to isolate hexokinase cDNA clones using oligonucleotide probes based upon the fragmentary amino acid sequence that was available (8) b u t w i t h o u t success. T h e rat b r a i n c D N A p r o b e w a s h y b r i d i z e d to the h u m a n c D N A l i b r a r y u s i n g low stringency conditions: 37°C; 25% formamide, 5 X SSC, 2 X D e n h a r d t ' s solution, 2 0 m M s o d i u m p h o s p h a t e buffer, p H 6.5, 0.1% N a D o d S O 4 , i00 # g / m l of s o n i c a t e d and d e n a t u r e d s a l m o n t e s t e s DNA, 10% d e x t r a n s u l f a t e a n d 1 X 106 c p m / m l of probe, t h e f i l t e r s w e r e w a s h e d in 2 X SSC and 0.1% N a D o d S O 4 at r o o m t e m p e r a t u r e a n d t h e n for o n e h o u r at 4 0 ° C b e f o r e a u t o r a d i o g r a p h y .
RESULTS Sequence in
the
of H u m a n adult
hexokinase of
these
complete 358
human
i)
kidney
I c D N A probe. IhHEX-12
and
sequence
nucleotides
(Fig.
Hexokinase.
of of
contained
nucleotides
encoding
well
bp
as
81
respectively. 93.4% (7), 85.3%
similarity we believe nucleotide
library
Eighteen -15
a
was
765
with
large
amino
bp
of
largest
composite
protein
have
is h u m a n between
phage
the
further
inserts. well
3,598
as
bp
frame
(M r
89.2%
the
5'
2,751 as
sequence, and
I sequence I
(there
corresponding
is
coding
Figure I. Composite nucleotide sequence of human kidney hexokinase cDNA and predicted amino acid sequence of the protein. The number of the nucleotide at the end of each line is indicated. The insert in IhHEX-15 includes nucleotides 38-3598. The 5'-EcoRI fragment of IhHEX-12, nucleotides 1-358, was also sequenced.
938
and
sequence of
identity
hexokinase hexokinase
rat
The
= 102,519)
3'-untranslated
rat b r a i n
identity
as
open-reading
467-917
this protein
sequence
with
characterized
5'-and
the partial
1 X 106
hybridized
the
acid
acids
of the
determined
The
single
a 917
were
contained
lhHEX-12.
amino
that
cDNA
IhHEX-15
and As
Seventy-nine
Vol. 157, No. 3, 1988
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS i Met Ile Ala A1a Gln Leu L e u Ala Tyr ATG A?C CCC GCG CAG CTC CTG GCC TAT
108
10 20 30 Tyr Phe ThE GIu Leu Lye Asp Asp Gln Val Lys Lys Ile Asp bys Tyr Leu Tyr Ale Met Arg Leu SeE Asp G1u ThE Leu Ile Asp Ile TAC TTC ACG GAG CTG AAG GAT GAC CAG GTC AAA AAG ATT GAC AAG TAT CTG TAT GCC ATG CGG CTC TCC GAT GAA ACT CTC ATA GAT ATC
198
40 50 00 Met ThE Arg Phe Azg Lys Glu Met Lys Asn Gly Leu SeE Arg ASp Phe ASh Pro ThE Ala ThE Val Lys Met LeU Pro ThE Phe Val Arg ATG ACT CGC TTC AGG AAG GAG ATG AAG AAT GGC CTC TCC CGG GAT TTT AAT CCA ACA GCC ACA GTC AAG ATG TTG CCA ACA TTC GTA AGG
208
70 80 90 Ser Ile Pro ASp Gly SeE Glu LyS Gly Asp Phe Ile Ala Leu ASp Leu Gly Gly Ser SeE Phe Arg Ile Leu Arg Val Gln val ASh His TCC ATT CCT GAT GGC TCT GAA AAG GGA GAT TTC ATT GCC CTG GAT CTT GGT GGG TCT TCC TTT CGA ATT CTG CGG GTG CAA GTG AAT CAT
378
IOO Ii0 120 Glu Lys Ash Gln Asn Val His Met G%u SeE Glu Val Tyr Asp ThE Pro GIu ASh Ile Val His Gly SeE Gly SeE Gln Leu Phe ASp His GAG AAA AAC CAG AAT GTT CAC ATG GHG TCC GAG GTT TAT GAC ACC CCA GAG AAC ATC GTG CAC GGC AGT GGA AGC CAG CTT TTT GAT CAT
468
130 148 150 Val Ala GIU Cys Leu G1y Asp Phe Met GIu Lys A~g Lys Ile Lys Asp LyS Lys Leu Pro Val Gly Phe Thr Phe SeE Phe P~o Cys Gln GTT GCT GAG TGC CTG GGA GAT TTC ATG GAG AAA AGG AAG ATC AAG GAC AAG HAG TTA CCT GTG GGA TTC ACG TTT TCT TTT CCT TGC CAA
558
160 170 180 Gln SeE Lys Ile Asp Glu Ala Ile Leu Ile ThE T r p ThE Lys Arg Phe 5ys Ala SeE Gly Val GI0 Gly Ala ASp Val Val Lys Leu Leu CAA TCC AAH ATA GAT GAG GCC ATC CTG ATC HCC TGG ACA AAG CGA TTT AHA GCG AGC GGA GTG GAH GGA GCA GAT GTG GTC AAA CTG CTT
648
190 200 210 ASh Lys Ala 11e Lye Lys Arg GIy ASp Tyr ASp Ala ASh lle Val Ala Val Val ASh Asp Thr val Gly ThE Met Met ThE Cys Gly Tyr AAC AAA GCC ATC AAA AAG CGH GGG GAC TAT GAT GCC AAC ATC GTA GCT GTG GTG AAT GAC HCA GTG GGC ACC ATG ATG ACC TGT GGC TAT
739
220 230 240 Asp Asp Gin His Cys GIu Val Gly Leu Ile Ile GIy Thr Gly Thr ASh Ala CyS Tyr Met GIu GIu Leu Arg His Ile Asp Leu Val Glu GAC GAC CAG CAC TGT GAA GTC GGC CTG ATC ATC GGC ACT GGC ACC HAT GCT TGC TAC ATG GAG GAA CTG AGG CAC ATT GAT CTG GTG GAA
828
250 260 270 Gly Asp GIu Gly Arg Met Cys Ile ASh ThE Glu Trp Gly Ala Phe Gly Asp ASp Gly SeE Leu GIu Asp Ile AEg Th( Glu Phe Asp AE 0 GGA GAC GAG GGG AGG ATG TGT ATC HAT ACA G A A TGG GGA GCC TTT GGA GAC GAT GGA TCA TTA GAA GAC ATC CGG ACA GAG TTT GAC AGG
918
280 290 300 GIu Ile Asp Arg Gly SeE Leu ASh Pro Gly Lye G l n Leu Phe Glu Lys Met Val Ser Gly Met Tyr Leu Gly GIu Leu Val AE 0 Leu Ile GAG ATA GAC CGG GGA TCC CTC AAC CCT GGA AAA CAG CTG TTT GAG AAG ATG GTC AGT GGC ATG TAC TTG GGA GAG CTG GTT CGA CTG ATC
1008
310 320 330 Leu vaZ Lys Met Ala LFS Glu Gly Leu Heu Phe Glu Gly Arg Ile ThE PEO Glu Leu Leu ThE Arg Gly Dys Phe Asn ThE SeE Asp Val CTA GTC AAG ATG GCC AAG GAG GGC CTC TTA TTT GAA GGG CGG ATC ACC CCG GAG CTG CTC ACC CGA GGG AAG T T T AAC HCC AGT GAT GTG
1098
340 390 360 SeE Ala Ile GIu Lys ASh Lys GIu Gly Leu His Ash Ala Lye Glu Ile Leu ThE Arg Leu Gly Val GIu Pro SeE ASp Asp A s p Cys Val TCA GCC ATC GAA AAG AAT AAG GAA GGC CTC CAC AAT GCC AAA GAA ATC CTG ACC CGC CTG GGA GTG GAG CCG TCC GAT GAT GAC TGT GTC
1188
•CGCCGGAGGAC•ACGGCTCGCCAGGG•TGCGGAGGACCGA••GTCC••ACGC•TG•CGC••CGcGACCC•GA•CGCCAGC
g~ Val
390
990
Gln His Val Cys ThE Ile Val SeE Phe Arg SeE Ala ASh Leu Val Ala Ala Thr Leu G1y Ale Ile Leu Asn keg Leu Ar 9 Asp TCA GTC CAG CAC GTT TGC ACC ATT GTC TCA TTT CGC TCA GCC AAC TTG GTG GCT GCC ACA CTG GGC GCC ATC TTG AAC CGC CTG CGT GAT
1278
400 410 420 Asn LyS Gly ThE P~o Arg Leu Arg Thr Thr Val Gly Val Asp Gly SeE Leu Tyr LyS ThE His Pro Gln TyE Ser Arg Arg Phe His Hys AAC AAG GGC ACA CCC AGG CTG CGG ACC ACG GTT GGT GTC GAC GGA TCT CTT TAC AAG ACG CAC CCA CAG TAT TCC CGG CGT TTC CAC HAG
1368
430 440 450 ThE LeO Arg Ar 9 Leu Val Pro ASp SeE ASp Val Arg Phe Leu Leu SeE Glu Ser Gly Ser Gly Lys Gly Ala Ala Met Val ThE Ala Val ACT CTA AGG CGC TTG GTG CCA GAC TCC GAT GTG CGC TTC CTC CTC TCG GAG AGT GGC AGC GGC AAG GGG GCT GCC ATG GTG ACG GCG GTG
1458
460 470 480 Ala Tyr Arg Leu Ala Glu Gln His Arg Gln Ile GIu GIu ThE Leu Ala 81s Phe His Leo Thr Lys Asp Met Leu Leu Glu Val Lys Lys GCC TAC CGC TTG GCC GAG CAG CAC CGG CAG ATA GAG GAG ACC CTG GCT CAT TTC CAC CTC ACC AAA GAC ATG CTG CTG GAG GTG AAG AAG
1548
490 500 530 Arg Met Arg Ale GIU Met Glu Leu Gly Leu Arg Lys Gin ThE Hie Hsn ASh Ala Val Val Lys Set Leu P~o Ser Phe Val AEg Arg ThE AGG ATG CGG GCC GAG ATG GAG CTG GGG CTG AGG AAG CAG ACG CAC AAC AAT GCC GTG GTT AAG ATG CTG CCC TCC TTC GTC CGG AGA ACT
1638
520 530 540 Pro ASp Gly Thr GIU ASh Gly ASp Phe Leo Ala Leo Asp Leu Gly Gly ThE ASh Phe Arg Val Heu Leo Val Lys Ile Arg SeE Gly bys CCC GAC GGG ACC GAG AAT GGT GAC TTC TTG CCC CTG GAT CTT GGA GGA ACC AAT TTC CGT GTG CTG CTG GTG AAA ATC CGT AGT GGG AAA
1728
SSO 560 570 Hys Arg ThE Val GlU Met His ASh Lys Ile Tyr Ala Ile P~O Ile Glu Iie Met Gln Gly ThE Gly GIu GIU Leu Phe ASp His Ile Val AAG AGA ACG GTG GAA ATG CAC AAC AAG ATC TAC GCC ATT CCT ATT GAA ATC ATG CAG GGC ACT GGG GAA GAG CTG TTT GAT CAC ATT GTC
1819
500 590 600 Ser Cys Ile Ser Asp Phe Leu A S p Tyr Met GIy Ile Lys Gly Pro Ar 0 Met Pro Leu Gly Phe ThE Phe Ser Phe Pro Cys Gln Gln Thr TCC TGC ATC TCT GAC TTC TTG GAC TAC ATG GGG ATC AAA GGC CCC AGG ATG CCT CTG GGC TTC ACG TTC TCA TTT CCC TGC CAG CAG ACG
1908
610 630 630 SeE Leu Asp Ala Gly lle Leu Ile ThE Trp Thr Lys Gly Phe Lys Ala Thr Asp Cys Val Gly Sls Asp Val Val Thr Leu Leu Arg Asp AGT CTG GAC GCG GGA ATC TTG ATC ACG TGG ACA AAG GGT TTT AAG GCA ACA GAC TGC GTG GCC CAC GAT GTA GTC ACC T T A CTA AGG GAT
1998
640 650 660 Ala Ile Lys Arg Ar 0 Glu Glu Phe Asp Leu ASp Val Val Ala Val Val ASh Asp ThE Val Gly ThE Met Met Thr Cys Ala Tyr Glu G1u GCG ATA AAA AGG AGA GAG GAA TTT GAC CTG GAC GTG GTG GCT GTG GTC AAC GAC ACA GTG GGC ACC ATG ATG ACC TGT GCT TAT GAG GAG
2088
680 690 Th~ Cys Glu val Gly Leu Ile Val Gly Thr Gly SeE ASh Ala Cys Tyr Met Glu GIU Met Lys ASh Val Glu Met Val GIu Gly Asp CCC ACC TGT GAG GTT GGA CTC ATT GTT GGG ACC GGC AGC AAT GCC TGC TAC ATG GAG GAG ATG AAG AAC GTG GAG ATG GTG GAG GGG GAC
2178
700 710 720 Gln Gly Gln Met Cys Ile ASh Met GIu Trp Gly Ala Phe GIy Asp ASh Gly Cys Leu Asp Asp Ile Arg Thr His Tyr A s p Arg Leu Val CAG GGG CAG ATG TGC ATC AAC ATG GAG TGG GGG GCC TTT GGG GAC AAC GGG TGT CTG GAT GAT ATC AGG ACA CAC TAC GAC AGA CTG GTG
2288
670
Fro
~sn30
740 750 GIu Tyr 8er Leu ASh Ala Gly Lys Gln Arg Tyr Glu Lys Met Ile SeE Gly Met Tyr Leu Gly GIu 11e Val Arg Ash Ile Leu Ile AAC GAA TAT TCC CTA AAT GCT GGG AAA CAA AGG TAT GAG AAG ATG ATC AGT GGT ATG TAC CTG GGT GAA ATC GTC CGC AAC ATC TTA ATC
2358
760 770 780 ASp Phe Thr bys Lys Gly Phe Leu Phe Arg GIy Gln Ile SeE Glu ThE Met Lys ThE Arg Gly i l e Phe Glu Thr LyS Phe Leu SeE Gin GAC TTC ACC AAG AAG GGA TTC CTC TTC CGA GGG CAG ATC TCT GAG ACG ATG AAG ACC CGG GGC ATC TTT GAG ACC AAG TTT CTC TCT CAG
2448
790 800 810 ile Glu Ser ASp Arg Leu Ala Leu Leu Gln Val Arg Ala Ile Leo Gln Gln Leu Gly Leu Asn SeE ThE Cys ASp Asp Ser Ile Leu Val ATC GAG AGT GAC CGA TTA GCA CTG CTC CAG GTC CGG GCT ATC CTC CAG CAG CTA GGT CTG AAT AGC ACC TGC GAT GAC AGT ATC CTC GTC
2538
820 830 840 Lys ThE Val Cys Gly Val Val Se~ Arg Arg Ala Ala Gln Leu Cys Gly A1a Gly Met Ala Ala Val Val Asp Lys Ile Arg Glu ASh Arg AAG ACA GTG TGC GGG GTG GTG TCC AGG AGG GCC GCA CAG CTG TGT GGC GCA GGC ATG GCT GCG GTT GTG GAT HAG ATC CGC GAG AAC AGA
2628
850 860 870 GIy Leu Asp Arg Leu ASh Val Th~ Val G1y Val Asp Gly Thr Heu Tyr LyS Leo His P~o His Phe Ser Arg lle Met His Gln Thr Val GGA CTG GAC CGT CTG AAT GTG ACT GTG GGA GTG GAC GGG ACA CTC TAC AAG CTT CAT CCA CAC TTC TCC AGA ATC ATG CAC CAG ACG GTG
2718
880 890 900 Lys GIu Leu Ser Pro Lys Cys Ash Val Ser Phe Leu Leu SeE GIu A s p Gly SeE Gly Lys G1y Ala Ala Leu Ile ThE Ala Val Gly Val HAG GAA CTG TCA CCA AAA TGT AAC GTG TCC TTC CTC CTG TCT GAG GAT GGC AGC GGC AAG GGG GCC GCC CTC ATC ACG GCC GTG GGC GTG
2808
910 917 Arg Leu Arg ThE Glu Ala SeE SeE OC CGG TTA CGC ACA GAG GCA AGC AGC TAA GA~T~GGGAT~CCAG~CTA~T~CTCT~A~cA~TTCTCT~TT~AAG~GG~GA~C~CTA~CTc~AG~GAGT~G~TGG
2919
GAGACGCTGGCGCCAOGC-CCTGCCGGCGCGGGGAGG~AAGC~.~AATCCAACT~ATGG~ATATATTGTAGGGTACAGAATAGAG CGTGTGCTGTTGATAATATC~TCACC CGGATCCCr C 3039 CTCA CTTG CCCTG CCACTTTG CATGGTTTGATTTTG AC CTG GTCCCCCACG TGTGAAGTGTAGTG GCATC CATTTCT ~
TA~ CA~ATC
CAACAGAGTTA~TAT~
G GAAAAT CACAC CACCTGAC AGGC CTTC TGGC~CTCC AAAC~CCCATCCTTGGGGTTCCCCCTC CCTGTGTGAAATGTATTATCA C C AGCAGA CA C T G C C G G ~ C T C C C ~ C TGCCTGAAGGCGAGTGTGGGCATAGCATTAGCTG
CT TCCT CCCCT CCTG GCAC C C A C T G T G C - C C T G G C A T C G C A T C G T G ~ G T G T C ~
CC A C ~
CTAGC CGCGTGTGAC AGT CTTGCATTCTGTTTGTCT C G T G G G G G G A G G T G G A C A G T C C T G C G G A A A T G T G T C T T G T C T T C C A T T T G G A T A A A ~ A C C ~ C CTGGAATTTC CCAC CGCTTTG TGAGC CGTG TCGTATGACCT AGTAAACTTTGTAC CAATTC A ~ A A A A A A ~ A A
939
GCTG GAGAT 3159 ~GGGGCAC
3279
GTGTGTC C G ~ G A A C CAG TC 3398 C~C~AC~T~
C ATC A 3519 3588
Vol. 157, No. 3, 1988
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS
i MIAAQLLAYYFTELKDDQVKKIDKYLYAMRLSDETLIDIMTRFRKEMKNGLSRDFNPTATVKMLPTFVRSIP
I* I *I * *I I * l l l * * ** ** I * *****I*** * 464 - AEQHRQIEETLAHFHLTKDMLLEVKKRMRAEMELGLRKQTHNNAVVKMLPSFVRRTP 73 DGSEKGDFIALDLGGSSFRILRVQVNHEKNQNVHMESEVYDTPENIVHGSGSQLFDHVAECLGDFMEKRKIK •*I* ***I******I **I* * I * * * I* * *I *I* ****I *I **II ** 521
DGTENGDFLALDLGGTNFRVLLVKIRSGKKRTVEMHNKIYAIPIEIMQGTGEELFDHIVSCISDFLDYMGIK
145 D K K L P V G F T F S F P C Q Q S K I D E A I L I T W T K R F K A S G V E G A D V V K L L N K A I K K R G D Y D A N I V A V V N D T V G T M M T
II*I**********I I* ******* * * * I * *** ** ***I* II* I************* 593 GPRMPLGFTFSFPCQQTSLDAGILITWTKGFKATDCVGHDVVTLLRDAIKRREEFDLDVVAVVNDTVGTMMT 217 CGYDDQHCEVGLIIGTGTNACYMEELRHIDLVEGDEGRMCINTEWGAFGDDGSLEDIRTEFDREIDRGSLNP • *II ******I***I*******I[ III**** * **** ******* * *I**** I** I *** 665
CAYEEPTCEVGLIVGTGSNACYMEEMKNVEMVEGDQGQMCINMEWGAFGDNGCLDDIRTHYDRLVNEYSLNA
289
GKQLFEKMVSGMYLGELVRLILVKMAKEGLLFEGRITPELLTRGKFNTSDVSAIEKNKEGLHNAKEILTRLG
•** I***I*******I** **I * * * * * *I I *** * * I* ** I * I I ** ** 737 GKQRYEKMISGMYLGEIVRNILIDFTKKGFLFRGQISETMKTRGIFETKFLSQIESDRLALLQVRAILQQLG 361 VEPSDDDCVSVQHVCTIVSFRSANLVAATLGAILNRLRDNKGTPRLRTTVGVDGSLYKTHPQYSRRFHKTLR I I ** I * ** I** * *I* * I *II If*l'i* ** ******I*** ** I** * *II 809
LNSTCDDSILVKTVCGVVSRRAAQLCGAGMAAVVDKIRENRGLDRLNVTVGVDGTLYKLHPHFSRIMHQTVK
433
RLVPDSDVRFLLSESGSGKGAAMVTAVAYRL - 463 • * * ***** *******II*** ** ELSPKCNVSFLLSEDGSGKGAALITAVGVRLRTEASS
881
- 917
Figure 2. Comparison of the sequences of the NH 2- and COOHterminal halves of human kidney hexokinase. Amino acids are indicated by their single-letter abbreviations. The upper sequence is the NH2-terminal 463 amino acids and the lower sequence is the COOH-terminal 454 residues. Asterisks denote identical amino acids and vertical lines indicate chemically similar residues. The number of the amino acid at the beginning of each line is noted.
regions
of
the
NH2-terminus
cDNA
of
sequences).
this
direct
The
protein
is
also
of
rat
determined
by
Ile-(Ala,
Gln)-Ala-Leu-Leu-Ala-Tyr]
indicates
that
Sequences
of
It a
has
been
process
the
NH 2
readily
N H 2-
gene
and
homologous
slightly
more
to
confirms
identity
domain
respectively)
than
of with
of
similar
the
to
that
I
[Met-
hexokinase
this
halves
identity
and
also
of
is
between human
yeast
A
of
regulatory
940
B
domain
(30.8%
N H 2domain
There
and
(33.9%
(the
the
(10-12). B
and and
be
67.9%
enzyme Both
A
of
can
catalytic
hexokinases
hexokinase
respectively).
this
by
human
identity,
60.1%).
and
the
sequences
hexokinase
51.9%
COOH-terminal
hexokinases
evolved
of
The
human
is
Homologous.
have
Analysis
regions
the
are
hexokinases
There two
the
Hexokinase
suggestion.
identity
domain
brain
(1-3,7). this
2).
these
yeast
of
mammalian
duplication
nucleotide
regulatory
catalytic
that
between
very
(9);
COOH-Halves
(Fig.
sequence
NH2-terminus.
COOH-terminal
aligned
corresponding
are
and
sequence -
is t h e
suggested
similarity,
terminal
Met-i
of
hexokinase
sequencing
predicted
and
is the
35.1%, 29.0%,
Vol. 157, No. 3, 1988
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS
DISCUSSION Mammalian of
~I00,000
hexokinases
daltons.
glucose,
they
glucose-6-PO 4 .
are
subject
not
to
allosteric
By
glucose-phosphorylating
do
of a single p o l y p e p t i d e
contrast,
enzyme,
allosteric
observations
led
other and
by
or
duplication.
yeast
investigators
hexokinase
Subsequently,
ancestral
fusion
function.
The
the
the
yeast
glucose-6-PO 4 . to
postulate
m a m m a l i a n h e x o k i n a s e s evolved from an ancestral glucokinase
by
mammalian
chains of -50,000 daltons
inhibition
several
inhibition the
glucokinase,
are single p o l y p e p t i d e
show
chain
In addition to binding and p h o s p h o r y l a t i n g
product,
hexokinases
consist
by
a
and
These
that
the
enzyme similar to
process
involving
gene
one of the two catalytic sites in the
protein
evolved
studies
of
to
Wilson
acquire
and
his
a
regulatory
colleagues
have
c o n f i r m e d many of the features of this model
for the structure of
mammalian
have
hexokinase
NH2-terminal domain
and
half
(3,7-9,
of
contains
rat
the
brain
and
of
the
enzyme
ATP
(13,14).
terminal
-i0 kDa,
binding
In
They
hexokinase
allosteric
and that the COOH-terminal portion
13-15).
site
the
addition,
they
or -80 amino acids,
of brain
is
for
hexokinase
to the
binding have
the
that
the
regulatory
glucose-6-PO 4
half of the molecule
containing
shown
(15)
is the catalytic sites
shown
functions
for
that
glucose the
NH 2-
in the reversible
outer m i t o c h o n d r i a l
membrane
(8). The
human
indicates acids
hexokinase
that
was
a
halves
suggests
that
interaction from
this
657,
708
one
of
binding I
and
residues
and 742)
two
yeast
(Ser 447)
155,
the
are
260
and
GSGKGA
residues
-450
amino
and
COOH-
in
which
Figure
involved
(7)
in
the
are also evident in
binding in
The
(residues
the
of
both
the (603,
sequence 448-453
in both domains, precede
2
membrane.
and catalytic
of human hexokinase. site,
of
conserved 294)
and
NH 2-
implicated
(7)
conserved and present acidic
those
model
mitochondrial
of h e x o k i n a s e
residues
209,
domains
ATP-binding
is also
site
four
the
presented
likely
features
hexokinase
(residues
the
are with
this
protein of
hexokinase
1-15
functional
a
alignment
hexokinase
yeast
putative
896-901)
human
confirms
encoding
The
alignment;
by
regulatory
the
of
of
other
glucose
region
duplicated.
terminal
Several
gene
sequence
of and
however,
putative
ATP-
in the catalytic domain of human and rat h e x o k i n a s e hexokinases
in the
A
and
B has
regulatory domain 941
been
replaced
by
of human hexokinase.
a
serine Schwab
Vol. 157, No. 3, 1988
and Wilson
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS
(7) have
suggested
that this pair
might serve to orient the phosphate bound the
glucose.
Could
regulatory
substitutions
the
domain
side-chain
serine to
one
the
It
is
perhaps
phosphofructokinase in
the
and/or
duplication half
of
and
regulatory
of
and
new
of
into
It
seems may
sites
be
(phosphofructokinase)
allosteric
effector
molecules
are
a
this
to the
and
enzymes
allosteric of
gene
NH2-terminal domain,
(16)
the
evolved
process
efficient
enzymes
key
process
regulatory
monomeric
have
altered
which
whereas
an
the
hexokinase
subject
that
in both
of
of
acid
of this domain?
by
the
region
amino
both
phosphofructokinase
tetrameric substrates
and
evolved However,
divergence
effector
that
eukaryotes,
developed
half
the
evolution
glycolysis
divergence.
function.
duplication
higher
inhibition,
hexokinase
COOH-terminal
evolving
of
regulation
activation
interesting
in this
of
substrate binding and regulatory properties
residues
of ATP towards the
replacement
represent
contributing
of acidic
of
a
gene
mechanism
for
(hexokinase)
and
especially
structures
when similar
the to
for the enzyme.
ACKNOWLEDGEMENTS The assistance of Ms. Julie Dicig in the preparation of this manuscript is greatfully appreciated. This research was supported by the Howard Hughes Medical Institute.
REFERENCES i. 2. 3. 4. 5. 6. 7. 8. 9. i0.
Purich, D.L., Fromm, H.J. & Rudolph, F.B. (1973) Adv. in Enzymol. 35, 249-326 Ureta, T. (1982) Comp. Biochem. Physiol. 71B, 549-555 Wilson, J.E. (1980) Current Topics in Cellular Regulation 16, 1-44 Maniatis, T., Fritsch, E.F. & Sambrook, J. (1982) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY Sanger, F., Coulson, A.R., Barrell, B.G., Smith, A.J.H. & Roe, B.A. (1980) J. Mol. Biol. 143, 161-178 Bell, G.I., Fong, N.M., Stempien, M.M., Wormsted, M.A., Caput, D., Ku, L., Urdea, M.S., Rall, L.B. & SanchezPescador, R. (1986) Nucleic Acids Res. 14, 8427-8446 Schwab, D.A. & Wilson, J.E. (1988) J. Biol. Chem. 263, 3220-3224 Schirch, D.M. & Wilson, J.E. (1987) Arch. Biochem. Biophys. 257, 1-12 Polakis, P.G. & Wilson, J.E. (1985) Arch. Biochem. Biophys. 236, 328-337 Frohlich, K., Entian, K. & Mecke, D. (1985) Gene 36, 105iii
942
Vol. 157, No. 3, 1988
ii. 12. 13. 14. 15. 16.
BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS
Kopetzki, E., Entian, K. & Mecke, D. (1985) Gene 39, 95102 Stachelek, C., Stachelek, J., Swan, J., Botstein, D. & Konigsberg, W. (1986) Nucleic Acids Res. 14, 945-963 Nemat-Gorgani, M. & Wilson, J.E. (1986) Arch. Biochem. Biophys. 251, 97-103 Schirch, D.M. & Wilson, J.E. (1987) Arch. Biochem. Biophys. 254, 385-396 White, T.K. & Wilson, J.E. (1987) Arch. Biochem. Biophys. 259, 402-411 Poorman, R.A., Randolph, A., Kemp, R.G. & Heinrikson, R.L. (1984) Nature 309, 467-469
943