G’ene, 102 (1991) 249-254 0
1991 Elsevier
GENE
Science
Publishers
249
B.V. 0378-I 119/91/$03.50
04065
The cDNA sequence and primary structure of the chicken transferrin receptor (Recombinant
Elizabeth
DNA;
deduced
amino acid sequence;
M. Gerhardt a, Lee-Nien
evolutionary
conservation;
primer extension)
L. Chan *, Shuqian Jing b, Meiying Qib and Ian S. Trowbridgeb
a Department of Human Biological Chemistry and Genetics, University of Texas Medical Branch, Galveston, TX 77550-2774 (U.S.A.), bDepartment of Cancer Biology, The Salk Institute for Biological Studies, San Diego, CA 92186-5800 (U.S.A.) Tel. (619)453-4100 Received by S.T. Case: 16 November Accepted: 24 January 1991
and
1990
SUMMARY
Recombinant cDNA clones encoding the chicken transferrin receptor (cTR) have been isolated and sequenced. Comparison of the deduced primary structure of cTR with those of the human transferrin receptor (hTR) and mouse transferrin receptor (mTR) shows that their size, hydropathy profile, location of sites for posttranslational modifications, and domain organization are highly similar. The cytoplasmic domain of cTR contains the motif Tyr-Xaa-Arg-Phe (YXRF) that is the recognition signal for high-efficiency endocytosis of hTR. The cTR has several highly conserved regions within its extracellular domain, including those flanking the putative N-glycosylation sites. Overall, however, the extracellular domain of cTR is only 53 % identical to the extracellular domains of hTR and mTR. The cTR also lacks three of the six Cys residues found in the extracellular domains of the mammalian TRs. These differences can account for functional and structural properties that distinguish cTR and m~m~ian TRs.
INTRODUCTION
Iron is required for a variety of vital cellular functions and is transported into cells by means of TR-mediated endocytosis of the iron-transferrin complex (Huebers and Finch, 1987). TR expression is regulated in a feedback fashion by the level of intracellular iron (Harford et al., 1991). The hTR has been extensively studied and is a dimeric transmembrane glycoprotein consisting of two identical disulfidebonded subunits of 95 kDa (Omary and Trowbridge, 198 1; Schneider et al., 1982). Biochemical studies of mTR and cTR indicate that they have a similar structure also
Correspondenceto: Dr. L.-N. L. Ghan, Department Chemistry
and Genetics,
TX 775.50-2774 (U.S.A.) Abbreviations: plementary
aa, ammo
University
ofTexas
of Human Biological Medical Branch, Galveston,
Tel. (409)761-2761; acid(s);
to RNA; cTR, chicken
Fax (409)761-5159.
bp, base pair(s);
cDNA,
DNA com-
TR; hTR, human TR; kb, kilobase
or 1000 bp; mTR, mouse TR; nt, nucleotide(s); TR, transferrin receptor; TR, gene (cDNA) aa sequence motif Tyr-Xaa-Arg-Phe.
RBC, red blood cell(s); encoding
TR; YXRF,
the
(Trowbridge et al., 1982; Schmidt et al., 1985). The aa sequences of the hTR and mTR have been deduced from the nt sequences of their cDNAs (McClelland et al., 1984; Schneider et al., 1984; Stearne et al., 1985; Trowbridge et al., 1988). The cytoplasmic domain of the hTR has been implicated in high efficiency endocytosis (Rothenberger et al., 1987) and, recently, the aa sequence within the cytoplasmic domain that determines rapid internalization has been identified (Alvarez et al., 1990; Jing et al., 1990; McGraw and Maxfield, 1990; Collawn et al., 1990). To define other functionally significant regions in TR, comparisons of the TR aa sequence of different species can be useful since such regions are usually conserved through evolution. This situation is especially true for avian and mammalian TRs since they have diverged to the extent that important conserved sequences can be ‘highlighted’ against a background of dissimilar sequences. The nt sequence of the 3’ noncoding region of the cTR cDNA and the identification of two highly conserved regions in this region of TR mRNA which are implicated in the post-transcriptional regulation of cellular TR expression have been reported
250 CCGCGFAGGGGGCGGCGGAGTGTG~AGCGGCGGGCG~CCTTCAGTC~GTT~TGGGTACTCCTCCGG~GGCA
77
ATG FAT CAT GCC AGA GCA GCA TTG TCT MC TTG TTC AGC GTC GAG CCG ATG TCG TAC ACA CGT TTC AGC ATT GCT CGG CAA ACA GAT GGA 1 Mst Asp His Ala Arg Ala Ala Leu Ser Asn Leu Phe Ser Val Glu Pro Met Ser Tyr Thr Arg Phe Ser Ile Ala Arg Gin Thr Asp Gly
167
GAC AAC AGC CAC GTG GAG ATG A4G CTG TCT 31 Asp Asn Ser His Val Glu Met Lys Leu Ser * CAG CCA GAG AGA AAT GGC AAG AGA CTC TGC 61 Gin Pro Gin Arg Am Gly Lys Arg Leu CYS
GCT GAT GAT GAG GAA GGA GGG GAC ATT GA.4 HGG CCA GAG CAC ATG CAT GTC AGT ATG GCT Ala Asp Asp Glu Glu Gly Gly Asp Ile Glu Arg Pro Glu His Met Hxs Val Ser Met Ala
257
TTC TTG GTC ATT GCA GCT GTT CTC CTC CTT TTG ATT GGG Phe Leu Va 1 Ile Ala Ala Val Leu Leu Leu Leu Ile Gly
TTT CTT ATT GGC TAC TTG AGT Phe Leu Ile Gly Tyr Leu Ser
347
TAT CFT GM 91 Tyr Arg Gly
T& CYS
GAG ATA ACT CCT ACT GCG TCG TAC TTA GTG GAT GGT Glu Ile Thr Pro Thr Ala Ser Tyr Leu "al Asp Gly
437
GA.4 GGA ACT GTG GA.4 GAA GAG ATT CAA GGA CCG CCT GTC ATC TTC TGG CCT GAA CTC AAA GCC ATG CTG TCA ,UA AAG CTG TCA Gee AAG 121 Glu Gly Thr "al Glu Glu Glu Ile Gin Gly Pro Pro Val Ile Phe Trp Pro Glu Leu Lys Ala Met Leu Ser Lys Lys Leu Ser Ala Lys
52,
AAT CTT GTA GAC AK 151 Asn Leu Val Asp Asn
TTG AGG LPU Arg
TGG AGG GTA GGT GTG GAC TCC TTT GAG GCT GGT GAG GCT GA.4 GAT ACA AK ATG GCC ACC TAC ATT CAT Trp Arg Val Gly Vnl Asp Ser Phe Glu Ala Gly Glu Ala Glu Asp Thr Asn Met ALa Thr Tyr Ile HIS
617
GAG FAA TTC AGG AK 181 Glu GLu Phe Arg Asn
TTC TTG GAT AAA GTG TGG AX FAT GAA CAC TAT ATC AAG TTG CAA GTC AGA GGC AGC ACC AAG AAC CA.4 GTG TCC Phe Leu Asp Lys Val Trp Asn Asp Glu HIS Tyr Ile Lys Leu Gin Val Arg Gly Ser Thr Lys AS,, GLn Val Ser
707
ATT TCG ATC AAT GGT AA.4 GAG GAG ATC TTG GAG ACT CCT GAT WA TAC GTT GCA TAC AGC GAG AGT GGC TCT GTT TCT GGC u.4 CCT GTC Ile Ser Ile Asn Gly Lys Glu Glu Ile Leu Glu Thr Pro Asp Ala TYK Val Ala Tyr Set Glu Ser Gly Ser Val Sex Gly Lys Pro "al
79:
211
TAT GTG MC Tyr Val Asn
TAC GGG CTG AAA AA.4 GAT TTT GAG ATC ATA CAG AAG GTC GTG GCT TCA CTG AAT GGA ACC ATA GTC ATT GTC AGA GCT GGA Tyr Gly Len Lys Lys Asp Phe Glu Ile Ile Gin LYS VaL Val Ala Se= Leu Asn Gly Thr Ile Val Ile Val Arg Ala Fly
887
241
AU, ATA ACA CTT GCT GAG AAG GTT GCA AAT GCC AAA GAG GCA GGA GCA GCT GGA GTC CTC ATG TAC GTG GAT TCA CTC AAG TAT GGA ATA 271 Lys Ile Thr Leu Ala Glu Lys Val Ala Asn Ala Lys Glu Ala Gly Ala Ala Gly Val Leu Mnt Tyr Val Asp Ser Leu Lys Tyr Gly Ile
977
CGA ATG CAG CTG GCT GCC AGG Arg Met Gin Leu Ala Ala A=&
CAA SAT GGA AGT GGC GGG T& GLn Asp Gly Se= G1.y Giy Cys
AC.4 GAT ACA CTT A-C CCA TTC GGA CAT GCC CAC CTT GGA ACT GGA GAC CCT TAC ACC CCA GGC TTC CCT TCG TIC AK 301 Thr Asp Thr Leu Iie Pro Phe Gly h‘is Ala His it?= Gly Thr Gly Asp Pro Tyr Thr Fro Gly Phe Pro Ser Phe Am CC.4 CCA GTT GAA TCT TCA GGA CTA CCC CAC ATT GCT GTT CAG Pro Pro Val Glu Ser Ser Gly Leu Pro His Ile Ala Val Gin * GAC ACA T:C TCT GAA GGT TGG AAA GGT GCG ATC CAT TCC TGT 361 Asp Thr Cys Ser Glu Gly Trp Lys GLy Ala Ile Nls Ser Cys 331
P&C AAT TCC ATG 391 Asn Asn Ser Met
CAC ACC CAG TTT 1067 HIS Thr Gin Phe
ACC ATC TCT AGC AGT GCA GCA GCC AGG CTG TTC AGI: AAA ATG GAT GGA 1157 Thr Ile Ser Ser Ser Ala ALa ala Arg Leu Phe Ser Lys Met asp Gly AAG GTG ACA ACA AAG CAC GAG AGC CAG ATA ATG GTG A&A CTA GAT GTG Lys VaL Thr Thr Lys Hxs Glu Ser Gin Ile Met Val Lys Leu Asp Val
1247
AA.4 GAC AGG AAG ATT CTG h4C ATC TTC GGT GCT ATC GAG GGA TTT GA.4 GPA CCA GAT CGG TAT GTT GTG ATT GGA FCC 1337 Lys Asp Arg Lys ILe Len Am Ile Phe Gly Ala 110 Gin Gly Phe Glu GLu Pro Asp Arg Tyr "a,. Val Ile Gly .&la
CAG AGA GAC TCC TGG GGC CCA GGA GTG GCT AA!, GCT GGC ACT GGA ACT GCT ATA TTG TTG GAA CTT GCC CGT GTG ATC TCA GAC ATA FTG 421 Gin Arg Asp Ser Trp Gly Pro Fly Val Ala Lys Ala Gly Thr Fly Thr Ala Ile Leu Leu Glu Leu Ala Arg Val ile Ser Asp Ile Val
142,
AA.4 AK GAG GGC 451 Lys Asn Glu Gly
1517
TAC AA.4 CCG AGG CGA AGC ATC ATC TTT GCT AGC TGG AGT GCA GGA GAC TAC GGA GCT GTG GGT GCT ACT GA.4 TGG CTG Tyr LYS Pro Arg Arl: Se= Ile Ile Phe Ala Se= Trp Ser Ala GLy Asp Tyr Gly Ala Val Gly ALQ Thr Glu Trp Leu
GAG GGG TAC TCT GCC ATG CTG CAT GCC AAA GCT TTC ACT TAC ATC AGC TTG GAT GcT CCA GTC CTG GGA GCA AGC CAT GTC AAG ATT TCT 1607 481 Glu Gly Tyr Ser Ala Met Leu His Ala LYS Alu. Phe Thr Tyr Ile Ser Leu Asp Ala Pro Vnl Leu Gly Ala Ser His Val Lys Ile Sex GCC AFC CCC TTG CTG TAT ATG CTG CTG GGG AGT ATT ATG f&G GGG GTG AAG AA? CC.4 GCA GCA GTC TCA GAG AGC CTC TAT AAC AGA CTT 169, 511 Ala Ser Pro Leu Leu Tyr Met, Leu Leu Giy Sex Ile Met Lys Gly VaL Lys Asn Pro Ala ALa Val Ser Glu Ser Leu Tyr AS* Arg Leu GGC CCA GAC TGG GTA AAA GCA GTT GTT CCT CTT GGC CTG GAT AAT GCA GCG TTC CCT TTC CTG GCG TAC TCA GGA ATT CCA GTG TTG TCT 278, 541 Gly Pro Asp Trp Val LYS Ala Val Val Pro Leu Gly Leu Asp As" Ala Ala Phe Pro Phe Leu ALa Tyr Ser Gly Il,e Pro "al ~eu Ser TTT GCT TTC TAC AAT AAA FAT GAG GAA TAT CGC 571 Phe Gly Phe Tyr Asn LYS Asp Glu Glu Tyr Arg
TTC CTG GAC ACT AAG GGT GAC ACA CTG GAG AAC CTG AGG AAA ATT GAT MT CTG GAT Phe Leu Asp Thr Lys Gly Asp Thr Leu Glu Asn Leu Arg Lys ILc Asp Asn Leu Asp
1877
GCT CTT CTG GCT GCT GCT GCA GA.4 GTA GCT GGA CAA GCA GCT CTC AGG CTG KC CAT GAT CAT GAG CTC TTC CTG GAC AK GGG AGA TAC 601 Ala Leu Leu Ala Ala Ala Ala Glu VaL Ala Fly Gin Ala Ala Leu A% Leu Thr HE Asp His Flu Leu Phe Leu Asp Ile Gly AXE ~yr
196,
AGT GAA GA.4 TTA CTG GCA 631 Ser Glu Glu Leu Leu Ala GCC CGT GGT GAC 661 Ala Arg Gly Asp
691
TAC CAG GAG GAG TTT TTG CC? TAC ATT AAG F&4 GTG CGG GAG CTG GGG TTG ACC TTG GAC TGG CTG TTT TTT 205, Tyr Gin Glu Glu Phe Leu Pro Tyr Ile LYS Glu Val Arg Glu Leu Gly Leu Thr Leu Asp Trp Leu Phe Phe
TTC CAG CGA GCT GTA ACT GCA CTG AGA AGA GAC ATT GCA AK AGT GAC GGG GAG AK AGG GTC AK CGC AGG GCC CTG 2147 Phe Gin Arg Ala Val Thr Ala Leu Arg Arg Asp Ile Ala As" Ser Asp Gly Glu As,, Ark "al IL@ Arg Arg Ala Leu
AAT GAC AGG ATG ATG AAG GTG GAG TAT GAC TTC CTG TCC CCG TAT CTC TCA CCA AAA GAT GTC CCT TTT CGC CAC ATC TTC TTT GGC AAA 2237 Asn Asp Arg Met Met LYS Val Glu Tyr Asp Phe Leu Ser Pro Tyc Leu Ser Pro Lys Asp "al. Pro Phe Ar& His Ile Phe Phe Gly ~ys
CGC CCC CAC ACC CTG CGG AGT CTG GTG GAG CAT CTG CAG CTG TTG AA4 ACC AX AGG AGC AGC GTG GA? CTG MC TTG CTF AGG GAG CAC- 2327 721 GLy Pro His Thr Ler: Ar8 Ser Leu Val Glu His Leu Gin Leu Lru Lys Thr Asn Arh Ser Ser "al Asp Leu ~sr> Leu Leu /,rg flu Gin CTG CCC CTA GCA ACG TGG ACC ATT AAA GGG GCG GCC AAT GCC TTG GGA GGT GAT ATC TGG GAA ACT GAC MT G.&A TTC TAG ACACTGCMGC 751 Leu Ala Leu Ala Thr Trp Thr Ile LYS Gly Ala Ala Asn Ala Leu Gly Fly Asp Ile Trp Glu Thr Asp Asn Glu phi> END ACGTGGTTAAGGTAACAGGGT
Fig. 1. The nt sequence domain
2440
of the 5’noncoding
is singly overlined
2419
and coding regions of CTRcDNAl,
and the potential
N-linked
glycosylation
a full-length
sites are underlined.
cDNA clone encoding
Cys codons
are marked
the cTR. The putative with asterisks,
transmembrane
and the internalization
recognition signal is marked by bold-face letters and doubly overlined. A Igtl 1 chicken cDNA library prepared from chicken erythroblasts was screened by standard methods (Maniatis et al., 1982) using CTRZ, a cTR genomic clone (Chan et al., 1989), as probe. A S-kb Eli-length cTR cDNA clone, CTRCDNA
1,wasisolated. Three EcoRI fragments
span the 5’-noncoding of overlapping
fragments
and coding regions were prepared
(0.6, 1.2 and 0.7 kb in size) and two EcoRI partial-digestion
were subcloned
using the Cyclone
into bacteriophage kit (International
M13mp19
for single-stranded
Biotechnologies,
fragments
(1.8and 1.9 kb in size) which
DNA sequencing
Inc., New Haven,
in both directions.
CT) and sequencing
was performed
Sets by
the dideoxy method (Sanger et al., 1977) as previously described (Chan et al., 1989). The sequence data were analyzed by the IBI/Pustell DNA Sequence Analysis Program (Pusteil and Kafatos, 1986) with an AT&T PC. The CTRcDNAl sequence has been deposited in the EMBL Database (accession No. X55348). This sequence
was obtained
in one laboratory
(E.M.G.
and L.-N.L.C.).
The complete
nt sequence
of cDNAs
spanning
region of the cTR was independently obtained in the other laboratory (S.J., M.Q. and I.S.T.). cDNAs encoding the C-terminal screening a chicken primary bursal LL6 lymphoma cDNA library, kindly provided by Dr. Carol Nottenburg (Fred Hutchinson WA), with an hTR cDNA probe at low stringency. from the same &I 1 chicken erythroblast library
A cDNA containing the nt region encoding as CTRcDNAl. The nt sequence encoding
the entire coding
447 aa were obtained by Cancer Center, Seattle,
the remaining 329 N-terminal aa of the cTR was obtained this region of the cTR obtained in both laboratories was
identical. Four confirmed sequence differences that may represent polymorphic variations were found in the coding region obtained from cDNAs the bursal lymphoma library: A’8’” + G, C’“@‘+ T, A*‘“‘-+ C and AZ3’* + G. This leads to two aa changes, Arg5s’ ---+His and Lys7jh -+Gln.
from
251 previously (Chan et al., 1989; Koeller et al., 1989; Harford et al., 1991). In this study, we describe the nt sequence of the coding region of the chicken cDNA and the deduced primary structure of cTR. Comparison of the aa sequence of cTR, mTR and hTR reveals structural features that are likely to be important in various aspects of TR function.
EXPERIMENTAL
1234
nt
AND DISCUSSION
(a) The nt sequence of chicken TR cDNA The nt sequence of the coding and 5’-noncoding regions of CTRcDNAl, a recombinant cTR cDNA clone, is shown in Fig. 1. In the 5’-noncoding region 77 nt have been sequenced. The coding sequence contains 2328 nt corresponding to 776 aa. A single start codon, ATG, and one stop codon, TAG, are present. The sequence of the 3’-noncoding region of cTR cDNA has been described already (Chan et al., 1989). The complete cTR cDNA contains 5019 nt, which includes a second polyadenylation site. To define the 5’-terminus of cTR cDNA, primer extension experiments were performed (Fig. 2). When the 20-nt primer that is complementary to nt 19-39 (Fig. 1) was hybridized to RNA from chick embryonic RBC, a primer extension product of 85 nt was produced (Fig. 2, lane 4). Negative controls, namely, hybridization of complementary primer to globin mRNA and hybridization of noncomplementary primer to chick-embryonic RBC RNA, gave no discernable extension products (Fig. 2, lanes 2 and 3, respectively). These results indicate that the cTR 5’-noncoding region contains 124 nt and that cTR mRNA from embryonic RBC has another 47 nt upstream from the 5’-end of CTRcDNAl.
Fig. 2. Analysis
of the 5’ terminus
from globin
mRNA
from
embryonic
chick
4, extension mentary
and complementary
product
RBC from
(b) Primary structure of cTR The aa sequence of cTR deduced from the cDNA sequence is shown in Fig. 1 also. The hydropathy plot of cTR (data not shown) indicates that the only region of high hydrophobicity, which likely corresponds to the transmembrane domain of the receptor, is located close to the N terminus of the polypeptide. cTR contains five Cys residues, one at the junction of the cytoplasmic domain and transmembrane region (Cys”) and four others (Cys residues 101, 108, 363, 374) in the external domain. Four potential N-linked glycosylation sites are located at Asn residues 261, 326, 391, and 738. The M, of the unglycosylated cTR predicted from its primary sequence is 85 72 1. This M, is consistent with what was previously reported for cTR isolated from chicken erythroblasts transformed with avian erythroblastosis virus (Schmidt et al., 1985). Northern-blot analyses show that cTR mRNA is consistently 4.9 kb in size in several different chick embryonic tissues (Chan et al., 1989; Gerhardt and Chan, 1990).
(Chan
were synthesized Peptide
and
noncomplementary
embryonic
RBC
product product primer;
and comple-
analyzed
in the same gel. Total RBC as previ-
et al., 1989). Oligodeoxyribonucleotide
Laboratory.
thereof.
RNA
from 12-day chick embryonic of Texas Medical
The primers
contain
region of the cTR cDNA sequence
complement
3, extension
on the right margin show the size of DNA
by the University
Synthesis
noncoding
by primer extension.
primer;
against DNA markers
cellular RNA was prepared ously described
RNA
chick
primer. The numbers
(in nt) as measured
of cTR mRNA
are: 1, primer (20-mer) alone; 2, extension
The lane designations
Primer extension
primers
Branch
DNA and
nt 19-38 in the 5’-
(see Fig. 1) and the reverse
analyses
were performed
accord-
ing to established procedures (Ausubel et al., 1989). The primers were end-labeled with [y-32P]ATP using T4 polynucleotide kinase. Hybridization of labeled primers to globin mRNA, which was used as a control, to RNA from
12-day chick embryonic
Reverse transcriptase the primers. and sequence
from avian myeloblastosis
The reaction
a 6% polyacrylamide
RBC was carried
products
sequencing
virus was used to extend
were purified gel together
and
out at 30°C.
and then analyzed
with DNA of known
on size
as markers.
(c) Evolutionary conservation of TR aa sequence and organization Comparison of the deduced primary structure of cTR with hTR and mTR (McClelland et al., 1984; Schneider et al., 1984; Trowbridge et al., 1988) reveals several common structural features (Fig. 3). The cTR is 776 aa compared with 760 and 763 aa for the hTR and mTR, respec-
252 t 1 96 HUMAN NNDQARSAFS NLFGGEPLSY TRISLARQVD GDNSHVFMKL AVDEEENAD. ...NNTKANV TKPKRCSGSI CYGTIAVIVF FLIGFMIGYL GYCKGVEPKT l
MOUSE CHICKEN consensus
t
MNDQARSAFS NLFGGEPLSX TRTSLARQVD GDNSHVEMKL AADEEENAD. ...NNMKASV RKPKRFNGRL CFAAIALVIF FLIGFMSGYL GYCKRVEQKE .MDHARAALS NLFSVEPMSY TIVSIARQTD GDNSHVEMKL SADDEEGGDI ERPEHMHVSM AQPQRNGKRL CFLVIAAVLL LLIGFLIGYL SYRGRMQLAA -MDqARsAfS NLFggEPlSY =SlARQvD 'GDNSHVEMKL a-DEEEnaD- ---nn-ka-v -kPkr--g-- C---IA-V-f fLIGFm-GYL g'lck-ve-kt
186 LWENQFREF KLSKVWRDQH YYIENQFHEF KFSKVWRDEH TYIHEEFRNF .LDKVWNDEH -Y-e-qF-eF kL-KVWrD-H
MOUSE CHICKEN Consensus
ECERLAGTES ECVKLAETEE RCQDGSGGCE eC__la_te_
PV .... ...R TD .......K ITPTASYLVD __________
TDFTSTIKL. 1EFADTIK.Q KNLVDNLRWR --f-tik--
LNENSYVPRE LSQNTYTPRE VGVDSF...E 1--n-y-prE
HUMAN MOUSE CHICKEN Consensus
FVKIQVKDS. YVKIQVKSSI YIKLQVRGST YvKiQVk-S-
.AQNSVIIVD KNGRLVYLVE NPGGYVAYSK AATVTGKLVH ANFGTKKDFE .GQNMVTIVQ SNGNLD.PVE SPEGYVAFSK PTEVSGKLVH ANFGTKKDFE KNQVSISING KEE ....ILE TPDAYVAYSE SGSVSGKPVY VNYGLKKDFE --Qn-v-Iv- -ng---- vE -P-gYVA-Sk ---V-GKlVh aNFGtKKDFE
...DLYTPw ...ELSYSm IIQKVVASM ----1---vN
TSIVIVRAGK ~LVIVRAGE ~IVIVRAGK Gs-VIVRAG-
HUMAN MOUSE CHICKEN Consensus
YMoQTKFPIV NAeLSFFGHA YMDKNKFPW EADLALFGHA Y'JDSLKYGIT DTLIP.FGHA YmD--Kfp-v -a-l--FGHA
HLGTGDPYTP HLGTGDPYTP HLGTGDPYTP HLGTGDPYTP
GFPSF-F GFPSF-F GFPSF-F GFPSFNHTQF
PPSRSSGLPN PPSQSSGLPN PPVESSGLPH PPs-SSGLPn
IPVQTISRAA IPVQTISRAA IAVQTISSSA IpVQTISraA
AEKLFGNMEG AEKLFGKMEG AARLFSKMDG AekLFg-MeG
D.CPSDWKTD .STCRMVTSE .SCPARWNNID .SSCKLELSQ DTCSEGWKGA IHSCKVTTKH --Q--W--d -s-C----s-
SKN..VKLTV NQN..VKLIV ESQIMVKLDV --n--VKL-V
HUMAN MOUSE CHICKEN Consensus
SNVLKEIKIL NIFGVIKGFV KNVLKERRIL NIFGVIKGYE Bl,iMKDRKIL NIFGAIQGFE -NvlKe--IL NIFGvIkG--
EPDHYVWGA EPDRWVVGA EPDRYWIGA EPD-YWvGA
QRDAWGPG.A AKSGVGTALL QP.DALGAGVA AKSSVGTGLL QRDSWGPG.V AKAGTGTAIL QRDa-G-G-a AKs-vGT-1L
LKLAQMFSDH LKLAQVFSDM LELARVISDI LkLAq-fSDm
VLKDGFQPSR ISKDGFRPSR VKNEGYKPRR --kdGf-PsR
SIIFASWSAG SIIFASWTAG SIIFASWSAG SIIFASW-AG
DFGSVGATEW DFGAVGATEW DYGAVGATEW DIG-VGATEW
476 LEGYLSSLHL LEGYLSSLHL LEGYSAMLHA LEGYlssLHl
HUMAN MOUSE CHICKEN Consensus
KAFTYINLDK KAFTYINLDK KAFTYISLDA KAFTYInLDk
AVLGTSNFKV WLGTSNFKV PVLGASHVKI -VLGtSnfkv
SASPLLYTLI SASPLLYTLM SASPLLYMLL SASPLLYtL-
EKTMQNVKHP GKIMQDVKHP GSIMKGVKNP -k-Mq-VKhP
VT.GQFLYQ. VD.GKSLY.R AAVSESLYNR v--g --LY--
.DSNWASKVF, KLTLDNAAFP .DSNWISKVE KLSFDNAAYP LGPDWVKAW PLGLDNAAFP -dsnW-skVe kL--DNA&-P
FLAYSGIPAV FLAYSGIPAV FLAYSGIPVL FLAYSGIPav
SFCFCE.DTD SFCFCE.DAD SFGFYNKDEE SFcFce-D-d
572 YPYLGTTMDT YPYLGTRLDT YRFLDTKGDT YpyLgT--DT
HUMAN MOUSE CHICKEN Consensus
YKJZLIERIPE LNKVARAAAE YEALTQKVPQ LNQMVRTAAE LENL.RKIDN LDALLAAAAE y-L----Ln---r-AAE
VAGQFVIKLT VAGQLIIKLT VAGQAALRLT VAGQ--lkLT
HDVELNLDYE HDVELNLDYE HDHELFLDIG HDvELnLDye
RYNSQLLSFV MYNSKLLSFM RYSEELLAYQ -Yns-LLsF-
RDLNQYRADI KDLNQFKTDI EEFLPYIKEV -dlnq ---di
KEMGLSLQWL RDMGLSLQWL RELGLTLDWL -emGLsLqWL
YSARGDFFRA YSARGDYFRA FFARGDFQRA ysARGD-fRA
TSRLTTDFGN TSRLTTDFHN VTALRRDIAN tsrLttDf-N
672 AEKTDRFVMK AEKTNRFVMR SDGENRVIRR aekt-Rfvm-
HUMAN MOUSE CH1CKF.N Consensus
KLNDRVMRVE EINDRIMKVE ALNDRMMKW --NDR-M-VE
KESPFRHVFW RESPFRHIFW KDVPFRHIFF -esPFRHIFW
GSGSHTLPAL GSGSHTLSAL GKGPHTLRSL GsGsHTL-aL
LENLKLRKQN NGAFWLFR VENLKLRQW mFmLFR VEHIQLLKTbI XSVDLNLLR -EnLkLr--N --afnetLFR
NQLALATWTI NQLALATWTI EQIJUATWTI nQLALATWT1
QGAANALSGD QGVANALSGD KGAANALGGD qG-ANALsGD
760 VWDIDNEF IWNIDNEF IWETDNEF -W-IDNEF
HUMAN
EEPG..EDFP SETMETEDVP GEGTVEEEIQ -E----Ed-p
AARRLYWDDL TSSRLYWADL GPPVIFWPEL ---rlyW-dL
KRKLSEKLDS KTLLSEKLNS KAMLSKKLSA K--LSeKL-s
AGSQKDENLA AGSQKDESLA AGEAEDTNMA AGsqkDe-lA
+ ++
+++
ITFAEKVANA ITFAEKVANA ITLAEKVANA ITfAEKVANA
c
+++
Fig.3. Comparison
of the aa sequences
of cTR, mTR, and hTR. Dots represent
gaps introduced
31-l
l
l
YHFLSPYVSP YHFLSPYVSP YDFLSPYLSP YhFLSPYvSP
281 ESLNAIGVLI QSFNAIGVLJ KEAGAAGVIJ4 ---nAiGVLI
to align sequences.
Capital
l
letters in the consensus
sequence represent aa common to all sequences, lower-case letters represent aa present only in hTR and mTR sequences, dashes indicate positions at which hTR and mTR sequences differ. Cys residues in the hTR are marked with asterisks and conserved Cys are shown in bold-face letters. Putative glycosylation
sites are underlined
and their positions
in the hTR sequence
marked
with plus signs. The position
and the internalization recognition signal sequence is shown in bold-face letters and doubly overlined. hTR. The mTR sequence has been deposited in the EMBL Database (accession No. X57349).
tively. All three proteins lack a leader sequence and the overall hydropathy profiles of avian and mammalian TRs are essentially superimposable. Each receptor is organized into a small N-terminal cytoplasmic domain, a single transmembrane region, and a large C-terminal extracellular domain that are 68,23, and 685 aa, respectively, in the case of the cTR. Although the cytoplasmic domain of the cTR has only 62% identity with the hTR compared with 93% identity between the cytoplasmic tails of mTR and hTR, the YXRF tetrapeptide recognition structure for high-efficiency
of transmembrane
The numbers
correspond
regions
is overlined
to the aa residues
in
endocytosis of the hTR (Collawn et al., 1990) is conserved in the cTR. This structural conservation is consistent with the rapid internalization of hTRs expressed in chicken embryo fibroblasts (Jing et al., 1990). The phosphorylation site at Ser24 and the acylation site at CYSTSin the hTR are both conserved in the cTR (Schneider et al., 1982; Rothenberger et al., 1987; Jing and Trowbridge, 1990). However, both the mTR and cTR lack CyP that is another lipid attachment site in the hTR (Jing and Trowbridge, 1987; 1990). Several features of the extracellular domain of
253 the cTR are significant.
Only three of the six Cys residues
found in the extracellular domains of the mTR and hTR are conserved in the cTR. CYSTS and CYSTS form two intermolecular disullide bonds between subunits of hTR (Jing and Trowbridge, 1987) but only Cys9* is conserved in the cTR. This finding is consistent with the observation that a fraction of cTRs, similar to mutant hTRs in which either CYSTSor Cys 98 has been modified to Set-, are not disulfidebonded dimers (Schmidt et al., 1985; Jing and Trowbridge, 1987). The three glycosylation sites found in mammalian TRs are conserved in the cTR and are located in, or adjacent to, regions of relatively high sequence similarity. Some regions-of similarity ‘may be important for receptor dimerization which is known to involve noncovalent interactions between the external domains of TR subunits (Omary and Trowbridge, 1981). The region of approx. 150 aa of the extracellular domain of the TRs proximal to the cell membrane are poorly conserved, and both the mTR and cTR lack the tryptic cleavage site at Arg’“’ in the hTR (Turkewitz et al., 1988). Overall, the cTR extracellular domain has 53 y0 identity to the extracellular domain of the hTR compared with 77% identity between the mTR and hTR. The sequence divergence of the cTR extracellular domain accounts for the fact that cTR does not bind most mammalian TRs, whereas most mammalian TRs can utilize other mammalian transferrins but not ovotransferrin (Shimo-Oka et al., 1984). The construction and functional analysis of human-chicken hybrid TRs in which different regions of the hTR are replaced by cTR sequences may provide information that will aid in the localization of the transfer+ binding site.
Ghan,
L.-N.L.,
Chicken
Grammatikakis,
transferrin
sequences
N., Banks,
receptor
and expression
gene:
J.M. and Gerhardt,
conservation
in erythroid
E.M.:
of 3’ noncoding
cells. Nucleic
Acids
Res. 17
(1989) 3763-3771. Collawn, J.F., Stangel, M., Kuhn, L.A., Esekogwu, V., Jing, S., Trowbridge, I.S. and Tainer, J.A.: Transferrin receptor internalization sequence
YXRF implicates
motif for endocytosis. Gerhardt,
EM.
chicken
and Chan.
transferrin
a tight turn as the structural
recognition
Cell 63 (1990) 1061-1072. L.-N.L.:
receptor
Structure
and expression
of the
gene. J. Cell Biol. 111 (1990) 120a.
Harford, J.B., Casey, J.L., Koeller, D.M. and Klausner, R.D.: Structure, function, and regulation oftransferrin receptor: insights from molecular biology.
In: Steer,
Trafficking
C.J. and
of Proteins.
Hanover,
Cambridge
J. (Eds.),
University
Intracellular
Press,
Cambridge,
U.K., 1991, in press. Huebers,
H.A. and Finch, CA.: The physiology
ferrin receptors.
Physiol.
Jing, S. and Trowbridge,
IS.: Identi~cation
fide bonds of the human
of transferrin
and trans-
Rev. 67 (1987) 520-582. transferrin
of the intermolecular
receptor
disul-
and its lipid-attachment
site. EMBO J. 6 (1987) 327-331. Jing, S. and Trowbridge, I.S.: Nonacylated human transferrin receptors are rapidly internalized and mediate iron uptake. J. Biol. Chem. 265 (1990) 11555-11559. Jing, S., Spencer,
T., Miller, K., Hopkins,
ofthe human transferrin localization
receptor
of a specific
C. and Trowbridge,
cytoplasmic
signal sequence
1,s.: Role
domain in endocytosis:
for internalization.
J. Cell
Biol. 110 (1990) 283-294. Koeller,
D.M.,
Casey,
L.-N.L., Klausner, to structural ferrin
J.L.,
elements
receptor
Hentze,
M.W.,
R.D. and Harford,
Gerhardt,
J.B.: A cytosolic
within the iron regulatory
mRNA.
Proc.
Natl.
Acad.
E.M.,
Chan,
protein
binds
region of the transSci. USA
86 (1989)
3.574-3578. Maniatis,
T., Fritsch,
Laboratory
E.F. and
Manual.
Harbor,
NY, 1982.
McClelland,
A., Kuhn,
Sambrook,
Cold Spring
L.C. and Ruddle,
receptor
gene:
genomic
structure
of the receptor
J.: Molecular
Harbor
and
transferrin
the complete
from a cDNA
A
Cold Spring
F.H.: The human
organization, deduced
Cloning.
Laboratory,
primary
sequence.
Cell 39
(1984) 267-274. McGraw, ACKNOWLEDGEMENTS
T.E. and Maxfield,
nalization
is partially
cytoplasmic
We thank Dr. B. Vennstrom for the chicken erythroblast cDNA library and Dr. Carol Nottenburg for the chicken bursal lymphoma cDNA library. We also thank Ms. N. Tovar and Joan Stewart for their assistance in the preparation of this manuscript. This work was supported in part by a grant from the American Heart Association, a University of Texas Medical Branch Small Grant, a grant from the John Sealy Memorial Endowment Fund, and grant CA34787 from the National Cancer Institute.
Omary,
domain.
Pustell,J.
in cultured
and Kafatos,
transferrin
upon an aromatic
IS.: Biosynthesis
ofthe human transferrin
F.C.: A convenient
and adaptable
for DNA and protein sequence
sis. Nucleic
Acids Res. I4 (1986) 479-488. S., Iacopetta,
transferrin
receptor
phorylation
inter-
cells. J. Biol. Chem. 256 (1981) 12888-12892.
environment Rothenber~er,
receptor
amino acid on the
Cell Reg. 1 (1990) 369-377.
M.B. and Trowbridge,
receptor
F.R.: Human
dependent
B.J. and Kuhn,
and analy-
L.C.: Endocytosis
requires the cytoplasmic
site. Cell 49 (1987) 423-43
microcomputer
manipulation
of the
domain but not its phos-
1.
Sanger, F., Nicklen, S. and Coulson. A.R.: DNA sequencing with chainterminating inhibitors. Proc. Natl. Acad. Sci. USA 74 (1977) 5463-5467. Schmidt, J.A., Marshall, terization
J. and Hayman,
of the chicken
transferrin
M.J.: Identification receptor.
Biochem.
and characJ. 232 (1985)
735-741. REFERENCES Alvarez,
E., Girones,
plasmic
domain
Schneider, N. and Davis,
Biochem. J. 267 (1990) 31-35. Ausubel, F.M., Brent, R., Kingston, Smith, J.A. and Struhl, Wiley, New York,
R.J.: A point mutation
of the transferrin
1989.
receptor
inhibits
in the cytoendocytosis.
features by the
K.: Current
Protocols
D.D., Seidman, in Molecular
J.G.,
Biology.
R., Newman,
R. and Greaves,
of the cell surface receptor for transferrin monoclonal antibody OKT9. J. Biol.
M.: Structural
that is recognized Chem 257 (1982)
85 16-8522. Schneider,
R.E., Moore,
C., Sutherland,
structure
C., Owen, of human
M.J., Banville, transferrin
D. and Williams,
receptor
sequence. Nature 311 (1984) 675-678. Shimo-Oka, T., Hagiwara, Y. and Ozawa.
deduced
J.G.:
Primary
from the mRNA
E.: Class specificity
of trans-
254 ferrin as a muscle tropic factor. J. Cell. Physiol. Stearne,
P.A., Pieterz,
murine adjacent Trowbridge,
transferrin regions.
Cell Physiol. surface
receptor: J. Immunol.
and the transferrin
126 (1986) 341-351.
J.W.: cDNA
sequence
cloning
of the
of trans-membrane
and
R.: Murine
studies with an anti-receptor
cell surface
monoclonal
trans-
antibody.
J.
112 (1982) 403-410.
I.S., Domingo, molecules
D.L., Thomas,
of the hematopoietic
M.L. and Chain, system:
A.: Cell
T200 glycoprotein
receptor
as models for proteins
and differentiation.
In: MacDermott,
Disease,
Medica
Excerpta
vier, Amsterdam,
134 (1985) 3474-3479.
I.S., Lesley, J. and Schulte.
ferrin receptor: Trowbridge,
G.A. and Goding,
Turkewitz, Schwartz,
A.P.,
involved in growth
R.P. (Ed.), Inflammatory
International
Congress
Bowel
Series, 775. Else-
1988, pp. 441-447.
Amatruda,
Borhani,
D., Harrison,
A.L.: A high yield purification
of the human
properties
J.F.,
receptor
and
Immunol.
263 (1988) 8318-8325.
of its major
extracellular
S.C. and transferrin fragment.
J.