Structure of the knob protein (KP) gene of Plasmodium falciparum

Structure of the knob protein (KP) gene of Plasmodium falciparum

Molecular and Biochemical Parasitology, 26 (1987) 11-16 Elsevier 11 MBP 00863 Structure of the knob protein (KP) gene of Plasmodium falciparum Yagy...

467KB Sizes 0 Downloads 85 Views

Molecular and Biochemical Parasitology, 26 (1987) 11-16 Elsevier

11

MBP 00863

Structure of the knob protein (KP) gene of Plasmodium falciparum Yagya D. Sharma and Araxie Kilejian The Public Health Research Institute, New York, NY, U.S.A. (Received 9 June 1987; accepted 10 June 1987)

We have determined the nucleotide sequence of the gene encoding the knob protein (KP) of Plasmodium falciparum (FCR3/Gambia). The gene is interrupted by an intron which contains 34 imperfect tandemly repeated ATTTT sequences. The first exon encodes 33 amino acids with a hydrophobic core typical of signal peptides. The second exon has an open translational reading frame for 597 amino acids. The deduced protein sequence indicates that KP has multiple structural domains; unlike the N-terminal histidine-rich domain which we described previously, the C-terminal half is rich in lysine residues. Consistent with the apparent association of KP with the cytoplasmic surface of the host erythrocyte membrane, the protein is highly charged and hydrophilic. Key words: Plasmodiumfalciparum; Knob protein; Malaria; Histidine-rich protein

Introduction T h e k n o b protein (KP) of Plasmodium falciparum is essential for the f o r m a t i o n of the characteristic knob-like protrusions on the host erythrocyte membrane. It has been identified in several geographical isolates and has b e e n characterized as a red cell m e m b r a n e - a s s o c i a t e d polypeptide which incorporates relatively large a m o u n t s of e x o g e n o u s histidine [1-7]. In a previous study we described a c D N A clone e n c o d i n g the N-terminal histidine-rich d o m a i n of the K P of isolate F C R - 3 [8]. W e now report the c o m p l e t e nucleotide sequence of the gene and the d e d u c e d primary structure of KP.

cloned k n o b - p r o d u c i n g G a m b i a n isolate F C R - 3 [1]. The D N A was digested with E c o R I and fractionated on a 10-40% sucrose gradient. Aliquots of the fractions were analyzed on a 0.8% agarose gel and transferred to nitrocellulose paper [9]. The filter was p r o b e d with nick-translated pfc43 [10]. A fraction which gave a strong hybridization signal was ligated to C h a r o n 4 A E c o R I arms, packaged in vitro and used to infect Escherichia coli LE392 [10]. R e c o m b i n a n t phage were screened with pfc43 and positives were plaque-purified. R e c o m b i n a n t phage D N A was isolated [10] and PstI and B a m H I fragments were subcloned into p U C 8 [11]. T h e 3' end fragment of a PstI subclone was nick-translated and used to rescreen the c D N A library [8] for the isolation of clone pfc24.

Materials and Methods Isolation of c D N A and genomic D N A clones. The isolation and characterization of a c D N A clone, pfc43, from a c D N A library was described previously [8]. D N A was p r e p a r e d [8] f r o m an unCorrespondence address: Araxie Kilejian, The Public Health Research Institute, 455 First Avenue, New York, NY 10016, U.S,A. Abbreviations: bp, base pair; KP, knob protein; HRP, histidine-rich protein.

Sequence analysis. The chemical degradation [12] and the dideoxy chain termination m e t h o d s [13] were used for sequence analysis. Results A c D N A clone, pfc43, encoding the N-terminal half of KP was used to isolate a cross-hybridizing E c o R I genomic D N A fragment f r o m a partial library constructed in Charon 4A. To facilitate further analysis, PstI and B a m H I subfragments of

0166-6851/87/$03.50 © 1987 Elsevier Science Publishers B.V. (Biomedical Division)

12

m

f

I

':

.+,r+

I

i

---

;+. I

.,, I

II

,..

. + . .

I

--I 0

!

500

4 p.--

!

bp

Bg43

.

,

Fig. 1. Partial restriction map of the KP genc and sequencing strategy. The boxed areas indicate the coding regions: the first exon coding the signal peptide, S, separated by intron, I, from second exon, C. Closed circles indicate the origin of end-labeled fragments sequenced by the method of Maxam and Gilbert [12] in the direction and to the extent indicated by the arrows. Arrows with no circles indicate sequencing by the dideoxy chain termination method. The origin of the sequenced fragments is indicated with the underlined respective cDNA (pfc24) and genomic DNA clones (Pg43 and Bg43) described in the text.

the insert D N A were cloned in plasmid pUC8. A PstI subclone, denoted Pg43, and a B a m H I subclone, denoted Bg43, were used for further analysis. All of the c D N A clones which we had identified previously [8] proved to be lacking the 3' end of the KP gene. Therefore, a 3' end fragment of Pg43 was used to isolate a c D N A clone, pfc24, encoding the C-terminal half of the KP gene. The structure of the KP gene was determined by comparative restriction enzyme analysis and sequencing of selected fragments of the genomic and c D N A clones as summarized in Fig. 1. The complete nucleotide and deduced amino acid sequence shown in Fig. 2 was derived by combining our previous data on c D N A pfc43 [8] and the sequence of fragments indicated in Fig. 1. A comparison of the 5' end of pfc43 [8] with that of genomic clone Pg43 clearly shows that the gene is interrupted with a 438 bp intron. The first exon encodes 33 amino acids with a hydrophobic core typical of signal peptides. The sequences of the splice sites (Fig. 2, boxed) are similar but not identical to published consensus sequences for 5' (AG { G T A ) and 3' ( T X C A G { ) junctions [14]. The intron shows a unique feature: it contains 34 imperfect repeats of A T T T T (Fig. 2, overlined). The second e x o n of the KP gene has an open translational reading frame of 1791 bp encoding 597 amino acids, with a calculated molecular weight of 64 843. In the absence of chemical data

on the N-terminal amino acid of mature KP, we have n u m b e r e d the first amino acid following the intron as 1 for the convenience of discussion (Fig. 2). On polyacrylamide gels the precursor of KP has a molecular weight of 75000 [15]. The apparent difference in the estimated and calculated molecular mass may be due to anomalous electrophoretic mobility as has been noted for several other malarial polypeptides. The deduced structure of KP shows some interesting features; it is distinctly a multidomain polypeptide with limited regions of repeated sequences (Fig. 2, underlined). The outstanding repeats are the three polyhistidine stretches characterizing the N-terminal histidine-rich domain (residues 24-80), three lysine-rich fragments (residues 336-398), and three imperfect repeats of the sequence Ala Thr Lys Glu Ala Ser Thr Ser Lys (residues 507-532). The latter two repeated sequences form the rough boundaries of a lysinerich domain. The multidomain nature of KP is more clearly apparent in Fig. 3 which shows the hydropathy index and charge distribution of the polypeptide; KP is clearly a highly charged and hydrophilic protein. The amino acid composition of KP deduced from the nucleotide sequence is as follows: Lys 84, Gly 65, Ala 55+ His 49, Glu 48, Asn 43, Set 43, Thr 39+ Gin 30, Asp 30, Pro 22, Tyr 20, Val 19, Leu 13, 1/2 Cys 11, Arg 10, Phe 9+ lle 4, Met 3. Although the 49 histidine residues account for

TTTAATTATAAx~aaI`ATAATTATACAAT~TATATATTTATTATTTTTATATGTTTATTAACACAz~x~x~z~TATATTTCATTATTAT~TTATATT~x~zz~TTTTTTTT TATAAATATATAACATATATTAGTTATTTTTTACACCTAAAGAAAAATATAATTAxx~xxxARAGGAAAAGTATTAT

CGTACATAATTATT~

TATATATTATATATTTT~TTTATATATT

P h e L y s A en L y s ASh T hx LeU ~ x g A x g L y s L y e ~ l a p h e P r o V a l r b e T h ~ L y e I l e T T T AAGAAC A A A A A T ACT TTGAGGAGA~AAG GCT T T C C C T G T T T T T A C T A A A A T T

~t Lye set ATGAAJ%AGT

L~U L e u V a l C T T T T A GTC

S e t P h e L e u V a l T r p V a l 1 ~ u Lys ~ m S e t A ~ n A S h TCT TTT TTA GTA TGG GTT TTG AAG TGC TCT AAT ~...~GTTCATARAT~ATATATATATATATATATATATATATATAATTATGTGTATTTTATARCTTAGACATAT GTGTATATTCTTTA~xxx~;ACCTAATATTTTATATATATATTTATATATTAGTTTATTTCJU~CJ~ATATTTACTCCATATATTTATTTCATATATATATATATATAT&TATATATATAT ATATGTGATGTTTRAARAATATGAAJ%TATATATTTATATATTATTATTTTCTTATTTTA~;~ATTTTATTTTATTTTATTTTAATTTTATTTTATTTTATTTTA~A~ TA~TTTAATTTTATTTTAATXTTATTTTATTTTAATTTTATTTTATTTTATTTTAATTTTATTTTATTTTATTTTAATTTTA~A~AT~

1 10 CyS a s n a s n ~ l y a s h g l y 8 e r q l y a s p s e t p h e a s p p h e a r g a e n l y s a r q t h r TGC AAT AAT G~AA~C GG~ TCC GGT GAC TCC TTC CAT TTC *~T*~Jq~T A.~G /qC,J ~ - T

ATTTTTTTTTTCA

20 leu ala gln lys gln his qlu his his his TTA GCJq ~ AAG C.~J~CAT G~&CAC CAT CAC

30 40 $0 h i s hil h l s h i * g l n h i m g l n h i s g l n h i s q l n ala p r o h £ * g l n ala h i a h i s b l m h i s h i B h i s q l y Q 1 u w a l a s h h l s g l n ala p r o

CAC CAT CAC CAT CAA CAT CAA CAC ~

CAC CA). GCT CCA CAC CAA (;CA CAC CAC CAT CAT CAT CAT GGA GAA GTA AAT CAC CAA GCA ¢:CA

60 70 80 g l l l Vl~. h 2 s ~111 q ] ~ V i i h~S q l y Qlll l i p q l n a14 h i l h i s h 2 1 h i s h ~ l ~L~J h ~ l h i t h ~ l Q1D I I U g~Jm p r o g i n g l n l e u ~ l n g l y CAG GTT CAC CAR, CAA GTA CAT GGT CAA GAC CAA GCA CAC CAT CAC CAT CAT ~ ¢AC CAT CAT CJ4A TTA CAA CCT CAA CAA CTC tAG GGA 90 100 t h r w a l a l a a e n p r o p r o s e t a s h q l u p r o v a l v a l l y e t h r q l n v a l ]:be a r q G l u a l a ACA GTT GCT JUqT CCT CCT ~ ; T AAT GAA CCA GTT GTA ~ ACC ~ G'TA TTC AGG GAA ~

120 glu lyJ tyr

~ l u set l y s h i s t y r

ll0 arq pro qly gly Qly phe lye ala ~ CCJq GGT GG/q GGT TTC JqJ~A ~

130 140 l y s l e u l y s q l u ash v a l v a l asp g l y l y s l y s asp cy* sap q l u l y s t y r

tyr qlu TAT

~llu a l a a18 8sn t y r

GAA AAA TAC GAA TCA AAA CAC TAT AAA TTA AAG GAA AAT GTT GTC GAT GGT AAA AAA GAT TGT CAT GAA A~A TAC GAA GCT GCC AAT TAT

ala

150 phe Jet

qlu

glu eye pro tyr

thr

val

160 ash asp tyr

s e ~ G i n q l ~ a s p q l y p r o man l l e

170 ~he ala

leu

arq l y : a r q p b e p r o l e u

GCT TTC T C C G A A G A G T G C CC.% T A C A C C G T A A A C GAT T A T A G C C A A G A A R A T G G T C C A A A T A T A T T T G C C T T A A G A A A A A G A T T C C C T C T T

180 190 200 g l y l e t 8sn asp g l u asp g l u g l u g l y l y s g l u a l * l e u a l a L i e l y s asp l y e l e u p r o Qly q l y l e u asp g l u r y e q l n ash g i n l e u GC~ ATG ~ T GAT GAA GAT GAA ~ GGT ~ GAA GCA TTA ~ ATA AAA G~T ~ ~ A CCA GGT GGT ~"fA ~ T GAA TAC CAA ~ CAA TTA

tyr

210 gly lie

220 cys a|n

glu thr

cy* thr

thr

230

cy# qly pro ala

thr

lie

aep t y r

val

pro ala

asp ala

pro ash qly tyr

ala

tyr

qly

TAT GGA ATA TGT RAT GAG ACA TGT ACC ACA TGT GGA CCT GCC ACT ATA GAT TAT GTT CCA GCA GAT GCA CCA AAT GGC TAT GCT TAT GGA

240

250

260

qly set ala h£s asp qly set his Qly ash leu arg qly him Qly ash lys gly ser glu gly tyr gly tlrr qlu GGA AGT GCA CAC G~T GGT TCT CAC GGT ~ T TTA AG~ GG~ C~C GGT ~ T ~N~ GGT TC~ GAA GGT TAT GG~ TAT G ~

270

280

gly phe a~n gly ala

pro gly

elm pro tyr ash pro GCT CCA TAT ~ C CCA

2~0

~ e r a s h g Z y me~ g i n a s n t y r

val pro pro h~s qly

ala

qly tyr

8er ala

pro tyr

qly vaZ pro h~s

G G A T T T A ~ T G G T G C T C C T G G A A G T A A T G G T A T G C A A A A T T A T GTC C C A C C C C A T G G T C ~ A G G C T A T T C A G C T C C A T A C G G A G T T C C A C A T

300

310

320

qly al8 ala h~s gly set arq tyz set set phe set sez val ash lys GGT GCA GCC CAT GGT TCA AGA TAT AGT TCA TTC AGT TCC GTA ~J~T ~

tyr ~ly lye his qly asp ~lu ly* his his see set lye lys TAT GG~ ~ CAC GGT G~T CJ~ ~ CAC CAT TCC I ~ T A ~ ~ G

330 340 350 h i s g l u q l y aim asp q l y g l u q l y g l u l y s l y J l y : l y s set l y : l y s h i = l y s asp h i = asp g l y g l u l y J l y e lym s e t l y e lye* h i s ~.J~T GJ~q GGA AAT GAC GGT GAA GGA GAA .qJ~. /q~G AJqA .qAA TCA AAA A,~q CJ~C ,q.q.q GAC CA~ ~ T ~ ~ ~ ~ ~ ~ ~ ~ ~

lye

360 asp a*n gZu asp ala

qlu

Jet

val

lys

set

3"/0 lys ly:

hi8 lys

met his

asp cyJ qlu lye

3110 lye lyJ ~

A ~ A G A C ~ A T G A A G A T G C A G A A A G C G T & A A A T C A A A A A A A CAC A A A A G C C A C G A T T G T G A A A~G ~

met lys TC~ ~

lye his lys ~ CAC ~

asp asn GAC A A T

3~0 400 410 q l u ssp a l a q l u 8er v a l l y 8 set l y J l y s ser v a l l y s g l u l y s g l y g l u l y s hLs asn g l y l y s l y e p r o c y * met l y s l y ~ t h r

ash

G A A C A T G C A G A A A G C G T A A A A T C A A A A A A A A G T GTT A A A G A A A A G G G A G A A AAG C A T A ~ T G G A A A A A A A CCA T G C A G C A A A ~ A A A C T AAC

glu

420 glu a~n lys

430 ash lys

glu lys

~hr ash ~8n leu

ly*

ser

asp gly

8mr l y ~ a l a

440 91u ly~ ly~ ~lu

his

asn ~lu thr

1y8 asn thr

G A A G A A A A T A A A A P T A A A G A A A A A ~ C C A A T R A T T T A A A A T C A G A T G G A T C A A R A G C T C A T G A A A ~ A A A ~ GAP. A A T G A A A C A AJ%A A A C A C C

450 ala

qly glu ash

lys

lys

G~T GGA GA~ AAT AA~ ~

qly

480 lys thr

ala

asp set

~hr ser

460 81a asp ash lys

470 |mr thr

ash ala

ala

~ h r Cbx g l y a l ~

490 asp

lys

thr

lys

G C A G A T T C T A C T T C A GCT G A T R A T A & A T C A A C A A A T G C T G C T ACC ACA GGC G ~ ~

asp ly~ thr

qln qly

G~T RAA ACT CAA GGA

SO0

q l y a l a s e t ~ h r a s h a l ~ * l a ~ h r a s h l y 8 g l y q l ~ CyS a l a a l a q l u g l y t h r

th~

ly8 gly tl~

thr

lys

G G A A A A A C T G A C A ~ A A C A G G A G C A A G T A C T AAT GCC G C A AC.~m.A A T A ~ A G G A C A ~ T G T G C T G C T G A A G G A RC~A A C T A A G G G A A C A A C T A A A

Sl0

520

$30

G I U ala ser E h r 8er lys g l u ala thr lys g l u ala :er thr ser lys q l y ala t h r lys G I U a l a set t h r ~ h r g l u q l y a l a t h r lys G A A G C ~ A G T A C T T C T A A A G A A G C A A C A A A A G A A G C A A G T A C T T C T A A A G G A G ~ A A C T A & A GAJ% ~ ~ ~ ~ ~ ~ ~ ~ ~

.540 gly ala ~er ~hr thr

ala

qly

m e: t h r

thr

550 qly al~ thr

thr

qly ala

8sn el8 val

gin

560 ~er lys sap gly thr

ala

asp ly*

ash ala

G G A G C A A G T A C T A C T G C A G G T T C A A C T A C A G G A G C A A C T A C A G G A Gt'T R A T G C A G T A C A A T C T A A A G A T G ~ A A C T G~C G A T A ~ A R ~ T G C T

*la

S?0 asn ann gly

glu gln val met Jet

580 arg qly gin ala

gin

R A T A A T G G T G A A C-AA G T A A T G T C A A G A G G A C A R GC~ C ~

leu gin glu ala

~ly

lys

lys

T T A C A A G A A G C A G G A JU~G~

590 597 lys lys 1y8 ~rg qly c~s cy8 gly *** AAG ~ ~ AGA GGA TG~ TGT GOT TAA

ATAATCTCTGCAI~'TGATTCATAAAATATAATT&CTTTTTGAJqTTAGAAAGATATACCAATAAATATATAZ ~ i ~ i ~ iAAAA

Fig. 2. Nucleotide sequence of the KP gene and the deduced amino acid sequence of the coding region. The gaps in the sequence analysis shown in Fig. 1 were completed with our previously published data [8]. The splice junction sequences are boxed. The tandemly repeated elements of the intron are overlined. The major repeated coding sequences are underlined. Asterisks mark the termination codon.

14 A

tlillllllt I I IglI II II ,II,, llllll I ,u~

B

~':" I His-Rich

"~

I

loo

Lys-Rich

2oo

iltSl0Ut

aoo

411o

! t ]

5ol1

NUMBER

Fig, 3. Charge distribution (A) and hydropathy index (B) of

the knob protein. Basic residues of arginine and lysine are represented by long bars and histidine by half bars. The positions of acidic aspartic and glutamic acid residues are represented by long bars. The hydropathy index was calculated by the method of Kyte and Doolittle [28]. only 8% of the total amino acids, considering the fact that the average histidine content of analyzed polypeptides is around 2% [14], KP is a relatively histidine-rich protein. Discussion

We have determined the nucleotide sequence of the KP gene of P. falciparum and thereby deduced the amino acid sequence of the polypeptide. KP is one of 4 known malarial histidine-rich proteins (HRPs). The structure of each of the three other proteins, the H R P of Plasmodium lophurae [17,18] and two closely related alaninehistidine rich proteins ( S H A R P or P f H R P I I and PfHRPIII) of Plasmodium falciparum [19,20], llas been described previously. Although a c o m m o n physiological function cannot be assigned to these proteins, the structural organization of the 5' end of their genes shows a striking similarity; they are

all interrupted and the first exon contains a short open translational reading frame with hydrophobic amino acids typical of signal peptides. A similar organization has been reported for the gene of the R E S A antigen [21]. The only apparent c o m m o n attribute of KP, P f H R P l l and R E S A is their eventual association with the host erythrocyte m e m b r a n e [5,7,22]. It seems unlikely that a spliced leader peptide is correlated with the processing of malarial polypeptides through particular c o m p a r t m e n t s of infected erythrocytes; the sequences of the leader peptides of these proteins do not show any obvious similarity. Furthermore, the H R P of P. lophurae is localized in m e m b r a n e bound granules and there is no experimental evidence for the export of this polypeptide to the host cell m e m b r a n e [23]. The only m e m b r a n e barrier which these granules may possibly cross is the m e m b r a n e of the food vacuole; they apparently accumulate within residual bodies of mature schizonts. The repeated A T T T T elements of the Jntron of the KP gene appear to be a unique feature; an identical sequence is also present in the KP gene of a Honduran isolate HB3 [24]. However, the splice junction sequences of all the H R P s are identical, while that of the R E S A shows a single C for T substitution (Table I). The structure of KP is very different from the other HRPs. The H R P of P. lophurae as well as the two small histidine-alanine-rich polypeptides of P. falciparurn are dominated by tandem repeats of a few amino acids, while KP shows distinctly different structural domains with limited areas of repeated sequences. This may have functional relevance; although the details of the interactions of KP with the erythrocyte m e m b r a n e are not known, it has been suggested that it is associated with the cytoskeleton of erythrocytes [4].

TABLE I Splice junction sequences 5' junction Consensus HRP, P. lophurae PfHRP, P. ,[~llciparum RESA, P. falciparum KP. P. falciparurn

AG { GTA GC ~,GTA AT { GTA AC ~,GTA

3' junction TXCAG ,L TATAG $ TATAG ~ CATAG $ TATAG ,L

References [14] [ 17] [20] [21] This study

15 We speculate that the different structural domains of KP may be essential for specific interactions with cytoskeletal polypeptides which lead to formation of knobs. A computer comparison of the amino acid sequence of KP with sequences of other proteins revealed limited areas of homology with several proteins. The most extensive homology was with the histidine-rich glycoprotein of human serum [25]. This is expected in view of the presence of histidine-rich domains in these polypeptides.. The HRPs of P. falciparum, KP and the two related small histidine-alanine rich polypeptides, are very different from each other; however, they each share some common sequences with the H R P of P. lophurae. The small polypeptides and H R P contain the repeated AlaHisHis sequence while the relationship of KP and H R P resides in repeated polyhistidine sequences. Since incorporation of relatively large amounts of histidine is one of the attributes of KP from several geographical isolates, it could be inferred that this domain is conserved. The evolutionary relationship of the polyhistidine sequences of KP and H R P can be only speculative. Although the function of H R P remains unknown, the gene encoding this peculiar polypeptide (over 70% histidine) has been conserved after more than 4 decades of maintenance of P. lophurae in an experimental host, the duck. The serological cross-reactivity of immune human sera with H R P [26] raises the possibility that the histidine-rich domain of KP may be immunogenic in man. H R P was the first purified malarial polypeptide used to test the feasibility of using a single defined antigen as a malaria vaccine [27]. The role of KP, a functionally important, immunogenic [7,26] and apparently conserved malarial polypeptide, in the immunity to P. falciparum remains to be tested. Since the completion of this study, there have been two reports of the sequence of the KP from two different isolates. The partial sequence of the KP of Honduras I [29] is almost identical to that of FCR-3, with the exception of 3 bp substitutions which change Gly 254 to Asp, Asn 421 and 423 to Lys (Fig. 2). The most notable difference between the complete sequence of the KP of isolate NF7 [30] and FCR-3 is at residues 399-430 (Fig. 2). The apparent difference in the transla-

tional reading frame of this region is due to an additional A in NF7 preceding Ser 399 and the deletion of an A following Lys 426 (Fig. 2); the authors have already discussed the rationale for adding an A at position 399. It is of interest that NF7 lacks the DraI restriction site at position 429-431; our studies indicate that this sequence is conserved in a Thai as well as a Honduran isolate (unpublished). A second difference between the two isolates is the substitution of the sequence Pro Pro His Gly Ala (residues 284-288, Fig. 2) by the sequence His Pro Trp Ser in NF7. The apparent minor differences in the histidine-rich domains of the two isolates are of interest from the point of view of the evolution of differences in repeated sequences of malarial polypeptides. Single bp substitutions have changed Gln 33, 35, 80 and 83 to His, resulting in longer stretches of polyhistidine; furthermore, the sequence Pro His Glu Ala (residues 41-44, Fig. 2) has been tandemly repeated in NF7. Despite minor apparent differences, the basic primary structure of KP appears to be highly conserved in the isolates studied.

Acknowledgments This study was supported by UNDP/World Bank/WHO Special Programme for Research and Training in Tropical Diseases and NIH Grant AI19845. We thank Dr. Habib Karoui for screening the c D N A library and Lauren Naslund for technical assistance.

References 1 Kilejian, A. (1979) Characterization of a protein correlated with the production of knob-like protrusions on membranes of erythrocytes infected with Plasmodiumfalciparum. Proc. Natl. Acad. Sci. USA 76, 4650-4653. 2 Kilejian, A. (1980) Homology between a histidine-rich protein from Plasmodium lophurae and a protein associated with the knob-like protrusions on membranes of erythrocytesinfected with Plasmodiumfalciparum. J. Exp. Med. 151, 1534-1538. 3 Hadley, T.J.. Leech, J.H., Green, T.J., Daniel, W.A., Wahlgren, M., Miller, L.H. and Howard, R.J. (1983) A comparison of knobby (K+) and knobless (K-) parasites from two strains of Plasmodiumfalciparum. Mol. Biochem. Parasitol. 9,271-278.

16 4 Leech, J,tt., Barnwcll, J,W,, Aikawa, M., Miller, L,It. and Howard, R..I. (1984) Plasmodium.[h[ciparum malaria: association of knobs on the surface of infected erythrocytcs with a histidinc-rich protein and the erythrocyte skeleton. J. Cell Biol. 98, 125(~-1264. 5 Vernot-Hemandez, J.-P. and Heidrich, H.-G. (1984) Timecourse of synthesis, transport and incorporation of a protcin identified in puritied membranes of host erythrocytes infected with a knob-forming strain of Plasmodium falciparum. Mol. Biochem. Parasitol. 12, 337-350. 6 Gritzmacher, C.A. and Reese, R.T. (1984) Reversal of knob formation on Plasmodium falciparum-infected erylhrocytes. Science 226, 65-67. 7 Culvenor, J.G., Langford, C,J., Crewthcr, P.E., Saint, R.B., Coppel, R.L., Kemp, D.J.. Anders, R.F. and Brown, G,V. (1987) Plasmodium jalciparum: Identification and localization of a knob protein antigen expressed by a eDNA clone. Exp. Parasitol, 63, 58-67. 8 Kilejian, A., Sharma, Y D . , Karoui, H. and Naslund, k. (1986) Histidine-rich domain of the knob protein of the human malaria parasite Plasmodium falciparum. Proc. Natl. Acad. Sci. USA 83, 7938-7941. 9 Southern, E. (19751 Detection of specitic sequences among DNA fragments scparated by gel clectrophorcsis. J. Mol. Biol. 98, 503-517. l/) Maniatis, T., Fritsch, E.F. and Sambrook, J. (19821 Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, New York. 11 Vieira, J. and Messing. J. (19821 The puC plasmids, an M13mp7-dcrived system for insertion mutagencsis and sequencing with synthetic universal primers. Gene 19, 259-2('~8. 12 Maxam, A.M. and Gilbert, W. (1980) Sequencing end-labelled DNA with base-specific chemical cleavages. Methods Enzymo[. 65, 499-560. 13 Sanger, F., Nicklen, S. and Coulson, A.R. (1977) DNA sequencing with chain-terminating inhibitors. Proc, Natl. Acad. Sci, USA 74, 5463-5467. 14 Lewin, B. (1980) Gene Expression, John Wiley and Sons, Inc., New York. 15 Kilc}ian, A. (19841 The biosynthesis of the knob protein and a 650t10 dalton histidine-rich polypeptide of Plasmodiumfidciparum. Mol. Biochem. Parasitol. 12, 185-194. 16 Chothia, C. (198,1) Principles that determine the structure of proteins. Annu. Rcv. Biochcm. 53,537-572. 17 Ravetch, J.V., Fcdcr, R.. Pavlovcc, A. and Blobel, G. (1984) Primary structure and genomic organization of the histidine-rich protein of the malaria parasite Plasmodium lophurae. Nature 312, 61 (-,-620. 18 Irving, D.O., Cross, G.A.M., Feder, R. and Wallach, M. (19861 Structnrc and organization of the histidine-rich protein genc of Plasmodium lophurae. Mol. Biochem. Parasitol. 18, 223-234.

19 Stahl, It.D,, Kemp, D.J., Crcwthcr, P.E., Scanlon. D.B.. Woodrow, G., Brown, G.V., Bianco. A.E., Anders, R.F. and Coppel, R.L. (1985) Sequence of a eDNA encoding a small polymorphic histidine- and alanine-rich protein from Plasmodium falcq)arum. Nucleic Acids Res. 13, 7837-7846. 2tl Wellcms, T.E. and Howard, R.J. (1986) Homologous genes encode two distinct histidinc-rich proteins in a cloned isolate of Plasmodium falciparum. Proc. Natl. Acad. Sci. USA 83, 6065-6(/69. 21 Favalaro, J.M., Coppel, R.L., Corcoran, L.M., Footc, S.J., Brown, G.V., Anders, R.F. and Kemp, D.J. (19861 Structure of the RESA gene of Plasmodium fah'iparurn. Nucleic Acids Res. 14, 8265-8277. 22 Howard, R.J., Uni, S., Aikawa, M., Aley, S.B., Leech, J.H., Lew, A.M., Wellens, T,E., Rener, J. and Taylor, D.W. (I986) Secretion of a malarial histidine-rich protein (Pf HRP 1I) from Plasmodium ,talciparum-infected erythrocytcs. J. Cell Biol,. 103, 1269-1277. 23 Kilejian, A. (19741 A unique histidinc-rieh polypeptide from the malaria parasite, Plasmodium lophurae. J. Biol. Chem. 249, 4650-4655. 24 Kilejian, A., Sharma. Y.D. and Yang, Y.F. (1987) The knob protein gene of Plasmodiurn talciparurn. In: HostParasite Cellular and Molecular Interactions in Protozoal Infections, NATO ASI Series (Chang, K.P. and Snary, D., cds.), pp. 307 314. Springer-Verlag, Heidelberg. 25 Koidc, T., Foster, D.. Yoshitake, S. and Davie. E.W. (19861 Amino acid sequence of human histidine-rich glycoprotein derived from the nucleotide sequence of its eDNA. Biochemistry 25, 2220--2225. 26 Kilciian, A. (19831 Immunological cross-reactivity of the histidinc-rich protein of Plasmodium Iophurae and the knob protein of Plasmodium falciparum. J. Parasitol. 60, 257 261. 27 Kilejian, A. (1978) Histidine-rieh protein as a model malaria vaccine. Science 2(11, 922-924. 28 Kyte, J. and Doolittle, R.F. (1982) A simple method for displaying the hydropathic character of a protein. J. Mol. Biol. 157, 1115-132. 29 Ardeshir, F., Flint, J., Matsumoto. Y., Aikawa, M., Rccsc, R.T. and Stanley. H. (1987) eDNA sequence encoding a Plasmodium falciparum protein associated with knobs and localization of the protein to electron dense regions in membranes of infected erythrocytes. EMBO J. 6, 1421-1427. 311 Triglia, T., Stahl, H.D., Crewther, P.E., Scanlon, D., Brown, G.V., Anders. R.F. and Kemp, D.J. (1987) The complete sequence of the gene for the knob-associated histidinc-rich protein from Plasmodium falciparum. EMBO J. 6, 1413-1419.