A MAP kinase homologue from the human malaria parasite, Plasmodium falciparum

A MAP kinase homologue from the human malaria parasite, Plasmodium falciparum

GENE AN ~NTERNATIONAL ,JOURNAL ON GENES AND GENOME5 ELSEVIER Gene 177 (1996) 1-6 A MAP kinase homologue from the human malaria parasite, Plasmodiu...

682KB Sizes 0 Downloads 60 Views

GENE AN ~NTERNATIONAL ,JOURNAL ON GENES AND GENOME5

ELSEVIER

Gene 177 (1996) 1-6

A MAP kinase homologue from the human malaria parasite,

Plasmodiumfalciparum Caroline M. Doerig a, Daniel Parzy b, Gordon Langsley c, Paul Horrocks a, Richard Carter e, Christian D. Doerig a., a Unitd I N S E R M 399, Facultd de M~decine de Marseille, 27 Bd. Jean Moulin, 13385 Marseille Cddex 5, France b Institut de Mddecine Tropicale du Service de Santk des Armkes, Le Pharo, 13998 Marseille, France c Institut Pasteur, Experimental Parasitology, 25-28 Rue du Dr. Roux, 75724 Paris, France d Institute of Cell and Molecular Biology, University of Edinburgh, Mayfield Road, Edinburgh, EH9 3JR, UK e Institute of Cell, Animal and Population Biology, University of Edinburgh, West Mains Road, Edinburgh, EH9 3JT, UK

Received 12 February 1996; accepted 12 March 1996

Abstract Pfmap-1, a gene encoding a novel protein kinase, has been identified in the human malaria parasite Plasmodium falciparum, using the polymerase chain reaction with degenerate oligodeoxyribonucleotides designed to hybridise to conserved regions of cdc2-related kinases. Computer comparison with other protein kinases strongly suggests that the protein encoded by this gene is closely related to mitogen-activated protein (MAP) kinases, which play important roles in eukaryotic adaptative response and signal transduction. In addition to the conserved MAP kinase catalytic domain, Pfmap-1 contains a highly charged C-terminal extension that includes two sets of repeated amino acid motifs. Pfmap-1 is located on chromosome 14 of P. falciparum, and its mRNA has a size of 3.7 kb. Keywords: Mitogen-activated protein kinase; Parasitic protozoans; A, adenosine

1. Introduction

The emergence of drug-resistance in the malaria parasite Plasmodiumfalciparum represents a serious threat to tropical public health. The development of novel chemotherapeutic agents would be facilitated by a greater knowledge of the molecular mechanisms regulating parasite growth and differentiation. This prompted us to initiate a study aimed at identifying and analysing regulatory networks operating in P. falciparum. In this context we became interested in protein kinases, especially those related to the cdc2 family. Protein kinases are essential elements of the biochemical pathways regulating cell growth, development and differentiation. The cdc2 gene of Schizosaccharomyces * Corresponding author. Tel. + 33 91 834523;Fax + 33 91 796063. Abbreviations: aa, amino acid(s); bp, base pair(s); kb, kilobase(s) or 1000bp; nt, nucleotides;MAP kinase, mitogen-activatedprotein kinase; ORF, open reading frame; PCR, polymerasechain reaction;Pfmap-1, P. falciparum mitogen-activated protein kinase 1; Pfcrk-1, P. falciparum cdc2-relatedkinase 1; PfPk5, P.falciparum proteinkinase 5; T, thymidine. 0378-1119/96/$15.00© 1996 ElsevierScienceB.V. All rights reserved PH S0378-1119(96)00281-8

pombe (like its homologue in Saccharomyces cerevisiae,

CDC28) encodes a 34-kDa protein kinase that controls both entry into mitosis and G1/S transition (Beach et al., 1982, reviewed in Forsburg and Nurse, 1991). The activity of this kinase is modulated by its own state of phosphorylation and association with positive (cyclins) or negative (ckis) regulatory proteins (reviewed in Peter and Herskowitz, 1994). Homologues of the cdc2 gene have been identified in numerous eukaryotes (e.g. Lehner and O'Farrell, 1990; Feiler and Jacobs, 1990; RossMcDonald et al., 1994). Mitogen-activated protein (MAP) kinases, also called ERK (extracellularly-regulated kinases), form a family of serine/threonine kinases with homology to the cdc2 family. They function at one of the last kinase steps in signal transduction pathways triggered by a wide range of stimuli such as hormone treatment or high osmolarity (reviewed in Neiman, 1993). Their activity is regulated by their own state of phosphorylation (Payne et al,, 1991; Posada and Cooper, 1992; Seger et al., 1991). M A P kinase substrates include cell cycle control elements and transcription factors responsible for expression of genes involved in adaptative

2

CM. Doerig et al./Gene 177 (1996) 1 6

response. For example, mating in Saccharomyces cerevisiae is initiated by a pheromone/receptor interaction that transmits a signal via G proteins to the Stell kinase (a M A P K K K , or MAP kinase kinase kinase). Stell then phosphorylates the MAP kinase kinase Ste7, which in turn phosphorylates the MAP kinases Fus3 and KSS1. Once activated, these MAP kinases phosphorylate substrates involved in mating, such as a CDC28 inhibitor (which results in cell cycle arrest) and the Stel2 transcription factor, which activates the transcription of genes involved in differentiation (reviewed in Bardwell et al., 1994; Errede and Levin, 1993; Oehlen and Cross, 1994). Here we describe a P. falciparum gene, Pfmap-1, that was found during a search for cdc2-related kinases in this organism. Pfmap-I presents all the features required to classify it as a member of the MAP kinase gene family. The putative protein encoded by Pfmap-1 shows a relatively high level of homology with Spkl, a S. pombe homologue of the KSS1/Fus3 kinases involved in S. cerevisiae mating.

2. Experimental and discussion 2.1. Isolation of a cDNA clone encoding a novel protein kinase

clones were isolated from a gametocyte cDNA library, using the C2 insert as a probe. One of these contained an overlap of approximately 800 bp with the 5' end of the C2 insert, plus additional upstream sequences. This allowed the identification of two in-frame ATGs that may function as start codons (positions 100-102 and 127-129); because the former is embedded in A and T homopolymeric stretches which are characteristic of P. falciparum untranslated sequences, and because the latter encodes a methionine followed by proline and lysine residues (the three residues found at the amino-terminus of a yeast MAP kinase, see Fig. 2), we favor the hypothesis that the most downstream of the two ATGs is the functional start codon, although this remains to be determined experimentally. An in-flame stop codon, surrounded by sequences that are rich in A and T homopolymers, is present 30 bp upstream of the first in-flame ATG (nt 67-69), confirming that translation is likely to initiate at one of these two ATGs. As none of the additional clones extended further downstream than the C2 insert, we employed the inverse PCR method (Triglia et al., 1988), which enabled us to identify a stop codon at nt 2605-2607 (see legend to Fig. 1). The polypeptide potentially encoded by the O R F is 826 amino acids long, with a predicted molecular mass of 96 460 Da.

2.2. Pfmap-I is related to the MAP kinase gene family Our purpose was to identify P.falciparum genes encoding cdc2-related protein kinases. Using degenerate primers 1 and 2 (see Fig. 1) designed to hybridise to conserved regions of the p34 cat2 family, we obtained a PCR product of the expected size (approximately 500 bp) from a P. falciparum clone 3D7 (Walliker et al., 1987) gametocyte cDNA library. Southern hybridisation studies using this product as a probe suggested that the 500-bp band was in fact a mixture of several different species of molecules (data not shown). Therefore the PCR products were cloned into the pCRII vector (Invitrogen), and individual clones sequenced. This allowed the identification of several different inserts amplified from different protein kinase genes, including the previously described PfPK5, a P. falciparum putative cdc2 homologue (RossMcDonald et al., 1994), and Pfcrk-1, a novel P. falciparum gene encoding a developmentally regulated cdc2-related protein kinase (Doerig et al., 1995). One clone (hereafter called C1) carried a 516-bp insert containing an uninterrupted open reading frame (ORF) not previously characterised from P.falciparum, with homology to a variety of cdc2- and MAP-related kinases. The C1 insert was used as a probe to screen a P. falciparum 3D7 gametocyte cDNA library constructed in the pJFEDAF vector (Elliot et al., 1990). A cDNA clone with a 1.9-kb insert (clone C2) was isolated and shown to contain an O R F spanning its entire length (see legend to Fig. 1). To define the start and stop codons of the putative protein encoded by the C 2 0 R F , additional

A computer analysis using the M O T I F S algorithm (Devereux et al., 1984) confirmed that the O R F encodes a protein kinase, as both protein kinase signatures (Bairoch, 1988) are present: the sequence V G K G A Y G V V (aa 29-37), which corresponds to the ATP-binding domain of protein kinases, and the sequence L L H R D I K P S N I L V (aa 142-154), located in the central part of the catalytic domain of serine/threonine protein kinases. Furthermore, the O R F contains the 15 invariant amino acids characteristic of serine/threonine protein kinases (Hanks et al., 1988; Hanks and Quinn, 1991) (Fig. 2). TFASTA analysis (Devereux et al., 1984) indicated that the polypeptide potentially encoded by the ORF is a member of the MAP kinase family: most of the 60 highest score entries were MAP kinases. The two residues whose phosphorylation is required for activation of all MAP kinases (Payne et al., 1991; Posada and Cooper, 1992; Seger et al., 1991) are found in the O R F (Fig. 2); only one of these residues is found in cdc2 family members. The catalytic domain of the putative kinase encoded by the ORF is 47% identical to the Spkl (Toda et al., 1990) catalytic domain and 41-44% identical to that of other MAP kinases. Homology to members of the cdc2 family is lower, with 38%, 35% and 37% identity to PfPK5, Pfcrk-1 and S. pombe cdc2 respectively. Moreover, the ORF does not encode a PSTAIRE-like motif characteristic of members of the cdc2 family. On the basis of these

CM. Doerig et al./Gene 177 (1996) 1-6 TT AT CAATTTAAAAC M ~

C CATTAAAAAAAAAAC

P K E D C K CTAAAGAAGATTGCAAGAC

A

T

C TTTTTTTT

TTTTTTTTT

TTTAAATTATTTTAATT~%ATTT

3

T T T G T A T G T T T T A A T C C T T T T C C A C AChieST C

E K S I D S I D E N V L K K Y D r L K K V G K AGAAAAGAGTATAGAC AGTATTGA TGAGAATGTATTAAAAAAATATGATATATTAAAAAAAGTTGGTAAAGGAGC

~

G

A

N

I V A ATTGTAGCT

N I I K L M D V I AA TA TAATAAAATTAATGGATGTTATAAAA K K Y I I Y AAGAAA TATATAATATAT

K

V K K I F G A GTGAAGAAAATATTTGGAGCT

F Q N C T D A Q R TTTC AAAATTGTACAGATGCTCAAAGAAC

A K N D N D I Y GCAAAGAATGATAATGATATATATC

Q L L R A L K Y CAATTATTAAGAGCTTTAAAATATATC

I

L I F D F M E TTATTTTTGATTTCATGGAAACC

H S G G L L H R C ATTCAGGAGGTTTATTACATAGAGA

D

T

T

F R E I GTTTAGAGAAATAAT

D L H E G ATTT ACATGAAG

I K TATTAAAC

P S N CATCAAAT

I

123

Y G V TTATGGTGTTG

.......................

R C K K N K AG ATGTAAGAAAAATAAGAAC

~ V F K TATTTAAA

> primer

G GGT

40 246

D

81 369

H CAT

122 492

V A TAGCA

163 615

1

F L Y E L N G H ATTTTTATATGAATTAAATGGACATGAT

V I K TAATAAAAG

A D L L E E C AGATTTATTAGAAGAGATC

I L V N S E C H I K ATATTAG TAAATTCAGAATGTCATATAAAAG

I

D F G L A R S I S T H V N E N K V P I L T D Y V A T R W Y R A P E I L L G S T H GACTTTGGATTAGCTAGAAGTATTTCAACTCATGTTAATGAAAATAAAGTACCTATACTAACAGATTATGTAGCAACTCGATGGTATAGAGCTCC TGAAATTTTGTTGGGTAGTACGCATTAT ................. > primer 3 primer 2 < ................... T E D V D M AC AGAA GATGTTGATATG

W S L G C TGGTCCTTAGGTTGTATC

P N K K D I E D I R C CGAATAAAAAGGATATAGAAGATATTAGATCAC D

L

L

E

K

L

L

Q

F

N

I

M G E L L C G K ATGGGAGAATTATTATGTGGAAAACC

P

L F T G N GTTATTTAC AGGAAATT

S T M N Q L E K CTACAATGAATCAATTAGAAAAAAT

K

E

E

I Q AATACAAG

204 738

V I G K TAATAGGAAAA

245 861 286 984

S

P F A E K I I S S F V D L K K K N L K D I C Y K A S CTTTTGCCGAAAAAATAATTTCGTCTTTTGTAGATTTAAAAAAAAAAAATTTAAAAGATATATGTTATAAGGCTTCTAACGAATCCTTA

N

E

S

L

P

S

C

R

H

I

K

R

I

S

A

E

N

A

L

K

H

K

Y

V

E

E

F

H

S

I

I

D

GATTTATTAGAAAAGTTGTTACAGTTTAATCCGTCTAAAAGGATAAGTGCAGAAAATGCCCTTAAACATAAATATGTTGAAGAATTTCATTCAATAATTGAC I T I ATTACAATTC

I

Y

E

P

T

1107

P I N D N T K Y R V N F Y R N V V Y F V I M R R N K F H S N V L CCATAAATGATAATACGAAATATAGAGTTAATTTTTATAGGAATGTTGTTTATTTTGTAATTATGCGTAGGAATAAGTTTCACTCTAATGTATTGAACCAAGGGGAAAGTAAA

N

Q

G

E

S

K

K

I

I

F

S

Q

G

K

D

AAAGAAGAAAAGAAGGATC

R

Y

Y

R

R

D

K

D

K

K

K

GATATTATAGAAGGGATAAGGATAAAAAAAAA

C

I

C

E

N

N

K

V

TGTATATGTGAAAATAATAAAGTAC

H

M

E

D

ATATGGAAGAT

327

GAACCTACATGTAGACACATT

K

E

Y

K

N

K

AAAGAATATAAGAATAA

368 1230 409

GATC ATAT TTTC TCAA GGG primer 4 < ........

1353

D K P E D I K E D H K K C K I C D E I H V D T S K Q I D S I N L E H V GATAAACCTGAAGACATAAAGGAAGACCACAAAAAATGTAAAATTTGTGATGAAATTCATGTGGATACATCAAAACAAATTGATTCTATCAACCTAGAGCATGTAGTTCTTAATAACAATGAT

V

L

N

N

N

D

450 1476

V S Y GTATCTTATG

E

R

R

V

F

Y

491 1599

E

S

N

E

M

K

532 1722

G

S

N

T

M

S

V D K K K K T K Y E P V Y K E R G K M K K D K M G N V Y C K D R TTGATAAGAAAAAAAAAACAAAATATGAACCAGTATACAAGGAAAGAGGAAAAATGAAAAAAGATAAAATGGGAAATGTTTACTGTAAGGATAGAGAAAGGAGAGTTTTTTAC

E A R K K E S I T N F T T A T T I S K S D D T E V E M S Q M E I N E I GAGGCTAGAAAAAAGGAGAGCATTACAAATTTTACGACAGCAACGACGATTAGTAAAAGTGATGATACAGAAGTGGAGATGAGTCAGATGGAAATAAATGAAATTGAGTCAAATGAAATGAAA G

K

I

K

E

O

I

K

E

O

GG AAAAATAAAAGAACAAATAAAAGAAC

S

T

I

S

K

T

E

I

K

E

0

;

K

E

Q

I

K

E

AAATAAAAGAACAAATAAAAGAACAAATAAAAGAA

P

N

S

R

N

Y

F

I

N

K

K

R

0

I

K

K

T

CAAA TTAAGAAGAC Mboll V

E

S

F

Y

T

Q

N

N

I

S

K

I

S

I

TC AAAA TAATA TTAG TAA GATT TCCA TAGGGAGT

K

E

R

K

N

N

D

I

L

F

H

AATACAATGAG

A

N

N

K

573 T

1845

K

614

AGCACTATCAGTAAAACTGAACCTAATAGTAGAAACTATTTTATTAATAAGAAAAGAGTTGAATCTTTTTATACTAAAGAAAGAAAAAATAATGACATATTGTTTCATGCTAATAATAAGAAG < ................. V

I

F

F

K

D

K

N

inv K

I

primer K

N

1 H

.................. S

S

E

K

K

K

I

K

Y

K

V

F

Y

G

F

K

K

S

Y

E

N

E

Q

> N

inv V

1968

primer P

H

2 Y

GTTATATTTTTTAAAGACAAAAATAAGATTAAAAATCATAGCTCAGAAAAAAAAAAAATTAAATATAAAGTATTTTATGGTTTTAAAAAATCATATGAAAATGAACAAAATGTACCT•ATTAT

655 2091

A L

E

T

K

C

S

N

T

P

L

G

K

N

Y

Y

Y

K

K

Y

V

D

G

~

~

K

K

Y

V

D

G

G

N

K

K

Y

V

D

G

~

N

_K

TTAGAAACCAAATGTAGTAATACCCCGTTAGGGAAGAATTATTATTATAAAAAATATGTGGATGGAAGTAACAAAAAATATGTGGATGGAGGTAACAAAAAATATGTGGATGGAGGTAACAAA ..................... K

Y

V

D

G

S

N

> primer K

K

Y

5 MboII V

D

G

S

N

R

K

Y

V

D

G

G

N

K

K

Y

V

D

G

S

S

K

K

Y

V

D

G

G

~

K

K

AAATATGTGGATGGAAGTAACAAAAAATATGTGGATGGAAGTAACAGAAAATATGTGGATGGAGGTAACAAAAAATATGTGGACGGAAGTAGTAAAAAATATGTGGATGGAG•TAACAAAAAA Y

V

D

G

S

S

K

TATGTGGACGGAAGTAGT N

H

N

Y

N

K

Y

V

D

E

G

N

K

K

Y

V

D

G

G

N

K

K

Y

V

D

G

G

N

K

K

Y

V

F

N

N

N

Y

H

Y

Y

S

R

K

G

P

L

K

S

V

N

K

K

K

K

N

C

P

Q

K

T

737 2337

D

G

L

N

K

K

Y

AAAATATGTGGATGAAGGTAACAAAAAATATGTGGATGGAGGTAACAAAAAATATGTGGATGGAGGTAACAAAAAATATGTGGATGGACTTAATAAAAAATAT H

696 2214

W

K

778 2460

N

T

P

K

F

819

AACCATAATTATAACCATTTTAATAATAATTACCATTACTATAGTCGTAAAGGCCCTTTGAAAAGCGTAAAC~CTGTCCCCAAAAAACTTGGAAAAATACGcCAAAATTC

2583

T F T ACTTTTACC

826 2655

P C S CCT T

K G

T

A

G

T

A

A

A

~

Fig. 1. Sequence of Pfmap-1. The deduced amino acid sequence is indicated above the nucleotide sequence. The location of PCR primers is indicated by arrows. Start and stop codons discussed in Section 2.1 are in underlined bold characters, and the amino acid repeats discussed in Section 2.2 are underlined. The boundaries of the C2 insert are indicated by A symbols. Methods. The degenerate primers used to amplify cdc2-related protein kinase gene fragments, and hence marking the boundaries of clone C1, were: primer 1, GAAAAAAT(A/T)GG(A/T)GAAGG(A/T)AC(A/T)TA and primer 2, ACTTC(A/T)GG(A/T)GCT CT(A/G)TACCA. PCR conditions with these primers were as previously described (Doerig et al., 1995). The stretch of As indicated in bold characters between positions 933 and 944 contained 13 residues in the C2 cDNA insert; this would result in a frameshift. This region was amplified with primers 3 and 4 (see the figure) from P. falciparum 3D7 genomic DNA and sequenced. There were 12 A residues in the polyadenosine stretch amplified from genomic DNA, suggesting that a frameshift mutation had occurred during the cDNA cloning (other cDNA clones did not contain this frameshift mutation). Sequences upstream of the C2 insert were determined from an overlapping cDNA clone, and sequences downstream of the C2 insert were determined by inverse PCR (Triglia et al., 1988) with primers invl and inv2, using as a template genomic 3D7 DNA that had been cleaved with MboI1 and religated. Sequences beyond the downstream MbolI site were obtained by direct sequencing of PCR products from a cDNA library using primer 5 and a primer in the vector sequence. The nucleotide sequence is available in the EMBL, GenBank and DDBJ databases under the accession No. X82646.

4

C.M. Doeriget al./Gene177 (1996) 1-6 MPKEDQ

KTEKSIDSID

ENV-L-KKYD

Io O ILKK~QKGAy

NKNIVAVKKI

FGAFQNCTDA

A/3GNSN..SV

A.SR.PHTH.

L.FE.PEE.E

MINLI.Q

......

CAALH.P

SGLK

-HP.NHPVFC

L..L...KL.

RHFR-.E...

98

RIirylq. S. - . F Q . . . . . . . .

LKSLL.E

......

CSATH.P

TGE...I.

-EP.DKPLF.

L..L...KI.

KHFK-.E...

72

MARTITFD.P

......

LVDLI.E

....

SGIK..I.

-QP.SKKLFV

T..I...KL.

RYFHE.E...

73

..VR..I.

PfmaD-I SPKI

MASATSTPTI

FUS3

...---

KSSI

Mouse E R K 2 Dros.

ERK-A

MEEFNSSGSV

Arabid.ATMP

MAMLV

SQ.K

o G~g%'FKGRCKK

T.CSAIH.P

MAAAAA

AGP-EMVRGQ

V-FDVGPR.T

N.SYI.E

....

M.CSAYDNL

VNGTGST.VP

QSNAEVIRGQ

I-FEVGPR.I

K.AYI.E

....

M.VSADDTL

EPPNGIKQQG

.HY~.M-WQT

L-FEIDT..V

PI.PI.R

.

.

.

.

.

.

CSSINRE

o

o

......

KLMDVIKA--

-KNDNDIYLI

SIL.ILPPPS

YQELE.V.

FUS3

TIFNIQRPDS

KSSI

Mouse E R K 2 Dros.

SPFEHQTY-C

...L...KI.

LRFR-.E...

82

SPFEHQTY-C

...L...TI.

TRFK-.E...

97

TNER..I.

HNV.E.RV..

L..L...KL.

RHVR-.E.V.

92

IYQLLRALKY

IHSGGLLI~RD

IF,P ~ " ~ Z * - V N S

ECH IKVADFG

T..I

M..A.W...

L .... L.L.A

N.DL

FE.F.EV.I.

QEL.Q

..STQ .....

M.SDD.IQ.F

...T...V.V

L.GSNVI...

L .... L.I..

N.DL.

SIL.KVRPVS

IDKL.AV..V

EEL ..... QK

..NNQNSGFS

T.SDD.VQ.F

T..I

...AQVI

GIN.I.R.PT

IEQMK.V.IV

Q.L ..... YK

LL.TQ

.....

H.SND.IC.F

L. .I. . G . . . . . .

ANV ....

.....

R.SND.IC.F

L..I..G

A/qV . . . .

L .... L.L.K

-. A D P E -

190

....

S.SDD.C..F

L F . . . . G. . . L. . A N I . . . .

L. . G . L .

T-. -QG.

186

.... R

IDQMR.V.IV

QCL ..... YK

LL.TQ

YEL.D

I..SSQ

Pfmap-I

ENKVPI

x x -LTDYVATRW

oo YRAPEILLGS

SPKI

-GGN.GF---

-M.E

FUS3

D.SE.TGQQS

GM.E

KSSI

SRETLVG---

FM.E

-HDHTGF--HDHTGF---

Arabid.ATMP

....

.-Q---F---

.... Q

..... S

......

........

LARS I STHVN

176

..........

TTAQ--

191

.C . . . . . .

I .D E S A A

167

L.L..

N.DL..C

......

CLASSSD

173

L .... L.L.T

T.DL.IC

......

VADPD--

175

T.DL. IC ...... . .A N . D L . I C ......

SLGC IMGELL

CGKPLFTGNS

TMNQLEKIIQ

VIGKPNKKDI

ED- IRSPFAE

KI ISSFVDLK

270

R E . S K A I . L.

.T.

.L A . M .

SAR...P.KD

YHS.ITL.LN

IL.T.TMD.F

S R - .K . A R . R

.Y.K.LPFTP

285

VM.T.

AK. SRAM.V.

.C.

.LA..F

LRR.I.P.RD

YRH..LL.FG

I..T.HSDND

LRC.E..R.R

EY.K.LPMYp

267

............

M.TF

QE. .TAM.I.

.C.

.L A . M V

269

M.N.

KG..KSI.I.

.V.

.LA.M.

.L.T.SFE.F IL.S.SQE.L

E Y .A N L P M ~ p

............

YHH..WL.LE YLD..NH.LG

N Q - .K . K R . K

-..E

S ..... P.RD SNR.I.P.KH

NC-. INLK. R

NYLL.

-..E

............

M.N.

KG..KSI.I.

.V.

.L A . M .

SNR.I.P.KH

YLD..NH.LG

.L.S.SRD.L

.C- . INEK.R

NYLE

L..CC

DN.GTSI.V.

.V.

.FA.I.

GR..I.P.TE

CL...KL..N

.V.SQQES..

RF-.DN.K.R

............ .

.

-M.E..V

.

.

.

.

.

.

.

.

.

........

M.SF

o

Pfmap-i

KKNLKDICYK .VSF.A.FPQ

--..PDAI

FUS3

AAP.EKMFPR

--VNPKGI..

.QRM.V.D.A

...T.KE..E

KSSI

PLP~rETVWS.

TDLNPDMI..

.D.M ..... D

Mouse E R K 2

.VPWNRLFPN

--.DSKA

....

Dros.

NVPWAKLFPN

--.DALA

....

GTH.SNL-.P

Q-.NpLAI..

-ASNESLDL

LEKLLQFNPS .......

T...D

D.M.T...H

KRISAENALK

269

.L P F K P

284

R F . K. L P Y S R

277

%

i00 47

TKDLKDLIWN

348

43

KIMRPEEEEE

VP. EML-. DM

347

42

DDLP-KEKLK

EL. I FEETAR

351

44

RVNFYPdqVVY

FVIMRRNKFH

-P.-.LVDL.

C-.KEDLEIP

-. L K A L I - .

.P.LQTY.DP

N...EGEP.-

-P.SFFEFDH

.... .KEALT

..... AE..R

.P.LAHYYDP

S...EYPPLN

LDDEFWKLDN

...EV.Q..A

.P.L.QYYDP

S...IAE

-A.FKFDMEL

P..AAY.DA

--

identity

358

TIPINDNTKY

....

HKYVEEFHSI

366

IDEPTCRHII S .... ASPM-

...T..E

LPHKN

{

SPKI

ERK-A

..... A

o THYTEDVDMW

o

Arabid.ATMP

o

LLEEIHKKYI P.SDD.CQ.F

RTSFK.V..V

ERK-A

o

.....

..RSQ .....

DIR.ILRVDS

Dros.

o

VIKAD

A.K..MLPAN

ERK2

o

FDFMETDLHE

ATMP

Mouse

84

QEL ..... YR

IV

ERK-A

Arabid.

YELNGHDNII

TNQR..I.

o

Pfmap-i SPKI

QRTFREIIFL

G.M.T...H

...PV.E..A

.P.L.QYYDP

G...VAE

....

V.FRI.MEN

DDIS-.DALK

SL. FEETLKF

.QRM.V.E.T

.... VTD..L

.P.MAGLFEP

GTN.PAH

....

V..SLDIDE

NMEEPVIREM

WNE. LYY--

R

.

370

41

360

44

Fig. 2. Amino acid comparison of the catalytic domain of Pfmap-1 with that of other MAP kinases [S, pombe Spkl (Toda et al., 1991); S. cerevisiae FUS3 (Elion et al., 1990) and KSS1 (Courchesne et al., 1989); Mus musculusERK2 (Her et al., 1991); Drosophila melanogaster ERK-A (Briggs and Zipuski, 1992); Arabidopsis thaliana ATMPK (Mizoguchi et al,, 1993)] (between 4 and 11 residues from the carboxy end of these kinases do not appear on the figure). Dots (.) indicate identity, dashes (-) indicate gaps introduced to optimise alignment. The figure is derived from an alignment performed using the GeneWorks software. The 15 invariant residues of serine/threonine kinases are labelled with an (©). The two protein kinase signatures are indicated in bold characters. The (x) symbols locate the two residues (Thr-184 and Tyr-186) that correspond to regulatory phosphorylation sites in members of the MAP kinase family. The level of homology is expressed as the percentage of identical amino acids, within the catalytic domain, in Pfmap-1 and each MAP kinase mentioned. The boundaries used to define the catalytic domain for this analysis are indicated by vertical dashes (I). observations, we propose that this O R F encodes a P.

(alciparum M A P kinase h o m o l o g u e , hereafter called Pfmap-1. In addition to the kinase catalytic domain, Pfmap-1 contains a C-terminal extension of 503 amino acids that is rich in basic and acidic residues. A prominent feature of this extension is the presence of 5 repeats of the tetramer I K E Q (aa 535-554) and of 13 imperfect repeats of the octamer K K Y V D ( G / S / E ) ( G / S / L ) N (aa 672-775). Such repeats are not u n c o m m o n in P. falciparum proteins (Anders et al., 1988). Pfcrk-1 contains a similar highly charged region with repeated motifs at its N-terminus (Doerig et al., 1995). The function of these elements remains unknown.

c h r o m o s o m e s were separated by pulse-field gradient electrophoresis, transferred to a nylon filter and hybridised to a Pfmap-1 probe. This showed that Pfmap-1 is located on c h r o m o s o m e No. 14 (Fig. 3B). The location was confirmed by hybridisation of the filter to an aldolase probe; the P. falciparum aldolase gene is k n o w n to reside on c h r o m o s o m e 14 (Triglia et al., 1992). A Northern blot performed with total R N A from asexual parasites is presented in Fig. 3C. This confirmed that Pfmap-1 is transcribed during infection of the erythrocyte, and indicated that the Pfmap-1 m R N A has a size of approximately 3.7 kb. Although this is 1.3 kb longer than the ORF, it is line with results from other P. falciparum genes (Levitt et al., 1993).

2.3. Chromosomal mapping and transcript size of Pfmap-1 3. Conclusions

P.falciparum genomic D N A was digested with a panel of restriction enzymes and hybridised with a Pfmap-1 probe (Fig. 3A). The presence of positive hybridisation signals confirmed that Pfmap-I is a P. falciparum gene. In each case, a single band was observed, which is consistent with Pfmap-1 being represented in the P. falciparum genome as a single copy gene. P. falciparum

Pfmap-1, a gene encoding a putative M A P kinase homologue, has been identified in P.falciparum, suggesting that signal transduction networks similar to those found in other eukaryotes may operate in this organism. Pfmap-1 is located on c h r o m o s o m e 14, presumably as a single copy; the Pfmap-1 m R N A detected in asexual

C.M. Doeriget al./Gene 177 (1996) 1 6

5

A I

~,

4

.¢,

6

"7

R

9

10

II

12

I

2

3

!iill¸

No14~).4 6.6-4.3-ii ¸ i! ~iiii=!==

7.5 - - i i ~ ¸~

2.3--

2.0--

4.4

2.4

-, ~iiii~i~!i!i~! ~iill ~

Fig. 3. Hybridisation experiments. Hybridisation conditions were as described (Doerig et al., 1995). (A) Southern blot. Approximately 1/~g of genomic DNA from P. falciparum clone 3D7 was digested with restriction enzymes, transferred to a Hybond N + membrane (Amersham) and hybridised to a Pfmap-1 probe. The probe was a 32p-labelled PCR product amplified from clone C2 with primer 1 and 2 (see Fig. 1). Restriction enzymes were (lanes 1-12): AccI, BgllI, Sau3A, MspI, XhoI, DraI, HinclI,HindlII, AluI, Hinfl, XbaI and NheI. The sizes of comigrating molecular mass markers are indicated to the left in kb. (B) Chromosomal mapping. Chromosomes from P.falciparum K1 isolate (Thaitong and Beale, 1981) were separated by pulse field gradient gel electrophoresis as described previously (Kemp et al., 1985), depurinated and transferred to a Hybond N + membrane. The membrane was probed with the same probe as in A (lane 2), or with a P. falciparum aldolase probe (lane 3). Lane 1 is an ethidium bromide stain of the chromosome gel prior to transfer. (C) Northern blot. 10/zg of total RNA from asexual parasites (K1 P. falciparum isolate) were used for the analysis, which was performed as previously described (Doerig et al., 1995). The probe was a PCR product covering the Pfmap-1 ORF from nt 126 to 2023.

parasites is approximately 3.7 kb long. In addition to the catalytic domain, the gene encodes a large, highly charged C-terminal extension containing two sets of repeated motifs. The function of this gene in the P. falciparum life cycle remains to be determined. By analogy with the functions of MAP kinases in other organisms, Pfmap-1 may be involved in the control of growth and/or sexual differentiation. The relatively high level of homology of Pfmap-I with Spkl, a S. pombe MAP kinase that is activated during response to mating pheromone, lends support to this hypothesis.

Acknowledgement

We thank P. Alano, A. Craig and D. Kaslow for kindly providing cDNA libraries. We were able to use DNA blots prepared by D. Arnot and S. Cheesman. P a r t s o f the s e q u e n c e w e r e v e r i f i e d o n a u t o m a t e d s e q u e n c e r s w i t h t h e h e l p o f C. J o b l e t a n d M . O n a t e (Assistance Publique de Marseille, Programme H o s p i t a l i e r de R e c h e r c h e C l i n i q u e 1995) a n d o f A l a i n R i c o (I.M.T.S.S.A.). T h e Pfmap-1 g e n e was i n d e p e n d e n t l y i s o l a t e d by R. G r a e s e r a n d B. K a p p e s at the

B i o z e n t r u m , U n i v e r s i t y o f Basel, S w i t z e r l a n d , w h o s e w o r k is to be p u b l i s h e d elsewhere. W e t h a n k t h e s e c o l l e a g u e s for freely d i s c u s s i n g the d a t a b e f o r e p u b l i c a tion. T h i s w o r k w a s s u p p o r t e d by a n M R C g r a n t to R.C. a n d b y t h e I n s t i t u t N a t i o n a l de la Sant6 et de la R e c h e r c h e M 6 d i c a l e ( I N S E R M ) U n i t 399.

References

Anders, R.F., Coppel, R.L., Brown, G.V. and Kemp, D.J. (1988) Antigens with repeated amino acid sequences from the asexual blood stages of Plasrnodiumfalciparum. Prog. Allergy 41, 148-172. Bairoch, A. (1988) Sequence patterns in protein kinases. Nature 331, 22. Bardwell, L., Cook, J.G., Inouye, C.J. and Thorner, J. (1994) Signal propagation in the mating pheromone response pathway of the yeast Saccharomyces cerevisiae. Dev. Biol. 166, 363-379. Briggs, W.H. and Zipuski, S.C. (1992) Primary structure, expression and signal-dependant tyrosine phosphorylation of a Drosophila homolog of extracellular signal-regulated kinase. Proc. Natl. Acad. Sci. USA 89, 6295-6299. Courchesne, W.E., Kunisawa, R. and Thorner, J. (1989) A putative protein kinase overcomes pheromone-induced arrest of cell cycling in S. cerevisiae. Cell 58, 1107 1119. Devereux, J., Haeberli, P. and Smithies, O. (1984) A comprehensive set of sequence analysis programmes for the VAX. Nucleic Acids Res. 12, 387-395.

6

C.M. Doerig et al./Gene 177 (1996) 1 6

Doerig, C.D., Doerig, C.M., Horrocks, P. Coyle, J., Carlton, J., Sultan, A., Arnot, D. and Carter, R. (1995) Pfcrk-1, a developmentally regulated cdc2-related protein kinase of Plasrnodium faleiparum. Mol, Biochem. Parasitol. 70, 167-174. Elion, E.A., Grisafi, P.L. and Fink, G.R. (1990) FUS3 encodes a edc2 + / CDC28-related kinase required for the transition from mitosis into conjugation. Cell 60, 649-664. Elliot, J.F., Albrecht, G.R., Gilladoga, A., Handunnetti, S.M., Neequaye, J., Lallinger, G., Minjas, J.N. and Howard, R.J. (1990) Genes for Plasmodium falciparum surface antigens cloned by expression in COS cells. Proc. Natl. Acad. Sci. USA 87, 6363 6367. Errede, B. and Levin, D.E. (1993) A conserved kinase cascade for MAP kinase activation in yeast. Curr. Opin. Cell Biol. 5, 254 260. Forsburg, S.L. and Nurse, P. (1991) Cell cycle regulation in the yeasts Saccharomyces cerevisiae and Schizosaccharomyces pombe. Annu. Rev. Cell. Biol. 7, 227-256. Feiler, H.S. and Jacobs, T.W. (1990) Cell division in higher plants: a cdc2 gene, its 34-kDa product and histone H1 kinase activity in pea. Proc. Natl. Acad. Sci. USA 87, 5397-5401. Hanks, S.K., Quinn, A.M. and Hunter, T. (1988) The protein kinase family: conserved features and deduced phylogeny of the catalytic domains. Science 241, 42-51. Hanks, S.K. and Quinn, A.M. (1991) Protein kinase catalytic domain sequence database: identification of conserved features of primary structure and classification of family members. Methods Enzymol. 200, 38-62. Her, J.H., Wu, J., Rail, T.B., Sturgill, T.W. and Weber, M.J.. Sequence of pp42/MAP kinase, a serine-threonine kinase regulated by tyrosine phosphorylation. Nucleic Acids Res. 19, 3743. Kemp, D.J., Corcoran, L.M., Coppel, R.L., Stahl, H.D., Bianco, A.E., Brown, G.V. and Anders, R.F. (1985) Size variation in chromosomes from independent cultured isolates of Plasmodiumfalciparum. Nature 315, 347-350. Lehner, C.F. and O'Farrell, P.H. (1990) Drosophila cdc2 homologs: a functional homolog is coexpressed with a cognate variant. EMBO J. 9, 3573-3581. Levitt, A. (1993) RNA processing in malaria parasites. Parasitol. Today 9, 465 468. Mizoguchi, T., Hayashida, N., Yamaguchi-Schinokazi, K., Kamada, H. and Shinokuzi, K. (1993) ATMPKs: a gene family of plant MAP kinases in Arabidopsis thaliana. FEBS Lett. 336, 440-444.

Neiman, A.M. (1993) Conservation and reiteration of a kinase cascade. Trends Genet. 9, 390 394. Oehlen, B. and Cross, F.R. (1994) Signal transduction in the budding yeast Saceharomyces cerevisiae. Curr. Opin. Cell Biol. 6, 836 841. Payne, D.M, Rossomando, A.J., Martino, P., Erikson, A.K., Her, J.H., Shabanowitz, J., Hunt, D.F., Weber, M.J. and Sturgill, T.W. (1991) Identification of the regulatory phosphorylation sites in pp42/mitogen-activated protein kinase (MAP kinase). EMBO J. 10, 885-892. Peter, M. and Herskowitz, I. (1994) Joining the complex: cyclin-dependent kinase inhibitory proteins and the cell cycle. Cell 79, 181-184. Posada, J. and Cooper, J.A. (1992) Requirements for phosphorylation of MAP kinase during meiosis in Xenopus ooeytes. Science 255, 212-215. Ross-MacDonald, P.B., Graeser, R., Kappes, B., Franklin, R. and Williamson, D.H. (1994) Isolation and expression of a gene specifying a cdc2-1ike protein kinase from the human malaria parasite Plasmodiumfalciparum. Eur. J. Biochem. 220, 693-701. Seger, R., Ahn, N.G., Boulton, T.G., Payanatos, N., Radjiejewska, E., Ericsson, L., Bratlien, R.L., Cobb, M.H. and Krebs, E.G. (1991) Microtubule-associated protein 2 kinases, ERK1 and ERK2, undergo autophosphorylation on both tyrosine and threonine residues: implications for their mechanism of action. Proc. Natl. Acad. Sci. USA 88, 6142 6146. Thaitong, S. and Beale, J.H. (1981) Resistance of ten Thai isolates of Plasmodium falciparurn to chloroquine and pyrimethamine by in vitro test. Trans. R. Soc. Trop. Med. Hyg. 75, 271 273. Toda, T., Shimanuki, M. and Yanagida, M. (1991) Fission yeast genes that confer resistance to staurosporine encode an AP-l-like transcription factor and a protein kinase related to the mammalian ERK1/MAP2 and budding yeast FUS3 and KSS1 kinases. Genes Dev. 5, 60 73. Triglia, T., Peterson, M.G. and Kemp, D.J. (1988) A procedure for in vitro amplification of DNA segments that lie outside the boundaries of known sequences. Nucleic Acids Res. 16, 8186. Triglia, T., Wellems, T.E. and Kemp, D.J. (1992) Towards a high-resolution map of the Plasmodium falciparum genome. Parasitol. Today 8, 225-229. Walliker, D., Quakyi, I.A., Wellems, T.E., McCutchan, T.F., Szarfman, A., London, W.T., Corcoran, L.M., Burkot, T.R. and Carter, R. (1987) Genetic analysis of the hum~in malaria parasite Plasmodium falciparum. Science 236, 1661 1666. ,