I’*) (assuming
Cardiotoxin
Vii from N. mossambica
procedure? Let us remember that density modification was essential to solve the crystal structure, the m.i.r. or s.i.r. maps being uninterpretable. It is surprising that density-modification improved the phases by less than 10” on the average. This confirms the well-known fact that the average phase error should be less than about 60” for the map to be interpretable. Note, however, that this improvement of a few degrees of the phases corresponds to almost doubling the correlation coefficient with the correct density. The refined co-ordinates are deposited with the Brookhaven Protein Data Bank.
3. Results The final R factor for all reflections with Fobs> 3a O-197 in the resolution range 2.5 to 5 A (4002 reflections). It was 6218 in the resolution range 2.5 to 20 A (4800 reflections, local scaling). The corresponding figures for all measured reflections were 0222 (4960 reflections between 2.5 and 5 A, theoretically observable: 5610) and 6237 (5761 reflections between 2.5 and 20 A, theoretically observable: 6421). All restrained geometric parameters were close to their ideal value, with an average r.m.s. deviation always smaller than the assumed standard deviation for each type of “observation”. The r.m.s. deviation of all bond lengths from their target value was 0.015 A, it was 2.4” for the valence angles and 95” for the torsion angle o around the peptide bond. was
(a) Molecular structure Cardiotoxin is a flat molecule with overall dimensions 32 A x 24 A x 15 A (Fig. 1). It is formed essentially of B-strands that interact within two antiparallel P-sheets: a three-stranded sheet, made of residues 20 to 26, 32 to 39 and 48 to 54, and a two-stranded sheet with residues 1 to 5 and 10 to 14. The hydrogen bond network of each P-sheet is
Figure 1. Cardiotoxin
Vi1 A stereoscopic
mossambica
285
continued by at least one additional bond: NH(46). . . OC(49) in the first case, NH(58). . . OC(2) in the second (Fig. 2). The polypeptide strands are held together strongly by four disulphide bridges close to each other in the “bottom’? region of the molecule (in the orientation shown in Fig. l), so that three loops protrude in the opposite direction. These will be called loop I, loop II and loop III. The small /3-sheet is part of loop I, while the three-stranded sheet extends over loop II and one strand of loop III. Three of the disulphide bridges mark the end of the p-sheets. Two of them, 3-21 and 14-38, connect the two p-sheets. The function of the fourth, 54-59, is probably to make the C-terminal loop rigid. Details on the conformation of the four disulphide bridges are given in Table 5; each has a different conformation. Two of them are left-handed, with x3 close to -9O”, and the two others are right-handed, with x3 close to +90”. x1 is close to -6O”, the most probable value a priori (Richardson, 1981), in five cysteine residues, to 180” in one and to the infrequent value of +60” in the two cysteine residues 54 and 59 forming the bridge within the C-terminal loop. This bridge is very peculiar, as it is completely symmetric, and just the mirror-image of the most commonly encountered conformation x1 XX’, z -6W; x2z:x;xx3 x -90” (Richardson, 1981). The shortest main-chain NH. . . 0 distances are reported in Table 6. The hydrogen bond pattern between the two first strands of the large p-sheet is altered locally by a /?-bulge at residue 24. This is of the so-called classic type (Richardson, 1981), with the three side-chains Met24, Va134 and Lys35 pointing in the same direction (seeFig. 3). Figure 2 gives also the values of the local twist angle in the /?-sheets.This is the angle between C?- 1 Cai+l and c’q_,c”i+,) on two consecutive strands of a /I-sheet (Salemme & Weatherford, 1981). It is
view of 1 of the 2 molecules of the asymmetric
unit of the crystal.
286
B. Rees et al.
Figure 2. The main-chain hydrogen bond network. A NH. .O interaction is reported wherever at) least, I of t,hr 2 corresponding N 0 distances determined in the 2 molecules of the asymmetric unit is smaller than 3.5 A (see Table 6). The 2 interactions between Cys53 and Cys53’ connect the 2 monomers of the asymmet)rir unit, thus forming a 6 stranded j-sheet. The Figure also indicates the twist of the 2 /?-sheets. The local intrrchain twist angle between C~~,C~+ 1 and CT- icy, 1 on 2 consecutive strands of a p-sheet (Salemme & Weatherford. 198 I ) is printed between the middle residues i and i’. The angles are given in degrees and are an average between the 2 independent molecules (r.m.8. difference between the 2 molecules, 2.6”).
Table 6 Main-chain
Table 5 The disulphide s-s
Xl
x2
NH
bridges
x3
x;
x;
d(C”. P)
-69 -63
-48 -53
-104 -106
-35 -34
-74 -78
55 c5.7
14-38
-37 -46
-76 -67
-64 -79
-115 -111
-55 -57
5.7 6.0
42-53
-165 -163
-1-72 +64
+87 +s7
f113 f112
-58 -47
60 5.9
+93 +99
+91 +104
f59 f59
69 6.4
3-21
5459
+58 f57
+91 +92
The 5 characteristic torsion angles (deg.), and the distance between the C” atoms (A) are given. x1 and x’, are the torsion angles around the 2 c”-CB bonds, xz and x’* around the U-S bonds, x3 around S-S’. For each disulphide bridge, the 2 lines contain the values refined for the 2 crystallographically independent molecules.
OC
A. Two-stranded 14... 1 3...12 12... 3 5...10 B. Three-stranded 20 39 39 .20 22 .37 37 .22 24 35 35 .24 34.. .24 26. 32 32 .26 21.. .54
Mol.
1
/,I sheet 2.8 2.8
2.6
hydrogen
Mol. 2.9 2.9 2.6 3.1
3.0 /I sheet 3.0 3.3 2.9 2.8 3.3 3.1 :+2 34 “-9 2.9 34 34 ?7 2.6 23 3. I 3.4 3.6 2.9 2%
2 NH
bonds O(
54...21 23 5“I 5%...23 25 .50 50 25 27 .4H
Mol.
-
1
Mol. 2
32 :91 32 3.1 3.1 2.x
C”. Other interaetims 58. 2 30 3-t 8... 5 IO... ‘i 3.3 18. .I5 3.7 46 .49 2.7 59 56 3.0 I). /ntermotrcutar 53 (53’
-.>.(J
The NH 0 distances (A) determined in each of thr 2 molecules of the asymmetric unit are reported, wherever at least 1 of the 2 values is smaller than 35. The shortest NH. 0 distances b&ween the 2 molecules are also given.
Cardiotoxin
V’f, from
N. mossambica
mossambica
287
R
b
Figure 3. The final model, superimposed on the Fourier transform of (a#‘,,,- F,,,)exp(i$,,,). (a) Part of the 3-stranded p-sheet, with the /?-bulge at residue 24. Map contour level, 1.9 times the r.m.s. density. fb) A zone of contact between the 2 molecules of the dimer. The strand from Cys53 to Ser55 of the 2nd molecule occupies the upper right corner. Several short contacts with the 1st molecule can be seen, in particular the main-chain interactions between the 2 Cys53 residues. The interactions involving a bridging water molecule are indicated by broken lines. The side-chain of Asn4.5 is connected t,o a 2nd water molecule seen in the background. Notice on the left side the disulphide bridge 42-53. To make the solvent molecules apparent, the map is contoured here at a lower level, 1.4 r.m.s.
experimentally better defined than the torsion angle 6 = OiCiCi+20i+2 often used to characterize the twist of B-strands (Chou et al., 1982). The right-handed twist is indicated by the minus sign of all the twist angles. The spread of the individual values is rather large, which means that both sheets are not very regular, but it is clear that the two-stranded sheet is much more twisted than the three-stranded sheet, with an average value of 29” for the latter versus 40” for the former. The C-terminal end of the molecule is well structured. It forms a loop held by the disulphide 54-59,
with a b-turn of type II (Richardson, 1981) between residues 56 and 59 (see Table 7). Another type-II p-turn connects residues 15 and 18. The third residue of this loop is the characteristic glycine. A /?-turn of type I ends loop III, but this is somewhat distorted, with a O(46). HN(49) distance as large as 3.6 and 3-7 A in the two molecules of the dimer, while the NH.. . 0 distance between the same residues is significantly shorter (Fig. 2 and Table 6). This distortion could be due to the interactions between the two molecules, discussed below, which involve, for example, the
B. Rees et al.
288
Table 7 Conformation Molecule
A. Loop I
B. Loop II
c. Loop III
The torsion
of turns
1
Molecule
2
4
CL
9
(1,
Leu6 Ile7 Pro8 Ile9
-48 -96 -60 -66
-54 110 -8 -15
-41 -88 -76 -93
-64 113 25 -1
Glu16 Gly17
-96 102
139 -11
-65 86
145 2
Ala27 Ser28 Lys29 Lys30 Met31
-73 -57 58 -99 84
-16 91 -53 - 13 165
-60 -45 108 - 143 -11
-21 72 -95 115 77
-76 158
- 103 -128
-44 141
-45 -16
-64 -120
-29 -18
Asn40 Val41
-69 -120
Ala47 Leu48
-62 -97
Asp57 Arg58
-42 49
129 41
angles C$ and $ of the residues
forming
side-chain of Asn45. There may also be competition between a 0(46) . . .N(49) and a O(46). NS(Lys50) interaction: the latter distance is quite short: 3.2 A and 3.3 A, in the two molecules, respectively. The conformation of loop I is somewhat unusual. Four amino acid residues connect the two strands of the small p-sheet. Their torsion angles 4 and Ic/ are given in Table 7. Three of the four residues, Leu6, Pro8 and Ile9, are in a right-handed helical conformation. Pro8 and Ile9 may even be considered as a fragment of a 3rc helix, which is known to form a tight turn close to a type-I j-turn (Richardson, 1981). Only Ile7 is in a /?-conformation. Its positive II/ angle, probably due to the steric hindrance of Pros, results in an elongation of the polypeptide chain.
-28 63 turns
are given
106 35
Type
of turn
j-Turn
type
II
/&Turn
type
I
P-Turn
type
II
for the 2 independent
molecules.
The electron density is quite clear and the chain tracing unambiguous for loops I and TIT. This is not so for the end loop II, which is probably affected by some structural disorder. There are significant differences in this region between the two molecules of the dimer, in the orientation of the side-chains and even in the conformation of the polypeptide chain (seeTable 7). Very likely, what we seein this region is an average between conformations occurring in the two molecules with a different frequency. A number of side-chains were found at short distances from other side-chain or main-chain atoms indicating probable interactions, principally hydrogen bonds or salt bridges. The most significant interactions are those observed independently in the two molecules of the asymmetric unit. They are reported in Table 8.
Table 8 Hydrogen bonds or salt bridges involving Mol. N(Lys2) N(Asn4) O(Asn4). N’(Asn4) Od(Asn4) O(Leu6) O(Pro8) W(Arg36) N(Gly37) Oa(Asn45) O(Ser46)
Od(Asp57) Od(Asn60) N6(Asn60) O(Asn60) N”(Arg58) N”(Arg36) Y(Arg58) O’(Asn60) N&(AsnGO) N(Tyr61) N’;(Lys50)
‘25 2.7 32 34 3.1 3.1 2.7 3.3 35 29 3.2
I
side-chains Mol.
2
2.8 2.9 31 2.6 28 33 3.0 2.7 3.4 2.9 3.3
Distances are given for pairs of 0. N or S atoms, at least one of them from a side-chain, and separated by at least 2 residues. Only distances ~3.5 A in both independent molecules are reported.
(b) The dimer The two molecules of the asymmetric unit were refined independently (at least in the later stages of the refinement). They possessan essentially identical structure. Only a few external loops show significant differences in the polypeptide chain configuration. Figure 4 shows the two molecules after least-squares superposition of the two polypeptide chains. The overall r.m.s. distance between superimposed main-chain atoms is 0.40 A (0.32 A when residues 28 to 32 of loop II are omitted). Figure 5 shows the average (r.m.s.) distance between corresponding main-chain atoms for each residue after this superposition. The largest differences are seen in the tip of loop II, and are probably related to partial structural disorder. Smaller differences are
Cardiotoxin
Figure 4. Comparison of the 2 molecules of the asymmetric atoms of the 2 polypeptide chains.
observed around residues 9 (loop I), 49 (loop II), 18 and 39 (the 2 “bottom” loops). The differences are of course
larger
for the side-chains:
while
the r.m.s.
distance after superposition is 0.36 A for the 60 C” atoms, it is 1.10 A for the 65 Cy atoms and 1.86 A for the 51 Cd atoms. The two molecules build a fairly strong dimeric association. Two short CO . . NH interactions continue the hydrogen-bond network of the threestranded /l-sheet of each molecule, so that a sixstranded antiparallel pleated sheet extends over the
10
20
Figure 5. Comparison
30
289
V’f, from N. mossambica mossambica
N”CPKNSAL.“KWCCSTDRCN 40 SO
00
of the 2 molecules of the asymmetric unit. (a) The refined B values (in AZ, averaged over the 4 main-chain atoms). (b) The r.m.s. distance (A) between the main-chain atoms of homologous residues, after least-squares superposition.
unit, after least-squares
dimer.
In spite
superposition
of the small
of the N, C”, C and 0
number
of main-chain
hydrogen bonds connecting the two middle strands, the complete B-sheet exhibits the characteristic right-handed twist, with a remarkable continuity of the interchain twist (Fig. 2): C’(36) C’(38) C”(21) C”(23)= -21”, C”(21) C”(23) C”(52) C”(54)= -23”, C”(52) C’(54) Ca(52’) C”(54’)= -21”. Other interactions involve side-chains. The strongest are listed in Table 9. (c) Side-chain
distribution
Cardiotoxin is characterized by continuous patches of hydrophobic residues at the surface of the molecule. The extremity of loop I (5 consecutive residues, from Leu6 to AlalO, followed by Tyrll) is completely hydrophobic, and flanked by a number of basic residues, (lysine and arginine). Furthermore, these five residues are either invariant (Leu6 and Pro8) or mutable only into other hydrophobic side-chains, in all 41 sensu strictu cardiotoxins sequencedso far (this excludes the so-called homologous cardiotoxins, which have more than 60 residues, and show larger differences: see Dufton & Hider, 1988). The second hydrophobic zone covers a large part of loops II and III, and is continuous over the crystallographic dimer. The side-chains are mostly on one side, the “convex” side of the molecule or of the dimer, opposite the C terminus: Va132, Va134, Tyr22, Met24, Leu26, Ala47, Leu48, Val 49 and Tyr51 all point in the same direction, thus filling a flat hydrophobic surface, which is a continuation of the hydrophobic zone of loop I. Note that the two tyrosine residues 22 and 51 are at the border of this surface, as Tyrll marks the limit of the first hydrophobic zone. Met25, Met31, Pro33 and Va152 are the
290
B. Bees et al.
.--
I
ALA
Figure (proline
6. Distribution excepted)
either
of side-chains strictly
of invariant
invariant
hydrophobic
or mutable
into
other
character. hydrophobic
The
Figure shows all hydrophobic amino acids only.
(e) Heavy-atoms
position
Not surprisingly, both platinum sites in the crvstals soaked with PtCl:-are close to rnethioninc residues; one is at a distance of 2.2 A from Sd(Met25) of one molecule of the dimer, the other at’ 2.1 a from the equivalent atom of the second molecule. The sulphur atoms of the two Met31 residues are at 5.1 and 6.0 A, respectively, but a conformational change of t’his side-chain would easily bring them within bonding distance. It is therefore possible that in the derivat’ive each Pt, atom is sandwiched between the t’wo methionine residues 25 and
(d) Solvent structure The positions and occupancies of 48 ordered water molecules were refined, and two clusters of positive density were tentatively assigned to phosphate groups. The strongest interactions with mainchain and side-chain atoms are shown in Table 10. The phosphate groups occupy similar positions in the two molecules of the dimer, each being close to the amine group of Lysl2 and to the hydroxyl
31.
Table 9 interactions
between the 2 molecules Mol.
1
Mol.
2
of the asymrmetric
Mol. 2.
Mol.
N(Cys53)
O(Cys53)
2.9
31
N(Ser5.5) OY(Ser55) OY(Ser55) OY(Ser55)
N6(Asn45) O(Pro43) HN(Asn4.5) Od(Asn45)
322 3.6 27 2.7
34 3c.2 3.3 3.3
N(Thr56) OY(Thr56) OY(Thr56)
N’(Asn45) Nd(Asn45) N<(LysSO)
3.5 3.0 3.1
3.4 2.7 29
Distances
are given
in A
residues
group of Tyr22. A number of significant water moecules were also found in similar locations in the two molecules. In some cases, a water molecule or a small cluster of water molecules bridge two amino acid residues. This is particularly frequent when one residue belongs to the C-terminal region. Also noteworthy is the bifurcated bridge bet,ween the carbonyl groups of residues 43 and 51 and the amide group of Asn45 in one molecule, and the hydroxyl group of Ser55 in the other molecule of the dimer. This is observed twice, in n.c. symmetric locations (Table 10).
only hydrophobic residues of this region that point to the other side of the molecule, but the latter residues are variable, with the exception (probably due to structural reasons) of Pro33, while the former without a single exception are always hydrophobic, and t,hus define an extended invariantly hydrophobic surface. This is quite apparent in Figure 6, where these semi-invariant hydrophobic residues are highlighted. Finally, a third zone is formed by Leul, Leu20, Va141 and Ile39; all lie in the same plane and surround the molecule below the disulphide bridges. Again, the first three residues have their side-chain on the convex side. All four are hydrophobic in all cardiotoxins (except in one case for Leul).
Main
ILL
ALA
I
Cardiotoxin
Table 10 Ordered solvent molecules A. Molecule
1 NH,(l) Lys2 Asn4
~~
NH(7) Lysl2 ~~
WI
CO(14)
w2 w3 w3
Tyrll Arg58
Pl
Tyr22
NH’(38) LyslH Asnl9
~~
w5
Asn40
W8
w9
WlO I
CO(31)
I
w15 w19’ I w13
Wll I W12
Ser28
- CO(55)
COO(60)
Lys23 I
CO(59)
LA,,,
W16
CO(28) I W17
w19 I NH(30)
NH(31)
Lys29
W18
Lys30
(‘O(42)
w20
Asp57
w25
B. Molecule2 Lys2
W26
\M’27 Lys5
Asp57
W28
Tyr22
w32
Glu16
1433 Ser28 ~~~ W34
(1. Molecule
CO(29)~~- w35 /I W36 w37
Lys30
Asn40
w41
~-
1 -molecule
/I w22
W40
291
Vif, from N. mossambica mossambica
~ W42
2 (‘) bridging
CO(43) I CO(51)--~ w21 ~ SIX55 I W23 ~ Asn45 CO(43’) I CO(51’)-- W24 ~ Ser55 I Asn45’
Pl, P2 and Wi are solvent molecules, Pl and P2 probably phosphate groups, the Wi are water molecules. The lines represent distances shorter than 35 A. When the solvent molecule interacts with side-chain atoms, the name of the amino acid is given.
Side-chain conformational changes between native and derivative crystals are probable for the tungstate derivative; no protein atom is at a bonding distance from any of the two W sites. Among the nearest neighbours is Nr(Lys5) (66 and 44 A, respectively). Again, a moderate conformational change of the side-chains, without alteration of the polypeptide backbone, would allow this amine group to make a salt bridge with WO,. Tungstate is a chemical substitute of phosphate, so that this interaction could be a model of the interaction of Lys5 with a phospholipid phosphate.
(f) Comparison
with post-synaptic
neurotoxins
Four X-ray structures of a-neurotoxins are available: neurotoxin B by Tsernoglou & Petsko (1977, in the Brookhaven Protein Data Bank under 1NXB); erabutoxin B by Smith et al. (1988, entry SEBX); a-cobratoxin, by Walkinshaw et al. (1980, entry 1CTX); a-bungarotoxin, by Love & Stroud (1986, entry BABX). The first two are in fact the structure of the same short neurotoxin (62 amino acid residues), determined at high resolution (1.4 A). They are extremely similar, and we will consider only the most recently refined one. The other two compounds are long neurotoxins: 71 amino acid residues in cobratoxin and 74 in bungarotoxin. Their structure was determined at a lower resolution (2.8 and 2.5 A, respectively). Comparison of the primary sequences showed that cardiotoxins and short neurotoxins possess eight identically situated cysteine residues, which are likely to form the same four disulphide bridges. Extensive three-dimensional similarities could therefore be expected for all toxins of the two families. The long neurotoxins have a fifth disulphide bridge, inserted at the extremity of loop II. Additional amino acid residues are inserted at the C-terminal end. X-ray diffraction showed that the four remaining disulphide bridges are at the same location as in short neurotoxins. All neurotoxins have exclusively a b-structure with at least one b-sheet. Figure 7 is the result of a least-squares superposition of the common part of this p-sheet (residues 21 to 24, 35 to 38 and 50 to 54 in cardiotoxin). The degree of conformational identity between cardiotoxin and neurotoxins for this domain can be measured as the r.m.s. distance of the superimposed C” atoms: 0.3 A for erabutoxin, 67 A for cobratoxin and 1.2 A for bungarotoxin. The distances between the C” atoms of all homologous residues are shown in the Figure. The cardiotoxin-bungarotoxin superposition (not shown) gives results similar to the cardiotoxin-cobratoxin superposition, but with larger
distances.
It is clear from Figure 7 that superposition of the central p-sheet brings the largest part of the molecules in fair coincidence. This is particularly so for the N-terminal and C-terminal ends. Larger differences are seen for all loops other than the C terminus, with maximum values at loop II.
B. Rees et al.
292
Erabu
10 20 RICFNHQPQTTKTCSPGESS@&J~SDFRGTII *
:NLCYKhJMLl
20-
LSKKB JVPVKRGC 30 -
20 DIl ‘SKDCPNGH Tl-
5 4 i
60
u
I NV
‘KNSALVKYVCCSTDRCI\
1
40
50
60
10
V-n
40
50
60
WCDFRGH :RVDmAATCPTVKTGVISTDNCN
IIIlId1,d CPE
Cardio
50 GCPTVKPGIKLSCCESEVCX
(a 1
Cobra 6
40 [m
4
Cardio IRCF
30
:NLCYKMMI
,ASKKhW
20-
30 (b)
‘PVKRGC
-
I NVCPP lNSA LVKYVCCSTDRCN
40
50
60
Figure 7. Comparison of the tertiary structures of (a) cardiotoxin and erabutoxin 6 and (b) of caardiotoxin and a-cobratoxin. Distances are shown (in A) between homologous C” atoms. after a least-squares superposition of part of the 3-stranded /?-sheet common to the 3 toxins considered. The aligned sequences are given below and above each diagram. Insertions are indicated by a star. The superimposed C” atoms are underlined (or overlined) in the sequence. The sequence alignment follows that described by Endo & Tamiya (1987) for the neurotoxins. The cardiotoxin alignment was discussed by Rees et al. (1987).
The long neurotoxin is generally less superimposable with cardiotoxin than is the short neurotoxin. An interesting exception is the peptide strand 39-43 (“bottom” of loop III): C” distances between cardiotoxin and cobratoxin are small, which indicates similar conformations, in contrast with erabutoxin, where two residues are deleted in this region and distances of the remaining C” atoms are large. Figure 8 shows the structural changes in a more explicit way. Starting from the superposition of the central /?-cores, other molecular domains may be brought into the best possible coincidence by an additional least-squares superposition. The additional rotations necessary to perform this are represented in the Figure. Strictly speaking, a general displacement may be described as a helicoidal motion, i.e. a rotation followed by a translation along the rotation axis, but in the present case the translation is always very small (less than @5 A). The Figure shows clearly the coincidence of the N-terminal and the C-terminal domains in cardio-
toxin and erabutoxin when t,he central B-cores are superimposed. The superposition of the C-terminal domains (cardiotoxin residues 54 to 60). can be improved only slightly by a rot’ation of 5” (not shown) and the r.m.s. C” distance is then 0.3 A. Cardiotoxin and erabutoxin are both flat and slightly concave molecules, but, the concave side is not the same; it is on the C terminus side in cardiotoxin, on the opposite side in erabutoxin. Figure 8 shows that all three main loops contribute to this change in concavity. The most spectacular change is of course that of loop II. which is turned hy almost 90”. When the rotations shown in the Figure are applied, there is a very good coincidence of the loop I B-sheet (r.m.s. C” distance, 0.4 ,h) and a fail coincidence of loop III (1.1 A). The conformat’ion of loop II is different, and the r.m.s. distance after superposition is larger (1.7 8). Also shown is the rotation that superimposes the polypeptide strand. made of residues 15 to 20 in cardiotoxin, that links the two-stranded and the three-stranded /?-sheets.
Cardiotoxin
V’i from N. mossambica
mossambica
293
Figure 8. Comparison of the tertiary structures of (a) cardiotoxin and erabutoxin b and (b) of cardiotoxin and a-cobratoxin. The thick line is the C” backbone of cardiotoxin, the thin line that of neurotoxin. The common part of the central j-sheet is superimposed by least-squares, as in Fig. 5. The Figure shows the additional rotation (axis and signed angle) necessary to bring into coincidence other parts of the molecule: the 2stranded b-sheet (residues 1 to 5 and 10 to 14 in cardiotoxin), loop II (residues 26 to 32), loop III (residues 42 to 53) and, for the comparison with erabutoxin only, the link between the 2 b-sheets.
The position of this strand is different in the two toxins but its conformation is very similar (the r.m.s. distance after superposition is 0.7 A). Considering now the long neurotoxin, an interesting observation is that the operation that brings loop III into coincidence with cardiotoxin is practically the same as for erabutoxin; the same rotation angle of 27” and similar position of the axis, in spite of the deletion of two residues at the bottom of loop III in erabutoxin. In contrast, the B-sheet of
loop I is displaced in a quite different way in the two neurotoxins; the rotation is small for erabutoxin, larger and in the opposite sense for cobratoxin. (g) Comparison with two-dimensional n.m.r. A complete assignment of the two-dimensional ‘H n.m.r. spectra of cardiotoxin CTXIIb has been carried out and extensive structural constraints
294
B. Rees et al.
derived (Steinmetz et al., 1988). This toxin has 680/b sequence homology with cardiotoxin Vi’. The threedimensional crystal structure of cardiotoxin Vi’ and the structure in aqueous solution of cardiotoxin CTXIIb, as determined by n.m.r., are extremely similar. The agreement includes a number of structural features, such as the existence and extension of the two /?-sheets and the b-turns 46-49 and 56 59. The torsion angles also agree with the indications given by the n.m.r. coupling constants. For example, the bounding values ascribed to three cysteine x1 angles, -20” to - 100” for Cys3, 140” to 220” for Cys42 and 20” to loo” for Cys54, are consistent with t,he X-ray results reported in Table 5. There are, however, several interesting discrepancies, which may be due either to a conformational change in solution or, more likely, to differences in the amino acid sequence; the suggested polyprolinelike conformation of the extremity of loop I could be due t’o the succession of two proline residues at positions 8 and 9. Interestingly, in spite of this difference between t’he two toxins, the n.m.r. distance-geometry calculations indicate that the two-stranded P-sheet is strongly twisted, as observed also in the present X-ray study. In loop IT, the three hydrogen bonds after the p-bulge are not detected by n.m.r., and at the top of the loop the torsion angle 4 at residue 29 is in the interval - 160” to -80” for CTXZZB. in contrast to the positive X-ray values given in Ta’ble 7. Rut in this region the two toxins have a completely different sequence. There is also a small difference in loop III, where a NOE is observed for a O(46) . . N(49) interaction, as expected for a normal b-turn, rather t,han N(46). . O(49) (Table 6). Note, however, that the torsion angles obtained from the distance-geometry calculations with n.m.r. con straint’s (42= -61°. $2= -21”. 43= -91”, $3= -26”. for the “best” solution). match well with t,he X-ray values (Table 7). A larger change seems to occur in the loop connecting the two p-strands (residues 14 to 20); no p-turn 17-l 8 is observed in (‘TXllb (no slowly exchanging proton), and the NOE Ha(18))H”(57) indicating proximity with t’he C-terminal region is at’ complete variance wit#h the X-ray dist(ance of 12.6 A between the respective P/ atoms.
4. Discussion The refined structure of the cardiotoxin molecule is characterized by a well-structured core constituted by the two b-sheets and the C-terminal loop. consolidated by the four disulphide bridges. This core contrasts with less well ordered loops, especially the protruding and rather floppy loop II. This is shown clearly by the variable quality of the electron density maps in the different regions of the molecule, by the magnitude of the temperature factors. and by the more or less exact matching of the two crystallographically independent molecules (Figs 4 and 5).
The C-terminal loop (residues 54 to 60) is in sharp contrast to the other loops; its structure is rein forced by the disulphide bridge 54459 and a @turn. A number of interactions secure it to the rest of the core: Table 8 shows quite clearly that this part of the molecule. and particularly the last residue. Asn60, is tightly bonded to the N-terminal end. Among the C-terminallN-terminal interactions is the NH(58). O(2) hydrogen bond, which may be considered as an extension of the twostranded ,&sheet of loop T. The disulphide bridge 42253 and the interactions of the terminal residue Asn60 with Arg36 cont’ribute to the fastening of the C-terminal loop to t’he central part of the molecule. Ordered solvent molecules add even more to the stabilit)y. The small valur of the thermal motion compared to the rest of the molecule is certainly a consequence of t’his (Fig. 5). Arg36 is invariant, in short neurotoxins as well as in cardiotoxins. Indeed, the irrt,eraction between the guanidyl group of this arginine base and the (l-terminal carboxyl group was also shown in erabutoxin R (Kimball et al., 1979: Tsernoglou B Petsko, 1977). It) is in agreement with t,he unusual pK, value of this carboxyl found by n.m.r. for botjh cardiotoxin Vi’ and a short neurot,oxin; the strong tendency for deprotonation is indicat,ivc of the vicinity of a positive charge (I,aut,erwein of nl.. 1978). The live polypeptide fragments connecting t~hc p-strands are less well ordered and are chara&erized by larger B factors and larger differences between the two molecules of the dimer. This is principal]) so for the large and protruding loop II. which is completely immersed in the solvent. This lack of defined stru&ure correlates with t,hr proba.bIc absence of any functional role (Gatincau et al.. unpublished results), in contrast, to nrurotoxins. where it, is an essential part of the toxic site (Endo & Ta,miya, 1987). Quite different is loop I, which plays a,n important role in the toxic action of cardiotoxin (Dufourcq et al.. 1982; Gatineau et al.. unpublished results). Although a number of interactions connect it to the rest, of the molecule, and more part’icularly to t)he C-terminal end. it) is tjopologically quite distinct. It is characterized by a strongly twisted /?-sheet (Fig. 2). The link between the two strands of this sheet is formed by four residues. the third of which is proline. invariant in all cardiotoxinn sequenced so far. The effect of this proline residue could be to disrupt, t#he otherwise helical conformation of the link and thus cause its elongation, while maintaining a relat,ively rigid structure. This brings the extremities of the B-strands wide apart. and may cause the observed t)wist Another striking feature is the number of intrractions between the two molecules of the asyrn metric unit. which lead t,o formation of a sixstranded ant,iparallel b-sheet. Hydrophobic sidechain interactions. hydrogen bonds between sidechain atoms and bridging water molecules st.rengthen the dimeric association. Whether this assocation is relevant to the toxic activity remains an
Cardiotoxin
V; from N. mossambica
open question. A very similar situation has been observed in the crystal structure of a-bungarotoxin (Love & Stroud, 1986). The comparison between cardiotoxin and the known structures of post-synaptic neurotoxins leads to the same distinction between a stable core and more variable and flexible external loops. The core is made of the central /I sheet and the C-terminal loop, and a few residues at the N terminus. It has the same conformation in cardiotoxin and in the short neurotoxin, and can be very well superimposed, while additional rotations are needed to superimpose the external loops. Essentially the same conclusions are reached when the comparison is made with long neurotoxins, although larger differences are observed in the core, probably due to the additional strand at the C terminus. An interesting observation is that the relative positions of loops II and III are different in cardiotoxin and in neurotoxins, but are the same in the short and long neurotoxins compared. This could be illustrative of a structural invariance of the relative position of those loops in the post-synaptic neurotoxins. Such an invariance could be a biological necessity if the site of interaction with the acetylcholine receptor extends over the two loops II and III, as was wellestablished for several short and long neurotoxins (Kimball et al., 1979; Menez et al., 1982; Inagaki et aE., 1981; Endo & Tamiya, 1987).
x
mossambica
295
Cardiotoxin is characterized by its basicity and its hydrophobicity. Of the 60 amino acid residues, 12 are basic (10 lysine and 2 arginine) and only two are acidic (1 glutamate and 1 aspartate). In many cases, basic and hydrophobic residues are neighbours in the spatial arrangement. This and the peculiar distribution of hydrophobic zones is certainly related with the toxic activity, even if the mechanisms involved, and even the target, are still controversial. In one way or another, the hydrophobic clusters of cardiotoxin certainly interact with the hydrophobic core of the phospholipid membrane, while the basic side-chains form bridges with external negative parts. Here, the contrast with neurotoxins is striking; while the molecular architecture is very similar, the distribution of hydrophobic residues is completely different. For example, in erabutoxin, loop I is essentially hydrophilic. One is rather reminded of scorpion neurotoxins, which, like the cardiotoxins, are membrane toxins and where similar hydrophobic patches exist, in spite of a completely different folding (Fontecilla-Camps et al., 1982). The solvent content of the crystals of cardiotoxin is unusually large for this protein size. This implies an unusual packing of the molecules, which is probably the main reason for the difficulties encountered in growing crystals. This can now be understood by a study of intermolecular contacts. Figure 9 shows
x
Figure 9. Crystal packing. The Figure shows, in the hexagonal unit cell, a dimer (a, a’) forming one asymmetric unit, and its closest neighbours. The contacts are essentially of van der Waals’ type, between hydrophobic side-chains. Each molecule is represented by the c” backbone with the hydrophobic side-chains.
296
B. Rees et al.
the crystal packing. The shortest contacts are observed between one molecular dimer (a, a’ in the Figure) and its two neighbours b and b’ in the 6, helix of the hexagonal packing around the z axis. Almost all those contacts are between hydrophobic residues of the central hydrophobic patch (see above); namely, Met24, Leu26, Va134 and Va149. Leu6 of loop I of each monomer is also involved, in contact with a neighbouring Leu48. The only hydraphilic interactions are given by Lys35; in one monomer it is in close contact (2.9 A) with Ser49, in the other it may interact weakly with Tyr51 (at 3.8 A).
A
smaller
number
of
contacts
is
Correction
I
for Disordered
0.3
? 1
seen
between two different helices. (a with c and a’ with c’ in Fig. 9.) They involve mostly the hydrophobic residues of loop I. Obviously, the crystal packing is dominated by the necessity, for energetic reasons, to hide the exposed hydrophobic side-chains from hydrophilic parts of other molecules or from the solvent. A closer packing would result only in a marginal gain in energy, more than compensated by the loss due to more exposed hydrophobic side-chains. It, would be interesting to know if similar, even if only temporary, molecular associations exist in solution. and their bearing on the mode of action of the cardiotoxins in the venom.
Appendix
0.6
0.2
0.1 20
IO
5
4
3
i 2-5
X(2Sl”8d)
Figure 10. Correction for disordered solvent.
Thr
Figure reports the R factor versus the resolution, after restrained refinements in the resolution range 2.5 to 20 A. This total range wasdivided into 20 domains equidistant in sin20/1’. All reflections with F > &r(F) were included in the calculation of the R factor. (a2(F) = crf + (2F)‘. where ~7~ is the standard deviation from counting statistics.) Dotted lines, the refinement included a correction for disordered solvent. Continuous lines. no solvent correction. overall H factor. Broken lines. no solvent correction. local scaling (scale factor determintd separately for each domain of resolution).
Solvent
A large solvent content of the crystals has a big systematic effect on t,he diffracted intensities, resulting in large variations of the scale factor between Fobsand uncorrected Fca, values, as a function of resolution. The effect of disordered solvent can be approximated by subtracting from the calculated amplitude diffracted by each atom that of the corresponding volume of uniformly distributed solvent. For simplicity, an identical spherical volume is considered for all atoms of the same type (M. S. Lehmann, unpublished results). A common temperature factor is applied to all the solvent volumes, at least equal to the largest temperature factors of the protein atoms. The restrained refinement program PROLSQ was modified to include this contribution of the disordered solvent to the structure factors and to their derivatives. This correction results in a change of a factor of about, 2 in the average scale factor (10.6 instead of 5.6) and an increase of 19 A2 of the apparent temperature factor. Figure 10 shows the variation of the agreement factor R with the resolution, with and without solvent correction, after several cycles of PROLSQ in the 2.5 to 20 A resolution range. Since, in the absence of correction, the scale factor is very dependent on resolution, a local scaling in
small resolution ranges was performed before the calculation of the R factor. In the solvent correction case, this makes no significant difference in the R factor.
It is clear from this Figure that, even with this local scaling, the observed data agree het.ter wit.h the solvent-corrected calculation at low resolution. At intermediate resolution: however, from 7 A to about 4 A, the reverse is true. which seemsto indicate that this solvent model is probably too crude when the resolution is increased. At even higher resolution, the effect>of the correction is just thab of a change in scale and H factor, so that the t,wo models become equivalent. For these reasons. the latest stages of refinement were done without solvent correction, but discarding all low-resolution reflections (low resolution limit, 5 A). Note that, while large on the temperat,ure factor. the effect of the solvent correction on the atomic coordinates is small; after the two PROLSQ refimments mentioned above, the r.m.s. positional difference was 0.09 A for all atoms, and only 0.035 A for the main-chain atoms alone.
Appendix II Correlated Density If one wants to make use of n.c. symmetry in a density-modification approach, it is necessary to define a molecular envelope, i.e. the limits of a three-dimensional region within which the n.c. symmetry applies. The automatic mask determination methods used for solvent levelling cannot be
Cardiotoxin
V’f, from N. mossambica mossambica
used here, since such methods can define the limits between protein and solvent, but not between adjacent proteins. The problem is usually solved by visual inspection of a low-resolution map. Sometimes the density is first averaged about the n.c. axis, since this enhances the region of interest; if pt is the true density, the expected value of the average is pt within the envelope, but close to zero outside, because of the lack of correlation. The concept of correlation can be used in a more systematic way. If p is considered as a statistical variable of mean 0, an estimate of the correlation coefficient between the densities at the n points r=ri,. . . r, related by the n.c. symmetry is: Covariance y(r) = Variance
= ~&+jp(rib(ri)/(n2 -4 xP(ri)21n
=-&(g!$ -1). In the present case, n = 2 and y(r) = 2Arl Wd/ (P(ri)2 +p(r2)2). The correlated density y(r)p(r) has the following advantage over the simple average of densities; for a given value of p(rl), the correlated density is maximum when p(r2) =p(rl), and smaller if p(r2) is either smaller or larger than p(rl). The average density is larger the larger p(r2), even if the matching with p(rl) is poor. An averaged correlated density map may make the molecular boundaries even more clearly apparent.
References Bhat, T. N. & Cohen, G. H. (1984). J. Appl. Crystallogr. 17, 244-248. Bricogne, G. (1976). Acta CrystaZZogr. sect. A, 32, 832-847. Chou, K. C., Pottle, M., Nemethy, G., Ueda, Y. & Scheraga, H. A. (1982). J. Mol. BioZ. 162, 89-112.
Edited
297
Dufourcq, J., Faucon, J. F., Bernard, E., Pezolet, M., Tessier, M., Bougis, P., Van Rietschoten J., Delori, P. $ Rochat, H. (1982). Toxicon, 20, 165-174. Dufton, H. J. & Hider, R. C. (1988). Pharmacol. Ther. 36, 140. Endo, T. & Tamiya, N. (1987). Pharmacol. Ther. 34, 403451. Fontecilla-Camps, J. C., Almassy, R. J., Suddath, F. L. & Bugg, C. E. (1982). Toxicon, 20, l-7. Fujinaga, M., Gross, P. t Van Gunsteren, W. F. (1989). J. AppZ. CrydaZZogr. 22, l-8. Harvey, A. L. (1985). J. ToxicoZ.-Toxin Rev. 4, 41-69. Hendrickson, W. A. & Konnert, J. H. (1980). In Cry8takgraphy (Diamond, R., Computing in Ramaseshan, S. & Venkatesan, K., eda), pp. 13.01-13.23, Indian Inst. Sci., Bangalore. Inagaki, F., Clayden, N. J., Tamiya. N. & Williams, J. P. (1981). Eur. J. B&hem. 120, 313-322. Jones, T. A. (1982). In ComputationaZ CrystaZZography, (Sayre, D., ed.), pp. 3033317, Oxford University Press, Oxford. Kimball, M. R., Sato, A., Richardson, J. S., Rosen, L. S. & Low, B. W. (1979). Biochem. Biophys. Re8. Commun. 88, 950-959. Lauterwein, J., Lazdunski, M. 6 Wiithrich, K. (1978). Eur. J. B&hem. 92, 361-371. Love, R. A. & Stroud, R. M. (1986). Protein Eng. 1,37-46. Menez, A., Boulain, J. C., Faure, G., Couderc, J., Liacopoulos, P., Tamiya, N. & Fromageot, P. (1982). Toxicon, 20, 95103. Rees, B., Samama, J. P., Thierry, J. C., Gilibert, M., Fischer, J., Schweitz, H. & Lazdunski, M. (1987). Proc. Nat. Acad. Sci., U.S.A. 84, 3132-3136. Rice, D. W. (1981). Acta CrystaZZcgr. sect. A, 37, 491-500. Richardson, J. S. (1981). Advan. Protein Chem. 34, 167-339. Salemme, F. R. k Weatherford, D. W. (1981). J. Mol. BioZ. 146, 101-117. Smith, J. L., Corfield, P. W. R., Hendrickson, W. A. & Low, B. W. (1988). Acta Crystallogr. sect. A, 44, 357-368. Steinmetz, W. E., Bougis, P. E., Rochat, H., Redwine, 0. D., Braun, W. & Wiithrich, K. (1988). Eur. J. Biochem. 172, 101-116. Tsernoglou, D. & Petsko, G. A. (1977). Proc. Nat. Acad. Sci., U.S.A. 74, 971-974. Walkinshaw, M. D., Saenger, W. & Maelike, A. (1980). Proc. Nat. Acad. Sci., U.S.A. 77, 2400-2404.
by R. Huber.