Volume 18, number 2
FEBS LETTERS
November 1971
T H E A M I N O ACID S E Q U E N C E O F C Y T O C H R O M E c 3 F R O M DESULFURICANS ( S T R A I N E L A G H E I L A Z, NCIB 8 3 8 0 )
DESULFOVIBRIO
R.P. AMBLER Department o f Molecular Biology, University o f Edinburgh, Edinburgh, EH9 3JR, Scotland
Mireille BRUSCHI Laboratoire de Chimie Bacterienne, C.N.R.S., Marseille, France
and J. LE GALL Department o f Biochemistry, University o f Georgia, Athens, Ga. USA
Received 10 June 1971
In previous communications [ 1, 2] we have reported the amino acid sequences of the cytochromes c3 from two species of the sulphate reducing bacterium Desulfovibrio. The different species in this genus [3] are similar in their general metabolism, but differ markedly in their DNA base composition [4, 5] and morphology. We have now determined the amino acid sequence of the cytochrome from a representative of a third species of the genus (table 1 and fig. 1). The sequence of the protein is sufficiently similar to that of the other two (fig. 2) for there to be no reasonable doubt that the three proteins are homologous. The conditions for growth of the organism and preparation of the cytochrome were similar to those used for the preparation of D. vulgaris cytochrome Ca [6]. The final preparation was judged to be pure by polyacrylamide and starch gel electrophoresis, and by N-terminal groups analysis, and by stoichiometry and absence of leucine in acid hydrolysates of the protein. The apo-protein was prepared as previously described [2], and the amino acid sequence determined by separation and analysis by standard methods [7, 8] of the products of tryptic, chymotryptic and thermolysin digestion (fig. 1). Most of the details of the sequence were determined from a tryptic digest of 2.5 gmole performic-acid oxidised apo-protein, but the large peptide comprising residues 4 1 - 6 2 could not be obtained in good enough yield for complete sequence North-Holland Publishing Company - Amsterdam
determination. The protein is completely lacking in leucine, and three of the five aromatic residues are very close together (residues 68-72), so chymotryptic digestion presented difficulties. By careful choice of conditions (pH 9, enzyme-substrate ratio 1/15 by wt, 4 hr, 37°), it was found possible to digest the unoxidized apo-protein such that cleavage occurred at most of the histidine and methionine residues. Even with very good quality chymotrypsin, pre-treated with soya-bean trypsin inhibitor, it was very easy to overdigest the protein to give a highly complex mixTable 1 Properties of cytochromes c 3 of known amino acid sequence, and of the bacteria from which they were prepared. Organism
D. vulgaris
(strain) G + C%
NCIB 8303 NCIB9332 NCIB 8380 Hildenborough E1 AgheilaZ 61 60 55
D. gigas
D. desulfuricans
Cytochrome c 3
isoelectric 10.5 No. of residues 106 Amino acids absent lie, Trp point
5.2
about 10
110
102
Met, Arg
Leu
347
Volume 18, number 2
FEBS LETTERS
November 1971
~-~A~a-Pr~-a~-Asp~et-~-~e-L~a-P~-A~y-A~-~-~-~r-~-~a~r~-~-Phe~er-H~-~s-G~y2
3
~
H C
5
6
7
~
~ ---~ ~ "
9
10
11
12
13
I~
15
I~
~7
18
• ~ ~ ~
20
21
22
23
24
•
~7 38 39 ~0 ~1 ~
II
~,~ ~
t6
t6
59
60
~I
62
63
64
65
66
67
68
r -Gly-Cys-His-
~7 ~8 ~9 50 51 52 5~ 5~ ~5 56
~
-Ala-£sn.Thr-Glu.Ser.Lys.Lys-Oly-Asp-Asp-Ser-Phe-Tyr 58
28
(n)
~
57
27
.,.
~3 3~ ~5 36
T
~
•
-Nis.Ala-Ser.~a t-Asp-cys .Lys .Thr -Cys -Ris-His-Lys -Trp-Asp-Gly-Ala-Oly-Ala-lle-Olu-Pro-Cys-Oln-Ala-Se
29 30 31 ~
25
~' '-
T
19
-~t
-Met-Ala-Phe-His-Olu-A rg-Lys -Se r-Olu-Lys-Se r-Cys-VaI-OIy~ys69
70
71
72
73
7/~ 75
76 77 78 79 80 81 ~2 83
•
-His-Lys -Set-Me t-Lys-Lys -Oly-Pro -Thr.Lys-Cys -Thr-Olu-Cys -His-Pro-Lys =As n
~5 86 87
g8 89 90 91 92 93 9~, 95 96 97 98 99 100 101 102
Fig. 1. Amino acid sequence of D. desulfuricans cytochrome c3. The peptides isolated from trypsin (T), chymotrypsin (C) and thermolysin (H) digests are shown by bars. Enzymes used for further degradation of certain peptides are shown by the letters in parentheses above the bars. (S) shows digestion with subtilisin B. Where a digest contained two or more peptides derived from the same region of the cytochrome, the lower bar indicates the peptide recovered in highest yield. ture in which many lysyl bonds had been hydrolysed. The T h r - L y s bond (18/19) was unexpectedly found to be very susceptible to chymotryptic hydrolysis. Difficulty was also experienced in estimating the tryptophan content of the protein. This cannot be done by direct spectrophotometry or by specific colour reactions when the haem is present, and it was found that the process of haem removal partially degraded the tryptophan. From the chymotryptic digests of the apo-protein, peptides H i s - L y s - T r p (yield 4%) and L y s - T r p (yield 9%) were recovered with the tryptophan intact, and all peptides containing tryptophan or its derivatives isolated from the protein were compatible with this sequence. Amide residues were located 348
by release with exopeptidases or by consideration of the electrophoretic mobility of simple peptides at pH 6.5. The protein is similar in size to the other c3 type cytochromes of known structure (table 1), and like them contains four cysteine-histidine clusters. The NMR spectra of cytochromes c3 [9] are much more complex than those of simple monohaem c cytochromes and indicate that the proteins contain at least three and possibly four haem residues per molecule. Recent work (Dr. Terry Meyer, personal communication) in which the haem content of D. vulgaris cytochrome ca determined from the spectrum of the pyridine haemochromogen indicates that these proteins
Volume 18. number 2
FEBSLETT'ERS
November 1971
12345678
v.d.a.P.A.d.m.v.
..,.........I..... .. 1
.
-----
___-____
2 I13 I 1 8901 2 3 4 516 7 8 9’0 12 3’4 5 6 7 8 9; :90123 4567 t.k.a.p.-.-.V.a.F.s(H.k.g.H!a.s.m.d/C.k.t.C.H.Hf :.i.k.a.P.A.-.G.a.K.v.-.-.-.-.
V.v.F.n/H.s.t.Hik.s.v.k{C.g.d.C.H.H! a.p.k.a.P.A.d.G.l.K.m.e.-.-.a.t.k.q.p.-.-. v.d.v.P.A.d.G.a.K.i.d.f.i.a.g.g.e,k.n.l(,v,F~n~H(s,t,H~k.d.v.k,~.b.b.~.H.H~ L-_-_-J I--___J
4
r-------T ,5
6
I-----l I7
3 4 5 6 7 8 9iO 12 314 5 6 0123456789~0123456~789012 .k.w.d.g.a.g.a.i.q.ptC.q.a.s.G.C.Hta.n.t.e.s.k.-.K.g,d.d.s.f.Y~m.a.f.H~e.r.k. .p.v.n.g.k.e.d.y.r.k;C.g.t.a.G.C.H~d.s.m.d.k.k.d.K.s.a.k.g.y.Y~~.v.m.H~d.k.n. .z.p.g.b.k.q.-.y.a.giC.t.t.d.G.C.HIn.i.l.d.k.a.d.K.s.v.n.s.w.Y!~~v~v~H~d.a.k. I________J ~_~~~~~~~ 0' 9 8;-----7 6 7 8 9'0 12 789012 3415 011 2 3 4 516 78 9 .s.e.-.K.-.slC.v.g.C.H~k.-.-.-.-.-.-.-.s.m.K.K.g.p.T.k~C.-.-.t.e.C.H~p.k.n .t.k.f.K.-.sjC.v.g.C.H/v.e.v.a.g.a.d.a.a.k.K.K.d.~.T.g~C.k.k.s.k.~.H,a
Fig. 2*.Amino acid sequences of cytochromes cg from D. desulfuricans(l), D. vdguris (2)and D. gigas (3) aligned to match best. Residues common to all three proteins are shown in capitals. Numbering is as in fig. 1. The first eight residues of sequence (1) are arranged to show a possible partial gene duplication. The four putative haem-binding sites are boxed, as are the two other histidine-containing regions. * A one-letter notation recommended by IUPAC-IUB Commission on Biochemical Nomenclature (1968).
probably contain four haem residues per 12,000 M.W., so in the native protein it is very likely that each of these cysteine-histidine clusters binds one haem. The sequences of the three cytochromes c3 are shown aligned in fig. 2, with insertions and deletions put in to try and maximize the matching. Between residues 22 and 56 the three sequences match very well, with histidine residues at positions 26,29 and 39 as well as in each of the putative haem binding sites. In the second of these haem sites, (residues 50-56) the cysteine residues are separated by four residues rather than the two found in typical cytochromes c. There is less similarity between the other parts of the molecules. The third and fourth haem binding sites are much sloser together in the D. desulfiricans sequence than in the other two, and in this protein alone the fourth site has the standard two-residue separation for the cysteines. It is possible that the residue at position
70 and histidine-73 may prove to have a similar structural or functional role to the histidines at positions 26 and 29, but proof will require chemical modification experiments or tertiary structure determination. The first eight residues of the D. desulfiricans sequence in fig. 2 are arranged to show a possible partial gene duplication. There is more difference between the sequence of the D. desulfuricans protein and that from D. vulgaris or D. gigas than between the latter proteins though no attempt is made to quantitate this because of the arbitrary value that would have to be given to deletions. This greater evolutionary separation of D. desuljiiricans is also expressed in the significantly lower G + C content of its DNA (table 1). We await with great interest the sequence of the cytochrome c3 from D. salexigens (C + C = 46%) [ 31.
349
Volume 18, number 2
FEBS LETTERS
Acknowledgements We wish to thank Mlle. GisBle Bovier-Lapierre Miss Elizabeth Clarke for technical assistance.
and
References [l] R.P. Ambler, Biochem. J. 109 (1968) 47P. [2] R.P. Ambler, M. Bruschi and J. Le Gall, FEBS Letters 5 (1969) 115. [3] J.R. Postgate and L.L. Campbell, Bacterial. Rev. 30 (1966) 732.
350
November 197 1
[4] N. Sigal, J.C. Senez, J. Le Gall and M. Sebald, J. Bacterial. 85 (1963) 1315. [S] G.F. Saunders, L.L. Campbell and J.R. Postgate, J. Bacterial. 87 (1964) 1073. [6] T. Horio and M.D. Kamen, Biochim. Biophys. Acta 48 (1961) 266. [ 71 R.P. Ambler and L.H. Brown, Biochem. J. 104 (1967) 784. [8] W.R. Gray, Methods in Enzymology 11 (1967) 469. [ 91 C.C. McDonald, W.D. Phillips, J. Le Gall and S. Vmogradov, 4th. Intern. Conf. on Magnetic Resonance in Biological Systems, Oxford, England, Aug.-Sept. 1970.