Replacement of amino acids in proteins

Replacement of amino acids in proteins

j. Theoret. Biol. (1961) 2, Z&--257 Replacement of Amino MARTYNAS Acids in Proteins YCAS Department of Microbiology, State University of New Yor...

706KB Sizes 0 Downloads 87 Views

j. Theoret. Biol. (1961) 2, Z&--257

Replacement

of Amino MARTYNAS

Acids

in Proteins

YCAS

Department of Microbiology, State University of New York Upstate Medical Center, Syracuse, N. Y., U.S.A. (Received 13 January 1961) Some consequences of the hypothesis that ribonucleic acid codes amino acid sequences with a coding ratio of one are discussed. The hypothesis predicts certain limitations on amino acid replacements in proteins as a result of mutation. Available evidence is reviewed and found to conform with prediction.

Recently I reviewed evidence indicating that information determining the sequence of amino acid residues in viral protein is coded in viral ribonucleic acid with a coding ratio of one, so that one nucleotide corresponds to and determines one amino acid residue. Part of the evidence for this hypothesis is that amino acids can be assigned to nucleotides in such a manner that the mole fraction of each group of amino acids in viral protein is equal to the mole fraction of the corresponding nucleotide in the viral ribonucleic acid. It is thus possible to predict the composition of viral ribonucleic acid from knowledge of the composition of the corresponding viral protein, Assignments producing this result are as follows: Adenylic acid: glutamic acid, glutamine, glycine, leucine, phenylalanine and tryptophan. Uridylic acid: aspartic acid, asparagine, isoleucine, histidine and serine. Guanylic acid: alanine, arginine, tyrosine and valine. Cytidylic acid: cysteine, lysine, methionine, proline, and threonine (YEas, 1960). It is the purpose of this paper to develop certain deductions of the hypothesis as they bear on mutations affecting protein structure, and to show that these deductions are supported by empirical evidence. A coding ratio of one evidently implies that viral ribonucleic acid possessesonly part of the information required to specify a protein, since each nucleotide limits, but does not finally determine, the choice of a residue at any given position.? Supplementary information must be contributed by some structure in the host. As illustrated in Fig. I, the complete information bearing structure is assumed to be composed of two t For evidence against the supposition information see YEas, 1958.

that neighboring 44

nucleotides provide additionai

REPLACEMENT

OF

AMINO

ACIDS

IN

PROTEINS

245

components: a, here viral ribonucleic acid, and p. The ,/3component is as yet chemically unspecified and may itself be a compound structure. On general biological grounds it seems reasonable to postulate that, if this is correct, such a scheme may apply generally: a p and a non-viral a component being always present in the cell for purposes of normal protein synthesis. c( (RNA) P

FIG. 1. Illustration of the concept that the material specifying in proteins is a double structure. An amino acid at any position a nucleotide in RNA and an element X in the p component.

the amino acid sequence is completely specified by

Consider now the result of a mutation in the a or /I component. (The term “mutation” is here used in the sense of any inherited change, whatever its nature.) To change the a component, at least one nucleotide must be replaced by another of a different kind, and the amino acid previously specified at that position must therefore be replaced by one belonging to a group specified by a different nucleotide. For example, if the original nucleotide was adenylic acid, the new amino acid will be one specified by cytidylic, guanylic or uridylic acid, but not by adenylic acid. If, on the other hand, a change is produced in the fl component, the new amino acid will belong to the group determined by the same nucleotide as the one previously occupying that position, since the nucleotide in the a component remains the same. Thus replacements prohibited as a result of “a mutation” are the only ones permitted as a result of “p mutation”, and vice versa. Since the assignments of amino acids to nucleotides are, by hypothesis, known, these predictions are quite definite and can, in principle, be checked by observation. In the case of viruses, it is possible to be certain that all mutations are a mutations, since different mutants, i.e. a components, can be grown in the same host and thus produce proteins with information contributed by the same /3 component. All amino acid replacements in viral protein resulting from mutation must therefore involve amino acids assigned to different nucleotides. The evidence available does not contradict this requirement (Table I), but unfortunately it is as yet too meager to be conclusive. More evidence is available on replacements involving non-viral proteins. As in these cases it is not possible to separate experimentally the u and j3 components, it might seem hopeless, at first sight, to observe separately the processes of a and /3 mutations. However, there remains the possi-

I

Bonito

I IH.LPII 11 IH.Ala

B (‘hain,

Insulin Insulin

(Sauger, (Sangrr,

A Chain* B Chainf

-_---.___-

” : :

:}

Pro-l.Ys.OII

al.,

H.Gly-Ilu-Val H.Phr--Val-Am-Gin I -

(Ynmamotoet

~goo)

igbo)

3

4

5

fJ

7

ef al., 1958) Sperm Whale Horsr s1mp Pig -Glum-Gin - Ly>m-Cys-HIS--LPU Cys-

(Idxhara

1960)

Sri \Vhale

.--_

: \Vhalr,

10

I1

I2

I3

t Cattle. Man. Rahblt Horse. Sheep, Pig, Cattle

',

‘?&=Ser-Thr 1 ! Thr-Ser-IIu I Thr---GlyIlu I ALGly--Val 1 1 Thr-?cr-Ilu -Ala--Ser.--Val-Cys--Scr--Lou-‘1.ye-Gin-Lcu I$-Scr-HIS-Leu --Val-Glu--Ala

.-___

Replacements d$feermtiating Knozun sequences of homologous proteins. references i?l some cases are not to the original zuork, but to conveniertt Fig. 2.

TABLE

14 15

-Leu--T;;

17

IQ

zo

--Glu -&a -Ty--Cys--Aan.OH -I.cu--\:aa--Cys-GIy-Glu-Arg

c_-.

*I

92

-

” recently divergent ” proteins boxed in. The compilations. Amino acid abbreviations as in

VasopresdnJI

1960)

Vasopressins,

brticotropin

MSH Pig MSH Ox Corticotropin

25

(Cys,

Cattle,

t Man,

Pig, Horse,

Vigneaud

Sheep,

et al.,

0 Man > Cd 3%rse , Sheep I/ Frog (Acher ef al., 1960)

Ilu,

(Du

30

Chicken

(White

& Landman,

b; Acher,

~-__ ’ Gly-Ala-Glu-Asp-Asp-Giu-Leu-Ala Asp-Gly-Glu-Ala-Glu-Asp-Q-Ala ) Ala-Gly-blu-Asp-Asp-Glu-Ala-Ser -Asp-Gly-Ala-Glu-Asp-Glu-Leu-Ala32 33 34 35

19533,

31

Pig “A” Ox Sheep

and corticotropins

Pro-Val-Lys-Val-Tyr-Pro26 2, 28 zg

hormones

Tyr,

Oxytocins

pig-Arg-Arg24

Pig “fi”

Melaophore-stimulating

37

Horse Ox

j

(Elliot

& Peart,

-Glu-Ala-Phe-Pro-Leu-Glu-PhcOH 40 41 42 43

peptide

39



1960)

H.AspArg-Val-TyrH.Asp--Arg-Val-Tyr-

38

Acher.

Hypertensive

36

1955;

1956;

44

45

Skeggs

46

et al., 1956)

Y

H.Asp. H.Asp..

Chicken Turkey

wideus

fruffa

Sdm

Sdm

(Felix,

1952)

1952)

::

Ser,,

Ala,,

.__-..---

ic’al,.

~__ 1 Val,,

/ .~ALa.OH .Val.OH .__

SW,, Ala,.

~.~._

Peters

.IAla.OHl Ala-&mOH

1954;

not sequence

(Gly,,

(Gly,,

.Gly-Val-Ala-a

(Thompson,

t Composition,

(Felix,

H.AspH.Arp-b, H.Asp

--Alai

albumins

Protamines

1959)

.Val-Gln-Lys--Cys-

C (Tuppy,

Ilu,

Ilu,

_-

i)f

)t

et al.,

..-_.__ ..- -.._ -

1958)

. ..Val-Gln-Arg-Cys-Ala-Gln-Cys-His-His-Thr-Val-~~lu... .Phe-Lys-Thr-Arg-Cys-Glu-Leu-Cys-His-Thr-Val-Glu.. Rubrum . ..Cys-Leu-Ala-Cys-His-Thr-Phe-Asp-Glu-Gly

Mall Cattle Hone

Serum

Silkworm Yeast Rhodospirrillum

Cytochromes

-.--..

..-_



.-.

Cattle

Man

{,z.;‘--

.~.

(Lorand

hormones

Fibrinogens

Growth

Ala-Asp-Lys

.-

1958;

t 2 chain<

& Li,

& Middlebrook,

(Parcells

1953)

Li

et al.,

1958)

1

z

(Hirs

3

4

37

88

92

93

1x8 *rq I20 12x IZZ IZ3

-Val-His-Phe-Asp-Ala-Ser-Val.OH

124

-.41a 94 95

Cys 96

10

II

I?

13

rq

68

to

-Thr 100 99

69

72

Arc-.4la--Gin-Lys-His-Ilu101 102 104 103

i1

I,

18

fhr-Phe-Val--HI% 45 46 47

16

48

19

21

zz

23

Iog

106

Ilu-%-Ala 107 108 rag

53

24

H3

26

2,

Cys-Asn-Gin

84

115

85

Val--Gln-Ala-Val54 55 56

zj

Cys-Glu-Gly-Asn-ProTyr-Val-Pro 113 I,O III 112 II4

82

-Ser-Ser-Ser-Asn--Tyr~~

C;lu-Ser-Leu-Ala-~Aspp49 40 51 52

20

~~(;ln-Ser-Tyr-Ser-Thr-~~ct-Ser--llu--Thr--Asp-Cy.~-Arg-(;luSer74 75 76 77 73 79 80 81 73

44

15

Thr -Ser-Ser-Asn-Hls-~Iet-Glu-Ala- Ala

Shwp (Anfinsen rt al , 1959) -Asp-Arg-Cy< -Lys-Pro-Val--.Aqm 40 41 42 43 38 39

g

67

I-

C’lu

6

Tyr-Lys-Thr 98 97

36

,

--Thr-Cly-Ser-Lys-Tyr-Pro-Asn 89 90 91

35

6

~-Cys-Ser-Glu-Lys-Asn-Val-Ala--Cys-Lys-Aan-Gly-Thr-Asn-GIn--Cys--Tyr 60 61 62 63 64 65 66 58 sq

s

(Anhnsen el al, 1959) --Ala--L1,~--Phe-Glu-Arg-Ser

1960)

Sheep -Ala-Ala

et al.,

~-Met--Met--Lys--Ser--.4r~-A~n--Leu..-Tl~r-:~~ys29 30 31 32 33 34

Cattk

Ribonuclease

I16

86

57

28

117

87

mosaic

virus protein

90 90

91 91

92 92

93 93

1111

94

o

40 40

94 95

95 96

I?j 126

96 97

98 99

gg IwJ

Ala,

42 42

126 12,

I26 I?0

.-Thr.OH Thr.OH

I2j 123

44 44

tilx

(Anderer (Tsugita

129 13”

100 101

130 rjr

101 103

Thrl Thr

4s 4.5

-Val --Gin 47 47

103 104

lo.* roj

IOj 106

50 so

-Val-Try-Lys-Cal-Try-L?s-I’ro-5er51 52 53 51 52 53

107

7 I04

10s 109

IO!, II0

II0 III

111 II?

141 142

II2 113

142 143

II3 ‘14

143 I?+

xr4 I15

I44 14s

115 II6

145 146

146 147

116 II,

88 88

o -Gly -Ala--Phe-Asp84 85 86 87 85 86 87 83

59 59

Pro--Plo--Ser-G(;ln-Cal-Thr-Val Pro-Gin-Val-Thrm-Val 54 55 56 57 56 54 55 56 57 58

Thr-Leu-Asx;;;-FF-Asp--Ala-Thr-Arg-Arg-Val-Asp-Asp-AlaAla-Thr-Arg-Arg-Val-Asx-Asx-Ala-

Phe -Ser-Gin ml’he-Ser-Gin 4s 49 48 49

Thr \la--Glx-Glu Thr---\lam

‘If, 46

-Arg-Gin

et a!., 1960) Acetylcf d, 1960) Acctyl-

II ~,-.4r~--(;ly-Thr-Gly-Srr---Tvr-.4sn-Arg-Ser-Spr-Ph~-Gl~-S~r-S~~. -Lcu -L~~r-Ar~~C.ly-Thr-G;lySer-Tyr-.4~n--Arg-Ser~Ser-Phe--Glu-Ser-Ser-. I.34 135 I37 139 I40 131 131 I33 136 13s 139 141 132 133 134 I35 I36 137 13s 140

101 102

Am, --I++ I’lo, .4in

43 43

‘I‘MV TMV

-Thr-Val-

-Asn--Ltu-Ilu--\‘al--(,iu-Leu-

97 99

Mutant (Trugna & Framkel-Conrat. 1960) ! z”-; ,---I~ HR (Niu & FraenkelXonrat, 1955) ! (Tbr, Ala) 8’ ( -Se+.Gly-Leu-Val-Try-Thr-;-Ser-Glym/ -Prom -Ala -Ser--Gly-Lcu-Val-Try-Thr-.+r~Gly;l-Prw -Ala.-

Strain

41 41

Gls -Val--Gls AQX (Clu, -C,lt1-~al-GG111-.4w--(;ln~--Ala-

--Thr -Val-Ala-Ilu-.4r~-Ser-4la-r\sxllu -Thr-Val-Ala-llu-Ar~-Ser-Ala-Asp~Iiu--Asn-Leu-Ilu--~‘al--Glu 1x7 IIR II9 I20 IZI I22 r*j 124 IIR I20 I21 I12 IT9 123 124 125

89 89

-Thr-Arg- Asn-Arg-Thr-Arg~Asn-ArR-llu--lilr

-La-Gly-Am-Gin-Phe-Glx-Thr-GZu-Gin-Ala-Arg -Leu-Gly-Asn-Gln-Phe-Gln-Thr~~~-G~-Ala-Ar~-~~hr-~‘al-Gln-Val-Ar~ 31 32 33 34 35 36 37 38 El 31 32 33 34 35 36 37 38

Tobacco

60 60

Seal (Pbora

tdlullina)

(Ingram

homology

Pinback Whale Sperm Whale

Horse

Myoglobins

TO preserve

Kendrew

toriginally

o --LFU--Ser

-Pro-

et al.

published,

1954)

later

of human

corrected

-o-

Ala-

sequence

hemoglobin,

(His-val-Lru-l,eu-Thr--Pr~Glu-Glu-Lys)t

H.Val-

the a and j3 chains

[ __~ H Gly (

GE&;“.

1955;

between

Man, achain (Braunitzer et&. rg6ob) Man, Bchain (Braunitzere(al., xg6ob) IIbS (Hunt & Ingram, 1959) HbC (Hunt & Ingram, rg6o) HbG (Hill & Schwartz, xgsg) Horre (Porter & Sanger. xg4g)j3 Cattle (Porter & Sanger, 1949)

Hemoglobins

hemoglobin

where delctmns

of human

inserted

(Ingram.

24 23

26 2s

25 24

26

27

2s

2,

29

28

30

29

HbJ (Braumtzcr el al., xg6oc)’ .&II I Lys---Ala--Ala -TryCly-I-Lys-I-Val-Gly-Ala-His~Ala Lrtl Try -Cly-Lys- ~-. Val-Asp-Val--Asp31

30

32

31

A (Ingram,

are prewmed Igjg;

Hunt

to occur. & Ingram.

1g5g)

-Ala-Ser Glu-Arg -Met-Phr -Tyr-Gly-[Ala - -Glu) --Val-GlyGly -Ixs;T;-! --Ala ~~Lru~-C,Iy -Arg-Lru.. 1959; Braunitzer el al.. xghaa) I-..-. l.ys ~

-Thr--Asp-oval---

I” the p chain

has bee”

HbE

Asp-Lys

0 -

Gly0

252

MARTYNAS

YCAS

bility that the a and ,8 mutation rates are significantly different, and thus could be distinguished on this basis. Suppose, as an example, that a mutation is the more frequent. Then replacements arising during some short period of time and differentiating originally identical proteins would be mainly of a type prohibited to /3 mutation. However, if the period of observation were lengthened, /3 mutations would begin to accumulate, so that eventually observed differences between two originally identical proteins would involve replacements which were randomly distributed. Providing only that the rate of /3 mutation were not zero, randomization of differences would always result. The same result would be eventually attained if fi mutation were more frequent, with, of course, an initial preponderance of differences prohibited by a mutation. Thus, observation of proteins originally identical and differentiating for varying lengths of time should make it possible to determine if the distinction between a and /3 mutations is real, providing that the rates of these two types of mutations differ. Because of technical difficulties, replacements arising as a result of observed mutations are not known. However, a considerable body of evidence on sequences of amino acids in homologous proteins from different species exists (Table I). Assuming the truth of the theory of evolution, hereditary differences must be assumed to arise as a result of mutations. The taxonomic distance between species can be used to measure, roughly, the length of time the proteins of two species have been diverging from a common ancestor, so that it is also possible to classify homologous proteins as “recently” and “anciently” divergent. As the rate of change of proteins appears to be very slow “recent” and “ancient” is to be measured in units of geological, rather than ordinary time. Before comparing protein sequences, it is necessary to clarify the In the usual sense it refers to proteins meaning of the term “homologous”. of the same general structure and function in different species that have identical sequences except at a few positions, e.g. insulin from man and pig. Clearcut cases of this type will be referred to as class A homology. However, some proteins, while obviously homologous in the sense of having similar sequences, do not exactly fit this definition. For example, oxytocin and vasopressin have seven of nine amino acids identical, but occur in one organism and have different functions. The same is true of the corticotropins, two forms of which are found in the pig, both similar and having an amino acid sequence closely resembling the melanophorestimulating hormones. These instances will be regarded, rather arbitrarily, as class A homologous for present purposes. A homology can also exist between different chains of a single protem

0 a

17-2

254

MARTYNAS

Y 6AS

(class B). In a previous publication in collaboration with Gamow and Rich (Gamow, Rich & YEas, 1956) I have drawn attention to the similarity between the two chains of insulin, as well as to the tendency for the end groups of different chains of single proteins to be identical (Yeas, 1958). As an explanation it was suggested that the several chains were originally one protein specified by a single genetic locus. By reduplication of genetic material the chains began to be controlled by different loci, which gradually diverged by genetic drift. At that time the evidence for this suggestion was slight, but the recent work of Braunitzer, Liebold, Miiller & Rudloff (r96ob) has, at least in part, vindicated this position. They have shown that the a and p chains of human hemoglobin are so similar that a common origin is clearly indicated. As the hemoglobin of the lamprey, presumably representative of the cyclostome ancestor of higher vertebrates, is single chained (Allison et al., 1960), the two chains of hemoglobin must have begun to diverge since about the Devonian or Silurian period, and are an example of homologous proteins long divergent. Since fish insulins are also two-chain proteins (Yamamoto, Kotaki, Okuyama & Satake, 1960) the chains must have diverged at a comparable, probably even earlier period, as might indeed be concluded from the relatively slight similarity remaining between them. Of course the mere presence of two chains must not, by itself, be taken as evidence that the divergence of the chains is ancient. Thus the growth hormone of man is single-chained, while that of cattle is, apparently, double (Table I). Unless the single chain condition is the result of loss, this double-chain structure must be a relatively recent deve1opment.t On the basis of these conclusions I classify known protein sequences into two groups: recently divergent, mostly mammalian so the divergencies are not older, and preponderantly much younger, than the mesozoic; and anciently divergent, which includes the two chains of insulin and hemoglobin and homologous sequences of cytochrome C from Rhodospirillum rubrum, yeast and cattle. These cannot have diverged later than the Devonian, and in the case of the cytochrome certainly vastly earlier.: t An examination of homologous sequences in Table I shows that, besides the wellknown phenomenon of replacement of one amino acid by another, a deletion of residues can also occur, as first pointed out by Sorm and his collaborators (1957). This is clearly demonstrated by the sequences of the two chains of hemoglobin. The known sequences of melanophore-expanding hormones appear to be fragments of an originally longer sequence, which has suffered breakage at various distances from both ends. A C-terminal deletion also seems to have occurred in fish insulin. $ Although in principle data on amino acid sequences are reproducible at will, in practice the great labor involved in obtaining them makes an independent verification difficult, so that most of such data must be accepted on the basis of the work of a single laboratory. Some parts of the sequence of ribonuclease, for example, depend indeed on a single experimental determination (Hirs, Moore & Stein, 1960). It is therefore legitimate to ask to what extent such data may be contaminated by experimental error. No general

REPLACEMENT

OF

AMINO

ACIDS

IN

PROTEINS

25s

It is now possible to consider whether there is, in fact, any difference between the distribution of replacements in recently and anciently divergent proteins. Replacements are considered to be all permutations of different amino acids which occupy one position in homologous proteins, listed in Table I. Thus position 9 in the A chain of insulin is evidence for one replacement, serine-glycine; position 30 in the B chain for three, serine-threonine, If one plots serine-alanine and threonine-alanine. replacements in recently and anciently divergent homologous proteins on a grid, letti.ng the hatched areas represent replacements prohibited as a result of an a mutation, one finds the distributions shown in Fig. 2. if replacements were random, they would be distributed between the two areas approximately in the ratio of 149 : 41. In fact, replacements differentiating the recently divergent proteins arc distributed quite differently. Of 62 such replacements, only two (glutamicglycine, hemoglobins A and G and alanine-valine, serum albumins) occur in the hatched area Fig. 2~). The probability that such a distribution is a chance deviation from what is actually a random distribution is very small (x2 = 12.3), so that Fig. 2a provides strong evidence that avoidance of replacements prohibited as a result of a mutation is rea1.t On the other hand, the distribution of 44 replacements differentiating anciently divergent proteins is in the ratio of 32 : 13 between the two areas (Fig. zb), w h’1ch .1s c 1ose to a random distribution (x2 = I .4). Comparing the two distributions, one can draw the following conclusions: answer can be given to the question, but some idea of the possible size of error can br obtained by considering a few cases where experimental cross-checks have been made. Thus the peptide, now known to be N-terminal, of human hemoglobin A was originall> stated to have the sequence shown in parentheses in Table I (Ingram, 1959) and later corrected to the sequence shown below (Hunt & Ingram, 1959). The differences are by no means small. As another example, the sequence of Tobacco Mosaic Virus protein as determined in two different laboratories is also shown in Table I (Anderer, 7.Thhg. Weber & Schramm, 1960; Tsugita et al., 1960). In this case it is not clear to what extent the differences may be due to the use of slightly different strains of virus, but certainl! at least some must be the result of errors in one laboratory or another. As these examples are taken from the published work of established investigators with a reputation in thei;field, it is clear that a considerable number of errors may exist in the published sequences, introducing what one must consider as “noise” in the data. Such “noise” would, of course, tend to randomize the apparent distribution of replacements. t If the occurrences of any replacement were equally probable, o’ZIj8 of all replacements should be found in the hatched area. However, the actual probability of any given replacement should be proportional to the product of the abundances of the two amino acids in the protein sample. Taking the average abundance of amino acids to be thar given in Table V of my 1958 paper, the sum of such products occurring in the hatched area is O’ZZIZ, virtually identical with the number given above. Replacements forbidden as a result of a mutation are, therefore, not a class whose occurrence is a priori less probable on statistical grounds. On a random basis the probability that of 62 replacements two or less will occur in the hatched area is 0~000035.

MARTYNAS

256

YCAS

I. Replacements differentiating the recently divergent proteins strongly avoid transitions which the coding hypothesis prohibits as a result of mutation in the a component. 2. Randomization of replacements differentiating the anciently divergent proteins indicates that the p component can also mutate, but at a much slower rate than the a component. 3. The random distribution of replacement in Fig. 2b speaks strongly against the view that the non-random distribution in Fig. 2a is due to a selection process, which makes certain transitions unobservable because they result in an inviable organism. This suggestion is in any case implausible, since many replacements plotted in Fig. 2n involve amino acids of very different types, such as glutamic-lysine, or arginine-serine. The concrete assignment of replacements into two categories was based on considerations that had nothing to do with the replacement process (YEas, 1960), so that the observed avoidance of replacements prohibited by a mutation is strong independent evidence in favor of the original coding hypothesis. Since the hypothesis was deduced from data on the composition of viruses, while the data on replacements is mainly from non-viral proteins, it is further evident that a ribonucleic acid analogous in function to that of a virus must be presumed to be present in uninfected cells as part of the normal protein specifying mechanism. Identification of the /3 component remains a problem for future research. This work has been supported by Grant G-9753 from the National Science Foundation.

REFERENCES ACHER, ACHER,

Acta, ALLISON,

(1960). ANDERER, ANFINSEN,

R. (1960). Annu. Rev. Biochem. 29, 542. R., CHAUVET, J., LENCI, M. T., MOREL, F. & MAETZ, J. (1960). Biochinz. hiophys. 42, 379. A. C., CECIL, R., CHARLWOOD, Y. A., GRATZER, W. B., JACOBS, S. & SNOU., S. S. Biochim. biophys. Acta, 42, 43. F. A., UHLIG, H., WEBER, E. & SCHRAMM, G. (1960). Natwe, Lo&. 186, 92~. C. B., AQUIST, S. E. G., COOKE, J. P. & JONSSON, B. (1959). 3’. biol. C’hem. 234,

1118. G., HILSCHMANN, N. & MOLLEH, R. (1961x). Hoppr-Seyl. Z. 318, G., LIEBOLD, B., MUELLER, R. & RUDLOFF,~. (196ob). Hoppe-Seyl. Z. G., RUDLOFF, V., HILSE, K., LIEBOLD, B. & MOLLER, R. (1960~). Seyl. 2. 320, 283. Du VIGNEAUD, V., LAWLER, H. C. & POPENOE, E. A. (1953a). J. Amer. &em. Sot. Du VIGNEAUD, V., RESSLER, C. & TRIPPETT, S. (195313). r. biol. Chem. 205, 949. ELLIOT, D. F. & PEART, W. J. (1956). Nature, Lond. 177, 527. FELIX, K. (1952). Expeuientia, 8, 3 12. GAMOW, G., RICH, A. & YEAS, M. (1956). In “Advances in Biological and Physics” Vol. IV, 23. Academic Press, New York. BRAUNITZER, BRAUNITZER, BRAUNITZER,

284. 320,170. Hopp~7.5, &Xo.

Medicai

REPLACEMENT HILL, R. HIRS, C. HUN,T, J. H~xT, J. INGRAM, INGRAM, ISHIHARA, KENDRE~,

L. H. A. A. V. V.

OF

AMINO

ACIDS

IN

PROTEINS

H. C. (1959). Nature, Land. 184, 641. J. & STEIN, W. H. (1960). r. biol. Chem. & INGRAM, V. M. (1959). Nature, Lond. 184, 640. & INGRAM, V. M. (1960). Biochim. biophys. Acta, 42, M. (1955). Biochim. biophys. Acta, 16, 599. M. (1959). Brit. med. Bull. rg, 27. Y., SAITO, T. Pr FUJINO, M. (1958). Nature, Lond. 181, J. C., PARISH, R. G., MARACK, J. C. & ORLANS, E. S.

257

& SCHWARTZ,

W.,

MOORE,

235, 633. 409.

1468.

(1954). Nature, Lond. 174, 946. L,I, C. H., PARCELLS, A. J. & PAPKOFF, N. (1958). r. biol. Chem. 233, I 143. L,ORAPITI), L. & MIDDLEBROOK, W. R. (1953). Science, 1x8, 515. MONIER, R. & JUTISZ, M. (1954). Biochim. biophys. Acta, 15, 62. NIP, C. I. 2% FRAENKEL-CONRAT, H. (1955). Arch. Biochim. Biophys. 59, 538. PARCELLS, A. J. & LI, C. H. (1958). r. biol. Chem. 233, 1140. PETERS, T., LOGAN, A4. C. & SANFORD, C. A. (1958). Biochim. biophys. Acta, 30, 88. PORTER, R. R. & SANGER, F. (1949). In “Haemoglobin”. (F. J. W. Roughton and J. C. Kendrew, Eds.) Interscience, New York. SANGER, F. (1960). Brit. med. Bull. 16, 183. SKEGGS, L. T., LENTZ, K. E., KAHN, J. R., SHUMWAY, N. P. & WOODS, K. R. (1956). J. e.xp. Med. 104, 193. FORM, F., KEIL, B., HOLEYZOVSK+, V., KNESSLOV,~, V., KoSTKA, V., M~~SIAR, P., MELOUN, B.. MIKE:, O., ToM~~~EK, V. & VANE~EK, J. (1957). Co/l. True,. chim. Tchkosl. 22, I 3 I o. ‘THOMPSON, E. 0. P. (1954). J. biol. Chem. 208, 565. TSUGITA, A. & FRAENKEL-CONRAT, H. (1960). Proc. nut. Acad. Sci. Wash. 46, 636. TSIJGITA, A., GISH, D. T., YOUNG, J., FRAENKEL-CONRAT, H., KNIGHT, C. A. & STANLEY, TV. M. (1960). Proc. nut. Acad. Sci. Wash. 46, 1463. TUPPY, H. (1959). Naturzuissenschaften, 46, 35. WALIXCHMIDT-LEITZ, E. (1953). Angeev. Chem. 65, 349. WHITE, W. F. & LANDMAN, W. A. (1955). J. Amer. them. Sot. 77, 1711. YAMAMOTO, M., KOTAKI, A., OKUYAMA, T. & SATAKE, I;. (1960). r. Biochem. 48, 84. YEAS. M. (1958). “The Protein Text”. Symposium on Information Theory in Biology. (H. P. Yockey, Ed.) Pergamon Press, New York. YCxs, 1LI. (1960). Natuve, Lond. 188, 209.