Arginine as an evolutionary intruder into protein synthesis

Arginine as an evolutionary intruder into protein synthesis

BIOCHEMICAL Vol. 53, No. 3,1973 AND BIOPHYSICAL RESEARCH COMMUNICATIONS ARGININE AS AN EVOLUTIONARY INTRUDER INTO PROTEIN SYNTHESIS Thomas Space ...

287KB Sizes 0 Downloads 77 Views

BIOCHEMICAL

Vol. 53, No. 3,1973

AND BIOPHYSICAL RESEARCH COMMUNICATIONS

ARGININE AS AN EVOLUTIONARY INTRUDER INTO PROTEIN SYNTHESIS

Thomas

Space Sciences of California,

University

Received

June

H. Jukes Laboratory Berkeley,

Calif.

94720

4, 1973

SUMMARY The arginine content of proteins is, on the average, far less than would be anticipated from the fact that 6 of the 61 codons for amino acids in the genetic code are for arginine. In contrast, lysine is more abundant than would be expected from its 2 codons. It is suggested that arginine replaced ornithine in protein synthesis during the evolution of the genetic code. The frequency is not

uniform,

between

and it

The with

distribution

of codons

responsible

basicity

preferentially

Subak-Sharpe

which

et al

organisms is

present

some extent

except

for

the

the amino

than noted

as indicating in four

rejected

reasons

acid

It

arginine

in evolution.

Copyright @ 1973 by Academic Press, Inc. All rights of reproduction in any form reserved.

that

a selective codons. It

The

709

was compared (1969).

an anticipated

are

primarily

that

lysine

l),

despite

basicity

lysine

(pK13

is

appears

fact

of arginine = 10.5).

against

Apparently,

used the

of CpG in the

process

therefore

such divergences.

case of arginine,

with

(Table

each

from

and Jukes

that

the scarcity

for

proteins

appears

of

codons

conspicuous

has 6.

correlation

for

by King

acids

in proteins

divergences

of 4.2% as compared

and arginine

(1966)

a fair

in 53 vertebrate amino

acids

of

to explore

proteins.

is much greater

is

number

in most proteins

has 2 codons

= 12.5)

certain

in

there

and the

each

are

to arginine

lysine

(PK'~

for

and lysine

for

that

amino

the possible

acids

to the extent

Arginine

different

of interest

was found

was present

10.7%.

is

of amino

A good correlation

that

It

acids

and to examine

the number

which

of amino

code.

correlation,

of the

has been noted

the abundance

in the genetic this

of occurrence

DNA of

this

arginine that

doublet, is

the

to concept

Vol.53,No.3,1973

that

"all

BIOCHEMICAL

is

for

the best

AND BIOPHYSICAL

in this

best

possible

RESEARCH COMMUNICATIONS

of all

codes"

may not

be

valid. Perhaps the

arginine

tRNAs for

code.

This

entered

another might

amino

happen

aminoacyl-ligase

system

enzyme chemistry

is

containing

formation

gives

ornithine

for

(homo-ornithine) purposes

might

system

in preference

functions.

in Table

The codons

further

would

to arginine

The proteins

lysine

for

important

in

the

statistical

a single

variable assumed

tend

or specificity

parameter number

are

chain. (S)

to arginine

into

replacing

affinity

than

proteins

could

at the same time, than

lysine

arginine amino

codons

for acid,

for

many ornithine.

such purposes

shown

lengths

and have different

the proportion

in Table

of arginine

and this

(Hill

et al,

1966).

containing

in which

The chains.

and constant from

chain

led

is

The

in arginine

results

of codons,

proteins

descended

The heavy

the

is

pathway.

resulted

lysine

1 are of varying

to have

as a molecule

with

acid,

1962).

and similar

to select

in

1.

and

shown without

treatment.

polypeptide

duplication

missing

tRNA-

dihydrofolic

of arginine but,

genetic

antagonists

have

have

more acceptable

the

acid

had a stronger

advantages,

codons,

total

The imnunoglobulins for

then

could

the

metabolic

would

arginine

have been

for

the

of such a phenomenon

a regular

The entry

special

as a substitute

The living

This if

tRNA.

certain

in

by charging

from

affinity

and Jukes,

evolution

synthesis

displaced

substrate,

(Delmonte

during

synthesis

of folic

the normal

arena.

ornithine

conveyed

affinity

to arginine

the biochemical in protein

have

rise

was thus

An example

greater than

of protein

had a high

in question.

pathway

ornithine

which

arginine

reductase

of this

entering

if

group

dihydrofolic

Ornithine

acid,

the far

a 4-amino

has for

the mechanism

a single

has an S region

of about

710

light

are

These

common ancestral this,

the

contain regions

protein light

of approximately the

responsible

and heavy,

(C) regions.

Following

S and C regions

two genes

same length

chain equal as that

are

by gene evolved length. of the

Vol.

53, No.

BIOCHEMICAL

3, 1973

TABLE

1.

ARGIE!ItIE CONTAIIJING

AND

AND

LYSINE

BIOPHYSICAL

CONTEt!T

50 OR MORE AMINO

RESEARCH

OF 83 rLASSES ACID RFSIDIJES.

-----

----_lm-L-.-.---

F’roteins, --..T ryps

--------_I-i nogen

-.-L-&‘ass , Rovi

3 -

Anti

-

trypsin

rhymot

5

plants

Leghemoqlobin

plants

- human

ryps

i nogen

Z-Lactoqlobul

in,

Bovine

AR

!lyoglobin Aloha

Globin, Cy tech

Chain

Hemonlobin Immuno~lohulins, Constant,

Kaopa,

Constant,

Lambda,

Variable

human

Immunoglobulins,

0.9

6.6

II.7

1.1

76.6

1.3

9.5 4.8

1.7

4.6

1.2 I.4

IQ.0

47.2

7

12

1.7

7.6

2

245

4.5

12.5

1.P

5.1

I

I62

3

15

1.0

1.3

152.9

2.?

I?.6

I.?

12.?

141 .I

3.0

11.6

2.1

8.2

136

3

In

7.2

107 145.2

7.4

15.6

2.2

3.3

II.3

2.3

7.4 14.6 7.1

2

106

2.5

7.5

2.4

7.1

2

1

4.9

7 3.7

I.0

18

IQ5 107.8

4.6

6.7 3.4

id

IOQ

4.n

3.2

3.6

2.9

& mouse

human

heavy

mouse & mouse

t

human

Lambda,

15

chains: human

Kappa,

Variable,

123

2

14

lr)

light

229

?

I

Chain

Lys

I 40

40

c “_ Beta

Residues Percent Am ---

of

I

10

ch i ronomus rome

Ave rane

--e--__ tJ0.

1

8

Hemoglobin

PROTEINS

2!GJ%!!~~A~rn~~~~

I

ne

Lactalbumin Ferredoxin

OF

_--_l--l____-

%7. of Proteins

Eukaryotic

COMMUNICATIONS

& mouse

chains:

Constant,

human

1

336

7

27

2.1

8.0

Constant,

rabbit

I

326

I3

19

3.P

Variable,

human

6

5.8

4.2

5.3

5.7 3.8

I

109.7 84

2

4

2.4

4.8

2

204

5

12.5

2.5

h.1

I

84

2

8

2.6

10.1

I

Protease

Inhibitor

-

Lima

Bean

Caseins Haptoglobin Carboni

Alpha c anhydrase,

Hemerythrin, inhibitor,

G lobin.

lamprey

soybean

Glyceraldehyde

3-PO4

Thyrotropin

fluscle

human

worm

T rypsin

Alcohol

Chain

CL and Dehydrogenase triose

PO4

Adrenodoxi

n,

human

bovine i dase

P rophosphol

i pase

, bovine

scorpion

A2,

pi g

a

7

18

2.7

6.9

113

3

II

2.7

9.7

I

7.0

7

71 146

2.R

4.5

I?

3.1

8.9

2

332.5

9.5

27

2.9

8.1

96

3

IO

3.1

IO.4

30

3.2

8

20

3.2

8.0 8. I

3

6

3.3

6.5

3

a

2

5

1

374 248

1

92.

1

118

4

5

307

11

15

3.4 3.6

4.2

1 1

130

5

9

3.8

6.Y

2.5

5.5

3.9

8.7

1

isomerase

gonadotropin,

Carboxypept

hormone

horse

Chorionic

Neurotoxin,

Dehydrogenase

luteinizing

240

I

2

711

63.5

I2

4.9

Vol. 53, No. 3, 1973

Proteins,

BIOCHEMICAL

_I--. Ribonuclease,

Keratin,

B

5

High-sulfur

protein

Trypsin

inhibitor,pancreatic

Trypsin

inhibitor,

Bee

3

pancreatic

rome

venom

Proinsul

h 2 2 ascaris

phosphol

1

ipase

A

1

3

in

Elastase,

porcine

Lipotropin

1

B and

y

2

Prolactin

nerve

growth

G lutamate Parathyroid

5

9.h

4.0

7.7

87.2

3.6

6.6

4.1

153.5

6.5

0

4.2

7.8 0.n

56

2.5

3.5

4.5

h.2.

66

3 6

7

4.5

10.6

10

4.7

7.@

11

2

4.8

2 .4

128

83.6 240

90.5

II 4.5

212

12

10

5.7

4.p

120

7

8

5.8

6.7

1

gon

30

32

6.0

6.4

1

84

5

9

6.0

10.7

61.2

4.2

5.3

6.2

8.4

9

6.3

7.0’

20

6.4

16.0

10.5

6.4

5.6

4

6.6

5.3

8

Histone

II

1

125

8

2

189.5

1

76

5

1

82

R chorionic

protein

Thyrotropin

f3 and -

Lysozyme Trypsin

luteinizing

hormone

6

basic,

Membrane

Trypsin

Inhibitor

Histone

I I I,

Histone

IV

Encephalitogenic calf

11

4

7.9

2.9

116.5

IO

6.5

8.4

5.6

5

129

11.6

6.2

9.n

4.w

1

protein

Maize

1

thymus

1 2

5% 170

65 135 102 9,656

TOTALS

__-Proteins,

*

139

1

bovine

12

2

human

vertebrates

inhibitor,

Myelin

A

gonadotropin,

4.5

11

128

ahloid

5.0

1

1

Hormone

1.2 11.0

1

chicken

Monkey

5.0

5.6

Avidin,

Growth

2 10 9

Hormone venom

LYS

IO.5

factor

ns , snake

Tit-n

194

Dehydrogenase

Neurotoxi

125

Lys

2

Papain Mouse

RESEARCH COMMUNICATIONS

No. of Ave raqe Proteins No. of In Class ~~~.~---~---------L.-_-_. Res i dues At-q

Eukaryotic

Cy tech

AND BIOPHYSICAL

6

4

In.?

6.9

18.5

12.5

10.9

7.4

8

1

12.3

1.5 ’

18

13

13.3

9.6

14.5

IO.5

13.7

IO.84

402.7

668.7

4.17

6.92

-_-_--------

Prokaryotic ------I_-

Rub redoxi

n

Cy t och rome Cy tech

rome

Cytoch

rome

50s

2 1

c2

3 c551 c3

Rihosomal

protein

Neocarzinostatin

A2,

5.

coli

Streptomyces

Thioredoxin Ribonuclease

T,

712

52.5 112

82

0

3

0.0

5.7’

n

17

0.0

15.2

0.3

1n.2

1

103

0.5

0.3

18.5

8.3

n.6

18.5 ’

1

12D

1

12

0.8

10.0

1

109

1

0

0.9

o.n*

1

108

1

10

0.9

9.3

1

in4

1

1

1.n

1.0

Vol. 53, No. 3,1973

light

chain,

tripling

and

of

mutations are

the

"it

specificity

minants" thus

accounting 1969).

greater

abundance

direction

of

higher

and

a difference

arginine

codons and

higher

in

arginine

of

randomly-occurring

rejected

by

between

the

the

-

clostridial

C regions.

S and

to

5

54.8

77

Penicillinase Ferredoxin, Thermolysin, Nuclease,

yellow

Staph.

mosaic

--aureus

Chromatium --Bacillus

thermoproteolyticus

staphylococcal

Cytochrome

B562

Tryptophan

Synthetase

Aspartate

A

transcarbamylase

Penicillinase, Coat

Protein

Alpha

lytic

Lysozym,

1

turnip

E.

licheniformir

- Tobacco protease,

R Chain

Mosaic

Vi rus

myxobact.

virus

1

I88

1

257

TOTALS

C

that

by there

is

-

At-4

Lys

n.?

0.8

-

Residues Percent At-? LyS

1.0

1.3

3 1.2

10

1.1

12.8

1.2

3.6 P.?

1

4

1.3

5.2

7 43

1.6

3 4

1.6

3.7 16.7

1

81

2

2

2.5

1

316

IO

11

3.2

3.5

1

I 4sr

3.4

15.4

1

23 16

2.5

110

5 4

3.6

14.5

1

267

11

13

4.1

4.9

1

152

8

10

5.3

6.6

1

265

24 1.6

5.7

9.0

6.1

1.3

5 1 2

bacteriophage

the

Average No. of Residues -I-

125.8

-

the

accepted

1 shows

274.5

protein

composition

than

mutations,

4 protein

the

whose

lysine

Azurin carrier

in

the

considerations,

2

Coat

a protein,

one

above

Table

C sequences,"

evolve

of

Subtilisin

Acyl

of

C regions.

No. of Proteins In Class

type

deleterious,

from

in

deter-

usually

it

lower point

available

antigenic

removed

the

and

a large

than

point

changes,

constancy

lead

From

most

immunoglobulins

are

the

lysine

a

evolutionary

regions

should

that

various

are

lower

of

have

with

S and

reasons.

----Ferredoxin

(C) of

Prokaryotic

Proteins,

cope

representing

1969

as to

constraints

functional be

to

variability

long, in

incorporated

constant

arginine

a result

S regions

such

of

should

the

functional

by

as

in

RESEARCH COMMUNICATIONS

regions

advantageous

the

If

constrained

regions

rapidly

antibodies

for

(Jukes,

S regions

are

"those

as

(S)(variable)

different

while

times

I suggested

inunologically

of

three

protein.

and is

AND BIOPHYSICAL

about

ancestral

advantageous,

assortment

the

a C region

the in

because

is

BIOCHEMICAL

157.4 198 160.5 3,630

15 9.6

2

6.1

1.0

12.5

12.5

7.9

7.9

106.3

262.5

2.93

7.23

12

Vol. 53, No. 3,1973

BIOCHEMICAL

The prokaryotic than

those

in Table

of the eukaryotes.

in contrast excretory

proteins

to prokaryotes, products

are somewhat

"less

1 average

Perhaps

this

can dispose

such as urea intolerant"

AND BIOPHYSICAL

is

of arginine

even

lower

because

of arginine

and creatinine,

RESEARCH COMMUNICATIONS

many eukaryotes,

by converting and therefore

than

in arginine

are

it

to

eukaryotes

the prokaryotes.

ACKNOWLEDGMENT Support by NASA grant Berkeley, is acknowledged.

NGR 05-003-460

to the

University

of California,

REFERENCES ;T;;lonr,LL.

and Jukes, T. H. (1962) Pharmacol. Reviews, 14:91. E. ( 1966) Delaney, R., Fellows, R. E., Jr. and Lebovitz,?r. Fro;. t&l. Acad. Sci. USA, 56:1762. Jukes, T.-(1969)calGenetics, H. 3:109. King, J. L. and Jukes, (1969)M, T. H. 164:788. Subak-Sharpe, H., Burk, R. R., Crawford, L. V.,xrrison, J. M., Hay , J. and Keir, H. M. (1966) Cold Spr.Harb..$mp;.Quant Biol., -31:737.

714