Cloning of human gene encoding prostaglandin endoperoxide synthase and primary structure of the enzyme

Cloning of human gene encoding prostaglandin endoperoxide synthase and primary structure of the enzyme

Vol. 165, No. 2, 1989 BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS Pages 888-894 December 15, 1989 OF H U M A N GENE E N C O D I N G P R O S...

379KB Sizes 0 Downloads 62 Views

Vol. 165, No. 2, 1989

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS Pages 888-894

December 15, 1989

OF H U M A N GENE E N C O D I N G P R O S T A G L A N D I N

CLONING

SYNTHASE AND PRIMARY

Chieko

Department Research

ENDOPEROXIDE

S T R U C T U R E OF T H E E N Z Y M E

Yokoyama

and

Tadashi

of Pharmacology, National Institute, Fujishiro-dai,

Tanabe*

Cardiovascular Suita, Osaka

Center Japan

565,

Received October 27, 1989

SUMMARY: The complete amino acid sequence of human prostaglandin endoperoxide synthase (EC 1.14.99.1, cyclooxygenase) was deduced by cloning and sequence analysis of human genomic DNA coding for the enzyme. The isolated clones covered approximately 40 kilobase pairs of human gene and the protein coding region of the enzyme was distributed into eleven exons, which encoded 599 amino acid residues with a calculated molecular weight of 68,548. Human prostaglandin endoperoxide synthase exhibited 91 % amino acid identity with the sheep enzyme. ®~9~9Ao~d~micpress, ~nc.

Prostaglandin fatty PG

acid

(PG)

cyclooxygenase,

biosynthesis

steroidal

endoperoxide

synthase

has

cDNA

(7-9).

acids

with

been

a

sequence

whom

0006-291~89

of

the

(i0),

all

mature

showed

has

cloned

determined

signal

cloned

was

been

a

enzyme

neither

and

enzyme

protein

is

Though

reported.

correspondence

cDNA

nucleotide Several

should

$1.50

Copyr~ht© 1~9 ~ Aca~micPress,~c. AHr~h~ ~repro~c~onina~rmreserved.

as

primary

66,175.

888

be

of for

in

fatty

by

nonand

sheep

PG

structure

of

the

sequence

of

the

of

The

deduced

the

enzyme the

human

sequence

nor

addressed.

4).

aspirin

composed

groups

the acid

(3,

encoding

the

form

enzyme

inhibited

such

nucleotide

of

its

is

the

precursor

peptide.

key

1.14.99.1,

activity

cDNA

from

weight

the

hydroperoxidase

Recently,

molecular

also

24-residue

*To

PGG

(EC

exhibits

drugs

6).

The

is

enzyme

anti-inflammatory (5,

structure

and

synthase

synthase) The

activity

indomethacin

been

2).

activity

cyclooxygenase

enzyme

PGH

(i,

cyclooxygenase The

endoperoxide

have

576

amino

amino

acid

including enzyme the shown

a has

primary that

Vol. 165, No. 2, 1989

hormones, of

PG

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

growth

factors

endoperoxide

which

is

in

by

epidermal factor

(ii,

suppress growth

in

12,

(EGF)

smooth

muscle

the

associated

mouse

the

other

hand,

enhanced

B transforming

cells

from

with

in

levels

type

cAMP,

enzyme On

RNA

synthesis

(11-15). is

16).

and

the

cells

of

messemger

factor

cultured

cultured

levels

cells

induce

stimulation,

RNA

MC3T3EI

corticosteroids

in

B-adrenergic

messenger

osteoblastic

interleukin-I

synthase

produced

increase

and

rat

by growth

thoracic

aortas

(15). In

this

paper,

characterization and from

the

of

primary

nucleotide

we human

report gene

isolation

encoding

structure

of

the

sequences

of

ii

human

PG

and

partial

endoperoxide

enzyme,

which

synthase, was

deduced

exons.

MATERIALS AND METHODS

[~-32p]dCTP 110 TBq/mmol) purchased from Amersham International. Restriction endonucleases, T4 DNA l i g a s e , exonuclase III a n d mung b e a n n u c l e a s e were obtained from Toyobo Co., the large fragment of E. coli DNA p o l y m e r a s e I (Klenow fragment) from Takara Shuzo Co., 7-Deaza-Sequenase kit (version 2.0) from United States Biochemical Co. and BluescriptII vector from Stratagene. T h e h u m a n g e n o m i c DNA l i b r a r y i n EMBL3 ( J a p a n e s e peripheral blood) generated by partial digestion with Sau3AI was kindly provided by Dr. Sakaki (Kyushu University, Fukuoka, Japan). The library was screened with restriction fragments of cDNA f o r s h e e p PG e n d o p e r o x i d e synthase as described3~previously (7). The double stranded probes were labeled with [a- aP]dCTP by random priming method (17). Screening of the library was performed with standard procedures (18). Isolated genomic DNA clones were characterized by restriction mapping and suitable restriction fragments were subcloned into pBluescriptII SK+ vector for further restriction mapping and sequence analysis. Restriction fragments carrying exon sequences were identified by Southern hybridization analysis u s i n g t h e s h e e p cDNA f r a g m e n t s as probes. For sequence analysis of subclones with long inserts, a series of overlapping DNA s e q u e n c e s were prepared using the exonuclease III/mung bean nuclease system. Nucleotide sequence was determined from the double stranded plasmid using the dideoxy chain-termination method (19) as described (20).

RESULTS

We sheep the

have

PG pPESI02

previously

endoperoxide (nucleotides

AND

DISCUSSION

reported synthase

the (7).

111-239),

889

isolation Among deleted

of

the

cDNA

sheep

fragment

clones cDNA of

for

clones, pPES40

by

Vol. 165, No. 2, 1989

Bal

31

nuclease

(nucleotides

(nucleotides

758-1442)

mixture

was

library

in

the

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

used

largest

hybridized

to

the

to

the

cDNA

SmaI

the

fragment

The

"gene

of

(1.4

the

hybridization

the

clones

IhPESI3

shown

to

Because

xhPES20 K

E KS

I

, I I(

----HI

PSE

i I

as

probe.

One

and

of

4

E

II

abc d e f g h

I

ATG

the

of

the

6-kb

K p_nI

Southern

IhPESI3 sheep

two

blot did

not

cDNA,

we

I

S

S

B

.~1 ]( I, I..~

I

i

further

5'-terminal

by

xhPES29

I

of

of

the

the

not

'1

BEEE

.. I.

I Jr,

I

with

xhPES27

t k

I

done

of

did

the

was

exons

and

-iii-i0)

sequence

fragment

xhPESl5

IhPESI3

covering

fragments

EcoRI,

hybridized

terminal

carry

any

3'-downstream

i

of

4.l-kb

and

weakly

characterized

with

pPES40

(nucleotides

amino

was

was

also

fragment

SalI

pPESI02,

insert

approach of

IhPES20 clone

the

clone

and

with

of

genomic

characterized.

4.7-kb

fragment

the

kb)

analysis. to

IhPES13

isolate

walking"

clones,

of

their

human the

was

pPES31

and

the and

i),

fragments

To

harboring

of

(kb),

However,

of

[~-32p]dCTP

pairs

SmaI

fragment

isolated

(Fig.

6.5-kb

(pPESI02).

region

positive

were

cDNA

5'-terminal

fragment

hybridize

sheep

SmaI

screening

digestion

fragment.

hybridize

enzyme,

clones

by

the

pPES40

upstream

for

~hPESI3

respectively.

sheep

with

6.5-kilobase obtained

to

labeled

probe

Five

fragments,

pPES31,

a

insert,

Approximately

and

were

as

EMBL3.

116-701)

i

I

j

k TGA

,--, IOObp AATAAA

Fig. i. R e s t r i c t i o n m a p of the g e n o m i c c l o n e s w i t h the l o c a t i o n of exons. O n l y the r e l e v a n t r e s t r i c t i o n s i t e s are s h o w n in the map. A b b r e v i a t i o n s of t h e r e s t r i c t i o n sites are as follows : BamHI, B; EcoRI, E; KpnI, K; PstI, P; SmaI, S. T h e e x o n s are i n d i c a t e d by c l o s e d boxes. The connected e x o n s are i l l u s t r a t e d under the restriction map. The protein coding region is i n d i c a t e d b y o p e n boxes, the 3' u n t r a n s l a t i o n a l r e g i o n by a s o l i d line. T h e e x o n c o n t a i n i n g the t r a n s l a t i o n a l i n i t i a t i o n c o d o n is l a b e l e d a a n d f o l l o w i n g e x o n s a r e l a b e l e d b t h r o u g h k. The point of the t r a n s l a t i o n a l i n i t i a t i o n codon, ATG, t e r m i n a l codon, TGA, and polyadenylation signal, A A T A A A are i n d i c a t e d by a r r o w s . The SalI s i t e s in E M B L 3 w e r e u s e d for p r e p a r a t i o n of t h e inserts.

890

Vol. 165, No. 2, 1989

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

screened

the

fragments

(nucleotides

EcoRI

human

genomic 1002-1221,

(nucleotides

fragments

of

pPES28

of

~hPES29

were

genemic

DNA

clones

covered

as

exons

were

described

sequence

in

in

nucleotide

sequences

intron/exon

junctions

with

codon,

that

putative

of

sheep

of

as

the shown

was

assigned

codon

initiation

sites

by

appeared

from

residue

of

termination

The

amino

deduced

from

the

nucleotide

the

human

enzyme

form

of

with

a molecular

PG

acid

weight

endoperoxide

sheep

and

encoding

terminal

both

23-amino

sequence

of

9).

potential

Four

aspirin the sites

the

acetylation

sheep

enzyme

(amino

acetylation

is of

the

acid site

of

The

exhibited

91

nucleotide enzymes

acid signal

showed

a

as

site (7-9). residues

were

529)

the

found

(21). bp

codon,

TGA

%

in

the

sequences

for

103,

143,

409)

were

also

found

891

precursor residues of

human of

open

homology.

amino

not

synthase

that

the

the

typical

(data

acid

to

for

downstream

structure

a

the

A

the

amino

of

in

sequence

sequence

identity

88.5

and

translational

the

reading The

amino

characteristic sheep

glycosylation

Similar 67,

regions

727-732

shows

shown

Determined

surrounding

primary

polypeptide peptide

i.

that

599

%

coding

endoperoxide

sequences

asparagine-linked

(residue

PG

four

gene.

protein

Kozak

of

68,548.

clones,

fragments

The

suggested

composed

PstI-

those

human

consensus

at

human

sequence

synthase

enzyme

frames

sequence

i,

comparing

by

AATAAA,

shown).

of

2.

the

described

the

kb

sequence

signal,

first

Fig.

Fig.

Fig.

with

and

1433-]922)

coding

polyadenylation the

in

nucleotide

agrees

PstI

Two

in

The

shown

in

of

restriction

protein

are

The

40

METHODS"

exons

pPES33,

probe.

suitable

AND

cDNA.

initiation

eukaryotic

from

a

shown

approximately

Ii

ATG

as

As

mixture

(nucleotides

cDNA

obtained.

"MATERIALS

a of

EcoRI

sheep

sequenced

appeared

initiation

and

the

and

with

1223-1310)

1311-1432)

XhPES27

The

library

enzyme

sites acid

(7-

and

the

sequence

of

glycosylation and

the

aspirin

in

the

primary

Vol. 165, No. 2, 1989

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

5 ' ~ - - C C G C G C C A T G A G C C gt g a g t g c g a H S R

..........

ca tc tgccagGGAGTCTCTTGCTCCGGTTCTTGCTG S L L L R F L L

II

T T G C T G C T C C T G C T C C C G C C G C T C C C C G T C C T G C T C G C G G A C C C A G G G G C G C C C A C G C C A G g t a g g c g g c c. . . .

L

L

......

L

L

L

P

P

L

P

V

L

L

A

D

P

G

A

P

T

P

V

32

t c a c c c a c agTGAATCCCTGTTGTTACTATCCATGCCAGCACCAGGGCATCTGTGTCCGCTTCGGCCTT N P C C Y Y P C Q H Q G I C V R F G L

51

GACCGCTACCAGTGTGACTGCACCCGCACGGGCTATTCCGGCCCCAACTGCACCATCCgtgagctggg .......

D

R

Y

Q

C

D

C

T

R

T

G

Y

S

G

P

N

C

T

I

P

71

• " "gcccctgcagCTGGCCTGTGGACCTGGCTCCGGAATTCACTGCGGCCCAGCCCCTCTTTCACCCACTTCCTG

O

L

W T

W L

R

N

S

L

R

P

S

P

S

F

T

H

F

L

91

V

L

116

.......... c t c t c t gcagTGCGCTCCAACCTTATCCCCAGTCCCCCCACCTACAACTCT R S N L I P S P P T Y N

S

131

K

156

t a a a & t gg g . . . . . . . . . . c c c c a a c c a g G G A A G A A G C A G T T G C C A G K K Q L P

171

CTCACTCACGGGCGCTGGTTCTGGGAGTTTGTCAATGCCACCTTCATCCGAGAGATGCTCATGCTCCTGGTACTC L T H G R W F W E F V N A T F I R E M L M L L ACAGgtgggtgtgg T V

GCACATGACTACATCAGCTGGGAGTCTTTCTCCAACGTGAGCTATTACACTCGTATTCTGCCCTCTGTGCCTAAA A H D Y I S W E S F S N V S Y Y T R I L P S V GATTGCCCCACACCCATGGGAACCAAAGg D C P T P M G T K

P

GATGCCCAGCTCCTGGCCCGCCGCTTCCTGCTCAGGAGGAAGTTCATACCTGACCCCCAAGGCACCAACCTCATG D A Q L L A R R F L L R R K F I P D P Q G T N

L

M

196

TTTGCCTTCTTTGCACAACACTTCACCCACCAGTTCTTCAAAACTTCTGGCAAGATGGGTCCTGGCTTCACCAAG F A F F A Q H F T H Q F F K T S G K M G P G F

T

K

221

GCCTTGGGCCATGGGg A L G H G

t g a g t ac ct

.

.

.

.

.

.

.

.

.

.

ct g t c c a c a g G T A G A C C T C G G C C A C A T T T A T G G A G A C A A T V D L G H I Y G D

N

CTGGAGCGTCAGTATCAACTGCGGCTCTTTAAGGATGGGAAACTCAAGTACCAGgtagtgctgg .......... L E R Q Y Q L R L F K D G K L K Y Q

236 c 254

a t c c c acagGTGCTGGATGGAGAAATGTACCCGCCCTCGGTAGAAGA~GCGCCTGTGTTGATGCACTACCCCCGA V L D G E H Y P P S V E E A P V L M H Y P R 276 GGCATCCCGCCCCAGAGCCAGATGGCTGTGGGCCAGGAGGTGTTTGGGCTGCTTCCTGGGCTCATGCTGTATGCC G I P P Q S Q M A V G Q E V F G L L P G L M L Y A 301 ACGCTCTGGCTACGTGAGCACAACCGTGTGTGTGACCTGCTGAAGGCTGAGCACCCCACCTGGGGCGATGAGCAG T L W L R E H N R V C D L L K A E H P T W G D E Q 326 CTTTTCCAGACGACCCGCCTCATCCTCATAGgt g a g g a c L F Q T T R L I L I G

tc ..........

ccctgcccagGGGAGACCATCAAG E T I K

341

ATTGTCATCGAGGAGTACGTGCAGCAGCTGAGTGGCTATTTCCTGCAGCTGAAATTTGACCCAGAGCTGCTGTTC I V I E E Y V Q Q L S G Y F L Q L K F D P E L L F 366 GGTGTCCAGTTCCAATACCGCAACCGCATTGCCACGGAGTTCAACCATCTCTACCACTGGCACCCCCTCATGCCT G V Q F Q Y R N R I A T E F N H L Y H W H P L M P 391 GACTCCTTCAAGGTGGGCTCCCAGGAGTACAGCTACGAGCAGTTCTTGTTCAACACCTCCATGTTGGTGGACTAT D S F K V G S Q E Y S Y E Q F L F N T S M L V D Y 416 GGGGTTGAGGCCCTGGTGGATGCCTTCTCTCGCCAGATTGCTGGCCGGgt aagcccc G V E A L V D A F S R Q I A G R

t ..........

ctctcgg

cagATCGGTGGGGGCAGGAACATGGACCACCACATCCTGCATGTGGCTGTGGATGTCATCAGGGAGTCTCGGGAG I G G G R N M D H H I L H V A V D V I R E S R E

432 456

ATGCGGCTGCAGCCCTTCAATGAGTACCGCAAGAGGTTTGGCATGAAACCCTACACCTCCTTCCAGGAGCTCGTA M R L Q P F N E Y R K R F G M K P Y T S F Q E L V 481 Ggtgagcagct G

..........

ctccttgtagGAGAGAAGGAGATGGCAGCAGAGTTGGAGGAATTGTATGGAGAC E K E M A A E L E E L Y G D

496

ATTGATGCGTTGGAGTTCTACCCTGGACTGCTTCTTGAAAAGTGCCATCCAAACTCTATCTTTGGGGAGAGTATG I D A L E F Y P O L L L E K C H P N S I F G E S M

521

ATAGAGATTGGGGCTCCCTTTTCCCTCAAGGGTCTCCTAGGGAATCCCATCTGTTCTCCGGAGTACTGGAAGCCG I E I G A P F S L K G L L G N P I C S P E Y W K P

546

AGCACATTTGGCGGCGAGGTGGGCTTTAACATTGTCAAGACGGCCACACTGAAGAAGCTGGTCTGCCTCAACACC S T F G G E V G F N I V K T A T L K K L V C L N T

571

AAGACCTGTCCCTACGTTTCCTTCCGTGTGCCGGATGCCAGTCAGGATGATGGGCCTGCTGTGGAGCGACCATCC K T C P Y V S F R V P D A S Q D D G P A V E R P S

596

ACAGAGCTCTGAGGGGCAGGAAAG--- 3 ' T E L *

599

Fig. 2~ Protein coding region of the nucleotide sequence of human PG endoperoxide synthase gene and deduced amino acid s e q u e n c e of the e n z y m e . Exon sequences a r e g i v e n in u p p e r c a s e letters. T h e d e d u c e d a m i n o a c i d s are s h o w n u n d e r the n u c l e o t i d e s e q u e n c e a n d n u m b e r e d b e g i n n i n g w i t h the t r a n s l a t i o n a l initiation methionine. I n t r o n s are g i v e n in l o w e r c a s e l e t t e r s a n d I0 b a s e s are p r e s e n t e d at b o t h the d o n o r a n d a c c e p t o r sites. T h e 5' a n d 3 ~ untranslation&l r e g i o n s of e x o n s a a n d k (see Fig. i) are s h o w n by l i m i t e d n u m b e r of r e s i d u e s .

892

Vol. 165, No. 2, 1989

structure amino

of

acid

that

of

the

human

the

exon

the

human

sequence EGF

is

This

has

of

the

enzyme

(22).

TATA

have

box-like

these

are

not

that

the

enzyme

exist

by

in

ATG

is

element the

notion

this

have

region.

Recently, by

cAMP

(12,

14),

endoperoxide

of

of the

domain evolution

5'-

CCAAT

and

functions

of

synthase

it

reported

has

been and

suppressed that

responsive gene.

In

the

cAMP

element

may

order

transcription

the from of ONO on

the

gene

are

now

in

studies

on

its

We thank Dr. T. Watanabe for a critical reading of manuscript. This work has been supported in part by grants the Ministry of Health, and Welfare and the Ministry Education, Science and Culture of Japan, and by grants from Medical Research Foundation, Yamanouchi Foundation of Research Metabolic Disorders, Japan.

of

gene,

and

to

5'-

region

synthase

sheep

endoperoxide

16)

the

of

several

suggesting

hormone

region

PG

PG

region

sequence

However,

of

steroid

mechanisms

the

found

of

the

EGF-like

3-kb

and

domain

of

the

approximately

(13,

the

that

the

induced

5'-flanking

sequence

the

resemble

entire

during

present.

and/or

the

to

EGF-like the

of

the

flanking

of

the

codon

corticosteroids

by

with

transcription

at

the

encoded

part

shown

exon-shuffling

in

the

was

by

start

known

investigate regulation

occured

in

gene

responsive

support

terminal

that

33-71)

sequences

sequences

directly

noteworthy

analyzed

the

aminQ

enzyme

identical

may

enzyme

of

is

completely fact

The

sheep

(residues

the

upstream

the

It

of

We

enzyme.

of

(22).

enzyme c

enzyme.

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

progress.

ACKNOWLEDGMENTS

REFERENCES

i. 2. 3. 4.

Pace-Asciak, C.R., and Smith, W.L.(1983) in The Enzymes, Vol. 16, pp.543-603, (Boyer, P.D;, Ed.) Academic Press, New York. Needleman, P., Turk, J., Jakschik, B.A., Morrison, A.R., and Lefkowith, J.B. (1986) Annu. Rev. Biochem. 55, 69-102. Miyamoto, T., Ogino, N., Yamamoto, S., and Hayaishi, O. (1976) J. Biol. Chem. 251, 2629-2636. Ohki, S., Ogino, N., Yamamoto, S., and Hayaishi, O. (1979) J. Biol. Chem. 254, 829-836. 893

Vol. 165, No. 2, 1989

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

5.

Van der Ouderaa, F.J., Buytenhek,M., Nugteren, D.H., and Van Dorp, D.A. (1980) Eur. J. Biochem. 109, 1-8. 6. Mizuno, K., Yamamoto, S., and Lands, W.E.M. (1982) Prostaglandins 23, 743-757. 7. Yokoyama, C., Takai, T., and Tanabe, T. (1988) FEBS Lett. 231, 347-351. 8. DeWitt, D.L., and Smith, W.L. (1988) Proc. Natl. Acad. Sci. USA 85, 1412-1416, and correction p.5056. 9. Merlie, J.P., Fagan, D., Mudd, J. and Needleman, P. (1988) J. Biol. Chem. 263, 3550-3553. 10. Hla, T., Farrell, M., Kumar, A., and Bailey, J.M. (1986) Prostaglandins 32, 829-845. ii. Yokota, K., Kusaka, M., Ohshima, T, and Yamamoto, S., Kurihara, N., Yoshino, T., and Kumegawa, M. (1986) J. Biol. Chem. 261, 15410-15415. 12. Kusaka, M., Oshima, T., Yokota, K., Yamamoto, S., and Kumegawa, M. (1988) Biochim. Biophys. Acta 972, 339-346. 13. Pash, J.M. and Bailey, J.M. (1988) FASEB J. 2, 2613-2618. 14. Bailey, J.M., Makheja, A.N., Pash, J., and Verma, M. (1988) Biochem. Biophys. Res. Commun. 157, 1159-1163. 15. Raz, A., Wyche, A., and Needlman, P. (1989) Proc. Natl. Acad. Sci. USA 86, 1657-1661. 16. Oshima, T., Yoshimoto, T., Yamamoto, S., Yokoyama, C. , Tanabe, T., and Kumegawa, M.(1989) Proceedings of Japaneese Conference on the Biochemistry of Lipids 31, 356-359. 17. Feinberg, A.P., and Vogelstein, B. (1983) Anal. Biochem. 132, 6-13. 18. Maniatis, T., Fritsch, E.F., and Sambrook, J. (1982) Molecular Cloning: A Laboratry Manual, pp.312-328, Cold Spring Harbor Laboratry, Cold Spring Harbor, New York. 19. Sanger, F., Nicklen, S., and Coulson, A.R. (1977) Proc. Natl. Acad. Sci. USA 74, 5463-5467. 20. Furutani, Y., Notake, M., Fukui, T., Ohue, M., Nomura, H., Yamada, M., and Nakamura, S. (1986) Nucl. Acids Res. 14, 3167-3179. 21. Kozak, M. (1984) Nucl. Acids Res. 12, 857-873. 22. Toh, H., FEBS Lett., in press.

894