ARCHIVES
OF BIOCHEMISTRY
AND
BIOPHYSICS
Vol. 290, No. 1, October, pp. 1033115, 1991
Multiple Forms of Liver Microsomal Flavin-Containing Monooxygenases: Complete Covalent Structure of Form 2’ Juris
0~01s of Biochemistry,
Department
University
of Connecticut
Health
Center,
Farmington,
Connecticut
06030
Received April 4, 1991, and in revised form June 7, 1991
Hepatic flavin-containing monooxygenases catalyze NADPH-dependent oxygenation of a wide variety of drugs that possess a nucleophilic heteroatom. Two forms of these enzymes (form 1 and 2) have been isolated from rabbit liver microsomes and partially characterized (Ozols, J., 1989, Biochem. Biophys. Res. Commun. 163, 49-55). The complete amino acid sequence of form 2 is presented here. Sequence determination was achieved by pulsed liquid-phase and solid-phase sequencing of 40 peptides generated by chemical and enzymatic cleavages, including CNBr cleavage of tryptophanyl residues. Form 2 monooxygenase contains 533 amino acid residues and has a molecular weight of 60,089. The COOH terminus of this enzyme is very hydrophobic and presumably functions to anchor the protein to the membrane. Form 2 is readily degraded, since a form lacking residues 1 to 278 and a form without the COOH-terminal segment were also isolated from solubilized membrane preparations. The amino acid sequence of form 2 is 52% identical to that of form 1 and shows 55% identity to the sequence of rabbit lung monooxygenase derived from the cDNA data. The putative FAD and NADP binding segments around residues 9 and 190 are conserved in all three forms. Three variable segments can also be identified in these isoforms. These are residues 308 to 321, residues 408 to 421, and the membrane binding domain, residues 505 to 533. A comparison of the presently limited amino acid sequence data of flavin-containing monooxygenases (FMOs) implies that a particular FM0 in different mammalian species may be very similar, but isozymes within a species may exhibit more extensive variability with respect to homology and catalytic activity. This study documents the structural diversity of a second hepatic FM0 from rabbit liver and establishes this class of drugmetabolizing enzymes as a family of related proteins. 0 1991
Academic
0003.9861/91
Press.
Inc.
$3.00
Copyright 0 1991 by Academic Press, Inc. All rights of reproduction in any form reserved.
Studies from several laboratories have shown that mammalian microsomal membranes contain flavoproteins that catalyze NADPH-dependent heteroatom oxygenation of xenobiotics, including antipsychotic and narcotic drugs2 (1). In an earlier communication, I reported the isolation of two structurally related flavin-containing monooxygenases (FM0)3 from rabbit liver microsomal membranes (2). Partial amino acid sequence analysis indicated that these two forms, designated form 1 and 2, were derived from related but distinct genes (2). More recently, I reported the complete amino acid sequence of form 1 (3). This form showed about 90% identity to the recently reported cDNA sequence encoding the enzyme from hog liver microsomes (4). Sequences derived from cDNAs coding for the flavin-containing monooxygenases expressed in rabbit lung and liver have also appeared (5). Very recently, Yamada et al. (6) report the isolation of two forms of FMOs from liver microsomes of guinea pigs. These forms appear to be similar to forms 1 and 2 of the rabbit liver. While the NH2 terminus of the 56-kDa form was blocked, the NH,-terminal sequence of 19 residues of the 54-kDa form showed 90% identity to the rabbit form 2. In an effort to increase our understanding of this group of enzymes, I report here the covalent structure of form
r This work was supported by United States Public Health Service Grant GM26351. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ’ Portions of this paper (including Experimental Procedure, Figs. S1 to S-25) are presented in miniprint at the end of this paper. 3 Abbreviations used: HPLC, high performance liquid chromatography; HFBA, heptafluorobutyric acid; GNCl, guanidine hydrochloride; PTH, phenylthiohydantoin; TFA, trifluoroacetic acid, FMO, flavin-containing monooxygenases; SDS, sodium dodecyl sulfate; PAGE, polyacrylamide gel electrophoresis. 103
104 GUY LYs
LYs
Val
Ala
Ile
Ile
Gly
Ala
10 Gly
Ile
SW
Gly
JURIS
OZOLS
Leu
Ser
Ala
Ile
Arg
20 Cys LW
SW
Glu
Glu
-
Glu
Met
Ser
Asp
Asp
Ile
Gly
Gly
Leu
40 Trp
Ser
Thr
Ser
Asp
His
Ala
Glu
Glu
Gly
Glu
Pro
Thr
Cys
30 phe
Gln
Ser
Val
Phe Thr
60 Asn
Gln
Glu
Tyr
90 Ile
Asp
Phe
Ser
120 Val
Gly
His
His
150 Val
Gly
180 Ile
R-2 50 Arg Ala
Ser
Ile
Tyr
R-3
CB-2 Lys
Glu
Met
Met
Cys
Phe
Pro
70 Asp Phe
Phe
Pro
Phe
Pro
Pro
Asp
Phe
Pro
Asn
80 Asn kt
Ala
Arg
Glu
Lys
Asn
Leu
100 Leu
His
Asn
Ser
LYS Leu
v
T-6
I-
Thr
Phe
Leu
-
I Ser
Lys
Gly
Lys
Tyr
Ile
Gln
Phe
Lys
Thr
Leu
Val
110 Ser
Ser
CB-4
Ile
Lys
Lys
His
Pro I
R-4
t
Thr
Gly
Gln
Trp
-
Tyr
m-4;
Val
Ala
Thr
Cys
130 Arg
Asn
Gly
Lys
Lys
Glu
Pm
Asn
-
Ala
140 Phe Asp
Val
Ala
Val
Met
D-2
Leu
Pro
Lys
Asp
Ser
Phe
160 Pro
Gly
Leu
Lys
Gly
Lys
Arg
Lys
His
Val
CB-5;
Phe
Lys
Gly
Lys
Ser
170 Phe
Arg
Gln
Arg
CB-5;
Ile
Ser
-
Ser
Leu
Val
Ile
190 Gly
Phe
Leu
Lys
C&5-
Leu
Gly
Asn
Ser
Glu
Tyr
Lys
Gly
Glu
Asp
Ile
Ala
200 Thr
Pro
230 Trp
Glu
Leu
Ser
Ser
His
Thr
Ala
Asn
Leu
Asn
Gly
Gly
Ser
Trp
Val
Ser
Arg
T-18 t-Leu
Pro
Thr
Ala
250 Ile
Ser
Val
I CB-6
Asp
Trp
Thr
L&u
Arg
Lys
280 Glu
Thr
310 Glu
Pro
Val
Trp
Asp
Asp
Gly
Tyr
Asp
F!et
Leu
Tyr
Val
Thr
Lys
Glu
Phe
Lys
Glu
Phe
Thr
Ser
Thr
CB-9, Gly
Tyr
Trp
Tyr
Val
Lys
Gln
CB-9,
Ala
Gln
Leu
-
Ala
Glu
Lys
Pro
IO; E-6
Thr
Met
261) Asn Ala
Lys
Phe
Lys
His
Glu
Gln
Val
Lys
Gly
Phe Gly
Lys
Trp
Thr
Phe
340 Leu
Asp
Ala
Pro
Ala
290 Arg
Ile
Leu
CYS Gly
Thr
Val
Ser
Ile
Lys
Phe
Glu
Aso
Gly
Thr
320 Val
Phe
Glu
Ala
Asp
Ser
Val
Ile
330 Phe Ala
Asn
350 Asn
Trp
Ile
Asp
Ser
Ile
Ile
Lys
Phe
Leu
Met
Ala
370 Val
Ile
Gly
Leu
Ser
Thr
Pro
Gly
Ala
Arg
VCB-14; -R-l5 bR-I6 Ala -
Leu
Phe CB-I5
Ser
Thr
Cys
400 Thr
Leu
Pro
Ile
Gln
430 Thr
Asp
Tyr
Val
Gln
Ser
Pro
Val
Lys
Asp
Ile
Asn
Tyr
Met
E-3
Asp
Pro
Arg
Leu
460 Ala
Leu
490 Thr
Leu
520 Leu
Leu
Glu
Lys
Val
Thr
Gln
Glu
Leu
Phe
Lys
Gly
Leu
Gly
Ala
380 Ala
Ile
Pro
Thr
Thr
Asp
Leu
Gln
Ala
Ile
Leu
Trp
Ile
360 Pm
Phe
T-30
-
Ala
Arg
Met
Asp
Met
Asn
410 Asp
Ile
His
Glu
Glu
Phe Phe CB-14;
Gly
Pro DP-2
Lys
Met
Gly
Thr
Gln
Trp
Ala
Ile
___ R-18
ASD Arg
b---
Ala
Val
Ser
Leu
Lys
CB-14;W-5-
Leu
Leu
Ile
Lys
Leu
420 Lys
-RR-t3;E-2t----B-13-
Leu
440 Ala
Ser
470 Pro
Met
500 Lys
Ala
530 Val
Ser
Phe
Val
290 Trp
t--
Ile
Gly
Val
Lys
Leu
Asn
Ile
450 Pro
Leu
Val
Gly
Pro
Gly
480 Lys
m-14 Cys
Tyr
Gln
Phe Arq
-
W-4 ___
Pro
Glu
E-6 -
CB-II
t
Tro
Ile
+
CB-I3 Leu
300 Pro
R-12; D-8 ~
-RR-13; -
270 Leu
Leu
-CB-I2 Glu
Ser
Asp
t Ile
Tyr
Aso
R-13
Thr
Asn
CB-8
~
T-30
Ala
240 Thr
T-27
Tyr
Pro
Gln
10
Pro
Phe
CB-7
+CB-9,tO; Tyr
210 Val
I +
IO Gly
Arg
R-9
Phe Asn
CB-9, Val
Val
I- R-IO; E-2 -
I Asn
Gln
I- CB-5; D-7220 Met
+ Pro
Glu
R-7
V D-7 ~ Asn
Pro
R-6 + CB-5; D-3 -
R-IO
Met
Glu
D-3
Arg
CB-5;
Ser
-
D-2
I
-
Cys
R-5
CB-5
Phe
Ile
-
Tyr
Thr
Pro
Ala
R-17
Thr
Leu
Arp
Val
Ala
Val
CB-15
Phe.
Gly
R-15His
Leu
R-18
Gln
Lys
510 Pro
COVALENT
STRUCTURE
OF LIVER
MICROSOMAL
MONOOXYGENASE
FORM
2
105
21)). The latter gave better recoveries, but gave less efficient resolution of peptides than the larger column. Tryptic cleavage of succinylated form 2 protein was carried out with about 6 nmol of alkylated protein. All of the expected peptides, except R-12, were isolated from this digest by HPLC. Peptide R-12 (residues 291 to 389) eluted in the void volume of the gel filtration column, in partially MATERIALS AND METHODS purified form, but was lost upon HPLC fractionation. The NH2 terminus of peptide R-16 was blocked, but digestion Materials. Detergents, enzyme substrates, cofactors, and chemicals, unless stated otherwise, were obtained from Sigma. [l-‘4C]iodoacetamide with pyroglutamyl aminopeptidase removed the blocking was from New England Nuclear. DEAE-cellulose (DE-52) was from group, and eight cycles of Edman degradation established Whatman Chemical Separations, and hydroxyapatite-agarose (HA-Ulits sequence. The HPLC isolation profiles of peptides tragel) was a product of IBF Biotechnics. TFA, HFBA, CNBr, GnCl, used for the proof of this sequence are shown in Figs. and urea were obtained from Pierce Chemical. Solvents for HPLC and gel filtrations were from Burdick and Jackson. Trypsin, a-chymotrypsin, S-l to S-6. and pepsin were obtained from Worthington. Stuphylococc~saureus Cleavage of form 2 at methionyl residues gave another protease, endoprotease Asp-N, and pyroglutamyl amino peptidase were set of peptides (Figs. S-7 to S-13). At residue 278, both obtained from Boehringer-Mannheim. Liver microsomal flavin-conMet and Arg were observed and at residue 404 both Val taining monooxygenase form 2 was isolated from adult New Zealand and Met were observed. Arg was the predominant residue male rabbits as previously described (2, 15). at 278 and Val at 404. Fragments obtained by CNBr and Methods. Routine amino acid analyses were carried out with 6 N HCl in the gas phase at 150°C for 1 h (16) using a Beckman Model 6300 tryptic digestion were further cleaved by endopeptidase amino acid analyzer equipped with a ninhydrin detector. The COOHAsp-N (Figs. S-14 to S-20). Cleavage at the NH2 side at terminal peptides of form 2 protein were also hydrolyzed for 12, 24, and both Asp and Glu residues was observed with this enzyme. 92 h with 6 N HCl in evacuated sealed tubes at 107°C. Acid cleavage at the Asp-Pro bond in peptide CB-14 proPeptide mixtures, unless stated otherwise, were first separated using a 2.5 X lO@cm or a 1.5 X loo-cm column of LH60 Sephadex equilibrated vided the necessary overlap for peptides CB-14 and Rwith formic acidethanol (3:7) as the solvent as previously described (6). 15. Cleavage of Trp residues in CB-14 by CNBr was esThe peptide mixtures from the gel filtration column were further resolved sential to establish the critical overlap for peptides R-16 by using reverse phase HPLC. The methodology has been detailed in (16). and R-17 (Fig. S-17). Reverse phase columns used for separations include Vydac C4 (15 Tryptic cleavage of the native protein (3 nmol) followed X 0.46 cm) and Waters Associates C18, PBondapac (30 X 0.39 cm). Peptide mixtures were dissolved in 88% formic acid prior to the injection. by carboxymethylation, gel filtration of the digest, and The type of solvent used in these separations is indicated in the legend HPLC analysis of individual fractions gave essentially all of each chromatogram. of the expected tryptic fragments that were larger than Sequence analysis of peptides was carried out on an Applied Biosysfive residues (Fig. S-21). Only peptides T-6, -18, -27, -30, terns Model 470A, with a pulsed liquid-phase solvent delivery system, equipped with a Model 120A PTH analyzer according to manufacturers and -42 were essential for the completion of the sequence instructions. Solid phase sequencing was performed with a Milligen/ of form 2 (Table SI and Figs. S-22 to S-25). The results Biosearch 6600 ProSequencer system as described in the miniprint secof automated sequence analysis of peptides used to deduce tion. Monooxygenase activity was measured by monitoring methimazolethe complete primary structure of form 2 are given in dependent NADPH oxidation as described in (2). Table SI.
2 monooxygenase and compare its structure to form 1 and the cDNA-derived sequences of the enzymes present in the liver and lung microsomes. The covalent structure of form 2 enzyme is the first report of an amino acid sequence of a hepatic FM0 isoform.
RESULTS
Sequence of form 2 monooxygenase. The general strategy of sequence analysis of form 2 enzyme involved the reduction and carboxyamidomethylation of the protein followed by chemical and enzymatic cleavages. Resolution of the digests was achieved by fractionation on an LH-60 column followed by HPLC of individual fractions (7). Two sizes of LH-60 columns were used (2.5 X 100 cm (Fig. S-l) and 1.5 X 95 cm (Figs. S-l, 8, and
Determination of sequence of the COOH terminus. Peptides corresponding to the COOH terminus of form 2 protein were isolated from tryptic digests of succinylated protein and of the intact form. The HPLC elution profile and the amino acid composition of these peptides were identical (Figs. S-3 and S-25). A peptide (CB15) with an HPLC elution profile similar to peptides R-18 and T-42, but having three additional residues, was isolated from CNBr digests of form 2 (Fig. S-8). Conventional automated sequence analysis of each of these pep-
FIG. 1. Covalent structure of liver microsomal flavin-containing monooxygenase form 2. Peptides obtained by tryptic cleavage of intact or succinylated protein are denoted by (T-) and (R-), respectively. Peptides obtained by CNBr cleavage are denoted by (CB-). Subcleavage of peptides by endoprotease Asp-N or Stuphylococcusaureus protease is denoted by (D-) and (E-), respectively. Prefix (DP-) marks peptides obtained by cleavage with formic acid, and (W-) denotes peptides obtained by cleavage at tryptophanyl bonds by CNBr. Residues identified by automated sequence analysis are indicated by a solid line. The covalent structure of peptide R-18 was determined by solid phase sequencing methodology. The sequence of residues 1 to 32 and 222 to 263 was reported previously (2). Residues 278 and 404 were replaced by methionyl residues in digests of some form 2 preparations. Residues 253 and 262 in this sequence are correctly reported as Trp and Lys rather than Met and Trp as erroneously reported earlier (2).
106
JURIS
OZOLS
l*Ala 2 Gly L Ala
Lys Lys Lys
Arg Lys Lys
Val Val Val
Ala Ala Ala
Ile Ile Val
Val lle Ile
Gly Gly Gly
Ala Ala Ala
10 Glv Yal Gly Ile Gly Val
Ser Ser Ser
Gly Gly Gly
Leu Leu Leu
Ala Ala Ile
Ser Ser Ser
Ile lle Leu
Lys Ara Lys
20 Ser Cys Leu Ser Cys Leu CYS Cys Val
Glu Glu Glu
Arg Met Arg
Ser Ser Thr
Asp Asp Glu
Asp Asp Asp
Leu Ile lle
Gly Gly Gly
Cly Gly Gly
Leu Leu Leu
40 Trp Arq Trp Lys Trp Arg
Phe Phe Phe
Thr Ser Lys
Glu Asp Glu
His His Asn
Val Ala Val
Glu Glu Glu
Glu Glu Asp
Clv Gly Gly
Ser Ser Thr
Cys Ser Ser
Lys Lys Lys
Glu Glu Glu
!let Net llet
Ser Met Ser
Cys Cys Cys
Tyr Phe Phe
Ser Pro Ser
7u Asp Phe Pro Asp Phe Pro Asp Phe Pro
Phe Phe Met
Pro Pro Pro
Glu Pro Glu
Asp Asn Asp
Tyr Phe Phe
Pro Pro Pro
Lys Thr Arp
Met Thr
Ile
Tyr Ala Pne Ala Phe Ala
Asp Arq Lys
Arg Glu Lys
Phe Asp Lys Asn Phe Asp
Leu Leu Leu
100 Leu Leu Leu
Lys Lys Lys
Glu Tyr Tyr
lle lle Ile
Gln Gln Gln
Phe Lys Phe Lys Phe Gln
Thr Thr Thr
Ser Thr Ser
Gly Gly Gly
Gln Gln Gin
Trp Trp Trp
Lys Tyr Glu
Val Val Val
Val Ala Val
Thr Thr Thr
Leu Cys Gln
130 His Arg Ser
Glu Asn Asn
Gly Gly Ser
Lys Lys Lys
Gln Lys Gln
Glu Glu Gln
Ser Thr Ser
Asn Tyr Leu
Pro Pro Pro
His Asn Asn
Leu Leu Ile
Pro Pro Pro
Leu Lys Leu
Gly Asp Lys
Cvs Ser Ser
Phe Phe Phe
160 Pro Pro Pro
Gly Gly Gly
lle Leu Ile
Lys Lys Glu
Thr His Lys
Phe Phe Phe
Phe Lys Phe Lys Leu Glu
Asp Gly Gly
Lys Lys Lys
Arg Arg Arg
Val Val
Ile
Leu Leu Leu
Val Val Val
Val lle Ile
190 Gly Gly Gly
Ilet Leu lle
Gly Gly Gly
Asn Asn Asn
SW Ser Ser
Gly Gly Ala
Leu
lle Ile
Ser Ser Ser
Thr Ser Thr
Thr Arg Arg
Gly Ser Lys
Gly Gly Gly
Ala Ser Ser
Trp Trp Trp
Val Val Val
220 lle Met Met
Ser Ser Ser
Arg Arg Arg
Val Val lle
Phe Phe Met
Ile Leu Leu
Arg Lys Arg
Asn Asn Asn
Ser Asn Val
Leu Leu Leu
Pro Pro Pro
Thr Thr Arg
Pro Ala Met
250 Ile lle lle
Val Ser Val
Thr Asp Lys
Trp Trp Trp
Val Met Ala
Pro Pro Pro
Lys Leu Glu
Asp Asn Asn
Arg Gly Lys
lle Thr Tyr
Gln Leu Leu
Met Arg Met
Lys Lys Lys
280 Glu Glu Glu
Pro Pro Pro
Val Val Val
Ser Am Arg
Ile Val Val
Lys Lys Lys
Glu Glu Glu
Val Phe Leu
Lys Lys Thr
Glu Glu Glu
Asn Ser Phe Thr Ser Ala
310 Val Glu Ala
Val Thr lle
Phe Gly Ser Ala Phe Glu
Thr Thr Thr
Gly Gly Gly
Tyr Tyr Tyr
Thr Gly Thr
Phe Ala Tyr Ala Phe Ala
Phe Tyr Phe
Pro Pro Pro
340 Ser Leu Phe Leu Phe Leu
Asp Asp Glu
Glu Asp Glu
Tvr Trn Leu I,.
Tvc --Ser I,-
Ser Ser Ser
Leu lle lle
Tyr Tyr Tyr
Lys Gln Gln
Ser Ser Ser
Val Val Val
Val Phe lle
Ser Thr Thr
60 Asn Asn Asn
Asn Asn Asn
60 Tyr Val Asn Met Phe Leu
Pro His His
Asn Asn Asn
Ser Ser Ser
Gln Lys Lys
Phe Leu Leu Gln Leu Leu
Asp Glu Glu
Tyr Tyr Tyr
90 Leu lle Phe
Thr Leu Thr
Val Val Val
110 Phe Ser lle
Ser Ser Ser
lie Ile Val
Thr Lys Lys
Lys Lys Lys
Glu His Ara
Gln Pro Pro
Asp Asp Asp
Phe Asn Phe Ser Phe Ala
120 Val Val Ser
Ala Ala Ala
lie Val Val
140 Phe Asp Phe Asp Phe Asp
Ala Ala Ala
Val Val Val
Wet Met Met
Val lle Val
Cys Cys Cys
Thr Ser Ser
Gly Gly Gly
Phe His His
Leu His His
150 Thr Val lle
Lys Lys Lys
Gly Gly Gly
Gln Lys Gln
Tyr Ser Tyr
170 Phe Phe Phe
His Arq His
Ser Gln Ser
Arg Arg Are
Gln Glu Gln
Tyr Tyr Tyr
Lys Lys Lys
His Glu His
Pro Pro Pro
Asp Gly Ala
180 lle lle Gly
Thr Glu Ser
Asp Asp Asp
lle lle lle
Ala Ala Ala
200 Val Thr Val
Glu Glu Glu
Ala Leu Leu
Ser Ser Ser
His His Lys
Val Thr Lys
Ala Ala Ala
Lys Glu Ala
Lys Gln Gln
Val Val Val
210 Phe Val Tyr
Phe Asp Trp Asp Ser Glu
Ser Gly Asp Gly ASD Gly
Tyr Tyr Tyr
Pro Pro Pro
230 Tro Trp Trp
Asp Asp Asp
Met Met Met
Val Leu Val
Phe Tyr Phe
Thr Val His
Thr Thr Thr
Arg Arg Arg
Phe Phe Phe
240 Gln Asn Gln Thr Set- Ser
Leu Trp Met
Val Tyr Met
Ala Val Glu
Lys Lys Gin
Lys Gln Gln
Met Met flet
260 Asn Asn Asn
Ser Ala Arg
Trn Lys Trp
Phe Phe Phe
Asn Lys Asn
His His His
Ala Glu Glu
Asn Asn Asn
Tyr Tyr Tyr
Gly Ser Gly
Leu Asn Asp Phe Asn Asp Leu Asn Asp
Glu Asp Asp
Leu Leu Leu
Pro Pro Pro
Gly Ala Ser
Arrl Arg Arg
lle lle lle
Ile Leu Leu
Thr Cys Tyr
Gly Gly Gly
Lys Thr Thr
Val Val lle
Phe Ser Lys
lle lle Val
Arg Lys Lys
Asn lle Asp
Ala His Phe Glu Glv ---
Asn Asp ---
Thr Gly ---
Pro Thr Thr
320 Ser Val Val
Glu Glu Phe Glu Glu Glu
Pro Ala
lle
lle lle
Val Ser Val
lle Val lle
Val
ASD
Asp Asp Aso
Phe Ala Phe Ala Phe Ala
Val Ile Leu
Val
Val Ser
Glu Glu Glu
Asp Asn Asp
Gly Asn Asn
Gln Lys Met
Ala Val Val
Ser Thr Ser
Leu Leu Leu
Tyr Phe Tyr
Lys Lys Lys
Tyr Gly Tyr
Leu lle Phe
Pro Pro Pro
Thr Thr Thr
Gly Thr Val
Glu Aso Glu
Thr Leu Leu
Val
Asn His
Arg Lys Arg
Lys Met Asn
290
lle Val
Lys Lys Lys
lle
lle Val
360
Ile lle Met
Phe Phe Phe
Pro Pro Pro
Gln Gln Gln
Ala Ala Ala
Arg Arc Arg
390 Tyr Tro Trp
Glu Gly Glu
Asn Thr Asn
Lys Lys Arg
His Leu lle
Asn Lys Ala
Ala Val Ala
Lys Lys Lys
Pro Leu Pro
450 Asn Asn Asp
Val Thr Leu
400 Lys Cys Thr Cys Ser
Leu Leu Leu
Pro Pro Pro
Pro Pro Ser
Thr Val Lys
Val Asp Thr
Met Met Met
lle Met Met
Lvs Asn Ala
410 Glu Aso Asp
lle
lle
Glu Glu Lys
Ivz Glu __ Gln
Ala T hr Lys
430 Leu lle Leu
Asp Asp Asp
440 Asp Glu Glu
Leu Leu Leu
Leu Ala Ala
Thr Ser Leu
Ser Phe Glu
Ile lle lle
Asn Gly Gly
Leu Leu Leu
Ala Ala Ala
Leu Leu Val
Thr Glu Lys
Met Val Leu
Phe Phe Tyr
Phe Gly Phe Gly Phe Gly
Pro Pro Pro
470 Tyr Cys Cys
Ser Ser Asn
Pro Pro Ser
Tyr Tyr Tyr
Gln Gln Gln
Phe Phe Tyr
Arg Arg Arg
Leu Leu Leu
Thr Val Val
Gly Gly Gly
Met Thr Leu Thr Phe Thr
Gln Gln Gln
Trp Trp Lys
Asp Asp Gln
Am Arq Arq
Thr Ser lle
Phe Leu Leu
Lys Lys Lys
500 Val Pro Pro
Thr Met Leu
Lys Lys Lys
Thr Thr Thr
Arq Arg Arq
Ile Ala Thr
Val Val Leu
Gln Gly Lys
Glu His Ala
Ser Leu Ser
Leu Ala Leu
Ala Leu lle Ala Phe Ala
Leu Val Leu
Val Leu Val
Ser Leu Leu
Val lle Ala
530 Phe Leu Ala Ala Phe Lcu
lle Phe Val Leu Phe Gln
Leu Val Leu
(Glu,Ser). Phe Gln Trp
Phe
-,-
lle
,.
Gln Gln Gln
Leu Ile Leu
Phe Ser Pro Trp Val Ser
Ser Leu Phe
Leu Gly Thr Phe Leu Thr Leu Phe Lys
Asp Asp Asp
Pro Pro Pro
460 Leu Arg Lys
Gly Gly Gly
Lys Lys Gln
Trp Trp Trp
Lys Pro Glu
Gly Gly Gly
Ala Ala Ala
Arq Arg Arg
Asn Gln Asn
Ala Ala Ala
490 lle lle Ile
Ser Lys Asn
Pro Phe Pro Ala Phe Pro
Glu Leu Val
Ser Phe Ser
Leu Ser Phe
Leu Pro Leu
Lys Glu Leu
Leu Leu Lys
520 Phe Ala Trp Leu Phe Leu
Ala Thr Thr
Val Leu Gly
Asp Asp Asn
Tyr Tyr Tyr
Cys Lys Glu
Ile lle
lle
Thr Asn Asp
Tyr Tyr Tyr
lle Met Leu
Pro Pro Arg 330
350 Ser Ser Ser
270 Leu Leu Leu 300
Ser Ala Ser
Lys Lys Lys
I.,-
50 Are Ala Arg Ala Arg Ala
Gly ,ly G Gly
Phe Ile Phe
L..”
30 Phe Phe Phe
lull __" ___ IN Leu
Gln Gln Arg
lcllc Tvz I YC- --_, Glu Ser
Cys Cys Cys
Pm._ SW Pro
Val Ala Thr
V.,
Thr Thr Thr
110 L._ Ivc ..,.. _., Val,. Gin 11 e Gln
Thr Ala Ala
r.1~ Thr Phn_ T,lv - , LeL I Phe Gly
Pro Pro Pro
,-,, cc” Leu Leu
CT..Th.. l,?,, “0” *,. rl” ,111 L-G” Pro Thr Met Ala Ser Thr Phe Ala
!llC
Lys Glu Glu
C,$, Y’J Gly Gly
I,,* LJ> Lys Lys
“‘J
Leu Leu Leu
,,,, IIC Ile Leu
I.... LIZ” Cl" “ill Leu Glu Leu Glu
r.11, ~0
Gly Gly Gly
380 Met Ala Ile
,a:“13 Gln Gln
Gly Gly Gly
Glu Glu Glu
370 \,., .“I Val Cys
"1_ n,a Pro Pro
Val Val Val
Glu Glu Asp
420
lle
480 Pro Pro Pro 510 Ser Gln Ser
FIG. 2. Comparison of amino acid sequences of rabbit liver microsomal forms 1 and 2, and the rabbit lung flavin-containing monooxygenases. The sequence of the lung enzyme was derived from the cDNA structure (5). The NH, terminus of form 1 is acetylated (3). The NHz-terminal methionyl residue has been deleted from the lung enzyme. In form 1, residues 252 and 253 are correctly reported as Thr and Trp rather than Ser and Thr as erroneouslv reported earlier (2).
“-\TT.
r
I,““XLl!,l~l
-.lm
“m-T,nmTT--
blJx”LlU~l3
-r.
ur
T
r.,nn
LIVl!,K
lull = ““,ROSOMAL
tides was unsuccessful. After cycle 14, the yield of the PTH derivative was below the limit of detection. Presumably, their extreme hydrophobicity led to a washout from the reaction cell of the sequencer. Coupling peptide R-18 or T-42 to an arylamine membrane (Sequelon-AA TM) using (1-ethyl-3-dimethylaminopropyl) carbodiimide as a coupling agent and sequencing the membrane with the MiliGen/Biosearch 6600 sequencing system overcame the washout problem and provided definitive assignment of all the residues in these peptides (Fig. S-I). The proof for the sequence of residues 1 to 32 and 222 to 263 was reported previously (2). DISCUSSION
The amino acid sequence of FM0 form 2 is summarized in Fig. 1. The protein is a polypeptide consisting of 533 residues. Peptides obtained from CNBr and tryptic digestion were characterized extensively. Most residues were confirmed by at least two sequencer runs. Unlike form 1, the carboxyamidomethylated form 2 protein was refractory toward S. aureus protease digestion. Except for the Glu residues at positions 322 and 348, only partial cleavages occurred at other glutamyl residues, resulting in a digest that was too complex to be resolved. Particularly useful fragmentation was obtained by the cleavage of tryptophanyl residues by excess CNBr in the presence of heptafluorobutyric acid (8). Cleavage of peptide CB-14 by this method established the essential overlap for peptides T-15, -16, and -17. The cleavage of Trp residues by CNBr is very simple to use and does not require reagents that are sensitive to storage or require the addition of difficult to remove phenol or indole derivatives. This cleavage procedure involves only a simple evaporation step to remove reactants and products prior to HPLC separation or sequence analysis. The structure of the COOH-terminal peptide could not be achieved without the automated micro-solid-phase sequencing methodology. The conventional sequencing of these peptides, immobilized on a Polybrene-coated fiberglass matrix, was not successful for more than 10 cycles (Table S-I). Presumably, the peptide was lost during the extraction steps of the automated Edman degradation procedure. In contrast, solid-phase sequence analysis of this peptide was quite successful in establishing its complete sequence (Table S-I). Three Asn-X-Ser/Thr sequences (at Asn 60, 267, and 274), which are glycosylated in microsomal proteins having such sequences oriented toward the luminal side of the membrane (9), are not glycosylated in form 2 monooxygenase. Similarly, the Asn-X-Ser sequences in liver microsomal cytochromes bf, (Asn 20 and 105) are not glycosylated, since they do not extend into the luminal side of the membrane, but are oriented toward the cytosolic side or are partially buried in the lipid bilayer (10). The form 2 enzyme has several sites that are particularly sensitive toward proteolysis. In some preparations a form with a slightly lower molecular weight than form
MONOOXYGENASE
FORM
2
107
2, but with 20 NH,-terminal residues identical to form 2, was identified by SDS-PAGE, electroblotting, and sequence analysis. Presumably, this form results from cleavage of about 30 residues from the COOH terminus. Another form, having a subunit molecular weight of approximately 25,000, with 20 NH,-terminal residues identical to residues 279 to 299 was also identified by SDSPAGE and electroblotting of partially purified form 2 preparations. These findings imply that these sites may exist as a surface segment in the tertiary structure of this FM0 and may be sites for protein degradation in uiuo. Of interest is that at residue 278, both Arg and Met were observed, but Arg was the predominant residue. Whether a site having an amino acid substitution results in a conformation that is more susceptible to proteases than the site containing the predominant residue remains to be determined. The comparison of the amino acid sequences of forms 1 and 2 is shown in Fig. 2. Residues at positions 425 and 427 are deleted in form 2, and an extra residue is inserted at position 535. The two forms share 52% identical residues. Form 2 has five additional basic and five additional acidic residues, so that the net charge difference between the two forms is conserved. While clusters of short segments of nonidentical residues are scattered throughout the entire sequence, two larger stretches of nonidentical residues are evident at residues 308 to 321 and the membrane-binding domain. Recently, sequences derived from cDNA structure coding for FMOs expressed in hog liver (4) and rabbit lung and liver have also appeared (5). The cDNA sequence of the liver enzyme, except for 10 amino acid assignments, is identical to that of form 1 (Table S-II). Six of the variant residues occur in the positions corresponding to a specific enzymatic or chemical cleavage site. The identity between the hog liver and the rabbit liver form 1 is about 90%. The lung enzyme, however, shows 55% identity to the form 1 sequence and 54% identity with the form 2 sequence (Fig. 2). When forms 1 and 2 are compared with the lung enzyme, only 42% of the residues are identical among the three forms. The proposed FAD and NADP pyrophosphate-binding domains around residues 9 and 190 are conserved in all three forms. Sequence comparison of all three rabbit FM0 forms identifies a third segment containing variable residues (residues 408 to 421). Whether these segments are responsible for the individual substrate specificities of these enzymes or represent “variable” domains of the monooxygenase molecules remains to be determined. The two hepatic FM0 sequences are unusual in that form 1 from rabbit (3) and hog (18) liver have an N-acetyl Ala at the NH, terminus, whereas the form 2 isozyme starts with a free glycyl residue. The NH2 terminus of the cDNA-derived lung FM0 is methionine, and the nature of the NH2 terminus of the expressed lung enzyme is not known (5). It has been proposed that N-acetylation may affect the biological function and/or protein stability, and
108
JURIS
that lack of acetylation can directly affect protein structure making it more unstable to denaturing conditions or more susceptible to proteolytic modification (20). The observation that form 2 FM0 is less stable toward storage than the N-acetylated form 1 is in agreement with this hypothesis. Furthermore, the observation that form 2 has several sites that are sensitive toward proteolysis supports the N-acetylation/stabilization hypothesis. While the cDNA sequence of the hog and rabbit lung enzymes indicate a methionyl residue at the NH2 terminus and the absence of a signal peptide sequence (4, 5), it remains to be determined whether the form 2 gene encodes a leader sequence. The second prominent feature of the monooxygenase sequences is the complete absence of polar residues at their COOH termini. Other microsomal membrane proteins sequenced to date terminate with a short stretch of negatively or positively charged residues. The significance of this structural feature for the degradation or the topology of this group of proteins remains to be elucidated. Recently, Hlavica et al. (11) reported that polyclonal antibodies to a synthetic 14-amino-acid peptide, residues 466 to 479 of form 1, cross-reacted with liver microsomes and with the partially purified liver enzyme, but did not recognize the protein in intact lung microsomes or the purified lung enzyme. This implies that the critical epitope may be exposed to the surface of the FM0 molecule when in the membrane-bound state in rho. Failure of this antibody to recognize the lung enzyme may be due to the amino acid sequence differences between the liver and the lung forms. Residues 466 to 479 of form 1 show only 57% identity when compared with the corresponding region of the lung form, whereas this segment in forms 1 and 2 shares 86% identical residues. These results are in agreement with studies that report that antibodies directed against the intact rabbit lung enzyme failed to demonstrate cross-reactivity with the hepatic enzyme from rabbits (12,13). The most intriguing questions regarding the microsomal FMOs cannot be answered by this study, since this is only the third reported sequence for this group of proteins. The recent report, that a FM0 form related to form 2 enzyme can be isolated from liver microsomes of guinea pigs (6), clearly indicates that form 2 enzyme may be more prevalent than previously suspected. Notwithstanding, certain conclusions concerning the biology of microsomal FMOs may be postulated from this study. Since the ammo acid sequence of rabbit liver FM0 form 1 is some 90% identical to that of form 1 from hog liver (4, 5), the amino acid sequence of form 1 in other mammalian species will be very much identical. From past sequence comparison studies, such a conclusion is now well supported. More importantly, since form 1 from other species is structurally very identical, the results of the extensive studies on substrate specificity, reported with the hog liver form (1)) can now be extrapolated to form 1 in other species. The sequence of rabbit lung enzyme is only 55% similar to the rabbit liver form 1. Rabbit lung
OZOLS
FM0 differs significantly in the substrate specificity from the hog liver form 1 (1, 12-14). For example, unlike liver form 1, the rabbit lung enzyme cannot distinguish primary from secondary or tertiary amines (1). On the other hand, many of the better substrates for liver form 1 (e.g., chlorpromazine, imipramine, benzphetamine) show no activity with the lung enzymes (21). Contrary to the hog and rabbit liver form 1, the amino acid sequence of rabbit liver form 2 is only 54% identical to that of rabbit form 1 and shows only 55% identity to the rabbit lung enzyme. The catalytic properties of form 2 FM0 will be elucidated in our next study. ACKNOWLEDGMENTS I am grateful to George Korza for his outstanding technical assistance and to Dr. Vijay Kumar for his skillful isolation and solubilization of microsomal membranes. I also thank Dr. David W. Andrews (MilliGen/ Biosearch, Burlington, MA) for performing the solid phase sequencing and Bridget A. Clancy-Tenan for her expert typing of the paper. The many helpful discussions with Drs. Craig Gerard (Dept. of Medicine, Beth Israel Hospital, Boston, MA) and F. S. Heinemann (Dept. of Pathology, Hoag Memorial Hospital Presbyterian, Newport Beach, CA) are greatly acknowledged.
REFERENCES 1. Ziegler, D. M. (1988) in Drug Metabolism Reviews (DiCarlo, F. J., Ed.), Vol. 19, pp. l-32, Dekker, New York. 2. Ozols, J. (1989) Rio&em. Biophys. Res. Commun. 163,49-55. 3. Ozols, J. (1990) J. BioZ. Chem. 265, 10,289-10,299. 4. Gasser, R., Tynes, R. E., Lawton, M. P., Korsmeyer, K. K., Ziegler, D. M., and Philpot, R. M. (1990) Biochemistry 29, 119-124. 5. Lawton, M. P., Gasser, R., Tynes, R. E., and Philpot, R. M. (1990) J. Biol. Chem. 265, 5855-5861. 6. Yamada, H., Yuno, K., Oguri, K., and Yoshimura, H. (1990) Arch. Biochem. Biophys. 280,305-312. 7. Heinemann, F. S., and Ozols, J. (1984) J. Biol. Chem. 259, 797804. 8. Ozols, J., and Gerard, C. (1977) J. Biol. Chem. 252, 5986-5989. 9. Ozols, J. (1989) J. Biol. Chm. 264, 12,533-12,545. 10. Ozols, J. (1989) Biochim. Biophys. Actu 997, 121-130. 11. Hlavica, P., Kellermann, J., Henschen, A., Mann, K.-H., and KunzelMulas, U. (1990) Biol. Chem. Hoppe-Seyler 371.521-526. 12. Williams, D. E., Ziegler, D. M., Hordin, D. J., Hale, S. E., and Masters, B. S. S. (1984) Biochem. Biophys. Res. Commun. 125, 116122. 13. Williams, D. E., Hale, S. E., Muerhoff, A. S., and Masters, B. S. S. (1984) Mol. Pharmucol. 28,381-390. 14. Tynes, R. E., and Philpot, R. M. (1987) Mol. Phurmacol. 31, 569574. 15. Ozols, J. (1990) in Methods in Enzymology (Deutscher, M. P., Ed.), Vol. 182, pp. 225-235, Academic Press, San Diego. 16. Ozols, J. (1990) in Methods in Enzymology (Deutscher, M. P., Ed.), Vol. 182, pp. 587-601, Academic Press, San Diego. 17. Ozols, J., Heinemann, F. S., and Gerard, C. (1980) in Methods in Peptide and Protein Sequence Analysis, (Birr, C., Ed.), pp. 417429, Elsevier/North-Holland, Amsterdam. 18. Guan, S., Falick, A. M., and Cashman, J. R. (1990) Bin&m. Biophys. Res. Commun. 170,937-943. 19. Matsudara, P. (1987) J. Biol. Chem. 262, 10,034-10,038. 20. Jornvall, H. (1975) J. Theor. Biol. 55, l-12. 21. Nagata, T., Williams, D. E., and Ziegler, D. M. (1990) Chem. Res. Toxicol. 3, 372-376.
MINIPRINT
SUPPLEMENTAL
MATERIAL
FLAVIN-CONTAINING
SUPPLEMENT
TO MULTIPLE FORMS OF LIVER MICROSOMAL MONOOXYGENASES: COMPLETE COVALENT STRUCI-URE OF FORM 2.
by
Juis Ozols m - Peptides were attached to aryl amine membranes (Sequelon- AA; MilliCenlBiosearch, Division of Millipore Corporation), using procedures supplied by the manufacturer. The peptide was dissolved in neat TFA and applied to an aryl amine disk placed on a sheet of Mylar resting on a heating block at 550. After drying, the disk was wetted wth 5 pl of a 0.1 M solution of MES (2.N-morpholino)ethanesulfonic acid, containing 15% acetonitrile. and 10 mg/ml of EDC (1.ethyl-313. dimethylaminopropyl])carbodimide, pH 5.0. MES and EDC were obtained from Sigma. The coupling reaction was allowed to proceed at room temperature until the disk was dry (about 20 min). Membrane containing the coupled peptide was transferred to the reaction chamber of a Milligen/Biosearch 6600 ProSequencer sequencing system. Automated Edman degradations were performed using the standard 36 min protocol developed for this system by the manufacturer. FIX-amino acid derivatives were identified using an on-line HPLC system consisting of a MilliGen/Biosearch low dispersion pump, a SequeTag PTH analysis column (30 x 0.39 cm), and a Waters 490 multi-wavelength UV-VIS detector. F’TH amino acids were identified bv their characteristic elution positions and their UV absorption at 269 nm. The dehydro de&atives of wine and thnx&e were identified by their absorption at 313 nm. Data were collected and analwed usine the MilliGen/Biosearch Pm-Maxima chromatography package. Chromato~ams we& optimally aligned, using alignment algorithms. and then submxted to give a difference trace for successive sequencer cycles.
MINUTES Fig. S-2. HPLC of fraction 55. from column described in Rg. S-l, on a (I5 x 0.4 cm) C4 column. Solvent A was 0.1% TFA and Solvent B was 0.1% TFA in 75% acetonimle. A linear gradient from 0% to 100% of Solvent B in 70 min was performed at a flow rate of 1.0 ml/min.
Reduction. Carboxvami~ Large scale (IO nmol) chemical modifications of form 2 protein were performed as described in (3). Reduction and alkylation of peptides on a microscale level was performed by adding to the dry peptide the following solutions: 60 ~1 of 8 M urea, 10 ~1 of 2 M ammonium bicarbonate and 5 ~1 of 50 mM DlT. After 15 min incubation at 58, the mixture was cooled to room temperature, and 5 ~1 of 100 mM iodoacetamide was added. After 15 min, at room temperature, the mixture was lyophylized, dissolved in 88% formic acid and subjected to HFLC. -tic Cleavw Tryptic, chymoayptic and S. m protease digestions were carried out with I:25 w/w of the appropriate enzyme in 2 M urea. containing 50 mM ammomum bicarbonate, pH 8. The digestions were performed at room temperature for 18 h. Peptide cleavage with Asp-N protease was performed in 2 M urea containing 50 mM sodium phosphate buffer, pH 8.0, for I8 h at 370. Prior to the enzymatic cleavages protein or peptides were dissolved in 8 M urea. The urea concentration was then reduced to 2 M by the simultaneous addition of the appropriate buffer and the enzyme.
MINUTES Fie. S-3. HPLC of fraction 48 from column described in Fig. S-l. pe>ormed as described in Fig. S-2.
The HPLC was
Pvroelutamvl Aminooeutidase Treatment of Peotide R-16 -Two mg of lyophylized enzyme (0.5 units/80 mg of lyophylized material) was dissolved in 200 pl of IO0 mM sodium phosphate, pH 8.0, containing 5% glycerol, 10 mM EDTA and 0.5 mM DTT. At 50 pl aliquot of this solution was added to 1 nmol of lyophilized peptide. The tube was flushed wth N2 and incubated at room temperature for some I6 h. The mixture was dried, dissolved in 88% formic acid and subjected to HPLC. Chemicw For cleavage at methionyl residues, protein or peptide was dissolved in 70% formic acid to give about a 1% (w/v) protein solution, and a lO@fold molar excess of CNBr over methionine residues. After 18 h at room temperature, the mixture was partly dried under nitrogen in a fume hood, diluted IO-fold with water and lyophylized. For cleavage at methionyl and tryptophanyl residues, the dried peptide was dissolved in 0.2 ml each of neat heptafluorobutyric acid and 88% formic acid. After addition of about 250 mg of solid CNBr, the mixture was incubated at mom temperature for 18 h (8). The reagent and solvent were removed with a stream of nitrogen (in hood), diluted IO-fold with water and lyophylized. Cleavage at Asp-Pro bonds was performed by incubating the peptide with 88% formic acid at 370 for 72 h.
10
20
30
40
50
MINUTES Fig. S-4. HPLC of fraction 68 from column described in Fig. S-1. The HPLC was performed as described in Fig. S-2.
Electroblotting and sequence analysis of the blotted proteins were performed as described by Matsudara (19). Partially cleaved form 2 preparations were identified in the effluent fractions of HA-Agamse column upon elution with 300 mM KPi containing 0.1% NP-40. 20% glycerol, I mM EDTA and 0.1 mM D’lT (2.14).
FRACTION (6 ml/tube) I,lg. S-l. Gel filtration of tryptic digest of 6 nmol of carboxyamidomethylated and succinylated form 2 protein. The digest was applied to a column of LHa60 Sephadex (2.5 x IO0 cm) equilibrated wth formic acid/ethanol (3:7). Six ml fractions were collected at a flow rate of 10 ml/hour. The solid bar Indicates the distribution of a particular peptide in the fractions collected.
to
20
30
40
50
60
MINUTES Fig. S-5. HPLC of fraction 56 from column described in Fig. S-I performed as described in Fig. S-2.
The HPLC was
109
110
JURIS
IO
20
30
40
50
60
70 10
MINUTES Fig. S-6. Isolation of Peptide R-13 from fraction 40 of gel filtration column described in Fig. S-l.
FRACTION
OZOLS
20
30 40 MINUTES
50
60
Fig. S-10. HPLC of fraction 17 of Fig. S-7 using HPLC conditions as described in Fig s-2.
( 3 ml /tube)
Fig. S-7. Gel filtration of CNBr digest of carboxyamidomethylated form 2 protein. Approximately 4 moo1 of the protein was digested and applied to a column of LH60 Sephadex (1.5 x 95 cm) equilibrated with formic acid/ethanol (3:7). Three ml fractions were collected at a flow rate of 10 ml/hour. The dot indicates the kaction from which the indicated peptide was isolated.
IO
20
30
40
MINUTES kg. S-l 1. HPLC of fraction 32 of Fig. S-7 using conditions as described in Fig. S-2.
1
-I
IO
20
30
40
50
60
MINUTES Fig. S-8. HPLC of fraction 25 of Fig. S-7 using HPLC conditions as described in hg. S2.
MINUTES Fig. S-12. HPLC of fraction 29 of Fig. S-7. HPLC conditions are described in Fig. S-2
MINUTES Fig. S-9. HPLC of fraction 20 of Fig. S-7 using HPLC conditions as described m Fig. S2.
MINUTES
Fig. S-13. HPLC of fraction 22 of Fig. S-7 using HPLC conditions as in Fig. S-2
COVALENT
STRUCTURE
OF LIVER
MICROSOMAL
MONOOXYGENASE
FORM
I
111
2
9-10 R-IO. E-2
-0
R 2
\
:;.i IO
20
30
40
MINUTES MINUTES I,lg S-14. HPLC resolution ~~mdlt~ons as in Fig. S-2.
I 0 0 M a
of Asp-N
protease
digest
CB-4
and 5 u>lilg
20
30
I
40
MINUTES
1.1g. S-l.5 Srsnhvlwxcus
Isolation w
of PeptIde CB-9,10;E-6. protease and the digest
IO
20
I,lg. S-18. Isolation of Peptide R-lO;E-2. m protease. The dtgest was resolved
The cleavage was performed by HPLC as in Fig. S-2.
wth
Stanhvloccu\
I 1, O c a
iI! 10
of Peptides
The parent peptlde resolvd as 1” Fig S-2
30
40
was digested
~1111
I
! 10
20
---l-
30
40
50
MINUTES
F!g S-19. Isolation of Peptide R-lZ;D-8. 20 of Fig. S-l was digested with Asp-N F1g. s-2.
Partially purified PeptIde R-12, from fraction protease and the digest resolved by HPLC ii\ m
50
MINUTES 10
I-lg. S 16. Cleavage of Asp-Pro bond in Peptide CB-14. The peptlde (m 88% formic aad) was incubated for 72 h at 370. The HPLC of the digest was perfomxd as in Fig So 2
Fig. S-20. HPLC of $taDhvlowaxs digest was pefomwd as in Fig. S-2.
I IO
20
30
40
MINUTES I+g. S-17. HPLC rsolation of Peptides CB-14;W-4 and CB-14;W-5. Cleavage of l’r[~ re\ldues in CB-I4 was performed by CNBr as described under “Expertmental Sectmn” HI’LC of the digest was performed as in Fig. S-2.
20
30
40
MINUTES
15
m
digest
20 FRACTION
of Peptlde
25 (3
R-l?.
30
The resolution
ot tk
35
ml/tube)
kg. S-21. Gel filtration of tryptlc digest of form 2 protein. About three nmol of d!gewd protein was carboxymethylated and the reaction mixture applied to n column of LHhO Sephadex (I .5 x 95 cm) equihbrated with formic acid/ethanol (3:7) Three ml fractmn\ were collected at a flow rate of 10 ml/hour.
112
JURIS
10
20 30 MINUTES
OZOLS
40
Fig. S-22. HPLC of fraction 24 of Ftg. S-21 using conditions as m Fig. S-2.
-
Fig. S-24. HPLC of fraction 27 of Fig. S-21 using conditions as in Fig. S-2.
02
I
P 4” 01
4
IO
I 20
r
I 30
40
h IO
I
I
1
1
1
II
20
30
40
50
60
70
MINUTES
MINUTES
Fig. S-23. HPLC of fraction 30 of Fig. S-21. Peptlde R-16 denotes nyptic peptide, residues 486 tc~494.
Tsblp DOS. 19 ii' 21 22 23 24 25 2G 27 2s 29 3c 31 32 33 34 3.5 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54
SI.
Sequence Data for Few 2 llonoxyqenase Peptidfa [cycle), yield (pmol)
seq. Ser cvs Lk" GlU Glu Gly LeU Glu Pt-0 Thr CVS Pie Glu "let Ser ,\sp As17 Ile Cl; Gly LelJ Tr?t LYS Phe Ser PSll 'iii Pla SlU Gl" Gly h-g Pla ser !le T‘/ r
POS. R-2 (1) (2) (3) (4) (5) (6)
4:, 22 ?J 25 27 16
17; P (9) (10) ill! (12) (131 (14) (15) (16) (171
:: 11 5.7 6.5 7.6 7.1 4.3 1.9 3.2 3.5
R-3 (1) (2) (3) (4)
9.1 3.5 7.9 8.0
Fig. S-25. HPLC of fraction 22 of Fig. S-21 using conditions as in Fig. S-2.
& (4)
(9) (101 (11) (12) (13) (14) (15) (lh) (17) (131 (19) (20) (21)
;; li
G.2 7.0 3.1 6.2 2.4 4.9 4.0 4.2 3.7 2.0 2.7 1.6 2.3
seq.
55 36 57 3& 59
Gin Ser Val Phe Thr
63 64 65
LYS Glu Islet
71 72 73
78 7" x0 ill 42 83 84 85 X6 7 91: 8'1 90 91
Peptides
(5)
7.3
I:] (8) (9) (10) (11) (12)
::," 4.9 3.1 1.G 0.3 0.4
I;;] (15) (16)
;:6' o.:, 1.1
T-6 (1) (2)
Phe Pro Pi?e
Pro Asn Asn Net Vi * iisn Ser Lys Leu 61n Gil! Tyr Ile Thr
m-4 (1)
34
1:;": 1 [g;;; (hj 79 (7)10! (~)!20
(9) 97 (10)
23
?II 38
ii] (5)
(6)
",: 31 2'
iii (9)
5: 19
ii:; (12) (13; (l&j
:3 14 '1 9.8
liS', (17) (18) (19)
::4 3.4 2.3 0.5
i;:;
A::
COVALENT
STRUCTURE Table
SI.
OF LIVER
MICROSOMAL
Sequence Data for Form 2 ~lononyqenase Peptidea (cycle), yield (pmol)
PUS.
Seq.
Pas.
seq.
92 13 94 95 36 97 33 39 1X 101 102 103 104 10s 136 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 12R 129 130 131 132 133 134 135 136 137 131! 139 140 141 142 143 144 145 146 147 148 143 150 151 152 153 154 155 156 157 158 159 160 161 162 lb3 16U 165 166 167 168 169 170 171 172 173 174 175 176
Thr Phe Ala An3 Glu LYS Asn LeU LfZU LYS TY~ Ile Gin Phe Lys Thr Leu '!a1 Ser Ser Ile LYS LYS tlis Pro ASP Phe Ser Val Thr G'Y Gin T!"D TY~ Val Ala Thr W Arq ASl. Gly iys LVS Glu Thr Ala va1 Phe ASD Al; Vdl Met
177 178 179 180 181 182 1 X?
GlU Pt-0 Gl .v Ile Phe i.ys Glv
184 185 ! 86 167 188 1x9 190 191 192 193 134 195 196 !Y7 198 199 LUti 2Lil 202 203 204 205 206 2d7 200 209 210 211 212 213 214 215 216 217 '1X 219 220 221 222 223 224 225 226 227 223 229 230 231 232 233 234 235 236 737 233 239 240 241 243 242
LYS AV 'Val Leu Vai Ile Gly LW GUY Asn Ser GUY Gl u
R-4 (1) (2) (3) (4) (5)
21 24 252 38 42
1;; (3) (9) (10) (11) ;;;]
;i 26 20 24 22 ;;
(14) (15) (16) (17) I;;{
20 11 13 12 ;;
(20) (21) ;;;I
6.2 4.3 33::
(24)
1.2
CS-4,D-2
i:i E Iii k 12;::i
(7) (3) (9) (10) i;:i (13) (14) (15) (16) (17)
R-5 i:j
:;
(7) (8) (9) (101 illj (12) (13)
33
5.5 1.1 4.3 4.8
::; 3.2 0.2 1.5 0.8 0.2
24
30 25 24 21 2u
I’e cys
1;;; ;;
Ser GUY ris HlS Val Tvr Pro ASn LeU Pl-0 LYS ASP Ser Phe Pro Gly LeU LYS His Phe LYS GUY LYS Ser Phe ArrJ Gln At-3 GlU TY~ LYS
(16)
7.7
,D-2 :: 29 21 17 15 11 3.0 8.7 6.9 6.3 5.8 1.9 4.5 1.5 2.1 0.6
R-6 (1)
33
(3)
25
(2) 23 CB-5,D-3
MONOOXYGENASE
244 245 246 247 248 249 250 25' 252 253 254 255 256 257 258
261
FORM
2
Peptides
(4) (5) (6) (7)
28 17 21 13
R-7
(4)
23
(7)
15
(10) (11) (12)
14 12 10
tzii: i:i1y
ASP
Ile 01 a Thr Glu Leu Ser
Hi 5 Thr Ala Glu I;ll. Val Vzl Ile
CB-5,D-7 iii 143; (5)
i:: 7.9
(6j 2.5 iii 2
Ser
SfZr Arg SPIGly SetTOP Val Met ier Arq va1 TOP AS0 ATI1 tily TY~ PI-o Trp Asp llet LelJ TY~ Val TtIr Arii Phe Gltl Thr Phe LW Lys ASl. ASll Leu Pl.0 Thr Ala Ilf Ser ASP Trn Tt-0 TY~ Val LYS Gln Het ASn Ala
;o"
T-l& (1) (2) (3)
2.5 7.0 1.5
(9) (10) (11)
0.3 5.7 3.3
R-9
(1) 15 i:i 2;
(4)
!5i
7.3 9.3
:;l 2 (8) 3.75.8
CB-7 (1) (2)
R-10 (11137 (2)11@
J'J 17
iii 7:
(5) (6) (7)
10 18 17
(8) 10 i;",'lZ 151 ,~I 89 ~~ i:i (3) (9) 10) 11) 12) 13) 14) !5) 16) I71 18) 19) !O) !l) !2) !3) '4)
:: 63 45 38 23 18 16 5.0 14 7.1 7.0 13 11 5.9 6.3 9.4 6.1 3.3
CR-8
(9) (10) ill) (12)
7.3 6.5 5.7
113
114
JURIS Table
SI.
Sequence
Data
Peptidea POS.
seq.
ZuL 263 264 265 266 267 263 269 270 271 272 273 274 275 276 277 278 279 280 231 282 283 284 205 286 287 283 239 290 291 292 293 294 295 296 297 290 239 300 301 302 303 304 305 306 307 303 309 310
LyS Phe LYS Hi 5 GlU Asn TY~ Ser LIX, Plet Pro Leu Asn Gly Thr Leu Arg LYS Glc PI-0 Val Pile ASll PSp 4sp LW pro Ala RI-Cl Ile Leu CYS GUY Thr Val SetIle LYS Pro PSll Val LYS Glu Phe LYS GlU Phe Thr Gl i,
311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345
Thr SetAla Ile Phe GlU ASP Gly Thr Val Phe Glu Ala Ile ASP Ser Val Ile Ph!? Ala Thr Gly TY~ Gly TY~ Ala TY~ Pro Phe Leu ASP ASP ser
(25)
12.0
R-lO,E-2 (1) 38 (21 47
(23)
R-lZ,D-8 I:] :: (3)
14
1%
8.7 8.2
1;;;
2:
i;ti
i:;
I;:] (12) (33)
4":: 3.7 3.1
1:;; (36) (37)
i:: 1.0 0.2
!;;I ;:;: I;;\
;:;
35 i;:j (23)
::: 3.7
T-27 (1) (2) (3) (4)
53 65 49 45
(5) (6) (7) (8) (9) (10) (11) (12) (13) (14) (15) (16) (17) (18) (19)
40 20 43 37 38 28 26 31 20 28 25 17 20 18 11
OZOLS
for Fov 2 llonoxyqenase (cycle), yield (pmol) POS.
Seq.
34G 347 34d 349 350 351 352 353 354 355 356 357 358 359
LYS Ser GlU Asn Asn Lys 'ial Thr LeU Phe LYS GUY Ile Phe
361 362 363 364 365 36b 367 368 369 370 371 372 373 374 375 376 377 378 379 330 381 382 383 3x4 385 386 3x7 383 33Y 390 391 392 393 394
Pt-0 Gln LeU GlU Lys PI-0 Thr Yet Ala Val Ile GUY LW Val Gln Ser Let. Gly Ala Ala Ile Pro Thr Thr ASP L?U Cln Ala Arg Trp Ala Ala Gln Val
3% 3% 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 4211 429
I'e LYS GUY Thr CYS Thr $
(6)
55
1;; (9) 110)
:," 55 60
(24) (25) (26)
1.1 0.4 0.6
CD-11
(9)
40
ii:; (12)
:i' 52
Pro Val LYS ;%&
I;:] (15)
:: 18+llet
j;;i
;;
;:A
I;:] (20) (21) (22)
;: 10 7.2 4.2
ASP Il!? His Glu Lys Met GUY Thr LYS LeU LYS Thr Phe Gl Y iy; Tl-D Gl; Thr Ile Gln
Peptides
R-13;E-2 i:]l! 13) (4) (5)
3
CB-12 (1) 3l
(2) 20
(3) (4) (5)
19 8.6 15
90 51 50 (5) 38 161 30
(10)
12
'(;:j (13) (14)
:: 30 23
R-13;E-3
COVALENT
430
Thr
(4)
431 432 433 A34 4?5 436 437 43:: 33Y 440 ‘44 1 442 443 444 445 445 447 43c 449 450 451 45; 453 454 455 45b 457 455 439 460 461 462 463 464 465 466 467 468 465 470 471 372 473 474 475 47h 471 473 479 480 481
Asp
(5)
Tyr IliZ
Rsn i7r :.1et
Ih) (71 (8) (9) (10)
Asr1 Cl!) GL (12) Lelc &la SetFhe :1e Gly !,a1 LYS I eu 4sn Ile PI-0 :rp LEU Phe Lt?lJ :hr As ,I PI"0 ill-9 I.CU Ala Lfll Gl li l'al Dhe Php Gly PTO ryi ser Pm lYr Clll Uhe Arq LCU Val Gly fro Gly L '/ s Tl-1'
(13j
STRUCTURE
OF LIVER
MONOOXYGENASE
FORM
2
115
50 6X 60 63 51 45 38 23 17 13
CR-14 (I)117 (21124 (3)11X (4j136
I:] ii (7) 84 (8) 78 (9) (10)
92 57
1;;; (13) (14)
63 sl 68 56
1-l;;
41;
(17) (If) (19)
41 37 10
;;y; (22) (23)
:;
1;@P-2 71 63 127 I3Y 125 95 95 102 I26 95 42 40 16 32 26 23 24 13 21 15 11 7
7.1 10.0
‘1-15 (11101 (2) 84 (3) 57
'In all succinylated methvlCystelne or
MICROSOMAL
peptides Lys carboxvmethylcvstcine.
Table
Residue
Position 19 9% 102 115 125 278 339 405 454 456
1s identified
487 483 GSY 490 411 492 493 444 495 496 497 4YC 499 500 501 502 x3 504 505 506 5d7 sub 509 510 511 512 513 514 515 516 517 51 ;I 519 520 ii1 522 522 124 525 5Zf, 527 523 x:li 533 531 i>3: 533
ds 5%Lvs;
SII. Di:ferences llonooxyoenase (3)
Between and the
Al a 1le LCU Thr Gill -rp !Ai[! ,4rq set. LfU I.YS PK ?1et ! ys TiltAt-q illa va1 Cl L, 111s Leu 61 II LVS PK ila LelJ Phe SL'r Pro GlU Leu Tl-1' LElJ LelJ Al a Ile I1 a ':a 1 Lfll LfU !le A! a ii 1 3 ildl LEU \'?I1 Phe irl
slkylated
i4) /5i
16 14
(6: 4 (7) 7 (JJ 2
:!!-5 is 16 11 l? :1 7
k-17 ilj34 (2) 41 (3) 32
R-l:i('ol.Pharei (1: 2 !2! 7
(6) !7! (Cl 19) (10) (11) (12) (13) (141
25 17 23 12 2.4 4.1 4.3 6.0 3.5
peptides,
the Prlnlar‘y Structure cDNP Encoded Seq~~cnc~ PTII (3)
CD-14 (1) (2) !3! ,4: j5i (6)
I!!
::
iojj (7) !:i)
;7 IL1 2.6
--4L (1 !X!l :2;162 !3)100 (4) 31 (5) 6:’ (6) 41 (7) 15 (51 19 (9) 17 Ii? (121 (13) (14) (151
Cys
1s identified
of For'!? 1 (5)
rls cdrboxyarnrlo-
lo'3 4 ?.'I 3.1 l.i, 0 5