The primary structure of mouse saposin

The primary structure of mouse saposin

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS Vo1.184, No. 3,1992 Pages 1266-1272 May 15,1992 THE PRIMARY STRUCTURE OF MOUSE SAPOSIN Masahik...

395KB Sizes 18 Downloads 84 Views

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

Vo1.184, No. 3,1992

Pages 1266-1272

May 15,1992

THE PRIMARY STRUCTURE OF MOUSE SAPOSIN Masahiko Tsuda I'* , Takeshi Sakiyama I, Hideya Endo 2 1 and Teruo Kitagawa IDepartment of Pediatrics, Nihon University, 8, l-Chome, Kandasurugadai,

School of Medicine,

Chiyoda-ku, Tokyo I01, Japan

2Medical Research Institute, Nihon University, 30-1, Kamimachi, Ohyaguchi,

Itabashi-ku,

School of Medicine,

Tokyo 173, Japan

Received March I, 1992

SUMMARY: The primary structure of mouse sphingolipid activator protein (saposin) was determined by cDNA sequencing. The amino acid sequence predicted by the cDNA sequence revealed that mouse saposin was highly homologous to human saposin and also to rat sertoli cell glycoprotein. Mouse saposin also has four functional domains, which are structurally similar to each other, and each domain has cysteines, prolines, and a potential glycosylation site at an almost identical position. An amino acid comparison between human and mouse saposins revealed that the similarity was approximately 70%, and human saposin lacks thirty-one amino acids between domains C and D. Heterogeneities of mRNA were found in both the coding and noncoding regions. © 1992 Academic Press, Inc.

Saposin proteins

is

that

a is

family known

of to

small,

heat

activate

stable,

a number

and

highly

of lysosomal

glycosylated

hydrolases

(i).

Saposin B, originally designated as SAP-I, is also a sulfatase activator, and the genetic deficiency of this protein results in clinical features similar to those

observed

originally

in

called

metachromatic SAP-2,

and

the

leukodystrophy deficiency

clinical features of Gaucher's disease (4, 5). and it was revealed

that four different

and

the

possibility

3).

this

protein

of

processing therapies

(6, 7).

for

it is essential to use an excellent animal model.

technology

has

it

*To whom correspondence

possible

to create

should be addressed.

0006-291X/92 $4.00 Copyright © 1992 by A~'aclemic Press, Inc. All rights of reproduction in any form reserved.

1266

leads

C

was

to the

Human saposin has been cloned,

disorders,

made

Saposin

saposins were derived from the same

large precursor and generated by proteolytic pathophysiology

of

(2,

mouse

models

To study the

lysosomal

storage

Recent molecular by

the

transgenic

Vol. 184, No. 3, 1992

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

method and gene targeting. mouse model,

As the first step in creating a saposin-deficient

we initiated

the isolation of mouse saposin cDNA, and here we

report the primary structure of mouse saposin precursor, presaposin.

MATERIALS AND METHODS Isolation of Mouse Saposin cDNA A mouse liver eDNA library (purchased from Stratagene, La Jolla, CA) was screened by plaque hybridization. Human saposin cDNA obtained from ATCC was used as a probe. Human saposin cDNA was labeled with [32p]-dCTP (NEN, Boston, MA) by means of the random priming method using a commercially available kit (Takara, Tokyo, Japan). Positive clones were purified, and in vivo excised to the plasmid bluescript (Stratagene, La Jolla, CA). Inserts were digested with EcoRl (Takara, Tokyo, Japan) and sized on a 1% agarose gel. A mouse brain cDNA library (Clontech, Palo Alto, CA) was also screened to identify the 5'-end nucleotide sequence. Mouse saposin cDNA isolated from the liver library described above was used as a probe. Nucleotide Sequence Determination Several clones were characterized by their nucleotide sequence and restriction enzyme map. Restriction enzymes were purchased from Takara, Tokyo, Japan. Insert DNA was digested with several restriction enzymes and each fragment was further subcloned into the sequence plasmid pG~M-4Z (Promega, Madison, WI) and directly subjected to nueleotide sequencing by the dideoxynucleotide chain termination method (8), using an alkaline-denaturated plasmid as a template (9). Nucleotide sequencing was performed with commercially available kits (USB, Cleveland OH, or Pharmacia, UPSAIA, Sweeden). Computer Analysis Computer analyses for nucleotide and amino acid comparison were performed with the software program GENETYX (SDC, Tokyo). RESULTS AND DISCUSSION The mouse liver cDNA library was screened by hybridization with a probe specific

to the

plaques result

were

initially

indicated

saposin mRNA.

C-terminus

that

of human

obtained

saposin.

from

approximately

the 0.005%

Approximately

1 X of

i0

6

phages

the mRNA

All positive clones were plaque-purified,

in

fifty positive screened. the

This

liver

was

in vivo excised to

the plasmid bluescript, and then the inserts were sized on a 1% agarose gel. The longest clone designated as MSAP-24 was sequenced.

The sequence analysis

of MSAP-24 revealed a close similarity to human saposin cDNA as well as rat sertoli cell glycopr0tein.

However, the 5'-end of this clone was rearranged.

MSAP-24 was then used as a probe to screen a mouse brain cDNA library, and a longer

clone

analysis.

designated

as

MSAP-4

was

obtained

and

Several other clones were also sequenced,

mRNA were observed.

subjected

to

sequence

and heterogeneities

of

The nucleotide sequence and deduced amino acid sequence

1267

Vol. 184, No. 3, 1992

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

18 20 30 40 50 00 70 BO 90 GCGCCATGTACG~CTCGCCC.TCTTCGCCAGCCTTCTG~C~ACCGCTCTGACCA~CCCTGTCCAA~ACCCGAA~ACATGC~CTGGG~G~T MetTy~A~aL~A~aLeuPheA~a~erLe~LeuA~aThrA~aLeuThrSerP~Va1~nAspPr~LysThrCys~e~G~yG~ySer ]OO 110 120 130 140 150 , 180 178 180 CAG•AGTGCTGTGCAGA•ATGTGAA•ACGG•GGTGGACTGT••GG•CGT•AAGCACTG•CAGCAGA•GG•CTG•AGCAAGCCCA•AGCGA

A•aVa•LeuCysArgAspVa•LysThrA•aVa•AspCy•G••A•aVa•Lys•isCy•G•n••n•etV••Trp•e•LysP••ThrA•aL•s 180 200 210 228 230 240 250 288 270 AATCCCT•CCTT•CGACATAT•CAAAAC•GT•GTCACCGAAGCT•G•AACTTGCTGAAAGA•AAT•CTACGCAGGAGGA•ATCC•TCA•T

~erLeuPr~CysAsp~eCy~L~sThrVa~Va~ThrG~uA~a~yA~nLeuLeuLy~AspA~nA~aThrG~nG~uG~u~eLeu~isT~r 280 290 300 310 320 330 340 560 380 ACCT••AGAAGACCTGTGAGTGGATTCATGACTCCA•CCTG•C•GCCTCGTGCAAGGAGGTGGTTGACTCT•ACCTGCCTGTCATCCTGG

LeuG•uLysTh••y•G•uTrp••e•isAsp•erSerLeuSerA•a•erCysL••G•uVa•Va•Asp•erTy•LeuPr•Va•••eLeuAsp 370

380

380

400

410

420

439

440

450

ACATGATTAAGGGCGA•ATGAGCAACCCTGGGGAA•T•T•CTCT•CGCTCAACCTCTGCCAGTCCCTTCAGGAGTACTTG•CC•A•CAAA ~t1~eLy~G~yG~MetS~rAsnPr~G~yG~Va~ys~rA~aLeuAsnLeuCysG~nSerLeuG~nG~uTyrLeuA~aG~uG~nAsn 460 470 460 490 500 510 820 530 540 AC•AGAAAC•GCTTGAGTCCAACAAGATCCC•GA•GT•GACA•G•CCCGTGTGGTT•CCCCCTTCA••TCCAACA•CCC•CTCCT•CTGT

G•nLysG•nLe•G•uSerAsnL•s1•ePr•G•uVa•A•p••tA•aArgVa•Va•A•aPr•PheMet•erA•n••ePr•LeuLeuLeuTyr 550

880

570

580

590

800

810

820

830

ACCCTCAGGATCACCCCCGCAGCCAGCCCCAACCTAAGGCTAACGAGGACGT•TGC•AGGACTGTA••AA•C•GGTGTCTGATGTCCAGA P••G•nAsp•isP••Ar••erG•nPr•G•nP••LysA•aAsnG•uAspVa•CysG•nAspCysMetLysLeuVa•SerAspVa•G•n•hr 840 888 890 670 885 890 700 710 720 CTGCTGTGAAGACCAACTC••GCTTTAT•CA•GGCTTCGT•GACCACGTGAAG•AGGATTGTGACCGCTTGGGGCCAGGC•T•TCTGACA

A•aVa•L•sThrAsnSerSerPhe1•e••nG•yPheVa•AspH•s•a•L•s••uAspCysAspArgLeuG•yP••G••Va••erAsp•1e 730 740 750 780 770 780 790 800 810 TAT•CAAGAACTACGT•GACCA•TATTCCGAGGTCT•TGT•CA•AT•TTGAT•CACAT•CAG•ATCA•CAACCCAAG•AAATCTG•GTGC CysLysAsnTyrVa•AspG•nTyrSe•G•uVa•Cy•Va•••nMetLeuMet•i•MetG•nAspG•nG•nPr•Lys••u1•eCysVa•Leu 820 530 840 850 860 870 880 890 800 TG••TGG•TTCTGTAATGAGGTCAAGAGAGTGCCAATGAAGACTCTGGTCCCT•CCA•CGA•ACCATTAAGAACATCCTCCCTGCCCT•G

A~aG~yPheCysAsnG~uVa~LysAr~a~Pr~MetLysThrLeuVa~Pr~A~aThrG1uThr1~eLysAsn~eLeuPr~A~aLeu~u 910

820

930

940

850

980

970

980

890

AGATGATGGAC•CCTATGA•CA•AATCTGGTCCAG•CCCACAATGTGATTTTATGC•A•ACCTGTCA•TTT•T•AT•AATAAGTTTTCTG Met•etAspP••TyrG•aG•nAsnLeuV••••nA•a••sAsnVa•••eLeuCysG•nTh•CysG•nPheVa••etAsnLysPheSerG•u 1000 1010 1020 1050 1040 1050 1080 1070 1080 AGCTGA•TGTCAATAATGC•ACT•AGGAGCTCCTAGTTAAA•GTTTGAGCAACGCATGCGGA•TGCTCCCC•ATCCTGCCAGAACCAAGT

Leu••eVa•AsnAsnA•aTh•••u••uLeuLeuVa•Ly•••yLeuSerAsnA•aCysG•yVa•LeuP••A•pPr•A•a•rgThrLysCys 1090 II00 III0 1120 1130 1140 1150 I158 1170 GCCAGGAGGTG•TG••AACATTTGG•CCCTCCCT•TT••ACATCTTTATCCATGAGGTAAACCCCA•CTCTCT•T•CGGTGTGATCGGCC

G~nG~u~a~Va~G~yThrPheG~yPr~erLeu~euAsp~1ePhe~1e~sG~uVa~AsnPr~SerSerLeuCysG~yVa~1~eG~Leu 1180 I190 1200 1210 1220 1230 1240 1250 1260 TCT••GCTGCCCGCCCGGA•TTG•TGGAGGCACTTGAGCAGC•TGCGCCAGCCATTGTATCT•CACTGCTCAAA•AGCCCACACCGCCAA CysA~aA~aA~Pr~uLeuVa~G~uA~aLeu~uG1nP~A~aPr~A~a~eVa~Ser~aL~uLeu~ysG~uPr~ThrPr~Pr~Lys 1270 1280 1290 1300 1510 1320 1330 1340 1550 A•CAGCCCG•ACAG•CCAA•CA•TCGGCATTGCCCGCCCATGTGCCTCCTCAGA••AATGGT•GGTTCTGTGAGGT•TGCAAGAAACTGG

G••Pr•A•a••nPr•LysG•nSerA•aLeuPr•A•aHis•a•Pr•Pr•••nLy•Asn••yG•yPheCysG•uVa•CysL•sL•sLeuVa• 1360

1370

1380

1390

1400

1410

1420

1430

1440

TCCTCTATTTG•AACATA•C•TG•AGAAAAA•A•CACCAAG•AG•AAATCCTGGCCGCACTTGAGAAG••CTGCAGCTTCCTGCCAGACC LeuTyrLeu~uHisAsnLeuG~uLysAsn~erTh~LysG~uG~u~eLeuA~aA~aLeuG~aLysG~yCy~SerPheLeuPr~AspPr~ 1450

1480

1470

1480

1490

1500

1510

1520

1530

CTTACCA•AAGCAGTGCGAT•ACTTT•T••CT•A•TATGAGCC•TTGCTATT••AGATCCTCGTG•AA•TGATGGATC•TGGATTTGTGT TyrG~nLysG~nCy~AspAspPheVa~A~aG~uTyPG~uP~LeuLeuLeuG~u~eLeuVa~G~uVa~MetAspPr~yPheVa~Cys 1840

1550

1580

1570

1590

1590

1800

1510

1620

GCT•GAAAATTGGA•T•TGCCCTTCTGCCTATAAG•T••T•CTG•GAAC•GAGAAGTGTGT•TGGG•C•CTA•CTACTGGTGTCAG•ACA SerLys~eG~yVa~Cy~Pr~SerA~aTy~L~sL~uLeuL~uG~yT~rG~uLysCys~TrpG~yPr~erTyrTrpCysG1nAsnMet 1830 1640 1850 1880 1670 1880 1890 1700 1710 T••AGACTGCCGC••GAT•CAATGCT•TCGATCATTGCAAA•GCCATGTGTG•AACTAGTTTCCCAGCT•CAGAA•TCACCTACTTGTGG GluThrAlaAlaArgCysAsnAlaValAspHlsCysLysArgHisValTrpAsn 1720

1730

1740

1750

1780

1770

1780

1780

1800

GTCTA~G~TAATGAACACATAGATCTATTTGACTTAATAAGTAGGAACCCCCTTTGCCCTTCCCCCATCTC~TCTCCCTTACTGTAGCAT 1810

1820

1830

1850

1840

1860

1870

1890

1890

TTCTGTCATGTAAGAGGTGCTGACA•CCACTTCCGTGTCCCCTTTCTGCTCGAAGGATGA•GATACCTT•GGACATCGCTC•CCGGCTGC 1800 1910 1820 1930 1940 1950 1960 1970 1980 CCTTTTCACCCACCTGCTG•A•G••••••GT•A•CCAGAGGGCA•GAGC•TTTTCTGAGCCCTTTCTTGGTGTGTGGGGGATCTAT••CC 1990

2000

2010

2020

2030

2040

2050

2080

2070

ATCTCCTAC•ATGAGGG•GCTACCCAGCTTCCT•T••TACCAAG•A•TTAT••T•GAT•ATTAGAAGCACAGAAT•ATCAGGCC•T•AGA 2080

2090

2100

2110

2120

2130

2140

2150

2150

~CGAT~GAATG~C~ATTGTCATAGCACAGAGATTTCAGAAGCA~CTGCAG~TGGCTTGCTTGGGATGTTGCTGTCCCTGGGTCA~CCTTC 2170 2190 2190 2200 2210 2220 2230 2240 2250 CATTCTGCTTTCCTGTCTTCCCGTC•GCCTTGTTGGGGTTCTGTGGGGTAGGGTGGGGAG•GGAAACTTGTGAATGTAACTTGCCT•TGC 2280 2270 2280 2290 2300 2310 2320 2530 2340 ~GT~ACGTTCACGTGGGCCTGGTCTTTTGTGTGTGAGG~CCTTGACCGT~TG~CCTCTGCCTGGACTGT~T~GGGTCCT~CACGGCTT 2550 2560 2370 2380 2390 2400 2410 2420 2430 ••CCACCAC•TGTA•CTCTTGTT•ACCTGCCTGTTCACCTCATGAGTGAAGC•T•TGCC•G•CAG•G•GCCATGAACTGA•G•GTCTCTG 2440

2480

2480

2470

2480

2490

2500

2510

2520

T•TA•AGTAGAAGCTTCCTGTGC•TCC•GTTG•CAGGAGACAGC•TGTGCAGTTAAATGGAC•TAGATTTT•TTTTGCACTAAAGT•TCT 2530

2540

2550

2580

GTGACTTAATAAAGTTCTGTTAACCAACAGAA~AAAAAAAAAAAAA

Fig. i. Nucleotide sequence of the longest mouse saposin cDNA and deduced amino acid sequence. The putative polyadenylation signal is underlined. The amino acids are designated by three-letter codes.

1268

VOI. 184, No. 3, 1992

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

bIYALFLLASL LGAALAGPVL GLKECTRGSA VWCQNVLTAS DCGAVKHCLQ

HUNAN RAT NOUSE

****A-F*** ,AT**TS**Q DL~T*SG*** *L*RD*K**V **G*****Q* TVWNKPTVF~ LPCDICKDVV TAAGDMLKDN ATEEEILVYL EKTCDWLPLP M**S***A*]* *******T** *E**NL**** *******H** ****A*IHDS N**S***A*[* *******T** *E**NL**** **Q****H** ****E*IHDS NNSASCKEIV DSYLPVILDI IKGEMSRPGE VCSALNLCES LQK~LAELNH

QKQLESNKIP ELDMTEVVAP FMANIPLLLY PQDGPRSKPQ PKDN~GDVCQD *R***~*-** *V*LAR**** **S******* ***R***Q** **A*[E***** *K**-***** *V-MAR**** **S******* ***H***Q** **A*[E***** CIQMVTDIQT AVRTNSTFVQ ALVEHVKEEC DRLGPGMADI CKNYISQYSE *MKL*T**** ******S*** G*~D****D* ***~**VS** ****VD**** *NKL*S*V** **K***S*I* GF~D****D* ******VB** ****VD**** IAIQNNMHMQ PKEICALVGF CDEV~EMPMQ TLVPAKVASK NVIPALELVE V.V******* *****VM*** ****[*RV**R *****TE*I~ *IL*****TD VCV**L**** *****VLA** ~N**]*RV**K *****TETI~ *IL***~NMD PIKKHEVPAK SDVYCEVCEL LVKEVTKLID NNKTEKEILD AFDKMCSKLP ~YEQDVIQ*Q NVIF*Q**Q* VMRKLSE**I **A**ELLIK GLS*A**L** *YEQNLVQ*H NVIL*QT*QF VMNKFSE*~I **A**ELLVK GLSNA*GV** KSLSEECQEV VDTYGSSILS VLLEEVSPEL VCSMLHpCSG] TR ........ APA*TK**** LV*F*P*L*D **MH**N*NF L*GVISL**~ NPNLVGTLEQ DPARTK**** ~G*F*P*L*D IFIH**N*SS L*GVIGL*AA] RPELVEALEQ ....................... LPALTVH VTQPKDC~FC EVCKKLVGYL PAAAIVSALP KEPAPPKQPE EPKQS**RA* *PPQ*N*I**~ *******I** PAPAIVSALL KEPTPPKQPA QPKQS**PA* *PPQ*N~I~** *~**~**L~*

DNNLEKNSTK QEILAALEKG CSFLPDPYQK QCDQFVAEYE PVLIEILVEV

NDPSFVCLKI GACPS~HKPL LGTEKCIYGP SYWCQNTETA ARCNAVEHCK *~***~S** ~V*~J*Y*L* ******VW** G****~S*** *Q**~*D*** ~**G***S** *V***I*Y*L* ******VW** G*****M*** *Q****D*** RHVWN

Fig. 2. Comparison of the amino acid sequence of mouse saposin with those of human saposin and rat sertoli cell glycoprotein. Identical residues are identified by an asterisk. Each missing amino acid is indicated by a dash.

of m o u s e human

saposin

saposin,

is s h o w n

mouse

Figure

2.

acids.

Although

3'-end,

a mouse has

acid

sequence

70%. and

There

The rat

the

these

the

are

a

single

a human

and

i, rat

large

eDNA has

and

the

sertoli open

two

cell

and

glycoprotein

sequence

further

identical.

acid

study

support

of

frame

an approximate

90%

saposin

the e v i d e n c e

It is i n t e r e s t i n g

1269

that

eDNA

has

sertoli

that m i c e

3'

and

in

amino at

the

and a m i n o

both approximately

between

the

shown 557

The n u c l e o t i d e were

between

(AATAAA)

comparison

to

is

encoding

signals

sequence

extended

mouse

comparison

glycoprotein

signal.

sequence

revealed

similarity

cell

acid

polyadenylation

to the k n o w n h u m a n amino

amino

reading

o n l y one p o l y a d e n y l a t i o n

expression

results

saposin

is

nucleotide

nucleotide

Although

saposin,

similarities

sertoli

in F i g u r e

mouse

similarity,

noncoding not

cell

saposin

been

region.

performed,

glycoprotein

rats h a v e

and

and

a thirty-one

Vol. 184, No. 3, 1 9 9 2

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

Saposin A B C D

- S L P C D I C K T W T - - E A G N L L K D N A T - Q E E I LHY-- L E K T C E W I H D S S L S EDV-CQDCMKLVSDVQTA--VKTNSSFIQGFVD--HVKEDCDRL-GPGVS VI L - C Q T C Q F V M N K F - - S E L I V N N A T - - E E L L - V K G L S N A C G V L P D P A - S - G F - C E V C K K L V L Y L E H - N L E K - N S T - K E E I L-- A A L E K G C S F L P D P ---

Consensus

- - F - C Q - C K K V V - - - E - - N L - K - N A T - - E E I L .... L E K - C - - L P D P - L S

Saposin A B C D

AS .... CKEVVDSYLPVILD-MIKGE-MSNPGEVCSALNLCQS .... DICKNYVDQYSEVCVQMLM---HM-QPKEICVLAGFCNE T .... KCQEVVGTFGPSLLDIFI--HEV-NPSSLCGVIGLCAA

consensus

......

--YQKQCDDFVAEYEPLLLEILV--EVM-DPGFVCSKIGVCPS CKEVVD-Y-PVLLDILI--E-M-NP-EVCS-IGLC-S

Fig. 3. Multiple alignment of the four functional domains. A consensus sequence is given where two or more identical amino acids align. Closed triangles represent residues which align in all four domains, while the dash indicated alignment of the four potential N-glycosylation sites in mouse saposin.

amino acid insertion between domains C and D. homologous among those species.

Domains A and D were highly

On the other hand, domains B and C were less

homologous. Molecular analyses of the sulfatide activator deficiency and the variants of Gaucher's identified.

disease

have

of

saposin

substitution. activator

and

several mutations

have

been

For sulfatide activator protein deficiency, three mutations have

been identified so far. codon

been performed,

B,

Kretz et al. found a C to T transition in the 23rd

resulting

in

a

threonine

to

isoleucine

amino

acid

This base change eliminates the glycosylation signal in this

protein

(i0).

Zhang

et

al.

found

a

33-base

insertion

between

nucleotides 777 and 778 (numbered from the A of the ATG initiation codon)(ll). Most recently, Holtschmidt et al. found a G to C transversion that eliminated a cysteine residue in the 47th codon of saposin B (12). Gaucher's disease,

a G to T transversion resulting

in the

In a variant

of

substitution of

phenylalanine for cysteine in the 72nd codon of saposin C was identified (13). All of these residues found mutated in these disorders were conserved among humans, mice, and rats, which indicates the significance of these residues. Figure 3 shows the multiple alignment of four junctional domains. alignment

position of six cysteine

potential

glycosylation

site

in

residues,

each

domain

one was

proline

A complete

residue,

observed.

A

and

one

consensus

sequence for two or more identical amino acids occurring in the same position of the alignment suggested that the saposin gene arose by gene duplication.

1270

Vol. 184, No. 3, 1 9 9 2

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

mouse + 9

256 M H M Q D Q q P K E ATG CAC ATG CAG GAT CAG CAA CCC AAG GAA

mouse + 6

ATG CAC ATG --- GAT CAG CAA CCC AAG GAA

mouse

ATG CAC ATG ......... CAA CCC AAG GAA

s

5

AATAAAGTTCTTGTTAACCAACAGAAAAAAAA

mouse + 3

AATAAAGTTCTTGTTAACCAAC--AAAAAAAA

mouse

AATAAAGTTCTTGTTAACC .....AAAAAAAA

mouse

+

Fig. 4. The heterogeneities found in mouse saposin mRNA. A: In the coding region, a 9-base insertion (CAGGATCAG) or a 6-base insertion (GATCAG) was found. The number above Met indicates the amino acid number counting from initiation methionine. B: In the noncoding region, a 5-base insertion (AACAG) or a 3-base insertion (AAC) was found just before the poly(A)tail. Poly(A) attaching signals are underlined.

Sequence

analysis

heterogeneitles Besides

the

contained region. it was there

several

mRNA,

additional

there

stretch

These heterogeneities at exactly

were

either

disclose

the

were of

the same position three

or

five

both the coding and noneoding mouse

revealed

eDNA

should

mechanism

of

model of saposin-deficiency

at

either

presence

of

regions of the mRNA (Fig. 4).

least nine

the

two or

additional six

bases

mRNAs

which

the

coding

in

were also found in human saposln eDNA (12), and

poly (A) track in some clones.

The

clones

both in the coding and noneoding

shortest

an

of

and

sequence.

additional

base

The significanees

In the noncoding insertions

just

region,

before

of these heterogenelties

the in

regions have yet to be determined. facilitate

gene

studies

duplication

as

on well

developed by the molecular

genomic as

organization

providing

an

to

animal

approach.

ACKNOWLEDGMENTS This work was supported in part by Grants in Aid for the Scientific Research on Priority Areas, Molecular Basis of Single-gene Disorders from the Ministry of Education, Science and Culture, Japan.

REFERENCES i. O'Brien, J.S. and Kishimoto, Y. (1991) FASEB J. 5, 301-308 2. Stevens, R.L., Fluharty, A.L., Kihara, H., Kaback, M.M., Shapiro, L.J., Marsh, B., Snadhoff, K. and Fischer, G. (1981) Am. J. Hum. Genet. 33, 900-906 3. Inui, K., Emmett, M. and Wenger D.A. (1983) Proc. Natl. Acad. Sci. USA 80, 3074-3077 1271

Vol. 184, No. 3, 1992

BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS

4. Christomanou, H., Aignesberger, A. and Linke, R.P. (1986) Biol. Chem. Hoppe-Seyler 367, 879-890 5. Kleinschmidt, T., Christomanou, H. and Braunitzer, G. (1987) Biol. Chem. Hoppe-Seyler 368, 1489-1502 6. O'Brien, J.S., Kretz, K.A., Dewji, N.N., Wenger, D.A., Esch, F. and Fluharty, A.L. (1988) Science 241, 1098-1101 7. Nakano, T., Sandhoff, K., Stumper, J., Christomanou, H. and Suzuki, K. (1989) J. Biochem. 105, 152-154 8. Sanger, F., Nicklen, S. and Coulson, A.R. (1977) Proc. Natl. Acad. Sci. USA 74, 5463-5467 9. Hattori, M. and Sasaki, Y. (1986) Anal. Biochem. 152, 232-238 I0. Kretz, M. and Carson, G.S., Morimoto, S., Kishimoto, Y., Fluharty, A.V. and O'Brien, J.S. (1990) Proc. Natl. Acad. Sci. OSA 87, 2541-2544 ii. Zhang, X., Rafi, M., DeGala, G. and Wenger, D.A. (1990) Proc. Natl. Acad. Sci. USA 87, 1426-1430 12. Holtschmidt, H., Sandhoff, K., Kwon, H.Y., Harzer, K., Nakano, T. and Suzuki, K. (1991) J. Biol. Chem. 266, 7556-7560 13. Schnabel, D., Schroder, M. and Sandhoff, K. (1991) FEBS Lett. 284, 57-59

1272