Cloning and characterization of a member of the rat multidrug resistance (mdr) gene family

Cloning and characterization of a member of the rat multidrug resistance (mdr) gene family

Gene. 106 (1991) 229-236 ic) 1991 Elsevier GENE Science Publishers B.V. All rights reserved. 229 0378-l 119/91/$03.50 06069 of a member of the...

901KB Sizes 21 Downloads 82 Views

Gene. 106 (1991) 229-236 ic) 1991 Elsevier

GENE

Science

Publishers

B.V. All rights reserved.

229

0378-l 119/91/$03.50

06069

of a member of the rat multidrug resistance (mdr) gene family

Cloning and characterization (P-glycoprotein;

recombinant

Jeffrey A. Silverman,

DNA;

PCR;

rodents;

mouse and human

genes; gt vector;

aflatoxin)

Hannu Raunio, Timothy W. Gant and Snorri S. Thorgeirsson

Laboratory of Experimental Carcinogenesis, National Cancer Institute, National Institutes of Health, Bethesda, MD 20892 (U.S.A.) Received by J. Piatigorsky: 19 May 1991 Revised/Accepted: 23 May/l0 June 1991 Received at publishers: 30 July 1991

SUMMARY

The rat mdr gene family [genes encoding P-glycoprotein (Pgp)] was characterized and the complete sequence of a rat mdr cDNA was determined based on seven independent cDNA clones that correspond to the same gene. The longest of these clones contains a 4.3-kb insert which represents a full-length rat mdr cDNA. The longest open reading frame of this sequence is 3933 bp; the first ATG is at 103 bp, making the deduced protein 1277 amino acids long (141 kDa). This correlates well with previously identified Pgp. The sequence of this gene has a very high, > 900/6, degree of identity to the mouse mdrlh gene (also known as the mdrl gene) therefore, we designate it the rat mdrlb gene. Transcription of this gene begins at a single start point 151 nucleotides upstream from the start codon. We show here that the rat gene family is comprised of three members, which is consistent with previous data on other rodent species.

INTRODUCTION

The mdr gene family is comprised of two members in humans, and three members in hamsters and mice (Endicott and Ling, 1989; Ng et al., 1989; Van der Bliek and Borst, 1989). Complete cDNA clones of the human and mouse mdr genes and partial clones of the hamster mdr genes have been isolated and characterized (Chen et al., 1986; Endicott et al., 1987; Gros et al., 1986a,b; 1988; Van der Bliek et al., 1988). The sequences of these genes are

Correspondence to: Dr. J.A. Silverman, Bethesda,

Tel. (301)496-0731; Abbreviations:

Fax (301)496-0734.

aa, amino acid(s); 2-AAF,

base pair(s); cDNA, encoding cytochrome

transcription

Pgp,

2-N-acetylamino-fluorene;

bp,

DNA complementary to RNA; c.vcP450, gene(s) P45Os; DMSO, dimethylsulfoxide; kb, kilobase

or 1000 bp; mdr, gene(s) deoxyribonucleotide; reaction;

Bldg. 37, Rm. 3C25, NCI, NIH,

MD 20892 (U.S.A)

encoding

Pgp; nt, nucleotide(s);

ORF, open reading

P-glycoprotein(s)

start point(s).

oligo, oligo-

frame; PCR, polymerase

(alternative

abbreviation:

Mdr);

chain rsp,

highly conserved which suggests a critical role for their protein products in normal cellular physiology. Analysis of these sequences indicates that Pgp is comprised of two very similar halves connected by a less well conserved linker region (Chen et al., 1986). Each half of the protein contains six putative membrane-spanning regions and one nt-binding site. Although the function of nt binding in drug resistance is unknown, its integrity is critical to Pgp function; metabolic inhibitors such as azide and vanadate decrease drug resistance (Horio et al., 1988; Van der Bliek and Borst, 1989). Further, mutation of one or both of these sites destroys the ability of the protein to pump out drugs (Azzaria et al., 1989). In addition to the high degree of sequence identity between mdr genes, there is also a significant amount of sequence homology between the mdr genes and other transport proteins. Noteworthy among these proteins are bacterial transport proteins such as HlyB, MalK, HisP and OppD (Chen et al., 1986). A structural similarity to the recently isolated cystic fibrosis transmembrane conductance regulator gene (CFTR) and Hum1 and Hum2 genes, which may be important in antigen presentation in cells, has

230 also been described (Deverson et al., 1990; Riordan et al., 1989; Spies et al., 1990). Although the coding regions of the mdr genes are highly conserved between species, there are significant sequence differences particularly in the flanking regulatory regions (Hsu et al., 1990; Raymond and Gros, 1990; Ueda et al., 1987b). In our previous investigations we have demonstrated a functional regulation of the rat mdr gene(s) (Burt and Thorgeirsson, 1988; Gant et al.. 1991; Thorgcirsson et al., 1987). Increased r& expression was observed in rat livers following surgical partial hepatectomy and after administration of carcinogens. Further, exposure of isolated rat hepatocytes in primary culture to xcnobiotics results in increased levels of I~M/TmRNA and Pgp. To further investigate the specific mechanisms of regulation of thcnrdrgenes by xenobiotics, it was necessary to isolate and characterize members of the rat mdr gene family. In this investigation we describe the cloning and characterization of a member of the rat mdr gene family. This rat gene has a high degree of homology to the mouse mdrlh gene (also known as mdrl ; see below for a discussion of mdr gene family nomenclature) therefore, we designated it the rat rndrlh gene. We further demonstrate that the rat rlldr gene family is comprised of three members, as are the other rodent species examined to date.

RESULTS

AND

DISCUSSION

(a) Isolation and characterization of a rat mdr cDNA We initially screened a 1.gtlO phagc cDNA library that was constructed using mRNA isolated from the liver of a rat which was previously treated with aflatoxin Bl. Three clones, RDR2b13, 14, 15, were first identified which hybridized to the human pH DRSA plasmid. The largest of these clones. RDR-14, was approximately 3.2 kb (Fig. 1). The inserts of these clones were partially sequenced and found to correspond to the same gene. We tentatively identified this rat gene by searching the GenBank database with the FASTA program and using the Besttit program for an optimal alignment (Devereux et al., 1984). The highest degree of similarity was to the 3’ end of the human MDRl gene (data not shown). To obtain clones which contained the 5’ end of the gene we rescreened the cDNA library with a rat mdr probe corresponding to this region of the gene. This probe, 830 (Fig. l), was obtained by PCR amplification of a rat cDNA pool that was constructed with poly(A) + mRNA isolated from primary rat hepatocytes which were treated with cycloheximide. We have previously shown that treatment of rat hepatocytes in vitro with cycloheximide increases the expression of mdr RNA (Gant et al., 1991). This cDNA pool was PCR-amplified to obtain a rat DNA fragment

CLONES RDR15 RDR2bl3 RDR14 RDRB4 RDRBl ATG

Rat m&lb 3 I

I

PROBES

I-l

I

(No.)

Rat

830

3594

-

pHDR5A

Human Fig. I. Structure

of nrdr cDNA

clones and probes.

Five cDNA

corresponding

to the same rat mdr gene were isolated

cDNA

Also shown

library.

aligned

clones

from a rat liver

to the clones is a schematic

of the

full-length rat m&/b gene; the ATG start codon and TGA stop codon are marked. Three probes used in the library screening and Southern-blot analysis fragments, described

are aligned PCR

below the gene. Probes

amplified

from

rat

RNA

830 and 3594 are DNA (830) or DNA

in sections a and b. The human MDRI plasmid,

a gift from M.M. Gottesman

and has been previously

(3594)

pHDRSA, described

as was

(Chen

et al., 1986). B, Bg/II; P, Psri; S. SncI.

corresponding to nt position 830-2 182 in the human A4DRl gene which is farther upstream than any of the isolated rat cDNA clones. Screening of the cDNA library with this probe resulted in the identification of four additional clones. Upon subsequent sequence analysis, two of these clones were identified as the same gene as the previously isolated clones (Fig. 1). The largest of these new clones, RDR-Bl, was approx. 4.3 kb long (Fig. 1). The sequence of RDR-Bl is presented in Fig. 2. This clone contains 4254 nt with the longest ORF of 3933 nt. The first ATG in this ORF is 103 nt from the beginning of this clone. The assignment of this position as the first aa in the encoded protein is based on a comparison to the mouse and human mdrl genes. It is noteworthy that, unlike the mouse mdrl gene, there is no purine 3 nt upstream from this site to correspond with the eukaryotic initiator consensus sequence of Kozak (1987); there is, however, a G at position + 4 which has also been indicated to be important for eukaryotic initiation and may compensate for the lack of a purinc at nt position -3. There are multiple stop codons in both alternate reading frames such that no peptide over 75 aa would be translated. Starting at the ATG at nt position 103, the ORF would contain 3831 nt encoding a protein containing 1277 aa (141236 Da). The first stop codon, TGA, encountered in this reading frame is at position 3832; it is followed by a 321-bp noncoding region.

GCTCCCATCT TCGAGGCTCA GCTCMCTCA

GAGCTACTTC TTCCAAATTC

-!EF CTTAACGGM L N G

GAGCAGACM R A 0

CTGTGCATGG

EEG

6

AAAAGAGTAA AAAGGAGAAG GAGAAGAMC K K S K K E K E K K

CTGCTGTTGG CATATTCGGG ATGTTTCGCT P A Y G I F G N F R

ATGCAGATTG GCTTGACAAG Y AD Y LDK

CTCTGGGAAC TCTCGCTGCT ATCATCCACG A L G T L A A 1 I H

GAACCCTGCT

TCGGATACAT GACAGATAGT TTTACCCCM F G Y N T D S F T P

GCAGAGACCC

CGAGCGATTA R Al

CTAATCMAG T N Q

CCGTCAGCGA CACGAGTCTG GAGGAGGACA T V S D EED T S L

TGGCCATGTA HAHY

TGCCTACTAT AYY

TTGGTGCCGG

I

126

GTTGCCTACA V A Y

TCCAGGTTTC ACTTTGGTGC CTGGCAGCTG 10 v s L Y c L A A

GGAGACAAAT 6 R 0

TTTTCCATGC F F HA

CATCATGAAT CAGGAGATAG IM N 9 E I

GCTGGTTTGA CGTGAATGAC G Y F D V N D

498 166

GClGGGGAGC A G E

TCAACACCCG TCTCACAGAT GACGTCTCCA L N T R L T D D" S

AAATTAATGA CGGAATTGGT GACAAACTTG K I N D G I G D K L

GAATGTTCTT TCAGTCCATA ACGACATTTT G N F F T T F 0 S 1

CAGCCGGTTT TATAATAGGA S A G F I I G

618 206

TTTATAAGTG F I s

GTTGGAAGCT AACCCTTGTA ATTTTGGCCG G Y K L T L V IL A

TCAGCCCTCT TATTGGGTTG 1CATCTGCCA Y s P L I G L S 5 A

TGTGGGCAAA "YAK

GGTACTGACT TCATTTACTA V L T S F T

ATAAGGAACT CCAGGCTTAT N K E L 0 A Y

738 246

GCGAAAGCTG A K A

GAGCAGTTGC CGAAGAAGTC TTAGCAGCCA G A V A E E V L A A

TCAGMCTGT I R T

GATTGCGTTT CGAGGACAAA I A F G G 9

AGMGGMCT K K E

TGAMGGTAC MTAAMATT N K N E R Y

TAGMGMGC L E E

858 286

GGCATAAAGA

MGCCATCAC

TTGCCTACCT

GTTGGTCTAT

CACTGGCATT

CTGGTATGGG

TCCTCTCAA,, TGAATATTCT

L

c

n

GIK

G

9

ATAATTGATA I

TGAMTCAAC S E IN

ID

AGTACACATA S T H

GGCCMCATT

KAIT

ATTGGACAAG 1

GAACTTCTCA MGATGGGCA K N F 5 K" G

18

TACATCTTGG CGGACTTCGC GAAGGAAACC CGGAGTGTTA CGTGAGGTCC TGATGGAGTT TGAAGAGGX

A

N

TCCATAGGTA 1

SIG

G

I

T

A

L

Y

TCCCCTCCTG L

P

L

ATGCTGGTGT L

H

L

v

ACACMGATT AGGCAGAAGT I H K I R 0 K

V

L

L

v

GCGTCTTATG Y

AS

Y

A

L

A

L

F

U

Y

TACACGGGCA Y

T

G

ACCTCCTTGG G

T

S

L

5

II 0

IG

V

A

L

S

GCATTCTGAC P

H

s

258 0

TGTGCTCATC G

v

L

N

E

Y

86 378

TAAMGAGTT K R V

A

978 5

326

TGCTTACCGT CTTCTTCTCT ATTTTATTGG V L T V F F S I L L

GGACTTTCAG TATTGGACAT TTAGCCCCAA G T F S I G H L A P

ACATAGAAGC CTTTGCAAAT GCAAGAGGGG N I E A F A N AR G

CAGCCTATGA AATCTTCAAG A A Y E I F K

1098 366

ATGAGCCAAG

AGGGACACM

ATTTGGMTT

ACCCATCACG

V

1218 406

N

C

CATCGACAGC

P

S

MGATCTTGA K IL

AGGGCCTCM KG L

N

GAGGGCGAGG E Gt

TCAGTATTGA V S ID

ATTCGCTATG IK Y (IGpACAF

ID

TTCTCMCCA s

F

S

T

KG

H

ATAATG6G.U S

G

L

E

TMAAATGTT F

K

N

TACTTCAACT V

Y

F

N

Y

P

S

AAGTGAAGTT R

S

E

CGACCCCATA D P I

1338 446

TCAATGTGAG GTATCTGCGG GAMTCATTG IN V R Y L R E I I

GGGTGGTGAG GVVS

TCAGGAACCC QEP

GTGCTGTTTG VLF

CCACCACGAT ATT,

TGCCGAAPAC A E N

1458 486

GCCGAGAAAA CGTCACCATG GATGAGATAG V T ,, G R E N 0 E I

AGAAAGCTGT CMGGMGCC E K A V K E

MTGCCTATG N A Y

ACTTCATCAT GAAACTGCCC CACAAATTTG D F I " H K F K L I'

ACACCCTGGT TGGTGAGAGA 0 T L V G E R

1578 526

TysGG;GGGACAGAAACAG QKQ

AGGATCGCCA RIA

TTGCCCGGGC CCTGGTCCGC AACCCCMGA IA A A L v If N P K

TCCTTTTGTT GGATGAGGCC ACGTCAGCCT I L L L DE A T S A

TGGACACAGA AAGCGAAGCC L D T E S E A

1698 566

CCGCTCTGGA TMGGCTAGA GMGGCCGGA AA L D K A R E G R

CCACCATTGT GATAGCTCAC CGCTTGTCTA T T I V IA H R L S

CAGTGCGCAA TGCTGACGTC ATTGCTGGTT T V R N A D V IA G

TTGATGGTGG TGTCATTGTG F 0 G G v I v

1818 606

ATCATGAAGA N HtE

GCTCATGMA GAGAAGGGCA L" K E KG

TTTACTTCM IY F

K

ACTTGTCATG ACACAGACTA L v n T 0 T

GAGGAAATGA MTTGAACCA GGAMTMTG R G N E IE P G N N

CTTATGMTC A Y E

S

D

1938 646

ACTGGTGCCT T G A

CTGAGIIGAC TTCAGAAGM TCAA4ATCTC S E L T S E E 5 K 5

CTTTMTAAG P L I

R

GAGATCMTT CGCAGAAGTA R S I R R S

TCCACAGAAG ACAAGACCAG GAGAGAAGAC I H R R ERR 0 D 0

TTAGTTCGM L S S

K

AGAGGATGTG E D V

20% 686

GATGAAGAIG D E 0

TGCCTAIGGT TTCCTTTTGG v P H v SFU

AGCTAAATAT KLNI

TAGTGAATGG CCCTATTTAG S E U P Y L

TTGlGGGTGl ACTTTGlGCT GTTATAAATG V V G V L C A V IN

GGTGCATACA G C ID

ACCAGTGTTT P V F

2178 726

GCCATAGTGT AlV

TTTCAMGAT F S K

GTMCTTGTT C N L

F

TTCCCTTCTC TTTCTGGTCA s L L F L V

TGGGMTGAT H G HI

TTCTTTTGTT 5 F v

2298 766

1CTICAMlC V F K

S

CAIGCTGCGA CAGGATATAA H L R Q D I

GCTGGTTTGA S Y F"

TGACCATAAA D H K

2418 606

a

GAGCAAGGAA E

9

6

I

9

D

ATCAGGACCA 1 R T

CAGATCCTM QIL

TGTAGGGGTT TTTTCMGAG V G V F S R

ACGTACTTCl T Y F

TTCAAGGCTT CACATTTGGC AAAGCTGGAG F Q G F T F G K A G

AAYF"";"

G;TC~Tr,TTAC~~;

"":"";&A"," CTTCTMTGT A S N

V

2538 846

ClCATTGTCT L IV

TGGGTGGAAT TATTGAAATG AAACTGTTGT L G G I I E H K L L

CTGGTCAAGC CTTGMGGAC S G 9 A L K D

2658 686

CTTCCGCACT GTTGTCTCTT F R T v v s

TGACTCGG6A GCAGAAGTTT GAAACTATGT E T H L T R E P K F

ATGCCCAGAG CTTGCAGATA Y A 9 5 L 9 I

2778 926

CTTCACCCAG

ATTTTTCCTA TGCTGCTTGT Y F S Y A AC

GTGCCTACTT GAYL

2898

TAGAGATCTC LEIS

CMTTWAAA A I E

R

ATGCTTTGM N A L

CCATACAGM P

Y

K

GAAAGCACAC GTCTTTGGGA K A H V F G

TCACCTTCGC 1TFA

S

ACCTTGGCAC AGGMTTATC N L G T G I I

E

N

0

GGCTTGCTGT AGTTACCCAG AATGTAGCAA N V A R LA V V T Q

MGAAAGAGC

GCTACAGAAG A T E

CCAAAGTGAC

ATGGGCTCCA N G S

TAAAGGGGCT K GA

TACTTGTAGT MTTAIACCA L L v v I IP

TGGGAAGATC GKI

CAACGGMTT QRN

AGATCCTCAC CMGCGACTC CGATACAIGG R Y N E I L T K R L

TCTTAGTCTA TGGCTGGCAG CTTACACTTT v L v Y L T L G U 0

K

A

ACGACGACCA TGAAACCAM D D 0 H tTK

llAlCCTTAG L 5 L

K

MCAGTGGCT NSG

N

AGAGGCTCTA QRLY

G

CCTGGTTGGC LVG

IN

CAGCTGCTGC QLL

V

CGGACAGGAC

0

CACMCTGTC TTY

V

AGACGGTAGC DTVA

P

GTGGGMAAG CGKS

GTGGTTCACG

CCTGMGGTG MGAGCGGGC KSG L K V

ACCAGACAGT K

F

T

9

GCCATGATTT A H I

TTCCGGTTCG FRF

GGTGGCACGA V AR

966

GMCTCATGA E L H

CGTTTGAAAA TGTTATGTTG GTATTTTCTG T F E N v n L V F 5

CTGTTGTCTT TGGTGCCATG GCAGCAGGGA A V V F G A H A A G

ATACCAGTTC ATTCGCTCCT GACTACGCGA N T S S F A P D Y A

AGGCCAAAGT CTCGGCATCC K A K V 5 A 5

3018 1006

CACATCATTG H I 1

GGATCATTGA GMAATCCCC GAGATIGACA K I P E ID G I I E

GCTACAGCAC GGAGGGCTTG MGCCTMTT K P N S Y S T E G L

GGTTAGMGG U L E

AAATGTGAAA TTTAATGGAG N V K F N G

TCAAGTTCAA CTATCCCACC Y P T V K F N

3138 1046

CGACCCAACA R P N

TCCCAGTGCT TCAGGGACTG AGCTTCGAGG I P v L S F E 0 G L

TGAAGAAGGG VKKG

GCAGACGCTC DTL

GCAGCAGTGG GSSG

CTGCGGGAAG CGK

TCCAGCTGCT VPLL

3258 1086

TACMCCCCA Y N P

TGGCTGGMC NAG

AAATAAAGCA ETKD

ACTCMTGTC CAGTGCGTCC L N V PC V

GCCGAGAACA A E N

TCGCCTACGG AGACMCAGC CGTGTCGTGT I A Y G 0 N S R V V

AGAGTGGGAG R V G

ACAMGGGAC DKGT

TCAGCTGTCG QLS

ACGGAGAGTG T E S

AAAAGGTCGT EKVV

CCAGGAAGCG PEA

AACGGCCAGG

AGCACGGTW; STY

CGAGCGCTTC E R F

GCGCACTGGG CATTGTGTCC CAGGAGCCCA I v 5 R A L G D E P

TCCTGTTTGA CTGCAGCATC I L F D c s 1

,126

CTCATGAGGA GATCGTGAGG GCCGCCAGGG 5 H E E 1 V R A AR

AGGCCMCAT E AN

CCACCAGTTC ATCGACTCAC IDS 1 HP F

TGCCTGAGAA ATACMCACC Y N T L P E K

3498 1166

GGCGGGCAGA GGQ

AGCAGCGCAT KQRI

CGCCATCGCG CGCGCCCTCG AI A R A L

TCAGACAGCC TCACATCTTA CTTCTGGATG H I L L L D V R q P

AAGCGACATC AGCTCTGWIT E AT 5 AL 0

3618 1206

CTGGACMAG LDK

CCAGGGAAGG AREG

CCGCACCTGC GTTGTGATCG R T C v v I

CGCACCGCCT GTCCACCATC CAGAACGCAG A H R L S T I P N A

ACTTGATCGT GGTGATTCAG D L I V V I9

37.38 1246

TCAAGGAGCA UXCACCCAC V K E H GTH

CAGCAGCTGC DQL

TGGCCCAGAA AGGCATCTAT TTCTCGATGG G I Y F S H L A Q K

TTCAGGCTGG AGCAMGCGC V Q A G AKR

TCATGAGCTG S-

GGAGTATTTG

AGGTGCTAAG

3858 1277

TATTTCTAAT

ATTGGTGTTC

AAACATGGCA

CGTAACCAM

GTTAAAAGGT

TAAAAGCACT

GTTAMGGTA

ATTTCATCM

GACG%AAGC

CTTCAGAGAC

TTCATPATTA

AATGAACCGA

3978

,UTTG&UM

AAAATCATTA

AACAGGGCCA

CATTTTTTM

TTGTATTATG

TGATTCMGA

GAACATATAG

TTTTTTTAAA

AAGAAATGTG

TAGTTTTGTT

TCAGTTTTTT

TAATTTCTAC

4098

CCTATTCCCT

TAAATGATCA

TAMGGCTGT

AMMGCACT

ATTTTTTTGC

GGC

N

Fig. 2. Nucleotide

G

9

sequence

T

AGTGTTTCTA GATGGCAAAG V F L D G K

CGCCTGGTGG RLV

G

of the rat mdrlb

cDNA

and aa sequence.

231

138 46

4151

The nt sequence

is numbered

in a 5’-to-3’

orientation

starting

at the ATG start

codon and ending at the first in-frame stop codon (bold underline). The deduced aa sequence is shown below the nt sequence. The aa symbols are aligned with the third nt of each codon. The aa corresponding to the nt-binding site motif A and ATP-binding active transporter consensus sequence are underlined. This sequence has been submitted to the GenBank database under accession No. M62425.

232 TABLE

I

Comparison

of the rat, mouse,

hamster

and human

mdr/MDR gene sequences

Human

Genes”

Mouse

MDRl

Hamster

MDR2

mdrla

mdrl b

mdr2

mdrla

79.1

71.5

82.5

90.4

69.6

78.1

80.4

75.1

47.7

76.8

83.4

77.9

82.0

100.0

100.0

nldrlb

Yb nt homology” Rat mdrlh Hamster Hamster

mdrlb’ ipgp2J mdrla’ (pgpl)

83.5

64.2

89.3

17.9

65.0

Mouse mdr2

71.1

86.1

71.1

69.4

100.0

Mouse mdrlb (mdrl)

78.7

70.6

84.2

100.0

Mouse mdrla (mdr3) Human MDR2 (MDR3)

82.2

71.6

100.0

14.9

100.0

“ Alternative h Comparison PCGene accession

gene designations of the rat, mouse,

package numbers

are in parentheses. hamster

(Intelligenetics, of the sequences

and human

mdr/MDR gene sequences.

Palo Alto, CA). The complete used for this comparison

sequences

The y0 identity

of these genes, coding

are as follows: hamster

was calculated and noncoding,

mdrla, Ml7897 (Endicott

using the NALIGN et al., 1987); hamster

(Endicott et al., 1987); human MDRl, Ml4758 (Chen et al., 1986); humanMDR2, M23234 (Van der Bliek et al., 1988); 1990); mouse mdrlb Ml4757 (Gros et al., 1986a). and mouse mdr2, JO3398 (Gros et al., 1988). L Note that the hamster

mdrla and mdrlb sequences

program

was used for this comparison,

of the The

mdrlb, Ml7896

mouse mdrla, M33581 (Hsu et al.,

are incomplete.

is an imperfect polyadenylation signal, AATTAAA, at position 3964. It is unlikely, however, that this represents the authentic signal which is probably not contained in this clone. The predicted size of the rat Pgp is in good agreement with previously described Pgps from other species (Endicott and Ling, 1989). Comparison of this rat mdr gene with the other known mdr sequences reveals a high degree of sequence conservation. The homology of this clone to the other mammalian mdr genes is shown in Table I. The entire available sequences of these genes, coding and noncoding regions, was used for this comparison (Chen et al., 1986; Devault and Gros, 1990; Endicott et al., 1987; Gros et al., 1986a; 1988; Van der Bliek et al., 1988). This table lists also the alternate names by which certain mdr genes are known. These genes were originally named as they were isolated. There

However, a functional nomenclature system has recently been proposed which divides these genes into those that are associated with drug resistance, class 1 genes, and those not yet associated with drug resistance, class 2 genes (Scotto et al., 1986). This nomenclature system also facilitates cross species comparison. In mice and hamsters two genes, mdrla and mdrlb, have been associated with drug resistance which corresponds to a single gene in humans. All three species also have a highly homologous gene, mdr2, which has yet to be ascribed a function in drug resistance. It is possible that the protein product of these class-2 genes transports yet unidentified substrates in a similar fashion to Pgp. The highest degree of similarity of the rat RDRBI cDNA is to the mouse mdrlb gene (also referred to as the mdrl

gene) with an overall identity of 90.4 y0 (Table I). Thus, we have designated the clone presented here the rat mdrlb gene. Clearly, the mdr genes are highly conserved between species but as indicated in Table I the rodent genes have a higher degree of homology to each other than to the human gene. This indicates sequence divergence since the speciation of rodents from primates. The least identical region of the rat mdrl b gene from the mouse mdrl b gene is in the 3’-untranslated region with a homology of 69:; (data not shown). The predicted protein of this rat mdr gene contains the important structural features identified in the Mdr proteins from other species (Endicott and Ling, 1989; Gottesman and Pastan, 1988). The bilateral symmetry of the molecule with each half containing several membrane spanning regions and one nt-binding region is evident in the predicted protein of this gene. A hydropathic plot for the rat Mdr 1b protein demonstrates a near-identical plot to that of the mouse Mdrlb (Fig. 3) and human MDRl proteins (data not shown). The presence of six membrane-spanning domains indicates that these proteins are integral membrane proteins. Such a localization has been previously confirmed using epitope-specific antibodies to various domains of Pgp (Yoshimura et al., 1989). The sequences surrounding the nt-binding regions are particularly conserved between species and are characteristic of the ATPbinding cassette family of active transport proteins. This has led to the hypothesis that the Pgps are members of this superfamily of membrane transport proteins which now includes over 30 members (Hyde et al., 1990). Members of this superfamily of transport proteins are

233

1

-5e-,,,,,,,,, 1

Fig. 3. The hydropathy computer

program

are negative

400

ZBB

,,,,,,,,, 208

bat3

,I,,,,,,, 480

which implements

along the ordinate.

The numbers

,,,,I,,,, 888

The hydropathy

the analysis method of Kyte and Doolittle along the abscissa

represent

found in both prokaryotes and eukaryotes and are thought to be important in ATP-dependent transport of a diverse range of compounds. These proteins include the bacterial transporters HlyB, which is involved in the movement of hemolysin across the bacterial membrane, and MalK and HisP, which are important in metabolite transport. In yeast, the STE6 gene is important in the transport of mating factor a and is a member of this family (McGrath and Varshavsky, 1989). The recently isolated cystic fibrosis transmembrane conductance regulator gene and the HAM1 and HAM2 genes, which may be important in the antigen recognition system of the MHC locus are also members of this gene superfamily (Deverson et al., 1990; Riordan et al., 1989; Trowsdale et al., 1990). The high degree of evolutionary conservation of this sequence motif indicates an important

iem3

,,,,,,,/, 608

plot for the rat (A) and mouse (B) Mdrl b proteins.

package

808

m38

III,,, leaa

plots were obtained (1982). Hydrophobic

the aa positions

aa

,I,,,Tr lzaa

aa

using the SOAP program

in the PCGENE

regions are positive and hydrophilic

regions

in the proteins.

role in normal cellular physiology and possibly common ancestral gene (see section c).

indicates

a

(b) Mapping of the transcription start point (tsp) Primer extension analysis was used to map the distance of the tsp from the start of translation. A 25mer oligo corresponding to nt positions 17-41 relative to the ATG start codon was used as a primer for the reverse transcription. A 192-nt extension product was observed in RNA extracted from isolated hepatocytes maintained in primary culture (Fig. 4, lane 1). We also observed an identical size extension fragment in RNA isolated from hepatocytes treated with 2-AAF and from the livers of rats previously treated with that agent (lanes 2 and 3, respectively). The greater intensity of the band in the drug-treated

234

1

23

1

GATC

2

3

4

5

6

151 nt-

Fig. 5. Southernblot

analysts

DNA was digested

with BarnHI

lowing electrophoresis Magnagraph Fig. 4. Primer extension

analysis to determine

was hybridized

to a “P-labeled

myeloblastosis

virus

reaction urea

gel and

(DMSO 2-AAF

products

treated

ladder (G,A,T,C)

oligo probe and transcribed transcriptase

were electrophoresed

then

treated)

reverse visualized

hepatocytes;

the tsp. Total RNA (50 ng) (Archer

et al., 3985). The

in an So, polyacrylamide/7

by autoradiography. 2,2-AAF

with avian

treated

Lanes:

M

rat liver. A dideoxy

was used to measure

the size of the extension

3

-

2

1)N.A. Gcnonuc

(lanes 1-3) or EcoRI (lanes 4-6). Fnlin OX?,, agarosc gels the DNA was transferred to

membranes

(Micron

Separations.

to a single exon wdr probe,

Inc.. Westboro, 3594, described

in

section E. Lanes: I and 4, rat DNA; 2 and 5, mouse DNA; 3 and 6, human DNA. .4 1-kb ladder margin.

(BRL, Gaithersburg,

MD) is marked

on the right

1. control

(20 PM) hepatocytes;

(five days at 20 mg/kg)

nylon

MA) and hybridized

of rat, mouse and human

-

3.

sequencing products.

samples indicates that this agent increased the expression of the mdrlb gene in the liver. These data support our previous observation of enhanced mdr expression with this agent (Burt and Thorgeirsson, 1988; Gant et al., 1991). These data further show that the tsp is 15 1 nt 5’ to the start codon. No difference in the tsp was observed between the RNA isolated from the rat liver and the RNA from the isolated hepatocytes. Thus, the rat 5’-untranslated region is similar in length to that observed for the mouse mdvl b gene (Raymond and Gros, 1989). Also indicated by these data is that the rat gene, like the mouse mdrlb gene, has a unique t.rp. This is in contrast to the human MDRl and murine

mdrlo genes for which two tsp have been described in drug-resistant and normal cell lines (Hsu et al., 1990): Ueda et al., 1987a,b). Isolation and characterization of a genomic clone containing the upstream sequences of this rat gene will permit us to further characterize this regulatory region.

(c) Identification of the members of the rat mdv gene family Previous investigations have demonstrated that the mdr gene family consists of two genes in humans and other primates and three genes in mice and hamsters, however, no data are yet available on the rat gene family (Chin et al.. 1989; Ng et al., 1989). To identify the number of genes in rats, genomic DNA was isolated from rat liver, digested with the indicated restriction endonuclease and examined

235 by Southern-blot analysis. DNAs were also analyzed analysis. obtained

Human and mouse genomic to serve as controls for the

The probe used was a 200-bp rat gene fragment by PCR that corresponds to exon 28 based on the

human and mouse mdrl genes (Fig. 1) (Chen et al., 1990; Raymond and Gros, 1989). Upon digestion with each BamHI and EcoRI we observed three restriction fragments in the rat DNA samples (Fig. 5, lanes 1 and 4). Similar results were obtained with XbaI and Hind111 (data not shown). We also observed three restriction fragments in the mouse (lanes 2 and 5) and two fragments in the human DNA samples (lanes 3, and 6). This confirms previous investigations which examined the number of genes in mice and humans (Chin et al., 1989; Ng et al., 1989). Together these data suggest that these rodent species have three members of the mdr gene family whereas primates have two. The presence of three genes in rats, mice and hamsters, while primates only have two, indicates that the third gene may have arisen after the divergence of rodents from primates. Of the two rodent genes associated with drug resistance the mdrl a gene has been suggested to be more closely associated with the human MDRI gene (Chin et al., 1989). The close homology of the sequence of mdrlb to that of mdrlu further indicates that this third gene may have resulted from a later gene duplication. Previous work by us and others demonstrates that mdr genes are co-regulated with other genes involved in the protection of cells by xenobiotics, e.g., members of the cycP4.50 gene family (Burt and Thorgeirsson, 1988; Fairchild et al., 1987). For example, exposure of isolated hepatocytes to 2-AAF or 3-methylcholanthrene causes a significant increase in the expression of mdrlb and cycP450-IA mRNAs and in Pgp levels (Gant et al., 1991). This increased expression is, at least in part, mediated by increased transcription of the mdr gene(s) (T.W.G., J.A.S., S.S.T., in preparation). The isolation and characterization of the rat mdr gene presented here and the yet to be cloned other members of the rat gene family will permit further investigation of the regulatory mechanisms of these genes in this important model system. (d) Conclusions (1) A cDNA encoding a complete rat mdr gene has been isolated and characterized. This gene has a high degree of sequence identity to the mouse mdrlb gene, thus it was designated the rat mdrlb gene. (2) The tsp for this gene is 15 1 nt upstream from the start codon. (3) The rat mdr gene family has three members. This is similar to the mouse and hamster gene families but in contrast to the human gene family which has two members.

ACKNOWLEDGEMENTS

We

acknowledge

Dr.

P.A.

Marino

and

Dr.

M.M.

Gottesman for their comments on this manuscript and A.D. Olson for his assistance with the computer analyses.

REFERENCES Archer,

T.K., Tam, S.P., Deugau,

K.V. and Deeley, R.G.: Apolipoprotein

CII mRNA levels in primate hepatocarcinoma

cell

liver. Induction

line.

HepG2.

by estrogen

J. Biol.

in the human

Chem.

260 (1985)

1676-1681. Azzaria,

M., Shurr, E. and Gros, P.: Discrete mutations

predicted

to confer multidrug Burt,

resistance.

R.K. and Thorgeirsson,

resistance

of MDR-1 multidrug

P-450 genes in rat liver by xenobiotics.

J.

Inst. 80 (1988) 1381-1386.

Chen, C., Chin, J.E., Ueda, K., Clark, D.P., Pastan, and Roninson,

LB.: Internal

duplication

proteins

cells. Cell 47 (1986) 381-389.

Chen, C., Clark, D., Ueda, Roninson, LB.: Genomic (MDRI)

I., Gottesman,

and homology

in the mdrl (P-glycoprotein)

transport resistant

resistance

in the

Mol. Cell. Biol. 9 (1989) 5289-5297.

S.S.: Coinduction

and cytochrome

Natl. Cancer

introduced

sites of the m&l gene abolish its ability

nucleotide-binding

K., Pastan, organization

M.M.

with bacterial

gene from multidrug-

I., Gottesman, M.M. and of the human multidrug

gene and origin of P-glycoproteins.

J. Biol. Chem.

265 (1990) 506-514. Chin, J.E., SoBir, R., Noonan,

K.E., Choi, K. and Roninson,

I.B.: Struc-

of the human MDR (P-glycoprotein)

ture and expression

gene family.

Mol. Cell. Biol. 9 (1989) 3808-3820. Devault,

A. and Gros,

confer multidrug

of the mouse mdr gene family

P.: Two members

resistance

with overlapping

but distinct drug specili-

cities. Mol. Cell. Biol. 10 (1990) 1652-1663. Devereux,

J., Haeberli,

sequence

P. and Smithies,

analysis

programs

0.: A comprehensive

for the VAX. Nucleic

Acids

set of Res. 12

(1984) 387-395. Deverson, E.V., Gow, I.R., Coadwell, J., Monaco, J.J., Butcher, G.W. and Howard, J.C.: MHC class II region encoding proteins related to the multidrug

resistance

family of transmembrane

transporters.

Nature

348 (1990) 738-741. Endicott, J.A. and Ling, V.: The biochemistry of P-glycoprotein mediated multidrug resistance. Annu. Rev. Biochem. 58 (1989) 137-171. Endicott,

J.A., Juranka,

P.F., Sarangi,

and Ling, V.: Simultaneous in drug-sensitive

F., Gerlach,

expression

Chinese hamster

J.H., Deuchars,

of two P-glycoprotein

K.L. genes

ovary cells. Mol. Cell. Biol. 7 (1987)

4075-4081. Fairchild,

CR.,

Ivy, S.P., Rushmore,

T., Lee, G., Koo, P., Goldsmith,

M.E., Myers, C., Farber, E. and Cowan, K.H.: Carcinogen-induced mdr overexpression is associated with xenobiotic resistance in rat preneoplastic liver nodules and hepatocellular Nat]. Acad. Sci. USA 84 (1987) 7701-7705. Cant,

T.W., Silverman,

and Thorgeirsson,

Carcin. Gottesman,

Proc.

J.A., Bisgaard,

H.C., Burt, R.K., Marino,

P.A.

S.S.: Regulation

of 2-acetylaminofluorene

and

3-methylcholanthrene family expression

carcinomas.

induction in primary

ofmdr and cytochrome

hepatocyte

cultures

P45OIA gene

and rat liver. Mol.

(1991) in press. M.M. and Pastan,

I.: The multidrug

transporter,

edged sword. J. Biol. Chem. 263 (1988) 12163-12166. Gros, P., Croop, J. and Housman, D.: Mammalian multidrug gene: complete rial transport

cDNA proteins.

sequence

indicates

Cell 47 (1986a)

strong homology 371-380.

a double resistance to bacte-

236 Gros, P., Neriah, Y.B., Croop, J.M. and Housman, D.E.: Isolation and expression of a complementary DNA that confers multidrug resistance. Nature

323 (1986b) 728-731.

Cell. Biol. 8 (1988) 2770-2778. Horio, M., Gottesman, M.M. and Pastan, I.: ATP-dependent of vinblastine in vesicles from human multidrug-resistant

transport cells. Proc.

Hsu, S.I.-H., Cohen, D., Kirschner, L.S., Lothstein, L., Hartstein, M. and Horwitz, S.B.: Structural analysis of the mouse mdrla (P-glycoprotein) promoter

reveals

the basis for differential

transcript

hetero-

5774.2 cells. Mol. Cell. Biol. 10 (1990)

3596-3606.

fibrosis,

model ofATP-binding

multidrug

resistance

proteins

and bacterial

associated transport.

with cystic Nature

346

(1990) 362-365.

Kyte, J. and Doolittle, thic character

R.F.: A simple method

of a protein.

for displaying

the hydropa-

1224-1232. M. and

correlation cation

D., Blanck,

G., Mellins,

complex class II region controlling the class 1 antigen pathway. Nature 348 (1990) 744-747.

Thorgeirsson,

S.S., Huber,

Gottesman,

B.E., Sorrell.

M.M.: Expression

S., Fojo, A.T., Pastan,

of the multidrug-resistance

and regenerating

preI. and

gene in

rat liver. Science 236 (1987)

1120-l 122. Trowsdale,

J., Hanson,

I., Mockridge,

Kelly, A.: Sequences related

encoded

I., Beck, S., Townsend,

in the class II region

to the ABC superfamily

Ueda, K., Clark, Pastan, cloning

of transporters.

A. and

of the MHC

Nature

D.P., Chen, C., Roninson,

348 (1990)

P.: Mammalian

of exon organization

of an ancestral

multidrug-resistance

with structural

gene. Proc. Natl. Acad.

domains

gene: and dupli-

Sci. USA 86 (1989)

Raymond, M. and Gros, P.: Cell-specific activity of &-acting regulatory elements in the promoter of the mouse multidrug resistance gene mdrl. Mol. Cell. Biol. 10 (1990) 6036-6040. J.R., Rommens,

J.M., Kerem,

B.-S.. Alon, N., Rozmahel,

I.B., Gottesman,

1.: The human multidrug resistance 1 (mdrl) and transcription initiation. J. Biol. Chem.

Ueda, K., Pastan, promotor

I. and Gottesman,

region of the human

Van der Bliek, A.M. and Borst,

M.M. and gene. cDNA 262 (1987a)

M.M.: Isolation multidrug

quence

of mdr3 cDNA

ofthe

(P-glycoprotein)

17432-17436.

P.: Multidrug

resistance.

Adv. Cancer

P.M., Schneider,

encoding

a human

C. and Borst,

P-glycoprotein.

P.: SeGene 71

(1988) 401-411. Yoshimura,

A., Kuwazuru,

Y., Sumizawa,

Uda, T. and Akiyama,

S.: Cytoplasmic

structure of the multidrug transporter, with sequence-specific antibodies. 16282-16291.

R.,

and sequence

resistance

Res. 52 (1989) 165-203. Van der Bliek. A.M., Kooiman,

Gros,

6488-6492.

Riordan,

patibility sentation

S., Arnold,

R.: A gene in the human major histocom-

gene. J. Biol. Chem. 262 (1987b)

340 (1989) 400-404.

Ng, W.F., Zastawamy, R.L., Veinoe-Dreot, L. and Ling, V.: Identification of members of the P-glycoprotein multigene family. Mol. Cell. Biol. 9 (1989)

M., Bahram,

and expresin mammalian

505-508.

J. Mol. Biol. 157 (1982) 105-132.

McGrath, J.P. and Varshavsky, A.: The yeast STE6 gene encodes a homologue of the mammalian multidrug resistance P-glycoprotein.

Raymond,

Spies, T., Bresnahan,

resistance

741-743.

Kozak, M.: An analysis of 5’noncoding sequences from 699 vertebrate messenger RNAs. Nucleic Acids Res. 15 (1987) 8125-8148.

Nature

sion of genes associated with multidrug cells. Science 232 (1986) 751-755.

of

of complemen-

tary DNA. Science 245 (1989) 1066-1073. Scotto, K.W., Biedler, J.L. and Melera, P.W.: Amplification

hepatocarcinogenesis

Hyde, S.C., Emsley, P., Hartshorn, M.J., Mimmack, M.M., Gileadi, U., Pearce, S.R., Gallagher, M.P., Gill, D.R., Hubbard, R.E. and Higgins, CF.: Structural

J., Lok, S., Plavsic, N., Chou, J.-L., Drumm,

M.C., Collins, F.S. and Tsui, L.-C.: Identification

E., Pious, D. and DeMars,

Sci. USA 85 (1988) 3580-3584.

geneity in multidrug-resistant

Z., Zielenski,

M.L., Iannuzzi,

the cystic fibrosis gene: cloning and characterization

Gros, P., Raymond, M., Bell, J. and Housman, D.: Cloning and characterization of a second member of the mouse tndr gene family. Mol.

Natl. Acad.

Grzelczak,

T., Ichikawa, orientation

M., Ikeda,

S.,

and two-domain

P-glycoprotein, demonstrated J. Biol. Chem. 264 (1989)