HMGA proteins: multifaceted players in nuclear function

572KB Sizes 0 Downloads 56 Views

Report

PDF Reader
Full Text

J. Zlatanova and S.H. Leuba (Eds.) Chromatin Structure and Dynamics: State-of-the-Art ß 2004 Elsevier B.V. All rights reserved DOI: 10.1016/S0167-7306(03)39007-6

CHAPTER 7

HMGA proteins: multifaceted players in nuclear function Raymond Reeves1 and Dale Edberg Washington State University, Biochemistry and Biophysics, School of Molecular Biosciences, Pullman, WA 99164-4660, USA 1 Tel.: 509-335-1948; E-mail: [email protected]

1. Introduction In contrast to the well established biological functions of the histone proteins, until recently our understanding of the roles played by the ‘‘high mobility group’’ (HMG) of nonhistone chromatin proteins in nuclear processes has been meager but tantalizing. Fortunately for one group of these proteins, the HMGA family, this situation has now dramatically changed. The reasons for this recent tidal shift in perception are many and reﬂect the realization by many workers that the nucleus consists of more than just DNA, histones and various enzymes. It also contains several classes of nonhistone proteins that participate in multiple functions ranging from serving as structural components of nuclear architecture to participating as ancillary players in such processes as transcription, replication, and DNA repair. The HMG proteins are the most abundant of these chromatin proteins and the HMGA subfamily is perhaps the best understood in terms of the multiple roles these proteins play in the nucleus. Members of the HMGA group of proteins, and the genes that code for them, possess a unique constellation of biochemical, biophysical, and biological attributes that enables them to participate in a diverse variety of activities not normally accessible to more specialized components of the nucleus. Foremost among these distinguishing characteristics are their remarkable degree of intrinsic ﬂexibility and their ability to undergo extensive and complex patterns of in vivo biochemical modiﬁcations in response to external and internal stimuli. Although much is now known about these remarkable proteins, only future research will reveal the full extent, and nature, of involvement of the HMGA proteins in nuclear and cellular functions.

2. Biological functions of HMGA proteins Attesting to the current state of interest, the HMGA family of proteins (formerly known as the HMGI (Y) family [1]), and the genes coding for them, has been the subject of numerous recent reviews [2–13]. The HMGA proteins are coded for by two diﬀerent genes: the HMGA1 gene, whose alternatively spliced mRNA

156 transcripts give rise primarily to the HMGA1a (a.k.a., HMG-I) and HMGA1b (a.k.a., HMG-Y) proteins, and the HMGA2 gene whose primary product is the HMGA2 (a.k.a., HMGI-C) protein. The HMGA proteins participate in, or are targets of, a wide variety of normal and pathological biological events. For example, HMGA proteins are the down-stream targets of a number of external and internal signal transductions pathways that aﬀect both the types and levels of secondary biochemical modiﬁcations on the proteins and, as a consequence, regulate their substrate binding characteristics and their biological functions. Furthermore, by acting as ‘‘architectural transcription factors’’ the HMGA proteins participate in both the positive and negative regulation of a large number of eukaryotic and viral genes and are also thought to participate in such processes as DNA replication, ampliﬁcation, and repair. Evidence further suggests that they are also involved in regulating cell proliferation, diﬀerentiation and apoptotic cell death. The HMGA genes are bona ﬁde oncogenes and their induced over-expression in cells promotes both cancerous transformation and metastatic progression. Elevated levels of HMGA proteins are among the most consistent biochemical features of naturally occurring human tumors with the protein concentrations being a diagnostic marker for increasingly malignant and metastatic cancers. A likely explanation for why over-expression of HMGA proteins is found in so many diﬀerent types of human cancers comes from recent experiments that demonstrate that expression of the HMGA1 gene is under negative transcriptional regulation by certain tumor-suppressing proteins and is also exquisitely sensitive to positive regulation by exposure of cells to numerous oncogenic growth factors as well as tumor promoting chemicals. The ability to participate in such varied biological processes, and to respond to so many diﬀerent external and internal signaling events, has led to the HMGA genes and proteins being referred to as ‘‘hubs’’ of nuclear function [12]. A cardinal position of the HMGA proteins in normal nuclear activity is supported by the fact that homozygous knockouts of the Hmga1 gene in mice results in embryonic lethality [13] and homozygous knockouts of the Hmga2 gene results in the diminutive pygmy (or ‘‘mini-mouse’’) phenotype in mice [14]. The HMGA genes and proteins possess a number of distinguishing features that contribute to their ability to play vital roles in nuclear metabolism. For example, the HMGA1 gene has multiple promoters [15] that are regulated by diﬀerent signal transduction pathways [16–18] and are responsive to a wide variety of external stimuli (reviewed in Ref. [12]). Furthermore, both the HMGA1 [15,19,20] and HMGA2 [21,22] genes produce a number of diﬀerent isoform HMGA proteins as a result of alternative splicing of precursor mRNAs. As free molecules, the HMGA proteins are quite ﬂexible with little intrinsic structure [12,23–26] but undergo disordered-to-ordered transitions upon binding DNA substrates [23,25]. The highly conserved DNA-binding regions of the HMGA proteins, the so-called AT-hook motifs, not only preferentially bind to the minor groove of AT-rich sequences [23], but also interact with non-B-form DNA structures such as four-way-junctions [27] and the distorted forms of DNA located at speciﬁc regions on the surface of

157 isolated nucleosome core particles [28]. The HMGA proteins also possess the ability to speciﬁcally interact with many other protein partners. They have, for example, been demonstrated to physically associate with at least 20 diﬀerent transcription factors via localized peptide regions that speciﬁcally interact with restricted areas of the transcription factors (reviewed in Ref. [12]). This combination of characteristics enables the HMGA proteins to choreograph the transcriptional activation of a number of inducible genes by participating in the formation of stereo-speciﬁc multiprotein–DNA complexes called ‘‘enhanceosomes’’ [29] on their promoter/ enhancer regions. Another distinctive feature of the HMGA proteins is that they are among the most highly modiﬁed proteins in the cell nucleus being subject to complex patterns of in vivo phosphorylation, acetylation, and methylation. These secondary biochemical modiﬁcations are generally reversible and, in some cases, cell cycle dependent. In other cases the modiﬁcations are a consequence of the HMGA proteins being the down-stream targets of signal transduction pathways that are activated by extra-cellular stimuli (reviewed in Refs. [10,12,30,31]). The intricate patterns of in vivo modiﬁcation found on the HMGA proteins have been demonstrated to not only aﬀect their interactions with various molecular substrates but also to inﬂuence their biological functions [32–34]. Analogous to the modiﬁcations found on histone proteins in vivo [35], it has recently been suggested that the patterns of modiﬁcations found on the HMGA proteins function as a biochemical ‘‘code’’ that regulates, or coordinates, the many diﬀerent biological activities of the HMGA proteins in living cells [10].

3. HMGA proteins: flexible players in a structured world Unlike most other proteins, members of the HMGA protein family are characterized by having little, if any, detectable secondary structure while free in solution [12,23–26] with individual proteins exhibiting greater than 75% random coil characteristics when analyzed by either circular dichroism or nuclear magnetic resonance (NMR) spectrometry [10,12]. Nevertheless, when bound to substrates such as DNA or other proteins, subdomains of the HMGA proteins undergo disordered-to-ordered transitions assuming deﬁned conformations [23,25]. This structural transition has been most convincingly demonstrated by NMR studies of co-complexes of the HMGA1 protein with an AT-rich synthetic oligonucleotide substrate [25]. As illustrated in Fig. 1, the highly conserved palindromic core peptide sequence of the DNA-binding domain of the HMGA proteins, Pro–Arg–Gly–Arg–Pro, is disordered prior to substrate binding (panel A) but assumes a planar, crescentshaped conﬁguration (the AT-hook motif [23]; panels B and C) which, when bound to the minor groove of AT-rich sequences (panels D and E), associates with about 5 bp of DNA, or about half a turn of B-form DNA (panel D). The peptide backbone on either side of an AT-hook bound to DNA retains a great deal of plasticity (Fig. 1E). Each HMGA protein has three independent AT-hook motifs

158

Fig. 1. Schematic diagrams based on the solution NMR structure of a complex of the second AT-hook motif of the human HMGA1a protein bound to the minor groove of an AT-rich synthetic duplex DNA fragment [25]. Various projected views of either the AT-hook peptide, or a co-complex of the peptide with DNA are shown (see text for details). Modiﬁed from Ref. [7].

that are separated by highly ﬂexible peptide sequences. This arrangement and ﬂexibility allows the AT-hooks of an individual protein to associate with the minor groove of either long, contiguous stretches of AT-rich sequence (> 15 bp) or bind to three shorter stretches (4–7 bp) of sequence that are separated from each other by variable distances [23,36]. The intrinsic ﬂexibility of HMGA proteins enables the AT-hooks of a single protein to not only bind simultaneously to AT-rich stretches on diﬀerent DNA molecules, thereby forming a peptide bridge between separate DNA substrates [25], but to also bind to quite variable arrangements of AT-rich binding sites on a single DNA molecule. These unique binding capabilities facilitate a variety of biological functions including the regulation of gene transcription initiation. As the diagram in Fig. 2 illustrates, the promoter regions of most genes regulated by HMGA proteins contain variably arranged stretches of AT-rich sequence that have been postulated to represent a sort of ‘‘bar code’’ that is ‘‘read’’ by the binding of the AT-hooks of the HMGA proteins during the process of transcription activation [10]. In the majority of cases, as shown by the proximal promoter regions of the human IL-2, IL-2R, IL-4, iNOS, IFN- , and E-selectin genes, the unique patterns of AT-rich binding sites occur within about 300 bp 50 -upstream of the transcription start site. In some instances, however, as illustrated by the mouse TNF- promoter in Fig. 2, the HMGA binding sites can occur at much greater

159

Fig. 2. The pattern of AT-rich, HMGA protein bindings sites (shown as black boxes; not to scale) in gene promoter regions form a unique ‘‘bar code’’ potentially involved with gene-speciﬁc transcriptional regulation ([10]; see text for details). Gene promoter sequence references: huIL-2, human interleukin-2 [38]; huIL-2R, human IL-2 receptor alpha subunit [60]; huIFN- , human interferon beta [120]; huE-Selectin, human E-selectin [120]; huIL-4, human IL-4 ([121]; GenBank Accession No. M23442); hu iNOS, human inducible nitric oxide synthase ([122]; GenBank Accession No. AF045478); muTNF- , murine tumor necrosis factor beta (a.k.a., lymphotoxin) [123]. Abbreviations: ARRE-1, -2, antigen regulated response elements-1 and -2; NFIL-2, -2B, nuclear factor interleukin-2 and -2B; CD28RE, CD28 response element; PRRII, positive regulatory region-II; PRDI-IV, positive regulatory domains I–IV.

distances 50 upstream of the start site or can even be located 30 downstream of the start site within intronic sequences [37]. Two additional features also contribute further combinatorial complexity to the postulated gene-speciﬁc ‘‘bar code’’ recognized by HMGA proteins. First, as illustrated by the arrows located beneath the human IL-2R promoter in Fig. 2, the HMGA proteins have been demonstrated to bind to AT-rich DNA sequences in an orientation-, or direction-speciﬁc manner [25,60]. And, second, the minor groove binding of the HMG proteins on gene promoters usually over-lap, or are near quite to, the major groove binding sites for transcription factors that interact physically with HMGA proteins during induction of gene transcription [10,12]. Together, the pattern and directionality of substrate binding, combined with speciﬁc interactions of the HMGA proteins with DNA, chromatin and other protein substrates, constitutes a collection of ‘‘determinants’’ that potentially allows these proteins to uniquely recognize and regulate individual gene promoter/enhancer regions among the immense number of other AT-rich binding sites present in the eukaryotic genome [60].

160 Even though the AT-hook peptide has no detectable secondary structure prior to substrate binding, there are inherent features of its conserved palindromic amino acid sequence that allow it to undergo a distinctive type of disordered-toordered transition. NMR studies have demonstrated that, as originally predicted [23], the proline residues of all three of the AT-hook peptides exist in a transconﬁguration while the protein is free in solution [24,25]. The trans-conﬁguration of the prolines restricts the ﬂexibility of the peptide back bone on either side of a freely mobile central glycine residue (Fig. 1, panels A–C) and, thereby, predisposes the AT-hook peptide to adopt dynamic, turn-like conﬁgurations in solution [24]. When these ﬂexible, but somewhat restrained, peptides turns encounter AT-rich stretches they are apparently ‘‘trapped’’ after assuming an energetically favorable planar, convex conﬁguration that makes optimal molecular contacts with both the sides and bottom of the narrow minor groove and stabilizing ionic contacts with the phosphodiester backbones of the DNA. Major contributors to the speciﬁcity of AT-DNA binding are the side chains of the arginine residues which orient parallel to the minor groove and extend toward the central axis of the DNA (Fig. 1C), thereby allowing their guanidino groups to make hydrogen bond contacts with the O2 position of thymines (Fig. 1D). Owing to the hydrophobic interactions between the inward projecting arginine side chains and the adenine bases, the AT-hook binds in only one direction in the minor groove. The critical importance of the prolines in the conserved AT-hook motif in facilitating both the initial DNA contacts and the subsequent conformational changes in the peptide backbone is attested to by the fact that if these residues are either changed to other amino acids, or if their position in the peptide is altered, the resulting mutant peptides will no longer preferentially bind to AT-rich DNA sequences [23]. Proteins containing such mutations act as in vivo dominant negative competitors for HMGA function when introduced into mammalian cells [38]. Their extreme degree of intrinsic ﬂexibility, combined with their ability to undergo substrate-induced conformational changes, sets the HMGA proteins apart from most other highly structured nuclear proteins and plays a critical role in enabling them to participate in a wide variety of biological processes. However, the importance of intrinsically disordered regions in proteins, and transitions from disordered-to-ordered structures, is now becoming widely recognized as a signiﬁcant and general feature of many diﬀerent biological systems [10,39–41]. Labile transitions between disordered and ordered conﬁgurations of the HMGA proteins, most likely mediated by reversible secondary biochemical modiﬁcations (see below), are likely to regulate the formation of functional HMGA complexes in cells and, thereby, control the biological activity of these proteins in vivo.

4. HMGA biochemical modifications: a labile regulatory code Over the last several years a substantial body of evidence has accumulated indicating the types and patterns of secondary biochemical modiﬁcations present on histones [42,43], transcriptional co-activators [44] and the HMGA proteins [33]

161 modulate their binding to DNA, to other proteins and to protein–DNA complexes. These modiﬁcations are often reversible and are employed by cells to precisely regulate the biological activity of proteins. The HMGA proteins are among the most highly modiﬁed proteins in the mammalian nucleus exhibiting complex patterns of phosphorylations, acetylations, methylations and possibly other covalent adductions [33,45]. These secondary biochemical modiﬁcations are both cell cycle-dependent and responsive to various environmental stimuli that activate speciﬁc signal transductions pathways (reviewed in Refs. [10,12]). Many years ago it was demonstrated that the HMGA proteins undergo cell cycle-dependent phosphorylations as a result of cdc 2 kinase activity in the G2/M phase of the cycle and that such modiﬁcations markedly reduce the aﬃnity of binding of the proteins to AT-rich DNA in vitro [32]. More recent studies have shown that HMGA proteins are also the downstream targets of a number of signal transduction pathways whose activation results in phosphorylation of speciﬁc amino acid residues distributed throughout the length of the proteins. In mammalian cells, the in vivo signaling pathways that activate casein kinase 2 (CK-2; [46–49]) and protein kinase C (PKC; [33]) result in phosphorylation of HMGA proteins within 15–30 min of their stimulation. The HMGA1 homolog protein of the insect Chironomous has also been shown to be phosphoryated in vivo by stimulation of the mitogen-activated protein (MAP) kinase signaling pathway [50]. Interestingly, agents that activate signaling pathways leading to programmed cell death (apoptosis) also aﬀect the phosphorylation state of HMGA proteins. Sgarra et al. [51] demonstrated, for example, that treatment of cells with drugs (etoposide, camptothecin) or viruses (herpes simplex virus type-I) that induce apoptosis also induce hyper-phosphorylation and mono-methylation of the HMGA1a protein soon after exposure to these agents followed a few hours later by a marked de-phosphorylation of the proteins. Since these hyper- and de-phosphorylation events occurred on the majority of the HMGA1a proteins in the cell, the authors propose that the modiﬁcations are causally connected to the global changes in chromatin structure that occur during the early and later phases of apoptotic cell death. Recent advances in mass spectrometry (MS) technology have provided researchers with an unparalleled ability to identify the types and patterns of secondary biochemical modiﬁcations found on proteins in living cells. Matrixassisted laser desorption/ionization-MS (MALDI-MS) analyses have shown, for example, that HMGA proteins in vivo are simultaneously subject to complex patterns of phosphorylation, acetylation and methylation and that, within the same cell type, diﬀerent isoforms of these proteins can exhibit quite diﬀerent modiﬁcation patterns [33]. Furthermore, these in vivo modiﬁcations have been demonstrated to markedly alter the binding aﬃnity of HMGA proteins for both DNA and chromatin substrates in vitro [33]. Nevertheless, due to their number and complexity, it has been diﬃcult to determine the actual biological function(s) played by these biochemical modiﬁcations in living cells. The use of MALDI-MS analysis alone to study in vivo protein modiﬁcations has several limitations, especially when it comes to identifying the speciﬁc amino acid

162 residues in the HMGA proteins that are modiﬁed. To overcome these shortcomings, we employed a strategy in which MALDI-MS is combined with tandem mass spectrometry (MS/MS) analysis to speciﬁcally identify both the types and sites of modiﬁcations found on HMGA proteins in vivo [52]. This experimental approach is outlined in Fig. 3. The HMGA and other acid-soluble proteins are ﬁrst isolated from cells and puriﬁed to >90% homogeneity by reverse-phase high performance liquid chromatography (RP-HPLC) employing standard techniques [53]. Enzymatic digests of the RP-HPLC puriﬁed proteins are then assessed by

Fig. 3. Strategy for analyzing the patterns of native secondary biochemical modiﬁcations found on HMGA proteins in living cells using mass spectrometry techniques. The upper left side of the ﬁgure shows steps of a standard protocol for determining both the amino acid sequence and sites of biochemical modiﬁcation of native HMGA proteins isolated from cells. The upper right side of the ﬁgure shows the proﬁle of a reverse-phase HPLC chromatogram of acid soluble proteins isolated from living cells, the initial fractionation step for isolating in vivo modiﬁed HMGA proteins. The lower left side of the ﬁgure shows an example of a restricted region of a MALDI/MS spectrum of a HMGA1 peptide digest containing the same peptide fragment with varying degrees of in vivo secondary biochemical modiﬁcations. Peaks: a, unmodiﬁed peptide; 1, the di-methylated peptide; 2, the tri-phosphorylated peptide; and, 3, the tetra-phosphorylated plus di-methylated peptide. The table on the lower right hand side shows the sequence and types of modiﬁcations present on the peptides shown in the chromatograph. See text for details.

163 MALDI-MS to determine the types and extent of modiﬁcations found on diﬀerent peptides fragments. These digests are also analyzed by ion trap MS/MS to directly obtain the sequence, types and sites of speciﬁc amino acid modiﬁcations present on individual peptides. These MS analytical techniques are very rapid, extremely accurate and require only small amounts of protein to obtain peptide sequences and amino acid modiﬁcation information [54]. As an example, the restricted region of a MALDI/MS chromatograph illustrated in the lower left side of Fig. 3 shows peaks corresponding to the same peptide fragment with either no modiﬁcations (labeled ‘‘a’’) or containing the various types of secondary modiﬁcations (peaks 1–3) listed in the table on the lower right side of the ﬁgure. Figure 4 shows some of the sites and types of in vivo modiﬁcations found by MALDI/MS on the HMGA1a protein isolated from human MCF7 mammary epithelial cells [33]. The sequence of the HMGA1a protein is shown in the center of the ﬁgure and shaded boxes indicate the three AT-hook motifs (I, II, and III) in the HMGA1a protein and the clear box indicates the 11 internal amino acid residues that are deleted from the HMGA1b protein as a consequence of alternative mRNA splicing [15]. The types of secondary modiﬁcations found on the various amino acid residues are shown above the diagram and, where known, the enzymes thought to be responsible for these modiﬁcations (e.g., cdc-2, PKC, CK-2, etc.) are indicated above the sequence [10,33]. The lines below the sequence indicate the regions of the HMGA1 proteins that have been demonstrated to interact physically with other transcription factors [12]. It is important to note that in vivo the most highly modiﬁed part of the HMGA1 proteins is located between the second (II) and third (III) AT-hooks and corresponds to the region of the protein that has the most identiﬁed interacting protein partners [12]. The concurrence of numerous in vivo sites of reversible biochemical modiﬁcations with those of direct physical association with other proteins suggests that this region of the HMGA1 proteins is important for regulation of their biological function(s) in cells. Support for the suggestion that biochemical modiﬁcations regulate the biological function of HMGA1 proteins in cells comes from experiments that demonstrate isolated native HMGA proteins exhibit markedly diﬀerent aﬃnities and speciﬁcities, compared to unmodiﬁed recombinant proteins, for binding to various DNA and nucleosomal substrates in vitro [33]. This point is illustrated by the results of the in vitro electrophoretic mobility shift assays (EMSAs) shown in Fig. 5. Panel A shows the proﬁle of bands obtained when increasing concentrations of unmodiﬁed recombinant HMGA1a protein are bound to a radio-labeled DNA substrate containing multiple (>7) AT-rich binding sites for the protein whereas panel B illustrates the results obtained when identical concentrations of in vivo modiﬁed protein are added to the DNA probe. It is obvious from these results that the modiﬁed HMGA1a proteins bind to the DNA probe with considerably less aﬃnity than does the unmodiﬁed recombinant protein. Likewise, as shown in panels C and D, a similar marked reduction is observed in the aﬃnity of binding of in vivo modiﬁed HMGA1b proteins, compared to unmodiﬁed proteins, to isolated nucleosome core particles. Additional evidence that the biochemical modiﬁcations

164

Fig. 4. Diagram of the human HMGA1 protein showing, previously identiﬁed sites of in vivo biochemical modiﬁcations [10], sites of modiﬁcation conﬁrmed by mass spectrometry and the regions of the proteins identiﬁed as minimal areas required for speciﬁc interactions with other proteins. The upper line illustrates the full-length HMGA1a protein and the rectangular boxes (shaded) indicate the positions of the three AT-hook DNA-binding domains (I, II, III). The elliptical box (clear) indicates the position where an 11 amino acid residue deletion occurs that gives rise to the HMGA1b isoform protein as a result of alternative mRNA splicing. The amino acid sequence and numbering of the HMGA1a protein are shown in the middle of the diagram. The sites of in vivo modiﬁcations of the HMGA1 protein that have been conﬁrmed by MALDI-MS and MS/MS are depicted by black boxes with white lettering or white lettering within the three AT-hooks. Serine and threonine residues are the sites of labile phosphorylation whereas variable acetylation occurs exclusively on lysine residues. The existence of methyl groups on HMGA1 proteins has previously been reported in the literature; however, the speciﬁc sites of such modiﬁcations have not yet been identiﬁed. Enzymes that are known to modify HMGA1 are indicated between the sequence and the diagram of the protein. The lines below the amino acid sequence show the areas of the protein that have been identiﬁed as the minimal required for speciﬁc protein–protein interactions with other transcription factors. The amino acid residues involved in these protein–protein interactions are indicated by numbers following the colons. The original sources demonstrating these physical protein interactions are: NF-B p50/p65 heterodimer, ATF-2/c-Jun heterodimer and IRF-1, Yie et al. [113]; NF-Y, Currie [114]; SRF (serum response factor), Chin et al. [115]; NF-B p50 homodimer, Zhang and Verdine [116]; Tst-1/Oct-6, Leger et al. [117]; HIPK2, Pierantoni et al. [118]. Figure modiﬁed and updated from Ref. [12].

165

166

Fig. 5. Secondary in vivo biochemical modiﬁcations of HMGA proteins reduce the binding aﬃnity of HMGA proteins for both free AT-rich DNA substrates (shown on left side of ﬁgure) and random sequence nucleosome core particles (right hand side of ﬁgure). Electrophoretic mobility shift assays (EMSAs) using radio-labeled free DNA or isolated nucleosome substrates were reacted with either unmodiﬁed recombinant human HMGA1 proteins (upper half of ﬁgure) or with native HMGA proteins isolated from cells containing complex patterns of secondary biochemical modiﬁcations (lower half of ﬁgure). See text for further details.

found on HMGA proteins may serve speciﬁc regulatory functions in cells comes from experiments by Munshi et al. [34] who investigated the role of acetylation of speciﬁc amino acid residues on the inducible regulation of the human interferon- (IFN- ) gene. These workers demonstrated that acetylation of the HMGA1a protein at residue Lys71 by the P/CAF acetyltransferase facilitates transcriptional activation of the IFN- by promoting formation of an enhanceosome on the gene’s promoter region in cells infected with viruses. In contrast, acetylation of the nearby residue Lys65 by a diﬀerent acetyltransferase enzyme, CBP, was shown to turn oﬀ transcription of the IFN- gene by promoting destabilization and disassembly of a previously assembled enhanceosome. The cumulative data, therefore, support not only a role for secondary modiﬁcations in regulating the biological function of HMGA proteins but also suggest that the complex patterns of such modiﬁcations provide the cell with mechanisms for exerting exquisitely ﬁne control over their in vivo activities.

5. HMGA proteins, AT-hooks and chromatin remodeling As well as being accessory regulators of gene transcription, HMGA proteins are also integral components of chromatin and are thought to be involved with controlling the mechanics of chromosome structure, function, and dynamics (reviewed in Refs. 2,10,30). In contrast to these global inﬂuences on chromosome architecture, a second, much more restricted, eﬀect of HMGA proteins on altering

167

Fig. 6. Puriﬁed recombinant HMGA proteins bind to four regions of DNA on random sequence nucleosome core particles. Panel A: The results of EMSA gel assays in which increasing concentrations of either puriﬁed nonhistone HMGN2 (a.k.a., HMG-17, which binds to two sites on nucleosome core particles) or recombinant human HMGA1a protein were bound nucleosome core particles isolated from chicken erythrocytes [57]. Panel B: Two diﬀerent views of the nucleosome taken from the X-ray structure of Luger et al. [119] showing the sites of binding of HMGA proteins (dashed circles) determined by DNA foot-printing analyses and other techniques (see text for details).

localized nucleosome structure and function has also been proposed. Indeed, one of the ﬁrst biological activities suggested for the HMGA proteins (originally referred to as proteins) was to induce positioning of nucleosomes on the AT-rich -satellite DNA sequences of chromosomes in monkey cells [55,56]. It was later discovered, however, that the highly repetitive -satellite DNA sequences are capable of positioning nucleosomes in vitro independent of HMGA proteins. Nevertheless, as shown in Fig. 6, HMGA proteins are among only a few known transcription factors that can bind directly to DNA on the surface of nucleosome core particles [57]. Panel A (see also Fig. 5C) shows the results of EMSA analyses that indicate HMGA proteins form four discrete complexes when bound to random sequence core particles isolated from chicken erythrocytes [57], whereas another well characterized nuclear protein, HMGN2 (a.k.a., HMG-17), forms only two complexes [58]. As illustrated by the schematic diagram in panel B, DNA footprinting and other analyses have demonstrated that these four sites are located at the entrance and exits of DNA from the nucleosome and at the junctions of the over- and under-wound regions of DNA ﬂanking either side of the dyad axis of the core particle [57,59]. In addition to these four sites, HMGA proteins are also able to bind to AT-rich stretches located on the surface of nucleosomes that have been reconstituted in vitro using core histones and cloned fragments of DNA of deﬁned nucleotide sequence [59,60]. Protein domain-swap experiments have, furthermore, demonstrated that it is the AT-hook regions of the HMGA proteins that are responsible for nucleosome core particle binding [28]. Importantly, HMGA binding to either random sequence or deﬁned sequence core particles has been shown to induce localized changes in the rotational setting of DNA on

168 the surface of nucleosome [59], thus mediating a restricted form of chromatin remodeling. The HMGA proteins have been proposed to participate in the localized chromatin remodeling events that accompany transcriptional activation of the promoters of certain inducible gene such as those coding for the human cytokine interleukin-2 (IL-2) [38,61] and the regulatory subunit of its receptor, IL-2R [60,62]. For example, it has been demonstrated that a nucleosome is positioned over the important PRRII regulatory sequence in the promoter of the human IL-2R gene (see Fig. 2) in unstimulated lymphoid cells and that this core particle undergoes a ‘‘remodeling’’ process during transcriptional activation of the gene in stimulated cells [60]. Importantly, additional experiments demonstrated that it is possible to reconstituted a positioned nucleosome at this same position over the PRRII element on an isolated fragment of the IL-2R promoter DNA in vitro and, most remarkably, that the HMGA1 protein binds to this reconstituted core particle with a direction-speciﬁc polarity. This directional binding of the HMGA1 protein has been proposed to impart a stereo-speciﬁcity to the positioned nucleosome and thus ‘‘tag’’ or uniquely identify it for subsequent disruption by ATPdependent chromatin remodeling complexes during the process of transcriptional activation of the IL-2R promoter [60]. The ability of AT-hook peptide motifs to bind to, and induce localized changes in the structure of, nucleosome core particles is not restricted to HMGA proteins. Signiﬁcantly, it has also recently been discovered that other proteins that contain AT-hook motifs are essential components of the multi-protein, ATP-dependent chromatin remodeling complexes or ‘‘machines’’ (CRMs) found in yeast, Drosophila and mammalian cells. For example, the Swi2p/Snf2p protein, which is the ATPase component of the SWI chromatin remodeling complex in yeast, contains two AT-hook motifs [63] and the ISWI ATPase component of the Drosophila chromatin remodeling complex NURF (nucleosome remodeling factor) contains a single AT-hook peptide [64]. Likewise, studies have demonstrated that the AT-hooks present in chromatin remodeling proteins are critically important for the biological activity of CRM complexes. For instance, the Rsc1 and Rsc2 subunits of the RSC (remodeling the structure of chromatin) complex in the yeast S. cerevisiae each contain a single AT-hook motif that, when mutated or deleted, destroys the chromatin remodeling activity of the RSC complex and results in cell lethality [65]. Similarly, mammalian SWI/SNF-like CRM complexes contain one or the other of two essential and closely related ATPases, known as brm/SNF2 (also called BAF) and BRG-1/SNF2 (also called PBAF) [66,67], each of which contains a single AT-hook motif in their C-terminal region. Yaniv and his colleagues [68] have demonstrated that when the AT-hook is deleted from brm/SNF, the CRM complex looses its in vivo functional chromatin remodeling activity and also can no longer bind to chromatin substrates. And, ﬁnally, Xiao et al. [69] discovered that the N-terminal end of the largest subunit of the Drosophila NURF complex, NURF301, contains two AT-hook peptide motifs and an acidic region that have high sequence similarity to the mammalian HMGA proteins. Intriguingly, the amino acid sequence of the N-terminal end of NURF301 more

169 closely resembles that of HMGA proteins than do the AT-hook containing domains of Rsc1, Rsc2, brm/SNF or any of the other CRM proteins. These workers also demonstrated that the only subunits of the NURF complex required for the induction of nucleosome sliding (i.e., remodeling) in an in vitro model system are NURF301 and the ISWI ATPase protein that also contains a single AT-hook peptide motif. Quite importantly, Xiao et al. [69] went on to show that the N-terminal end of the NURF301 is the region of the protein responsible for binding to nucleosome core particles in vitro and that when the two AT-hooks of this region are deleted, the ability of the truncated protein to both bind (‘‘tether’’) to core particles and induce sliding is inhibited [69]. This cumulative in vivo and in vitro evidence thus strongly supports an active role for the AT-hook motifs found in various CRMs in both nucleosome binding and ATP-dependent sliding/remodeling. Several mechanistic explanations have been advanced for explaining how AT-hook peptides in CRMs might be involved with nucleosome remodeling processes including the attractive suggestion that by selectively binding to distorted regions of DNA on core particles they induce, via ATP-driven reactions, dynamic localized rotational changes in DNA structure that are propagated in a screw-like manner to induce nucleosome translational sliding [12,69]. When considered in the context of activation of the human IL-2R gene discussed above, this information has also led to a proposal that the directional binding of HMGA1 proteins (via their AT-hook motifs) to the nucleosome positioned on the PRRII promoter region in unstimulated T-cells likely acts as a ‘‘marker’’ or ‘‘placeholder’’ for binding by the AT-hooks of CRM proteins during the subsequent ATP-dependent disruption of the core particle that occurs during transcriptional activation of the gene in vivo [10,60]. Considerable experimental support for this model has recently been obtained employing chromatin immunoprecipitation (ChIP) assays that demonstrated that in vivo the HMGA protein is bound to the positioned nucleosome on the IL-2R promoter in resting T lymphocytes and that, within 30 min of following cell stimulation, the HMGA1 protein dissociates from the nucleosome. In contrast, parallel ChIP assays showed that BRG-1, a subunit of the human SWI/SNF complex, is not bound to the positioned nucleosome in unstimulated lymphocytes but becomes associated with the core particle immediately after cell stimulation (within 5–15 min) at about the same moment that the HMGA1 protein is dissociating from the nucleosome, a time during which chromatin remodeling and transcriptional activation of the IL-2R gene is occurring [70]. Further experimental support for a functional cooperation, or active interplay, between the HMGA proteins and the AT-hook-containing CRM proteins during the process of chromatin remodeling comes from the work of Lomvardas and Thanos [71]. These workers demonstrated that an in vitro system composed of only puriﬁed recombinant HMGA1, SWI/SNF and TBP (TATA-binding protein) proteins is capable of eﬃciently inducing ATPdependent nucleosome sliding/remodeling. It is likely that such cooperative interactions between diﬀerent AT-hook containing proteins are common and that many more examples of HMGA proteins being intimately, and actively,

170 involved in chromatin remodeling processes mediated by ATP-requiring CRM activities will be found.

6. HMGA proteins as potential drug targets Given their central role in such a variety of normal and pathological processes [12], the HMGA genes and proteins are attractive potential targets for the development of therapeutic drugs. The experimental strategies for development of such drugs fall into several categories. Some of the more promising approaches are to develop: (i) drugs that lower the eﬀective concentration of HMGA proteins in cells; (ii) drugs that non-speciﬁcally compete with the AT-hooks of the proteins for binding to substrates; (iii) drugs that block speciﬁc binding of HMGA proteins to gene promoter regions; (iv) drugs that either speciﬁcally inactivate HMGA proteins or selective cross-link them to DNA in vivo. Many of these strategies have, with diﬀering degrees of success, already been investigated while others remain to be explored. 6.1. Methods to lower the cellular concentrations of HMGA proteins There are several reports demonstrating that lowering the endogenous levels of HMGA proteins in cells by the introduction of either anti-sense or dominantnegative expression vector constructs results in the reversal or amelioration of certain pathologic conditions. These include the demonstration that anti-sense eukaryotic cell expression vectors can: (i) inhibit neoplastic transformation of normal rat thyroid cells infected with retroviruses [72]; (ii) suppress the growth rate of cancerous cells and decrease their ability for anchorage-independent growth [73]; and (iii) preferentially induce apoptotic cell death in anaplastic human thyroid carcinoma cells but not in normal thyroid cells [74]. On the other hand, there are also reports demonstrating that reduction of cellular levels of HMGA proteins interferes with normal cellular processes such as inhibition of the inducible expression of the unique-sequence genes coding for the human interferon- [75] and interleukin-2 [76,77] proteins. These examples give some encouragement to the notion that modulation of endogenous HMGA protein levels might be a fruitful target for drug development. However, the anti-sense and dominant negative studies reported so far have involved delivery of expression vectors to cells, either as transfected DNAs or via viral infections, processes that are inherently ineﬃcient and therefore of limited general use. Alternative approaches of employing synthetic oligonucleotide-based strategies [80,81], such as the use of stable, anti-sense synthetic oligonucleotides [78] or anti-sense peptide-nucleic acids [79], in transfection experiments to inhibit HMGA protein expression seem feasible in principal but have not yet reported in the literature. Nevertheless, these procedures also suﬀer from similar limitations to those outlined for eukaryotic cell expression vectors. In order to overcome these shortcomings, there are a variety of other promising strategies (reviewed in

171 Refs. [82,83]) that can potentially be used to develop drugs that modulate the levels or functions of HMGA proteins in cells. 6.2. Drugs that non-specifically compete with AT-hooks peptides for DNA-binding As illustrated in Fig. 1, the highly conserved Pro–Arg–Gly–Arg–Pro peptide core region of the AT-hook DNA-binding domains assumes, following DNA-binding, a planar, crescent-shaped structure similar in conformation to the pyrrole antibiotics netropsin and distamycin A and to the ﬂuorescent dye Hoechst 33258, all of which also reversibly bind to the minor groove of AT-rich sequences with high selectivity. In fact, the structural similarity between the AT-hook DNA-binding peptide motif and Hoechst 33258 is the basis for an extremely sensitive ﬂuorescence competition assay employed to quantitative determine the aﬃnity of binding of HMGA proteins to AT-rich DNA substrates in vitro [23]. Distamycin A, and its chemical derivatives, are highly cytotoxic and exhibit antiviral [84] and anti-cancer [85–87] activities. Distamycin A, and other minor groove binding drugs appear to exert their biological eﬀects by interfering with cellular gene expression patterns through the alteration or disruption of DNA-binding by transcription factors [86] and, as a consequence, inhibiting the initiation step of transcription [87]. Interestingly, early in vitro protein–drug–DNA-binding studies by Wegner and his colleagues [88] led to the proposal that the cytotoxic eﬀects of drugs that selectively bind to the minor groove of AT-rich stretches of DNA (such as distamycin A, netropsin and berenil) are likely to be a consequence of competitive displacement of HMGA proteins from their in vivo DNA-binding sites. In vivo support for displacement of HMGA proteins by such minor groove binding drugs also comes from studies by Radic et al. [89] who demonstrated that both Hoechst 33258 and distamycin A directly compete with HMGA proteins for binding to AT-rich satellite DNA sequences in mouse cells causing chromosome decondensation, particularly in heterochromatic regions. Similar cytological eﬀects have recently been observed in human cells treated with these drugs [90]. The observed in vivo eﬀects on chromosome structure of drugs that selectively bind to the minor groove of AT-rich sequences agree quite well with the predictions of models of global gene activation originally advanced by Laemmli and his colleagues [91,92]. These workers proposed that, during the early stages of cellular diﬀerentiation when developmental changes in chromatin domains are occurring, HMGA proteins out-compete inhibitory proteins (such as histone H1) for binding to AT-rich DNA sequences, called ‘‘scaﬀold attachment sites’’ (or SARs), that are distributed along the backbone of metaphase chromosomes and, as a consequence, ‘‘open up’’ selected regions of chromatin for active gene transcription [91,92]. Similarly, the results of minor groove-binding drug studies are consistent with the observation that artiﬁcially created ‘‘multiple AT-hook’’ (MATH) proteins that contain numerous AT-hook motifs separated by ﬂexible peptide linkers have the ability to both condense chromatin and inhibit chromosome assembly when added to in vitro extracts of oocytes of the amphibian Xenopus laevis [93].

172 MATH proteins have also been shown to regulate the in vivo transcription of endogenous host cell genes in diﬀerentiated adult tissues when transgenes coding for these proteins were introduced into the laraval stages of the insect Drosophila [94]. A major problem with using drugs such as netropsin, berenil or derivatives of distamycin as potential therapeutic agents, however, is their generalized binding to the minor groove of most AT-rich sequences. These promiscuous interactions result in non-speciﬁc toxicity of the drugs for all types of cells thus greatly limiting their use as selective anti-viral or anti-tumor agents. 6.3. Drugs that block binding of HMGA proteins to specific gene promoters One alternative that is beginning to be explored to overcome such generalized drug toxicity problems is to create membrane-permeable synthetic molecules that target only speciﬁc gene promoters that are naturally regulated by the HMGA proteins. Based on the known DNA-binding characteristics of the HMGA proteins and assuming, as previously discussed, that the promoter region of each HMGAregulated gene has a unique ‘‘bar code’’ of AT-rich binding sites (Fig. 2), it should be possible to create synthetic drugs that will selectively recognize portions of this ‘‘bar code’’. Such promoter-speciﬁc drugs would be expected to competitively inhibiting the binding of endogenous HMGA proteins to their natural target sites in this promoter and, thereby, reduce or eliminate the expression of only this particular gene in living cells. A number of experimental advances support the feasibility of this ‘‘designer’’ drug-based approach to inhibiting expression of speciﬁc HMGA-regulated genes. Principal among these has been the successful design and synthesis of small, minor groove binding molecules, called lexitropsins, that contain polymers of N-methylimidazole (Im) and N-methylpyrrole (Py) amino acids and which bind with high aﬃnity to predetermined DNA sequences (reviewed in Refs. [83,95–97]). Importantly, the structural basis and pairing rules have been developed to design rationally pyrrole–imidazole (Py–Im) polyamide dimers [98] that bind speciﬁcally to the minor groove of either AT- [99] or GC-sequences [100] with aﬃnities and speciﬁcities comparable to native DNA-binding proteins. Such Py–Im polyamide molecules have been demonstrated to speciﬁcally inhibit the transcription of viral and genomic genes in living cells [101] by selectively binding to only certain promoter sequences and, thereby, interfering with both the binding of TATA-box binding protein (TBP) and basal transcription by RNA polymerase II [102]. The synthesis of a Py–Im polyamide molecule that speciﬁcally recognizes a short stretch of AT-rich sequence that constitutes part of the distinctive ‘‘bar code’’ of an individual gene promoter is certainly possible. However, targeting of a lexitropsin molecule to only a single site in a complex promoter is unlikely to provide the necessary degree sequence recognition speciﬁcity required for gene speciﬁc regulation in eukaryotic cells. Thus, at a minimum, the challenge for speciﬁc gene regulation will be to create chimeric molecules with Py–Im moieties that bind to

173 speciﬁc (but diﬀerent) stretches of AT-DNA and which are separated from each other by a peptide linker of such length that the dimeric drug binds to only the appropriately spaced and directionally oriented AT-stretches present in a given target promoter. Encouragingly, experiments attesting to the feasibility of using such bipartite drugs as reagents for modulating gene expression in living organisms have recently been performed in the insect Drosophila. In an elegant series of experiments, Janssen et al. [103] synthesized a series of dimeric oligopyrrole drugs containing internal ﬂexible peptide linkers of varying length that exhibited high binding aﬃnity for large and bipartite AT-rich DNA tracks with the various drugs in the series speciﬁcally binding to diﬀerent AT-rich satellite DNA sequences in Drosophila. When these drugs were fed to larvae they signiﬁcantly modulated normal developmental gene expression patterns and caused both gain- and loss-of-function of phenotypes in adult ﬂies. For example, one of the polyamide drugs suppressed position-eﬀect variegation (PEV) of the white-mottled locus (a consequence of increased gene expression) whereas another drug mediated homeotic transformations (caused by loss of gene function) exclusively in the brown-dominant locus [103]. These are remarkable biological results and if analogous phenotypic eﬀects can be obtained by the targeting of bipartite Py–Im polymide drugs to the promoters of speciﬁc HMGA-regulated genes in mammalian cells, the path could be opened for new types of therapeutic treatments for a wide range of pathological conditions ranging from viral infections and immune disorders to the formation of atherosclerotic plaques. Of course, the success of such a drug development strategy is dependent on identifying the promoters of those genes that are the direct targets of HMGA protein regulation in cells exhibiting various pathological conditions, a task that has already been initiated [73,104].

6.4. Drugs that specifically inactivate or cross-link HMGA proteins in vivo Drugs that recognize, and selectively interact with structural or other features of the HMGA protein themselves could potentially be used as therapeutic reagents to eliminate, for example, aberrant tumor cells that are constitutively over-express high levels of HMGA proteins. The power of combinatorial chemistry (reviewed in Ref. [105]) could, in principal, be employed to select for synthetic drugs that speciﬁcally interact with the unbound, unstructured forms of the HMGA proteins present in cells and prevent them from performing their endogenous biological functions. Likewise, cell permeable reagents that selectively react with structural features of the AT-hook peptide motif when it is directly bound to the minor groove of DNA (Fig. 1) might also prove to be an eﬃcacious way of inactivating the HMGA proteins in vivo. In this connection, the drug FR900482 (4-formyl6,9-dohydroxy-14-oxa-1,11-diazatetracyclo [7.4.1.02,7,010,12]-tetradeca-2,4,5-triene-8-yl methyl carbamate [106]), and its chemical derivative FK317 (11-acetyl-8-carbamoyloxymethyl-4-formyl-6-methoxy-14-oxa-1,11-diazatetracyclo [7.4.1.02,7,010,2]tetradeca-2,4,6-trien-9-yl acetate [107]) are two examples of new anti-cancer drugs

174

Fig. 7. Diagram of the structure of the FR900482 and FK317 pro-drugs and the proposed scheme of their reductive activation via a miosene-like intermediate inside living cells (see the text for further details). Figure modiﬁed from Ref. [108].

that have recently been demonstrated to cross-link HMGA proteins to the minor groove of DNA in living cells [108,109]. As illustrated in Fig. 7, both FR900482 and FK317 are reductively activated inside cells through a scheme that involves the thiol-mediated two-electron reduction of the N–O bond in the presence of trace Fe(II) generating a transient ketone which rapidly cyclizes to a carbinolamine derivative, followed by expulsion of water to produce the reactive electrophilic mitosene derivative [110,111]. This requirement for reductive activation is thought to be the mechanism responsible for the preferential targeting of these pro-drugs to tumorgenic cells that, in general, exhibit a more anaerobic metabolism than normal cells. The presence, or lack, of reductive activation leads to diﬀerent active compounds derived from the FR900482 and FK317 drugs in normal and cancerous cells. In tumorgenic cells both drugs are deacetylated, oxidized and then reduced to create a 4-alcohol derivative that has cytotoxic properties. In contrast, in normal cells FK90082 and FK317 are deacetylated and oxidized to form a 4-carboxylic acid, which is not cytotoxic [112]. While both the FR900482 and FK317 drugs, which diﬀer by only a single methyl group in cells after they have been reductively activated, are highly toxic to tumor cells, the two drugs have other quite diﬀerent secondary biological eﬀects. One of the major diﬀerences between the drugs is that FR900482 induces a pathological condition in treated patients known as vascular leak syndrome (VLS) where as the K317 drug does not. As a consequence, FR900482 has recently been withdrawn from clinical studies whereas FK317 is currently in phase II clinical trials for cancer treatment in Japan. One possible explanation for the diﬀerence in the ability of the two drugs to induce VLS could be diﬀerences in their abilities to up-regulate expression of the IL-2 and IL-2R genes in lymphoid cells [109].

175 Although FR900482 and FK317 are not speciﬁc cross-linkers of HMGA proteins to DNA in living cells (i.e., they also cross-link other minor groove binding proteins in vivo), they are the ﬁrst drugs demonstrated to speciﬁcally cross-link HMGA proteins to the minor groove of DNA in vivo and therefore present a major technical advance for studies aimed at examining the cellular dynamics of binding of these proteins to speciﬁc sequences of DNA in living cells [70]. And, importantly, the FK317 drug also potentially serves as a model for designing future therapeutic hybrid FK317-polyamide lexitropsin drugs that target speciﬁc AT-rich HMGA binding for selective killing of particular cell types. These future hybrid drugs are envisioned to consist of a cell permeable, reductively activated pro-drug (e.g., FK317) that is connected by a ﬂexible linker to a bipartite AT-sequence recognizing Py–Im polyamide lexitropsin that speciﬁcally recognizes the promoters of speciﬁc HMGA-regulated genes. In principal, these hybrid drugs could be used to speciﬁcally treat individual pathological conditions in ways that are not currently possible. Additionally, such drugs could ﬁnd use as new reagents to investigation the biological function(s) of HMGA proteins in normal processes such as embryogenesis and cell diﬀerentiation.

7. Conclusions A constellation of at least three characteristics distinguishes the HMGA proteins from almost all other cellular proteins: (i) the AT-hook DNA-binding motif that recognizes the structure, rather than the nucleotide sequence, of substrates; (ii) an unusually high degree of intrinsic ﬂexibility that allows the proteins to undergo disordered-to-ordered structural transitions as part of their biological function; and (iii) complex patterns of secondary modiﬁcations that appear to function as a biochemical ‘‘code’’ to precisely regulate the biological activity of the protein in vivo. Structural simplicity and ﬂexibility, combined with very sophisticated regulatory control mechanisms, are the biophysical and biochemical traits that allow the HMGA proteins to function as either ‘‘generalists’’ or as ‘‘specialists’’ in so many diﬀerent nuclear activities. In this review we have brieﬂy discussed how these features enable the proteins to participate in such diverse processes as chromosome dynamics during the cell cycle, transcriptional regulation of genes and both global and localized chromatin remodeling events. Their central role in nuclear metabolism makes the HMGA proteins attractive targets for therapeutic interventions to treat several diﬀerent types of pathologies. A number of current and potential approaches appear to be promising in this area. The key problem for developing such useful therapeutics, however, is to create drugs with the requisite speciﬁcity to target only certain functions of the HMGA proteins inside cells. Unfortunately, the same characteristics that allow the HMGA proteins to perform multifaceted tasks within the nucleus are precisely those that present the most challenge in creating eﬀective and speciﬁc anti-HMGA drugs. Nevertheless, important strides have been made in our understanding of the

176 structure and function of the HMGA proteins and thus a solid foundation has been laid for future advances in this area.

Abbreviations ARRE-1, -2 AT-hook bp CD28RE CK-2 CD ChIP CK2 CRM EMSA HMG HMGA (a.k.a., HMGI/Y) HMGB (a.k.a., HMG-1 and -2) HMGN (a.k.a., HMG-14 and -17) hu IFN- IL-2 IL-2R IL-4 iNOS ISWI MALDI/MS MAP kinase MATH mRNA mu MS MS/MS NFIL-2 NMR NURF PEV PKC PRD-I to -IV PRRII Py–Im

antigen regulated response elements-1 and -2 DNA-binding domain peptide of HMGA proteins base pair CD28 response element casein kinase 2 circular dichroism chromatin immunoprecipitation assay casein kinase 2 chromatin remodeling machine electrophoretic mobility shift assay ‘high mobility group’ nonhistone chromatin proteins the ‘AT-hook’ containing family of HMG proteins the ‘‘B box’’ containing family of HMG proteins the ‘Nucleosome-binding’ family of HMG proteins human interferon beta interleukin-2 alpha subunit of the IL-2 receptor interleukin-4 inducible nitric oxide synthase imitation SWI remodeling complex matrix-assisted laser desorption ionization/mass spectrometry mitogen activated protein kinase synthetic multi-AT hook proteins messenger ribonucleic acid murine mass spectrometry tandem mass spectrometry–mass spectrometry nuclear factor interleukin-2 nuclear magnetic resonance spectrometry nucleosome remodeling factor position eﬀect variegation protein kinase C positive regulatory domains I to IV positive regulatory region II pyrrole–imidazole polyamides

177 SRF SWI/SNF TCR TNF- SAR TPA

serum response factor chromatin remodeling complexes the T cell receptor -chain tumor necrosis factor- scaﬀold attachment region phorbol ester 12-O-tetradecanoylphorol-13-acetate

References 1. 2. 3. 4. 5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18. 19. 20. 21. 22. 23. 24. 25. 26. 27. 28. 29. 30. 31. 32.

Bustin, M. (2001) Trends Biochem. Sci. 26, 152–153. Bustin, M. and Reeves, R. (1996) Prog. Nucleic Acid Res. Mol. Biol. 54, 35–100. Goodwin, G. (1998) Int. J. Biochem. Cell Biol. 30, 761–766. Bustin, M. (1999) Mol. Cell Biol. 19, 5237–5246. Jansen, E., Petit, M.R., Schoenmakers, E.F., Ayoubi, T.A., and van de Ven, W.J. (1999) Gene Ther. Molec. Biol. 3, 387–395. Tallini, G. and Dal Cin, P. (1999) Adv. Anat. Pathol. 6, 237–246. Reeves, R. (2000) Environ. Health Perspect. 108, 803–809. Bianchi, M.E. and Beltrame, M. (2000) EMBO Rep. 1, 109–114. Wisniewski, J.R. and Schwanbeck, R. (2000) Int. J. Mol. Med. 6, 409–419. Reeves, R. and Beckerbauer, L. (2001) Biochim. Biophys. Acta 1519, 13–29. Liu, F., Chau, K.Y., Arlotta, P., and Ono, S.J. (2001) Immunol. Res. 24, 13–29. Reeves, R. (2001) Gene 277, 63–81. Fedele, M., Battista, S., Manﬁoletti, G., Croce, C.M., Giancotti, V., and Fusco, A. (2001) Carcinogenesis 22, 1583–1591. Ashar, H.R., Fejzo, M.S., Tkachenko, A., Zhou, X., Fletcher, J.A., Weremowicz, S., Morton, C.C., and Chada, K. (1995) Cell 82, 57–65. Friedmann, M., Holth, L.T., Zoghbi, H.Y., and Reeves, R. (1993) Nucleic Acids Res. 21, 4259–4267. Ogram, S.A. and Reeves, R. (1995) J. Biol. Chem. 270, 14235–14242. Holth, L.T., Thorlacius, A.E., and Reeves, R. (1997) DNA Cell Biol. 16, 1299–1309. Cmarik, J.L., Li, Y., Ogram, S.A., Min, H., Reeves, R., and Colburn, N.H. (1998) Oncogene 16, 3387–3396. Johnson, K.R., Lehn, D.A., and Reeves, R. (1989) Mol. Cell Biol. 9, 2114–2123. Nagpal, S., Ghosn, C., DiSepio, D., Molina, Y., Sutter, M., Klein, E.S., and Chandraratna, R.A. (1999) J. Biol. Chem. 274, 22563–22568. Hauke, S., Rippe, V., and Bullerdiek, J. (2001) Genes Chromosomes Cancer 30, 302–304. Kurose, K., Mine, N., Iida, A., Nagai, H., Harada, H., Araki, T., and Emi, M. (2001) Genes Chromosomes Cancer 30, 212–217. Reeves, R. and Nissen, M.S. (1990) J. Biol. Chem. 265, 8573–8582. Evans, J.N., Zajicek, J., Nissen, M.S., Munske, G., Smith, V., and Reeves, R. (1995) Int. J. Pept. Protein Res. 45, 554–560. Huth, J.R., Bewley, C.A., Nissen, M.S., Evans, J.N., Reeves, R., Gronenborn, A.M., and Clore, G.M. (1997) Nat. Struct. Biol. 4, 657–665. Schwanbeck, R., Gymnopoulos, M., Petry, I., Piekielko, A., Szewczuk, Z., Heyduk, T., Zechel, K., and Wisniewski, J.R. (2001) J. Biol. Chem. 267, 26012–26021. Hill, D.A., Pedulla, M.L., and Reeves, R. (1999) Nucleic Acids Res. 27, 2135–2144. Banks, G.C., Mohr, B., and Reeves, R. (1999) J. Biol. Chem. 274, 16536–16544. Merika, M. and Thanos, D. (2001) Curr. Opin. Genet. Dev. 11, 205–208. Reeves, R. (1992) Curr. Opin. Cell Biol. 4, 413–423. Reeves, R. and Beckerbauer, L. (2002) Prog. Cell Cycle Res. 5, 279–286. Reeves, R., Langan, T.A., and Nissen, M.S. (1991) Proc. Natl. Acad. Sci. USA 88, 1671–1675.

178 33. 34. 35. 36. 37. 38. 39. 40. 41.

42. 43. 44. 45. 46. 47. 48. 49. 50. 51. 52. 53. 54. 55. 56. 57. 58. 59. 60. 61. 62. 63. 64. 65. 66. 67. 68. 69. 70. 71. 72.

Banks, G.C., Li, Y., and Reeves, R. (2000) Biochemistry 39, 8333–8346. Munshi, N., Merika, M., Yie, J., Senger, K., Chen, G., and Thanos, D. (1998) Mol. Cell 2, 457–467. Strahl, B.D. and Allis, C.D. (2000) Nature 403, 41–45. Maher, J.F. and Nathans, D. (1996) Proc. Natl. Acad. Sci. USA 93, 6716–6720. Kim, H.-P., Kelly, J., and Leonard, W.J. (2001) Immunity 15, 159–172. Himes, S.R., Reeves, R., Attema, J., Nissen, M., Li, Y., and Shannon, M.F. (2000) J. Immunol. 164, 3157–3168. Plaxco, K.W. and Gross, M. (1997) Nature 386, 657, 659. Wright, P.E. and Dyson, H.J. (1999) J. Mol. Biol. 293, 321–331. Dunker, A.K., Lawson, D., Brown, C.J., Romero, P., Oh, J., Oldﬁeld, C.J., Campen, A.M., Ratliﬀ, C.M., Hipps, K.W., Ausio, J., Nissen, M.S., Reeves, R., Kang, C.H., Kissinger, C.R., Bailey, R.W., Griswold, M.D., Chiu, W., Garner, E.C., and Obradovic, Z. (2001) J. Mol. Graph. Model. 19, 1–65. Cheung, P., Allis, C.D., and Sassone-Corsi, P. (2000) Cell 103, 263–271. Jenuwein, T. and Allis, C.D. (2001) Science 293, 1074–1080. Gamble, M.J. and Freedman, L.P. (2002) Trends Biochem. Sci. 27, 165–167. Elton, T.S. (1986) Puriﬁcation and Characterization of the High Mobility Group Nonhistone Chromatin Proteins. Ph.D. Thesis, pp. 1–134. Washington State University, Pullman, WA 99164. Palvimo, J. and Linnala-Kankkunen, A. (1989) FEBS Lett. 257, 101–104. Ferranti, P., Malorni, A., Marino, G., Pucci, P., Goodwin, G.H., Manﬁoletti, G., and Giancotti, V. (1992) J. Biol. Chem. 267, 22486–22489. Wang, D.Z., Ray, P., and Boothby, M. (1995) J. Biol. Chem. 270, 22924–22932. Wang, D., Zamorano, J., Keegan, A.D., and Boothby, M. (1997) J. Biol. Chem. 272, 25083–25090. Schwanbeck, R. and Wisniewski, J.R. (1997) J. Biol. Chem. 272, 27476–27483. Diana, F., Sgarra, R., Manﬁoletti, G., Rustighi, A., Poletto, D., Sciortino, M.T., Mastino, A., and Giancotti, V. (2001) J. Biol. Chem. 276, 11354–11361. Edberg, D.D., Adkins, J., Springer, D., and Reeves, R. (2002) Unpublished data. Reeves, R. and Nissen, M.S. (1999) Methods Enzymol. 304, 155–187. Kinter, M. and Sherman, N.E. (2000) Protein Sequenceing and Identiﬁcation Using Tandem Mass Spectrometry. Wiley-Interscience, New York. Wu, K., Strauss, F., and Varshavsky, A. (1983) J. Mol. Biol. 170, 93–117. Strauss, F. and Varshavsky, A. (1984) Cell 37, 889–901. Reeves, R. and Nissen, M.S. (1993) J. Biol. Chem. 268, 21137–21146. Paton, A.E., Wilkinson-Singley, W., and Olins, D.E. (1983) J. Biol. Chem. 258, 13221–13229. Reeves, R. and Wolﬀe, A.P. (1996) Biochemistry 35, 5063–5074. Reeves, R., Leonard, W.J., and Nissen, M.S. (2000) Mol. Cell. Biol. 20, 4666–4679. Himes, S.R., Coles, L.S., Reeves, R., and Shannon, M.F. (1996) Immunity 5, 479–489. John, S., Reeves, R., Lin, J.X., Child, R., Leiden, J.M., Thompson, C.B., and Leoni, L. (1995) Mol. Cell. Biol. 15, 1786–1796. Laurent, B.C., Treich, I., and Carlson, M. (1993) Genes Dev. 7, 583–591. Tsukiyama, T., Daniel, C., Tamkun, J., and Wu, C. (1995) Cell 83, 1021–1026. Cairns, B.R., Schlichter, A., Erdjument-Bromage, H., Tempst, P., Kornberg, R.D., and Winston, F. (1999) Mol. Cell 4, 715–723. Wang, W., Cot’e, J., Xue, Y., Zhou, S., Khavari, P.A., Biggar, S.R., Muchardt, C., Kalpana, G.V., Goﬀ, S.P., Yaniv, M., Workman, J.L., and Crabtree, G.R. (1996) EMBO J. 15, 5370–5382. Vignali, M., Hassan, A.H., Neely, K.E., and Workman, J.L. (2000) Mol. Cell. Biol. 20, 1899–1910. Bourachot, B., Yaniv, M., and Muchardt, C. (1999) Mol. Cell. Biol. 19, 3931–3939. Xiao, H., Sandaltzopoulos, R., Wang, H., Hamiche, A., Ranallo, R., Lee, K., Fu, D., and Wu, C. (2001) Mol. Cell 8, 531–543. Beckerbauer, L. (2002) Role of the HMGA1 protein in transcriptional activation of the IL-2R gene. Ph.D. Thesis, pp. 1–187. Washington State University, Pullman, WA, 99164. Lomvardas, S. and Thanos, D. (2001) Cell 106, 685–696. Berlingieri, M.T., Manﬁoletti, G., Santoro, M., Bandiera, A., Visconti, R., Giancotti, V., and Fusco, A. (1995) Mol. Cell. Biol. 15, 1545–1553.

179 73. Reeves, R., Edberg, D.D., and Li, Y. (2001) Mol. Cell. Biol. 21, 575–594. 74. Scala, S., Portella, G., Fedele, M., Chiappetta, G., and Fusco, A. (2000) Proc. Natl. Acad. Sci. USA 97, 4256–4261. 75. Thanos, D. and Maniatis, T. (1992) Cell 71, 777–789. 76. Himes, S.R., Coles, L.S., Reeves, R., and Shannon, M.F. (1996) Immunity 5, 479–489. 77. Himes, S.R., Reeves, R., Attema, J., Nissen, M., Li, Y., and Shannon, M.F. (2000) J. Immunol. 164, 3157–3168. 78. Toulmen, J.J., Di Primo, C., and Moreau, S. (2001) Prog. Nucl. Acid Res. Mol. Biol. 69, 1–46. 79. Pooga, M., Land, T., Bartfai, T., and Langel, U. (2001) Biomol. Eng. 17, 183–192. 80. Giovannangeli, C. and Helene, C. (1997) Antisense Nucleic Acid Drug Dev. 7, 413–421. 81. Helene, C., Giovannangeli, C., Guieysse-Peugeot, A.L., and Praseuth, D. (1997) Ciba Found. Symp. 209, 94–102. 82. Jen, K.Y. and Gewirtz, A.M. (2000) Stem Cells 18, 307–319. 83. Gottesfeld, J.M., Turner, J.M., and Dervan, P.B. (2000) Gene Expr. 9, 77–91. 84. Howard, O.M., Oppenheim, J.J., Hollingshead, M.G., Covey, J.M., Bigelow, J., McCormack, J.J., Buckheit, R.W.J., Clanton, D.J., Turpin, J.A., and Rice, W.G. (1998) J. Med. Chem. 41, 2184–2193. 85. Cozzi, P. and Mongelli, N. (1998) Curr. Pharm. Des. 4, 181–201. 86. Broggini, M. and D’Incalci, M. (1994) Anticancer Drug Des. 9, 373–387. 87. Possati, L., Campioni, D., Sola, F., Leone, L., Ferrante, L., Trabanelli, C., Ciomei, M., Montesi, M., Rocchetti, R., Talevi, S., Bompadre, S., Caputo, A., Barbanti-Brodano, G., and Corallini, A. (1999) Clin. Exp. Metastasis 17, 575–582. 88. Wegner, M. and Grummt, F. (1990) Biochem. Biophys. Res. Comm. 166, 1110–1117. 89. Radic, M.Z., Saghbini, M., Elton, T.S., Reeves, R., and Hamkalo, B.A. (1992) Chromosoma 101, 602–608. 90. Turner, P.R. and Denny, W.A. (1996) Mutat. Res. 355, 141–169. 91. Zhao, K., Kas, E., Gonzalez, E., and Laemmli, U.K. (1993) EMBO J. 12, 3237–3247. 92. Kas, E., Poljak, L., Adachi, Y., and Laemmli, U.K. (1993) EMBO J. 12, 115–126. 93. Strick, R. and Laemmli, U.K. (1995) Cell 83, 1137–1148. 94. Girard, F., Bello, B., Laemmli, U.K., and Gehring, W.J. (1998) EMBO J. 17, 2079–2085. 95. Lown, J.W. (1994) J. Mol. Recognit. 7, 79–88. 96. Walker, W.L., Kopka, M.L., and Goodsell, D.S. (1997) Biopolymers 44, 323–334. 97. Wemmer, D.E. and Dervan, P.B. (1997) Curr. Opin. Struct. Biol. 7, 355–361. 98. Dervan, P.B. and Burli, R.W. (1999) Curr. Opin. Chem. Biol. 3, 688–693. 99. Kielkopf, C.L., White, S., Szewczyk, J.W., Turner, J.M., Baird, E.E., Dervan, P.B., and Rees, D.C. (1998) Science 282, 111–116. 100. Geierstanger, B.H., Mrksich, M., Dervan, P.B., and Wemmer, D.E. (1994) Science, 646–650. 101. Gottesfeld, J.M., Neely, L., Trauger, J.W., Baird, E.E., and Dervan, P.B. (1997) Nature 387, 202–205. 102. Ehley, J.A., Melander, C., Herman, D., Baird, E.E., Ferguson, H.A., Goodrich, J.A., Dervan, P.B., and Gottesfeld, J.M. (2002) Mol. Cell. Biol. 22, 1723–1733. 103. Janssen, S., Cuvier, O., Muller, M., and Laemmli, U.K. (2000) Mol. Cell 6, 1013–1024. 104. Henderson, A., Bunce, M., Siddon, N., Reeves, R., and Tremethick, D.J. (2000) J. Virol. 74, 10523–10534. 105. Hall, D.G., Manku, S., and Wang, F. (2001) J. Comb. Chem. 3, 125–150. 106. Iwami, M., Kiyoto, S., Terano, H., Kohsaka, M., Aoki, H., and Imanaka, H. (1987) J. Antibiot. (Tokyo) 40, 589–593. 107. Kiyoto, S., Shibata, T., Yamashita, M., Komori, T., Okuhara, M., Terano, H., Kohsaka, M., Aoki, H., and Imanaka, H. (1987) J. Antibiot. (Tokyo) 40, 594–599. 108. Beckerbauer, L., Tepe, J.J., Cullison, J., Reeves, R., and Williams, R.M. (2000) Chem. Biol. 7, 805–812. 109. Beckerbauer, L., Tepe, J.J., Eastman, R.A., Mixter, P., Williams, R.M., and Reeves, R. (2002) Chem. Biol. 9, 1–20.

180 110. Paz, M.M. and Hopkins, P.B. (1997) Tetrahedron Lett. 38, 343–346. 111. Paz, M.M. and Hopkins, P.B. (1997) J. Am. Chem. Soc. 119, 5999–6005. 112. Naoe, Y., Inami, M., Kawamura, I., Nishigaki, F., Tsujimoto, S., Matsumoto, S., Manda, T., and Shimomura, K. (1998, June) Jpn. J. Cancer Res., 666–672. 113. Yie, J., Liang, S., Merika, M., and Thanos, D. (1997) Mol. Cell Biol. 17, 3649–3662. 114. Currie, R.A. (1997) J. Biol. Chem. 272, 30880–30888. 115. Chin, M.T., Pellacani, A., Wang, H., Lin, S.S., Jain, M.K., Perrella, M.A., and Lee, M.E. (1998) J. Biol. Chem. 273, 9755–9760. 116. Zhang, X.M. and Verdine, G.L. (1999) J. Biol. Chem. 274, 20235–20243. 117. Leger, H., Sock, E., Renner, K., Grummt, F., and Wegner, M. (1995) Mol. Cell. Biol. 15, 3738–3747. 118. Pierantoni, G.M., Fedele, M., Pentimalli, F., Benvenuto, G., Pero, R., Viglietto, G., Santoro, M., Chiariotti, L., and Fusco, A. (2001) Oncogene 20, 6132–6141. 119. Luger, K., Mader, A.W., Richmond, R.K., Sargent, D.F., and Richmond, T.J. (1997) Nature 389, 251–260. 120. Whitley, M.Z., Thanos, D., Read, M.A., Maniatis, T., and Collins, T. (1994) Mol. Cell Biol. 14, 6464–6475. 121. Chuvpilo, S., Schomberg, C., Gerwig, R., Heinﬂing, A., Reeves, R., Grummt, F., and Serﬂing, E. (1993) Nucl. Acids Res. 21, 5694–5704. 122. Perrella, M.A., Pellacani, A., Wiesel, P., Chin, M.T., Foster, L.C., Ibanez, M., Hsieh, C.M., Reeves, R., Yet, S.F., and Lee, M.E. (1999) J. Biol. Chem. 274, 9045–9052. 123. Fashena, S.J., Reeves, R., and Ruddle, N.H. (1992) Mol. Cell Biol. 12, 894–903.

HMGA proteins: multifaceted players in nuclear function

HMGA proteins: multifaceted players in nuclear function

Recommend Documents