Journal Pre-proof Folding perspectives of an intrinsically disordered transactivation domain and its single mutation breaking the folding propensity
Nitin Sharma, Alexander V. Fonin, Olesya G. Shpironok, Sergey A. Silonov, Konstantin K. Turoverov, Vladimir N. Uversky, Irina M. Kuznetsova, Rajanish Giri PII:
S0141-8130(19)34479-4
DOI:
https://doi.org/10.1016/j.ijbiomac.2019.11.111
Reference:
BIOMAC 13892
To appear in:
International Journal of Biological Macromolecules
Received date:
14 June 2019
Revised date:
1 November 2019
Accepted date:
12 November 2019
Please cite this article as: N. Sharma, A.V. Fonin, O.G. Shpironok, et al., Folding perspectives of an intrinsically disordered transactivation domain and its single mutation breaking the folding propensity, International Journal of Biological Macromolecules(2018), https://doi.org/10.1016/j.ijbiomac.2019.11.111
This is a PDF file of an article that has undergone enhancements after acceptance, such as the addition of a cover page and metadata, and formatting for readability, but it is not yet the definitive version of record. This version will undergo additional copyediting, typesetting and review before it is published in its final form, but we are providing this version to give early visibility of the article. Please note that, during the production process, errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
© 2018 Published by Elsevier.
Journal Pre-proof
Folding perspectives of an intrinsically disordered transactivation domain and its single mutation breaking the folding propensity
Nitin Sharma,1,# Alexander V. Fonin,2,# Olesya G. Shpironok,2 Sergey A. Silonov,2 Konstantin K. Turoverov,2,3 Vladimir N. Uversky,4,5,* Irina M. Kuznetsova,2,* and Rajanish
1
ro
of
Giri1,6,*
School of Basic Sciences, Indian Institute of Technology Mandi, Himachal Pradesh 175005,
Laboratory of structural dynamics, stability and folding of proteins, Institute of Cytology,
re
2
-p
India
3
lP
Russian Academy of Sciences, St. Petersburg, Russia
Peter the Great St. Petersburg Polytechnic University, Department of Biophysics,
Department of Molecular Medicine and USF Health Byrd Alzheimer's Research Institute,
ur
4
na
Polytechnicheskaya av. 29, St. Petersburg, Russia
5
Jo
Morsani College of Medicine, University of South Florida, Tampa, FL, USA Laboratory of New Methods in Biology, Institute for Biological Instrumentation of the Russian
Academy of Sciences, Pushchino, Moscow region 142290, Russia 6
BioX Centre, Indian Institute of Technology Mandi, VPO Kamand, 175005, India
*Corresponding authors: VNU, E-mail:
[email protected]; Phone: 1-813-9745816; Fax: 1-813-974-7357; IMK, E-mail:
[email protected]; Phone: +7 812 2971957; Fax: +7 812 2970341; RG, E-mail:
[email protected]; Phone: +91-1905-267134; Fax: 01905-267138 1
Journal Pre-proof
ABSTRACT Transcriptional regulation is a critical facet of cellular development controlled by numerous transcription factors, among which are E-proteins (E2A, HEB, and E2-2) that play important roles in lymphopoiesis. For example, primary hematopoietic cells immortalisation is promoted by interaction of the conserved PCET motif consisting of the Leu-X-X-Leu-Leu (LXXLL) and
of
Leu-Asp-Phe-Ser (LDFS) sequences of the transactivation domains (AD1) of E-proteins with the KIX domain of CBP/p300 transcriptional co-activators. Earlier, it was shown that the LXXLL
ro
motif is essential for the PCET-KIX interaction driven by the PCET helical transition. In this
-p
study, we analyzed the dehydration-driven gain of helicity in the conserved region (residues 11-
re
28) of the AD1 domain of E-protein. Particularly, we showed that AD1 structure was
lP
dramatically affected by alcohols, but was insensitive to changes in pH or the presence of osmolytes sarcosine and taurine, or high polyethylene glycol (PEG) concentrations and DOPC
na
Liposomes. These structure-forming effects of solvents were almost completely absent in the
ur
case of L21P AD1 mutant characterized by weakened interaction with KIX. This indicates that KIX interaction-induced AD1 ordering is driven by PCET motif dehydration. The L21P
Jo
mutation-caused loss of molecular recognition function of AD1 is due to the mutation-induced disruption of the AD1 helical propensity.
Keywords: Intrinsically Disordered Proteins, transactivation domain, PCET Motif
1. INTRODUCTION Non-random chromosomal translocations are known to cause several malignancies. These translocations either result in production of novel chimeric factors, which stimulate uncontrolled 2
Journal Pre-proof cell growth and inhibit differentiation or lead to overexpression of endogenous genes [1]. One such translocation occurs between chromosome 1 and 19 (t (1; 19)), which fuses E2A gene (one of the E-protein-encoding genes) and the PBX1 gene encoding for the homeodomain protein PBX1, implicated in induction of acute lymphoblastic leukemia (ALL) [2]. In approximately 412% of pediatric ALL cases, t (1; 19) leads to the expression of oncogenic E2A-PBX1 fusion protein [3]. The E-proteins (E2A, HEB, and E2-2) are a family of class I basic helix-loop-helix
of
(bHLH) mammalian transcription factors that play essential roles in lymphopoiesis [2]. Each
ro
member of this family contains a C-terminal domain (CTD) for protein dimerization and
-p
recognition of the E-box DNA sequence CANNTG and two N-terminal activation domains AD1
re
and AD2 that function in recruitment of co-activators and co-repressors [4]. An important conserved region known as PCET motif of AD1 domain consists of two overlapping conserved
lP
protein recognition motifs Leu-xx-Leu-Leu (LXXLL) and Leu-Asp-Phe-Ser (LDFS) sequences.
na
The LXXLL motif found in AD1 is an illustrative member of the highly conserved ϕ-x-x-ϕ-ϕ motif (where ϕ represents a bulky hydrophobic residue and x corresponds to any amino acid) that
ur
mediates interactions between transcription factors and transcriptional co-regulators [5]. On its
Jo
turn, the conserved LDFS motif, which is found in the vertebrate class I HLH proteins and in a yeast HLH protein, stimulates transcription by direct recruitment of the SAGA histone acetyltransferase complex [6]. Previous reports have identified the role of PCET motif in interactions with transcriptional activators, where this conserved motif was shown to bind to a KIX domain of the members of the CBP/p300 transcriptional co-activator family [2]. This CBP/p300 family includes two closely related transcriptional co-activating proteins, CREBbinding protein (CBP or CREBBP) and E1A binding protein p300 (p300 or EP300) [7]. Both members of the CBP/p300 family are characterized by similar structures, containing five protein
3
Journal Pre-proof interaction domains, such as the nuclear receptor interaction domain (RID), the KIX domain (CREB and MYB interaction domain), the cysteine/histidine regions (TAZ1/CH1 and TAZ2/CH3), and the interferon response binding domain (IBiD) [7]. Studies have also identified this PCET motif as a target of the eTAFH domain of transcriptional corepressor ETO and leukemogenic fusion protein AML1-ETO [8,9]. These interactions prevent the binding of KIX domain to the same site and lead to the silencing of E-proteins.
of
The molecular mechanism through which E2A-PBX1 plays a role in leukaemia induction is
ro
unclear. In literature, two potential mechanisms are consonant: recruitment of co-regulators to
-p
the target gene loci bound to E2A-PBX1 homeodomain or withdrawal of co-regulators away
re
from the wild-type E-proteins. These mechanisms, in turn, reveal a critical role of the AD1
lP
domain of E2A-PBX1 in the recruitment of transcriptional co-regulators. Therefore, AD1 seems to be indispensable for E2A-PBX1 mediated oncogenesis [2].
na
The role of the conserved region of AD1 domain of E-proteins (E2A and E2-2) in transcription
ur
activation of yeast and mammalian cells was demonstrated [10], where it was observed that the AD1 conserved putative helix region (PCET motif) is crucial for transactivation properties of E-
Jo
proteins. The AD1 domain (residues 1-99) has reported being structured and characterised by the distinctive helical properties, but the synthetic peptide (residues 9-31) representing conserved region (PCET motif) was unstructured in aqueous solution and adopted a helical conformation with the increasing concentration of 2,2,2-trifluoroethanol (TFE) [10]. The analysis of the role of the interaction between E2A-PBX1 (1-483) and KIX domain of CBP/p300 in proliferation of primary hematopoietic cells revealed that this interaction with KIX domain requires helical AD1 and AD2 domains [11]. This study highlighted the functional cooperativity between the two domains and presented the importance of this interaction in the immortalisation of primary bone 4
Journal Pre-proof marrow cells retrovirally transduced with E2A-PBX1 [11]. As conserved PCET motif region of AD1 is unstructured, its interaction with KIX domain was elucidated to show that after interaction with KIX, AD1 conserved domain undergoes helical transition [12]. But the substitution of L20 in LXXLL motif has shown to impair PCET interaction with KIX and also inhibited cell immortalisation by E2A-PBX1 [12]. E2A binding requirements to the KIX domain was functionally mapped and compared with the c-Myb binding to KIX. This analysis revealed
of
that the E2A interaction was less sensitive to the disruption of hydrophobic pocket and groove of
ro
KIX, suggesting different mode of KIX recognition by these activation domains and underscored
-p
the need of biophysical and structural characterisation of the E2A-KIX interaction further [12].
re
The importance of LXXLL motif for the interactions of E-proteins with KIX and oncogenesis was also reported [2]. The LXXLL residues of PCET motif are essential for both binding KIX
lP
and bone marrow immortalisation by E2A-PBX1. However, despite the abundance of data about
na
the function of both E-proteins and CBP/p300, how the interaction between E2A-PBX1 and CBP/p300 might contribute to leukaemia induction is still unclear. Therefore, there is a need for
ur
detailed structural studies to find the molecular mechanisms behind the abolishment of leukemia
Jo
induction due to the single L20 mutations in the AD1 region of E2A-PBX1. Intrinsically disordered proteins (IDPs) and IDP regions (IDPRs) are unable to form stable structures, but still, exhibit important biological activities [13–20]. Their flexibility and structural instability are due to the specific amino acid sequence [17,21]. The penetrance of IDPs in viruses and the use of molecular recognition features by these viral IDPs have allowed them to fold upon interaction with the folding partners to perform various functions with small number of proteins [22,23]. The studies of IDPs involvement in pathogenic disease have proven the importance of the development of new disorder based therapeutic strategies [22–28]. Structured proteins can be
5
Journal Pre-proof grouped according to their secondary and tertiary structures but IDPs and IDPRs are difficult to partition into structural groups [29–31]. The conserved PCET motif of the AD1 domain of Eproteins is considered to be IDPR, which folds upon binding to KIX domain of CBP. Binding of IDPs to their physiological partners, such as proteins and nucleic acids, is often coupled with structuration and molecular recognition features, a phenomenon known as “folding upon binding” or “coupled folding and binding” [29,32–35]. Coupled folding and binding is further
of
divided into two most commonly discussed mechanisms, that is, conformational selection (i.e.,
ro
folding before binding) and induced fit (i.e., folding after binding) [36,37]. However, there are
-p
also more complex mechanisms, e.g., comprising conformational selection combined with
re
induced fit. The cellular conditions of molecular crowding (where the concentration of biological macromolecules ranges from 80 to 400 mg/ml) occupy 5–40% of cellular volume [38–41]. This
lP
creates a unique crowded medium with restricted amounts of free water [40,42–46] are also an
na
essential factor, which may play a role in structural transitions and functionality of IDPs [47,48]. It is known that intracellular environment contains high concentration of biological
ur
macromolecules that can be created not only by high-molecular substances (proteins, nucleic and
Jo
ribonucleic acids), but also by low-molecular compounds. In particular, under extreme conditions, cells of a number of organisms contain high concentrations (up to 2.5 M) of osmolytes [49–52]. Osmolytes are small organic substances, the main function of which is to equalize the hydrostatic pressure between the inside of cell and the extracellular space. These low-molecular compounds accumulate in the cell in response to stress (osmotic, temperature, caused by changes in pH, salt concentration) and are able to increase the stability and functional activity of native proteins and promote folding of unfolded proteins. Therefore, osmolytes are often considered as ‘chemical chaperones’ [50,53–58].
6
Journal Pre-proof Here, we have elucidated the structural properties of the conserved PCET motif (residues 11-28) of the E-protein HEB (reference from structure PDB: 2KWF) to evaluate the effects of pH, temperature, different solvents, osmolytes, and macromolecular crowding conditions. We have found the gain of helical content in the peptide corresponding to the wild type PCET motif in the presence of alcohols, such as ethanol, 2,2,2-trifluoroethanol (TFE), hexafluoroisopropanol (HFIP) and the osmolyte trimethylamine N-oxide (TMAO). Also, we have observed that the
of
introduction of the proline (known as a helix breaker) at the 21st residue (L21P mutation) has
ro
disrupted the helical propensity of the peptide. Curiously, the previous analysis revealed that the
-p
L20A mutant of the analogous E2A-PCET peptide showed no detectable binding to KIX domain
re
[2]. It is known that although proline serves as a helix breaker, alanine is known to display the highest propensity for helical structure among the amino acids. Therefore, these observations
lP
indicate that the mechanisms for disruption of KIX binding by L21P mutation in HEB PCET
na
motif and by the L20A mutation in the E2A-PCET peptide are different. In other words, our analysis provided mechanistic foundation needed for better understanding of the roles of
ur
substitutions of the conserved leucine residue at 20th position of E-proteins in disrupting the
Jo
transactivation and molecular recognition functions of E-proteins and in ultimate blocking of the leukaemia induction by E2A-PBX1.
2. MATERIAL AND METHODS Many chemicals were purchased from Sigma-Aldrich. The peptide comprising 19 amino acid residues (11-28) of the wild type PCET motif of AD1 domain of HEB (NH2GSGTDKELSDLLDFSAMFS-COOH) and mutant type (L21P) was chemically synthesized by ThermoFisher Scientific. The wild type peptide sequence was equivalent to the peptide used for obtaining the NMR solution structure of KIX domain bound to PCET (HEB) motif of E-protein 7
Journal Pre-proof activation domain 1 (AD1) with PDB: 2KWF. Lyophilised peptide was dissolved in buffer (10 mM potassium phosphate, 50 mM sodium sulphate, pH 7.0) at concentration of 1 mg/ml. Lipids were purchased from Avanti Polar Lipids (Alabaster, AL, USA). Computational analysis: The intrinsic disorder predisposition of human E-protein HEB, also known as transcription factor 12 (TCF-12), class B basic helix-loop-helix protein 20 (bHLHb20), DNA-binding protein HTF4, E-box-binding protein, and transcription factor HTF-4 (UniProt ID:
of
Q99081) was evaluated by a set of commonly used disorder predictors, such as PONDR® VLXT
ro
[59], PONDR® VSL2 [60], PONDR® VL3 [60,61]}, PONDR® FIT [62], IUPred_short, and
-p
IUPred_long [63–65]. Justification for the selection of these tools is provided in our previous
re
publications (e.g., [66,67]). We also generated a mean per-residue intrinsic disorder profile for
lP
vitronectin by averaging the outputs of these six per-residue disorder predictors. The use of consensus or mean disorder propensity is motivated by empirical observations that this approach
na
usually increases the predictive performance compared to the use of a single predictor [68–70]. The outputs of the evaluation of the per-residue disorder propensity by these tools are
ur
represented as the real numbers between 1 (ideal prediction of disorder) and 0 (ideal prediction
Jo
of order). A threshold of ≥0.5 was used to identify disordered residues and regions in a query protein, whereas residues/regions characterized by the disorder scores ranging from 0.2 to 0.5 are classified as flexible. Liposome Preparation: Dioleoyl-phosphatidyl choline (DOPC) was used for preparation of neutral liposomes. Liposomes were prepared in 20mM sodium phosphate buffer pH 7.4 by 5 freeze thaw cycles in liq. N2 and water at 60oC. This is followed by the extrusion method using mini Avanti extruder with 100 nm polycarbonate filters by Avanti standard protocol [71]. The
8
Journal Pre-proof size and uniformity of prepared liposomes was confirmed by Dynamic Light Scattering (DLS) using Zetasizer Nano ZS (Malvern Instruments Ltd., UK). CD spectroscopy: CD spectroscopy was used to assess the secondary structure composition of the AD1 domain of E-protein (HEB) in various solvents (such as water and aqueous mixtures of ethanol, TFE, and HFIP), osmolytes (such as sarcosine, taurine and TMAO), neutral liposomes such as DOPC, as well as in the presence of high concentration of crowding agent, PEG. Far-UV
of
CD spectra were measured between 190 and 240 nm at a protein concentration of 20 µM using a
ro
JASCO-815 spectropolarimeter equipped with a thermostatically controlled cell holder.
-p
Measurements were conducted at 25°C using a quartz cuvette with the 1 path-length mm.
re
Spectra were recorded at a scan speed of 50 nm/min and a bandwidth of 1 nm. The spectra were
lP
averaged over three scans to eliminate signal noise. The normalisation of data was performed by subtracting the baseline recorded for the buffer from values obtained for protein samples under
na
similar conditions.
ur
Phase diagram analysis: The “phase diagram” method (also known as method of spectral diagrams, or parametric dependencies) was first suggested by Dr. Burstein in 1971 for the
Jo
analysis of fluorescence data [72], and later, a generalized form was developed for any pair of extensive independent spectral parameters [73]. The indispensable property of this method of the phase diagrams is to develop the plot of I1 versus I2, where I1 and I2 are the spectral intensities measured on wavelengths 1 and 2 under distinct experimental conditions for a protein undergoing structural transformations. Spectral intensity can be described by any twocomponent system by a simple relationship: I()=1 I1() + 2 I2()
(1)
9
Journal Pre-proof 1+2= 1, where I1() and I2() are the spectral intensities corresponding to the first and second components, whereas 1 and 2 are relative contents of these components in a system. Excluding
1 (1=1-2), the equation (1) could be rewritten as: I()=(1- 2) I1() + 2 I2() = I1() + 2(I2()-I1())
(2)
of
Here, 2 may be determined from the spectral intensity measurements at two different
ro
wavelengths, 1 and 2. In fact:
lP
and
(4)
re
I() = I1() + 2 (I2() - I1())
(3)
-p
I() = I1() + 2 (I2() - I1())
𝐼(𝜆2 )−𝐼1 (𝜆2 ) (𝜆2 ) 2 (𝜆2 )−𝐼
𝛼2 = 𝐼
na
1
(5)
ur
The relationship between I() and I() may be determined by the substitution of 2 in (3) from
Jo
(5):
𝐼(𝜆2 )−𝐼1 (𝜆2 ) (𝐼2 (𝜆1 ) 2 (𝜆2 )−𝐼1 (𝜆2 )
𝐼(𝜆1 ) = 𝐼1 (𝜆1 ) + 𝐼
𝐼 (𝜆 )−𝐼 (𝜆 )
− 𝐼1 (𝜆1 )) = 𝐼1 (𝜆1 ) − 𝐼2 (𝜆1 )−𝐼1 (𝜆1 ) 𝐼1 (𝜆2 ) + 2
𝐼2 (𝜆1 )−𝐼1 (𝜆1 ) 𝐼(𝜆2 ) 𝐼2 (𝜆2 )−𝐼1 (𝜆2 )
2
1
2
(6)
or: 𝐼(𝜆1 ) = 𝑎 + 𝑏𝐼(𝜆2 ),
(7)
where 𝐼 (𝜆 )−𝐼 (𝜆 )
𝑎 = 𝐼1 (𝜆1 ) − 𝐼2 (𝜆1 )−𝐼1 (𝜆1 ) 𝐼1 (𝜆2 ) and 𝑏 = 2
2
1
2
𝐼2 (𝜆1 )−𝐼1 (𝜆1 ) . 𝐼2 (𝜆2 )−𝐼1 (𝜆2 )
10
(8)
Journal Pre-proof In relation to protein unfolding experiments, the relationship (7) predicts that the dependence 𝐼(𝜆1 ) = 𝑓(𝐼(𝜆2 )) will be linear, if changes in protein environment result in the all-or-none transition between two different conformations. Contrarily, the non-linearity of this function imitates the sequential behavior of structural transformations. Furthermore, each linear portion of the 𝐼(𝜆1 ) = 𝑓(𝐼(𝜆2 )) dependence describes the individual all-or-none transition. In principle, 1
of
and 2 are the arbitrary wavelengths of the spectrum [73]. Solution-state NMR Spectroscopy: All NMR experiments were performed on a Bruker
H] NOESY (Nuclear Overhauser Enhancement spectroscopy) experiments were carried out for
-p
1
ro
Advance III 500 MHz spectrometer equipped with the cryogenic triple-resonance probe. 2D [1H-
re
protein resonance assignment and collected 2048 data points in f2 and 1024 in f1 on 1 mM
lP
samples of wild type and mutant E2A peptides in phosphate buffer, pH 7. Solvent suppression was achieved by pre-saturation of the water signal for samples in aqueous-D2O (5%). NOESY
na
data were collected with a mixing time of 250 ms to derive 1H–1H distance constraints. A total of
ur
128 (NOESY) transients were acquired and all 2D spectra were recorded with a spectral width of 7,000 Hz. Bruker AG 3.5 and Topspin 2.1 software was used for acquisition, Fourier
Jo
transformation, and processing of time-domain data. Computer Aided Resonance Assignment (CARA 1.9.2 alpha) software was used for backbone and side chain resonance analysis.
3. RESULTS To place the intrinsic disorder predisposition of the PCET motif within the context of the full length human HEB protein and to evaluate how this propensity is affected by the L21P mutation, we subjected the amino acid sequences of the wild type and mutant proteins to the
11
Journal Pre-proof multiparametric computational analysis utilizing several commonly used per-residue disorder predictors. Results are shown in Figure 1. Figure 1A illustrates that the wild type human HEB is predicted as mostly disordered protein containing several relatively short ordered regions. The presence of such short regions of order embedded within the long IDPRs represents a hallmark feature of many IDPs and reflects their ability to adopt structure upon interaction with a binding partner. In fact, such segments are commonly classified as molecular recognition
of
features (MoRFs), which are relatively short segments in IDPs that are capable of undergoing
ro
disorder to order transitions upon binding to their partners [29,74–80]. In its bound state, the
[29,74,75,80,81]. Because of specific sequence features, MoRFs are
re
irregular structure
-p
ordered (or “frozen”) MoRF can contain elements of classic secondary structure or possess
predictable with rather high accuracy [29,77,78]. Using one of such tools, ANCHOR [78],
lP
human HEB was predicted to contain 20 MoRFs (residues 5-31, 41-52, 82-108, 114-143, 153-
na
162, 171-196, 219-230, 246-256, 265-277, 291-298, 302-312, 315-353, 388-422, 424-490, 498-519, 546-554, 588-609, 614-632, 641-647, and 676-682). In other words, more than 60%
ur
of the residues of this protein might take part in disorder-based interaction with various
Jo
partners. Figure 1B focuses on the first 50 residues of the wild type human HEB, a region containing the PCET motif. It is seen that similar to the full-length protein, the N-terminal tail of human HEB is mostly disordered. Importantly, the PCET motif (shown as a gray shaded are) is located within the first MoRF region, conforming the intrinsic propensity of this motif for binding to specific partners. Finally, Figure 1C represents the first 50 residues of the mutated HEB and clearly shows that the L21P mutation noticeably increases the intrinsic disorder propensity of PCET.
12
Journal Pre-proof To elucidate the secondary structure of the PCET motif of the AD1 domain of HEB (a homologous sequence of E2A-PBX1) in solution, we carried out far-UV circular dichroism (CD) experiments at pH 7.0 and 25°C. The far-UV CD spectrum of the conserved region (residues 11-28) of AD1 domain of E-protein HEB reported in Figure 2A shows absence of ordered secondary structure, confirming that PCET motif of AD1 domain of E-proteins is an IDPR. Furthermore, Figure 2A shows that the L21P mutation has a minimal effect on the
of
structure of this peptide.
ro
In order to determine which factors may cause the ordering of the PCET structure in its bound
-p
form, we have studied the structure of this peptide in various solvents. First, we investigated the
re
spectral characteristics of PCET peptide in solutions with different pH. Results are summarized
lP
in Figure 2B illustrates that the structure of PCET motif was practically the same in solutions with acidic, neutral, and alkaline pH, indicating the independence of PCET conformation on
na
environmental pH.
ur
Next, we looked at the effect of temperature on the secondary structure of the wild type and L21P mutant PCET peptides. Figure 3 summarises the results of these studies and shows that
Jo
similar to many other IDPs [13], these peptides possess so-called “turned out” response to heat [82–84]. Figures 3A and 3B show that at low temperatures, both peptides possessed far-UV CD spectra typical of an unfolded peptide chain. As the temperature increased, the shape of spectra changed, reflecting the temperature-induced formation of transient structure content as documented in other IDPs representing the redistribution of statistical coils [85]. Note that these temperature-induced changes in secondary structure of these peptides were completely reversible. This temperature-induced changes of the wild type and L21P mutant PCET peptides is further illustrated by Figures 3C and 3D, which represent dependences of []222 and []198 of 13
Journal Pre-proof both peptides on temperature. Finally, Figures 3E and 3F shows that temperature-induced hydrodynamic changes in the structures of these peptides can be described in terms of response for temperature sensitivity generally correlated to the temperature induced backbone conformations in polypropylene II and -helix as reported by English and co-workers, since the corresponding phase diagrams represent mostly linear dependences [86]. The secondary structure analysis of CD spectra using K2D3 software had shown the inertness of both wild type and L21P
of
peptide. The structure-forming effects of the elevated temperatures may be attributed to the
ro
stronger hydrophobic interaction at higher temperatures, which may be the major driving force
-p
for transient conformational changes.
re
Membrane–protein interfaces are known to stabilize inter-residue hydrogen bond networks of
lP
proteins by the exclusion of the bulk solvent and reducing the effective dielectric constant [87]. Alcohols, by decreasing the dielectric constant around the peptide/protein, favor the formation
na
of inter-residue hydrogen bonds, which stabilize secondary structure elements, such as α-
ur
helices [88]. Based on this analogy, alcohols such as TFE or HFIP are referred to as membrane-mimetic solvents for proteins. Based on these premises, we analyzed the effects of
Jo
different alcohols (such as ethanol, TFE, and HFIP) on secondary structure of the wild type and L21P mutant of the PCET peptide. Results of these studies are shown in Figures 4-6 and clearly illustrate that the addition of any of the alcohols caused the noticeable increase in the ordered secondary structure content manifested by a decrease in the minimum at 198 nm accompanied by an increase in negative mean residual ellipticity around 222 nm. Importantly, these structural changes observed by CD were completely reversible and were independent of protein concentration (at least in the range of 0.1–2.5 mg/ml), indicating that the increase in structure of wild type PCET domain induced by alcohols represents an intramolecular process
14
Journal Pre-proof and not self-association. Since pronounced helical structure was induced by all three alcohols in the wild type PCET peptide, one can conclude that this peptide possesses profound helical propensity that can be stabilized by alcohol-induced dehydration of protein causing the formation of intramolecular hydrogen bonds. The helical propensity of wild type HEB PCET peptide in aqueous TFE solutions has been quantified by the analysis of induced secondary structure elements from CD spectra using K2D3 software (as shown in supplementary Figure
of
S1). The total helical propensity of PCET motif was found to be ~77% at a TFE concentration
ro
of 90% (Supplementary Table T1). This is in agreement with the previous studies on the
-p
modulation of secondary structure of peptides derived from globular proteins by alcohols,
re
where it was shown that the helical peptides have a significant intrinsic tendency to adopt their
lP
native helical structure in aqueous alcohol solutions [89]. On the contrary, similar analysis conducted for the L21P mutant form in presence of ethanol
na
and fluoroalcohols revealed that the increase in the concentration of all three alcohols had a minor effect on the structural properties of this peptide (see plots B, D, and F in Figures 4-6).
ur
The analysis of percent helicity of L21P from CD spectra using K2D3 has shown very minor
Jo
changes at higher concentrations of TFE (as shown in Supplementary Figure S2). These observations indicate that the intrinsic helical propensity of the PCET peptide is dramatically distorted by the L21P mutation. We have also analyzed the conformational changes in WT peptide in the presence of neutral liposomes (DOPC) using far-UV CD spectroscopy. However, we have not found any significant effects of DOPC in the far-UV CD spectra of the peptide up to a concentration of 5 mM (see in Figure 7). At higher concentrations, DOPC led to the noticeable increase in the high-tension (HT) voltage, resulting in the dramatic increase in the spectral noise.
15
Journal Pre-proof Biological macromolecules are known to function in a cell under conditions of macromolecular crowding characterized by limited free volume. It is believed that this macromolecular crowding might limit the number of conformations that the polypeptide chain can adopt, which, in turn, can cause both the folding of the polypeptide chain into a more compact structure, and the misfolding of the proteins and their aggregation. The macromolecular crowding conditions in vitro are usually modeled using the highly concentrated solutions of inert polymers of different
of
molecular weights, such as polyethylene glycol (PEG), Dextran, Ficoll, etc. Due to the mutual
ro
impermeability of the dissolved macromolecules under such conditions, the steric interactions
-p
between the biological object under study and the solute molecules increase substantially. We
re
investigated the effect of different concentrations of PEGs with different molecular weights on the secondary structure of PCET peptide. Results are summarised in Figure 8, which clearly
lP
shows that neither the concentration nor the molecular mass of PEG used in this study had a
na
significant effect on the peptide structure. This indicates that the decrease in the accessible volume does not cause noticeable ordering of the PCET peptide.
ur
Next, we analyzed the effects of various osmolytes (sarcosine, taurine, and TMAO) on the
Jo
structure of the PCET peptide and its L21P mutant. This analysis revealed that although structure of this domain was not affected by sarcosine and taurine, the increase in the TMAO concentration promoted a significant ordering of the wild type PCET (Figure 9). TMAO is known as one of the stronger osmolytes that can induce folding of various proteins [90]. It is believed that the effect of TMAO on the structure and stability of proteins is determined by the osmophobic effect; i.e., the preferential exclusion of water molecules from the near-surface layer of the protein [91–93]. On the contrary, none of the osmolytes had noticeable effects on the highly disordered structure of the L21P mutant (data not shown).
16
Journal Pre-proof The 2D NMR NOESY spectra of the wild type and mutant E2A peptides obtained in aqueous solution (5% D2O) did not indicate the presence of secondary structure. However, in the presence of 20% HFIP/water mixtures, the increase in the number and the distribution of new correlations indicated that significant secondary structure had been induced. The overlay of the 2D NMR 1H1
H NOESY spectra measured for the wild-type peptide in the aqueous and 20% HFIP solution
was used to qualitatively identify changes in the content of secondary structure elements in the
of
peptide after the addition of helix-stabilizing alcohol (HFIP) as shown in Figure 10. Similarly,
ro
the overlay of the 2D NMR 1H-1H NOESY spectra measured for the mutant peptide L21P in the
-p
aqueous and 20% HFIP solutions was plotted as shown in Figure 11 to identify the induction of
re
helical structure, which was not prominently visible in mutant peptide. The analysis of NH-NH connectivity regions (Figures 10C and 11A) has shown the decreased HFIP-induced helicity in
lP
the mutant PCET peptide as compared to the wild type peptide. Also, the low NH-H cross-
na
peaks (as shown in Figures 10C and 11C) have represented the decrease in intramolecular Hbonding in L21P peptide in comparison with the wild type PCET motif. These results correlate
ur
very well with the Circular Dichroism data obtained for these peptides in the presence of helix-
Jo
inducing solvent conditions. Hence, the mutant (L20P) is capable of breaking the helix propensity of PCET motif of E proteins and thereby can decrease the efficiency of the E2A-KIX interactions.
4. DISCUSSION A highly conserved 17 residues N-terminal region (PCET motif) of transactivation domain AD1 is present in E-proteins. It consists of two overlapping protein recognition sites, the LXXLL
17
Journal Pre-proof (representative of the ϕxxϕϕ motif) and LDFS motifs, identified to be responsible for interaction with transcriptional activators. Previous reports have indicated the specific interactions between KIX domain (residues 589-685) of CBP/p300 and PCET motif (residues 11-28) of the AD1 domain of E-proteins are related to the regulation of B cell lymphopoiesis and are essential for leukemia induction by E2A-PBX1. In structural studies on the PCETHEB bound to KIX domain of CBP, a set of residues at the KIX/PCET interface important for interactions has been
of
established. These included the hydrophobic residues L17, L20, and L21 from the LXXLL motif
ro
and the F23 from the overlapping LDFS site. The L21 residue has been found to be associated
-p
with the hydrophobic pocket on KIX surface formed by I611, I660, L664, and K667 residues.
re
Analysis of the primary structure of the AD1-PCET motif indicates a high probability of the α-
lP
helical structure formation, which would suggest the presence of a high content of α-helical regions in the PCET structure [10]. However, computational analysis of the full-length HEB
na
protein by a set of commonly used disorder predictors suggested that this protein is expected to be mostly disordered. Furthermore, the PCET motif, being generally disordered, is located within
ur
the MoRF region, which supposes to fold into an ordered conformation at interaction with
Jo
binding partner. In agreement with the results of these computational studies, the far-UV CD spectrum and the 2D NMR 1H-1H NOESY spectra of the peptide corresponding to the PCET motif in aqueous solutions have all characteristics of highly disordered polypeptide chain, and that there is practically no α-helical structure of this region from the transactivation domain of HEB peptide. These data are in accordance with the results of previous studies of the PCET motif using 1H-15N HSQC experiments, where this peptide was shown to display poor dispersion of the HN resonances, thereby indicating its intrinsically disordered nature [2]. Since PCET motif has a predominant α-helical structure in its complex with KIX domain, this observation suggests
18
Journal Pre-proof that PCET acts as a molecular recognition feature (MoRF), which is region of intrinsically disordered protein that is unstructured in the unbound form but can gain folded conformation at interaction with binding partners [29]. Analysis of the structure of the KIX complexed with AD1 (PDB ID: 2kwf) indicated that in its bound state, the PCET motif of the AD1 domain is characterized by low solvent accessibility. Therefore, based on all these observations, it can be hypothesized that the acquisition of the α-helical structure by PCET upon its binding to KIX is
of
driven by the displacement of water from the close environment of the PCET motif.
ro
To test this idea, we investigated the effect of various solvent conditions on structural properties
-p
of the peptide corresponding to the wild PCET motif, as well as to the L21P mutant form of this
re
region. It is known that the replacement of leucine residue at the junction of the Leu-x-x-Leu-
lP
Leu and Leu-Asp-Phe-Ser motifs of PCET weakens the interaction of E2A-PBX1 with the KIX domain and affects induction of bone marrow leukemia in mice in vitro [12]. Our analyses
na
revealed that the environmental conditions causing dehydration of the polypeptide chain (such as alcohols and osmolyte TMAO) were able, in a cosolute concentration-dependent manner, to
ur
induce noticeable structural transformation of the wild type PCET peptide reflective of the
Jo
pronounced α-helical structure formation. However, neither change in pH, or addition of sarcosine and taurine osmolytes, nor mimicking conditions of macromolecular crowding by inclusion to the solution of high concentrations of polyethylene glycol (PEG) of different molecular masses had comparable structure-forming effects on the wild type PCET peptide. On the other hand, the structure of the L21P PCET mutant was much less sensitive to changes in solutions, and this mutant peptide preserved mostly disordered structure in the presence of alcohols and TMAO.
19
Journal Pre-proof Therefore, our analyses revealed that among various solvent conditions, noticeable structural changes in the wild type PCET peptide were induced by ethanol, TFE, and HFIP, which are cosolvents that stabilize the α-helical structure in peptides and proteins with intrinsic helical propensity, and by the osmolyte TMAO that contributes to the dehydration of proteins. The observed transition of the mostly disordered peptide to a conformation with pronounced α-helical structure in alcohol solutions and TMAO suggested that it is the dehydration of this peptide that
of
contributes to the ordering of its structure. Also, the inertness of the L21P mutant of the PCET
ro
peptide to structural changes in presence of alcohols and TMAO indicated that this amino acid
-p
substitution was sufficient to dramatically diminish the intrinsic helical propensity of this region,
ACKNOWLEDGEMENTS
lP
re
probably due to the fact that proline can serve as a helix breaker.
na
This work was supported by grants from the Department of Science and Technology (DST)
ur
India, Grant number: INT/RUS/RFBR/P-255 (to R.G.) and from Russian Foundation for Basic
Jo
Research, Grant number: 17-54-45169 IND (to I.M.K.) and INT/RUS/RFBR/P-255 (R.G.). We thank Dr. Neel Sarovar Bhavesh, Group Leader at the International Center for Genetic Engineering and Biotechnology (ICGEB), for providing access to the NMR spectrometer at ICGEB New Delhi, which is procured and supported by the Department of Biotechnology (DBT), Government of India and ICGEB core funds.
AUTHOR’S CONTRIBUTION
20
Journal Pre-proof RG: conception and design. RG and IMK: study supervision. NS, AVF, and SAS: performed experiment with wild type peptide. VNU: conducted computational analyses. NS: performed experiments of L21P mutant of peptide. NS, AVF, VNU, IMK, and RG: data analysis and writing of the manuscript. OGS, SAS, and KKT: data analysis and review of manuscript.
of
COMPETING INTERESTS
ro
The authors declare no competing interests and do not include any financial and non-financial
re
-p
interests.
S.E. Aspland, H.H. Bendall, C. Murre, The role of E2A-PBX1 in leukemogenesis,
na
[1]
lP
REFERENCES
C.M. Denis, S. Chitayat, M.J. Plevin, F. Wang, P. Thompson, S. Liu, H.L. Spencer, M.
Jo
[2]
ur
Oncogene. 20 (2001) 5708–5717. doi:10.1038/sj.onc.1204592.
Ikura, D.P. LeBrun, S.P. Smith, Structural basis of CBP/p300 recruitment in leukemia induction by E2A-PBX1., Blood. 120 (2012) 3968–77. doi:10.1182/blood-2012-02411397. [3]
E. De Braekeleer, A. Basinko, N. Douet-Guilbert, F. Morel, M.-J. Le Bris, C. Berthou, P. Morice, C. Férec, M. De Braekeleer, Cytogenetics in pre-B and B-cell acute lymphoblastic leukemia: a study of 208 patients diagnosed between 1981 and 2008., Cancer Genet. Cytogenet. 200 (2010) 8–15. doi:10.1016/j.cancergencyto.2010.03.004.
21
Journal Pre-proof [4]
D.P. Lebrun, E2A basic helix-loop-helix transcription factors in humna leukemia, Front. Biosci. 8 (2003) 206–222. doi:10.1093/nar/24.14.2813.
[5]
M.J. Plevin, M.M. Mills, M. Ikura, The LxxLL motif: A multifunctional binding sequence in transcriptional regulation, Trends Biochem. Sci. 30 (2005) 66–69. doi:10.1016/j.tibs.2004.12.001. M.E. Massari, P.A. Grant, M.G. Pray-Grant, S.L. Berger, J.L. Workman, C. Murre, A
of
[6]
conserved motif present in a class of helix-loop-helix proteins activates transcription by
ro
direct recruitment of the SAGA complex, Mol. Cell. 4 (1999) 63–73. doi:10.1016/S1097-
N. Vo, R.H. Goodman, CREB-binding Protein and p300 in Transcriptional Regulation, J.
re
[7]
-p
2765(00)80188-4.
S. Park, W. Chen, T. Cierpicki, M. Tonelli, X. Cai, N.A. Speck, J.H. Bushweller,
na
[8]
lP
Biol. Chem. 276 (2001) 13505–13508. doi:10.1074/jbc.R000025200.
Structure of the AML1-ETO eTAFH domain-HEB peptide complex and its contribution to
J. Zhang, M. Kalkum, S. Yamamura, B.T. Chait, R.G. Roeder, E protein silencing by the
Jo
[9]
ur
AML1-ETO activity, Blood. 113 (2009) 3558–3567. doi:10.1182/blood-2008-06-161307.
leukemogenic AML1-ETO fusion protein, Science (80-. ). 305 (2004) 1286–1289. doi:10.1126/science.1097937. [10] M.E. Massari, P.A. Jennings, C. Murre, The AD1 transactivation domain of E2A contains a highly conserved helix which is required for its activity in both Saccharomyces cerevisiae and mammalian cells., Mol. Cell. Biol. 16 (2015) 121–129. doi:10.1128/mcb.16.1.121. [11] R. Bayly, L. Chuen, R.A. Currie, B.D. Hyndman, R. Casselman, G.A. Blobel, D.P. 22
Journal Pre-proof LeBrun, E2A-PBX1 interacts directly with the KIX domain of CBP/p300 in the induction of proliferation in primary hematopoietic cells., J. Biol. Chem. 279 (2004) 55362–71. doi:10.1074/jbc.M408654200. [12] R. Bayly, T. Murase, B.D. Hyndman, R. Savage, S. Nurmohamed, K. Munro, R. Casselman, S.P. Smith, D.P. LeBrun, Critical role for a single leucine residue in leukemia induction by E2A-PBX1., Mol. Cell. Biol. 26 (2006) 6442–52. doi:10.1128/MCB.02025-
of
05.
ro
[13] A.K. Dunker, E. Garner, S. Guilliot, P. Romero, K. Albrecht, J. Hart, Z. Obradovic, C.
-p
Kissinger, J.E. Villafranca, Protein disorder and the evolution of molecular recognition:
re
theory, predictions and observations., Pac. Symp. Biocomput. (1998) 473–84.
lP
http://www.ncbi.nlm.nih.gov/pubmed/9697205.
[14] A.K. Dunker, J.D. Lawson, C.J. Brown, R.M. Williams, P. Romero, J.S. Oh, C.J. Oldfield,
na
A.M. Campen, C.M. Ratliff, K.W. Hipps, J. Ausio, M.S. Nissen, R. Reeves, C. Kang, C.R.
ur
Kissinger, R.W. Bailey, M.D. Griswold, W. Chiu, E.C. Garner, Z. Obradovic, Intrinsically
Jo
disordered protein, J. Mol. Graph. Model. 19 (2001) 26–59. doi:10.1016/S10933263(00)00138-8.
[15] P. Tompa, Intrinsically unstructured proteins., Trends Biochem. Sci. 27 (2002) 527–33. doi:10.1016/S0968-0004(02)02169-2. [16] V.N. Uversky, What does it mean to be natively unfolded?, Eur. J. Biochem. 269 (2002) 2–12. doi:10.1046/j.0014-2956.2001.02649.x. [17] V.N. Uversky, A.K. Dunker, Understanding protein non-folding, Biochim. Biophys. Acta - Proteins Proteomics. 1804 (2010) 1231–1264. doi:10.1016/j.bbapap.2010.01.017.
23
Journal Pre-proof [18] V.N. Uversky, A.K. Dunker, The case for intrinsically disordered proteins playing contributory roles in molecular recognition without a stable 3D structure., F1000 Biol. Rep. 5 (2013) 1. doi:10.3410/B5-1. [19] V.N. Uversky, J.R. Gillespie, A.L. Fink, Why are “natively unfolded” proteins unstructured under physiologic conditions?, Proteins Struct. Funct. Genet. 41 (2000) 415–
of
427. doi:10.1002/1097-0134(20001115)41:3<415::AID-PROT130>3.0.CO;2-7. [20] P.E. Wright, H.J. Dyson, Intrinsically unstructured proteins: re-assessing the protein
ro
structure-function paradigm, J. Mol. Biol. 293 (1999) 321–331.
-p
doi:10.1006/jmbi.1999.3110.
re
[21] P. Radivojac, L.M. Iakoucheva, C.J. Oldfield, Z. Obradovic, V.N. Uversky, A.K. Dunker,
lP
Intrinsic Disorder and Functional Proteomics, Biophys. J. 92 (2007) 1439–1456.
na
doi:10.1529/biophysj.106.094045.
[22] R. Giri, D. Kumar, N. Sharma, V.N. Uversky, Intrinsically Disordered Side of the Zika
ur
Virus Proteome, Front. Cell. Infect. Microbiol. 6 (2016) 144.
Jo
doi:10.3389/fcimb.2016.00144. [23] P.M. Mishra, V.N. Uversky, R. Giri, Molecular Recognition Features in Zika Virus Proteome, J. Mol. Biol. (2017). doi:10.1016/j.jmb.2017.10.018. [24] D. Kumar, N. Sharma, R. Giri, Therapeutic Interventions of Cancers Using Intrinsically Disordered Proteins as Drug Targets: c-Myc as Model System, Cancer Inform. 16 (2017) 1176935117699408. doi:10.1177/1176935117699408. [25] N. Sharma, A. Murali, S.K. Singh, R. Giri, Epigallocatechin gallate, an active green tea compound inhibits the Zika virus entry into host cells via binding the envelope protein, 24
Journal Pre-proof Int. J. Biol. Macromol. 104 (2017) 1046–1054. doi:10.1016/j.ijbiomac.2017.06.105. [26] A. Singh, A. Kumar, R. Yadav, V.N. Uversky, R. Giri, Deciphering the dark proteome of Chikungunya virus, Sci. Rep. 8 (2018) 5822. doi:10.1038/s41598-018-23969-0. [27] N. Garg, P. Kumar, K. Gadhave, R. Giri, The dark proteome of cancer: Intrinsic disorderedness and functionality of HIF-1α along with its interacting proteins, in: Prog.
of
Mol. Biol. Transl. Sci., 2019: pp. 371–403. doi:10.1016/bs.pmbts.2019.05.006.
ro
[28] M. Aarthy, D. Kumar, R. Giri, S.K. Singh, E7 oncoprotein of human papillomavirus: Structural dynamics and inhibitor screening study., Gene. 658 (2018) 159–177.
-p
doi:10.1016/j.gene.2018.03.026.
re
[29] C.J. Oldfield, Y. Cheng, M.S. Cortese, P. Romero, V.N. Uversky, A.K. Dunker, Coupled
lP
Folding and Binding with α-Helix-Forming Molecular Recognition Elements †,
na
Biochemistry. 44 (2005) 12454–12470. doi:10.1021/bi050736e. [30] V.N. Uversky, A decade and a half of protein intrinsic disorder: Biology still waits for
ur
physics, Protein Sci. 22 (2013) 693–724. doi:10.1002/pro.2261.
Jo
[31] V.N. Uversky, Unusual biophysics of intrinsically disordered proteins, Biochim. Biophys. Acta - Proteins Proteomics. 1834 (2013) 932–951. doi:10.1016/j.bbapap.2012.12.008. [32] S. Gianni, A. Morrone, R. Giri, M. Brunori, Biochemical and Biophysical Research Communications A folding-after-binding mechanism describes the recognition between the transactivation domain of c-Myb and the KIX domain of the CREB-binding protein, Biochem. Biophys. Res. Commun. 428 (2012) 205–209. doi:10.1016/j.bbrc.2012.09.112. [33] C.J. Oldfield, A.K. Dunker, Intrinsically Disordered Proteins and Intrinsically Disordered Protein Regions, Annu. Rev. Biochem. 83 (2014) 553–584. doi:10.1146/annurev25
Journal Pre-proof biochem-072711-164947. [34] A. Toto, C. Camilloni, R. Giri, M. Brunori, M. Vendruscolo, S. Gianni, Molecular Recognition by Templated Folding of an Intrinsically Disordered Protein, Sci. Rep. 6 (2016) 21994. doi:10.1038/srep21994. [35] A. Toto, R. Giri, M. Brunori, S. Gianni, The mechanism of binding of the KIX domain to
of
the mixed lineage leukemia protein and its allosteric role in the recognition of c-Myb,
ro
Protein Sci. 23 (2014) 962–969. doi:10.1002/pro.2480.
[36] R. Giri, A. Morrone, A. Toto, M. Brunori, S. Gianni, Structure of the transition state for
-p
the binding of c-Myb and KIX highlights an unexpected order for a disordered system,
re
Proc. Natl. Acad. Sci. 110 (2013) 14942–14947. doi:10.1073/pnas.1307337110.
lP
[37] S.L. Shammas, M.D. Crabtree, L. Dahal, B.I.M. Wicky, J. Clarke, Insights into Coupled
na
Folding and Binding Mechanisms from Kinetic Studies., J. Biol. Chem. 291 (2016) 6689– 95. doi:10.1074/jbc.R115.692715.
ur
[38] G. Rivas, F. Ferrone, J. Herzfeld, Life in a crowded world., EMBO Rep. 5 (2004) 23–7.
Jo
doi:10.1038/sj.embor.7400056. [39] B. van den Berg, R.J. Ellis, C.M. Dobson, Effects of macromolecular crowding on protein folding and aggregation., EMBO J. 18 (1999) 6927–33. doi:10.1093/emboj/18.24.6927. [40] S.B. Zimmerman, S.O. Trach, Estimation of macromolecule concentrations and excluded volume effects for the cytoplasm of Escherichia coli, J. Mol. Biol. 222 (1991) 599–620. doi:10.1016/0022-2836(91)90499-V. [41] R.J. Ellis, A.P. Minton, Join the crowd, Nature. 425 (2003) 27–28. doi:10.1038/425027a. [42] R.J. Ellis, Macromolecular crowding: Obvious but underappreciated, Trends Biochem. 26
Journal Pre-proof Sci. 26 (2001) 597–604. doi:10.1016/S0968-0004(01)01938-7. [43] A.B. Fulton, How crowded is the cytoplasm?, Cell. 30 (1982) 345–347. doi:10.1016/00928674(82)90231-8. [44] A.P. Minton, Influence of excluded volume upon macromolecular structure and associations in ‘crowded” media, Curr. Opin. Biotechnol. 8 (1997) 65–69.
of
doi:10.1016/S0958-1669(97)80159-0.
ro
[45] A.P. Minton, Protein folding: Thickening the broth, Curr. Biol. 10 (2000) R97–R99.
-p
doi:10.1016/S0960-9822(00)00301-8.
[46] S.B. Zimmerman, A.P. Minton, Macromolecular Crowding: Biochemical, Biophysical,
re
and Physiological Consequences, Annu. Rev. Biophys. Biomol. Struct. 22 (1993) 27–65.
lP
doi:10.1146/annurev.bb.22.060193.000331.
na
[47] I. Kuznetsova, B. Zaslavsky, L. Breydo, K. Turoverov, V. Uversky, Beyond the Excluded Volume Effects: Mechanistic Complexity of the Crowded Milieu, Molecules. 20 (2015)
ur
1377–1409. doi:10.3390/molecules20011377.
Jo
[48] I.M. Kuznetsova, K.K. Turoverov, V.N. Uversky, What macromolecular crowding can do to a protein., Int. J. Mol. Sci. 15 (2014) 23090–140. doi:10.3390/ijms151223090. [49] N.A. Chebotareva, Effect of molecular crowding on the enzymes of glycogenolysis, Biochem. 72 (2007) 1478–1490. doi:10.1134/S0006297907130056. [50] Z. Ignatova, L.M. Gierasch, Effects of Osmolytes on Protein Folding and Aggregation in Cells, in: Methods Enzymol., 2007: pp. 355–372. doi:10.1016/S0076-6879(07)28021-8. [51] R.E. MacMillen, A.K. Lee, Australian desert mice: Independence of exogenous water, Science (80-. ). 158 (1967) 383–385. doi:10.1126/science.158.3799.383. 27
Journal Pre-proof [52] P.H. Yancey, Organic osmolytes as compatible, metabolic and counteracting cytoprotectants in high osmolarity and other stresses, J. Exp. Biol. 208 (2005) 2819–2830. doi:10.1242/jeb.01730. [53] P. Attri, P. Venkatesu, M.-J. Lee, Influence of Osmolytes and Denaturants on the Structure and Enzyme Activity of α-Chymotrypsin, J. Phys. Chem. B. 114 (2010) 1471–
of
1478. doi:10.1021/jp9092332. [54] N.A. Chebotareva, B.I. Kurganov, S.E. Harding, D.J. Winzor, Effect of osmolytes on the
ro
interaction of flavin adenine dinucleotide with muscle glycogen phosphorylase b.,
-p
Biophys. Chem. 113 (2005) 61–6. doi:10.1016/j.bpc.2004.07.040.
re
[55] N.-Y. Fang, J. Lee, S.-J. Yin, W. Wang, Z.-J. Wang, J.-M. Yang, G.-Y. Qian, Y.-X. Si,
lP
Y.-D. Park, Effects of osmolytes on arginine kinase from Euphausia superba: A study on thermal denaturation and aggregation, Process Biochem. 49 (2014) 936–947.
na
doi:10.1016/J.PROCBIO.2014.03.019.
ur
[56] J.K. Kaushik, R. Bhat, Why is trehalose an exceptional protein stabilizer? An analysis of
Jo
the thermal stability of proteins in the presence of the compatible osmolyte trehalose., J. Biol. Chem. 278 (2003) 26458–65. doi:10.1074/jbc.M300815200. [57] E.P. O’Brien, G. Ziv, G. Haran, B.R. Brooks, D. Thirumalai, Effects of denaturants and osmolytes on proteins are accurately predicted by the molecular transfer model., Proc. Natl. Acad. Sci. U. S. A. 105 (2008) 13403–8. doi:10.1073/pnas.0802113105. [58] L.R. Singh, N.K. Poddar, T.A. Dar, S. Rahman, R. Kumar, F. Ahmad, Forty years of research on osmolyte-induced protein folding and stability, J. Iran. Chem. Soc. 8 (2011) 1–23. doi:10.1007/BF03246197.
28
Journal Pre-proof [59] P. Romero, Z. Obradovic, X. Li, E.C. Garner, C.J. Brown, A.K. Dunker, Sequence complexity of disordered protein., Proteins. 42 (2001) 38–48. [60] K. Peng, P. Radivojac, S. Vucetic, A.K. Dunker, Z. Obradovic, Length-dependent prediction of protein in intrinsic disorder, BMC Bioinformatics. 7 (2006) 208. doi:10.1186/1471-2105-7-208.
of
[61] Z. Obradovic, K. Peng, S. Vucetic, P. Radivojac, C.J. Brown, A.K. Dunker, Predicting Intrinsic Disorder From Amino Acid Sequence, in: Proteins Struct. Funct. Genet., 2003:
ro
pp. 566–572. doi:10.1002/prot.10532.
-p
[62] B. Xue, R.L. Dunbrack, R.W. Williams, A.K. Dunker, V.N. Uversky, PONDR-FIT: A
re
meta-predictor of intrinsically disordered amino acids, Biochim. Biophys. Acta - Proteins
lP
Proteomics. 1804 (2010) 996–1010. doi:10.1016/j.bbapap.2010.01.011.
na
[63] Z. Dosztányi, V. Csizmok, P. Tompa, I. Simon, IUPred: Web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content,
ur
Bioinformatics. 21 (2005) 3433–3434. doi:10.1093/bioinformatics/bti541.
Jo
[64] Z. Dosztányi, V. Csizmók, P. Tompa, I. Simon, The pairwise energy content estimated from amino acid composition discriminates between folded and intrinsically unstructured proteins, J. Mol. Biol. 347 (2005) 827–839. doi:10.1016/j.jmb.2005.01.071. [65] B. Mészáros, G. Erdös, Z. Dosztányi, IUPred2A: Context-dependent prediction of protein disorder as a function of redox state and protein binding, Nucleic Acids Res. 46 (2018) W329–W337. doi:10.1093/nar/gky384. [66] S.E. Permyakov, A.A. Vologzhannikova, P.A. Khorn, M.P. Shevelyova, A.S. Kazakov, V.I. Emelyanenko, A.I. Denesyuk, K. Denessiouk, V.N. Uversky, E.A. Permyakov, 29
Journal Pre-proof Comprehensive analysis of the roles of ‘black’ and ‘gray’ clusters in structure and function of rat β-parvalbumin, Cell Calcium. 75 (2018) 64–78. doi:10.1016/j.ceca.2018.08.005. [67] E.I. Deryusheva, A.I. Denesyuk, K. Denessiouk, V.N. Uversky, S.E. Permyakov, E.A. Permyakov, On the relationship between the conserved ‘black’ and ‘gray’ structural clusters and intrinsic disorder in parvalbumins, Int. J. Biol. Macromol. 120 (2018) 1055–
of
1062. doi:10.1016/j.ijbiomac.2018.08.183.
ro
[68] I. Walsh, M. Giollo, T. Di Domenico, C. Ferrari, O. Zimmermann, S.C.E. Tosatto,
-p
Comprehensive large-scale assessment of intrinsic protein disorder, Bioinformatics. 31
re
(2015) 201–208. doi:10.1093/bioinformatics/btu625.
lP
[69] X. Fan, L. Kurgan, Accurate prediction of disorder in protein chains with a comprehensive and empirically designed consensus, J. Biomol. Struct. Dyn. 32 (2014) 448–464.
na
doi:10.1080/07391102.2013.775969.
ur
[70] Z. PENG, L. KURGAN, ON THE COMPLEMENTARITY OF THE CONSENSUS-
Jo
BASED DISORDER PREDICTION, in: Biocomput. 2012, WORLD SCIENTIFIC, 2011: pp. 176–187. doi:10.1142/9789814366496_0017. [71] T. Support, Preparing Large, Unilamellar Vesicles by Extrusion (LUVET) | Avanti Polar Lipids, (2018) 10–11. https://avantilipids.com/tech-support/liposome-preparation/luvet (accessed November 1, 2019). [72] E. A. Burstein, Intrinsic protein fluorescence: origin and applications, In Advances in Science and Technology, Series Biophysics 7, VINITI: Moscow, Russia, (1976). [73] I.M. Kuznetsova, K.K. Turoverov, V.N. Uversky, Use of the phase diagram method to 30
Journal Pre-proof analyze the protein unfolding-refolding reactions: Fishing out the “invisible” intermediates, J. Proteome Res. 3 (2004) 485–494. doi:10.1021/pr034094y. [74] A. Mohan, C.J. Oldfield, P. Radivojac, V. Vacic, M.S. Cortese, A.K. Dunker, V.N. Uversky, Analysis of Molecular Recognition Features (MoRFs), J. Mol. Biol. 362 (2006) 1043–1059. doi:10.1016/j.jmb.2006.07.087.
of
[75] V. Vacic, C.J. Oldfield, A. Mohan, P. Radivojac, M.S. Cortese, V.N. Uversky, A.K. Dunker, Characterization of molecular recognition features, MoRFs, and their binding
ro
partners, J. Proteome Res. 6 (2007) 2351–2366. doi:10.1021/pr0701411.
-p
[76] Y. Cheng, C.J. Oldfield, J. Meng, P. Romero, V.N. Uversky, A.K. Dunker, Mining α-
re
helix-forming molecular recognition features with cross species sequence alignments,
lP
Biochemistry. 46 (2007) 13468–13477. doi:10.1021/bi7012273.
na
[77] B. Mészáros, I. Simon, Z. Dosztányi, Prediction of protein binding regions in disordered proteins, PLoS Comput. Biol. 5 (2009) e1000376. doi:10.1371/journal.pcbi.1000376.
ur
[78] Z. Dosztanyi, B. Meszaros, I. Simon, ANCHOR: web server for predicting protein binding
Jo
regions in disordered proteins, Bioinformatics. 25 (2009) 2745–2746. doi:10.1093/bioinformatics/btp518. [79] F.M. Disfani, W.L. Hsu, M.J. Mizianty, C.J. Oldfield, B. Xue, A. Keith Dunker, V.N. Uversky, L. Kurgan, MoRFpred, a computational tool for sequence-based prediction and characterization of short disorder-to-order transitioning binding regions in proteins, Bioinformatics. 28 (2012) i75-83. doi:10.1093/bioinformatics/bts209. [80] R. van der Lee, M. Buljan, B. Lang, R.J. Weatheritt, G.W. Daughdrill, A.K. Dunker, M. Fuxreiter, J. Gough, J. Gsponer, D.T. Jones, P.M. Kim, R.W. Kriwacki, C.J. Oldfield, R. 31
Journal Pre-proof V Pappu, P. Tompa, V.N. Uversky, P.E. Wright, M.M. Babu, Classification of Intrinsically Disordered Regions and Proteins, Chem. Rev. 114 (2014) 6589–6631. doi:10.1021/cr400525m. [81] V.N. Uversky, C.J. Oldfield, A.K. Dunker, Showing your ID: Intrinsic disorder as an ID for recognition, regulation and cell signaling, J. Mol. Recognit. 18 (2005) 343–384.
of
doi:10.1002/jmr.747. [82] T.D. Kim, H.J. Ryu, H. Il Cho, C.H. Yang, J. Kim, Thermal behavior of proteins: Heat-
-p
(2000) 14839–14846. doi:10.1021/bi001441y.
ro
resistant proteins and their heat-induced secondary structural changes, Biochemistry. 39
re
[83] D.E. Timm, P. Vissavajjhala, A.H. Ross, K.E. Neet, Spectroscopic and chemical studies
lP
of the interaction between nerve growth factor (NGF) and the extracellular domain of the low affinity NGF receptor, Protein Sci. 1 (1992) 1023–1031.
na
doi:10.1002/pro.5560010808.
ur
[84] V.N. Uversky, J. Li, A.L. Fink, Evidence for a Partially Folded Intermediate in α-
Jo
Synuclein Fibril Formation, J. Biol. Chem. 276 (2001) 10737–10744. doi:10.1074/jbc.M010907200. [85] M. Kjaergaard, A.B. Nørholm, R. Hendus-Altenburger, S.F. Pedersen, F.M. Poulsen, B.B. Kragelund, Temperature-dependent structural changes in intrinsically disordered proteins: Formation of α-helices or loss of polyproline II?, Protein Sci. 19 (2010) 1555–1564. doi:10.1002/pro.435. [86] L.R. English, A. Tischer, A.K. Demeler, B. Demeler, S.T. Whitten, Sequence Reversal Prevents Chain Collapse and Yields Heat-Sensitive Intrinsic Disorder, Biophys. J. 115
32
Journal Pre-proof (2018) 328–340. doi:10.1016/j.bpj.2018.06.006. [87] * D. E. Otzen, and P. Sehgal, L.W. Nesgaard, Alternative Membrane Protein Conformations in Alcohols†, (2007). doi:10.1021/BI700091R. [88] D. Roccatano, G. Colombo, M. Fioroni, A.E. Mark, Mechanism by which 2,2,2trifluoroethanol/water mixtures stabilize secondary-structure formation in peptides: a
of
molecular dynamics study., Proc. Natl. Acad. Sci. U. S. A. 99 (2002) 12179–12184.
ro
doi:10.1073/pnas.182199699.
[89] M. Ramírez-Alvarado, L. Serrano, F.J. Blanco, Conformational analysis of peptides
-p
corresponding to all the secondary structure elements of protein L B1 domain: secondary
re
structure propensities are not conserved in proteins with the same fold., Protein Sci. 6
lP
(1997) 162–74. doi:10.1002/pro.5560060119.
na
[90] I. Baskakov, D.W. Bolen, Forcing thermodynamically unfolded proteins to fold., J. Biol. Chem. 273 (1998) 4831–4. doi:10.1074/JBC.273.9.4831.
ur
[91] D. Aioanei, S. Lv, I. Tessari, A. Rampioni, L. Bubacco, H. Li, B. Samorì, M. Brucale,
Jo
Single-Molecule-Level Evidence for the Osmophobic Effect, Angew. Chemie Int. Ed. 50 (2011) 4394–4397. doi:10.1002/anie.201006714. [92] D. Aioanei, I. Tessari, L. Bubacco, B. Samorì, M. Brucale, Observing the osmophobic effect in action at the single molecule level, Proteins Struct. Funct. Bioinforma. 79 (2011) 2214–2223. doi:10.1002/prot.23045. [93] D.. Bolen, I. V Baskakov, The osmophobic effect: natural selection of a thermodynamic force in protein folding 1 1Edited by D. Draper, J. Mol. Biol. 310 (2001) 955–963. doi:10.1006/jmbi.2001.4819. 33
Journal Pre-proof
FIGURE LEGENDS Figure 1. Intrinsic disorder propensity of human HEB protein and its PCET motif. A. Disorder propensity of the full-length wild type human HEB protein (UniProt ID: Q99081). B. Zoomed-in view of the disorder profile of the N-terminal region (first 50 residues) of the wild
of
type human HEB protein. C. Zoomed-in view of the disorder profile of the N-terminal region
ro
(first 50 residues) of the L21P mutant form of human HEB protein. Disorder propensity was
-p
studied by PONDR® VLXT (black curves), PONDR® VL3 (red curves), PONDR® VSL2 (green curves), PONDR® FIT (pink curves), IUPred_short (yellow curves), and IUPred_long (blue
re
curves). Access to the per-residue disorder predictors of PONDR® family and to the UniProt2A
lP
disorder predictor was provided by the DisProt database (http://original.disprot.org/) and UniProt
na
platform (https://iupred2a.elte.hu/), respectively. Light pink shadow around PONDR® FIT curve shows error distribution. Mean disorder predisposition was calculated by averaging of all
ur
predictor-specific per-residue disorder profiles (bold, dashed, dark cyan curve). Light blue
Jo
shadow around the mean disorder curve shows the distribution of standard deviations. Gray shaded area in each plot represents position of the PCET motif.
Figure 2. Far UV-CD spectra of the peptides and effect of pH. A. Far UV-CD spectra of the peptide corresponding to the PCET motif of the AD1 domain of HEB E-protein. Data for the wild type (WT) and L21P mutant are shown by black and red lines, respectively. Measurements were conducted in buffer containing 10 mM potassium phosphate and 50 mM sodium sulphate, pH 7.4 at 25oC. B. Far UV-CD spectra of the wild type PECT 34
Journal Pre-proof peptide measured in solutions with different pH.
Figure 3. Effects of temperature on far-UV-CD spectra of the peptides. Effects of temperature on the wild type PCET peptide (A, C, and E) and its L21P mutant form (B, C, and F). Plots A and B represent far UV-CD spectra measured at temperatures changing
of
from 4 to 90oC at an interval of 5oC. Plots C and D show the temperature courses of the
ro
ellipticity at 198 nm (black) and 222 nm (red) with increasing temperature to 90oC. Plots E and F
-p
represent phase diagrams ([]222 vs. []198 dependences) to analyze the mechanism of changes in
lP
re
structural propensity of these peptides under increasing temperature.
Figure 4. Effects of ethanol on secondary structure of the wild type PCET peptide and its
na
L21P mutant.
ur
Effects of ethanol on the wild type PCET peptide (A, C, and E) and its L21P mutant (B, C, and F). Plots A and B represent far UV-CD spectra measured at ethanol concentration ranging from
Jo
0% to 90% at an interval of 5%. Plots C and D show the dependences of the ellipticity at 198 nm (black) and 222 nm (red) on increasing ethanol concentration to 90%. Plots E and F represent phase diagrams ([]222 vs. []198 dependences) to analyze the mechanism of changes in structural propensity of these peptides under increasing ethanol concentration.
Figure 5. Effects of TFE on secondary structure of the peptides.
35
Journal Pre-proof Effects of TFE on the wild type PCET peptide (A, C, and E) and its L21P mutant form (B, C, and F). Plots A and B represent far UV-CD spectra measured at TFE concentration ranging from 0% to 90% at an interval of 5%. Plots C and D show the dependences of the ellipticity at 198 nm (black) and 222 nm (red) on increasing TFE concentration to 90%. Plots E and F represent phase diagrams ([]222 vs. []198 dependences) to analyze the mechanism of changes in structural
ro
of
propensity of these peptides under increasing TFE concentration.
-p
Figure 6. Effects of HFIP on secondary structure of the peptides. Effects of HFIP on the wild type PCET peptide (A, C, and E) and its L21P mutant form (B, C,
re
and F). Plots A and B represent far UV-CD spectra measured at HFIP concentration ranging
lP
from 0% to 90% at an interval of 5%. Plots C and D show the dependences of the ellipticity at
na
198 nm (black) and 222 nm (red) on increasing HFIP concentration to 90%. Plots E and F represent phase diagrams ([]222 vs. []198 dependences) to analyze the mechanism of changes in
Jo
ur
structural propensity of these peptides under increasing HFIP concentration.
Figure 7. Effect of neutral liposomes (DOPC) at a concentration of 1mM, 2.5mM and 5mM on secondary structure of WT (HEB) peptide.
Figure 8. Effect of molecular crowder on secondary structure of peptides. A. Far-UV CD spectra of the PCET peptide in solutions containing different concentrations of PEG with the molecular weight of 600 Da. B. Far-UV CD spectra of the PCET peptide in solutions containing different concentration of PEG with the molecular weight of 4,000 Da. C. 36
Journal Pre-proof Far-UV CD spectra of the PCET peptide in solutions containing different concentration of PEG with the molecular weight of 12,000 Da. D. Far-UV CD spectra of the PCET peptide in solutions containing PEG of different molecular weights at concentration of 200 mg/ml.
Figure 9. Effect of osmolytes on far-UV-CD spectra of the peptides.
of
A. Far-UV CD spectra of the PCET peptide in the presence of different concentrations of
ro
sarcosine. B. Far-UV CD spectra of the PCET peptide in the presence of different concentrations of taurine. C. Far-UV CD spectra of the PCET peptide in the presence of different concentrations
-p
of TMAO. D. Dependences of the ellipticity at 222 nm in far UV CD spectra of the PCET
lP
re
peptide on TMAO concentration.
na
Figure 10. 2D NMR 1H-1H NOESY spectra of the wild type PCET peptide.
ur
The figure is showing the effect of HFIP on secondary structure elements of PCET peptide. The overlay of wild type PCET peptide in aqueous condition (Black) and 20% HFIP aqueous mixture
Jo
(Red) using qualitative analysis (Negative peaks are shown in green). A. Fingerprint NH-H region of 2D NMR 1H-1H NOESY spectrum acquired at 500 MHz B. Fingerprint NH-H( or ) region of 2D NMR 1H-1H NOESY spectrum C. Fingerprint NH-NH connectivity region of 2D NMR 1H-1H NOESY spectrum D. Fingerprint H-H connectivity region of 2D NMR 1H-1H NOESY spectrum E. Fingerprint of H- H region of 2D NMR 1H-1H NOESY spectrum of PCET wild-type peptide. F. Overlay of 2D NOESY wide range spectrum of PCET wild type peptide in water (black) and 20% HFIP (red) aqueous mixture.
37
Journal Pre-proof Figure 11. 2D NMR 1H-1H NOESY spectra of the mutant PCET peptide. The effect of HFIP on secondary structure elements of mutant PCET peptide has been shown by the overlay of mutant PCET peptide in aqueous condition (Black) and 20% HFIP aqueous mixture (Red) using qualitative analysis. A. Connectivity of NH-H region of 2D NMR 1H-1H NOESY spectrum has been shown to differentiate the secondary structure elements B. Overlay of NH-H( or ) region of 2D NMR 1H-1H NOESY spectrum C. Overlay of NH-NH correlations
of
of 2D NMR 1H-1H NOESY spectrum D. Overlay of H-H connectivity region of 2D NMR 1HH NOESY spectrum of mutant PCET peptide E. Fingerprint of H- H region of 2D NMR 1H-
1
H NOESY spectrum of mutant peptide. F. Overlay of 2D NOESY wide range spectrum of
-p
ro
1
Jo
ur
na
lP
re
PCET mutant peptide in water (black) and 20% HFIP (red) aqueous mixture.
38
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11