The lac repressor hinge helix in context: The effect of the DNA binding domain and symmetry

The lac repressor hinge helix in context: The effect of the DNA binding domain and symmetry

BBA - General Subjects 1864 (2020) 129538 Contents lists available at ScienceDirect BBA - General Subjects journal homepage: www.elsevier.com/locate...

855KB Sizes 2 Downloads 44 Views

BBA - General Subjects 1864 (2020) 129538

Contents lists available at ScienceDirect

BBA - General Subjects journal homepage: www.elsevier.com/locate/bbagen

The lac repressor hinge helix in context: The effect of the DNA binding domain and symmetry Danielle Seckforta,b, Gillian C. Lynchb, B. Montgomery Pettitta,b,

T



a

Structural and Computational Biology and Molecular Biophysics, Baylor College of Medicine, Houston 77030, TX, USA Department of Biochemistry and Molecular Biology, Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston 77555, TX, USA

b

A R T I C LE I N FO

A B S T R A C T

Keywords: Protein LacI Disordered proteins MD simulations Metadynamics Disorder to order transition Salt stability

The Lac system of genes has been an important model system in understanding gene regulation. When the dimer lac repressor protein binds to the correct DNA sequence, the hinge region of the protein goes through a disorder to order transition. The hinge region is disordered when binding to nonoperator sequences. This region of the protein must pay a conformational entropic penalty to order when it is bound to operator DNA. Structural studies show that this region is flexible. Previous simulations showed that this region is disordered when free in solution without the DNA binding domain present. Our simulations corroborate that this region is extremely flexible in solution, but we find that the presence of the DNA binding domain proximal to the hinge helix and salt make the ordered conformation more favorable even without DNA present.

1. Introduction The lac system has been a model of gene regulation since 1952 [1–12]. The lac repressor regulates the expression of a set of genes involved in lactose metabolism in the bacterium Escherichia coli (E. coli). When lactose is either no longer in the system or no longer the favorable source of energy in the system, a binding signal modulates the affinity of the lac repressor protein for the operator. The repressor then binds to the operator region of the operon and the gene's expression is suppressed [13]. The lac repressor is a protein of 360 amino acids that assembles into a homotetramer. The three-dimensional structure of the lac repressor provides important atomic level clues as to how the repressor performs its biological role. The structure of fragments of the DNA-binding domain (DBD) [14] and regulatory domain (RD) when bound to inducer have been determined [15–17]. The structure of the DBD showed that the repressor contained the helix-turn-helix (HTH) motif that is common in other proteins that bind specifically to DNA. These early studies were only able to characterize the pieces of the protein and were unable to do so for the full protein where the RD and DBD were linked by the hinge helix (HH) region. M. Lewis et al. were able to obtain the full three-dimensional structure bound to operator-like DNA in 1996 using x-ray crystallography to obtain an image of the alpha carbons of the full molecule

with resolution 2.7 Å (PDB 1LBG) [17]. Work continued on the repressor protein bound to the operator using other methods to get better resolution [18–20]. Previous experiments had determined there must be a structural change to the protein when no longer bound to operator DNA [3,21,22], but characterizing this through crystallography or NMR was difficult. Kaptein et al. made a single mutation to the HH, V52C, and were able to solve the structure of the head piece structure (HP62), the DBD + HH, bound to operator and non-operator DNA using NMR spectroscopy (Fig. 1) [23,24]. This mutation showed no sign of altering the affinity of the protein for the operator site or change to the binding, it simply made the protein fluctuate less. This sequence change allowed a view of the structure bound to non-operator DNA and showed that it was disordered. Proteins and protein domains with a less well-defined structure are referred to as intrinsically disordered proteins (IDP) or intrinsically disordered regions (IDR). These types of proteins were first characterized in the mid-1990s and have been extensively studied [25–31]. The secondary structures of many proteins are disordered under physiological conditions, but they can adopt specific structures in a particular functional state [26]. This type of folding is often coupled with binding to the protein's target [32]. The HH of the lac repressor protein is one of these IDRs. Although a disordered structure has been determined for the hinge helix when bound to non-operator DNA, there is still very little

⁎ Corresponding author at: Department of Biochemistry and Molecular Biology, Sealy Center for Structural Biology and Molecular Biophysics, University of Texas Medical Branch, Galveston 77555, TX, USA. E-mail address: [email protected] (B.M. Pettitt).

https://doi.org/10.1016/j.bbagen.2020.129538 Received 14 October 2019; Received in revised form 13 January 2020; Accepted 14 January 2020 Available online 17 January 2020 0304-4165/ © 2020 Elsevier B.V. All rights reserved.

BBA - General Subjects 1864 (2020) 129538

D. Seckfort, et al.

Fig. 1. A rendering of PDB 1L1M(A) and 1OSL (B) using VMD showing the lac repressor protein's conformation when bound to operator and non-operator DNA respectively. The HH is shown in red and the DBD is shown in blue.

surface as a function of a few selected collective variables (CVs). We tested a number of different collective variables including RMSD from the crystal structure, radius of gyration, end-to-end distance (E2E), and helical content. Each of these CVs has desirable and undesirable characteristics [42]. We concluded RMSD from the crystal structure was an adequate compromise and allowed other variations via reweighting (below). We used the RMSD of residues 50–58 from each monomer from the starting structure of each respective simulation. The systems were run using NAMD 2.1038 and the CHARMM36 force field [37,43]. The VMD package [38] provided some analysis and visualization. To begin metadynamics for systems 1 and 2, a Gaussian height of 1.0 kcal/mol was used. The systems were able to traverse the entire range of the CV at least 5 times at the initial Gaussian height. The height was subsequently reduced according to the schedule shown in Table 1 in order to get a more accurate and smooth final potential of mean force (PMF) [38,44,45]. For each subsequent height the systems transited the entire range of the CV multiple times, at least 1 trip/10 ns, until the results showed regularity allowing for convergence. Every 50 ps we collected an intermediate output PMF file. Once all runs were complete, all PMFs from each Gaussian height were averaged using the block method (4 blocks/ ns). Then every four points along the RMSD were averaged. The variance from the average PMF for each height is shown in Supporting Information Fig. S1. System 1 was sampled for a total of 300 ns while system 2 was sampled for a total of 340 ns. The systems were analyzed using VMD with plugins from PLUMED [46], NAMD [38], and METAGUI [47,48]. Analysis using METAGUI 3 was used to cluster the metadynamics data based on RMSD. Once clustered PLUMED was used to calculate the torsion angles of residues 51–58, and E2E distances of the HHs (N-Asn50 to O-Gly58) of each frame in those clusters. Using the 16 torsion angles of each frame, the number of consecutive helical residues (−90 < φ < −35 and − 70 < ψ < −15) [49] was calculated. The metadynamics calculation was then reweighted to E2E distance using the original PMF and the populations of each variable per RMSD cluster. The reweighting followed the usual procedure for a biased distribution.

information on the protein when free in solution. Part of this is due to limitations of current experimental techniques to characterize disordered proteins in solution. Simulations have been performed on the hinge-helix peptide in solution [33] and confirmed the tendency towards disorder of this region of the protein. Here we use computational simulation techniques to investigate the structure and conformational free energy of this system in solution when the DBD is present. To investigate the effect on the protein conformation of the ion atmosphere around DNA, which is known to substantially increase the salt concentration over bulk, we consider up to 0.4 M salt [34]. 2. Methods We performed metadynamics simulations starting from minimized crystal coordinates in order to compute the free energy surface for the hinge-helix with the full DNA-binding domain as a function of a set of collective variables (CVs). We chose to sample the root mean square deviation (RMSD) from the starting coordinates as our original CV and subsequently reweight the sampling to other CVs for further analysis. The studies were performed in two different ionic concentration conditions, 0.0 M and 0.4 M. 2.1. System preparation To compare the differences between the ordered and disordered states of the HH region, two metadynamics simulations were performed. We monitored the simulations to ensure that each was traversing the entire range of the collective variable. For both simulations the starting structure was the V52C mutant of the full homodimer DBD + HH (residues 1–62) [23]. Many previous experiments and simulations have been performed using either the PurR-GCG construct or the V52C mutant to aid dimer formation [33,35]. In order to compare most closely with previous simulations [33], both systems used the PDB structure 1L1M [23]. This structure is a dimer consisting of the first 62 N-terminal residues of each monomer, crystalized to operator DNA. To prepare the structures for simulation the DNA was removed from the starting crystal structure. Then hydrogen atoms were added using PSFGen in Visual Molecular Dynamics (VMD) [36]. The peptides were then solvated in explicit CHARMM TIP3P [37] water molecules. Then ions were randomly added to the two systems in order to match desired concentrations, 0.4 M and 0 M (neutral). The high ion system had a total of 52,159 atoms while the low ion system had 52,667 atoms. All systems were minimized for 21,000 steps then equilibrated for 640 ps in an NVT ensemble at 300 K. A weakly coupled Langevin thermostat [38] was used to maintain constant temperature. Electrostatic interactions were computed using the particle-mesh Ewald algorithm (PME) [39]. The PME grid spacing was 1.0 Å.

Table 1 The change in average ion density versus RMSD for each of the HH amino acids based on line of least squares fit from the first and last point of RMSD, 0.058 and 9.911 Å respectively.

2.2. Metadynamics variables The free energy landscape representing the HH unfolding pathway in the presence of the DBD was investigated using metadynamics [40,41], an enhanced sampling method to calculate the free-energy 2

Amino acid

Starting avg. ion count

Ending avg. ion count

Change in Avg. Ion Count

Asn 50 Arg 51 Cys 52 Ala 53 Gln 54 Gln 55 Leu 56 Ala 57 Gly 58

0.2023 0.4352 0.0833 0.0459 0.1521 0.2918 0.0474 0.0281 0.0858

0.0880 0.1850 0.0526 0.0794 0.1687 0.1578 0.1272 0.1887 0.2010

−0.1143 −0.2502 −0.0307 +0.0335 +0.0166 −0.1399 +0.0798 +0.1606 +0.1152

BBA - General Subjects 1864 (2020) 129538

D. Seckfort, et al.

Fig. 2. (A,B), Graphs of the potential of mean force during the metadynamics simulations based on the collective variable of root mean squared deviation in high ion and low ion solutions respectively. In the low ion solution the difference in energy between the starting structure and the most energetically favorable structure is over 60 kcal/mol as shown in the insert of A. The larger graph of A shows the PMF on the same scale as the high ion system of B. n

hi∗ =

∑ j = 0 hi (x j ) e−βV (xj ) n ∑ j=0

Q (x j

) e−βV (xj )

determining the dynamics and thermodynamics of the isolated HH [33]. Looking at the low ion concentration system in the current study, there are only 4 ions in the ~50,000 atom system and at no time are they within 5 Å of the HH residues (50–58). In the high ion system, many statistically significant ionic interactions with the HH occur and we will focus our analysis there. We considered proximal ion density versus RMSD. The ion density within 4 or 5 Å of all HH residues is relatively featureless and flat along the RMSD (slopes of −0.007 ions/Å). The density does not change depending on the conformation (RMSD) of the HH region. When ion density around individual residues is investigated, some trends can be seen. The ion densities within 5 Å of three of the residues (Cys52, Ala53, Gln54) remain fairly constant and do not change depending on the RMSD. (Table 1) These three residues are known to mostly interact with the other HH monomer by making protein-protein contacts [19,50]. For the polar residues Asn50, Arg51, and Gln55 there is a greater concentration of ions within 5 Å of the residue when the HH has a lower RMSD, which is when the HH is more helical (Table 1). The concentration for these 3 residues steadily increases as the RMSD decreases. The charged arginine at residue 51 is expected to have a high density of ions surrounding it due to its positive charge, but it is interesting to see how the folding of the HH region affects the result. The asparagine at position 50 has the lowest slope and densities of the 3 residues; its increase in ion density is probably related to its proximity to arginine 51. The third residue with an inverse relationship between ion density and RMSD is glutamine 55. This molecule is facing away from the DNA binding surface and interacting with the core domain [19,50]. When the protein is in the helical conformation, but without DNA or the core domain present, residue 55 is facing out into solution. The three end residues of the HH (Table 1) show a positive correlation, the greater the RMSD the greater the ion density surrounding them. As the RMSD increases, the HH becomes less tightly coiled. This allows the ends of the HH to extend farther into the solution, allowing more ions to be near, due to not having as much steric hindrance from the rest of the surrounding protein. At these higher RMSDs, residues 56–58 also are better hydrated having more (Supplemental Fig. S2) solution accessible surface.

(1)

where V(xj) is the value of the original PMF at an RMSD of xj, hi(xj) is the number of frames with the new E2E CV at value i in RMSD cluster xj, Q(xj) is the bin size of the cluster at RMSD xj and hi* is the reweighted value for the new CV value for i. 3. Results 3.1. Metadynamics PMF analysis The results from the metadynamics simulations, shown in Fig. 2(A,B), display a large difference between the two systems in different ion concentrations. These PMFs were calculated by averaging the PMFs of the simulation steps after achieving the smallest Gaussian height, 0.3 kcal/mol. Considering the system without ions, the free energy difference between the starting structure and the lower region of the basin is around 60 kcal/mol. The wide basin from ~3–9 Å is rough and in the range of about 0–5 kcal/mol. Comparing points less than an Angstrom apart shows there are large jumps of 2–3 kcal/mol instead of a smooth basin with a single minimum. This makes it more likely for the protein to stay in a single local conformation for an extended period of time. There are quite a few local minima that are similar, less than 1 kcal/mol difference, along this basin. The PMF of the system with high ion concentration is quantitatively different. The free energy difference between the structure with both monomers being fully helical and the minimum is 12 kcal/mol, rather than over 60 kcal/mol. But at ~0.5 Å, when one of the helices has started to partially unwind, the free energy has already dropped by about 3.5 kcal/mol. Near an RMSD of ~1.2 Å when one helical turn has fully unwound or both are partially unwound, the structure is another 4 kcal/mol more favorable. These two plateaus indicate meta-stable structures. The free energy surface has a minimum from 5 to 5.5 Å where both helices are unwound but not stretched out. However, the entire bottom of the basin from 2 to 9 Å is within a 4 kcal/mol difference and relatively smooth with only one large feature. This indicates that a transition between these many states would be accessible on the few ns timescale.

3.3. Reweighted E2E results To further investigate the conformational free energy surface for this system, the PMFs for both systems were reweighted based on endto-end (E2E) distance, as described in Eq. 1. The distances considered are from the first heavy atom of residue 50 to the last heavy atom of residue 58 for both monomers. Thus, we chose the nitrogen of Asn50 to

3.2. Ion concentrations The RMSD PMFs demonstrated that there were clear differences between the two systems. In a previous study, ion concentration was found to be significant in 3

BBA - General Subjects 1864 (2020) 129538

D. Seckfort, et al.

Fig. 3. A,B – Free energy versus end to end distances of the two monomers for the high ion and low ion systems respectively.

conformation when free in solution, with or without salt, regardless of the presence of the DBD. The free energy required to make the transition to the helical state is however considerably lowered when the DBD is present and ions are added to the system. This is mechanistically consistent with the understanding that the area surrounding DNA is highly saturated with ions [34] and so would promote helical propensity. Thus, when the protein gets closer to any DNA, an environment with a higher ion concentration, the cost of helix formation is lowered. The amount of free energy needed by the operator sequence to secure that change is then reasonable in comparison to the HH alone in solution [33]. We note that all simulation studies show a dependence on force fields and recent studies have shown issues when considering unstructured domains with force fields designed for folded proteins [55]. Previous studies [56] [57,58] have shown that an increase in ionic strength shifts the equilibrium towards helical stability in alanine rich peptides. They found that concentrations below 0.15 M had a linear effect while higher concentrations followed a more exponential dependence. We saw the effect ion concentration has on charged or polar residues, specifically the importance of Arg51 and Gln55. According to previous studies [19,50] Asn50, Arg51, and Gln55 all associate with the core domain while bound to operator DNA. This means that when the HH is helical and the core domain is not present, they are facing out towards the solution and able to interact with the most ions. Future studies will be required to study whether this trend of high ion-protein interaction still holds when the core domain is present or if the proteinprotein interactions out compete them. Residues 52 and 53 are consistently surrounded by the two lowest ion concentrations, this correlates with both having strong monomer-monomer interactions [50]. Residue 52 was specifically chosen as the one to be mutated (V52C) because it was the only one that had no direct interactions with DNA or the core domain [35]. It only interacts with the other HH. The PMFs reweighted based on end-to-end distances gave additional insight into the 3-dimensional structure of the protein when free in solution. Although the RMSD PMFs gave markedly different numbers between the ordered and disordered structures, the reweighted E2E PMFs of the two systems are quite similar to each other. This points towards possible favored structural characteristics in solution even with a broad conformational distribution. Despite the environments being different, without DNA present these two systems both favored structures with similar end-to-end lengths. Further investigations into what specific characteristics other than this are favored should be pursued. More information on the structure of the protein in solution could help our understanding of the protein as a whole.

the oxygen on Gly58. In crystal form, the length of both of these distances is 14 Å. In our starting structure, those distances are 14.5 Å and 11.5 Å, denoted as (14.5, 11.5), for monomer 1 and 2 respectively. Since these two monomers are independent and have nonsymmetric conformations, we consider the re-weighted energetic profile as the E2E distance of monomer 1 versus the E2E distance of monomer 2 versus the free energy of the system. Fig. 3 shows the minima for both systems are near the helical the starting structure. For the system in high ion concentration the minimum is seen at (14.75, 11.25), while the system in low ion concentration has minimum at (14.25, 11.25). (Fig. 3, A and B) When looking at the structures of the protein with E2E distances near these minima, ± .25 Å, the majority of the structures are helical, similar to the starting structure, but some can be found with an RMSD up to 3.07 Å. The re-weighted PMF in terms of end-to-end distances of the helices gives a different perspective from the RMSD from the crystal structure. While neither CV may be optimal both give insight into the collection of the most energetically favorable structures. We find a persistence of a lack of symmetry despite the sequence identity. Both systems favor asymmetric conformations with monomers exploring different lengths and thus different structures. Monomer 1 is usually longer than monomer 2 in Fig. 3. Conformations where the opposite is true are less favorable. 4. Discussion and conclusions The trait of coupled folding and binding is common in protein-DNA recognition [51–54]. LacI repressor has been an often studied model of this sort of recognition pathway [11,18]. This pathway has many complexities that contribute to its function as reflected in the interprotein interactions, protein-solution interactions, and protein-DNA interactions. To better understand the details of the protein conformational energetic changes, we have looked at the mechanism of the range of protein conformations without the added influence of DNA to determine the protein's propensity for the required shape. In this work, we have presented a quantitative analysis of the structural transition of the LacI HH, in the presence of the DBD, from helical to disordered conformation in two different solution conditions. The high salt solution is a model for the counterion-condensed atmosphere the protein would be in when binding to DNA. Previously it has been noted that the protein should be disordered in solution, without considering its free energy profile to transition from this state [3,21,22]. We found that the HH prefers to be in a disordered 4

BBA - General Subjects 1864 (2020) 129538

D. Seckfort, et al.

Our results show a clear difference between the free energy landscape in low and high ion concentrations. This gives a better idea of the free energetic changes as the protein moves through solution to recognize DNA. These results build upon Spronk's work [59] that showed the dimer to be necessary for helical formation. When comparing to our previous work [33] of the single HH alone in solution, adding the rest of the DBD complex significantly lowers the free energy penalty between the ordered and disordered conformations when in a solution similar to that surrounding DNA. The physical principles that emerge from this study add to our understanding of IDPs, specifically intrinsically disordered regions that change conformation upon binding DNA.

[15] A.M. Friedman, T.O. Fischmann, T.A. Steitz, Crystal structure of lac repressor core tetramer and its implications for DNA looping, Science 268 (1995) 1721–1727. [16] S.C. Harrison, A structural taxonomy of DNA-binding domains, Nature 353 (1991) 715–719. [17] M. Lewis, G. Chang, N.C. Horton, M.A. Kercher, H.C. Pace, M.A. Schumacher, R.G. Brennan, P. Lu, Crystal structure of the lactose operon repressor and its complexes with DNA and inducer, Science 271 (1996) 1247–1254. [18] C.E. Bell, M. Lewis, The lac repressor: a second generation of structural and functional studies, Curr. Opin. Struct. Biol. 11 (2001) 19–25. [19] C.E. Bell, M. Lewis, A closer view of the conformation of the lac repressor bound to operator, Nat. Struct. Biol. 7 (2000) 209–214. [20] C.E. Bell, M. Lewis, Crystallographic analysis of lac repressor bound to natural operator O1, J. Mol. Biol. 312 (2001) 921–926. [21] W. Gilbert, A. Maxam, The nucleotide sequence of the lac operator, Proc. Natl. Acad. Sci. U. S. A. 70 (1973) 3581–3584. [22] P. DeHaseth, C. Gross, R. Burgess, M. Record, Measurement of binding constants for protein-DNA interactions by DNA-cellulose chromatography, Biochemistry 16 (1977) 4777–4783. [23] C.G. Kalodimos, A.M. Bonvin, R.K. Salinas, R. Wechselberger, R. Boelens, R. Kaptein, Plasticity in protein-DNA recognition: lac repressor interacts with its natural operator 01 through alternative conformations of its DNA-binding domain, EMBO J. 21 (2002) 2866–2876. [24] C.G. Kalodimos, N. Biris, A.M. Bonvin, M.M. Levandoski, M. Guennuegues, R. Boelens, R. Kaptein, Structure and flexibility adaptation in nonspecific and specific protein-DNA complexes, Science 305 (2004) 386–389. [25] A.K. Dunker, C.J. Brown, J.D. Lawson, L.M. Iakoucheva, Z. Obradovic, Intrinsic disorder and protein function, Biochemistry 41 (2002) 6573–6582. [26] A.K. Dunker, J.D. Lawson, C.J. Brown, R.M. Williams, P. Romero, J.S. Oh, C.J. Oldfield, A.M. Campen, C.M. Ratliff, K.W. Hipps, J. Ausio, M.S. Nissen, R. Reeves, C. Kang, C.R. Kissinger, R.W. Bailey, M.D. Griswold, W. Chiu, E.C. Garner, Z. Obradovic, Intrinsically disordered protein, J. Mol. Graph. Model. 19 (2001) 26–59. [27] H.J. Dyson, P.E. Wright, Intrinsically unstructured proteins and their functions, Nat. Rev. Mol. Cell Biol. 6 (2005) 197–208. [28] W.L. Hsu, C. Oldfield, J. Meng, F. Huang, B. Xue, V.N. Uversky, P. Romero, A.K. Dunker, Intrinsic protein disorder and protein-protein interactions, Pac. Symp. Biocomput. (2012) 116–127. [29] C.J. Oldfield, B. Xue, Y.Y. Van, E.L. Ulrich, J.L. Markley, A.K. Dunker, V.N. Uversky, Utilization of protein intrinsic disorder knowledge in structural proteomics, Biochim. Biophys. Acta 1834 (2013) 487–498. [30] V.N. Uversky, C.J. Oldfield, A.K. Dunker, Showing your ID: intrinsic disorder as an ID for recognition, regulation and cell signaling, J. Mol. Recognit. 18 (2005) 343–384. [31] P. Tompa, Intrinsically unstructured proteins, Trends Biochem. Sci. 27 (2002) 527–533. [32] R. Rohs, X. Jin, S.M. West, R. Joshi, B. Honig, R.S. Mann, Origins of specificity in protein-DNA recognition, Annu. Rev. Biochem. 79 (2010) 233–269. [33] D. Seckfort, B. Montgomery Pettitt, Price of disorder in the lac repressor hinge helix, Biopolymers 110 (2019) e23239. [34] C.-L. Lai, C. Chen, S.-C. Ou, M. Prentiss, B.M. Pettitt, Interactions between DNA double helices, Phys. Rev. E (2020) in press. [35] L. Swint-Kruse, K.S. Matthews, P.E. Smith, B.M. Pettitt, Comparison of simulated and experimentally determined dynamics for a variant of the Lacl DNA-binding domain, Nlac-P, Biophys. J. 74 (1998) 413–421. [36] W. Humphrey, A. Dalke, K. Schulten, VMD: visual molecular dynamics, J. Mol. Graph. 14 (33–38) (1996) 27–38. [37] A.D. MacKerell, D. Bashford, M. Bellott, R.L. Dunbrack, J.D. Evanseck, M.J. Field, S. Fischer, J. Gao, H. Guo, S. Ha, D. Joseph-McCarthy, L. Kuchnir, K. Kuczera, F.T. Lau, C. Mattos, S. Michnick, T. Ngo, D.T. Nguyen, B. Prodhom, W.E. Reiher, B. Roux, M. Schlenkrich, J.C. Smith, R. Stote, J. Straub, M. Watanabe, J. Wiorkiewicz-Kuczera, D. Yin, M. Karplus, All-atom empirical potential for molecular modeling and dynamics studies of proteins, J. Phys. Chem. B 102 (1998) 3586–3616. [38] J.C. Phillips, R. Braun, W. Wang, J. Gumbart, E. Tajkhorshid, E. Villa, C. Chipot, R.D. Skeel, L. Kale, K. Schulten, Scalable molecular dynamics with NAMD, J. Comput. Chem. 26 (2005) 1781–1802. [39] T. Darden, D. York, L. Pedersen, Particle mesh Ewald: an N·log(N) method for Ewald sums in large systems, J. Chem. Phys. 98 (1993) 10089–10092. [40] A. Barducci, M. Bonomi, M. Parrinello, Metadynamics, Wires Comput. Mol. Sci. 1 (2011) 826–843. [41] A. Barducci, G. Bussi, M. Parrinello, Well-tempered metadynamics: a smoothly converging and tunable free-energy method, Phys. Rev. Lett. 100 (2008) 020603. [42] G. Fiorin, M.L. Klein, J. Hénin, Using collective variables to drive molecular dynamics simulations, Mol. Phys. 111 (2013) 3345–3362. [43] J. Huang, A.D. MacKerell Jr., CHARMM36 all-atom additive protein force field: validation based on comparison to NMR data, J. Comput. Chem. 34 (2013) 2135–2145. [44] Q. Bai, Y. Shen, N. Jin, H. Liu, X. Yao, Molecular modeling study on the dynamical structural features of human smoothened receptor and binding mechanism of antagonist LY2940680 by metadynamics simulation and free energy calculation, Biochim. Biophys. Acta 1840 (2014) 2128–2138. [45] N. Liu, M. Duan, M. Yang, Structural properties of human IAPP dimer in membrane environment studied by all-atom molecular dynamics simulations, Sci. Rep. 7 (2017) 7915. [46] M. Bonomi, D. Branduardi, G. Bussi, C. Camilloni, D. Provasi, P. Raiteri, D. Donadio, F. Marinelli, F. Pietrucci, R.A. Broglia, M. Parrinello, PLUMED: a portable plugin for

Declaration of Competing Interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. Acknowledgements The authors thank Dr. Cheng Zhang for many helpful discussions. The Sealy Center for Structural Biology scientific computing staff is acknowledged for computational support. We gratefully acknowledge the Robert A. Welch Foundation (H-013), and the National Institutes of Health (GM-037657) for partial support of this work. This work used the Extreme Science and Engineering Discovery Environment (XSEDE), which is supported by National Science Foundation grant number ACI1548562. The authors also acknowledge the Texas Advanced Computing Center (TACC) at The University of Texas at Austin for providing HPC resources that have contributed to the research results reported within this article. URL: http://www. tacc.utexas.edu Appendix A. Supplementary data Supplementary data to this article can be found online at https:// doi.org/10.1016/j.bbagen.2020.129538. References [1] J. Monod, From enzymatic adaptation to allosteric transitions, Science 154 (1966) 475–483. [2] J. Monod, M. Cohn, La Biosynthègse Induite Des enzymes (Adaptation Enzymatique), Advances in Enzymology, 1952, pp. 67–119. [3] P. DeHaseth, T. Lohman, M. Record, Nonspecific interaction of lac repressor with DNA: an association reaction driven by counterion release, Biochemistry 16 (1977) 4783–4790. [4] D. Frank, R. Saecker, J. Bond, M. Capp, O. Tsodikov, S. Melcher, M. Levandoski, M. Record, Thermodynamics of the interactions of lac repressor with variants of the symmetric lac operator: effects of converting a consensus site to a non-specific site 1, J. Mol. Biol. 267 (1997) 1186–1206. [5] M.D. Barkley, P.A. Lewis, G.E. Sullivan, Ion effects on the lac repressor-operator equilibrium, Biochemistry 20 (1981) 3842–3851. [6] R.B. Winter, P.H.V. Hippel, Diffusion-driven mechanisms of protein translocation on nucleic acids. 2. The Escherichia coli lac repressor-operator interaction: equilibrium measurements, Biochemistry 20 (1981) 6948–6960. [7] H. Nick, K. Arndt, F. Boschelli, M.A. Jarema, M. Lillis, J. Sadler, M. Caruthers, P. Lu, lac repressor-lac operator interaction: NMR observations, Proc. Natl. Acad. Sci. U. S. A. 79 (1982) 218–222. [8] A. Ozarowski, J.K. Barry, K.S. Matthews, A.H. Maki, Ligand-induced conformational changes in lactose repressor: a phosphorescence and ODMR study of single-tryptophan mutants, Biochemistry 38 (1999) 6715–6722. [9] A.M. Khoury, H.S. Nick, P. Lu, In vivo interaction of Escherichia coli lac repressor N-terminal fragments with the lac operator, J. Mol. Biol. 219 (1991) 623–634. [10] R.T. Sauer, Lac repressor at last, Structure 4 (1996) 219–222. [11] F. Jacob, J. Monod, Genetic regulatory mechanisms in the synthesis of proteins, J. Mol. Biol. 3 (1961) 318–356. [12] C.J. Wilson, H. Zhan, L. Swint-Kruse, K.S. Matthews, The lactose repressor system: paradigms for regulation, allosteric behavior and protein folding, Cell. Mol. Life Sci.: 64 (2007) 3–16. [13] J. Parker, Lac Operon, in: S. Brenner, J.H. Miller (Eds.), Encyclopedia of Genetics, Academic Press, Cambridge, MA, 2001, p. 1070. [14] R. Kaptein, E.R. Zuiderweg, R.M. Scheek, R. Boelens, W.F. van Gunsteren, A protein structure from nuclear magnetic resonance data, lac repressor headpiece, J. Mol. Biol. 182 (1985) 179–182.

5

BBA - General Subjects 1864 (2020) 129538

D. Seckfort, et al.

[47]

[48]

[49] [50]

[51] [52]

[53] J.M. Rogers, C.T. Wong, J. Clarke, Coupled folding and binding of the disordered protein PUMA does not require particular residual structure, J. Am. Chem. Soc. 136 (2014) 5197–5200. [54] L. Staby, C. O’Shea, M. Willemoes, F. Theisen, B.B. Kragelund, K. Skriver, Eukaryotic transcription factors: paradigms of protein intrinsic disorder, Biochem. J. 474 (2017) 2509–2532. [55] J.A. Drake, B.M. Pettitt, Force field-dependent solution properties of glycine oligomers, J. Comput. Chem. 36 (2015) 1275–1285. [56] J.M. Scholtz, E.J. York, J.M. Stewart, R.L. Baldwin, A neutral, water-soluble, .Alpha.-helical peptide: the effect of ionic strength on the helix-coil equilibrium, J. Am. Chem. Soc. 113 (1991) 5102–5104. [57] K. Xiong, E.K. Asciutto, J.D. Madura, S.A. Asher, Salt dependence of an α-helical peptide folding energy landscapes, Biochemistry 48 (2009) 10818–10826. [58] E. Lacroix, A.R. Viguera, L. Serrano, Elucidating the folding problem of α-helices: local motifs, long-range electrostatics, ionic-strength dependence and prediction of NMR parameters11Edited by A. R. Fersht, J. Mol. Biol. 284 (1998) 173–191. [59] C.A. Spronk, M. Slijper, J.H. van Boom, R. Kaptein, R. Boelens, Formation of the hinge helix in the lac repressor is induced upon binding to the lac operator, Nat. Struct. Biol. 3 (1996) 916–919.

free-energy calculations with molecular dynamics, Comput. Phys. Commun. 180 (2009) 1961–1972. X. Biarnés, F. Pietrucci, F. Marinelli, A. Laio, METAGUI. A VMD interface for analyzing metadynamics and molecular dynamics simulations, Comput. Phys. Commun. 183 (2012) 203–211. T. Giorgino, A. Laio, A. Rodriguez, METAGUI 3: a graphical user interface for choosing the collective variables in molecular dynamics simulations, Comput. Phys. Commun. 217 (2017) 204–209. U. Langel, B. Cravatt, A. Graslund, N. von Heijne, M. Zorko, T. Land, S. Niessen, Introduction to Peptides and Proteins, 1st ed., CRC Press, 2009. H. Zhan, L. Swint-Kruse, K.S. Matthews, Extrinsic interactions dominate helical propensity in coupled binding and folding of the lactose repressor protein hinge helix, Biochemistry 45 (2006) 5896–5906. V.P. Jain, R.S. Tu, Coupled folding and specific binding: fishing for amphiphilicity, Int. J. Mol. Sci. 12 (2011) 1431–1450. C.L. Moody, V. Tretyachenko-Ladokhina, T.M. Laue, D.F. Senear, M.J. Cocco, Multiple conformations of the cytidine repressor DNA-binding domain coalesce to one upon recognition of a specific DNA surface, Biochemistry 50 (2011) 6622–6632.

6