|
|
||||||||
1 Department of Bioengineering,
2 Department of Biochemistry and Cell Biology, and
3 W.M. Keck Center for Computational Biology, Rice University, Houston, Texas 77005, USA
4 Graduate Program of Structural and Computational Biology and Molecular Biophysics and
5 Verna and Marrs McLean Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas 77030, USA
Reprint requests to: Jianpeng Ma, 1 Baylor Plaza, BCM-125, Baylor College of Medicine, Houston, TX 77030, USA; e-mail: jpma{at}bcm.tmc.edu; fax: (713) 796-9438.
(RECEIVED May 5, 2003; FINAL REVISION August 3, 2003; ACCEPTED August 4, 2003)
6 These authors made equal contributions to this work. ![]()
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.03188303.
7 The DNA-binding domain was intact on the protein used to create crystals for 1LBH
[PDB]
; however, this region did not exhibit an electron density, presumably due to high mobility in the absence of DNA (Lewis et al. 1996). ![]()
8 The crystal structure with PDB code 1TLF
[PDB]
was generated from LacI protein that was proteolytically treated to remove the DNA-binding domain (Friedman et al. 1995). Even though the 1LBH
[PDB]
structure does not have any electron density for this domain (Lewis et al. 1996), its physical presence might affect small but important changes in the structure that could influence the TMD of the core domain. Further, the protein used to create 1TLF
[PDB]
differs at position 109 (alanine) from repressor used to create 1LBH
[PDB]
and 1EFA
[PDB]
(threonine 109). Thus, the TMD requirement for identical atoms precludes the use of 1TLF
[PDB]
without further modeling, which would introduce additional computational error into the trajectory. ![]()
| Abstract |
|---|
|
|
|---|
-stacking intermediate. Pathway 3 extends from the rear of the inducer-binding pocket, across a hydrogen-bond network at the bottom of the pocket, and transverses the monomermonomer interface via changes in H74 and H74rsquo;. In general, intermediates detected in this study are not apparent in the crystal structures. Observations from the simulations are in good agreement with biochemical data and provide a spatial and sequential framework for interpreting existing genetic data. Keywords: Gene regulation; allosteric mechanism; structural flexibility; conformational transition pathway; targeted molecular dynamics
| Introduction |
|---|
|
|
|---|
In both states, the architecture of LacI can be divided into five structural units (Friedman et al. 1995; Lewis et al. 1996; Bell and Lewis 2000). First, the N-terminal DNA-binding domain (amino acids 150) comprises a helix-turn-helix motif that interacts with the major groove of the DNA. Two N termini of a dimer are required for a complete DNA-binding site. Next in the primary sequence is the hinge helix (amino acids 5161), which binds to the center of the operator within the minor groove. The hinge region appears to be disordered in the absence of DNA (Ha et al. 1989; Lewis et al. 1996; Spronk et al. 1996, 1999a,b; Kalodimos et al. 2001, 2002a,b). The hinge helix connects the DNA-binding domain to the core domain (amino acids 62330), which comprises the interconnected N- and C-subdomains (Fig. 1A
). The inducer-binding pocket is located between these subdomains. Finally, the C-terminal regions (amino acids 331360) associate into a four-helical bundle to facilitate tetramerization of LacI (not shown in Fig. 1A
).
|
-nitrophenyl-ß, D-fucoside [ONPF] induced, LacIIPTG inducer) led Lewis and colleagues (Lewis et al. 1996; Bell and Lewis 2000) to postulate that the allosteric transition in LacI occurs when inducer binding shifts the N-subdomains, both with respect to each other and to the C-subdomains. As a result of this structural shift, the N-subdomain monomermonomer interface is significantly altered. These motions in turn displace the hinge helices from the minor groove of the operator, disrupting specific protein-DNA contacts and increasing the dynamic mobility of the N-terminal DNA-binding domains (Lewis et al. 1996; Spronk et al. 1996, 1999a,b; Swint-Kruse et al. 1998; Kalodimos et al. 2001, 2002a,b). In this study, the time course and detailed interactions along the allosteric pathway in the LacI core are simulatedfrom the DNA/anti-inducerbound structure (Protein Data Bank [PDB] code 1EFA
[PDB]
; Bell and Lewis 2000) to the inducer-bound structure (PDB code 1LBH
[PDB]
; Lewis et al. 1996)by using the method of targeted molecular dynamics (TMD; Schlitter et al. 1993; Ma and Karplus 1997; Ma et al. 2000, 2002). The results of this study culminate in the construction of an integrated framework for the allosteric transition pathway of LacI that is consistent with the biochemical, genetic, and structural data accumulated over the past four decades. | Results and Discussion |
|---|
|
|
|---|
|
TMD does require that the starting and ending structures have identical sets of atoms. Therefore, the present simulations include the only domain in common between the two structuresthe LacI core domains of a dimer. Neither the DNA-binding domains nor the operator DNA is present in the induced structure. The small molecule ligands differ in the two structures (IPTG and ONPF) and the DNA-bound structure was generated from LacI protein that lacked the C-terminal tetramerization domain. Thus, these features were not included in the simulation. Analysis of the CHARMM trajectories (Brooks et al. 1983) was performed by aligning the stationary C-subdomains (residues 162290 and 321329) followed by (1) inspecting the trajectory file computationally and visually with Quanta (www.msi.com), and (2) using the Contacts of Structural Units (CSU) software program (Sobolev et al. 1999) to compare equilibrated structures and those from representative time points during the trajectory (Fig. 2B
).
When LacI binds inducer, the induction signal might be expected to propagate through the structure in a spatially and temporally ordered manner. For these TMD simulations, which cannot force inducer into the binding pocket, space/time ordering may or may not be the same as for the actual protein (Fig. 2
). However, the changes observed for LacI in the present simulations occur in a logical sequence. During the trajectory, a striking observation is that the conformational changes undergone by each monomer are not identical. This phenomenon is intriguing. Experimentally, the LacI dimeroperator DNA complex binds two molecules of inducer cooperatively, and the LacI allostery may be described equally well by the Monod-Wyman-Changeaux (MWC) and Koshland models (OGorman et al. 1980; Daly and Matthews 1986). Further, the LacI K84L mutant binds IPTG with slowed and biphasic kinetics (Chang et al. 1993); this mutation may have slowed the conformational transition so that the asymmetry is experimentally observable (see Interpretation of Experimental Data).
We have actively explored the origin of the asymmetric conformational change. An early feature of the trajectory is an asymmetric change in hydrogen-bond formation between the side chains of D149 and S193 (see Pathway 1, below). Residues that directly contact IPTG are underlined in the text, tables, and figures (Table 1
). The region around D149 is essentially identical for all three monomers of the crystallographic asymmetric cell of the DNA-bound structure (Bell and Lewis 2000). However, D149 interactions in the structures of intact inducer-bound LacI and the proteolytic fragment of inducer-bound core demonstrate some differences in distances and interaction partners (Friedman et al. 1995; Lewis et al. 1996). During equilibration at the beginning of the simulation, all three structures undergo asymmetric changes between two monomers of a dimer around position 149. Multiple equilibrations demonstrate a wide variety of configurations adopted by D149 and its partners, especially S193. Both the asymmetry and the conformational changes observed during the simulation are repeatable over seven TMD trajectories (repeatability is illustrated in Fig. 3
). In the simulations, the conformational change originates on a "trigger monomer," then propagates through various noncovalent interactions to the second "response," monomer (Fig. 2
). In this article, we discriminate between the two monomers by adding a prime () symbol to residues and regions on the trigger monomer.
|
|
-stacking between H74 and H74. The third pathway extends from F161, at the rear of the inducer-binding pocket on the trigger monomer, through the bottom of this pocket, and across the monomermonomer interface to the response monomer via the H74H74
-stacking. The resulting conformational changes reposition the N-subdomains so that they can no longer form the contacts to the DNA-binding domain presumed requisite for high-affinity binding (Bell and Lewis 2000; Swint-Kruse et al. 2002).
Origin of asymmetry: Repeatability of equilibration and TMD trajectory
We find the asymmetry observed in the pathways to be a compelling result of the simulation. The simulated asymmetry has a number of possible origins, ranging from experimentally relevant to random occurrence during TMD.
We first examined the crystal structures to determine whether these starting structures gave rise to the observed behavior. The structure of the DNA-bound LacI (1EFA [PDB] ) is symmetric for all three monomers present in the crystallographic unit cell (Bell and Lewis 2000). However, all eight available monomers in the inducer-bound statefour each from 1LBH [PDB] and 1TLF [PDB] have subtle differences in the region around D149 (Friedman et al. 1995; Lewis et al. 1996). A potential cause for the observed asymmetry is different occupancies of the inducer-binding sites, but this source is unlikely at the high IPTG concentrations used (Friedman et al. 1995; Lewis et al. 1996). Finally, the fact that both 1TLF [PDB] and 1LBH [PDB] are variable around amino acid 149 indicates that crystal packing artifacts are an unlikely source of the observed asymmetry.
To ascertain whether asymmetry was a random occurrence during the simulation, the TMD trajectory was carried out five times by using the same equilibrated start and target structures but varying the random number seed in the simulation. In each trajectory, the same monomer always emerged as the trigger monomer. When a different equilibrated target structure was used to generate two additional trajectories, the opposite monomer functioned as the trigger in both cases. Therefore, although some asymmetry is present in the crystal structures, asymmetry of the simulated conformational change appears to be dictated during equilibration.
Energy equilibration/minimization algorithms are used before virtually every molecular dynamics simulation to relieve stresses inherent to the experimental structure or those that result from modeling (e.g., the addition of solvent molecules or point charges). Equilibration usually does not result in "meaningful" conformational changes, but the structures of the current simulation are subjected to experimentally relevant forces. Prior to equilibration, the DNA-bound N termini, which make cross-domain interactions with the core N-subdomain, were removed from the starting structure; anti-inducer ONPF was also deleted from this structure; and the inducer IPTG was removed from the target structure. Therefore, both structures were in the unliganded state for 10 psec of computer time while they were equilibrated. Experimentally, the allosteric constant for unbound protein is
1 when LacI dimer function can be described by a MWC model, which indicates that unliganded LacI is equally likely to adopt either the repressed or induced conformation (OGorman et al. 1980; Daly et al. 1986). Equilibration provides opportunity for the now unliganded structures to initiate relevant changes toward the alternate conformation.
Indeed, several residues that are asymmetrically involved in the subsequent trajectory of the LacI conformational change are affected during equilibration. The first step in the trajectory involves changes at residue 149 of the trigger monomer (see Fig. 2
, pathway 1). Interestingly, the region around D149especially S193 and R197is affected during equilibration of both the DNA- and inducer-bound structures. Note that two of these residues contact the inducer/anti-inducer ligand (Table 1
), and thus, changes in this region during equilibration are consistent with the structural manipulations required to execute TMD. Although 1TLF
[PDB]
could not be used as a target structure, we also subjected this structure to ligand removal, solvation, and equilibration and saw changes similar to those in 1LBH
[PDB]
(Table 2
).
|
Freeing the S193 side chain of the DNA-bound structure allows this residue to form stronger interactions with D149 as the simulation proceeds toward the inducer-bound conformation. In 1EFA [PDB] , these two residues form only one bond between the backbone N of 193 and the side chain of 149. Upon ligand removal and equilibration, these residues form any of several combinations of six different hydrogen bonds between backbone/side chain of 193 and the side chain of 149, with one to four bonds per monomer. In 1LBH [PDB] , the S193 side chain forms two bonds to the D149 side chain per monomer. The structure of the proteolytic fragment, 1TLF [PDB] , has one of these bonds. During equilibration of either inducer-bound structure, these residues form combinations of zero to three hydrogen bonds in variety of side chain/side chain, side chain/backbone, and backbone/side chain combinations.
The D149-S193 hydrogen bonds exhibit no obvious patterns that correlate with the asymmetry of the TMD. However, the total surface area in contact between these two residues should correlate with the total strength of the interactions. These values are presented in Table 2
and provide insight into the origin of TMD asymmetry: When a trajectory used equilibration 1 for the DNA-bound conformation and equilibration 1 for the inducer-bound target, monomer B served as the trigger monomer. Consistent with this, monomer B of inducer-bound equilibration 1 has much larger contact surface between D149 and S193 than does monomer A. Further, when equilibration 8 was substituted for the inducer-bound target, monomer A emerged as the trigger monomer. In this case, the monomers of the target had similar contact surface (Table 2
), and thus, the moderate difference in start structures appears to dominate: Monomer A of the starting structure had less contact surfaceand thus may be easier to interruptthan is monomer B. This correlation between contact surface of D149/S193 and trigger monomer will be tested in future studies.
Finally, a backbone hydrogen bond is frequently lost between ß-strands C and D (D149-O:L128-NH; Table 2
) during equilibration of the target monomer. Intriguingly, this change can occur on either monomer, but the distance is always longer on one monomer within a dimer. This pattern persisted during equilibration of the proteolytic fragment 1TLF
[PDB]
. Further, the monomer with the broken bond functioned as the trigger monomer in the TMD trajectories. Because the bonds are formed in both the DNA-bound and inducer-bound crystal structures, this feature of the equilibrated structures may approximate a feature of apo-LacI that allows the protein to switch between conformations.
The TMD trajectory was repeated seven times (illustrated in Fig. 3
), and the features of all trajectories were highly similar. For the purpose of clarity, the main body of this article uses the time points of the first trajectory and is summarized in Figure 2
. The reverse trajectoryfrom the inducer- to DNA-bound conformationwas also simulated. Although a detailed examination is beyond the scope of this article, the key features of the process are near-mirror images of the data presented for the forward transition.
Pathway 1
In pathway 1, changes in the inducer-binding pocket of the trigger monomer propagate through ß-strands C and D to a hydrophobic cluster in the N-subdomain, then to the monomermonomer interface, and finally result in significant changes at K84 (Fig. 2B
). Pathway 1 only occurs on the trigger monomer, and we assume that this simulates the changes that occur when the trigger monomer binds the first inducer molecule. Toward the end of pathway 1, the response monomer undergoes changes so that the two monomers adopt similar configurations in the final structure.
As mentioned above, changes in D149 appear critical to propagating the message from the inducer-binding pocket to the N-subdomain. D149 lies at a key junction: It contacts IPTG and is located between the base of ß-strand D and at the start of a flexible loop (residues 149-156;Fig. 1A
). Flexibility in this loop allows the side chain of D149 to move
4 Å, thus forming the hydrogen bond with S193' (at 90 psec; Fig. 4A
, arrow 1). Note that the distance between D149 and S193 of the response monomer remains constant throughout the simulation (Fig. 4B
). Next, at 100 psec, the side chain of D149 forms another hydrogen bond with the backbone -NH of F161 (Fig. 4C
, arrow 3). Finally, the backbone oxygen of D149 forms a hydrogen bond to the side chain of S193' 15 psec later (Fig. 4A
, arrow 2). Note that most of the motions in the flexible loop stabilize after 110 psec (Fig. 4E,G)
, most likely due to the two D149 hydrogen bonds, which essentially lock the flexible loop in place.
|
The changing hydrogen-bond distances between strands C and D endow greater mobility to the region of the N-subdomain that is closest to the monomermonomer interface. A cluster of hydrophobic residues in this region contract at
90 psec (Fig. 1B
, arrows). This apparently leads to a similar change in the response monomer
110 psec (Fig. 2
), resulting in a more closely packed monomermonomer interface.
Interface rearrangement: The intersection of pathways 1 and 2
When pathway 1 changes are completed on the trigger monomer structure, the LacI dimer is poised to make changes that affect both the cross-subunit (trigger monomer to response monomer) and cross-domain (core to DNA binding) interfaces. This arrangement provides opportunity to transmit the allosteric message from the inducer-binding site of one monomer (presumably the trigger monomer) to (1) the inducer-binding site of the second monomer, so that the second inducer molecule binds with higher affinity than the first, and (2) the DNA-binding site, so that it adopts the conformation with low affinity for the operator sequence.
As stated above, changing hydrophobic interactions in N-subdomain alter the interactions across the monomermonomer N-subdomain interface (Fig. 1A,2B). This entire region plays a key role in determining the conformational state of LacI (Lewis et al. 1996; Bell and Lewis 2000; Bell et al. 2001), but the electrostatic interactions of the lysine residues (K84 and K84) are particularly important. In the initial (repressed) state, the side chains of residue 84 are located in the middle of this interface, in plane with but interrupting many polar groups capable of forming hydrogen bonds (Fig. 5A
). The side chains of K84 and K84 point toward each other, and their conformations appear to be anchored by an intervening anion (Bell and Lewis 2000). For the purposes of this study, we have assumed this to be a chloride anion (see Materials and Methods). Above and below the lysine plane are two layers of hydrophobic side chains, including members of the N-subdomain hydrophobic groups mentioned above. In addition, an extensive network of polar and ionic interactions forms the monomermonomer interface of the DNA-bound state (Fig. 5A
).
|
The shift in the N-subdomain interface is more or less concerted. The interatomic distance between the backbones of V96 and V96 decreases nearly linearly throughout the transition, whereas the distance between the side chains of K84 and K84 increases (Fig. 3C
). The simulation indicates the following pathway: After contraction of the N-subdomain hydrophobic cluster on the trigger monomer, the -NH3+ groups of the side chains of both K84s slowly dissociate from a sandwiched position between the main-chain carbonyl groups of V94 and V96 (90 psec). These moieties then move to a transient position between the main-chain carbonyl of V94 and the side chain of S97. The displacement of the lysine side chains allows secondary structural contacts to form across the interface between the main-chain amino and carbonyl groups of V96 and V96. Finally, the second N-subdomain hydrophobic cluster on the response monomer contracts. The side chain of K84 soon detaches from V94 and S97, and moves outward to briefly contact the main-chain carbonyl of M98 before engaging the two acidic residues (170 psec), D88 and E100, in a chargecharge interaction. Interestingly, during the transition, the hydrocarbon stem of the K84 side chain maintains a close hydrophobic contact with the side chain of M98, even though the ionic moiety of K84 flips in the opposite direction (same for K84). The K84 motions are essentially symmetric. The only difference in the two monomers is that K84 and E100 do not interact in the equilibrated target structure.
Water molecules also participate in the N-subdomain transition. In the DNA-bound state, the side chains of K84 and K84 and the chloride anion are partially exposed to the solvent, as proposed by Bell et al. (2001). However, the water molecules are squeezed out of the interface completely upon the formation of the inducer-bound conformation due to the extensive hydrophobic interactions. The chloride anion escapes to the solvent before the final transformation to the inducer-bound state. The complete departure of water molecules and the chloride ion from the interface occurs approximately in the middle of the transition.
As the N-subdomains move closer to form a continuous 12-stranded ß-sheet, N113 and Q117 also undergo dramatic changes. Both N113 and Q117 are located on the top surface of the N-subdomain and interact with the DNA-binding domain in the repressed structure (Fig. 1A
). In the DNA-bound form, the side chains of Q117 and Q117 do not contact each other, but in the induced state, their side chains form a hydrogen bond. In the simulation, this event occurs at 130 psec. The side chain of N113 lies
5 Å from the side chain of S93' (same for N113' and S93) in the repressed conformation. During the simulation, the distances between these side chains decrease to 3 Å to form new hydrogen bonds (beginning at 100 psec), further tightening the monomermonomer interface. Thus, these changes may interrupt the cross-domain interface interactions requisite for high-affinity DNA-binding (Bell and Lewis 2000; Swint-Kruse et al. 2002).
Pathway 2
The second pathway involves interactions across the N-subdomain monomermonomer interface after changes in the K84 pair. Three significant pairs of residues change: K84K84, H74H74, and Q78Q78. The K84 interactions link pathway 1 to pathways 2 and 3, and the H74 interactions connect pathway 2 to 3 (see pathway 3; Fig. 2B
).
H74 and H74 are located at the bottom of helices 5 and 5' in the N-subdomains, very close to the inducer-binding pockets (Fig. 1A
), and undergo extensive changes during the course of the transition. Initially, H74 forms intra-subunit hydrophobic contacts with F293, T276, and the hydrophobic region of Q78, whereas H74 forms similar contacts within the partner monomer. Because of the loss of the K84K84 interaction (90 psec), the distance between the core N-subdomains also decreases. This distance eventually decreases enough (110 psec) to allow H74 and H74 to engage in stable parallel
-stacking across the interface (Fig. 6A,B
, arrow 1). Histidine rings must be within 4.5 Å of each other and have similar dihedral angles for parallel
-stacking to occur (McGaughey et al. 1998). This
-stacking was not present in either crystal structure and was only detected by the TMD simulation. At 170 psec, the histidines simultaneously rotate 180°, then reform their
-stacking interaction (Fig. 6B
, arrow 2). The
-stacking ceases at 200 psec (Fig. 6B
, arrow 3) and adopts the final equilibrated inducer-bound structure. These
-stacking interactions appear to be important for transmitting signals across the N-subdomain interface. This
-stacking and the concomitant flip were present in every TMD trajectory and are shown in Figure 6
for trajectories 1, 5, and 7.
|
In summary, three monomermonomer interface interactions of the DNA-bound state are disrupted (K84K84, Q78L71, and Q78L71), but a stable
-stacking interaction is transiently gained as the simulation progresses from the DNA-bound to the inducer-bound structure.
Core pivot
The gross clamping motion of the N-subdomains relative to the fixed C-subdomains must logically involve the three sets of residues that covalently interconnect the N- and C-subdomains: (1) amino acids 161164, connecting strand E of the N-subdomain and helix 8 of the C-subdomain; (2) residues 290293, joining helix 13 of the N-subdomain and strand J of the C-subdomain; and (3) positions 318322, linking strand K of the N-subdomain to L of the C-subdomain (Fig. 1A
). We denote these sets of connecting residues at the N- and C-core junction as the core pivot (Figs. 1A, 7A![]()
). Similar regions in other proteins are often referred to as hinge, but we have chosen this alternate description to avoid confusion with the LacI hinge helix at residues 5161.
|
and
angles of the backbone of three conformations of D-allose-binding protein (ALBP; a structural homolog of the LacI core domain) to determine the mechanism by which coupled torsional changes of the backbone produced large conformational changes in the region analogous to the core pivot of LacI. Plots of the
and
angles for the residues of the core pivot were constructed over the entire 210-psec trajectory (Fig. 7B
/
angle for each data set was also evaluated for the complete 210-psec simulation (Supplemental Material). Examination of
/
versus time identified those data sets with a standard deviation >9° that adopted at least two distinct conformations at some point in the trajectory. The
angles of P320, P320, V321, and V321 were exceptions; each had a standard deviation >9° due to a flip of the pucker in the proline ring. The data illustrate that the highest degree of mobility resides in the backbones of residues F161, H163', E164, Q291, L318, L319, V321, and S322 in the trigger monomer, and F161, E164, K290, Q291, D292, L318, L319, V321, and S322 in the response monomer (Supplemental Material). The majority of backbone rearrangements occur after 90 psec and contribute to pathway 3.
In addition, many side chains in the core pivot form a hydrophobic cluster located at the back of the inducer-binding pocket. On both monomers, most of these hydrophobic side chains make van der Waals contact with F161 (Fig. 7A
). In the equivalent region of ALBP, Magnusson et al. (2002) theorized that water molecules acted as structural "ball bearings" to facilitate the subdomain conformational change. Similarly, Mowbray and Bjorkman (1999) studied conformational changes in ribose-binding protein (RBP) and discovered that water molecules play an important role in its hinge region (equivalent to the core pivot in LacI), but they noted that LacI likely cannot use a similar mechanism due to hydrogen-bonding differences involving residues F161 and S162. We propose that LacI may use the inherent hydrophobicity of the core pivot, especially F161, to facilitate the conformational changeliterally "greasing" the transition machinery.
Pathway 3
The third pathway extends from the back of the inducer-binding pocket across the monomermonomer interface via the H74H74
-stacking (Fig. 1A
). This pathway appears to propagate the additional changes needed for cooperative inducer-binding on the second monomer. After the loss of the K84K84 interaction (90 psec), both N-subdomains clamp inward, thereby exerting bending forces at their respective core pivots. Accordingly, substantial backbone rearrangements take place in the core pivots to accommodate the conformational changes at the interface. As a possible consequence of these core hinge backbone motions, the side chain of F293 moves
2 Å closer to the side chain of F161 at 110 psec; F293 and F161 do the same (Fig. 8
). The
angle of F161 also has a noticeable change at 110 psec (Fig. 7B
).
|
The other termini of the hydrogen-bond networks are positioned near the entrance to the inducer-binding pockets at the N-subdomain interface (Q248 and Q248; Fig.1A
). This region of the protein is extensively solvated in both conformations and has significant hydrogen bonding. As the simulation progresses (155 psec), the side chain of Q248 flips away from the inducer-binding pocket and interacts with the N
2 atom of H74, momentarily destabilizing the
-stacking interaction across the interface between H74 and H74. In addition, a single water molecule forms new hydrogen bonds with the side chain oxygen of N246 and the backbone oxygen of L73. Consequently, this Q248 flip may be due to a combination of three factors: (1) stabilization of the monomermonomer interface after the onset of the H74-H74
-stacking (110 psec), (2) closure of the N-subdomain interface at ß-strands B and B (140170 psec), and (3) the presence of a lone water molecule, which preserves the continuity of the hydrogen bonding between N246 and L73, allowing Q248 to break its contacts with these two residues. The transient hydrogen bonds mentioned above are only detected via the use of TMD.
The disruption created by the Q248 flip allows the side chains of H74 and H74 to change conformation, rotating 180° (170 psec), after which they regain their
-stacking interaction (Fig. 6B
, arrow 2). In turn, this rotation causes the backbones of H74 and H74 to flip, momentarily disrupting the hydrogen bonding between the side chain oxygen of Q248 and the backbone -NH of H74. Finally, the side chain of Q248 rotates 180° (185 psec), forming a hydrogen bond with both the backbone oxygen of L73 and the side chain oxygen of N246. The side chain-NH of N246 is hydrogen-bonded to one of the side chain oxygens of D274, thereby continuing the hydrogen-bond network across the interface. Hence, movement generated on one monomer (the flip by Q248) passes across the interface between the two monomers (by means of the histidine
-stacking), and is felt by the second monomer (rotation of Q248). Communication between the monomers across this interface at this position could contribute to cooperativity of inducer binding.
However, the Q248 flip is the only feature that is not consistently present in repeated trajectories. Thus, changes at Q248 cannot be always responsible for the change in conformation of the H74H74
-stacking. In the remaining six trajectories, the
-stack flip appears to be mediated by water molecules present in the interface. The flip occurs only after the N-subdomain interface completely closes (denoted by the stabilization of the interatomic distance between the backbones of V96 and V96; see Fig. 3B
).
Interpretation of experimental data
Both phenotypic and biochemical studies demonstrate that a number of residues implicated by the TMD simulation are important to LacI function. The simulated allosteric transition pathway provides a spatial and temporal framework for interpreting some of these data.
Mutations of nearly every residue of lac repressor have been studied phenotypically to detect whether a mutant affects loss of either repression or induction (Suckow et al. 1996). Lack of repression could be due to improper folding, assembly defects, or changes in DNA-binding affinity, whereas loss of induction could result from changes in either inducer binding or propagation of the allosteric signal to the DNA-binding domain. In the intact protein (with residues 2329 individually mutated), mutations at
56% of the sites alter function (Suckow et al. 1996). Here,
75% of residues that participate in the simulated pathways are affected by mutation, a result indicating that TMD identified functionally relevant positions. In addition, residues identified in this study comprise 60% of all positions that generate inducer insensitivity (is) positions. Because another 20% of is residues are located in the immobile C-subdomain, TMD identified the majority of residues with this phenotype (Suckow et al. 1996).
Of necessity, biochemical analysis of purified repressor protein is limited to a smaller number of mutant proteins. However, these studies provide a more detailed picture of mutational effects on function. Several interesting residues are summarized below, whereas other residues that participate in the conformational change are summarized in Table 3
.
|
Mutations of the N-subdomain interface also affect allostery. Apolar substitutions at K84 yield a phenotype that is unresponsive to inducer (Suckow et al. 1996), although K84L can still bind IPTG with wild-type affinity, albeit with significantly slower and biphasic kinetic rate constants (Chang et al. 1993). In wild-type repressor, the highly charged lysine side chains may destabilize the otherwise hydrophobic monomermonomer interface. When the lysines are mutated to hydrophobic residues, the interface is highly stabilized (Nichols and Matthews 1997), presumably by enhanced hydrophobic interactions, especially from contacts similar to those made by the hydrocarbon stem of K84 with the side chain of M98. The enhanced stability of the interface may prevent the monomers from effectively sliding against one another during the transition and, therefore, reduce the inducibility of the protein. The recent crystal structure of the K84L mutant provides direct evidence for modified hydrophobic packing interactions within this interface (Bell et al. 2001). We speculate that stabilization of the K84L interface slows the asymmetric conformational change predicted by TMD so that it may be detected experimentally as biphasic kinetics.
Mutations of D88, which anchors K84 in the wild-type induced conformation, also decrease inducibility of the repressor (Chang et al. 1994; Suckow et al. 1996). Mutating D88 would deprive the K84 cation of a negative anchor in the induced state. In particular, the D88K substitution (Chang et al. 1994) could repel the K84 pair and disfavor the inducer-bound conformation. During equilibration of the inducer-bound structure (1LBH [PDB] ), K84 tends to form electrostatic interactions only with D88 and not with E100. Interestingly, mutations of E100 have no effect on inducibility (Suckow et al. 1996). Structurally, the differential importance of D88 and E100 may be due to their distinct structural environments. D88 lies on a rigid helix (helix 5), whereas E100 lies on a flexible loop (Bell et al. 2001). Greater mobility of E100 could compromise the strength of its interactions with K84. This possibility is further supported by the fact that although K84 could form an interaction with E100 during one equilibration, K84 did not interact with E100. This preference of K84 for D88 was also observed in most of the target structure (inducer-bound) equilibrations.
Substitutions at other positions may influence the interactions delineated above. For example, the observed decrease in inducibility for the A110K mutation (Müller-Hartmann and Müller-Hill 1996) could derive from the lysine side chain at position 110 forming an ionic interaction with D88, thus destabilizing the interaction between K84 and D88.
As noted previously, H74 appears to be integral to signal transmission across the subunit interface. Early examination of the crystal structure led to the hypothesis that forming an H74D278 ion pair in the induced conformation is critical to the conformational change (Lewis et al. 1996). However, Barry and Matthews (1999) demonstrated that although many mutations of H74 and D278 affect the allosteric conformational change, these effects were independent of ionic charge. The observed
-stacking of H74H74 provides a mechanism that explains the biochemical results (Barry and Matthews 1999). In addition, H74W and H74F mutants increase apparent DNA-binding affinity by changing the allosteric constant to favor the state with high DNA-binding affinity (Barry and Matthews 1999). Replacing histidine with these large aromatic residues likely stabilizes the DNA-bound structure of the protein in which H74 and H74 form intra-subunit hydrophobic contacts. Furthermore, these large aromatic residues are not likely to engage in
-stacking that appears to facilitate induction.
Finally, F161, located in the core pivot region, is in a highly hydrophobic environment and may be key to the mobility of the N-subdomain. Consistent with this observation, F161 mutants demonstrate that changing this residue can affect either repression or induction (Suckow et al. 1996). Mutations at this position may "lock" the protein in one conformation or the other. All members of the core pivot hydrophobic group, except L295 and P320, are phenotypically affected by mutations (Table 3
; Suckow et al. 1996). In contrast, of the remaining polar core pivot residues, mutation only affected Q291 and D292 (Table 3
; Suckow et al. 1996). These differential effects may be the result of side chain versus backbone involvement for each of the core pivot residues in the allosteric transition. As pointed out earlier, the side chains of the core pivot hydrophobic group cluster together around F161 (Fig. 7B
). In contrast, the side chains of S162, H163, E164, K290, L318, and S322 all point outward and away from the protein interior and interact extensively with solvent during the simulation, indicating a possible explanation for minimal effects of mutations at these residues.
Additional structural insight may be obtained by comparing structures of homologous proteins. As noted earlier, the hydrophobic region of the core pivot may function like the water molecule "ball-bearings" of ALBP and RBP (Mowbray and Bjorkman 1999; Magnusson et al. 2002). Conformational changes are only known for one other related repressor, Escherichia coli purine repressor (PurR). In PurR, the "allosteric switch loop" is analogous to the LacI flexible loop, and PurR mutations of W147 (homologous to LacI V150) show that this residue plays an intimate part in the allosteric response of the protein (Huffman et al. 2002). The side chain of PurR W147 undergoes significant conformational changes in the transition between the two regulatory states, apparently stabilizing two distinct conformations of the allosteric switch loop (Huffman et al. 2002). The TMD results for LacI reveal a different mechanical role: Changes in side chains are more subtle (e.g., hydrogen bonding at D149), whereas mobility in this loop propagates the inducer-binding signal from the binding pocket to the monomermonomer interface that lies below the DNA-binding site.
Finally, a new computational technique that uses orthologous and paralogous proteins to identify specificity-determining residues in proteins has been applied to the LacI/PurR family (Mirny and Gelfand 2002). By using this method, a dozen specificity-determining residues were identified (note that the bold residues below are implicated by TMD in the allosteric transition). Four of these residues are in the DNA-binding domain (Y17, Q18, V52, and A57), six are in the inducer-binding pocket (N125, D149, V150, F161, W220, and Q248), and two "false positives" lie further from the DNA and ligand binding sites (R101 and Q117). Interestingly, TMD indicates that the specificity false positive (Q117) plays a role in transmitting the allosteric signal.
Conclusion
This article reports the allosteric conformational pathway of LacI predicted by targeted molecular dynamics simulations. In this method, the effect of the ligand binding was achieved by using the inducer-bound protein conformation as the target of the simulated conformational transition from the DNA/anti-inducer-bound form. The C-subdomains do not undergo significant motions during the transition and anchor the N-subdomains. Therefore, our discussion focuses primarily on the changes of the extensive interactions both within and across the two N-subdomains.
The simulated trajectories indicate that the allosteric signal originates asymmetrically in the inducer-binding site of one (trigger) monomer and propagates to the other (response) monomer through various noncovalent interactions of three interconnected pathways. Asymmetry originates in the interactions involved with D149 during equilibration of the target structure. We believe this reflects experimentally relevant changes between the bound crystal structure and the unliganded equilibration structure. Neither D149 nor S193 has been targeted for biochemical studies of substitutions at these sites. Therefore, this work predicts results for new projects in vitro (biochemical characterization of mutants at 149 and 193) and in silico (determining whether the contact surface between the two residues dictates the trigger monomer).
Both the asymmetry and conformational changes observed in the trajectory are repeatable, and either monomer may function as the trigger. The K84K84 interaction connects pathway 1 to pathways 2 and 3, whereas the H74H74
-stacking links pathways 2 and 3. The conformational changes observed reposition the N-subdomains so that they no longer make the contacts with the DNA-binding domain that appear to be requisite for high affinity binding. The detection of the intermediate features of the allosteric transition pathway, such as the H74H74
-stacking interaction, demonstrates the power of molecular dynamics simulations. Such intermediates are not apparent in the available crystal structures. Overall, the results from the simulated trajectories are in agreement with a wide range of experimental biochemical and genetic data, and provide a spatial and temporal framework for interpreting these existing data and for designing new experiments. Targeted molecular dynamics provides an extremely useful tool to assess intermediate states along allosteric or conformational transition pathways. The asymmetric interactions and the three pathways delineated by this approach offer new insights into the atomic-level functional mechanism of LacI.
| Materials and methods |
|---|
|
|
|---|
The coordinates of a certain conformation j are given by vector xj = (x1j,x2j,x3j...x3Nj), here N is the number of atoms, where x1, x2, and x3 represent the components of a Cartesian coordinate system for the first atom, and so forth. The distance,
, between two conformations j and k can be calculated as
.
is then used in the time-dependent constraint as a directing control parameter:
![]() |
where x represents the current conformation in the simulation, and xT the target conformation (Schlitter et al. 1993). Consequently, decreasing
forces the system to undertake the conformational transition toward the target.
is decreased after each time step (
t) by the following equation:
![]() |