Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ma, B.
Right arrow Articles by Nussinov, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ma, B.
Right arrow Articles by Nussinov, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Protein Science (2003), 12:1882-1893.
Copyright © 2003 The Protein Society

Energy landscape and dynamics of the ß-hairpin G peptide and its isomers: Topology and sequences

Buyong Ma1 and Ruth Nussinov1,2

1 Basic Research Program, SAIC-Frederick, Inc., Laboratory of Experimental and Computational Biology, National Cancer Institute at Frederick, Frederick, Maryland 21702, USA
2 Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, Sackler School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel

Reprint requests to: Ruth Nussinov, NCI-FCRF Building 469, Room 151, Frederick, MD 21702; e-mail: ruthn{at}ncifcrf.gov; fax: (301) 846-5598.

(RECEIVED February 13, 2003; FINAL REVISION June 13, 2003; ACCEPTED June 19, 2003)

Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.0306103.


    Abstract
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
We have investigated free energy landscape [MM/PBSA + normal modes entropy] of permutations in the G peptide (41–56) from the protein G B1 domain by studying six isomers corresponding to moving the hydrophobic cluster along the ß-strands (toward the turn: T1, AGEWTYDDKTFTVTET; T2, GEDTWDYATFTVTKTE; T3, GEDDWTYATFTVTKTE; toward the end: E1, WTYDDAGETKTFTVT; E2, WEYTGDDATKTETFTV; E3, WTYEGDDATKTETFTV). The free energy terms include molecular mechanics energy, Poisson-Boltzmann electrostatic solvation energy, surface area solvation energy, and conformational entropy estimated by using normal mode analysis. From the wild type to T1, then T3, and finally T2, we see a progressively changing energy landscape, toward a less stable ß-hairpin structure. Moving the hydrophobic cluster outside toward the end region causes a greater change in the energy landscape. {alpha}-Helical instead of a ß-hairpin structure was the most stable form for the E2 isomer. However, no matter how much the sequence changes, for all variants studied, ideal "native" ß-hairpin topologies remain as minima (regardless of whether global or local) in the energy landscape. In general, we find that the energy landscape is dependent on the hydrophobic cluster topology and on the sequence. Our present study indicates that the key is the relative conformational energies of the different conformations. Changes in the sequence strongly modulate the relative stabilities of topologically similar regions in the energy landscape, rather than redefine the topology space. This finding is consistent with a population redistribution in the process of protein folding. The limited variation of topological space, compared with the number of possible sequence changes, may relate to the observation that the number of known protein folds are far less than the sequential allowance.

Keywords: Protein folding; side-chain interaction; hydrogen bonding; hydrophobic interactions; ß-peptide; ß-hairpin


    Introduction
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
According to Anfinsen’s hypothesis, all the information of the protein three-dimensional structure is encoded in the primary sequence of the polypeptide. On the other hand, the protein three-dimensional topology shows an amazing tolerance to sequence variation. Many proteins share the same fold, even without significant sequence homology. For this reason, there has been considerable interest in studies of the changes of protein folds with the sequence.

Protein topology and stability may define the space of allowed sequences (Koehl and Levitt 2002). Hence, sequence evolution is under structural constraints (Parisi and Echave 2001). The question is, then, how does the change in the sequence affect the protein structure.

ß-Hairpin structures may constitute suitable minimal systems for in-depth studies of fundamental issues in protein folding. In particular, Muñoz et al. (1997, 1998) have shown that a 16-residue ß-hairpin peptide captures the essential features of the folding of large proteins. The peptide folds rapidly and cooperatively into a conformation with a defined secondary structure and a well-packed hydrophobic cluster of aromatic side chains. The peptide they used (GEWTYDDATKTFTVTE, residues 41–56 from the protein G B1 domain, named G peptide below following the method of Honda et al. 2000) has a similar conformation in the isolated state (Blanco et al. 1994) compared with that observed in the protein (Gronenborn et al. 1991).

A particularly interesting feature of the G peptide is the topology of a four-residue hydrophobic cluster (W43, Y45, F52, and V54). Numerous experimental and theoretical studies indicate that this hydrophobic cluster is crucial for the stability of the G peptide. There are two points concerning the hydrophobic cluster: (1) how many residues are needed to form stable hydrophobic cluster in the G peptide, and (2) what are the effects of shifting the hydrophobic cluster (toward the turn or toward the terminus) in the ß-hairpin structure. Both lead to an intriguing sequence–topology relationship.

Previous studies have addressed some aspects of the sequence–topology relationship of the G peptide. Computationally, coarse-grained (Kolinski et al. 1999; Guo et al. 2000; Klimov and Thirumalai 2000) and atomistic (Dinner et al. 1999; Pande and Rokhsar 1999; Roccatano et al. 1999; Bryant et al. 2000; Ma and Nussinov 2000; Garcia and Sanbonmatsu 2001; Lee and Shin 2001; Zagrovic et al. 2001; Zhou et al. 2001; Zhou and Linhananta 2002) models were used to study the folding/unfolding and the energy landscape of the peptide. The G peptide also serves as a reference for ß-hairpin design and for mutational studies (Kobayashi et al. 2000; Cochran et al. 2001; Espinosa et al. 2001). From molecular dynamics simulations, Ma and Nussinov (2000) proposed that three out of the four hydrophobic residues are sufficient to form a stable hydrophobic cluster in the ß-hairpin structure. Experimental mutational studies confirmed that V54A, and possibly W43A, mutants still fold as a ß-hairpin. However, either a Y45A or a F52A mutation strongly destabilizes the ß-hairpin structure (Kobayashi et al. 2000).

By using a statistical mechanical model, Muñoz et al. (1998) first characterized two variants of the original ß-hairpin: moving the hydrophobic cluster one residue closer to the center of the peptide (W44, Y46, F51, V53) and one residue closer to the ends (W42, Y44, F53, V55). The results indicated that the ß-hairpin is stabilized and the folding rate is enhanced if the hydrophobic cluster is moved closer to the turn and opposite if moved toward the ends. Kolinski et al. (1999) also simulated the two variants (with a Monte Carlo lattice model) and found similar trends. Klimov and Thirumalai (2000) tested other possibilities of shifting the hydrophobic cluster two residues toward the turn and two residues closer to the ends. They also found that a shift changes the folding rate and cooperativity of ß-hairpin formation. At least two experimental studies have also been devoted to the moving of the hydrophobic cluster along the ß-hairpin strands (Santiveri et al. 2000; Espinosa et al. 2001). Santiveri et al. (2000) found that the bending rigidity of the turns alters the side-chain interactions. Based on studies of a series of isomers of a 20-residue peptide (derived from the G peptide), Espinosa et al. (2001) concluded that the separation between the loop segment and the hydrophobic cluster strongly influences the ß-hairpin size and stability, with a smaller separation leading to a greater stability.

To further understand the structure–sequence relationship of the G-related peptides, we investigated the energy landscapes and dynamics of the ß-hairpins, changing the size and position of the hydrophobic cluster. The peptides investigated (Fig. 1Go) include three isomers corresponding to moving the hydrophobic cluster toward the inside, that is, toward the turn region (T1, AGEWTYDDKTFTVTET; T2, GEDTWDYATFTVTKTE; T3, GEDDWTYATFTVTKTE), and three isomers shift the hydrophobic cluster outside, toward the end region (E1, EWTYDDAGETKTFTVT; E2, WEYTGDDATKTETFTV; E3, WTYEGDDATKTETFTV). For each peptide isomer, we generate the folded structures (ß-hairpin and {alpha}-helix), partially folded and random conformations by using high temperature molecular dynamics simulation. The free energy surfaces are constructed by using the free energies of the generated conformers. The free energy terms include molecular mechanics energy, Poisson-Boltzmann electrostatic solvation energy, surface area solvation energy, and conformational entropy estimated by using normal mode analysis.



View larger version (42K):
[in this window]
[in a new window]
 
Figure 1. The sequence, ideal topology, and secondary structure predictions for the G peptide and its related isomers.

 
By examining the changes of the free energy landscape with the sequence alterations, we can investigate the topology–sequence relationship for the G peptide and its related isomers. We find that the energy landscape is dependent not only on the hydrophobic cluster topology but also strongly on the sequence.


    Results and Discussion
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Sequences selection and secondary structure predictions
Shifting the hydrophobic cluster along the ß-hairpin strands not only modulates the target ß-hairpin topology but also changes the amino acid sequence of the peptide. There are many possible sequence permutations. Kolinski et al. (1999) also simulated two variants: S1, GWTYEDDATKTTFTVE, and S2, GEDWTYDATKFTVTTE. In the S1 permutation, the hydrophobic cluster is moved outside, toward the end region, while keeping most of the turn (DDATKT) unchanged. In the S2 permutation, the hydrophobic cluster is moved inside, keeping a turn of DATK. Klimov and Thirumalai (2000) tested other possibilities of shifting the hydrophobic cluster two residues toward the turn (M1, GEDTWDYATFTVTKTE) and two residues closer to the ends (M2, WEYTGDDATKTETFTV). In the M1, AT are kept as turn residues, and in M2, DDATKT are still in the turn region, as in S1.

In the present study, we adopt the Klimov and Thirumalai’s permutation (Klimov and Thirumalai 2000; M1 is T2, and M2 is E2; Fig. 1Go) and select other permutations to ensure that we have both similarity and diversity. Our goal is to carry out a meaningful comparison between changes in sequence and topology. Our T1 and T2 are similar to Kolinski’s (1999) S1 and S2, however, with a lesser turn conservation. T3 has more sequence conservation than does T2, with YTW and FTV moving together inside the turn region. E3 has the highest sequence similarity to the wild type, with YTW and FTV moving together outside toward the end, keeping the turn region of DDATKT unchanged.

Changes in the sequence can alter the secondary structure preference of the peptide to different extents. By using the PSIPRED protein structure prediction server (http://bioinf.cs.ucl.ac.uk/psipred; Jones 1999), we tested the secondary structure preference changes upon shifting the hydrophobic cluster along the ß-hairpin strands. For the wild-type G-peptide sequence, the server correctly predicted one of the two ß-strands, with the sequence GEWTYDDAT to be random and KTFTV to be a ß-structure. When shifting the hydrophobic cluster one residue closer to the turn, the resulting peptide (T1) has a similar secondary structure preference as the wild type. However, when shifting two residues closer to the turn (T2 and T3), additional residues are predicted to be in an extended structure, with a turn region. Note that in the T2 and T3 permutations, the turn residues have mostly changed. For the shifting of the hydrophobic cluster one residue closer to the end (E1), the predicted structural preference is again similar to the wild type. For the larger moving steps (two residues closer to the end, E2 and E3), all sequences are predicted to be random (Fig. 1Go), even for the E3 sequence, which has the highest sequence identity compared with the wild-type G peptide.

We also used the PHD server (Rost and Sander 1993). Because the query sequences are too short (16 residues) for the PHD algorithm, we added two XX to our query sequences (e.g., for the native peptide we seek XXGEWTYDDATKTFTVTEXX). The predicted secondary structures are also reported in Figure 1Go. In general, the PHD algorithm predicted more extended structures for the isomers.

The secondary structure predictions only indicate that the permuted sequences may still be able to form ß-strands. However, because the turn propensity is not addressed, their ß-hairpin propensities are unknown from the secondary structure predictions. In general, the stability of a ß-hairpin structure depends on three factors: the turn propensity, the stability of the hydrophobic cluster, and the interactions between backbone hydrogen bonding, dictated by strand propensities. In all permutations, the hydrophobic core appears closely packed. However, the turn and strand propensities change: The T1 and E1 permutations are the most likely to keep the ß-hairpin structure, because their strand propensity is still preferred and the turn region is predicted to be a coiled structure. E2 and E3 also appear to have a good probability to be in a ß-hairpin conformation due to the high conservation of the turn sequences. However, from the PSIPRED prediction, their strand propensities may have changed too much (Fig. 1Go). As for the T2 and T3 sequences, their hydrophobic cores are very close to the turn and are expected to stiffen the turn region. Indeed, both PHD and PSIPRED predict continued extended structures around the turn region, which is unfavorable for a ß-hairpin structure.

The size and position of the HP cluster change the energy landscape
Experiments and computational studies (Veitshans et al. 1997) show that much of the complexity seen in the folding of proteins is captured in peptides with well-defined low-energy structures. Characterization of the folding free energy surface of small peptides may provide an insight into peptide stabilities and folding pathways. Bursulaya and Brooks (1999 Bursulaya and Brooks (2000) examined the potential of mean force (PMF) surface of the three-stranded ß-sheet protein Betanova. They found that the overall shape and location of the global minimum are relatively less sensitive to the solvation models (explicit and implicit GB models) used. In the present study, we explored the free energy landscapes for the wild type and its variants.

Initially, we tested three types of reaction coordinates to characterize the free energy surface of the wild type peptide. Two obvious reaction coordinates are the intrastrand hydrogen bonds and the fraction of the native contacts. In the present study, we use a measure to follow the closeness of the hydrophobic clusters, the HP cluster index (see Materials and Methods). Our concerns are largely the stability of the hydrophobic cluster, how this stability affects the overall peptide topology, and what are the sequence effects. The question is which combination to use: HP cluster with native contacts or with intrastrand hydrogen bonds as reaction coordinates.

As may be seen in Figure 2Go, the free energy surface shows a well-defined global minimum with the HP cluster index and the native contact fraction (Fig. 2A,BGo), centered near a native contact fraction of 0.5 to 1 and the HP index around zero (the smaller the HP index, the more compact the HP cluster). When using the HP index and the number of hydrogen bonds to characterize the free energy surface, we see an L-shaped valley (Fig. 2C,DGo), favoring a more compact HP cluster and, to a lesser extent, hydrogen bonds. The L-shaped distribution is very similar to the potential energy surface previously characterized by Pande et al. (Pande and Rokhsar 1999; Zagrovic et al. 2001) for the wild-type peptide. The free energy surface characterized by the native contact fraction and the hydrogen bonds (Fig. 2E,FGo) is much less defined compared with the native contact fraction and the HP cluster index.



View larger version (57K):
[in this window]
[in a new window]
 
Figure 2. Three dimensional and contour map for the energy landscape of the native G peptide.

 
Experimentally, the folding of the G peptide has negative activation energy (Muñoz et al. 1997). The energy landscape with the HP cluster index and the native contact fraction (Fig. 2AGo) also shows that the folding has no apparent enthalpic barrier. However, there is a possible entropy barrier from the extended state, with a large hydrophobic cluster separation and an absence of native contacts. Note that the energy landscape with the HP cluster index and a native contact fraction forms a rugged funnel for the wild-type peptide. The folding funnel concept (Bryngelson and Wolynes 1989; Frauenfelder et al. 1991; Baldwin 1994; Bozko and Brooks 1995; Karplus et al. 1995; Onuchic et al. 1995; Wolynes et al. 1995; Dill and Chan 1997; Lazaridis and Karplus 1997; Gruebele and Wolynes 1998; Dill 1999) indicates that the most realistic model of two-state proteins is a minimally frustrated heteropolymer with a rugged funnel-like landscape biased toward the native structure (Shea et al. 1999). In constructing an energy landscape, a smooth surface would be preferred, especially when one wants to fold a protein ab initio. In the current study, we are mostly interested in the change of the energy landscape with the sequences. Therefore, we need reaction coordinates that can provide a well-defined fold location and with the changes progressively moving toward the folded region. In this sense, the combination of the HP cluster index and the native contact fraction is ideal for this purpose. For the rest of the isomers studied, we mainly use the HP cluster index and the native contact fraction to characterize the free energy surface.

A comparison of Figure 3AGo (for the T1) with Figure 2AGo shows that when shifting the hydrophobic cluster one residue inside toward the turn region (T1, AGEWTYDDKTFTVTET), the free energy landscape does not change too much. For both the wild type and the T1 isomer, the narrow valley corresponds to the ß-hairpin conformations. As may also be seen in Table 1Go, ß-hairpin structures dominate the top 10 most stable conformers for the wild type and the T1 isomer. These results are consistent with our expectation from secondary structure analysis.



View larger version (69K):
[in this window]
[in a new window]
 
Figure 3. Three-dimensional and contour map for the energy landscape of the G-peptide isomers.

 

View this table:
[in this window]
[in a new window]
 
Table 1. Top rank conformers for the GB1 peptide isomers
 
Continuing to shift the hydrophobic cluster inside, toward the turn region (T2, GEDTWDYATFTVTKTE; T3, GEDDWTYATFTVTKTE), changes the free energy landscape further (Fig. 3B,CGo), with a few isolated valleys corresponding to larger hydrophobic residue separations. No native (ß-hairpin) contacts appear. For the T2 sequence, the energy landscape is diffusive. Indeed, a fully extended structure (Fig. 4BGo; Table 1Go) was identified as the most stable conformer for the T2 isomer. For the T3 isomer, a partially folded {alpha}-helix (at the N-terminal portion; data not shown) is ranked number three (Table 1Go). Note that for the T2 and T3 isomers, the hydrophobic residues are close in sequence (at the turn); therefore, the HP cluster indices are relatively smaller than those of the wild-type G peptide and of the T1 isomer. Still, all three isomers (T1, T2, and T3) share similar landscape features. From the wild type to T1, then T3, and finally T2, we see a progressively changing energy landscape toward less stable ß-hairpin structures. This behavior is due to the change in the sequence reflected in the turn topology and in the position of the HP cluster. The prediction that the T2 should be less stable than the wild type is in accord with the conclusions of Klimov and Thirumalai (2000).



View larger version (24K):
[in this window]
[in a new window]
 
Figure 4. Ribbon structure of top ranking conformers (Table 1Go) for the G-peptide isomers.

 
Shifting the hydrophobic cluster outside toward the end region makes the larger separation of the hydrophobic cluster more energetically accessible. Even just shifting by one residue toward the end region (E1, EWTYDDAGETKTFTVT; Figs. 1C, 3DGoGo) ranks two random conformers in the top three conformations. Still, there is a deep valley centered around the ideal ß-hairpin structure and the most stable conformer is a hairpin structure with a long loop (Table 1Go; Fig. 4DGo). Moving the hydrophobic cluster to the very end (E2, WEYTGDDATKTETFTV; E3, WTYEGDDATKTETFTV) also leads to an interesting free energy landscape (Fig. 3E,FGo). For the E2 isomer, a completely folded {alpha}-helix structure was the most stable conformer (Fig. 4EGo). There is only a small valley corresponding to the ideal ß-hairpin structure. Even though the ß-hairpin structure appears in the top 10 conformers, the top-ranking conformers are dominated by extended and random structures (Table 1Go).

The change in the HP cluster size also provides insight into its role in stabilizing the ß-hairpin structure. We investigated the energy landscape of three GB1 peptide variants, replacing W43 by Gly, replacing W43 and V54 by Gly, and mutating Y44 and F53 to Gly (Fig. 5Go). In our previous study (Ma and Nussinov 2000), we named the three variants P4, P7, and P8, respectively. The P4 peptide still has a similar landscape as the wild type. However, when an additional hydrophobic residue is mutated out (P7; Fig. 5BGo), conformations with large hydrophobic separations (Y–F for P7; and W–V for P8) become energetically accessible.



View larger version (47K):
[in this window]
[in a new window]
 
Figure 5. Three-dimensional and contour map for the energy landscape of G-peptide mutants P4 (A), P7(B), and P8 (C).

 
Force field parameters (Zhou et al. 2001; Yeh and Hummer 2002) and limitations of finite time scale calculations may affect the relative stabilities of ß-hairpin sequences. Nevertheless, the qualitative prediction that for some of the sequences the stability of the ß-hairpin is compromised can be tested by experiment. Clearly, the topology of the energy landscape is less sensitive to the force field than are specific conformations. In the study of the PMF surface of a three-stranded ß-sheet protein Betanova, Bursulaya and Brooks (1999 Betanova, Bursulaya and Brooks (2000) concluded that the overall shape and location of the global minimum are relatively less sensitive to the solvation models (explicit and implicit GB models) used.

Correlation of the free energy landscape and peptide conformation dynamics
Molecular dynamics simulations of the wild-type G peptide have been reported extensively in the literature (Dinner et al. 1999; Kolinski et al. 1999; Pande and Rokhsar 1999; Roccatano et al. 1999; Bryant et al. 2000; Guo et al. 2000; Klimov and Thirumalai 2000; Ma and Nussinov 2000; Garcia and Sanbonmatsu 2001; Lee and Shin 2001; Zagrovic et al. 2001), as well as the simulations of P4, P7, and P8 peptides (Ma and Nussinov 2000). In the present study, we have also conducted MD simulations for the six isomers (T1, T2, T3, E1, E2, and E3). For each isomer, two independent MD simulations are run with slightly different ideal ß-hairpin starting geometries.

The trajectories of the hydrophobic cluster index are reported in Figure 6Go. In general, shifting the HP cluster by just one residue toward the inside (T1) or outside (E1) can still keep a stable ß-hairpin structure, whereas no stable ß-hairpin structure is observed in the course of the simulations for T2, T3, E2, and E3. Such conformational dynamics are consistent with what we can expect from their free energy landscapes. The energy landscape for T2 (Fig. 3BGo) shows a broad distribution of low-lying conformers. For the E2 isomer, there is only a small valley for the compact HP cluster, with most low-lying conformers located in other regions (Fig. 3FGo). In the MD simulations, the starting ß-hairpin structures for both the T2 and E2 isomers quickly changed into random states, which also show a large separation between the hydrophobic residues (Fig. 6Go, green lines).



View larger version (56K):
[in this window]
[in a new window]
 
Figure 6. Trajectories of hydrophobic cluster index from molecular dynamics simulations.

 
As mentioned above, a fully folded {alpha}-helix was found to be the lowest lying conformer for the E2 isomer. We also simulated the stability of the {alpha}-helical E2 isomer. At the 278 K (1.4 nsec) and, subsequently, the 300 K (1 nsec) simulations, the central portion of the E2 isomer keeps the {alpha}-helical structure well, whereas both the N- and C-terminal residues fluctuate.

The hydrophobic clusters of the T3 and E3 isomers are relatively more compact than those of the T2 and E2 isomers. The sole difference between the T2 and T3 conformers (and between the E2 and E3 conformers) is that T3 and E3 retain slightly more of the wild-type character (i.e., WTY matches FTV in T3 and E3) than do T2 and E2 (Fig. 1Go). However, as may be seen in Table 1Go and Figure 3Go, the absolute free energies of these isomers are all much higher than that of the wild-type peptide. These elevated free energy surfaces lead to unstable ß-hairpin structures in the T3 and E3 isomers.

In molecular dynamics simulations, kinetic barriers often trap specific conformations. This may be the case for our simulations of the E1 and P8 peptides. Examination of the energy landscape for the E1 isomer (Fig. 3DGo) indicates large conformational fluctuation. However, the MD simulation shows a consistent ß-hairpin structure and a well-defined hydrophobic cluster (Fig. 6BGo, blue lines). In our previous simulations of the P4, P7, and P8 peptides (Ma and Nussinov 2000), we found that P4 and P8 are consistently stable, whereas P7 showed an unfolding and refolding transition. Experimental evidence indicated that P4 might be stable in solution (experiment W43A mutation; see Kobayashi et al. 2000; P4 is the W43G mutant), whereas P8 should be random in solution. Examination of the energy landscapes for the P4, P7, and P8 (Fig. 5Go) peptides indicates that P4 (with a funnel-like landscape similar to the wild type) is stable in solution, whereas P8 may be random, consistent with experimental studies. Apparently, the previous simulations of P8 were caught in a kinetic trap, and the additional simulations with multiple trajectories may have revealed its true stability.

Conclusions
Through a study of two peptides with similar topologies (three-stranded antiparallel ß-sheet) and a low sequence identity (15%), Ferrara and Caflish (2001) have shown that the overall shape of the free-energy surface is defined primarily by the native-state topology, whereas the sequence determines the statistically predominant order of the events. In our present study, we have revealed the gradual changes of the free energy surface with the size and position of the hydrophobic cluster for the G-peptide variants.

Shifting the HP cluster along the ß-strands introduces sequence changes as well. As for the six isomers studied, two (T1 and E1) share high sequence identity with the wild-type G peptide (87.5%). Although shifting the HP cluster one residue inside toward the turn region (T1) preserves the characteristics of the energy landscape the most, the landscape changes dramatically when the HP cluster shifts one residue outside toward the end region (E1). Isomers T2, T3, E2, and E3 share little sequence identity with the G peptide. In terms of topology, all these sequences can form a ß-hairpin structure with a variable position of the HP cluster with respect to the turn and end regions. Indeed, the energy landscape for T2, T3, and E3 are also topologically similar to the wild-type G peptide (Fig. 3Go), consistent with the recently increasing general agreement regarding the role of native topology. However, we also see the critical effects of the sequence, best illustrated by the energy landscape of the E2 isomer. Six central residues (D46–T51) in the E2 and E3 isomers are identical to those of the turn region of the G peptide, and E2 and E3 isomers only differ by an exchange of the residue positions of E42 and T44. Yet, the energy landscape for E2 is unique and strongly prefers an {alpha}-helical conformation instead of the dominant ß-hairpin of the related isomers.

So then, how to understand the changes in the energy landscape? Our present study indicates that the key is the relative conformational energies of the different conformations. That is, changes in the sequence modulate the relative stabilities of the different topologies rather than redefine the topology space. In this regard, our current simulations yield two important observations. First, for all variants studied, ideal native ß-hairpin topologies remain as minima (regardless of whether global or local) in the energy landscape. Even for the most different landscape of the E2 isomer, the ideal ß-hairpin region has not vanished. This is especially interesting when we note that the free energies of all isomers are higher than that of the wild type (Table 1Go). This indicates that the entire energy landscape shifts while still preserving the topology space. Second, when the hydrophobic cluster size changes (Fig. 5Go), the near ideal ß-hairpin energy region does not change too much, whereas the other regions change dramatically. Previously, we have demonstrated the stabilizing role of the HP cluster for the G peptide. Here we further notice that the force keeping the hydrophobic residues together is not because they are attractive, that is, leading to deeper minima. Rather, it acts through destabilizing conformations with larger separations of the hydrophobic residues. The Z scores for the alternative sequences are perhaps less than that for the wild type. Clearly, this type of behavior is expected given the nature of hydrophobic interactions. Nevertheless, we see a gradual modification of the energy landscape by sequence and topology.

The recent work of Zhou et al. (2001) does not show evidence for helical intermediates. However, two other MD simulations have indicated that the native G peptide already has a tendency to form an {alpha}-helical structure. By using a replica-exchange approach that combines MD trajectories with a temperature exchange Monte Carlo process, Garcia and Sanbonmatshu (2001) explored the energy landscape of the G peptide and located many helical structures. Semi-helical intermediates are also found in the folding pathway of the G peptide (Zagrovic et al. 2001). Therefore, the strong helical preference of the E2 isomer is simply a modification of the existing energy landscape.

Thus, the sequence changes do not lead to a complete redefinition of the topological space. Instead, they modulate the existing energy landscape, consistent with a population redistribution in the process of protein folding and binding (Ma et al. 2001, 2002). The ß-sheet structure reflects an interplay of many weak interactions (Searle 2001). A small perturbation may cause a large population shift. Consistent with the experimental shifting of a hydrophobic cluster along the ß-strands by Gellman and his colleagues (Espinosa et al. 2001), our energy landscape study and MD simulations also show that a smaller separation between the loop and the HP cluster tends to retain the wild-type energy landscape and thus has a greater stability. However, it appears that the native G-peptide sequence is optimized by nature. Any perturbation shifting the HP cluster leads to a higher energy landscape and lower stability compared with that of the native sequence.

The limited variation of the topological space compared with the number of possible sequence changes, may relate to the observation that the number of known protein folds are far less than the sequential allowance. Recently, Dokholyan et al. (2002) attempted to relate the limited protein folds to the early stage of evolution. Our study indicates a physically based explanation that might not be entirely new: Given a peptide/protein, the topology space is mostly defined by the Ramachandran plot, which defines the basic pool of conformers. For a long sequence, the topology may be extended through additional backbone and side-chain interactions. However, such interactions define the characteristic topologies, with only a limited number being energetically preferred.


    Materials and methods
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
For each peptide isomer, we generate the folded structures (ß-hairpin and {alpha}-helix), partially folded, and random conformations. The wild G-peptide ß-hairpin structure is taken from the protein structure (Protein Data Bank: 1gb1 [PDB] ). The ß-hairpin structures for the permutated sequences are generated from the wild G peptide by mutating the residues. The {alpha}-helix structures are generated using Insight II modeling package (Accelrys). The partially folded and random structures are generated from molecular dynamics simulations starting from both the ß-hairpin and {alpha}-helical structures. The conformations near the "native" states are taken from MD simulations with explicit waters. The random states are generated with high-temperature MD simulations in the gas phase (with distance-dependent dielectric constant 4*r). The conformations are then clustered by the native contact fraction and the hydrophobic (HP) cluster index. The native contacts are defined from the ideal ß-hairpin structure illustrated in Figure 1Go, with a C{alpha} distance cutoff of 5 Å. The HP cluster index is a measure of the overall contact between the four hydrophobic residues, and is defined by


The distances Rs are as follows: R1, between C{gamma}s of F and Y; R2, between C{gamma}s of F and W; R3, between C{gamma} of W and C{gamma} of V; and R4, between C{gamma} of Y and C{gamma} of V. The superscript o indicates the distances in the native states. Therefore, the smaller the HP index, the more compact the hydrophobic cluster.

The free energy landscapes are constructed by evaluating the free energies for individual conformers. The selected conformers are first subjected to energy minimization with a gradient less than 0.001 kcal/mole/Å with distance-dependent dielectric constant 4*r. Based on the minimized structure, the conformational free energies are evaluated using the formula


where the Egas is the gas phase potential energy, Eelectrostatics is the electrostatic contributions to the solvation energy, Esurface is the cavity and exposed surface effect on the solvation energy, and GGvibration is solute vibrational free energy.

The vibrational free energy for a given conformation is


All calculations are carried out at room temperature (298 K).

The vibrational contribution of the enthalpic and entropic components of the free energy are obtained in the standard way:



where, N is the total number of atoms in the peptide, {rho}i is the vibrational frequency, h is Plank’s constant, K is the Boltzmann constant, and T is the temperature.

Estimation of the entropy contribution to the free energy surface is still a challenge. The G peptide is only marginally stable at most temperatures (Muñoz et al 1997). Thus, the conformational entropy could make a significant contribution to the free energy. Using vibrational entropy to estimate the conformational entropy is a good approximation (Wang et al. 2001). Vibrational energy and entropy are molecular properties and should be included in the energy evaluation. Further, flexible molecules have high conformational entropy due to the high number of energetically accessible states. At the same time, flexible molecules also have more low-energy vibrational states and therefore high vibrational entropy. In folding/unfolding of a peptide, by using a vibrational entropy estimation, we predicted cold denaturation of an {alpha}-helix (Ma et al. 2000). The prediction of this phenomenon was subsequently confirmed by experiments (Kinnear et al. 2002). Even though the entropy estimation may yield some uncertainty in the ordering of the conformations in Table 1Go, the ranking of the native ß-hairpin structure as the most stable conformation for the wild-type G peptide provides confidence in the free energy evaluation of the system studied.

The Egas is evaluated with the CFF91 force field (Maple et al. 1998). The electrostatic solvation energy is obtained by solving the Poisson-Boltzmann equation by using the program Delphi (Gilson et al. 1988; Jayaram et al. 1989). Reaction field energy was used as the electrostatic solvation energy. The atomic radii and partial charges are taken from the CFF91 force field. The dielectric constants used are one for the solute and 80 for the water solvent. Grid spacing is 0.4 Å. The surface effects are calculated (Ma and Nussinov 1999) with Esurface = 0.05(ASAnonpolar - ASApolar). The computation of the polar and nonpolar ASA (accessible surface area) used the method described previously (Tsai and Nussinov 1997).

Molecular dynamics simulations were performed in the canonical ensemble (NVT) with cubic periodic boundary condition by using the program Discover 2.98. The system consisted of a peptide solvated with 1330 water molecules (for the native peptide; the number of water molecules slightly differs for different mutants) placed in a cubic box (with dimensions 35 x 35 x 35 Å3). The effective water density in the solvation box is 1.006 g/cm3. The overall charge of the system is -3. All atoms of the system were considered explicitly, and their interactions were computed by using the CFF91 force field (Maple et al. 1998). The time step in the MD simulations is 1 fsec, and snapshots from the trajectories were saved every 1 psec. The native peptide has the most folded conformation ~273 K, and the midpoint of the unfolding transition is 297 K. In our simulations, the temperature used is 278 K.


    Acknowledgments
 
We thank Dr. Jacob Maizel for encouragement and for helpful discussions. In particular, we would like to thank Dr. D. Thirumalai for suggesting a study of the location of the hydrophobic cluster in ß-hairpins and its effects on hairpin stability. We further thank him for discussions, encouragement, and critical reading of the manuscript. The computation times are provided by the National Cancer Institute Frederick Advanced Biomedical Supercomputing Center. The research of R.N. in Israel has been supported in part by the Center of Excellence in Geometric Computing and its Applications funded by the Israel Science Foundation (administered by the Israel Academy of Sciences), by the Adams Brain Center, and by internal grants of Tel Aviv University. This project has been funded in whole or in part with federal funds from the National Cancer Institute, NIH, under contract number NO1-CO-12400. The content of this publication does not necessarily reflect the view or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organization imply endorsement by the U.S. Government.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.


    References
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Baldwin, R.L. 1994. Matching speed and stability. Nature 369: 183–184.[CrossRef][Medline]

Blanco, F., Rivas, G., and Serrano, L. 1994. A short linear peptide that folds into a native stable ß-hairpin in aqueous solution. Struct. Biol. 1: 584–590.

Bozko, E.M. and Brooks III, C.L. 1995. First principles calculation of the folding free energy for a three helix bundle protein. Science 269: 393–396.[Abstract/Free Full Text]

Bryant, Z., Pande, V.S., and Rokhsar, D.S. 2000. Mechanical unfolding of a ß-hairpin using molecular dynamics. Biophys. J. 78: 584–589.[Abstract/Free Full Text]

Bryngelson, J.D. and Wolynes, P.G. 1989. Intermediates and barrier crossing in a random energy model (with applications to protein folding). J. Phys. Chem. 93: 6902–6915.[CrossRef]

Bursulaya, B.D. and Brooks, C.L. 1999. Folding free energy surface of three-stranded ß-sheet protein. J. Am. Chem. Soc. 121: 9947–9951.[CrossRef]

———. 2000. Comparative study of the folding free energy landscape of a three-stranded ß-sheet protein with explicit and implicit solvent models. J. Phys. Chem. 104: 12378–12383.

Cochran, A.G., Skelton, N.J., and Starovasnik, M.A. 2001. Tryptophan zippers: Stable, monomeric ß-hairpins. Proc. Natl. Acad. Sci. 98: 5578–5583.[Abstract/Free Full Text]

Dill, K.A. 1999. Polymer principles and protein folding. Protein Sci. 8: 1166–1180.[Abstract]

Dill, K.A. and Chan, H.S. 1997. From Levinthal to pathways to funnels. Nat. Struct. Biol. 4: 10–19.[CrossRef][Medline]

Dinner, A.R., Lazaridis, T., and Karplus, M. 1999. Understanding ß-hairpin formation. Proc. Natl. Acad. Sci. 96: 9068–9073.[Abstract/Free Full Text]

Dokholyan, N.V., Shakhnovich, B., and Shakhnovich, E.I. 2002. Expanding protein universe and its origin from the biological big bang. Proc. Natl. Acad. Sci. 99: 5578.

Espinosa, J.F., Munoz, V., and Gellman, S.H. 2001. Interplay between hydrophobic cluster and loop propensity in ß-hairpin formation. J. Mol. Biol. 306: 397–402.[CrossRef][Medline]

Ferrara, P. and Caflisch, A. 2001. Native topology or specific interactions: What is more important for protein folding? J. Mol. Biol. 306: 837–850.[CrossRef][Medline]

Frauenfelder, H., Sligar, S.G., and Wolynes, P.G. 1991. The energy landscapes and motions of proteins. Science 254: 1598–1603.[Abstract/Free Full Text]

Garcia, A.E. and Sanbonmatsu, K.Y. 2001. Exploring the energy landscape of a ß hairpin in explicit solvent. Proteins 42: 345–354.[CrossRef][Medline]

Gilson, M.K., Sharp, K.A., and Honig, B.H. 1988. Calculating the electrostatic potential of molecules in solution: Method and error assessment. J. Comput. Chem. 9: 327–335.[CrossRef]

Gronenborn, A.M., Filpula, D.R., Essig, N.Z., Achari, A., Whitlow, M., Wingfield, P.T., and Clore, G.M. 1991. A novel, highly stable fold of the immunoglobulin binding domain of streptococcal protein G. Science 253: 657.[Abstract/Free Full Text]

Gruebele, M. and Wolynes, P. 1998. Satisfying turns in folding transitions. Nat. Struct. Biol. 5: 662–665.[CrossRef][Medline]

Guo, C., Levine, H., and Kessler, D.A. 2000. How does a ß-hairpin fold/unfold? Competition between topology and heterogeneity in a solvable model. Proc. Natl. Acad. Sci. 97: 10775–10779.[Abstract/Free Full Text]

Honda, S., Kobayashi, N., and Munekata, E. 2000. Thermodynamics of a ß-hairpin structure: Evidence for cooperative formation of folding nucleus. J. Mol. Biol. 295: 269–278.[CrossRef][Medline]

Jayaram, B., Sharp, K.A., and Honig, B.H. 1989. The electrostatic potential of B-DNA. Biopolymers 28: 975–993.[CrossRef][Medline]

Jones, D.T. 1999. Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 292: 195–202.[CrossRef][Medline]

Karplus, M., Sali, A., and Shakhnovitch, E. 1995. Kinetics of protein folding. Nature 373: 664–665.[CrossRef][Medline]

Kinnear, B.S, Hartings, M.R, and Jarrold, M.F. 2002. The energy landscape of unsolvated peptides: Helix formation and cold denaturation in Ac-A4G7A4 + H+. J. Am. Chem. Soc. 124: 4422–4431.[CrossRef][Medline]

Klimov, D.K. and Thirumalai, D. 2000. Mechanisms and kinetics of ß-hairpin formation. Proc. Natl. Acad. Sci. 97: 2544–2549.[Abstract/Free Full Text]

Kobayashi, N., Honda, S., Yoshii, H., and Munekata, E. 2000. Role of side-chains in the cooperative ß-hairpin folding of the short C-terminal fragment derived from streptococcal protein G. Biochemistry 39: 6564–6571.[CrossRef][Medline]

Koehl, P. and Levitt, M. 2002. Protein topology and stability define the space of allowed sequences. Proc. Natl. Acad. Sci. 99: 1280–1285.[Abstract/Free Full Text]

Kolinski, A., Ilkowski, B., and Skolnick, J. 1999. Dynamics and thermodynamics of ß-hairpin assembly: Insights from various simulation techniques. Biophys. J. 77: 2942–2952.[Abstract/Free Full Text]

Lazaridis, T. and Karplus, M. 1997. "New view" of protein folding reconciled with the old through multiple unfolding simulations. Science 278: 1928–1931.[Abstract/Free Full Text]

Lee, J. and Shin, S. 2001. Understanding ß-hairpin formation by molecular dynamics simulations of unfolding. Biophys. J. 81: 2507–2516.[Abstract/Free Full Text]

Ma, B. and Nussinov, R. 1999. Explicit and implicit water simulations of a ß-hairpin peptide. Proteins 37: 73–87.[Medline]

———. 2000. Molecular dynamics simulations of a ß-hairpin fragment of protein G: Balance between side-chain and backbone forces. J. Mol. Biol. 296: 1091–1104.[CrossRef][Medline]

Ma, B., Tsai, C.-J., and Nussinov, R. 2000. Protein folding energy landscape and vibrational entropy: A systematic study. Biophys. J. 79: 2739–2753.[Abstract/Free Full Text]

Ma, B., Wolfson, H.J., and Nussinov, R. 2001. Protein functional epitopes: Hot spots, dynamics and combinatorial libraries. Curr. Opin. Struct. Biol. 11: 364–369.[CrossRef][Medline]

Ma, B., Shatsky, M., Wolfson, H.J., and Nussinov, R. 2002. Multiple diverse ligands binding at a single protein site: A matter of pre-existing populations. Protein Sci. 11: 184–197.[Abstract/Free Full Text]

Maple, J.R., Hwang, M.J., Jalkanen, K.J., Stockfisch, T.P., and Hagler, A.T. 1998. Derivation of class II force fields: V quantum force field for amides, peptides, and related compound. J. Comput. Chem. 19: 430–458.[CrossRef]

Muñoz, V., Thompson, P.A., Hofrichter, J.A., and Eaton, W.A. 1997. Folding dynamics and mechanism of ß-hairpin formation. Nature 390: 196–199.[CrossRef][Medline]

Muñoz, V., Henry, E.R., Hofrichter, J.A., and Eaton, W.A. 1998. A statistical mechanical model for ß-hairpin kinetics. Proc. Natl. Acad. Sci. 95: 5872–5879.[Abstract/Free Full Text]

Onuchic, J.N., Wolynes, P.G., Luthey-Schulten, Z., and Socci, N.D. 1995. Towards an outline of the topography of a realistic protein folding funnel. Proc. Natl. Acad. Sci. 92: 3626–3630.[Abstract/Free Full Text]

Pande, V.S. and Rokhsar, D.S. 1999. Molecular dynamics simulations of unfolding and refolding of ß hairpin fragments of protein G. Proc. Natl. Acad. Sci. 96: 9062–9067.[Abstract/Free Full Text]

Parisi, G. and Echave, J. 2001. Structural constraints and emergence of sequence patterns in protein evolution. Mol. Biol. Evol. 18: 750–756.[Abstract/Free Full Text]

Roccatano, D., Amadei, A., Nola, A.D., and Berendsen, H.J.C. 1999. A molecular dynamics study of the 41–56 ß-hairpin from B1 domain of protein G. Protein Sci. 8: 2130–2143.[Abstract]

Rost, B. and Sander, C. 1993. Prediction of protein secondary structure at better than 70% accuracy. J. Mol. Biol. 232: 584–599.[CrossRef][Medline]

Santiveri, C.M., Rico, M., and Jimenez, M.A. 2000. Position effect of cross-strand side-chain interaction on ß-hairpin formation. Protein Sci. 9: 2151–2160.[Abstract]

Searle, M.S. 2001. Peptide models of protein ß-sheet: Design, folding and insights into stabilizing weak interactions. J. Chem. Soc. Perkin Trans. 2: 1011–1020.

Shea, J.E., Onuchic, J.N., and Brooks III, C.L. 1999. Exploring the origins of topological frustration: design of a minimally frustrated model of fragment B of protein A. Proc. Natl. Acad. Sci. 96: 12512–12517.[Abstract/Free Full Text]

Tsai, C.-J. and Nussinov, R. 1997. Hydrophobic folding units derived from dissimilar monomer structures and their interactions. Protein Sci. 6: 24–42.[Abstract]

Veitshans, T., Klimov, D., and Thirumalai, D. 1997. Protein folding kinetics: Timescales, pathways and energy landscapes in terms of sequence-dependent properties. Fold Des. 2: 1–22.[CrossRef][Medline]

Wang, J., Morin, P., Wang, W., and Kollman, P.A. 2001. Use of MM-PBSA in reproducing the binding Freee Energies to HIV-1 RT of TIBO derivatives and predicting the binding mode to HIV-1 RT of efavirenz by docking and MM-PBSA. J. Am. Chem. Soc. 123: 5521–5230.

Wolynes, P.G., Onuchic, J.N., and Thirumalai, D. 1995. Navigating the folding routes. Science 267: 1619–1620.[Free Full Text]

Yeh, I.C. and Hummer, G. 2002. Peptide loop-closure kinetics from microsecond molecular dynamics simulations in explicit solvent. J. Am. Chem. Soc. 124: 6563–6568.[CrossRef][Medline]

Zagrovic, B., Sorin, E.J., and Pande, V. 2001. ß-Hairpin folding simulations in atomistic detail using an implicit solvent model. J. Mol. Biol. 313: 151–169.[CrossRef][Medline]

Zhou, R., Berne, B.J., and Germain, R. 2001. The free energy landscape for ß hairpin folding in explicit water. Proc. Natl. Acad. Sci. 98: 14931–14936.[Abstract/Free Full Text]

Zhou, Y. and Linhananta, A. 2002. Role of hydrophilic and hydrophobic contacts in folding of the second ß-hairpin fragment of protein G: Molecular dynamics simulation studies of an all-atom model. Proteins 47: 154–162.[CrossRef][Medline]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Biophys. JHome page
B. Ma and R. Nussinov
The Stability of Monomeric Intermediates Controls Amyloid Formation: A{beta}25-35 and its N27Q Mutant
Biophys. J., May 15, 2006; 90(10): 3365 - 3374.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
D. Du, Y. Zhu, C.-Y. Huang, and F. Gai
Understanding the key factors that control the rate of {beta}-hairpin folding
PNAS, November 9, 2004; 101(45): 15915 - 15920.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ma, B.
Right arrow Articles by Nussinov, R.
Right arrow Search for Related Content