|
|
||||||||
1 Department of Chemical Engineering, University of California, Berkeley, California 94720-1462, USA
2 Chemical Sciences Division and
3 Earth Sciences Division, Lawrence Berkeley National Laboratory, Berkeley, California 94720, USA
Reprint requests to: Clayton J. Radke, Department of Chemical Engineering, University of California, Berkeley, CA 94720-1462, USA; e-mail: radke{at}cchem.berkeley.edu; fax: (510) 642-4778.
| Abstract |
|---|
|
|
|---|
. Changes in the interaction energies strongly affect many protein properties. We present an optimized energy parameter set for best representing realistic behavior typical for many proteins (fast folding and high cooperativity for single chains). Our optimal parameters feature a much weaker hydrophobicity contrast and mean attraction than does the original interaction scale. The proposed interaction scale is designed for calculating the behavior of proteins in bulk and at interfaces as a function of solvent characteristics, as well as protein size and sequence. Keywords: Lattice simulation; interaction energies; protein folding; aggregation; solvation
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.03198204.
| Introduction |
|---|
|
|
|---|
The biological function of a protein is coupled strongly to its conformation or tertiary structure. For native proteins, this conformation is believed to correspond to the global minimum of free energy (Asherie et al. 1998; Fink 1998; Harrison et al. 1999). The order of free-energy levels of different conformations can change in the presence of other molecules (macromolecules such as chaperones or amyloids, or small molecules such as urea or sugars) and in the presence of interfaces (Srebnik et al. 1998; Anderson et al. 2000). A change in conformation usually restrains the biological function of a protein and may lead to precipitation (Asherie et al. 1998). Conformation change is relevant for the production (Georgiou and Bowden 1990), formulation, and application (Costantino et al. 1995) of protein-containing drugs and nutrition supplements. Aggregation also plays a crucial role in several diseases, including Alzheimers (Mitraki and King 1989; Wetzel 1994, 1996; Janicke 1995; Silow and Oliveberg 1997; Fink 1998) and bovine spongiform encephalopathy (BSE)/scrapie (Prusiner 1991; Cohen et al. 1994; Prusiner and DeArmond 1994). Despite the importance of protein aggregation, much remains to be learned.
In the vast majority of previous folding and aggregation studies, one of three interaction-energy models is used: First, in the Go model (1983), two amino acid beads interact with energy -
if they are in contact in the native state; otherwise, the interaction energy is zero. Many studies (Du et al. 1999; Pande and Rokhsar 1999; Bratko and Blanch 2001, 2003; Cheung et al. 2002), among others, adopt the Go model to study protein folding and aggregation in bulk water. In the Go model, each bead is unique and attracts only those beads that are neighbors in the native state. For a lattice protein of N beads, the Go model is an N-letter model with a very high specificity. In a Go-model protein, more different amino acid beads can appear than in nature. Second, the HP model contains two different kinds of amino acid beads: hydrophobic and polar (Lau and Dill 1989). In the original HP model, only the HH attraction is different from zero. In later studies, nonzero HP and PP interactions have also been investigated (Li et al. 1996; Gupta et al. 1998; Hirst 1999; Giugliarelli et al. 2000; Crippen and Chhajer 2002). The HP model belongs to the class of two-letter models because it uses two different kinds of beads. In the HP model, no different strengths of hydrophobicity and no specific interactions between certain pairs of amino acids exist. Third, the more complex Miyazawa-Jernigan model (MJ; Miyazawa and Jernigan 1985) is a 20-letter model used, for example, by Tiana et al. (1998), Broglia et al. (1998), Anderson et al. (2000), and Mirny et al. (1998). It has the advantage of providing experimentally determined interaction energies for all 20 amino acids, allowing for different degrees of hydrophobicity and polarity and specific amino acid interactions. For example, the self-interaction of charged amino acids is very weak, but oppositely charged amino acids attract each other quite strongly. Our interaction-energy model is an extension of the MJ scheme. Miyazawa and Jernigan later (1996) refined their interaction-energy set based on a larger amalgam of experimental data. Interaction energies for most amino acids changed only slightly. We adopt the original interaction set in this work because the new MJ potentials contain a nonpairwise-additive packing term that makes simulation computationally demanding and that makes isolation of energy-scale effects more difficult.
| The MJ interaction model |
|---|
|
|
|---|
Miyazawa and Jernigan (1985) made several approximations in the process of deriving their interaction energies from X-raystructure contact frequencies. As a result, the average amino acidsolvent contact energy is about twice as large as almost all of those from previous estimates of hydrophobic energies.
Because of the unrealistic difference in hydrophobic energies and because of other results (see below) reported by several investigators, we reconsider the MJ interaction model. Our re-examination is motivated by the following: First, aggregation studies with the MJ energy scale (Broglia et al. 1998) for two 36-mers in water showed that irreversible aggregation occurs for all amino acid sequences studied whenever two strongly attractive sites of the stable cores of the chains come into contact during the folding process. However, in nature, this irreversible aggregation seems to be the exception rather than the rule for proteins under physiological conditions (West et al. 1999). Second, adsorption studies of a 27-mer at an oilwater interface (Anderson et al. 2000) showed reasonable physical behavior of a single chain in the bulk (reversible folding) and at the interface (irreversible adsorption), when a hydrophilic amino acid (histidine) was chosen to represent water and a hydrophobic amino acid (glycine) represented oil. However, by using these same assignments for "water" and "oil," our present simulations with two identical chains exhibit irreversible aggregation in the bulk and a very weak, reversible aggregation at the inferface. Because this calculated result is essentially the opposite of what we would expect from experimental data (Cascão Pereira et al. 2002, 2003), we want to examine the effect of changes in the pertinent energy parameters. Third, Giugliarelli et al. (2000) performed two-dimensional (2D) lattice MC simulations with a two-letter HP model to study the effect of interaction energies on the formation of compact states (states with minimal surface) or incompact native states for single chains and for bulk aggregation. These investigators found three classes of protein-like sequences: nonaggregating, aggregating in the native state (crystallizing), and prion-like aggregates. For a given amino acid sequence or composition, the type of behavior depends strongly on the interaction energies. Fourth, Gutin et al. (1995) studied the effect of mean attraction in three dimensions, not of a contrast parameter. Our effort focuses on the roles of a mean attraction and of the degree of contrast among solventhydrophilic interactions and solventhydrophobic interactions. Fifth and finally, Booth et al. (1997) found experimentally that lysozyme, which is known not to form any aggregates in vivo, does aggregate in vitro under appropriate conditions. The literature indicates that almost any protein can form aggregates under some conditions (Booth et al. 1997), such as the temperature, solvent pH, and salt concentration, or the presence of co- or antisolvents. These solvent characteristics can, in principle, be modeled with lattice simulations by the use of effective amino acidsolvent energy parameters.
| A new interaction model |
|---|
|
|
|---|
To elucidate the basis of our interaction-energy model, we recall some features of lattice simulations and the associated characteristic energies. Consider the exchange of the bond between a pair of two identical amino acid sites (Ai)2 and the bond between two solvent molecules, S, to form a bond between the amino acid site and the solvent AiS:
![]() |
The energy difference for this exchange reaction is obtained from the energies of the pertinent species: 2Ei0 - E00 - Eii (for i = j, see equation 5a in Miyazawa and Jernigan 1985). Ei0 is the absolute interaction energy between amino acid i and the solvent, E00 is the absolute interaction energy between two solvent sites, and Eii is the absolute interaction energy between two amino acids of the same type i. One half of this reaction energy is known as exchange energy,
i. Miyazawa and Jernigan present relative energies denoted by a small e instead of absolute energies (denoted by capital E). Clearly, the definition of the exchange energy holds for the relative as well as for the absolute energies. Because e00 is set to zero by the rescaling transformation in the original MJ energy scale, as well as in our energy scale,
i = ei0 -
eii. Therefore, it is the exchange energies
i, and not the amino acidsolvent interaction energies, ei0, that are the distinctive energies determining whether the amino acidsolvent interaction is effectively attractive (
i < 0) or repulsive (
i > 0). Nevertheless, we still present the amino acid~solvent interaction energies ei0 because they are the quantities that are used in the simulations.
We propose a new expression for the interaction energy ei0 to model the solvent properties,
![]() | (1) |
or equivalently:
![]() | (2) |
where eii and all other interaction energies eij are taken from Table 5 of Miyazawa and Jernigan (1985). Two parameters arise in equations 1
and 2
. The parameter
determines how well the solvent likes amino acids in general. Strongly negative values promote unraveled forms of the protein, whereas strongly positive values stabilize compact forms. Cs is a contrast parameter. If it is close to zero, the solvent solvates hydrophobic amino acids approximately as well (or as poorly) as hydrophilic amino acids, because in this case
. A positive Cs value means that the solvent prefers hydrophilic acids over hydrophobic ones (i.e.,
i is relatively more negative for the hydrophilic residues). As Cs is raised toward unity, the solvent more strongly prefers the hydrophilic acids relative to the hydrophobic acids. Consequently, the contrast between the hydrophilic and hydrophobic sites is magnified. With Cs < 0, oil-like solvents can be modeled, because in this case, contacts between the solvent and hydrophobic amino acids are preferred relative to the hydrophilic amino acids. When Cs = 1 and
, the original MJ model is recovered.
To avoid the use of explicit amino acid solvent interaction energies, thereby fascilitating fast simulations, the matrix transformation described by Miyazawa and Jernigan (1985) can be applied easily:
![]() | (3) |
where
denotes the new relative interaction energies for the implicit-solvent simulations. However, if simulations with more than one solvent are performed (e.g., to study proteins at a fluid phase interface), the solvents can no longer be considered implicitly. Because our simulation code is designed to perform later such interface simulations, we use explicit amino acidsolvent interaction energies here. In multiple-solvent simulations, three kinds of solvent parameters are necessary: Csi and
are single-solvent parameters, and e0i0j is the binary solventsolvent interaction energy for solvents i and j. All solventsolvent self-interaction energies e0i0i can be set to zero.
The sign of
does not reveal whether the amino acidsolvent interaction is attractive or repulsive until Cs is considered as well. Therefore, we eliminate
in favor of the perhaps physically more meaningful parameter:
![]() | (4) |
or
![]() | (5) |
where the sum runs over the 20 types of amino acids; that is, n = 20.
When
is negative, amino acids interact on average attractively with the solvent. When
is positive, they repel the solvent and effectively attract each other. Because it is more convenient to think in terms of amino acid interactions than in terms of amino acidsolvent interactions in the rest of this article, we call
the "effective amino acidamino acid mean attraction" or briefly "mean attraction." Although based on exchange energies, a positive
means an effective amino acid attraction in this context.
Equation 5
is substituted into equations 1
and 2
to eliminate
:
![]() | (6) |
or
![]() | (7) |
To illustrate the meaning of the two energy-scale parameters Cs and
, Figure 1
graphs the exchange energy
i versus the amino acid~amino acid self-interaction energy eii. Results are shown for values of Cs and
that we find to be typical for relatively fast folding interaction scales of our 27-mer protein. For comparison, data for the original MJ model are also shown as closed circles. Open circles and diamonds show the effect of Cs when
is approximately constant (-
Cs is the slope). As the slope becomes more negative, there are increasing differences in hydrophobicity among the 20 amino acids. Open circles and squares demonstrate different values of
with the same
. A high-lying data set means a strong mean attraction and features compact states, whereas a low-lying data set with less mean attraction promotes noncompact states. Figure 1
also shows that the amino acids are not evenly spaced along the
i energy scale but rather fall into groups of different polarity, as they do in the eii scale of the MJ model. This relative polarity grouping of the MJ scale is also present in our interaction energy model. In the MJ scale (filled circles), the contrast parameter (Cs = 1) and the mean attraction
are much stronger than those in the parameter sets with fastest folding and highest cooperativity properties. Only 19 symbols per data set can be seen in the figure because glutamine and proline have exactly the same self-interaction energy.
|
T, that dictates this transition. A small number for
T/Tm, where Tm is the midpoint temperature, means that the structure change from the unfolded to folded state takes place in a narrow temperature range. Therefore, the transition is sharp and the cooperativity is high when
T/Tm is small (
T/Tm and Tm are defined quantitatively in Materials and Methods). Thus, to compare the model 27-mer protein with actual proteins, we examine the effect of our energy-parameter space (i.e., Cs and
) on folding speed and cooperativity. | Results |
|---|
|
|
|---|
with respect to folding speed and cooperativity. The mean attraction necessary for a strong cooperativity is slightly higher than that for fastest refolding.
|
|
on folding speed and cooperativity, whereas precise values for these parameters are given in Table 1
T/Tm, by the size of diamonds. Small symbols denote high folding speed and strong cooperativity and are, therefore, desireable. For each given contrast parameter, Cs, we find an optimal value of
for the folding speed and a slightly higher optimum value for the cooperativity. The reason for this is as follows: If the mean attraction is not too strong
, some of the hydrophilic amino acids interact more favorably with water than with any amino acid. When there are only a few such amino acids and these beads are not in terminal positions, the maximum compact state (Fig. 2
. Here some of the terminal beads are completely immersed in water. The free-energy barriers between the fully folded and the unfolded states are very low at the midpoint temperature, and folding is very fast. Under these conditions, significant cooling is required to condense the chain to the fully folded state; in some cases, even a very low temperature cannot achieve condensation. Table 2
, the chain is basically folded, whereas simulations with a weaker mean attraction show much less native structure. The opposite is true for a temperature above the mean temperature: The chain unfolds well when the mean attraction is weak and maintains many contacts when the mean attraction is large. These two effects reflect the decreased cooperativity for too weak or too strong mean attractions.
|
|
|
, there is always a high energy barrier for a transition from one relatively compact state to another one because, in between, there is a less compact transition state that limits refolding speed. Also, the unfolded state is more compact than that for a weak mean attraction; the system must be heated substantially to unfold completely. These two effects decrease the cooperativity. For a strong mean attraction, the number of total contacts at the midpoint temperature is substantially higher than for a weak mean attraction, whereas the number of native contacts is always 14, by definition. This means that more nonnative contacts are formed in the case of a high mean attraction, or in other words, in this case, the interactions are less specific.
Table 1
and Figure 3
show that there is a single optimal value of Cs for fast folding and strong cooperativity. It is ~0.2. If Cs is larger, transition states with an exposed hydrophobic amino acid are too unfavorable for fast folding. Conversely, because there is too little energy difference between hydrophilic residueswater and hydrophobic residueswater interactions, small Cs values provide only a weak force to drive hydrophobic amino acids into the core of the folded protein.
Figures 5
and 6
illustrate the difference between a simulation with a strong
and a weak
mean attraction by plotting the number of native contacts, Nnat, versus the number of MC steps. Both simulations are performed at Tm with the contrast parameter that features fastest folding (Cs = 0.2). If the mean attraction is moderately strong (cf. Fig. 5
), the maximum compact state is the most stable state. This can be seen from the high density of states with 28 native contacts. The folded state is well separated from the unfolded state in terms of native contacts, and the cooperativity is high. If the mean attraction is weak, as in Figure 6
, the range of native contacts for the unfolded state (zero to 12 native contacts) is closer to the number of contacts for the folded state (16 to 28 native contacts) and the fully folded state (Nnat = 28) is less populated than a semifolded state with 23 to 26 native contacts.
|
|
; for the other two mean attractions shown, the state with Nnat = 28 has the lowest energy. The state with 23 native contacts can be realized in about twice as many ways as the one with 28 contacts, which causes a negative contribution of the entropy to the free energy of the 23-native-contacts state. These two effects lead to the differences in free energy between the Nnat = 23 state and the Nnat = 28 state that we can observe in Figure 7
|
23 instead of Nnat = 28 as definition of the native state, the free energy of the folded state is obviously reduced, but the results do not change qualitatively.
Figure 7
also contains the energy profile of one chain that is 90% folded and one that is 90% unfolded for the medium mean attraction. In the high-temperature simulation, T = 1.81Tm, the free energy has a minimum at two native contacts, whereas >10 are rarely observed. In the low-temperature simulation, T = 0.76Tm, however, a rather large range of native contacts can still be seen, and the minimum at Nnat = 28 is not as pronounced as one might expect. Note that when a line ends, the corresponding values of the order parameter have not been sampled during the simulation. This means that the free energies of these states are very high but cannot be determined without applying biasing techniques.
| Discussion |
|---|
|
|
|---|
, that modify the MJ amino acidsolvent interaction energies and permit us to study the effect of the solvent on protein folding. In the parameter space studied (selected combinations of 0.05
Cs
1.0 and
) and with 27-mers, we discover four regimes corresponding to different protein behavior. Figure 8
space and their approximate boundaries. Protein simulations using parameters from regions I and IV show a behavior that is not typical for real proteins under native conditions. Region I reflects slow folding and weak cooperativity. The original MJ interaction-energy matrix lies in region I. In region IV, the most compact state is highly unstable, which results in slow folding and a low cooperativity. There is a set of medium compact states with similar energies, in which essentially no foldingunfolding event is observed. Energy parameters characteristic of regions II and III can be used to study actual proteins, because they exhibit the most realistic behavior and allow the shortest simulation times. In region II, the 3 x 3 x 3 cube is the lowest energy state, and it exhibits the highest cooperativity of all interaction models and fast folding. Therefore, region II corresponds to proteins having a compact native state that is well defined for all amino acids. Many real proteins show some flexibity even in the native state; parts of the chain cannot be resolved in X-ray crystal structures (Crippen and Chhajer 2002). Such proteins can be modeled with parameters from region III; they show the fastest folding and also a relatively high cooperativity. In our study, the reference state for folding rate and cooperativity is the compact cube. If this reference state is changed to a noncompact state, as recently suggested by Crippen and Chhajer (2002), proteins with parameters from region II will likely show even "better" behavior in the sense of fast folding and high cooperativity.
|
, but they keep the original MJ contrast unchanged. These investigators studied the folding of a 36-mer at ~0.9 Tm for a relatively strong and a relatively weak mean attraction. Gutin et al. found the formation of a compact globule as a first step for the strong-mean-attraction case, followed by a first-order transition to the native state. Similar to our results, during their simulations, the number of total contacts is much higher than the number of native contacts. For a weak mean attraction, most of the states that are formed during the folding process are native contacts, and the formation of a compact state happens at the same time as the formation of the native state. In contrast to our results, Gutin et al. found the mean folding time similar for the strong and the weak mean attraction. Apparently, these investigators used a lower temperature than that studied here. Also they determined the folding time of an unfolded chain to a folded one, whereas we measure the refolding time between an unfolded and a folded state when both are in equilibrium. Our simulations indicate that the two possible design goals (fast folding and high cooperativity) for proteins are correlated. Therefore, if the system is optimized for one of the goals, the other one is rather close to its optimal value. This holds especially for the contrast parameter, Cs. For 27-mers, fastest folding requires a slightly lower mean attraction than does high cooperativity, whereas a mean attraction between these two optima remains very good for fast folding and high cooperativity. In many studies (Shakhnovich and Gutin 1993; Shakhnovich 1994; Dinner et al. 1996; Li et al. 1996; Tiana et al. 1998; Harrison et al. 1999), the importance of a generally high stability of the native state has been emphasized. Recently, this issue has been discussed with respect to interaction potential and sequence (Giugliarelli et al. 2000; Crippen and Chhajer 2002). In our study, the amino acid sequences of our lattice proteins for a given interaction parameter set are optimized for stability of a given native conformation. However, investigation of a series of interaction scales reveals that this interaction set should not be optimized for stability of the native state. Rather an interaction set that generates only marginally stable native states best features fast folding and high cooperativity, in agreement with experimental findings that proteins are not solely optimized for stability (Gruebele 1999).
Finally, we offer brief prospective remarks based on our work. First, the emphasis of this work is on the optimization of the amino acidsolvent interaction energies for native-proteinlike behavior. Nevertheless, we assert that the results can be used to study sequence-related effects of protein aggregation. In doing so, energy-scale-related aggregation effects can be separated from sequence-related aggregation effects. This may aid in further understanding of the aggregation process. Second, the behavior of chains in water as the solvent is now modeled more realisticaly, and extension to other solvents is straightforward. In addition, the effect of changing solvent characteristics on protein folding can be studied. However, relations between our parameters and quantities such as pH, salt concentration, and the presence of co- or antisolvents remain to be established.
| Materials and methods |
|---|
|
|
|---|
The system energy is defined as the sum of the interaction energies over all pairs of adjacent sites on the lattice, but which are not neighbors on a chain. Accordingly, the system energy is described by the Ising-like Hamiltonian:
![]() | (8) |
L is the size of the lattice in lattice units and e(i,j,k),(l,m,n)is the interaction energy between an amino acid or solvent bead on site (i, j, k) and one on site (l, m, n). During the simulation, MC moves are attempted as shown in Figure 9
. Frequency of moves is set by randomly choosing a site of the chain with equal probability for all sites. If the chosen bead is an end site, we perform an end-bend move. If it is one before then end site, we perform an invert-bend move. Otherwise an invert-bend or crankshaft move is performed with 50% probability for each move. If we try to move a chain bead to a site occupied by another chain bead, the move is immediately rejected because e(i,j,k),(i,j,k) =
. If the move is to a site occupied by a solvent bead, we try to exchange the protein bead with the solvent bead on this site. Then, the move is accepted or rejected depending on the energy change, E, associated with the move in accordance with the Metropolis criterion:
|
![]() | (9) |
![]() | (10) |
where kB, the Boltzmann constant, has the value of 1 MJ-energy unit/1 associated temperature unit. MJ energies are based on contact frequencies, and hence, they are only relative energies. To our knowledge, no conversion to conventional energy units is available. Therefore, for our purpose here, temperature can be expressed in any arbitrary unit.
Sequence and initial conformation
To design a lattice protein, it is first necessary to choose a conformation for the native state, that is, the path the chain takes in the 3 x 3 x 3 cube for a 27-mer. We undertook preliminary studies with a few different native conformations and then settled on the one shown in Figure 2
for all simulations with 27-mers. The types of the 27 amino acid beads for a chain are chosen to reflect a composition corresponding to a typical real protein. Then the sequence is optimized by simulated annealing, meaning that sites on the chain in the native state are randomly exchanged with the Metropolis algorithm. During the run, the temperature is reduced eventually to find the global energy minimum. The sequence corresponding to this minimum is the most stable sequence for the given native conformation and interaction energy scale. To ensure that the global minimum is found, the run is then repeated, starting from the result of the previous run until no further improvement in stability is found.
A change in the mean attraction
has no effect on the energy difference between different sequences for the same structure, but the contrast parameter (Cs) does. Therefore, the annealing process is repeated every time Cs is changed, but not when
is altered. Table 3
shows all sequences used in our simulations.
|
Data analysis
Contacts
During the simulation, we compute and record different types of contacts in addition to the energy. The number of total contacts Ntot is the number of contacts between the beads of one chain and all other amino acid beads on any chain, omitting contacts between the two neighbors that are consecutive on the same chain (one neighbor for a terminal bead). A contact between two beads exists if the beads are nearest neighbors on the lattice. The number of native contacts, Nnat, refers to those of the intramolecular contacts that are also present in the native state. Nnat provides a commonly used measure of the similarity of a given structure to the native structure. The number of total contacts gives information on how compact a protein is under given conditions, similar to the radius of gyration, but with many contacts diagnosing a compact molecule.
Midpoint temperature
Protein-like polymers exhibit a folding transition: They are unfolded at high temperatures and folded at low temperatures. The midpoint temperature Tm is defined as the temperature where, on average, half of the native contacts are present during a simulation. For a 27-mer with a 3 x 3 x 3 cube as native state having 28 native contacts, this means that Tm is the temperature at which Nnat = 14.
Cooperativity
Several definitions of cooperativity are discussed by Crippen and Chhajer (2002). We use the simplest of those for which the cooperativity,
T/Tm, is calculated from the temperatures where, on average, we observe 0.9, 0.5, and 0.1 times the maximum number of native contacts:
![]() | (11) |
Again, a low value of
T/Tm corresponds to a high cooperativity and vice versa.
Folding time
Because of its importance for proteins, we also determine the folding time in our simulations. During the simulation, we record when a chain changes from a completely folded state to one with <40% of its native contacts present, or vice versa. The average number of MC time steps for one folding/unfolding cycle is designated as the mean folding time tf.
Blocking algorithm
We analyze the results of our simulations with a blocking method described by Flyvbjerg and Petersen (1989). This method permits reliable estimates for statistical errors if the simulation is ergodic, that is, if it samples all relevant states sufficiently. The blocking algorithm also helps us to recognize when our chains are in a glassy state.
Free energy
In a MC simulation, it is not possible to calculate absolute free energies. However, it is posssible to calculate free energy differences between states that can be observed in MC simulations by counting the numbers of occurences of the states (Frenkel and Smit 2002). We use the number of native contacts, Nnat, as an order parameter to create a histogram for each chain in a simulation, and we count how often the chain is in each state corresponding to the order parameter. After the simulation, the Landau free energy F(A) of state A relative to the fully folded state can be calculated from the number of times the chain is found in the state, Z(A), and from the number of times the chain is found in the fully folded state (Frenkel and Smit 2002).
![]() | (12) |
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
. 1996. Improved design of stable and fast-folding model proteins. Fold. Des. 1: 221230.[CrossRef][Medline]
Anderson, R.E., Pande, V.S., and Radke, C.J. 2000. Dynamic lattice Monte Carlo simulation of a model protein at an oil/water interface. J. Chem. Fys. 112: 91679185.
Asherie, N., Pande, J., Lomakin, A., Ogun, O., Hanson, S.R.A., Smith, J.B., and Benedek, G.B. 1998. Oligomerization and phase separation in globular protein solutions. Biophys. Chem. 75: 213227.[CrossRef][Medline]
Booth, D.R., Sunde, M., Belotti, V., Robinson, V., Hutchinson, C.V., Fraser, P.E., Hawkins, P., Dobson, C.M., Raiford, S.E., and Blake, C.C. 1997. Instability unfolding and aggregation of human lysozyme variants underlying amyloid fibrilogenesis. Nature 385: 787793.[CrossRef][Medline]
Boyd, R.H. and Phillips, P.J. 1993. The science of polymer molecules. Cambridge Universty Press, Cambridge, UK.
Bratko, D. and Blanch, H.W. 2001. Competition between protein folding and aggregation: A three-dimensional lattice-model simulation. J. Chem. Phys. 114: 561569.[CrossRef]
. 2003. Effect of secondary structure on protein aggregation: A replica exchange simulation study. J. Chem. Phys. 118: 51855194.[CrossRef]
Broglia, R.A., Tiana, G., Pasquali, S., Roman, H.E., and Vigezzi, E. 1998. Folding and aggregation of designed proteins. Proc. Natl. Acad. Sci. 95: 1293012933.
Cascão Pereira, L.G., Hickel, A., Radke, C.J., and Blanch, H.W. 2002. Kinetic model for enzyme interfacial activity and stability: pa-hydroxynitrile lyase at the diisopropyl ether/water interface. Bioeng. Biotech. 78: 595605.[CrossRef]
Cascão Pereira, L.G., Theodoly, O., Blanch, H.W., and Radke, C.J. 2003. Dilatational rheology of BSA conformers at the air/water interface. Langmuir 19: 23492356.[CrossRef]
Castells, V., Yang, S., and Van Tassel, P.R. 2002. Surface-induced conformational changes in lattice model proteins by Monte Carlo simulation. Phys. Rev. E 65: 031912031918.[CrossRef]
Cheung, M.S., Garcia, A.E., and Onuchic, J.N. 2002. Protein folding mediated by solvation: Water expulsion and formation of the hydrophobic core occur after the structural collapse. Proc. Natl. Acad. Sci. 99: 685690.
Cohen, F.E., Pan, K.-M., Huang, Z., Baldwin, M., Fletterick, R., and Prusiner, S.B. 1994. Structural clues to prion replication. Science 264: 530.
Costantino, H.R., Langer, R., and Klibanov, A.M. 1995. Aggregation of lyophilized pharmaceutical protein, recombinant human albumin: Effect of moisture and stabilization by experiments. Biotechnology 13: 493496.[CrossRef][Medline]
Crippen, G.M. and Chhajer, M. 2002. Lattice models of protein folding permitting disordered native states. J. Chem. Phys. 116: 22612268.[CrossRef]
Dinner, A.R., Sali, A., and Karplus, M. 1996. The folding mechanism of larger model proteins: Role of native structure. Proc. Natl. Acad. Sci. 93: 83568361.
Du, R., Pande, V.S., Grosberg, A.Y., Tanaka, T., and Shakhnovich, E. 1999. On the role of conformational geometry in protein folding. J. Chem. Phys. 111: 1037510380.[CrossRef]
Fink, A.L. 1998. Protein aggregation: Folding aggregates, inclusion bodies and amyloid. Fold. Des. 3: R9R23.[CrossRef][Medline]
Flyvbjerg, H. and Petersen, H.G. 1989. Error estimates on averages of correlated data. J. Chem. Phys. 91: 461466.[CrossRef]
Frenkel, D. and Smit, B. 2002. Understanding molecular simulation, Chapter 7. Academic Press, New York.
Georgiou, G. and Bowden, G.A. 1990. Inclusion body formation and the recovery of aggregated recombinant protein. In Recombinant DNA technology and applications (eds. A. Prokop et al.), pp. 333356. McGraw Hill, New York.
Giugliarelli, G., Micheletti, C., Banavar, J.R., and Maritan, A. 2000. Compactness, aggregation, and prionlike behavior of protein: A lattice model study. J. Chem. Phys. 113: 50725077.[CrossRef]
Go, N. 1983. Theoretical studies of protein folding. Annu. Rev. Biophys. Bioeng. 12: 183.[CrossRef][Medline]
Gruebele, M. 1999. The fast protein folding problem. Annu. Rev. Phys. Chem. 50: 485516.[CrossRef][Medline]
Gupta, P., Hall, C.K., and Voegler, A.C. 1998. Effect of denaturant and protein concentrations upon protein refolding and aggregation: A simple lattice model. Protein Sci. 7: 26422652.[Abstract]
Gutin, A.M., Abkevich, V.I., and Shakhnovich, E.I. 1995. Is burst hydrophobic collapse necessary for protein folding? Biochemistry 34: 30663076.[CrossRef][Medline]
Harrison, P.M., Chan, H.S., Prusiner, S.B., and Cohen, F.E. 1999. Thermodynamics of model prions and its implications for the problem of prion protein folding. J. Mol. Biol. 286: 593.[CrossRef][Medline]
Hirst, J.D. 1999. The evolutionary landscape of fundamental model proteins. Protein Eng. 12: 721726.
Janicke, R. 1995. Folding and association versus misfolding and aggregation of proteins. Philos. Trans. R. Soc. Lond. B 348