|
|
||||||||
Department of Chemical Sciences, Tata Institute of Fundamental Research (TIFR), Colaba, Mumbai, India, 400005
Reprint requests to: Rajamanickam Murugan, Department of Chemical Sciences, Tata Institute of Fundamental Research, Homi Bhabha Road, Colaba, Mumbai, India, 400005; e-mail: muruga{at}tifr.res.in; fax: 91-22-2280-4610.
| Abstract |
|---|
|
|
|---|
RT ln 2. This result agrees well with earlier studies and indicates that the bias is mainly due to hydrophobic interaction. Further investigations have shown that the landscape funnel could be experimentally mapped onto a two-dimensional space formed by denaturant concentration and the connectedness of configurational domains. The theoretical value of the depth-of-folding funnel in terms of denaturant concentration has been calculated for a model protein (P450cam), which agrees well with the experimental value. Using our model, it is also possible to explain the turnover nature of heat-capacity change upon unfolding of proteins and the existence of enthalpy and entropy convergence temperatures during unfolding without any strict assumptions as proposed in earlier studies. Keywords: Protein dynamics; energy landscape; folding funnel; convergence temperatures
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.03347504.
| Introduction |
|---|
|
|
|---|
Because the configurational space of a macromolecule is astronomically large, finding the global minimum just by a random search will not be possible unless there is a kind of energy bias toward the minimum, which forces us to assume a funnel-shaped landscape to explain the experimental observations. For example, the observed folding time and the folding rate of a protein cannot be explained unless we introduce an energy bias toward the native configuration. Moreover, extensive protein-folding simulations have confirmed this fact. But the main drawback of such simulations is that the energy landscape of a real protein is correlated and rugged and therefore far away from these theoretical models. Unfortunately, the usual thermodynamic and kinetics studies on proteins will tell us only about the overall free-energy change of folding/unfolding transitions and the average height of the transition-state ensemble (TSE), but we cannot get any information about the folding funnel and its nature owing to the fact that the TSE lies far above the funnel. The depth of the funnel is an important parameter, which decides the stability and foldability of a particular protein. Suppose that the folding funnel is shallow, then the proteins native conformation would be unstable, heterogeneous, and prone to conformational fluctuations even though its overall folding free energy is very high. From the depth of the folding funnel it is also possible to discriminate a well-folding protein from other proteins. In this article, we present a theory and an experimental methodology to map the protein free-energy landscape onto a two-dimensional space from which we can easily estimate the depth of the folding funnel, which is not possible in standard folding/unfolding studies. First, we present the formal theoretical principles, and then we present the experimental method with one example.
Theoretical concepts
The native configuration of a protein, which is denoted as n, lies at the bottom of the landscape funnel (Fig. 1
), which is, of course, very narrow and deep enough to stabilize the native configuration. Here we say that the connectedness of the native form with rest of the configurational space, which is denoted as u, is almost zero. When we apply free-energy perturbations via temperature or denaturants, the native configuration will get probabilistically dispersed into the nonnative domain according to Boltzmanns distribution. Therefore, here we say that the connectedness of the native state (n) configuration with the rest of the configurational space (u) is increasing. This increasing trend will hold until the energy perturbation reaches the bottleneck of the folding funnel, after which the connectedness will start to decrease, which is caused by the fact that inside the funnel, the molecules can explore only a small configurational area and therefore get strongly connected, whereas once the molecule comes out of the funnel, it can explore a large area of landscape and therefore be weakly connected. Here we have complete information only about the native state (n), whereas the nonnative states (u) are not defined or heterogeneous. Therefore, the connectedness of n with u is meaningful but not vice versa. Now let us define this concept slightly more rigorously.
|
u, n
u =
, x
{n, u}. Suppose that if the native form n contains C correct contacts (in other words, native contacts), then the nonnative form u can have a maximum of C - 1 correct contacts. Because the configurational space is extraordinarily large, for the purpose of analysis, we can assume that x is continuous and the corresponding Langevin equation can be written as:
![]() | (1) |
Here F(x) denotes the potential of mean force (pmf) acting on the protein molecule [i.e., F(x) = -dx
(x), where
(x) =
G(x) is the folding free energy], m is its effective mass,
denotes the internal friction coefficient, and
(t) is delta-correlated Gaussian noise, which satisfies the fluctuation dissipation theorem as:
![]() |
The corresponding Fokker-Plank Equation (FPE) for the probability of finding the protein molecule at position x in time t can be given as (i.e., the Smoluchowski equation):
![]() | (2) |
Here T is the absolute temperature (in kelvins) and k is the Boltzmann constant. Because we are interested in the stationary solution [i.e., at
tP(x, t) = 0], that can be given as follows:
![]() | (3) |
Here
![]() |
is the normalization constant. Now the stationary probability of finding the native form n is
![]() |
and the connectedness of the native form n with rest of the configurational space u can be easily given as follows:
![]() | (4) |
Proof: Given that n
u and {n, u} =
, the following equality is true, which proves equation 4
:
![]() |
When we unfold the protein by denaturants such as urea and guanidine hydrochloride, the Linear Energy Model (LEM) predicts the following relation:
![]() | (5) |
Here,
(
is the folding free energy (the potential of mean force) acting on the native protein, md (in kilocalories per mole per molar) denotes the denaturant m value, and D (in moles per liter) denotes the denaturants activity. From equations 3
and 5
, the probability Pst(n, D) of observing the native configuration n at the denaturant concentration of D is given by
![]() |
where N is the normalization constant. Using the boundary conditions Pst(n, 0) (therefore,
and Pst(n,
), it can be shown that:
![]() | (6) |
From equations 4
and 6
, the connectedness (denoted as
') of the native configuration with the rest of the configurational space in the presence of denaturant can be given as:
![]() | (7) |
From the relation
DP(n
u, D) = 0, we can easily show that the denaturant concentration at which the maximum of connectedness occurs is equal to Dmax = RT ln 2/md. One also should note that at Dmax, Pst(n) = Pst(u) = 0.5, which is known as a stochastic separatrix; that is, the probabilities of existence of the protein molecule in the folded form and the unfolded form are equal. The stochastic separatrix is an abstract point that decides whether a protein molecule is in folded form [Pst(n) > 0.5] or in unfolded form [Pst(u) > 0.5], which otherwise can be viewed as the bottleneck of the folding funnel. There exists one more point that is different from Dmax called the midpoint denaturant concentration, at which the population of the folded form is equal to the population of the unfolded form, that is, Dmid =
G0H2O/md, and therefore
(n) = 0 in equation 5
. Here one should note that from the Dmax, which is obtained from the usual equilibrium unfolding experiments, it is not possible to derive any information about the folding funnel and the energy landscape. But using the aforementioned formalism, we show in the following sections that it is possible to estimate the energy bias toward the native configuration as well as the depth of the folding funnel.
Estimation of energy bias toward native configuration
It is known from earlier studies on denaturant-mediated protein unfolding that (Murugan 2003):
![]() | (8) |
where h0 denotes the activity of water (55.5 M),
denotes the number of bound water molecules on protein (i.e., interaction potential of water with protein), and
is the number of water molecules interacting with the denaturant. Therefore, by putting the value of md in Dmax, we obtain Dmax = h0 ln 2/
. Because Dmax is inversely proportional to waterprotein interactions (i.e.,
), it is an indirect measure of hydrophobic interactions within protein, that is, Dmax
hydrophobic interactions. Comparing this with equation 5
, we obtain the approximate energy bias per correct contact as Ebias = md x Dmax
RT ln 2, which is very close to the earlier suggested value (Zwanzig et al. 1992) of few RTs per native contact and also indicates that it is mainly contributed by hydrophobic interactions! Here we should note that n and u could be differentiated by a unit number of correct contacts.
Estimation of depth of folding funnel
Now, finding the depth of the proteins landscape funnel is a simple procedure. We just give incremental free-energy perturbations to the native state (n) by applying denaturants or temperature and measure the connectedness of n with rest of the configurational space u as a function of this perturbation by some means. Here we have used the sample-size autocorrelation method as described in Materials and Methods. From our prediction, the depth of the folding funnel is simply the amount of free-energy perturbation at which connectedness attains maximum. Because the free-energy perturbation can be measured in terms of the amount of perturbing agents such as urea, we express the connectedness in terms of urea concentration, where the maximum connectedness occurs at the concentration of Dmax (moles per liter). In the next section, we generalize the concept to thermal free-energy perturbations.
Existence of enthalpy and entropy convergence temperatures
The aforementioned model can be easily extended to temperature-mediated unfolding too. The potential of mean force of the native state can be expressed as a function of temperature using the following expression:
![]() | (9) |
![]() |
![]() |
Here the subscript n denotes the corresponding parameter values at T = Tn, which is the temperature (T = Tn) at which the protein exists completely in native form,
S,
H denotes the entropic and enthalpic changes caused by the change in temperature, and
C is the corresponding change in heat capacity of the protein during unfolding. We should note that
T(n, Tn) =
Hn - Tn
Sn and the corresponding probability of observing the native conformation (n) at temperature T is P(n, T) = N x e-
T(n, T)/RT (from equation 3
), where N is the usual normalization constant. Using the initial condition, N can be shown to be N = e
T(n, Tn)/RT . And thus the connectedness of the native configuration n with the rest of the configurational space u at a temperature T can be given as:
![]() | (10) |
where,
![]() |
Now it is very easy to show [by solving the equation
TP(n
u, T) = 0 using MAPLE 7] that the function has two maxima at:
![]() | (11) |
![]() | (12) |
Here
![]() |
where y = LambertW(x) is the solution of the equation y exp(y) = x. We are generally interested in the maximum Tn > T (here it is Tm, 2), whose approximate value can be calculated as follows: Given that (1/Tn) >> 1 (Tn is generally close to room temperature, i.e., ~298 K) and R/
CpNU << 1, by neglecting those terms in the expression of Tm, 2, we get:
![]() | (13) |
The numerical value of LambertW(-1/e) can be obtained from MAPLE 7. One also should note that P(n
u, T) is a turnover function only when
CpNU < 0. Although the expressions for change in enthalpy and entropy due to temperature have the forms as:
![]() |
where
![]() |
is the heat capacity change due to unfolding, detailed calorimetric studies showed that it was not valid in the case of protein unfolding because of the fact that
CpNU itself was a function of temperature. Moreover, upon unfolding, the protein generally would not be only in native and unfolded forms but exist in a continuum of states. Because n and u are two different sets of configurations of the same molecule, earlier studies (Zhou et al. 1999) showed that it was necessary to introduce a measure called the weighting factor (i.e.,
CpNU = fUCpU - fNCpN < 0, where fN and fU are the corresponding fractions of native and unfolded forms) as a correction. In our model, we propose that the correct weighting factor should be the connectedness of configurational domains [i.e., P(n
u, T)], because the change in heat capacity upon unfolding is directly proportional to the connectedness of configurational domains. Moreover, weighting by fraction of folded/unfolded population does not carry any meaning because there is a possibility of occurrence of protein molecules with partially folded configuration, which cannot be accounted for either in the folded fraction or in the unfolded fraction. In other words, it is not valid in the vicinity of the stochastic-separatrix. Because the connectedness has a turnover behavior, it is easy to conclude that the function
CpNU x P(n
u, T) also should be a turnover function, which is the usual observation in calorimetric studies on protein unfolding.
Explaining the existence of the enthalpy and entropy convergence temperatures (i.e., the temperature, T* = 110°C, at which common enthalpy,
H*, and entropy values,
S*, occur) is still under debate, although many possible explanations based on balance between hydrophobic and hydrophilic interaction of water with proteins have been proposed (Privalov 1979, 1996, 1997; Baldwin 1986; Lee 1991; Baldwin and Muller 1992; Fu and Freire 1992; Ragone and Colonna 1994) so far. Using our model, we were able to predict such convergence temperatures under certain conditions as follows: Using our weighted expression for the change in heat capacity, the possible correct equation for enthalpy and entropy change with temperature can be given as:
![]() | (14) |
![]() | (15) |
Because our studies showed that Tm, 2 [this is the point at which P(n
u, T) becomes maximum] depends only on Tn, for a set of proteins with similar
CpNU, it is easy to verify that although there is a small difference in the Tn values of the proteins, there exists a convergence temperature T* at which common
H* and
S* values can occur beyond the temperature Tm, 2, which is the usual observation. This is because the tail region (T > Tm, 2) of the function P(n
u, T) contributes much less to the integrals given by equations 11
and 12
. In the following section, we see how can one measure the depth of the folding funnel of a model protein, in terms of denaturant concentration experimentally.
| Materials and methods |
|---|
|
|
|---|
392 = 102 mM-1 cm-1; Gunsalus and Wagner 1978). A protein concentration of ~3 µM was used. Experiments were conducted at room temperature (298 K). The unfolding kinetic experiments were done using Hi-Tech SF61MX stopped flow spectrometer. Here a fixed N of 2000 data points (i.e., absorbance at the Soret peak of 392 nm) and a total time (T = N
t) of data collection of 200 msec (i.e.,
t = 100 µsec) were used. From these collected data, the corresponding sample-size autocorrelation functions were constructed using equation A6
' values. | Results and Discussion |
|---|
|
|
|---|
' obtained by fitting the sample-size autocorrelation function constructed accordingly to equation A5
|
|
| Appendix |
|---|
|
|
|---|
T is
, which is the connectedness parameter. Assuming equal initial probability, the birthdeath master equation becomes (because the configurational space is extraordinarily large, we can neglect the reverse probability terms):
![]() | (A1) |
Equation A1
can be simply solved by using the generating function
![]() |
with initial condition
![]() |
which is due to the fact that P(x,0) = 1/M to give:
![]() | (A2) |
Now the variance of x can be given as:
![]() | (A3) |
Any kinetic data involving biomacromolecules like protein and DNA are the sum [S(t), where t is time] of reactive [e.g., foldingunfolding transitions that involve changes in energy state, f(t)], nonreactive [local dynamics, h(t)], and instrumental noise components [e(t)], that is, S(t) = f(t) + h(t) + e(t), whose sample-size autocorrelation with a delay of
sec can be easily obtained by assuming e(t) as an additive Gaussian noise with
e> = 0,
e2
1/n for n
30, where n is the sample size,
ei, ei +
> = 0,
e, f> = 0, and
e, h> =
h, f> = 0, as follows:
![]() | (A4) |
Given that
hi, hi + 

0 = 0,
hi, hi
= (
/A) x Var{x(t)} = (
/A) x (
+
t) =
' +
't, where
is the corresponding spectroscopic conversion factor (e.g., molecular extinction coefficient) and A is Avagadros number,
![]() | (A5) |
where
t is the time difference between two consecutive data points, which is constant, t = (
t)n and
=
'
t. From equation A5
, we can easily conclude that as
' increases from zero, G(
0, n) exhibits the turnover behavior (Fig. 4
). The experimental G(
0, n) can be constructed from the kinetic data (in our case, unfolding of cytochrome P450cam by urea) using the following relation:
|
![]() | (A6) |
Where,
![]() |
Here gi is the i-th data point from the experiment and
0 < n < (N -
0), where N is the total number of data points collected. Now, by fitting equation A5
to the function constructed from equation A6
by a standard nonlinear method, we can easily obtain the parameter
'.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Baldwin, R.L. and Muller, N. 1992. Relation between the convergence temperatures T*h and T*s in protein unfolding. Proc. Natl. Acad. Sci. 89: 71107113.
Boczko, E.M. and Brookes, C.L. 1995. First-principles calculation of the folding free energy of a three-helix bundle protein. Science 269: 393.
Bryngelson, J.D., Onuchic, J.N., Socci, N.D., and Wolynes, P.G. 1995. Funnels, pathways, and the energy landscape of protein folding: A synthesis. Proteins 21: 167.[CrossRef][Medline]
Dill, K.A., Fiebig, K.M., and Chan, H.S. 1993. Cooperativity in protein-folding kinetics. Proc. Natl. Acad. Sci. 90: 19421946.
Fu, L. and Freire, E. 1992. On the origin of the enthalpy and entropy convergence temperatures in protein folding. Proc. Natl. Acad. Sci. 89: 93359338.
Gunsalus, I.C. and Wagner, G.C. 1978. Bacterial P-450cam methylene monooxygenase components: Cytochrome m, putidaredoxin, and putidaredoxin reductase. Methods Enzymol. 52: 166188.[Medline]
Lee, B. 1991. Isoenthalpic and isoentropic temperatures and the thermodynamics of protein denaturation. Proc. Natl. Acad. Sci. 88: 51545158.
Leopold, P.E., Montalk, M., and Onuchic, J.N. 1992. Protein folding funnels: A kinetic approach to the sequencestructure relationship. Proc. Natl. Acad. Sci. 89: 8721.
Murugan, R. 2002. Sample size autocorrelation analysis of kinetic data: Resolving reaction path heterogeneity. J. Chem. Phys. 117: 63726376.[CrossRef]
. 2003. Competitive model on denaturant mediated protein unfolding. Biophys. J. 84: 770774.
Onuchic, J.N., Wolynes, P.G., Luthey-Schulten, Z., and Socci, N.D. 1995. Toward an outline of the topography of a realistic protein-folding funnel. Proc. Natl. Acad. Sci. 92: 3626.
Privalov, P.L. 1979. Stability of proteins: Small globular proteins. Adv. Protein Chem. 33: 167241.[Medline]
. 1996. Intermediate states in protein folding. J. Mol. Biol. 258: 707725.[CrossRef][Medline]
. 1997. Thermodynamics of protein folding. J. Chem. Thermodynamics 29: 447474.[CrossRef]
Ragone, R. and Colonna, G. 1994. Enthalpyentropy balance and convergence. J. Biol. Chem. 269: 40474049.
Unger, B.P., Gunsalus, I.C., and Sligar, S.G. 1986. Nucleotide sequence of the Pseudomonas putida cytochrome P-450cam gene and its expression in Escherichia coli. J. Biol. Chem. 261: 11581163.
Zhou, Y., Hall, C.K., and Karplus, M. 1999. The calorimetric criterion for a two-state process revisited. Protein Sci. 8: 10641074.[Abstract]
Zwanzig, R., Szabo, A., and Bagchi, B. 1992. Levinthals Paradox. Proc. Natl. Acad. Sci. 89: 2022.
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |