|
|
||||||||
dziej1,2
1 Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, New York 14853-1301, USA
2 Faculty of Chemistry, University of Gdansk, 80-952 Gdansk, Poland
Reprint requests to: Harold A. Scheraga, Baker Laboratory of Chemistry and Chemical Biology, Cornell University, Ithaca, NY 14853-1301, USA; e-mail: has5{at}cornell.edu; fax: (607) 254-4700.
(RECEIVED May 24, 2004; FINAL REVISION June 24, 2004; ACCEPTED June 28, 2004)
| Abstract |
|---|
|
|
|---|
and the dihedral angles
for the rotation of peptide groups about the C
C
virtual-bond axes. For the side-chains that do not interact favorably with calcium, simple excluded-volume potentials were introduced. The parameters of the potentials were obtained from ab initio quantum mechanical calculations of model systems at the Restricted Hartree-Fock (RHF) level with the 631G(d,p) basis set. The energy surfaces of pairs consisting of Ca2+-acetate, Ca2+-propionate, Ca2+-acetamide, Ca2+-propionamide, and Ca2+-N-methylacetamide systems (modeling the Ca2+-Asp, Ca2+-Glu, Ca2+-Asn, Ca2+-Gln, and Ca2+-peptide group interactions) at different distances and orientations were calculated. For each pair, the restricted free energy (RFE) surfaces were calculated by numerical integration over the degrees of freedom lost when switching from the all-atom model to the united-residue model. Finally, the analytical expressions for each pair were fitted to the RFE surfaces. This force-field was able to distinguish the EF-hand motif from all potential binding sites in the crystal structures of bovine
-lactalbumin, whiting parvalbumin, calbindin D9K, and apo-calbindin D9K. Keywords: protein structure prediction; united-residue force-field; calcium cation binding; EF-hand motif
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.04878904.
| Introduction |
|---|
|
|
|---|
It is not clear whether metal ions help proteins to fold or whether they are incorporated into the proteins after they have already formed their three-dimensional structure. Although there have been numerous attempts to design calcium-binding sites in proteins (Yang et al. 2002, 2003), there has been very little effort to locate the most probable binding site in a protein. Locating potential binding sites in a mean-field model is still more difficult because, in most such models, atomic details of the side-chains are eliminated and the side-chains are represented by spheres or ellipsoids. However, charges and the orientation of the side-chain are very important in the process of metal binding. Here, we present an effort to reproduce the correct charge and geometry distribution of the side-chains in a united-residue model. Such a model can help in incorporating metal ions efficiently in the folding process, especially because most metal-binding proteins are large multichain subunits, and it is currently difficult to simulate their folding by using an all-atom potential. To try to fold these proteins with a united-residue force-field that can include cations in the folding process might shed light on the roles that cations play in folding proteins. We start with the calcium cation because, in contrast to transition-metal cations, it forms complexes of predominantly electrostatic type, which are easier to model compared with covalent bonding interactions as occur with cations such as zinc. Because we treat the system at the mean-field level, we consider only the total energetic effect of replacing water molecules in the coordination sphere of a calcium cation and not the detailed kinetics of this process.
The calcium cation shows a greater tendency to form complexes with negatively charged carboxylate groups than with water molecules (Katz et al. 1996). It binds directly to the side-chains of the hydrophilic residues and to the carbonyl groups of the backbone peptide groups as opposed to indirect binding via a metal-bound water molecule (Dudev and Lim 2003; Dudev et al. 2003). The observed total coordination number of calcium in proteins varies from six to eight, with a coordination radius of 1.00 Å and 1.12 Å, respectively (Jernigan et al. 1994; Dudev and Lim 2003; Dudev et al. 2003).
Calcium binding sites can be classified into three categories (Einspahr and Bugg 1984; McPhalen et al. 1991; Pidcock and Moore 2001; Dudev and Lim 2003). Most calcium binding proteins possess a highly conserved structure, the EF-hand motif, that selectively binds calcium (Forsen and Koerdel 1996). This structure corresponds to a short continuous segment of the protein that provides groups for binding (the classical EF-hand motif). A second structure is the same as an EF-hand motif except that one of the groups comes from a distant site. A third structure corresponds to binding sites in which the groups come from different segments in the protein. The EF-hand motif is the most common of them all (Kretsinger and Nockolds 1973). The classical EF-hand motif is a 12-residue Ca2+-binding loop between two helices, forming a conserved helix-loop-helix structure (Kretsinger and Nockolds 1973; Dudev and Lim 2003). This motif contains Asp, Glu, Asn, and Gln residues that, along with the peptide groups, provide sites for calcium binding. In many calcium-binding proteins, Ca2+ binding induces a conformational change in the EF-hand motif, leading to the activation or inactivation of target proteins. In the S100 superfamily, for example, to which calbindin D9K, calmodulin, and troponin C belong, this conformational change is described as a change from the <<closed>> conformational state in the absence of Ca2+ to the <<open&Rt conformational state in its presence (Yap et al. 1999).
| Results |
|---|
|
|
|---|
-lactalbumin, whiting parvalbumin, and calbindin D9K.
|
-lactalbumin (Protein Data Bank [PDB] access code 1F6S
[PDB]
; Chrysina et al. 2000) is an
+
hexamer, each monomer containing 121 residues. Every monomer has one EF-hand motif starting at Asp78 and ending at Asp88. The binding pocket contains only Asp residues. The position of the calcium cation in the native structure and the position found by a systematic search of the optimum binding site and subsequent minimization of the calcium-chain energy (with the polypeptide chain fixed in the native structure; see Materials and Methods) are shown in Figure 1
|
-helical dimeric protein. Each of the monomers has 108 residues and contains two EF-hand motifs. One of the EF-hand motifs starts at Asp53 and ends at Glu62. The other EF-hand motif starts at Asp92 and ends at Glu101. The positions of the two calcium cations in the experimental structure and the positions found by a systematic search of the optimum binding sites and subsequent energy minimization (with the polypeptide chain fixed in the native structure) are shown in Figure 2
|
-helical monomeric protein of 75 residues. This protein is very well studied, and there are many good crystal and NMR structures of the calcium-bound form, the apo-form, and structures with other ions such as magnesium and lanthanium in the PDB. It contains two EF-hand motifs. The first one starts at Glu17 and ends at Glu27. The second one starts at Asn54 and ends at Glu65. The second motif has more interacting residues, and therefore, it is energetically favored by calcium and other cations even though the pockets are comparable in size. The positions of the two calcium cations in the experimental structure and the positions found by systematic search of the optimum binding sites and subsequent energy minimization (with the polypeptide chain fixed in the native structure) are shown in Figure 3
|
To carry out test 1, we used the PDB structure of apo-calbindin D9K (PDB access code 1CLB [PDB] ; Skelton et al. 1995). The apo form examined is a mutant of the wild type, in which Phe43 is replaced by Gly. It appears that this mutation does not affect the potential calcium-binding regions, because the mutated residue is not at any of the two EF-hands and it is not a residue that interacts favorably with calcium. The main difference between calbindin D9K and apo-calbindin D9K is in the EF-hand regions. In apo-calbindin D9K, the EF-hands are positioned toward the center of the protein. This form of the EF-hand motif is also known as the <<closed>> form. However, when calciums bind to the EF-hands, they push the EF-hands to the outside and the EF-hands adopt the <<open>> form (Yap et al. 1999). The opening of the EF-hands upon calcium binding is thought to be responsible for the biological activity of the protein (Yap et al. 1999).
The total backbone
-carbon (C
) RMSD between the PDB structures of calbindin D9K and apo-calbindin D9K (residues 175) is 2.11 Å. The C
RMSDs between the first set of residues (1727) and the second set of residues (5465) in the two EF-hand motifs of the experimental structure of calbindin D9K and apo-calbindin D9K are 1.13 Å and 1.79 Å, respectively. The crystal structure of apo-calbindin D9K had to be relaxed by minimization of the UNRES energy because it had some side-chain clashes when it was converted from the PDB geometry to the UNRES geometry. However, the relaxation procedure increased the RMSD between the relaxed structure of apo-calbindin D9K and the structure of calbindin D9K in the crystal. The increase in RMSD was due mainly to the introduction of three residues to the helix in the region of the first EF-hand motif of apo-calbindin D9K. After the relaxation, the overall C
RMSD between these structures of apo-calbindin D9K and calbindin D9K was 3.70 Å, and the RMSDs between the first and the second EF-hand motifs were 2.70 Å and 1.82 Å, respectively.
After the introduction of calcium cations into the relaxed apo-structure and minimization of the UNRES energy of the chain, including the positions of the cations, the total C
RMSD of the final structure from the structure of calbindin D9K in the crystal was 3.21 Å as opposed to 3.70 Å before energy minimization. The RMSDs between the first and the second EF-hand motifs were 2.21 Å and 1.84 Å, respectively, as opposed to 2.70 Å and 1.82 Å of the relaxed structure before calcium placement. The RMSD between the position of calcium in the final calculated structure and in the experimental structure of calbindin D9K in the crystal was 4.39 Å in the first pocket and 7.44 Å in the second pocket. Therefore, introduction of calcium cations and subsequent minimization of the energy of the complex reduced the C
RMSD between the relaxed apo-form and the native calcium-bound form, especially in the first EF-hand motif, which was distorted by the relaxation procedure. The superposition of the crystal structure of the calcium-bound form (black), the crystal structure of the apo form (red), and the final force-field structure (blue) are shown in shown in Figure 4
. It can be seen in the figure that the calcium-bound structure and the final force-field structure are much more similar in the EF-hand motif regions than are the apo structure and the final force-field structure. This shows that our calcium force-field is capable of converting the EF-hand motifs from the <<closed>> form in the apo structure to the <<open>> form in the calcium-bound structure.
|
The overall C
RMSD between this lowest-energy structure and the crystal structure of apo-calbindin D9K was 6.3 Å. The RMSDs between the first and the second EF-hand motifs were 4.17 Å and 3.31 Å, respectively. In this structure, the EF-hand motifs were completely distorted. They did not posses any extended structure, which is the characteristic of the EF-hand motif. Rather, they possessed a significant amount of helical content.
To assess whether one calcium cation is capable of correcting one of the distorted EF-hand motifs, one calcium cation was introduced to this structure. The chain, as well as the position of the calcium cation, was energy-minimized. The final structure obtained from the calcium force-field had an overall C
RMSD of 5.89 Å from the crystal structure of calbindin D9K. The RMSDs of the first and second EF-hand motif were 3.91 Å and 3.65 Å, respectively. This structure is shown in Figure 5A
. It can be seen that although both the overall RMSD, as well as those of the EF-hand motifs, are lower in the final structure than in the initial structure, the calcium cation was not able to correct the distorted EF-hands. It did not enter into any of the EF-hand motifs but stayed in a region between the two distorted regions in the core of the structure.
|
Initially, we thought that only one calcium cation by itself might not be able to correct the distorted EF-hands. However, after introducing the second calcium cation, the total C
RMSD of the final structure increased to 6.19 Å, and the RMSDs between the first and the second EF-hands became 3.95 Å and 4.45 Å, respectively. Compared with the original structure, this structure is farther from the crystal structure of calbindin D9K. This structure is shown in Figure 5B
. The two calcium cations are in the core of the protein, and they are surrounded by the interacting residues of the EF-hands and the two adjacent helices. They interact with Glu17 and Glu27 of the first EF-hand motif and Glu48, Asp54, Asp58, Glu60, and Asn67 of the second EF-hand motif. Further, the helices on the sides of the EF-hands have shifted so that they provide as many interacting residues to the two calcium cations as possible. This shift of the helices is the reason for the increase in the RMSD and the deviation between this structure and the crystal structure of calbindin D9K.
| Discussion |
|---|
|
|
|---|
However, the force-field is not able to correct the topology of a distorted binding site, as can be seen in Figure 5
, A and B. The most probable reason for this is that the UNRES force-field is not perfect in reproducing loop geometry; it tends to assign some secondary structure to loops. As can be seen in Figure 5B
, the helices from both sides of the loops are arranged to surround the calcium cations by the greatest number of interacting residues. This means that it is energetically more favorable for this force-field to reposition an entire helix than to open a helix turn. However, the calcium-binding constant to
-lactalbumin is of the order of 107, so that the free energy of binding is ~10 kcal/mole (Permyakov et al. 2001); this is enough to unfold two or three turns of a helix.
The possible reasons for the distortion of the geometry of the calcium coordination sphere (i.e., the location of the calcium with respect to the interacting residues in the binding site) obtained with the force-field derived in this work are: (1) The parameterization was not refined by reproducing the geometry of high-resolution crystal structures of calcium-bound peptides; and (2) the multibody terms to express the calcium-interaction in the potential-energy function were absent. Both issues are now being investigated in our laboratory.
| Materials and methods |
|---|
|
|
|---|
-carbon atoms, (C
s), linked by virtual bonds with attached united side-chains (SCs), and united peptide groups (ps). Each united peptide group is located in the middle of two consecutive
-carbon (C
) atoms (Fig. 6
|
-carbon atoms are geometric points. All virtual-bond lengths (i.e., C
C
and C
SC) are fixed; the distance between the neighboring C
s is 3.8 Å, corresponding to trans peptide groups, whereas the side-chain angles (
SC and
SC), the virtual-bond angles (
s), and dihedral angles (
s) can vary. In this work, we introduce calcium cations. For side chains that bind the calcium cation specifically (Asp, Glu, Asn, and Gln), the calcium cation interacts with the carboxyl and carbonyl groups in the side-chain and not with the center of mass of the side-chain. However, UNRES ellipsoids do not contain the details of the side-chains. Therefore, the calcium-interaction site is assumed to lie on the C
SC axis and to be shifted from the SC center of mass by the distance
SC; This new center is called SC'. Therefore, the location of a calcium cation with respect to a favorable amino acid residue i is described by its distance dSC'i from SC' and the Ca2+Ci&agr;SCi angle (
Ca2+) as shown in Figure 6
SC axis by the angle
Ca2+, which, together with dSC'i and
Ca2+, defines its location with respect to the polypeptide chain. It should be noted that the energy of interaction of the SC site with the calcium does not depend on
Ca2+, because the corresponding expression has cylindrical symmetry with respect to the C
SC axis. For the remaining side-chain types, calcium is assumed to interact with the center of mass of the ellipsoid. For the peptide groups of the backbone, calcium is assumed to interact with the center of mass of the peptide. UNRES is a physics-based force-field, which is derived as a Restricted Free Energy (RFE) function of a polypeptide chain. The RFE is defined as the free energy of a given coarse-grain conformation obtained by integrating the Boltzmann factor of the all-atom (i.e., the polypeptide chain-plus-solvent) energy over the degrees of freedom that are neglected in the united-residue model (Liwo et al. 1998, 1999b, 2001). The complete UNRES potential-energy function, which also includes the protein-calcium cation interaction terms that are introduced in this work, is expressed by equation 1:
![]() | (1) |
The terms USCiSCj correspond to the mean free energy of hydrophobic (hydrophilic) interactions between the side-chains. These terms implicitly contain the contributions from the interactions of the side-chain with the solvent. The terms USCipj correspond to the excluded-volume potential of the side chainpeptide group interactions. The terms Upipj represent the energy of average electrostatic interactions between backbone peptide groups. The terms Utor and Utord are the torsional and the double-torsional potentials, respectively, for rotation about a given virtual bond or two consecutive virtual bonds. The terms Ub and Urot are the virtual-anglebending and side chainrotamer potentials. The terms Ucorr (m) correspond to the correlations (of order m) between peptide-group electrostatic and backbone-local interactions. The last five terms pertain to the calciumpolypeptide chain interactions, which are described in the next section.
The terms USCiSCj, Ub, and Urot were parameterized (Liwo et al. 1997a, b) from the distribution and correlation functions determined from the PDB. Utor, Utord, and Ucorr were based on the cumulant expansion of the RFE of polypeptide chains (Liwo et al. 1998, 1999b, 2001) and parameterized from the RFEs of model systems obtained by high-level MP2/631G(d,p) ab initio calculations of model system (Oldziej et al. 2003; Liwo et al. 2004). Finally, the ws are weights of the various energy terms, obtained by optimization of the total potential-energy function to obtain a funnel-like energy landscape of benchmark proteins (Liwo et al. 2002).
Analytical expressions for energy of interaction of calcium cations with polypeptide chains
Binding of the calcium cation to the carboxyl and carbonyl groups is mainly electrostatic in nature. Therefore, to derive analytical expressions for the RFE of interaction of the calcium cation with the Asp, Glu, Asn, and Gln side-chains and with the peptide groups, we assumed that the charge distribution of each of these interaction sites (and, consequently, the interaction energy with the calcium cation) is expressed by a multipole series, which provided simplified analytical expressions for the interaction energy. Subsequently, by averaging these expressions over the secondary degrees of freedom (i.e., the rotation angle
of the Ca2+ cation about the C
-SC axis shown in Fig. 6
), we derived approximate analytical expressions for the mean-field interaction energies of the calcium cation with its interaction sites.
The Asp and Glu side-chains bear a net charge of -1e and also possess a quadripole moment, which is necessary to describe the angular dependence of the calcium-side-chain interaction energy. For charged systems, choosing the origin of the reference system in the charge center implies that the dipole moment is zero. However, because it is not know a priori what choice of the origin of the reference system provides the analytical expression best fitting the numerically calculated RFE surfaces, even for charged side-chains, we introduced a dipole moment parallel to the rotation axis. The energy of interaction of the calcium cation with the point charge and with the dipole moment does not depend on the rotation angle
; therefore, it is necessary to average only the calciumquadripole interaction energy (Fig. 7
). In this figure, for clarity we have shown only the quadruple moment placed at SC'. However, in our derivations, there is also a dipole moment along the Z-axis and a point charge located at SC'.
|
The magnitude of the quadripole moment is given by equation 2:
![]() | (2) |
The energy of interaction of the quadripole placed at SC' with a point charge of magnitude q' positioned at coordinates x, y, and z is given by equation 3
![]() | (3) |
where dSC' = x2 + y2 + z2 and is the dielectric constant. Transforming equation 3 from Cartesian coordinates to internal coordinates dSC',
,
such that
![]() | (4) |
we obtain equation 5.
![]() | (5) |
The energy averaged over the rotation angle
is expressed by equation 6:
![]() | (6) |
The energy of interaction of a point dipole (with dipole moment p) with a point charge q' (positioned at a distance dSC' from the dipole and angle
with respect to the dipole axis) is expressed by equation 7:
![]() | (7) |
To represent the point charge energies, we use the expression for the Coulombic chargecharge interaction energy and finally a Lennard-Jones term to prevent the collapse of the calcium cation on the respective side-chain. Because the distance
SCi is much smaller than dSC'i and dCa2+ in Figure 6
, we assume that
in Figure 7
is approximately equal to
Ca2+ in Figure 6
for simplicity. Taking all this into account, we obtain equation 8 for the energy of interaction of a calcium cation with an aspartate or glutamate side-chain, UAGCa2+.
![]() | (8) |
where X denotes the Asp or the Glu side-chain,
XCa2+ and r0 XCa2+ are the parameters of the Lennard-Jones potential, wc,X is the parameter of the Coulomb term, wdip,X is the parameter of the dipole term, and wquad1,X and wquad2,X are the parameters of the quadripole term, respectively.
The expressions for the interaction energy between the Asn and Gln side-chains and calcium (UAGNCa2+) or the peptide groups and calcium (UpCa2+) are similar. Because these groups are neutral, there is no chargecharge interaction term:
![]() | (9) |
where X denotes the Asn or the Gln side-chain or the peptide group, and the meaning of the symbols is the same as described by the text under equation 8.
The constants of the Lennard-Jones potential
and r0 were estimated from the calcium cation parameters of the AMBER force-field (Pearlman et al. 1995). Determination of other parameters of equations 8 and 9 is described in "Parameterization of the Expressions for CalciumPolypeptide Interaction Energy."
The terms USCiCa2+ correspond to the interaction between the calcium cation and any side-chains other than Asp, Glu, Asn, and Gln introduced to avoid clashes of the calcium cation with the side-chains. They are expressed by a Lennard-Jones potential, as given by equation 10.
![]() | (10) |
where
![]() | (11) |
![]() | (12) |
where r0Ca2+ = 3.4725 Å (Mayo et al. 1990), the values of
SCi were obtained from (Liwo et al. 1997a), and the values of
SCiCa2+ were all arbitrarily set at 0.2 kcal/mole.
Finally, UCa2+Ca2+ corresponds to the repulsive interaction between two calcium cations, and consists of an excluded-volume Lennard-Jones term and a Coulomb term in equation 13:
![]() | (13) |
where r0Ca2+ is defined in the text under equation 11,
Ca2+ = 0.05 kcal/mole (Mayo et al. 1990), k = 332 is the conversion factor to express the energy in kilocalories per mole if the distance and charge are expressed in angströms and electron charge units, respectively, qCa2+ = 2e is the charge of the calcium cation, and
is the dielectric constant of water.
Parameterization of the expressions for calciumpolypeptide interaction energy
As in our recent work on the parameterization of the torsional and correlation terms of the UNRES force-field (Oldziej et al. 2003; Liwo et al. 2004), the parameters of the calciumpolypeptide interaction terms were obtained from quantum mechanical calculations on model compounds, followed by fitting the resulting RFE hypersurfaces to the analytical expressions.
The energy landscapes for each of the different interacting residues were obtained by ab initio quantum mechanical calculations at the Restricted Hartree-Fock (RHF) level with the 631 G(d,p) basis set. The program GAMESS (Schmidt et al. 1993) was used in all calculations. The systems studied were Ca2+AcO, Ca2+PrO, Ca2+AcNH2, Ca2+PrNH2, and Ca2+AcNHMe (where Ac and Pr denote the acetyl and propionyl group, respectively) to model the interactions of the calcium cation with the ionized aspartate and glutamate side-chains, the asparagine and glutamine side-chains, and the peptide group, respectively. The valence geometry of the side-chain sites was not optimized. The bond lengths and angles were taken from ECEPP/3 geometry (Némethy et al. 1992), and the peptide groups were assumed to be planar. The systems are shown in Figure 8
, A through E.
|
SC axis,
Ca2+, the angle of rotation of the Ca2+ cation about the C
SC axis,
Ca2+, and the angle
is the dihedral angle about a bond as shown in Figure 8, A through D
d ≤ 7 Å with a 1 Å increment, 0°
Ca2+
180° with a 15° increment, 0°
Ca2+
180° with a 30° increment, and 0°
360° with a 30° increment. For Ca2+AcNHMe, unlike other systems, AcNHMe is assumed to be rigid and the energy was evaluated on the following grid: 2Å
d
6 Å with a 1 Å increment, 0°
Ca2+
180° with a 15° increment, and 0°
Ca2+
180° with a 30° increment (Fig. 8E
Ca2+ were calculated by integrating out the remaining degrees of freedom, as given by equation 14:
![]() | (14) |
where n is the number of dihedral angles
in a system,
denotes the vector of the dihedral angles
, and
= 0.1 (or, in other words, scaling down the energies) to account for the screening of the calcium cation by the solvent and, thereby, decreasing the interaction energy calculated in vacuo. Given the dielectric constant of water (78), the effective value of
at 298 K should be 1.69/78.4
0.02. However, we consider short distances here between the calcium cation and the interacting residues, and the effective dielectric constant is smaller; therefore,
should take a value between 0.02 and 1.69, selected here as 0.1. Integration was carried out numerically.
The approximate analytical expressions for calciumside-chain and calciumpeptide interaction energy were fitted to the RFE calculated numerically for each calciumside-chain and calciumpeptide system by means of a nonlinear least-squares Marquardt method (Marquardt 1963). The resulting parameters are shown in Table 1
. To take the effect of hydration into account, all the weights of the chargecharge (wc,X), chargedipole (wdip,X), and chargequadripole (wquad1,X) interaction terms in equations 8 and 9 were divided by the dielectric constant of water (i.e.,
= 78).
The weights wAGCa2+, wAGNCa2+, wSCCa2+, wpCa2+, and wCa2+Ca2+ of the energy terms in equation 1 were set arbitrarily so that the calciumpolypeptide chain energy term is about one-fourth of the polypeptide chain energy as represented by the UNRES terms of equation 1. This led to the best performance after the force-field of equation 1 was tested on the crystal structure of a number of calcium-binding proteins, and the placement of the calcium in the binding site was monitored. The weights for each of the energy terms of equation 1 are provided in Table 2
.
|
Ca2+ (see below). Subsequently, in each case, the energy of the system was minimized with respect to the calcium coordinates (keeping the polypeptide chain fixed). The conformation with the lowest energy was selected as the one with the calcium bound to the optimal calcium-binding site. To determine the distributions of dSC' and
Ca2+, we used the crystal structures of calcium-bound proteins from the PDB. The list of the proteins is available in the electronic version under the supplementary materials. We obtained the following mean and the variance for the distance distribution and angle distribution: <dSC'> = 4.5 Å and [<dSC'2> <dSC'>2]1/2 = 1.3 Å, whereas <
Ca2+> = 80° and [<
2Ca2+> <
Ca2+>2]1/2 = 41°, respectively. For the apo- and the UNRES-predicted structure, the procedure for searching the optimal binding site was similar to that described above except that all degrees of freedom (i.e., those of the calcium cation and those of the polypeptide chain) were subjected to energy minimization. This allowed the EF-hand motif to adopt the <<open form>> conformation upon binding of calcium.
| Electronic supplemental material |
|---|
|
|
|---|
| Footnotes |
|---|
3 Present address: Schrodinger, Inc., Portland, OR 97201, USA ![]()
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Chrysina, E.D., Brew, K., and Acharya, K.R. 2000. Crystal structures of Apo-and Holobovine
-lactalbumin at 2.2 Å resolution reveal an effect of calcium on inter-lope interactions. J. Biol. Chem. 275: 3702137029.
Declercq, J.P., Evrard, C., Lamzin, V., and Parello, J. 1999. Crystal structure of the EF-hand parvalbumin at atomic resolution (0.91 Å) and at low temperature (100 K). Evidence for conformational multistates within the hydrophobic core. Protein Sci. 8: 21942204.[Abstract]
Dudev, T. and Lim, C. 2003. Principles governing Mg, Ca, and Zn binding and selectivity in proteins. Chem. Rev. 103: 773787.[CrossRef][Medline]
Dudev, T., Lin, Y., Dudev, M., and Lim, C. 2003. First-second shell interaction in metal binding sites in proteins: A PDB survey and DFT/CDM calculations. J. Am. Chem. Soc. 125: 31683180.[CrossRef][Medline]
Einspahr, H. and Bugg, C.E. 1984. Metal ions in biological systems, Vol. 17 (ed. H. Sigel), pp. 5197. Dekker, Basel, Switzerland.
Forsen, S. and Koerdel, J. 1996. Biomolecular structure and dynamics: Experiment and theory. J. Pharmaceut. Biomed. 14: 233246.[CrossRef]
Frausto da Silva, J.J.R. and Williams, R.J.P. 1991. The biological chemistry of the elements: The inorganic chemistry of life. Oxford University Press, New York.
Hanna, R. and Doudna, J. 2000. Metal ions in ribozyme folding and catalysis. Curr. Opin. Chem. Biol 4: 166170.[CrossRef][Medline]
Jernigan, R., Raghunathan, G., and Bahar, I. 1994. Characterization of interactions and metal-ion binding-sites in proteins. Curr. Opin. Struct. Biol. 4: 256263.[CrossRef]
Katz, A.K., Glusker, J.P., Beebe, S.A., and Bock, C.W. 1996. Calcium ion coordination: A comparison with that of beryllium, magnesium and zinc. J. Am. Chem. Soc. 118: 57525763.[CrossRef]
Kretsinger, R.H. and Nockolds, C.E. 1973. Carp muscle calcium-binding protein, II: Structure determination and general description. J. Biol. Chem. 248: 33133326.
Lee, J., Liwo, A., and Scheraga, H.A. 1999. Energy based de novo protein folding by conformational space annealing and an off-lattice united-residue force-field: Application to the 1055 fragment of staphylococcal protein A and to apo-calbindin D9K. Proc. Natl. Acad. Sci. 96: 20252030.
Lippard, S.J. and Berg, J.M. 1994. Principles of bioinorganic chemistry. University Science Books, Mill Valley, CA.
Liwo, A., Oldziej, S., Pincus, M.R., Wawak, R.J., Rackovsky, S., and Scheraga, H.A. 1997a. A united-residue force field for off-lattice protein-structure simulations, I: Functional forms and parameters of long-range side-chain interaction potentials from protein crystal data. J. Comput. Chem. 18: 849873.[CrossRef]
Liwo, A., Pincus, M.R., Wawak, R.J., Rackovsky, S., Oldziej, S., and Scheraga, H.A. 1997b. A united-residue force field for off-lattice protein-structure simulations, II: Parameterization of short-range interactions and determination of weights of energy terms by Z-score optimization. J. Comput. Chem.18: 874887.[CrossRef]
Liwo, A., Kazmierkiewicz, R., Czaplewski, C., Groth, M., Oldziej, S., Wawak, R.J., Rackovsky, S., Pincus, M.R., and Scheraga, H.A. 1998. United-residue force field for off-lattice protein-structure simulations, III: Origin of backbone hydrogen-bonding cooperativity in united-residue potentials. J. Comput. Chem. 19: 259276.[CrossRef]
Liwo, A., Lee, J., Ripoll, D.R., Pillardy, J., and Scheraga, H.A. 1999a. Protein structure prediction by global optimization of a potential energy function. Proc. Natl. Acad. Sci. 96: 54825485.
Liwo, A., Pillardy, J., Kazmierkiewicz, R., Wawak, R.J., Groth, M., Czaplewski, C., Oldziej, S., and Scheraga, H.A. 1999b. Prediction of protein structure using knowledge-based off-lattice united-residue force field and global optimization methods. Theor. Chem. Accounts 101: 1620.
Liwo, A., Czaplewski, C., Pillardy, J., and Scheraga, H.A. 2001. Cumulant-based expressions for the multibody terms for the correlation between local and electrostatic interactions in the united-residue force field. J. Chem.Phys. 115: 23232347.
Liwo, A., Arlukowicz, P., Czaplewski, C., Oldziej, S., Pillardy, J., and Scheraga, H.A. 2002. A method for optimizing potential-energy functions by a hierarchical design of the potential-energy landscape: Application to the UNRES force field. Proc. Natl. Acad. Sci. 99: 19371942.
Liwo, A., Oldziej, S., Czaplewski, C., Kozlowska, U., and Scheraga, H.A. 2004. Parameterization of backbone-electrostatic and multibody contributions to the UNRES force field for protein-structure prediction from ab initio energy surfaces of model systems. J. Phys. Chem. B 108: 94219438.[CrossRef]
Marquardt, D.W. 1963. An algorithm for least-square estimation of nonlinear parameters. J. Soc. Indust. Appl. Math. 11: 431441.[CrossRef]
Mayo, S.A., Olafson, B.D., and Goddard, W.A. 1990. DREIDING: A generic force-field for molecular simulations. J. Phys. Chem. 94: 88978909.[CrossRef]
McPhalen, C.A., Strynadka, N.C., and James, M.N. 1991. Calcium binding site in proteins: A structural perspective. Adv. Prot. Chem. 42: 77144.[Medline]
Némethy, G., Gibson, K.D., Palmer, K.A., Yoon, C.N., Paterlini, G., Zagari, A., Rumsey, S., and Scheraga, H.A. 1992. Energy parameters in polypeptides, 10: Improved geometrical parameters and nonbonded interactions for use in ECEPP/3 algorithm with application to proline-containing peptides. J. Phys. Chem. 96: 64726484.[CrossRef]
Oldziej, S., Kozlowska, U., Liwo, A., and Scheraga, H.A. 2003. Determination of the potentials of mean force for rotation about C
C
virtual bonds in polypeptides from the ab initio energy surfaces of terminally-blocked glycine, alanine, and proline. J. Phys. Chem. A 107: 80358046.[CrossRef]
Pearlman, D.A., Case, D.A., Caldwell, J.W., Ross, W.S., Cheatham, T.E., De-Bolt, S., Ferguson, D., Seibel, G., and Kollman, P. 1995. <<AMBER>>: A package of computer programs for applying molecular mechanics, normal mode analysis, molecular dynamics and free energy calculations to simulate the structural and energetic properties of molecules. Comp. Phys. Commun.91: 142.
Permyakov, S.E., Uversky, V.N., Veprintsev, D.B., Cherskaya, A.M., Brooks, C.L., Permyakov, E.A., and Berliner, L.J. 2001. Mutating aspartate in the calcium-binding site of
-lactalbumin: Effects on the protein stability and cation binding. Prot. Eng. 14: 785789.
Pidcock, E. and Moore, G.R. 2001. Structural characteristics of protein binding sites for calcium and lanthanide ions. J. Biol. Inorg. Chem. 6: 479489.[CrossRef][Medline]
Schmidt, M.W., Baldridge, K.K., Boatz, J.A., Elbert, S.T., Gordon, M.S., Jensen, J.H., Koseki, S., Matsunaga, N., Nguyen, K.A., Su, S., et al. 1993. General atomic and molecular electronic structure system. J. Comput. Chem. 14: 13471363.[CrossRef]
Skelton, N.J., Koerdel, J., and Chazin, W.J. 1995. Determination of the solution structure of apo calbindin D9K by NMR spectroscopy. J. Mol. Biol. 249: 441462.[CrossRef][Medline]
Svensson, L.A., Thulin, E., and Forsen, S. 1992. Proline cistrans isomers in calbindin D9K, observed by x-ray crystallography. J. Mol. Biol 223: 601606.[CrossRef][Medline]
Yamashita, M.M., Wesson, L., Eisenman, G., and Eisenberg, D. 1990. Where metal ions bind in proteins. Proc. Natl. Acad. Sci. 87: 56485652.
Yang, W., Lee, H.W., Hellinga, H., and Yang, J.J. 2002. Structural analysis, identification, and design of calcium-binding sites in proteins. Proteins 47: 344356.[CrossRef][Medline]