Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Ho, B. K.
Right arrow Articles by Brasseur, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Ho, B. K.
Right arrow Articles by Brasseur, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Protein Science (2003), 12:2508-2522.
Copyright © 2003 The Protein Society

Revisiting the Ramachandran plot: Hard-sphere repulsion, electrostatics, and H-bonding in the {alpha}-helix

Bosco K. Ho1, Annick Thomas2 and Robert Brasseur1

1 Centre de Biophysique Moléculaire Numérique (CBMN), B-5030 Gembloux, Belgium
2 Institut National de la Santé et de la Recherche Médicale (INSERM), 75013 Paris, France

Reprint requests to: Bosco K. Ho, Centre de Biophysique Moléculaire Numérique (CBMN), 2 Passage des déportés, B-5030 Gembloux, Belgium; e-mail: ho.b{at}fsagx.ac.be; fax: +32-81-622-522.

(RECEIVED June 2, 2003; FINAL REVISION July 14, 2003; ACCEPTED July 16, 2003)

Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.03235203.


    Abstract
 TOP
 Abstract
 Introduction
 Materials and methods
 Results and Discussion
 References
 
What determines the shape of the allowed regions in the Ramachandran plot? Although Ramachandran explained these regions in terms of 1–4 hard-sphere repulsions, there are discrepancies with the data where, in particular, the {alpha}R, {alpha}L, and ß-strand regions are diagonal. The {alpha}R-region also varies along the {alpha}-helix where it is constrained at the center and the amino terminus but diffuse at the carboxyl terminus. By analyzing a high-resolution database of protein structures, we find that certain 1–4 hard-sphere repulsions in the standard steric map of Ramachandran do not affect the statistical distributions. By ignoring these steric clashes (N···Hi+1 and Oi-1···C), we identify a revised set of steric clashes (Cß···O, Oi-1···Ni+1, Cß···Ni+1, Oi-1···Cß, and Oi-1···O) that produce a better match with the data. We also find that the strictly forbidden region in the Ramachandran plot is excluded by multiple steric clashes, whereas the outlier region is excluded by only one significant steric clash. However, steric clashes alone do not account for the diagonal regions. Using electrostatics to analyze the conformational dependence of specific interatomic interactions, we find that the diagonal shape of the {alpha}R and {alpha}L-regions also depends on the optimization of the N···Hi+1 and Oi-1···C interactions, and the diagonal ß-strand region is due to the alignment of the CO and NH dipoles. Finally, we reproduce the variation of the Ramachandran plot along the {alpha}-helix in a simple model that uses only H-bonding constraints. This allows us to rationalize the difference between the amino terminus and the carboxyl terminus of the {alpha}-helix in terms of backbone entropy.

Keywords: Ramachandran plot; {alpha}-helix; hard-sphere model; H-bonds


    Introduction
 TOP
 Abstract
 Introduction
 Materials and methods
 Results and Discussion
 References
 
In 1963, Ramachandran et al. introduced the {varphi}{xi} angles (Fig. 1AGo) as a parameterization of the protein backbone. The plot of these angles, the Ramachandran plot, has become a standard tool used in determining protein structure (Morris et al. 1992; Kleywegt and Jones 1996) and in defining secondary structure (Chou and Fasman 1974; Muñoz and Serrano 1994). Using an analysis of local hard-sphere repulsions between atoms that are at least third neighbors (1–4 interactions), Ramachandran et al. (1963) constructed a steric map of the Ramachandran plot that predicted the commonly allowed regions: the {alpha}R, {alpha}L, and ß-regions. This steric map (Fig. 1BGo) has become the standard interpretation of the Ramachandran plot (Richardson 1981) where Mandel et al. (1977) identified the specific steric clashes that define the boundaries of the standard steric map.



View larger version (30K):
[in this window]
[in a new window]
 
Figure 1. Schematic of the {varphi}{psi} angles. (A) The schematic of the alanine dipeptide that represents the protein backbone parameterized by the {varphi} = C-N-C-C{alpha}-C and {psi} = N-C{alpha}-C-N dihedral angles. (B) The original Ramachandran steric map (Ramachandran et al. 1963) where the specific hard-sphere repulsions (dashed line) identified by Mandel et al. (1977) define the allowed regions (gray): {alpha}L, {alpha}R, and ß regions.

 
However, there are differences with the data. Using a high-resolution (<1.8 Å) database of structures with a sample size of nearly 100,000 residues (Lovell et al. 2003), we can see differences between the observed Ramachandran plot (Fig. 2AGo) and the standard steric map (see Fig. 1BGo). The {alpha}R and {alpha}L regions are diagonal (Garnier and Robson 1990; Hovmöller et al. 2002). The ß-region partitions into two diagonal lobes: the ß-strand region (left) and the polyproline II region (right; Kleywegt and Jones 1996; Hovmöller et al. 2002). There also exists sparsely populated regions that are forbidden in the standard steric map such as the {gamma} and {gamma} regions (Milner-White 1990), the type II turn region (Sibanda and Thornton 1985), and the pre-Pro region (Macarthur and Thornton 1991).



View larger version (42K):
[in this window]
[in a new window]
 
Figure 2. Ramachandran plots. (A) All residues excluding Pro, Gly, and pre-Pro; (B) residues in the center of the {alpha}-helix, which are more constrained than for all residues; (C) the Ncap residue; and (D) the Ccap residue in the {alpha}-helix, which are scattered throughout the entire allowed region.

 
Various studies have refined the calculation of the Ramachandran plot by using Lennard-Jones potentials and electrostatics (for a review, see Ramachandran and Sasisekharan 1968). Nevertheless, electrostatics fail to adequately reproduce the Ramacahandran plot (Lovell et al. 2003). In particular, the origin of the diagonal shape of the {alpha}R, {alpha}L, and ß-strand regions is not well understood. Furthermore, Hu et al. (2003) showed that typical molecular mechanics (MM) force fields generate unrealistic Ramachandran plots. In contrast, they modeled the alanine dipeptide using quantum mechanics (QM), which they placed in an explicit solvent modeled with MM. They reproduced the observed Ramachandran plot, showing that the Ramachandran plot arises from local backbone interactions.

Is there a simple way to account for the boundaries of the observed Ramachandran plot? To this end, we have analyzed the statistical distributions of the interatomic distances parameterized by the {varphi}{xi} angles. We found that certain 1–4 steric clashes in the standard steric map have no discernible effect on the statistical distributions. By ignoring these clashes, we can analyze the contributions of the remaining steric clashes. We thus obtain a revised steric map that produces a better match to the observed Ramachandran plot.

However, steric clashes do not account for the diagonal shape of the {alpha}R-region. The standard steric map predicted a smaller {alpha}R-region (see Fig. 1BGo) than the observed {alpha}R-region (Fig. 2AGo). However, the predicted {alpha}R-region is also elongated horizontally into regions where there is no observed density. Another problem is why the Ramachandran plot of residues in {alpha}-helices is constrained to the lower half of the general {alpha}R-region (Fig. 2BGo). It is often stated (Karplus 1996) that the {alpha}R-region consists of two discrete regions: the helical {alpha}R-region and the {delta}R-region. In this study, we attempt to clarify the relationship between the general {alpha}R-region and the helical {alpha}R-region.

Given that the strong diagonal shape of the observed {alpha}R-region has been reproduced by QM calculations (Hu et al. 2003), the shape of the {alpha}R-region must be due to local backbone interactions. Lovell et al. (2003) argued that the diagonal {alpha}R-region is due to the disfavoring of the conformations near (-150°, -60°) where the H and Hi+1 atoms are close together. However, we find that crowded H and Hi+1 atoms are also found in favored conformations of the {alpha}R-region, for example (-110°, 0°). As the crowding of H atoms produces different results in different parts of the Ramachandran plot, something else must induce the diagonal shape of the {alpha}R-region.

We use electrostatics to analyze the conformational dependence in the Ramachandran plot of specific interatomic interactions. We find that various dipole–dipole interactions, when combined with the revised steric map, conformationally induce diagonal {alpha}R, {alpha}L, and ß-strand regions. Although, in general, electrostatics cannot account for the Ramachandran plot (Lovell et al. 2003), the conformational dependence of individual interatomic interactions in the Ramachandran plot cannot differ greatly between electrostatics and QM. After all, only atoms with opposite partial charges attract and like charges repel. However, as the strength of individual interactions can vary greatly in the QM calculation, the electrostatic approximation fails when all the individual minima are summed together.

Recent studies have found that the shape of the helical {alpha}R-region varies depending on the position of the residue in the {alpha}-helix. In the central residues and in the amino terminus, the helical {alpha}R-region is constrained to the lower half of the general {alpha}R-region. However, Petukhov et al. (2002) found that the Ramachandran plot at the carboxyl terminus is much more diffuse than the rest of the {alpha}-helix. This flexibility in the carboxyl terminus has also been observed in peptide studies (Miick et al. 1993). In simulations, there is an asymmetry between the amino terminus and the carboxyl terminus in both folding (Sung 1994; Voegler-Smith and Hall 2001) and unfolding (Soman et al. 1991) studies. The origin of this asymmetry has not yet been resolved.

Ramachandran and Sasisekharan (1968) showed that H-bonding constraints induce the constrained helical {alpha}R-region. They analyzed {alpha}-helices where all residues were parameterized with the same {varphi}{xi} angles. They identified the {varphi}{xi} angles where d(Oi···Hi+4) ~2.0 Å for all CO···HN H-bonds along the {alpha}-helix. These {varphi}{xi} angles correspond to the constrained helical {alpha}R-region in central helical residues. However, as the analysis of Ramachandran and Sasisekharan (1968) used {alpha}-helices that had identical {varphi}{xi} angles, this only accounts for central helical residues. What then causes the differences between the amino terminus and the carboxyl terminus? We first analyzed the Ramachandran plots along different positions of the {alpha}-helix in the structural database. Then, using an extension of the model of Ramachandran and Sasisekharan (1968), we studied the constraints of the backbone H-bonding along the {alpha}-helix. As our model reproduced the observed variation along the {alpha}-helix, we can use backbone H-bonding to explain the observed differences between the amino terminus and the carboxyl terminus of the {alpha}-helix.


    Materials and methods
 TOP
 Abstract
 Introduction
 Materials and methods
 Results and Discussion
 References
 
Data set
We used the data set of 500 nonhomologous proteins (Lovell et al. 2003) from the PDB (Bernstein et al. 1977) with resolution better than 1.8 Å. In this data set, all hydrogen atoms have been projected from the backbone and optimized. Due to their specialized Ramachandran plots, we excluded Gly, Pro, and pre-Pro residues (MacArthur and Thornton 1991) from our analysis. In the steric clash analysis, we used the van der Waals (vdW) radii given by the Richardson lab (Word et al. 1999) (H{alpha} = 1.17 Å, H = 1.00 Å, C = 1.65 Å, C{alpha} and Cß = 1.75 Å, O = 1.40 Å, and N = 1.55 Å). We used DSSP (Kabsch and Sander 1983) to define {alpha}-helical residues.

Local conformations of the {varphi}{xi} map
To calculate the ideal curves of the interatomic distances as a function of the {varphi}-{xi} angles, we modeled the alanine dipeptide (see Fig. 1AGo). Covalent bond lengths and angles were fixed to standard Engh and Huber (1991) values, which only allows the {varphi}{xi} angles to vary. The {varphi}{xi} angles of the central residue were incremented in 5° steps and the corresponding distance parameters were calculated. Then, we generated the energy map of the Ramachandran plot by calculating, for each value of {varphi}{xi}, the energy of various interatomic interactions. We used two types of interactions: partial charge electrostatics


and Lennard-Jones 12–6 potentials


where the parameters were taken from CHARMM22 (MacKerell Jr. et al. 1998).

Model of {alpha}-helix
We modeled the {alpha}-helix with a chain of 7 Ala residues. Covalent bond lengths and angles were fixed to standard Engh and Huber (1991) values where the {varphi}{xi} angles are the only degrees of freedom. As the {varphi}{xi} angles of the Ncap and Ccap do not affect the geometry of the H-bonds within the {alpha}-helix, they were ignored.

The simplest requirement to form CO···HN H-bonds is that d(O···H) ~2.0 Å. Thus, to impose a given CO···HN H-bond, we used a harmonic distance constraint to minimize the O···H distance:


The minimum of this constraint is zero when d[O···H] = 2.0 Å. We also used ECO···HN to measure the deviation from the ideal CO···HN H-bond geometry when the given conformation cannot form the CO···HN H-bond. To avoid steric clashes, we applied Lennard-Jones 12–6 potentials:


where the parameters were taken from CHARMM22 (MacKerrell Jr. et al. 1998).

To analyze the H-bonding constraints in the amino terminal residues (N1, N2, and N3; red in Fig. 8BGo, below), we fixed the {varphi}{xi} angles of N4, N5, and N6 to the average helical values (-63°, -42°), which assumes that the {alpha}-helix from N4 to the carboxy-terminal is fixed in the {alpha}-helical conformation. We then minimized the energy function:



View larger version (22K):
[in this window]
[in a new window]
 
Figure 8. H-bonding in the amino terminus and carboxyl terminus of the {alpha}-helix. (A) Carboxyl terminus showing the carboxy-terminal residues (red) and the H-bonds (red) used in the model. (B) The amino terminus showing the amino-terminal residues (red). (C) The schematic of the allowed region in C1 residue (solid red), which is due to steric constraints (black), electrostatics (red outline), and formation of H-bonds that bring the two H atoms together (blue; see also A). (D) The schematic of the H-bonding constraints on the N1 residue (see also B).

 

where the first term refers to the Lennard-Jones potential, which models the steric clashes, and the second term refers to the harmonic potentials that minimizes the CONc···HNN4, CON1···HNN5, and CON2···HNN6 H-bonds (red in Fig. 8BGo, below).

  1. For N1, we divided the Ramachandran plot into a grid of points separated by 5° intervals. For each grid point, we used Powell minimization (Press et al. 1986) to minimize E by varying the {varphi}{xi} angles of the N2 and N3 residues. We repeated the process for all grid points of N1 to generate an energy profile of N1.
  2. For the grid points of N2, we allow the {varphi}{xi} angles of N1 and N3 to vary.
  3. For the grid points of N3, we allow the {varphi}{xi} angles of N1 and N2 to vary.

To analyze the H-bonding constraints in the carboxy-terminal residues (C1, C2, and C3; red in Fig. 8AGo, below), we fixed the {varphi}{xi} angles of C4, C5, and C6 to the average helical values of (-63°, -42°), which assumes that the {alpha}-helix from C4 to the amino terminus is fixed in {alpha}-helical conformation. In the energy minimization, we modeled the COC4···HNCc, COC5···HNC1, and COC6···HNC2 H-bonds (red in Fig. 8AGo, below).

  1. For the grid points of C3, we allow the {varphi}{xi} angles of C2 and C1 to vary.
  2. For the grid points of C2, we allow the {varphi}{xi} angles of C1 and C3 to vary.
  3. For the grid points of C1, we allow the {varphi}{xi} angles of C2 and C3 to vary.


    Results and Discussion
 TOP
 Abstract
 Introduction
 Materials and methods
 Results and Discussion
 References
 
Because the database of protein structures contains a large number of residues (97,368), we can compare the statistical distributions directly to the ideal geometry of the protein backbone. The local interatomic distances that are directly parameterized by the {varphi}{xi} angles can be divided into three categories: {varphi} dependent, {xi} dependent, and {varphi}-{xi} codependent distances. In Table 1Go, we list the parameters of these interatomic distances. By comparing the value of the 5% minimum (5th percentile band) with the vdW diameter, we can see which atoms are in contact and can interact. We focus on the steric clashes of the standard steric map (Ramachandran and Sasisekharan 1968). As described in Mandel et al. (1977), they are: Oi-1···C and Oi-1···Cß, which restricts {varphi}; N···Hi+1 and Cß···Hi+1, which restricts {xi}; and O···Hi+1, H···Hi+1 and Oi-1···O, which shaves off the corners of the allowed regions (see Fig. 1BGo).


View this table:
[in this window]
[in a new window]
 
Table 1. Range of the interatomic distances [Å] parameterized by the {varphi}{xi} angles
 
The {varphi} dependent and {xi} dependent steric constraints
The {varphi}-dependent distances
We first consider the restrictions from the standard steric map that restrict {varphi}: Oi-1···Cß and Oi-1···C (see Fig. 1BGo). To evaluate the effect of each steric clash on the observed distribution, we can make two comparisons. First, we can compare the {varphi} frequency distributions to the ideal curve. The idea is that if a hard-sphere repulsion restricts {varphi}, then, in regions of {varphi} where the ideal curve is below the vdW diameter, the {varphi} frequency distribution should drop correspondingly. Distributions that are found below the vdW radius indicates a steric overlap that could be due to some kind of interaction. For example, Ho and Curmi (2002) showed that in the allowed regions of {varphi} in ß-sheet residues, there is an Oi-1···H{alpha} nonbonded electrostatic interaction where most of the observed values are found below the vdW diameter (Fig. 3AGo). We plot the observed frequency distribution of {varphi} at the bottom of Figure 3Go. For the ideal curves of both d(Oi-1···Cß) versus {varphi} (Fig. 3DGo) and d(Oi-1···C) versus {varphi} (Fig. 3BGo), we see that as the interatomic distance decreases below the vdW diameter, the {varphi} frequency distribution drops correspondingly. This is consistent with the Oi-1···Cß and Oi-1···C steric clashes restricting the {varphi} angle.



View larger version (19K):
[in this window]
[in a new window]
 
Figure 3. Distributions of interatomic distances [Å] parameterized by {varphi} [°]. The ideal curves (gray) are calculated using Engh and Huber (1991) geometry. The vdW diameters (dashed line) are taken from Word et al. (1999). (A) Oi-1···H{alpha} versus {varphi}; (B) Oi-1···C versus {varphi}; (C) Ci-1···Cß versus {varphi}; and (D) Oi-1···Cß versus {varphi}. The {varphi} frequency distribution is shown at the bottom of D.

 
Second, we can compare the observed distributions against the ideal curves based on standard geometry (see Materials and Methods). Deviation of the observed distribution from the ideal curve indicates possible steric strain. The observed distributions of d(Oi-1···C) versus {varphi} (Fig. 3BGo) and d(Oi-1···Cß) versus {varphi} (Fig. 3DGo) fit the ideal curves well, showing that there are no significant deviations from standard geometry.

The {xi}-dependent distances
In the standard steric map, it is the N···Hi+1 steric clash that restricts {xi} in the region 0° < {xi} < 90° (see Fig. 1BGo). Comparing the ideal curve of d(N···Hi+1) versus {xi} to the {xi} frequency distribution (bottom of Fig. 4Go), we see that there is no corresponding drop in the {xi} frequency distribution as d(N···Hi+1) descends below its vdW diameter (Fig. 4CGo). The N···Hi+1 steric clash has no effect on the {xi} angle. Furthermore, the observed distribution of d(N···Hi+1) versus {xi} is distorted from the ideal curve for the region where d(N···Hi+1) is below the vdW diameter. Karplus (1996) has shown that this deviation accommodates the close approach of the N···Hi+1 interaction. On the other hand, we find that the ideal curve of d(Cß···O) versus {xi} corresponds quite well to the variation of the {xi} frequency distribution (Fig. 4DGo). This suggests that in the region 0° < {xi} < 90°, we can ignore the effects of the N···Hi+1 steric clash and instead, use the Cß···O steric clash. Indeed, given that the N···Hi+1 interaction deviates from the ideal geometry, the position of the Hi+1 atom is somewhat flexible.



View larger version (19K):
[in this window]
[in a new window]
 
Figure 4. Distributions of interatomic distances [Å] parameterized by {psi} [°]. The ideal curves (gray) are calculated using Engh and Huber (1991) geometry. The vdW diameter (dashed line) are taken from Word et al. (1999). (A) Cß···Hi+1 versus {xi}; (B) Cß···Ni+1 versus {xi}; (C) N···Hi+1 versus {xi}; and (D) Cß···O versus {xi}. The {xi} frequency distribution is shown at the bottom of D.

 
In the standard steric map, it is the Cß···Hi+1 steric clash that restricts {xi} in the region -180° < {xi} < -50° (see Fig. 1BGo). In the comparison of the {xi} frequency distribution (bottom of Fig. 4Go) to the ideal curve of d(Cß···Hi+1) versus {xi}, the Cß···Hi+1 steric clash appears to restrict {xi} (Fig. 4AGo). However, the observed 5% minimum value of d(Cß···Hi+1) is 0.21 Å higher than the vdW diameter (Table 1Go), suggesting that the Cß···Hi+1 steric clash is not responsible for the restriction on {xi}. Is there any other interaction that could be responsible? The ideal curve of d(Cß···Ni+1) versus {xi} also corresponds to the drop-off in the {xi} distribution (Fig. 4BGo). However, the observed 5% minimum value of Cß···Ni+1 is below the vdW diameter (Table 1Go), which is a clear steric contact. Hence, we should ignore the Cß···Hi+1 steric clash and replace it with the Cß···Ni+1 steric clash. Furthermore, as the H atom is more flexible than the other backbone atoms and the H atom has a negligible vdW interaction, we expect that the Cß···Hi+1 interaction will be soft and not behave as a hard steric clash.

The {varphi}{xi} codependent distances
However, if we look at interatomic distances as a function only of {varphi}, or as a function only of {xi}, then we will miss steric clashes that are {varphi}{xi} codependent. For example, in the standard steric map, the Oi-1···C steric clash excludes the middle of the Ramachandran plot, resulting in vertical boundaries in the {alpha}, {alpha}L, and ß regions (see Fig. 1BGo). However, these vertical boundaries are not found in the observed distribution, where the corresponding boundaries are diagonal (see Fig. 2AGo). Because the {varphi}{xi} codependent steric clashes induce diagonal boundaries, if we ignore the Oi-1···C steric clash, then we can identify the steric clashes that induce diagonal boundaries (Fig. 5AGo).



View larger version (37K):
[in this window]
[in a new window]
 
Figure 5. Revised steric map. (A) The steric clashes (dashed blue lines) that best match the data. d(Oi-1···O) = 2.7Å, d(Oi-1···Ni+1) = 2.7 Å, and d(H···Hi+1) = 1.6 Å. (B) Schematic of the revised steric map showing steric restrictions (dashed blue lines) and sterically allowed regions (dark blue). The revised steric map gives diagonal boundaries for the {alpha}R, {alpha}L, and ß regions and defines a more realistic upper boundary for the {alpha}R-region. Diagonal {alpha}R and {alpha}L regions (red region) from the dipole–dipole analysis (Fig. 7GGo) are defined mainly by the attractive Oi-1···C and N···Hi+1 interactions (red lines). The diagonal ß-strand region (yellow) is induced by aligning the CO···HN dipole–dipole interaction. Regions that are only excluded by a single steric clash (light blue) accounts for the outlier region in Lovell et al. (2003).

 
To make the comparison with the data, we generate all the contour plots of constant distance for the {varphi}{xi} codependent interactions. We show these contour plots in Figure 6Go mainly as a reference. We then define the steric boundaries of each contour plot by considering the regions where the distances are smaller than the corresponding vdW diameter (Table 1Go). In Figure 5AGo, we identify the steric clashes that best match the diagonal boundaries of the observed distribution. These diagonal boundaries exclude a region in the Ramachandran plot that runs down the middle of the plot. This excluded region can be divided into two. The first region, excluded by the Oi-1···O steric clash, consists of both the upper-central and lower-central regions, which are symmetric due to the inversion symmetry found in all the contour maps (Fig. 6Go). The second region, excluded by the Oi-1···Ni+1 steric clash, is in the center of the Ramachandran plot.



View larger version (60K):
[in this window]
[in a new window]
 
Figure 6. Contour plots of the {varphi}{xi} codependent interactions. The contours of constant distance [Å] are shown as functions of the {varphi}{xi} angles [°]. These interactions can be grouped in terms of dipole–dipole interactions in the alanine dipeptide (see Fig. 1AGo) where the contour plots within each group are geometrically similar. In terms of the COi-1···CO dipole–dipole interaction, they are (A) Oi-1···O, (B) Ci-1···O. In terms of the NH···NHi+1 interaction, they are (C) H···Hi+1 and (D) H···Ni+1. In terms of the COi-1···NHi+1 interaction, they are (E) Oi-1···Ni+1, (F) Oi-1···Hi+1, (G) Ci-1···Hi+1, and (H) Ci-1···Ni+1. In terms of the CO···HN interaction, the only {varphi}{xi} codependent distance is (I) O···H. All contour plots possess a twofold inversion symmetry through the point {varphi} = {xi} = 0°. The sterically excluded regions are defined as the regions where the interatomic distance is smaller than the corresponding vdW diameter (see Table 1Go).

 
The revised steric map of the Ramachandran plot
From the analysis above, we find that the N···Hi+1 steric clash does not affect the frequency distributions of {xi} and that ignoring the Oi-1···C steric clash results in well-defined diagonal boundaries in the Ramachandran plot. Thus, we obtain a revised set of steric clashes where (1) the Oi-1···Cß steric clash restricts {varphi}; (2) the Cß···O and Cß···Ni+1 steric clashes restrict {xi}; and (3) the Oi-1···O and O···Ni+1 steric clashes restrict {varphi}{xi}. Compared to the standard steric map (see Fig. 1BGo), the revised steric map (dark blue regions in Fig. 5BGo) matches the data better. The revised steric map gives a better upper bound to the {alpha}R-region and defines diagonal boundaries in the ß, {alpha}R, and {alpha}L region (Fig. 5AGo).

In their analysis of the {varphi}{xi} distribution, Lovell et al. (2003) defined regions in the observed Ramachandran plot in terms of favored (98%), outlier (between 98% and 99.5%), and strictly forbidden regions. In the outlier region, observed conformations are rare but nevertheless allowed if there exists a compensating interaction. The outlier regions include the plateau region below the {alpha}R region, the region between the {alpha}R and ß-regions, and a sinuous, sparsely populated stripe at {varphi} ~70° (Fig. 5AGo). The outlier region includes the rare type II‘ turn, {gamma} and {gamma}‘ conformations. In contrast, conformations near {varphi} = 0° are strictly forbidden.

What is the difference between the outlier and strictly forbidden regions? We find that (1) the strictly forbidden region corresponds to the region excluded by the Oi-1···Cß, Oi-1···O, and O···Ni+1 steric clashes; and (2) the outlier region is excluded by the Cß···O and Cß···Ni+1 steric clashes. Although we have identified only a single steric clash that is induced by diagonal boundaries in Figure 5BGo, some of the boundaries are in fact induced by multiple steric clashes. The existence of multiple hard steric clashes accounts for the difference between the strictly forbidden and outlier regions. The multiple steric clashes exist because we can group the {varphi}{xi} codependent distances in terms of the dipole–dipole interactions in the alanine dipeptide (see Fig. 1AGo). The contour plots that belong to each dipole–dipole interaction are geometrically similar (Fig. 6Go).

In the strictly forbidden region of the Ramachandran plot (white in Fig. 5BGo), both the Oi-1···O (Fig. 6AGo) and Ci-1···O (Fig. 6BGo) steric clashes exclude the same {varphi}{xi} region. We find that all the interatomic interactions that are grouped within the COi-1···NHi+1 interaction dipole–dipole interactions (Oi-1···Ni+1, Oi-1···Hi+1, Ci-1···Hi+1, and Ci-1···Ni+1) exclude the same central region in the Ramachandran plot (Fig. 6EGo–H). We also find that both the Oi-1···Cß steric clash (see Fig. 3DGo) and Ci-1···Cß steric clash (see Fig. 3CGo) exclude the same region of {varphi} where the Ci-1···Cß interaction is in a particularly serious steric overlap. This steric overlap could be an indication that the vdW radius of C (Word et al. 1999) is overestimated or that the electron shell of C is not entirely spherical.

In contrast, the outlier region corresponds to the region restricted by a single steric clash (light blue in Fig. 5BGo). For the region 0° < {xi} < 90°, only the Cß···O steric clash restricts {xi} (see Fig. 4DGo). It is not reinforced by N···Hi+1 (see Fig. 4CGo) as the N···Hi+1 interaction is not a hard steric clash. In the other region -180° < {xi} < -50°, as Cß···Hi+1 is probably not a hard steric clash (see Fig. 4AGo), only the Cß···Ni+1 steric clash (Fig. 4BGo) restricts {xi}.

Local electrostatic interactions in the Ramachandran plot
However, not all the features of the observed Ramachandran plot can be explained by local steric clashes. In this section, we focus on the diagonal shapes of the {alpha}R, {alpha}L, and ß-strand region. In previous studies, the {gamma} and {gamma}‘ regions were explained in terms of a C7 H-bond (Milner-White 1990). The polyproline II region within the ß-region was explained in terms of both a favorable COi-1···CO interaction (Maccallum et al. 1995) and as the most entropically favored conformation (Pappu and Rose 2002). Ho and Curmi (2002) showed that restrictions due to hydrogen bonds in ß-sheet formation induce a diagonal ß-strand region. However, the diagonal shape of the ß-strand region is also induced for residues not in ß-sheets. Therefore, the diagonal ß-strand region must also arise from local backbone interactions.

Lovell et al. (2003) argued that the diagonal {alpha}R-region is due to the disfavoring of the conformations near (-150°, -60°) (Fig. 5AGo), where the H and Hi+1 atoms are close together. They postulated that the crowding of the H atoms is disfavored because this prevents the formation of one H-bond with the solvent. However, comparing the contour distance plot of H···Hi+1 (Fig. 6CGo) with the observed {alpha}R-region (see Fig. 2AGo), we can see that favored conformations in the observed plot, such as (-110°, 0°), also has crowded H and Hi+1 atoms. As the crowding of H atoms produces different results in different parts of the Ramachandran plot, something else must induce the diagonal shape of the {alpha}R-region.

Following Maccallum et al. (1995), we analyze the electrostatic interactions of the alanine peptide in terms of the dipole–dipole interactions: the COi-1···CO, NH···NHi+1, COi-1···NHi+1, and CO···NH interactions. The difference with the study of Maccallum et al. (1995) is that in our calculation, we have included the Lennard-Jones potentials of our revised set of steric clashes (Fig. 7AGo).



View larger version (43K):
[in this window]
[in a new window]
 
Figure 7. Contour plots of the dipole–dipole interactions [kcal/mole] as a function of the {varphi}{xi} angles [°]. Energy plots of (A) the Lennard-Jones 12–6 potentials of the revised set of steric clashes; (B) all electrostatic interactions; the individual dipole–dipole interactions of (C) COi-1···CO; (D) NH···NHi+1; (E) CO···NH; and (F) COi-1···NHi+1. (G) The combination of the COi-1···CO, NH···NHi+1 and CO···NH dipole–dipole interactions produces clear diagonal minima in the {alpha}R, {alpha}L, and ß regions.

 
The combined electrostatic map (Fig. 7BGo) does not produce a minimum in the {alpha}R-region. However, when considered individually, we find that, of the four dipole–dipole interactions, the COi-1···CO (Fig. 7CGo), NH···NHi+1 (Fig. 7DGo), and CO···NH (Fig. 7EGo) interactions induce diagonal shapes in the {alpha}R and {alpha}L regions. Consequently, the energy map that combines the COi-1···CO, NH···NHi+1, and CO···NH interactions (Fig. 7GGo) produces well-defined diagonal minima in the {alpha}R and {alpha}L regions. In the backbone conformation of these regions (the diagram in Fig. 1AGo corresponds to such a conformation), (1) the COi-1 dipole points toward the CO dipole such that Oi-1 is in contact with C; (2) the NHi+1 dipole points toward the NH dipole such that the N atom is in contact with the Hi+1 atom; and (3) the CO and NH groups are aligned in an antiparallel conformation such that O is as far away from H as possible. A simple description of this conformation is that the Oi-1···C and N···Hi+1 attraction are simultaneously optimized. Optimizing the Oi-1···C interaction will restrict |{varphi}| < 100°, and optimizing the N···Hi+1 interaction will restrict |{varphi}| < 80° (see Fig. 5BGo). The optimization of the N··· Hi+1 interaction in the {alpha}R-region was also observed by Karplus (1996).

Maccallum et al. (1995) showed that the polyproline II region corresponds to a minimum in the electrostatic COi-1···CO interaction. We can see this in Figure 7CGo. Similarly, we find that the diagonal ß-strand region can also be explained in terms of an electrostatic dipole–dipole interaction. A diagonal minimum of the CO···NH is induced (Fig. 7EGo), which corresponds to the observed ß-strand region (see Fig. 5AGo). In this minimum, the CO and NH groups in the backbone are essentially aligned and co-planar. This CO···HN electrostatic minimum is so deep that the diagonal ß-strand region is still found in the combined electrostatic interaction (Fig. 7BGo).

Although it has been shown that the COi-1···NHi+1 interaction induces the {gamma} and {gamma}‘ region (Milner-White 1990), the electrostatic approximation of the COi-1···NHi+1 interaction does not induce a minimum in the {gamma} region (Fig. 7FGo). However, it does induce a weak minimum in the {gamma}‘ region. Compared to the QM calculcations (Hu et al. 2003), the electrostatic approximation of the COi-1···NHi+1 interaction is poor, which is probably the reason why the combined electrostatic map (Fig. 7BGo) does not give the diagonal {alpha}R-region.

Ramachandran plots of the {alpha}-helix
Although the Ramachandran plot of residues in {alpha}-helices is found within the {alpha}R-region (Ramachandran and Sasisekharan 1968), there are subtle but significant differences. The Ramachandran plot of residues in the center of the {alpha}-helix is smaller than the {alpha}R-region and the Ramachandran plot varies at different positions of the {alpha}-helix termini (Petukhov et al. 2002). We use the Richardson and Richardson terminology (1988) to describe the different positions of the {alpha}-helical residues. The residues at the amino terminus are labeled Ncap-N1-N2-N3-N4–··· (Fig. 8BGo) where the amino-terminal residues (N1, N2, N3) only contribute CO groups to H-bonds. The residues at the carboxyl terminus are labeled ···C4-C3-C2-C1-Cap (Fig. 8AGo) where the carboxy-terminal residues (C1, C2, C3) only contribute NH groups to H-bonds. Ccap and Ncap are boundary residues, which are not considered part of the {alpha}-helix.

Here, we plot the Ramachandran plots of the {alpha}-helical residues: Ncap (see Fig. 2CGo), N1, N2, N3 (Fig. 9Go), central (see Fig. 2BGo), C3, C2, C1 (Fig. 10Go), and Ccap (see Fig. 2DGo). The statistical parameters of these distributions are listed in Table 2Go. There appear to be no systematic restraints on the capping residues as the Ramachandran plot of the Ncap and Ccap residues are found all over the Ramachandran plot (see Fig. 2C,DGo). This is understandable given the plurality of capping interactions in the {alpha}-helix (for review, see Aurora and Rose 1998). The central (see Fig. 2BGo), N1, N2, N3 (Fig. 9Go), and C3 (bottom of Fig. 10Go) residues all have similar Ramachandran plots, which are constrained to the lower half of the general {alpha}-region of the Ramachandran plot. The C2 residue (center of Fig. 10Go) is slightly more diffuse than the central residues, whereas the C1 residue (top of Fig. 10Go) is identical to the general {alpha}R-region.



View larger version (53K):
[in this window]
[in a new window]
 
Figure 9. The Ramachandran plot of the amino-terminal residues. The left column gives the observed distribution. The right column gives the energy map of the H-bonding constraints and Lennard-Jones potential. The Ramachandran plot has been truncated for clarity.

 


View larger version (51K):
[in this window]
[in a new window]
 
Figure 10. The Ramachandran plot of the carboxyl-terminal residues. The left column gives the observed distribution. The right column gives the energy map of the H-bonding constraints and Lennard-Jones potential. The Ramachandran plot has been truncated for clarity.

 

View this table:
[in this window]
[in a new window]
 
Table 2. Parameters of the {varphi}{xi} distributions in the {alpha}-helix
 
We also examined the Ramachandran plots of N1, N2, N3, C3, C2, and C1 for different amino acids but did not find any significant differences between the amino acids. This is consistent with previous studies (Chakrabarti and Pal 2001; Lovell et al. 2003), which found that the contours of the Ramachandran plot are relatively stable although the frequencies of occurrence differ for the different amino acids. Given that the contours for each {alpha}-helical positions are the same for different amino acids, the shape of the contours must be due to backbone interactions.

H-bonds in the {alpha}-helix
What kind of backbone interactions can induce different constraints along the {alpha}-helix? The obvious interaction is the backbone H-bond. To analyze the H-bonding constraints, we extend the analysis of Ramachandran and Sasisekharan (1968) where, instead of modeling identical {varphi}{xi} angles along the {alpha}-helix, we treat the {varphi}{xi} angles of different residues independently (see Materials and Methods). We modeled the amino terminus by allowing the {varphi}{xi} angles of the N1, N2, and N3 residues to vary independently to form the first three CO···HN H-bonds (red in Fig. 8BGo). Similarly, we model the carboxyl terminus by allowing the {varphi}{xi} angles of the C1, C2, and C3 residues (red in Fig. 8AGo) to vary independently to form the last three CO···HN H-bonds (red in Fig. 8AGo). This induces different restrictions on the N1, N2, N3, C3, C2, and C1 residues.

To model the CO···HN H-bonds, we use harmonic distance constraints (see Materials and Methods). Although we also considered electrostatics and Lennard-Jones potentials, we found that the harmonic distance constraint was sufficient to induce well-formed H-bonds and that using electrostatics to align the CO and NH dipoles did not make a significant difference. Furthermore, the harmonic distance constraint easily converged to a unique solution. We also imposed Lennard-Jones 12–6 potentials of the revised set of steric clashes to avoid local steric clashes. Subsequently, we obtain energy maps of the Ramachandran plot that show regions where the H-bonds are allowed to form and where there are no significant steric clashes. The restricted regions are reproduced for N1, N2, N3 (Fig. 9Go) and C3 (bottom of Fig. 10Go). A more diffuse region is obtained for C2 (center of Fig. 10Go) and a very diffuse region is obtained for C1 (top of Fig. 10Go). H-bonding constraints thus explain the variation in the Ramachandran plots along the {alpha}-helix.

How can we understand the big difference between the Ramachandran plots of the N1 (bottom of Fig. 9Go) and C1 (top of Fig. 10Go) residues? The H-bonding constraints can be understood as the problem of simultaneously forming two neighboring CO···HN H-bonds in the {alpha}-helix. When these two CO···HN H-bonds are formed, they will be parallel and close together. The N1 residue is found between the CONc···HNN4 and CON1···HNN5 H-bonds. Forming these two H-bonds simultaneously will minimize the ONc···ON1 distance (colored blue in Fig. 8BGo). Consequently, from the contour plot of d(Oi-1···O) versus {varphi}{xi} (see Fig. 6AGo), we extract the region d(Oi-1···O) < 3.00 Å. This produces an allowed region (blue in Fig. 8DGo) that encompasses the allowed N1 residue Ramachandran plot (red in Fig. 8DGo). If we also eliminate the region with local steric clashes (black in Fig. 8DGo), then we obtain the constrained region corresponding to the N1 residue.

In the carboxyl terminus, the C1 residue sits between the COC5···HNC1 and COC4···HNCc H-bonds. Forming these two H-bonds will minimize the HC1···HCc distance. Hence, from the contour plot of d(H···Hi+1) versus {varphi}{xi} (see Fig. 6CGo), we extract the region d(H···Hi+1) < 3.00 Å. This produces an allowed region (blue outline in Fig. 8CGo) that encompasses the allowed region of C1 (red in Fig. 8CGo). However, unlike the N1 residue, the local steric clashes in the C1 residue (black in Fig. 8CGo) do not eliminate any part of the {alpha}R-region, resulting in the larger C1 Ramachandran plot.

Conclusion
Interactions that determine the Ramachandran plot
We have analyzed the statistical distributions of the protein backbone and find that certain 1–4 interactions in the standard steric map can be ignored (N···Hi+1, Oi-1···C, and Cß···Hi+1). This allows us to identify a revised steric map (Cß···O, Oi-1···Ni+1, Cß···Ni+1, Oi-1···Cß, and Oi-1···O) that matches the observed Ramachandran plot better than the standard steric map (see Fig. 5AGo). We also find that the rare, but allowed, outlier region in the Lovell et al. (2003) study can be defined as the regions that are only restricted by a single steric clash. In the strictly forbidden regions, the backbone geometry brings more than one pair of atoms into a steric clash. Our analysis follows the hard-sphere model pioneered by Ramachandran et al. (1963) and supports the view of Baldwin and Rose (1999) that, to quote Richards (1977), ". . . the use of the hard-sphere model has a venerable history and an enviable record in explaining a variety of different observable properties." For simple models of the protein, the revised steric map represents an efficient way to improve the match with the data. Furthermore, the revised steric map consists of steric clashes between heavy atoms, which should be useful for models that ignore H atoms. Indeed, we find that the H. . .Hi+1 steric clash in the standard steric map (see Fig. 1BGo) has no significant effects on the revised Ramachandran plot (see Fig. 5BGo).

However, other features of the Ramachandran plot must be explained in terms of electrostatic interatomic interactions. The ß-strand region corresponds to conformations where the CO and NH dipoles are aligned, which optimizes the dipole–dipole interaction (yellow region in Fig. 5BGo). The diagonal shape of the {alpha}R and {alpha}L regions depends on the optimization of the N···Hi+1 and Oi-1···C interactions (red region in Fig. 5BGo). The N···Hi+1 and Oi-1···C interactions are also found to have no steric effect on the statistical {varphi}{xi} distributions. Although these electrostatic interactions should only be viewed as useful approximations, we can use these results to understand the QM calculation (Hu et al. 2003). The effect of applying QM is to induce a strong N···Hi+1 and Oi-1···C attraction that neutralizes the hard-sphere repulsion. Consequently, diagonal {alpha}R and {alpha}L regions are induced.

Along the {alpha}-helix
We have also shown that the variation in the Ramachandran plots along the {alpha}-helix is induced by backbone H-bonding constraints. This severely restricts the residues in the middle and amino terminus of the {alpha}-helix but not in the carboxyl terminus. The larger size of the Ramachandran plot in C1 (Fig. 8CGo) compared to N1 (Fig. 8DGo) can be interpreted as a larger backbone entropy in the carboxyl terminus than in the amino terminus. This would make the carboxyl terminus more flexible than the amino terminus, which has been experimentally observed (Miick et al. 1993). In simulations of the folding of {alpha}-helices, H-bond formation proceeds faster in the N to C direction than in the opposite C to N direction (Sung 1994; Voegler-Smith and Hall 2001). Because the backbone entropy of the carboxyl terminus is larger, the change in free-energy required to form the carboxyl terminus [{Delta}G = {Delta}HH-bond - T(Scoil - Shelix)] is smaller, and hence it is more probable for the {alpha}-helix to form in the N to C direction. Other simulations find that {alpha}-helix unfolding proceeds faster in the opposite C to N direction (Soman et al. 1991). The smaller backbone entropy in the amino terminus makes it more likely for H-bonds to break at the amino terminus, which corresponds to unfolding in the C to N direction.


    Acknowledgments
 
B.K.H. was supported by a postdoctoral grant from the Fonds National de la Recherche Scientifique (FNRS), Belgium. R.B. is director of research of FNRS. A.T. is director of research of INSERM.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.


    References
 TOP
 Abstract
 Introduction
 Materials and methods
 Results and Discussion
 References
 
Aurora, R. and Rose, G.D. 1998. Helix capping. Protein Sci. 7: 21–38.[Abstract]

Baldwin, R.L. and Rose, G.D. 1999. Is protein folding hierarchic? I. Local structure and peptide folding. Trends Biochem. Sci. 24: 26–33.[CrossRef][Medline]

Bernstein, F.C., Koetzle, T.F., Williams, G.J.B., Meyer Jr., E.E., Brice, M.D., Rodgers, J.R., Kennard, O., Shimanouchi, T., and Tasumi, M. 1977. The Protein Data Bank: A computer-based archival file for macromolecular structures. J. Mol. Biol. 112: 535–542.[Medline]

Chakrabarti, P. and Pal, D. 2001. The interrelationships of side-chain and main-chain conformations in proteins. Prog. Biophys. Mol. Biol. 76: 1–102.[CrossRef][Medline]

Chou, P.Y. and Fasman, G.D. 1974. Conformational parameters for amino acids in helical, ß-sheet, and random coil regions calculated from proteins. Biochemistry 13: 222–245.[CrossRef][Medline]

Engh, R. and Huber, R. 1991. Accurate and angle parameters for x-ray protein structure refinement. Acta Crystallogr. A 47: 392.[CrossRef]

Garnier, J. and Robson, B. 1990. The GOR method for predicting