|
|
||||||||
T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland 21218, USA
(RECEIVED March 23, 2007; FINAL REVISION May 13, 2007; ACCEPTED May 14, 2007)
| Abstract |
|---|
|
|
|---|
-helices and
-strands, interconnected by reverse turns and longer loops. Most short turns can be classified readily into a limited repertoire of discrete backbone conformations, but the physical–chemical determinants of these distinct conformational basins remain an open question. We investigated this question by exhaustive analysis of all backbone conformations accessible to short chain segments bracketed by either an
-helix or a
-strand (i.e.,
-segment-
,
-segment-
,
-segment-
, and
-segment-
) in a nine-state model. We find that each of these four secondary structure environments imposes its own unique steric and hydrogen-bonding constraints on the intervening segment, resulting in a limited repertoire of conformations. In greater detail, an exhaustive set of conformations was generated for short backbone segments having reverse-turn chain topology and bracketed between elements of secondary structure. This set was filtered, and only clash-free, hydrogen-bond–satisfied conformers having reverse-turn topology were retained. The filtered set includes authentic turn conformations, observed in proteins of known structure, but little else. In particular, over 99% of the alternative conformations failed to satisfy at least one criterion and were excluded from the filtered set. Furthermore, almost all of the remaining alternative conformations have close tolerances that would be too tight to accommodate side chains longer than a single
-carbon. These results provide a molecular explanation for the observation that reverse turns between elements of regular secondary can be classified into a small number of discrete conformations. Keywords: protein structure/folding; computational analysis of protein structure; reverse turns
| Introduction |
|---|
|
|
|---|
-helices and
-strands (Levitt and Chothia 1976). These iso-directional elements are interconnected by short turns and longer loops (Rose and Seltzer 1977) that usually reverse the overall chain direction (Rose et al. 1985). The backbone torsion angles (
,
angles) of most turn conformations are found to lie within a limited set of discrete basins, and accordingly work in this area has focused largely on
,
-based turn classification (Chou and Fasman 1977; Richardson 1981; Rose et al. 1985; Sibanda and Thornton 1991; Ring et al. 1992; Efimov 1993; Donate et al. 1996; Wojcik et al. 1999; Engel and DeGrado 2005; Lahr et al. 2005). The very fact that turns are compatible with such classification prompted us to investigate the physical-chemical origin of this phenomenon.
Our analysis focuses on short turns bracketed between
-helices or
-strands, resulting in four distinct microenvironments:
-turn-
,
-turn-
,
-turn-
, and
-turn-
. Classic
-turns are four residues long (i to i + 3), but only the central residues (i + 1, i + 2) affect turn conformation (Venkatachalam 1968). Upon analyzing the central residues in three-, four-, and five-residue turns, we find that their backbone conformations are particular to each of the four microenvironments. This realization suggests that the limited repertoire of backbone conformations observed in turns is conditioned by local restrictions that are secondary-structure specific.
Earlier studies emphasized the paramount importance of sterics and hydrogen bonding in restricting accessible conformational space (Srinivasan and Rose 1999; Fitzkee and Rose 2004, 2005). If these same factors are the primary determinants of turn conformations, then authentic turns should be clash-free and hydrogen-bond–satisfied; conversely, other conceivable backbone conformations should fail at least one of these criteria.
We tested this proposition using simulations: Conformations of short-chain segments embedded in the four microenvironments were sampled exhaustively and then filtered for clash-free, hydrogen-bond–satisfied structures having reverse-turn chain topology. Survivors include authentic turns observed in solved protein structures, but almost all other conformers (>99%) are filtered away, and most remainders can be rationalized. These results confirm the hypothesis that the conformational biases of short turns between elements of repetitive secondary structure are established via local restrictions imposed by sterics and hydrogen bonding.
| Results |
|---|
|
|
|---|
-helix or
-strand) having crossing angles between 110° and 180° (Fig. 1). All segments satisfying these criteria were culled from the Protein Coil Library (http://www.roselab.jhu.edu/) (Fitzkee et al. 2005b), a repository of contiguous protein fragments that are neither
-helix nor
-strand, excised from a large data set of solved, nonredundant structures (Wang and Dunbrack Jr. 2003). Coil-library fragments are typically short (Fig. 1), and a substantial fraction of the one-, two-, and three-residue fragments correspond to the central residues in turns.
|
-turn-
,
-turn-
,
-turn-
, and
-turn-
), and each class was further subdivided into turns having one, two, or three central residues. For each of these 12 categories, the observed backbone torsion angles (
,
angles) were plotted as contour maps (Fig. 2).
|
-turns (e.g., between
-strands; Fig. 2K) and many noncanonical turns. Representative
,
values for all 28 turn types are listed in Table 1. While some observed structures are not included within the distinct conformational clusters highlighted in Figure 2, these outliers are sparsely distributed across the Ramachandran map (Supplemental Fig. 1).
|
,
values for turns within their observed secondary-structure microenvironment (Table 1) were tested against each of the other three microenvironments. Each cross-matched case gave rise to violations of (1) turn-like chain topology (i.e., crossing angles between the bracketing secondary structure elements between 110° and 180°), (2) clash-free sterics, and/or (3) hydrogen-bond satisfaction. Illustrative examples of these three types of cross-matched violations are given next.
atoms (Fig. 3B, right).
atom and cannot form a hydrogen bond to either the protein or the solvent (Fig. 3C, right).
|
Authentic turn conformations satisfy steric, hydrogen-bonding, and topological restrictions
All 28 authentic turn types were assessed in simulations by varying the backbone torsion angles of turn residues by 10° from their representative
,
values (Table 1), with bracketing secondary structures held fixed as rigid units. Chain topology was assessed by the mean crossing angle, averaged over the ensemble of simulated structures. Steric restrictions and hydrogen-bond satisfaction were evaluated using the acceptance ratio from these simulations. An acceptance ratio of 1.0 indicates that all simulated structures are clash-free and fully hydrogen bonded. Indeed, all simulated structures were found to have high acceptance ratios (>0.8) with crossing angles that effectively reversed the overall chain direction (>120°; Fig. 4A).
|
,
map (Hovmøller et al. 2002); each state is a circular region of radius 10°. Identical ranges (i.e., a 10° radius around a central value) were used for both alternative and authentic turns to normalize comparison of their resulting acceptance ratios. Using these nine states, conformations between pairs of secondary-structure elements were sampled exhaustively for one-, two-, and three-residue fragments (with 9, 92, and 93 states, respectively). Crossing angles and acceptance ratios were then calculated from the simulated ensembles after first eliminating any authentic turns (see details in Materials and Methods).
|
It is apparent that almost all alternative conformations have high bin numbers, corresponding to low crossing angles and/or low acceptance ratios (shown as unfilled histogram bars in Fig. 4B). Only 14 of the 2215 total alternative conformations fall into bin 1 or 2, i.e., more than 99% are outside the observed variation of authentic turns. Most of these 14 false positives have C
atoms that are too close to other atoms for all but an alanine or a glycine residue (Fig. 5; Table 2), and their acceptance ratios would have been reduced significantly had bulky side chains been included.
|
|
| Discussion |
|---|
|
|
|---|
-helix (Pauling et al. 1951) and
-sheet (Pauling and Corey 1951)—were based solely on local hydrogen-bond satisfaction, sterics, and peptide bond geometry. Later, Venkatachalam predicted the existence of
-turn structures using identical principles (Venkatachalam 1968). Together, these several categories account for approximately three-quarters of protein structure, on average (Fitzkee et al. 2005a,b). The remaining quarter includes other recognizable categories, such as the one-, two-, and three-residue turns described in Figure 2. These turns have greater conformational heterogeneity than helix and sheet and might therefore be expected to involve additional determinants, more than just local hydrogen-bond satisfaction, sterics, and chain geometry. But our results suggest otherwise.
Reverse turns include both strict
-turns (Venkatachalam 1968) and more moderate bends (Rose et al. 1985) and loops (Leszczynski and Rose 1986). The peptide main chain changes its overall direction at these reverse-turn sites, and their abundance is the reason why globular proteins are, in fact, globular. Much, but not all, of the earlier literature has focused on strict
-turns that link strands of
-sheet (Richardson 1985; Wilmot and Thornton 1988; Sibanda and Thornton 1991; Gunasekaran et al. 1997; Ramirez-Alvarado et al. 1997; Chung et al. 1998). Efimov's modeling studies (Efimov 1993, 1994) encompass a more general definition of turns, including those at helix ends (Brunet et al. 1993). Helices often terminate in recognizable capping motifs (Aurora and Rose 1998) that both stabilize the helix and launch the main chain in a different direction, and consequently helix caps (Presta and Rose 1988; Richardson and Richardson 1988) can be an integral component of helix-adjacent turns (Lahr et al. 2005). Recently, Engel and DeGrado (2005) undertook a comprehensive analysis of links in helical hairpins and their populations (Fig. 6 and Table 4 in Engel and DeGrado 2005) resemble ours (Fig. 2; Table 1) (their analysis extends to nine-residue links, whereas ours is limited to three residues). Shortle has pointed out that the densely populated regions in
,
space can be subdivided into discrete bins (Shortle 2003), consistent with subpopulations that are favored by the local structural environment. However, unlike most of these earlier studies, our present focus is on the molecular origin of turn conformations and on why only a few discrete populations are observed.
The physical-chemical determinants analyzed here are completely general in that they are properties of the protein backbone and therefore applicable to all residues (Rose et al. 2006). These determinants are also extensive. Even for a dipeptide, steric considerations limit accessible conformational space (Ramachandran and Sasisekharan 1968), and additional limitations are imposed in these congested turn conformations, where the chain folds back on itself (Fitzkee and Rose 2004). Further, polar groups must find either intra- or intermolecular hydrogen-bond partners because the energetic cost of an unsatisfied backbone polar group is high (Baldwin 2003). Accordingly, almost all backbone hydrogen bonds in X-ray structures of folded proteins are either satisfied (Fleming and Rose 2005) or can be minimized into satisfied conformations without substantial structural changes (Panasik et al. 2005).
Contouring the data, as in Figure 2, reveals the major populations by suppressing outliers. It is possible that such outliers are instances of rare but authentic turns, e.g., strained conformations (Herzberg and Moult 1991; Hodel et al. 1994). Upon excluding those false positives with obvious structural anomalies, four conformations with crossing angles and acceptance ratios resembling those of authentic turns remain in Table 2; these conformers are not highly populated in proteins. Another possibility is that some or all of the outliers are spurious conformations that legitimately belong to one of the major populations listed in Table 1, akin to the near-turns described in Panasik Jr. et al. (2005). Clearly, further work is needed to distinguish between these possibilities. Other interesting details are also beyond the scope of this current study, such as the differences in basin sizes and populations in Figure 2, and here, too, further analysis is needed, taking into account factors that were neglected initially, such as secondary-structure packing (Chothia et al. 1981) and electrostatic effects (Dasgupta et al. 2004). These issues notwithstanding, it is compelling that Pauling's three simple criteria—steric exclusion, hydrogen-bond satisfaction, and chain geometry—can largely account for the conformational diversity observed in short turns that interconnect elements of protein secondary structure.
| Materials and Methods |
|---|
|
|
|---|
-helix nor
-strand were extracted from the Protein Coil Library (Fitzkee et al. 2005b) and sorted by length. These fragments were derived from the PISCES list (Wang and Dunbrack Jr. 2003) of structures with resolution of 2.0 Å or better and refinement factors of 0.25 or better. The list was supplemented by one-residue fragments, which are not included in the Protein Coil Library (http://www.roselab.jhu.edu/coil). Care was taken to avoid double-counting longer fragments as multiple shorter fragments.
In the context of their host proteins, the one-, two-, and three-residue fragments used in this study are flanked by secondary-structure elements of differing lengths. To control for length variability when computing crossing angles, all secondary-structure elements were set to be eight residues in length. Specifically, the two adjoining residues at either end of a given fragment were extended by an additional six residues; canonical backbone torsion angles were assigned to these six-residue extensions (
,
= –62°, –42° for
-helix or
,
= –120°, 135° for
-strand). Crossing angles were computed by taking the principal moments of inertia of each secondary structure element, calculated from its C
coordinates; the moment with the smallest eigenvalue is the vector of least-squares best-fit to the long axis of that element (Rose and Seltzer 1977). The crossing angle between two elements was then computed as the scalar angle between their two best-fit vectors, each pointed in the N- to C-terminal direction to avoid sign ambiguities.
In simulations, the backbone torsions of turn residues were varied at random within 10° of their origin, with bracketing secondary-structure elements held rigid. Exceptions were made in the case of some one-residue fragments where clear conformational preferences observed in authentic turns affect the fragment-adjacent residue, as noted in Table 1. All residues were modeled as alanine except those with
> 0°, which were modeled as glycine.
Average crossing angles and acceptance ratios were based on 100 simulated structures. Standard bond lengths, bond angles, atomic radii, and hydrogen-bond criteria were used, as described elsewhere (Fitzkee and Rose 2004, 2005). Atomic radii were scaled to 90% of their standard values to compensate for any simulation artifacts caused by using rigid models of secondary structure (Creamer and Rose 1994).
The
,
origins of the nine states in Scheme 1 are (–160°,160°), (–80°,160°), (–160°,80°), (–80°,80°), (60°,30°), (–90°,0°), (90°,0°), (–60°,–30°), and (60°,–150°). Each state is a circular region of the Ramachandran map within 10° of these central values. Conformations were sampled exhaustively by varying the backbone through all possible combinations of states within every secondary-structure microenvironment (helix–helix, helix–strand, strand–helix, and strand–strand). In each of these four microenvironments, one-, two-, and three-residue fragments can visit 9, 81, and 729 possible states, respectively. However, conformers within 30° of any authentic turn in Figure 2 were disallowed, eliminating 64 conformers. In addition, conformers that simply extend an adjacent secondary-structure element were also disallowed. Specifically, states 1 and 3 were disallowed when immediately adjacent to a
-strand, and state 8 was disallowed when immediately adjacent to an
-helix; 997 additional conformations were eliminated by these criteria.
| Footnotes |
|---|
Reprint requests to: George D. Rose, T.C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 N. Charles Street, Baltimore, MD 21218, USA; e-mail: grose{at}jhu.edu; fax: (410) 516-4118.
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.072898507.
| Acknowledgments |
|---|
|
|
|---|
| References |
|---|
|
|
|---|
Baldwin, R.L. 2003. In search of the energetic role of peptide hydrogen bonds. J. Biol. Chem. 278: 17581–17588.
Brunet, A.P., Huang, E.S., Huffine, M.E., Loeb, J.E., Weltman, R.J., and Hecht, M.H. 1993. The role of turns in the structure of an
-helical protein. Nature 364: 355–358.[CrossRef][Medline]
Chothia, C., Levitt, M., and Richardson, D. 1981. Helix to helix packing in proteins. J. Mol. Biol. 145: 215–250.[CrossRef][Medline]
Chou, P.Y. and Fasman, G.D. 1977.
-Turns in proteins. J. Mol. Biol. 115: 135–175.[CrossRef][Medline]
Chung, Y.J., Christianson, L.A., Stanger, H.E., Powell, D.R., and Gellman, S.H. 1998. A
-peptide reverse turn that promotes hairpin formation. J. Am. Chem. Soc. 120: 10555–10556.[CrossRef]
Creamer, T.P. and Rose, G.D. 1994.
-Helix-forming propensities in peptides and proteins. Proteins 19: 85–97.[CrossRef][Medline]
Dasgupta, B., Pal, L., Basu, G., and Chakrabarti, P. 2004. Expanded turn conformations: Characterization and sequence–structure correspondence in
-turns with implications in helix folding. Proteins 55: 305–315.[CrossRef][Medline]
Donate, L.E., Rufino, S.D., Canard, L.H., and Blundell, T.L. 1996. Conformational analysis and clustering of short and medium size loops connecting regular secondary structures: A database for modeling and prediction. Protein Sci. 5: 2600–2616.[Abstract]
Efimov, A.V. 1993. Standard structures in proteins. Prog. Biophys. Mol. Biol. 60: 201–239.[CrossRef][Medline]
Efimov, A.V. 1994. Common structural motifs in small proteins and domains. FEBS Lett. 355: 213–219.[CrossRef][Medline]
Engel, D.E. and DeGrado, W.F. 2005.
–
Linking motifs and interhelical orientations. Proteins 61: 325–337.[CrossRef][Medline]
Fitzkee, N.C. and Rose, G.D. 2004. Steric restrictions in protein folding: An
-helix cannot be followed by a contiguous
-strand. Protein Sci. 13: 633–639.
Fitzkee, N.C. and Rose, G.D. 2005. Sterics and solvation winnow accessible conformational space for unfolded proteins. J. Mol. Biol. 353: 873–887.[CrossRef][Medline]
Fitzkee, N.C., Fleming, P.J., Gong, H., Panasik Jr, N., Street, T.O., and Rose, G.D. 2005a. Are proteins made from a limited parts list? Trends Biochem. Sci. 30: 73–80.[CrossRef][Medline]
Fitzkee, N.C., Fleming, P.J., and Rose, G.D. 2005b. The Protein Coil Library: A structural database of nonhelix, nonstrand fragments derived from the PDB. Proteins 58: 852–854.[CrossRef][Medline]
Fleming, P.J. and Rose, G.D. 2005. Do all backbone polar groups in proteins form hydrogen bonds? Protein Sci. 14: 1911–1917.
Gunasekaran, K., Ramakrishnan, C., and Balaram, P. 1997.
-Hairpins in proteins revisited: Lessons for de novo design. Protein Eng. 10: 1131–1141.
Herzberg, O. and Moult, J. 1991. Analysis of the steric strain in the polypeptide backbone of protein molecules. Proteins 11: 223–229.[CrossRef][Medline]
Hodel, A., Kautz, R.A., Adelman, D.M., and Fox, R.O. 1994. The importance of anchorage in determining a strained protein loop conformation. Protein Sci. 3: 549–556.[Abstract]
Hovmøller, S., Zhou, T., and Ohlson, T. 2002. Conformations of amino acids in proteins. Acta Crystallogr. D Biol. Crystallogr. 58: 768–776.[CrossRef][Medline]
Lahr, S.J., Engel, D.E., Stayrook, S.E., Maglio, O., North, B., Geremia, S., Lombardi, A., and DeGrado, W.F. 2005. Analysis and design of turns in
-helical hairpins. J. Mol. Biol. 346: 1441–1454.[CrossRef][Medline]
Leszczynski, J.F. and Rose, G.D. 1986. Loops in globular proteins: A novel category of secondary structure. Science 234: 849–855.
Levitt, M. and Chothia, C. 1976. Structural patterns in globular proteins. Nature 261: 552–558.[CrossRef][Medline]
Panasik Jr, N., Fleming, P.J., and Rose, G.D. 2005. Hydrogen-bonded turns in proteins: The case for a recount. Protein Sci. 14: 2910–2914.
Pauling, L. and Corey, R.B. 1951. The pleated sheet, a new layer configuration of polypeptide chains. Proc. Natl. Acad. Sci. 37: 251–256.
Pauling, L., Corey, R.B., and Branson, H.R. 1951. The structure of proteins; two hydrogen-bonded helical configurations of the polypeptide chain. Proc. Natl. Acad. Sci. 37: 205–211.
Presta, L.G. and Rose, G.D. 1988. Helix signals in proteins. Science 240: 1632–1641.
Ramachandran, G.N. and Sasisekharan, V. 1968. Conformation of polypeptides and proteins. Adv. Protein Chem. 23: 283–437.[Medline]
Ramirez-Alvarado, M., Blanco, F.J., Niemann, H., and Serrano, L. 1997. Role of
-turn residues in
-hairpin formation and stability in designed peptides. J. Mol. Biol. 273: 898–912.[CrossRef][Medline]
Richardson, J.S. 1981. The anatomy and taxonomy of protein structure. Adv. Protein Chem. 34: 167–339.[Medline]
Richardson, J.S. 1985. A new twist for hairpin turns. Nature 316: 102–103.
Richardson, J.S. and Richardson, D.C. 1988. Amino acid preferences for specific locations at the ends of
-helices. Science 240: 1648–1652.
Ring, C.S., Kneller, D.G., Langridge, R., and Cohen, F.E. 1992. Taxonomy and conformational analysis of loops in proteins. J. Mol. Biol. 224: 685–699.[CrossRef][Medline]
Rose, G.D. and Seltzer, J.P. 1977. A new algorithm for finding the peptide chain turns in a globular protein. J. Mol. Biol. 113: 153–164.[CrossRef][Medline]
Rose, G.D., Gierasch, L.M., and Smith, J.A. 1985. Turns in peptides and proteins. Adv. Protein Chem. 37: 1–109.[Medline]
Rose, G.D., Fleming, P.J., Banavar, J.R., and Maritan, A. 2006. A backbone-based theory of protein folding. Proc. Natl. Acad. Sci. 103: 16623–16633.
Shortle, D. 2003. Propensities, probabilities, and the Boltzmann hypothesis. Protein Sci. 12: 1298–1302.
Sibanda, B.L. and Thornton, J.M. 1991. Conformation of
hairpins in protein structures: Classification and diversity in homologous structures. Methods Enzymol. 202: 59–82.[Medline]
Srinivasan, R. and Rose, G.D. 1999. A physical basis for protein secondary structure. Proc. Natl. Acad. Sci. 96: 14258–14263.
Venkatachalam, C.M. 1968. Stereochemical criteria for polypeptides and proteins. V. Conformation of a system of three linked peptide units. Biopolymers 6: 1425–1436.[CrossRef][Medline]
Wang, G. and Dunbrack Jr, R.L. 2003. PISCES: A protein sequence culling server. Bioinformatics 19: 1589–1591.
Wilmot, C.M. and Thornton, J.M. 1988. Analysis and prediction of the different types of
-turn in proteins. J. Mol. Biol. 203: 221–232.[CrossRef][Medline]
Wojcik, J., Mornon, J.P., and Chomilier, J. 1999. New efficient statistical sequence-dependent structure prediction of short- to medium-sized protein loops based on an exhaustive loop classification. J. Mol. Biol. 289: 1469–1490.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
L. L. Perskie, T. O. Street, and G. D. Rose Structures, basins, and energies: A deconstruction of the Protein Coil Library Protein Sci., July 1, 2008; 17(7): 1151 - 1161. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |