|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
-helix cannot be followed by a contiguous
-strand
T.C. Jenkins Department of Biophysics, Johns Hopkins University, Baltimore, Maryland 21218, USA
Reprint requests to: George D. Rose, T.C. Jenkins Department of Biophysics, Johns Hopkins University, 3400 N. Charles St., Baltimore, MD 21218, USA; e-mail: grose{at}jhu.edu; fax: (410) 516-4118.
(RECEIVED October 31, 2003; FINAL REVISION December 2, 2003; ACCEPTED December 3, 2003)
| Abstract |
|---|
|
|
|---|
-helical segment followed by a
-strand segment must be connected by an intervening linker. This restriction was validated both by simulations and by seeking violations within proteins of known structure. In fact, no violations were found within an extensive database of high-resolution X-ray structures. Natures exclusion of
-
hybrid segments, fashioned from an
-helix adjoined to a
-strand, is built into proteins at the covalent level. This straightforward conformational constraint has far-reaching consequences in organizing unfolded proteins and limiting the number of possible protein domains. Keywords: protein folding; unfolded protein; protein secondary structure; Ramachandran plot
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.03503304.
| Introduction |
|---|
|
|
|---|
"For chemically bonded atoms the distribution is not spherically symmetric nor are the properties of such at oms isotropic. In spite of all this, the use of the hard sphere model has a venerable history and an enviable record in explaining a variety of different observable properties (Richards 1977)."
Arguably, the most important application of the hard sphere model in biochemistry is the now famous
,
-plot for a dipeptide, developed by Sasisekharan, Ramakrishnan, and Ramachandran (Ramachandran et al. 1963; Ramachandran and Sasisekharan 1968). This simple idea has been applied to nucleic acids as well (Murthy et al. 1999). In proteins, the hard sphere model identifies two major populated regions for an alanine dipeptide; backbone dihedral angles in these regions resemble those of an
-helix or a
-strand. Despite their remarkable structural diversity, protein molecules have mainchain conformations that lie almost entirely within these two regions.
In this paper, we explore additional steric constraints on polypeptide chains beyond the dipeptide level. We find that an
-helix cannot be followed by a
-strand without an intervening linker. This restriction is a consequence of unavoidable collisions between backbone atoms, and it derives experimental support from the paucity of exceptions among high-resolution protein structures (Berman et al. 2000).
We use several conventions for describing regions of the
,
-map. Figure 1
shows a
,
-distribution of
-helix (green),
-strand (red), and polyproline II helix (blue), determined from structures in the Protein Data Bank (PDB; Berman et al. 2000); the more populated the region, the darker the color. Throughout this paper,
refers to the region given by -135.0°
-105.0° and 120.0°
150.0°, and PII refers to -80.0°
-55.0° and 130.0°
155.0°. We define both a relaxed helical region
', where -75.0°
-45.0° and -60.0°
-30.0°, and a strict helical region
, as a circle of radius 7.0° centered about
= -63.0° and
= -45.0°. Finally,
represents the entirety of sterically accessible
,
-space for the alanine dipeptide.
|
| An experimental observation raises a question |
|---|
|
|
|---|
-helix followed by a
-strand are rarely observed in the PDB. Instead, helices and strands are interconnected by a transition regiona turn, a loop, or some other linker. Only seven occurrences of three or more
' residues followed by a single
residue were found in a representative set of PDB structures (Hobohm and Sander 1994), but a pattern consisting of three or more
' residues followed by a non-
' residue was detected 37,563 times in this data set. Why is the direct transition from
to
so rare? | Materials and methods |
|---|
|
|
|---|
,
-distributions by subdividing
,
-space into a 5° by 5° grid; each of the 72 by 72 grid squares corresponds to a bin. These bins were ranked according to the number of
,
-pairs they contain and then grouped into three categories representing the top 33%, 66%, and 90% of the data. The three groups are plotted in Figure 1
Mining the PDB
In all cases, the chains selected from the PDB were that subset of PDBSelect (Hobohm and Sander 1994) structures determined by X-ray diffraction. In all, 1455 chains at the 25% aligned sequence identity level and 5378 chains at the 90% level were included.
Idealized secondary structures
Secondary structure was assigned using PROSS (Srinivasan and Rose 1999), a method based solely on
,
-angles. Unlike the more familiar DSSP (Kabsch and Sander 1983), PROSS does not include hydrogen bonding in its assignment criteria. Distributions of
,
-values were grouped into three secondary structure categories:
-helix,
-strand, and polyproline II (PII; Fig. 1A
). The helical region was further subdivided into 1° grid squares (Fig. 1B
). Idealized ranges for
,
, and PII were then defined, guided by those bins that represent the top 33% of the data. The
' region is a relaxed definition of
, similar in size to
and PII. These definitions agree well with textbook classifications of secondary structure (Creighton 1984).
Simulations
Monte Carlo simulations of polyalanyl peptides were performed to determine how steric factors influence chain conformation. Polyalanine was chosen as a model for the peptide backbone. Simulated peptides had lengths ranging from 9 to 12 residues. Hydrogen atoms were not included. For each simulation, the
,
-distributions of all sterically allowed conformers were collected so as to accumulate up to 5000 clash-free structures from a maximum of five million attempts.
Generation of sterically allowed structures was accomplished by sampling backbone torsion angles at random from
,
, PII, or
, as appropriate.
-Torsions were varied at random in the range [-175.0°, -185.0°] and assigned in conjunction with backbone torsions. Each attempt was checked for collisions; if none were found, the conformer was accepted and its torsion angles were retained. Otherwise, the conformer was rejected. Rejected structures with a single atomic collision occur at the boundaries between regions; these were cataloged by
,
-angle and collision type for use in assembling a collision map. Structures with multiple atomic collisions are not localized at boundary regions and were ignored.
Hard sphere atomic radii from Word et al. (1999) are among the most conservative in the literature and were adopted for this study (Table 1
). These radii were further scaled by a factor of 0.95, ensuring that the observed collisions are not methodological artifacts. The overall robustness of our results was tested extensively by determining the degree to which steric restrictions persist as radii diminish (described below). The collisions identified in this study do not include those with hydrogen atoms. Inclusion of hydrogens would have enlarged the effective radii and, consequently, imposed further restrictions on available conformational space.
|
| Results |
|---|
|
|
|---|
-helical peptide:
4-
-
4
-residue (the guest) in the middle of an eight-residue polyalanyl peptide that was constrained to be helical (the host). In every case, sterically disallowed patterns identified in simulations were validated against X-ray-elucidated structures by searching for exceptions.
The resulting distribution of the guest residue (Fig. 2
) plots 5000 allowed structures (from 418,213 attempts). Both raw (Fig. 2A
) and binned (Fig. 2B
) data exhibit a Y-shaped plot for the guest residue. A comparison of the two figures (Fig. 2A,B
) shows that the binning method captures the distribution successfully. Notably, the Y-shape encompasses
-helix, but both
-strand and PII are excluded. This conclusion is highlighted in Figure 2B
by superimposing the 66% contour for
-strand from Figure 1A
on the binned simulation data. In short, a single
-guest residue cannot avoid a steric collision in an
-helical host.
|
-residue (Oi-3). The distribution of points for this collision (Fig. 3
|
i+3 (in red) which is responsible for exclusion of the PII region. These two atoms are brought into juxtaposition when the guest
-residue, at i, samples the relevant region in
,
-space.
Two different methods were used to test the robustness of these results. First, the atomic radii were reduced well beyond any plausible van der Waals limit by successively decrementing the scaling factor from 0.95 to 0.92, 0.90, 0.88, and 0.85 (Fig. 4AD
). Reduction of the atomic radii results in expansion of the Y-shaped boundaries of the guest residue. However, substantial strand exclusion survives even the most extreme reduction. Similar behavior was also observed when the
-region was expanded to a radius of 14° or 21° (Fig. 4E,F
). Again, the Y-shaped boundary expands, yet persists. Thus, steric exclusion of a
-residue in a sequence of
-residues is a robust finding, not an artifact of our helix definition or hard sphere radii.
|
were eliminated. Simulations were then performed as before, except that the definition of
was varied for each helical residue, using a radius of 7.0° centered about its experimentally determined
,
-value:
6-
-
5. With a radial scaling factor of 0.95, all but three helices were found to be sterically incompatible with
-values for the central residue. The
-region was largely, but not entirely, excluded in these three exceptions as well (Fig. 5
,
-values of flanking residues were well outside the high-confidence
-region (Fig. 1B
|
|
-residue cannot follow three or more consecutive
-residues, a testable hypothesis using the PDB. In the list of 5378 chains with sequence identity of 90% or less, a series of three or more
-residues was found 19,062 times; none was followed by a
-residue. However, as mentioned earlier, seven exceptions were found when
' is used instead of
(Table 3
-torsions. The final four cases may be legitimate, albeit marginal, exceptions. One of the four cases involves a single
-residue that follows the
'-residues; the other three cases involve type III turns and do not represent an intermixing of helix and strand.
|
-helix to
-strand is disallowed: A single
-residue adjoined to a helical peptide results in a steric collision (Fig. 6
-residue. Therefore, an N-terminal to C-terminal transition from helix to strand must pass through at least one "buffer" residue from the turn region. This finding rationalizes the familiar observation that many
-helices terminate in a 310 helix (Richardson 1981), a progression that both satisfies helix capping requirements (Aurora and Rose 1998) and facilitates the transition from helix to strand, turn, or loop.
|
| Discussion |
|---|
|
|
|---|
-CO-NH-C
HR-CO-NH-C
, which, has two degrees of backbone freedom, like a dipeptide). Similar ideas about the importance of sterics as an organizing force in proteins were also implicit in space-filling models (Koltun 1965), developed during this same era. Such ideas have been validated repeatedly in proteins (Berman et al. 2000) and are now invoked routinely when assessing the quality of experimentally determined structures (Laskowski et al. 1993). Today, the restrictions that sterics impose on the conformation of a dipeptide are widely accepted. Yet, hard sphere models have played a comparatively small role in both protein structure prediction and analysis of the unfolded state. Why?
The perceived problem is one of scale. If each
,
-pair is independent of its neighbors (Flory 1969), then conformational space grows exponentially, despite dipeptide restrictions. Accordingly, the conformations accessible to a peptide backboneeven a short onecan quickly overwhelm constraints imposed by dipeptide sterics. This view is often invoked by alluding to the "Levinthal paradox" (Levinthal 1969): How does a protein find its unique native conformation among the more-than-astronomical number of conformational possibilities? For Levinthal, this conundrum was a demonstration, not a paradox, indicating that additional conformational constraints must exist. But what additional constraints might have been overlooked in this well cultivated field?
Earlier work using explicit counting showed that the size of conformational space is smaller than previously believed (Pappu et al. 2000) because local steric interactions exert influence beyond the dipeptide, winnowing the number of accessible conformations. Here, we focused specifically on steric restrictions in the
-helix.
Unfolded proteins
It has been proposed that the coil librarydefined as the set of all nonhelical, nonstrand structures in the PDBcan be used to model the unfolded state of proteins (Swindells et al. 1995; Avbelj and Baldwin 2003). Therefore, our steric rules, which were validated against the PDB, including the coil library, would also hold for this model of the unfolded state. Plausibly so, because van der Waals repulsive forces will be unaffected by whether or not the protein is folded or unfolded.
Repulsive forces can have an organizing influence on the folding reaction, N(ative)
U(nfolded), and related order-disorder transitions, such as helix-coil theory (Zimm and Bragg 1959; Van Holde et al. 1998), where each residue is characterized by initiation and propagation constants. For example, any "coil" conformation with
,
-angles in the
-region would exert a cooperative influence on the helix-coil equilibrium, making it harder to initiate a helix from the coil state by constricting the size of conformational space accessible to a residue that follows a helix nucleation site. Conversely, once nucleated it would also be harder to melt a helix, because the helical conformer would inhibit introduction of a central coil residue with
,
-angles in either the
- or PII-regions.
The number of protein domains
Our analysis of short polyalanyl chains demonstrates that a
-conformer cannot be introduced into an
-helix without an accompanying steric clash. This restriction maintains the structural homogeneity of
-helices by excluding heterogeneous conformers consisting of a turn of
-helix followed by one or more
-residues. Exclusion of folds in which there is an immediate transition from helix to strand eliminates many conceivable protein domains.
In particular, when a protein folds, backbone polar groups removed from solvent will participate in compensatory intramolecular hydrogen bonds. To do so, they form segments of
-helix or strands of
-sheet, the only regular, repeating hydrogen-bonded protein structures that are sterically available (Aurora et al. 1997). Proteins are largely supramolecular complexes of helices and strands (Levitt and Chothia 1976), and their intramolecular recognition and self-assembly are facilitated by the sterically imposed elimination of
-
hybrids.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Aurora, R., Creamer, T.P., Srinivasan, R., and Rose, G.D. 1997. Local interactions in protein folding: Lessons from the
-helix. J. Biol. Chem. 272: 14131416.
Avbelj, F. and Baldwin, R.L. 2003. Role of backbone solvation and electrostatics in generating preferred peptide backbone conformations: Distributions of
. Proc. Natl. Acad. Sci. 100: 57425747.
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., and Bourne, P.E. 2000. The Protein Data Bank. Nucleic Acids Res. 28: 235242.
Creighton, T.E. 1984. Proteins: Structures and molecular principles, p. 515. H. Freeman, New York.
Eriksson, A.E., Baase, W.A., Zhang, X.J., Heinz, D.W., Blaber, M., Baldwin, E.P., and Matthews, B.W. 1992. Response of a protein structure to cavity-creating mutations and its relation to the hydrophobic effect. Science 255: 178183.
Flory, P.J. 1969. Statistical mechanics of chain molecules, pp. 250255. Wiley, New York.
Hobohm, U. and Sander, C. 1994. Enlarged representative set of protein structures. Protein Sci. 3: 522524.[Abstract]
Kabsch, W. and Sander, C. 1983. Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22: 25772637.[CrossRef][Medline]
Koltun, W.L. 1965. Precision space-filling atomic models. Biopolymers 3: 665679.[CrossRef][Medline]
Laskowski, R.A., MacArthur, M.W., Moss, D.S., and Thornton, J.M. 1993. PROCHECK: A program to check the stereochemical quality of protein structures. J. Appl. Cryst. 26: 283291.
Lee, B. and Richards, F.M. 1971. The interpretation of protein structures: Estimation of static accessibility. J. Mol. Biol. 55: 379400.[CrossRef][Medline]
Levinthal, C. 1969. In Mössbauer spectroscopy in biological systems (eds. P. Debrunner et al.), pp. 2224. University of Illinois Press, Urbana, IL.
Levitt, M. and Chothia, C. 1976. Structural patterns in globular proteins. Nature 261: 552558.[CrossRef][Medline]
Murthy, V.L., Srinivasan, R., Draper, D.E., and Rose, G.D. 1999. A complete conformational map for RNA. J. Mol. Biol. 291: 313327.[CrossRef][Medline]
Pappu, R.V., Srinivasan, R., and Rose, G.D. 2000. The flory isolated-pair hypothesis is not valid for polypeptide chains: Implications for protein folding. Proc. Natl. Acad. Sci. 97: 1256512570.
Ramachandran, G.N. and Sasisekharan, V. 1968. Conformation of polypeptides and proteins. Adv. Prot. Chem. 23: 283438.[Medline]
Ramachandran, G.N., Ramakrishnan, C., and Sasisekharan, V. 1963. Stereochemistry of polypeptide chain configurations. J. Mol. Biol. 7: 9599.[Medline]
Richards, F.M. 1977. Areas, volumes, packing, and protein structure. Ann. Rev. Biophys. Bioeng. 6: 151176.[CrossRef][Medline]
. 1979. Packing defects, cavities, volume fluctuations, and access to the interior of proteins. Including some general comments on surface area and protein structure. Carlsberg Res. Commun. 44: 4763.
Richardson, J.S. 1981. The anatomy and taxonomy of protein structure. Adv. Prot. Chem. 34: 168340.
Srinivasan, R. and Rose, G.D. 1999. A physical basis for protein secondary structure. Proc. Natl. Acad. Sci. 96: 1425814263.
Swindells, M.B., MacArthur, M.W., and Thornton, J.M. 1995. Intrinsic
,
propensities of amino acids, derived from the coil regions of known structures. Nat. Struct. Biol. 2: 596603.[CrossRef][Medline]
Van Holde, K.E., Johnson, W.C., and Ho, P.S. 1998. Principles of physical biochemistry, pp. 156162. Prentice Hall, Upper Saddle River, NJ.
Word, J.M., Lovell, S.C., LaBean, T.H., Taylor, H.C., Zalis, M.E., Presley, B.K., Richardson, J.S., and Richardson, D.C. 1999. Visualizing and quantifying molecular goodness-of-fit: Small-probe contact dots with explicit hydrogen atoms. J. Mol. Biol. 285: 17111733.[CrossRef][Medline]
Zimm, B.H. and Bragg, J.K. 1959. Theory of the phase transition between helix and random coil in polypeptide chains. J. Chem. Phys. 31: 526535.[CrossRef]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
L. L. Perskie, T. O. Street, and G. D. Rose Structures, basins, and energies: A deconstruction of the Protein Coil Library Protein Sci., July 1, 2008; 17(7): 1151 - 1161. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Gong and G. D. Rose Assessing the solvent-dependent surface area of unfolded proteins using an ensemble model PNAS, March 4, 2008; 105(9): 3321 - 3326. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. O. Street, N. C. Fitzkee, L. L. Perskie, and G. D. Rose Physical-chemical determinants of turn conformations in globular proteins Protein Sci., August 1, 2007; 16(8): 1720 - 1727. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. D. Rose, P. J. Fleming, J. R. Banavar, and A. Maritan A backbone-based theory of protein folding PNAS, November 7, 2006; 103(45): 16623 - 16633. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Gong, P. J. Fleming, and G. D. Rose From The Cover: Building native protein conformation from highly approximate backbone torsion angles PNAS, November 8, 2005; 102(45): 16227 - 16232. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. V. Kuzmenkina, C. D. Heyes, and G. U. Nienhaus Single-molecule Forster resonance energy transfer study of protein dynamics under denaturing conditions PNAS, October 25, 2005; 102(43): 15471 - 15476. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. C. Fitzkee and G. D. Rose Reassessing random-coil statistics in unfolded proteins PNAS, August 24, 2004; 101(34): 12497 - 12502. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |