|
|
||||||||
Department of Biochemistry and Molecular Biology and Center for Biological Modeling, Michigan State University, East Lansing, Michigan 48824, USA
Reprint requests to: Leslie A. Kuhn, Michigan State University, 502C Biochemistry, East Lansing, MI 48824, USA; e-mail: kuhn{at}agua.bch.msu.edu; fax: 517-353-9334.
(RECEIVED November 11, 2002; FINAL REVISION January 31, 2003; ACCEPTED February 17, 2003)
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.0238803.
| Abstract |
|---|
|
|
|---|
-helices and between
-helices and ß-sheets in 1316 diverse, nonredundant protein structures. Finite-length vectors were fit to the
-carbon atoms in each of the helices and strands, and the packing angle between the vectors,
, was determined at the closest point of approach within each helixhelix or helixsheet pair. Helixsheet interactions were found in 391 of the proteins, and the distributions of
values were calculated for all the helixsheet and helixhelix interactions. The packing angle preferences for helixhelix interactions are similar to those previously observed. However, analysis of helixstrand packing preferences uncovered a remarkable tendency for helices to align antiparallel to parallel regions of ß-sheets, independent of the topological constraints or prevalence of ß-
-ß motifs in the proteins. This packing angle preference is significantly diminished in helix interactions involving mixed and antiparallel ß-sheets, suggesting a role for helixsheet dipole alignment in guiding supersecondary structure formation in protein folding. This knowledge of preferred packing angles can be used to guide the engineering of stable protein modules. Keywords: Protein folding; secondary structure packing; folding cores; helix and sheet dipoles; protein design; diffusioncollision model
| Introduction |
|---|
|
|
|---|
-helix packs against several strands of the C-terminal ß-sheet to form an intermediate structure along the folding pathway. Analysis of protein folding for a series of other proteins, by hydrogen-exchange nuclear magnetic resonance spectroscopy (NMR) and computational approaches, indicates that the packing of two secondary structures forms a folding core that builds up to the native structure (Hespenheide et al. 2002; Li and Woodward 1999). Assuming the framework model is one valid scenario for protein folding, we can ask whether secondary structures prefer to adopt specific geometries when they coalesce. In one of the earliest studies on secondary structure packing, Chothia et al. (1981) analyzed 50 helixhelix packing interactions from 10 protein structures. The results led them to propose the "ridges into grooves" model, in which helix pairs adopt specific geometries allowing the side chains to interdigitate. Since that time, advances in computer technology have allowed for not only an invaluable increase in the number of protein crystal structures, but also the development of algorithms that allow the removal of statistical bias in the Protein Data Bank (PDB; Berman et al. 2000) toward protein families with many occurrences. More recent studies have expanded the analysis of helixhelix packing interactions to a dataset of 1776 interactions from 757 protein structures with <30% sequence identity and better than 2.4 Å resolution (Walther et al. 1998).
In studies of secondary structure packing, the secondary structures are often represented by best-fit lines through the C
coordinates of the residues. A sphericalpolar coordinate system is then used to measure packing geometries. For a pair of interacting structures, the geometry can be described by a single dihedral angle, referred to as
, formed by each structure and the line of closest approach between them (Fig. 1
). Observed distributions of
packing angles for helixhelix interactions initially exhibited distinct peaks (Walther et al. 1996). However, Bowie (1997), with further developments by Walther et al. (1998), demonstrated that the expected uniform random distribution of
is actually biased toward angles near 90°. When this geometric bias was taken into account, the observed peaks in the helixhelix
angle distribution were significantly attenuated.
|
angle for a helix packing against a sheet should be near 0°, indicating that only small angles allowed for complementary packing of the helix side chains within the surface created by a ß-sheet. This observation of helixstrand axial alignment was further supported by work published by Cohen et al. (1982) a few years later. A theoretical study in which low energy helixsheet conformations were predicted also agreed that a helixstrand packing
angle near 0° was a favorable interaction (Chou et al. 1985). Their analysis of 163 helixsheet packing interactions observed from proteins of known structure showed a predominant peak near 0°. In all of these studies, the packing angles were measured by approximating inherently twisted ß-sheets as a plane. Also, the
angle was measured in the range -90°
90°; therefore, the N-terminal to C-terminal orientation of the secondary structures was not taken into account.
In this work, further analysis of helixhelix and helixstrand packing interactions is presented. The distribution of helixhelix
angles is found to be very similar to the data presented by Walther et al. (1998). For examining helixsheet interactions, the strands in the sheet are categorized according to five possible orientations, depending on the direction of the strand relative to its neighbors (parallel versus antiparallel). The observed distribution of
packing angles is then presented, with geometric bias taken into account, for each of the five cases. The
packing angle for both helixhelix and helixsheet interactions is measured over the range -180°
180° to observe any correlation between parallel/antiparallel packing and
angle. A unique coordinate transformation is used to measure the
packing angle, so that an inherently twisted ß-sheet is not approximated as a plane. The results indicate a strong preference for a helix to pack antiparallel (with
near ±180°) to a sheet composed of parallel strands. This preference is not dependent on the topological constraints imposed by short loops in ß-
-ß motifs (Sternberg and Thornton 1976), as the
angle distributions do not change significantly when ß-
-ß motifs are excluded from the dataset.
| Results |
|---|
|
|
|---|
packing angles in our dataset is shown in Figure 2
near 0) and antiparallel (
near +/- 180) helixhelix alignments are preferred.
|
packing angle distribution as a function of strand orientation
packing angles for each strand orientation are presented in Figure 3A
values. A value of 1.0 indicates that the range of observed
angles occurred just as often as expected. Values <1.0 represent unfavored
angles, and values >1.0 indicate preferred packing angles.
angles in the range -90° <
< 90° represent an interaction in which the N-terminal to C-terminal direction of the helix is parallel to the direction of the strand. For
< -90° or
> 90°, the helix is packed antiparallel to the strand.
|
|
distribution by disallowing parallel packing interactions. To account for this, we analyzed a subset of the interactions shown in Figure 3A
angle did not arise from connectivity constraints. The
distributions for this set are shown in Figure 3B
Type -1 and -2 strand orientation distributions in both Figure 3A
and 3B
show some preference for parallel packing when the neighboring strands are antiparallel. Also, there is a preference to pack at angles near -30° and 150°, and to avoid packing at angles near -100° and +70°, similar to helixhelix packing. The top three panels in Figure 3
show that for parallel (type 0, 1, and 2) strand orientations, there is an increasing preference with increase in sheet parallelicity for the helix to pack antiparallel to the strand.
angles >90° are favored, whereas angles between -90° and +90° are disfavored. Type 2 strand orientations exhibit the strongest preference, with antiparallel packing strongly preferred, and almost no packing interactions observed in which the helix is oriented parallel to the strand. Figure 4
shows an ideal type 2 antiparallel helixstrand interaction present in the protein IIB cellobiose from Escherichia coli (PDB code: 1iib
[PDB]
; van Montfort et al. 1997). The yellow arrows, representing ß-strands, point in the N- to C-terminal direction. The N- to C-terminal direction of the helix is from the upper right to the lower left. The strand determined to be interacting most closely with the helix is the second from the left, and this strand is parallel to both its neighbors. The
angle for this particular interaction is 120°.
|
packing angle as a function of local sheet twist
. This was essentially a random scatter plot for all five strand orientations, showing no clear patterns. The correlation coefficients between local sheet twist and
angles were also small and variable in sign (Table 1| Discussion |
|---|
|
|
|---|
packing angles (Fig. 2
angles, with peaks corresponding to both parallel and antiparallel orientations and with few helices packed perpendicular to each other. Analyzing orientations in helixsheet interactions is more complex, because an individual sheet can consist of all parallel, all antiparallel, or mixed parallel and antiparallel strands. This diversity in hydrogen bonding pattern, along with varying amino acid composition, also leads to nonplanarity being the rule, rather than the exception, for ß-sheets. One hypothesis tested is that, as the twist of a sheet increasingly deviates from planarity, steric interactions between the helix and strands adjacent to the interacting strand would force the helix to turn, causing a stronger preference in
packing angle. However, no significant correlation was observed between local sheet twist and
packing angle. Thus, the observed preference for antiparallel alignment of helices with parallel regions of sheets could be due to dipole interactions between the helix and sheet, or local side-chain interactions being more important than sheet twist in defining the helix orientation. Side chains vary significantly in their interaction properties, and may also be flexible. Thus, an analysis of the role of side-chain interactions in helixsheet packing, similar to that reported for helixhelix packing interfaces (Walther et al. 1996), is warranted.
In the distributions of
angle for each strand orientation shown in Figure 3
, the orientations showing the strongest
angle preference are those in which the interacting strand is parallel to its neighbors. In these cases, strand orientations 1 and 2, the helix prefers to pack antiparallel to the strand, at an angle near 160°. One possible explanation for this preference is the presence of a net dipole arising from the hydrogen bonding pattern in parallel strands. The hydrogen bonds between antiparallel strands are nearly perpendicular to the protein backbone, and a negligible net dipole moment is produced. In these helixstrand interactions, the dipole would not be expected to play a role, and we observe no strong preference in
angles. However, hydrogen bonds between parallel strands make an ~20° angle with respect to the N-terminal to C-terminal direction of the protein backbone, leading to net dipole moment of about 1.15 Debyes (Hol et al. 1981). It was previously observed that there is a favorable electrostatic interaction energy between dipoles of helices and sheets in proteins consisting entirely of parallel sheets, indicating antiparallel packing of the structures would be favored (Hol et al. 1981). This trend for strong antiparallel preference in helix packing is observed in our study of helixsheet interactions found in all the 1316 proteins.
It recently has been emphasized that the proteins comprising the PDB contain a significant number of repeated supersecondary structure motifs (Salem et al. 1999). Of particular importance to our study is the prevalence of the ß-
-ß supersecondary structure (Sternberg and Thornton 1976). This motif consists of two strands, hydrogen bonded in parallel, and connected by a stretch of protein that contains at least one helix. Often the loop region between one of the strands and the helix is short (
7 residues), which forces the helix to pack antiparallel to the strand, as the loop region is not long enough to allow a parallel helixstrand interaction. An analysis of helixstrand interactions in which potential ß-
-ß motifs were excluded (Fig. 3B
) yielded remarkably similar
angle distributions relative to the entire dataset (Fig. 3A
). This similarity indicates that the preference for helices to pack antiparallel to parallel regions of sheets does not result from conformational restrictions imposed by short loops during folding.
In conclusion, the preferred packing angles found for helixhelix interactions support the results of earlier studies, whereas we uncover a significant preference for antiparallel packing between helices and strands that is strongly dependent on the degree of parallel content in the sheet. In addition to indicating a role for helixsheet dipole interactions in guiding and stabilizing protein supersecondary structure formation, these preferred angles for helixsheet packing can be useful guides for protein design.
| Materials and methods |
|---|
|
|
|---|
Representing secondary structures as vectors
The residues forming regular secondary structure in each PDB file were identified according to the HELIX and SHEET records. To include only significant interactions, we required helices to have at least seven residues, corresponding to two complete turns of a regular
-helix. Strands were required to have at least three residues for proper fitting of a vector to the C
coordinates. Assuming a rise/residue of 1.5 Å for helices and 3.5 Å for strands, the minimal structures allowed for helices and sheets both have an axial length of 10.5 Å. For each sheet identified, the closest distance between neighboring strands was measured, and any sheet that had a closest interstrand distance >5.0 Å was visually checked to see that the strands were listed in the proper topological order. Errors in strand order within a PDB file were fixed manually.
The
-carbon positions of each residue in a helix and strand were used to compute the best-fit line through a given structure by using a parametric least squares algorithm (Christopher et al. 1996). Because an individual strand can deviate severely from linearity, the degree to which each strand bowed was also computed as: Bow = ||m|| / ||d||, where d is a vector between the first C
and the last C
in the strand, and m is a perpendicular vector from d to the C
in the middle of the strand. If the strand contained an even number of residues, the average position of the middle two C
s was used to compute m.
Identifying a pair of interacting secondary structures
Each helix in a protein was represented in 3D by a finite axial vector h, and each strand, or second helix in the case of helixhelix interactions, was represented by a vector s, as shown graphically in Figure 1
. The Euclidean distance between the midpoints of h and s was defined as MD. The closest point of approach between h and vector s was computed using equations described by Chothia et al. (1981). The quantity CP1 was defined as the percentage along the length of h, relative to its start, to reach the point closest to the vector s. Similarly, CP2 defines the percentage along vector s to reach the point closest to vector h. If both CP1 and CP2 were between 0.0 and 1.0, then the line of closest approach, represented by vector L (of length CD), would intersect the axis of both secondary structures.
A helix was defined as interacting with a strand if the following criteria were met:
20.0 Å (a rough screen).
13.0 Å (ensuring two structures interact closely).
CP1, CP2
0.99 (ensuring that the closest point of interaction is within the secondary structure itself).
13.0 Å; CDk
13.0 Å, where j and k are the two closest strands to s (ensuring the helix is interacting with the face of the sheet, not along the edge).
0.25 (ensuring that the sheet in the vicinity of a helixstrand interaction is not excessively bowed, because of ß-bulges or nonstandard
,
angles; Salemme 1983).
Assigning local strand orientation
The orientation of a strand relative to its hydrogen bonded neighbor(s) was determined using the "sense" field assigned to columns 3940 of the SHEET record in a PDB file. The orientation value of strand i was computed as the sense of strand i plus the sense of strand i + 1. For example, if the second strand in a sheet is parallel to the first one, it has a sense value of 1. If the third strand in the sheet is parallel to the second strand, it also has a sense value of 1. The orientation value of the second strand is the sum of these two values, 1 + 1 = 2. Table 1
lists the five possible orientation values that can occur for a strand in a sheet, ranging from -2, most antiparallel, to +2, most parallel.
Measuring the
packing angle and local sheet twist
Because the vector of closest approach, L, is perpendicular to both h and s, the packing geometry between the two structures can be defined by a single dihedral angle,
. This angle is measured between s and the projection of h into the plane defined by s and s x L (Fig. 1
).
The local sheet twist was measured to determine the extent to which the helixsheet packing angle depends on steric interactions with the groove formed by the sheet, due to the twisting of its strands. For a helixstrand interaction, the orthogonal vectors s and L were used to describe a plane, W, that is locally perpendicular to the surface of the sheet. The vectors representing the two neighboring strands to s were then projected onto W. The sheet twist local to strand s was then computed as the average angle between strand s and the projection of each neighboring strand onto W. The sign of the twist angle corresponds to the expected increase or decrease in
angle due to steric effects as local sheet twist increases.
Helixhelix
packing angles
The distribution of helixhelix packing angles was measured to allow comparison to previous studies of secondary structure packing and to assess any changes in the distribution due to using a larger database of interactions. Identification and analysis of helixhelix packing were performed as described earlier for helixsheet pairs, except that a second helix took the place of the strand. Bias in the distribution of observed helixhelix packing angles was removed as described previously (Walther et al. 1998).
Removing topologically constrained interactions from the dataset
To create a subset of the data in which topologically constrained interactions were removed, the order of the secondary structures along the primary structure of each protein was determined. The subset was created by excluding any helixstrand pair if the loop connecting them did not contain another secondary structure.
Normalizing the distributions of helixsheet
angles
A geometric bias proportional to sin
, which arises because of finite length vector representations of the secondary structures, was taken into account as described previously (Walther et al. 1998). However, only a sin
correction was used, rather than sin2
. A second sin
bias, due to the inequality of the solid angles arising from equal sampling of
in a sphericalpolar distribution (Bowie 1997), for helixhelix pairs does not occur for helixsheet interactions. This is because close packing requires that the helices lie roughly in a plane, rather than a spherical section, relative to the sheet.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., and Bourne, P.E. 2000. The Protein Data Bank. Nucleic Acids Res. 28: 235242.
Bowie, J.U. 1997. Helix packing angle preferences. Nat. Struct. Biol. 4: 915917.[CrossRef][Medline]
Bryngelson, J.D., Onuchic, J.N., Socci, N.D., and Wolynes, P.G. 1995. Funnels, pathways, and the energy landscape of protein folding: A synthesis. Proteins 21: 167195.[CrossRef][Medline]
Chothia, C., Levitt, M., and Richardson, D. 1981. Helix to helix packing in proteins. J. Mol. Biol. 145: 215250.[CrossRef][Medline]
Chou, K.-C., Nemethy, G., Rumsey, S., Tuttle, R.W., and Scheraga, H.A. 1985. Interactions between an
helix and a ß sheet. J. Mol. Biol. 186: 591609.[CrossRef][Medline]
Christopher, J.A., Swanson, R., and Baldwin, T.O. 1996. Algorithms for finding the axis of a helix: Fast rotational and parametric least-squares method. Comput. Chem. 20: 339345.[CrossRef][Medline]
Cohen, F.E., Sternberg, M.J.E., and Taylor, W.R. 1982. Analysis and prediction of the packing of
-helices against a beta-sheet in the tertiary structure of globular proteins. J. Mol. Biol. 156: 821862.[CrossRef][Medline]
Hespenheide, B.M., Rader, A.J., Thorpe, M.F., and Kuhn, L.A. 2002. Identifying protein folding cores from the evolution of flexible regions during unfolding. J. Mol. Graph. Model. 21: 195207.[CrossRef][Medline]
Hobohm, U., Scharf, M., and Schneider, R. 1993. Selection of representative protein data sets. Protein Sci. 1: 409417.
Hol, W.G.J., Halie, L.M., and Sander, C. 1981. Dipoles of the
-helix and ß-sheet: Their role in protein folding. Nature 294: 532536.[CrossRef][Medline]
Honig, B. 1999. Protein folding: From the Levinthal paradox to structure prediction. J. Mol. Biol. 293: 283293.[CrossRef][Medline]
Janin, J. and Chothia, C. 1980. Packing of
-helices onto ß-pleated sheets and the anatomy of
/ß proteins. J. Mol. Biol. 143: 95128.[CrossRef][Medline]
Karplus, M. and Weaver, D.L. 1994. Protein folding dynamics: The diffusion-collision model and experimental data. Protein Sci. 3: 650668.[Abstract]
Kippen, A., Sancho, J., and Fersht, A.R. 1994. Folding of barnase in parts. Biochemistry 33: 37783786.[CrossRef][Medline]
Li, R. and Woodward, C. 1999. The hydrogen exchange core and protein folding. Protein Sci. 8: 15711591.[Abstract]
Perrett, S., Clarke, J., Hounslow, A.M., and Fersht, A.R. 1995. Relationship between equilibrium amide proton exchange behavior and the folding pathway of barnase. Biochemistry 34: 92889298.[CrossRef][Medline]
Reddy, B.V.B. and Blundell, T.L. 1993. Packing of secondary structural elements in proteins. Analysis and prediction of inter-helix distances. J. Mol. Biol. 233: 446479.
Salem, G.M., Hutchinson, E.G., Orengo, C.A., and Thornton, J.M. 1999. Correlation of observed fold frequency with the occurrence of local structural motifs. J. Mol. Biol. 287: 969981.[CrossRef][Medline]
Salemme, F.R. 1983. Structural properties of protein ß-sheets. Prog. Biophys. Mol. Biol. 42: 95133.[CrossRef][Medline]
Sternberg, M.J. and Thornton, J.M. 1976. On the conformation of proteins: The handedness of the ß-strand-
-helix-ß-strand unit. J. Mol. Biol. 105: 367382.[CrossRef][Medline]
Udgaonkar, J.B. and Baldwin, R.L. 1988. NMR evidence for an early framework intermediate on the folding pathway of ribonuclease A. Nature 335: 694699.[CrossRef][Medline]
van Montfort, R.L., Pijning, T., Kalk, K.H., Reizer, J., Saier Jr., M.H., Thunnissen, M.M., Robillard, G.T., and Dijkstra, B.W. 1997. The structure of an energy-coupling protein from bacteria, IIB cellobiose, reveals similarity to eukaryotic protein tyrosine phosphatases. Structure 5: 217225.[Medline]
Walther, D., Eisenhaber, F., and Argos, P. 1996. Principles of helix-helix packing in proteins: The helical lattice superposition model. J. Mol. Biol. 255: 536553.[CrossRef][Medline]
Walther, D., Springer, C., and Cohen, F.E. 1998. Helix-helix packing angle preferences for finite helix axes. Proteins 33: 457459.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
S. Jiang and I. A. Vakser Shorter side chains optimize helix-helix packing Protein Sci., May 1, 2004; 13(5): 1426 - 1429. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |