|
|
||||||||
1 Department of Biochemistry, Weill Medical College of Cornell University, New York, New York 10021, USA
2 Bioinformatics Laboratory, Department of Applied Mathematics and Statistics, State University of New York at Stony Brook, Stony Brook, New York 11794-3600, USA
Reprint requests to: Ilya A. Vakser, Bioinformatics Laboratory, Department of Applied Mathematics and Statistics, State University of New York at Stony Brook, Stony Brook, NY 11794-3600, USA; e-mail: vakser{at}ams.sunysb.edu; fax: (631) 632-8490.
(RECEIVED January 24, 2003; ACCEPTED April 25, 2003)
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.0304503.
| Abstract |
|---|
|
|
|---|
, and
ß). Interestingly, the relative contribution of the steric match in different types of pairs was correlated with the number of such pairs in known protein structures. This effect may indicate an evolutionary pressure to select tightly packed elements of secondary structure to maximize the packing of the entire structure. The overall conclusion is that the steric match plays an essential role in the packing of secondary structure elements. The results are important for better understanding of principles of protein structure and may facilitate development of better methods for protein structure prediction. Keywords: Docking; protein modeling; molecular recognition; protein folding; structural bioinformatics
| Introduction |
|---|
|
|
|---|
The interaction of secondary structure elements in protein structures may be formulated in terms of docking. The docking is traditionally considered to be a problem of matching two separate molecules. The main difference in matching secondary structure elements and matching separate molecules is in the constraints imposed by the environment. Earlier studies explored the applicability of docking to secondary structure packing (Ausiello et al. 1997; Yue and Dill 2000; Vakser and Jiang 2002). A multiplicity of physicochemical factors obviously plays a role in the packing of secondary structure elements in proteins and in the formation of protein complexes. However, the well-known tight packing of structural elements indicates the importance of the geometric fit. The steric complementarity in protein interactions has been studied extensively (for a review, see Halperin et al. 2002). Obviously, the role of the steric complementarity in the interaction of secondary structure elements deserves similar attention.
Earlier studies of this subject were primarily focused on helixhelix packing (see Richmond and Richards 1978; Cohen et al. 1979; Chothia et al. 1981; Murzin and Finkelstein 1988; Reddy and Blundell 1993; Walther et al. 1996). One of the reasons was the limited number of high-quality crystal structures. Another reason probably was a traditional biochemical view on interactions of secondary structure elements that largely neglected the geometric complementarity as an important factor (with the exception of helixhelix interactions). The examples may be found in basic biochemistry texts, which describe the packing of ß-strands as determined solely by hydrogen bonding. Earlier studies addressed the interaxial distances in
, ßß, and
ß packing with implications to comparative modeling of proteins (Nagarajaram et al. 1999; Reddy et al. 1999). However, a systematic study of the role of geometric complementarity in the packing of secondary structure elements is still lacking. In this article, we present such a study. A docking algorithm based on geometric complementarity is applied to a comprehensive database of secondary structure elements derived from the Protein Data Bank (PDB). The results show that the steric fit plays an important role in the interaction of the secondary structure elements.
| Docking of secondary structure |
|---|
|
|
|---|
Secondary structure elements
The following procedure was used to select the pairs of interacting secondary structure elements. For each protein chain, the Dssp procedure (Kabsch and Sander 1983) was used to identify the secondary structure elements:
-helix, ß-strand, or a loop. The minimal number of residues was required to be four in a ß-strand and three in a loop. Helices containing less than six residues were excluded to guarantee the presence of at least two turns. A pair of secondary structure elements was considered interacting if three or more residues on each side were in contact. The residues were considered to be in contact if their Cß - Cß distance (C
for Gly) was
6 Å. Only pairs with a residue contact ratio (number of residues in contact divided by the total number of residues in the pair)
30% were selected for this study. Table 1
shows the number of pairs of secondary structure elements in contact.
|
| Results and Discussion |
|---|
|
|
|---|
-helix
-helix, ß-strandß-strand,
-helixß-strand, and looploop (Fig. 1
|
2.5 Å were considered correct. If there were more than one correct match for a pair, the lowest-energy match was picked as the final result for that pair. The numbers in the parentheses in Table 1
The docking results are shown in Figure 2
. The percentage of pairs with a correct match at a given energy rank is plotted against the energy rank. The results show that the percentage of the correct matches is strongly correlated with the energy values. The percentage grows dramatically for the lowest-energy matches (to a lesser extent for
ß pairs). Later, we show that a part of this trend is based on our selection protocol (selection of a single lowest-energy match out several correct matches per pair). However, the observed trend is far stronger that the one introduced by this bias in the selection of matches. Therefore, because our energy is based purely on surface complementarity, this pronounced trend indicates the importance of the surface match in the interaction of secondary structure elements.
|
40% for
,
80% for ßß,
40% for
ß, and
50% for looploop. The rest of the pairs were assigned to the small-interface subset. The criteria for the large and the small interfaces were chosen to make the sets approximately equally populated. The results in Figure 2
Significance of the predictions: Comparison with random models
To prove the significance of the steric complementarity revealed by the results in Figure 2
, we compared the actual docking results with would be random ones. A detailed study of random models in molecular recognition was published earlier (Tovchigrechko and Vakser 2001). In the present study, we use two models: (1) random docking and (2) random energy rank.
Model 1: Random docking
In this model, the secondary structure elements are approximated by cylinders (
-helices and ß-strands) and spheres (loops, due to their irregular shapes). The model is illustrated in Figure 3
. To eliminate no-contact cases, we set the interaxial distance for docked pairs at 10 Å (Reddy and Blundell 1993; Nagarajaram et al. 1999; Reddy et al. 1999). The correct match region was defined as
2.5 Å from the crystal structure position (approximately in accordance with our definition of the correct match in the actual docking). Such a shift is equivalent to ~30° change in orientation, out of possible 360° (Fig. 3A
). To avoid no-contact cases, an up-and-down translation along the axes of the 10-Å cylinders (the minimum length of the secondary structure elements in this study) is possible to a maximum of 10 Å. If the correct match area is
2.5 Å in each direction, the chance of hitting the correct area is one out of four. The correct match corresponds to the ~30° spin around the axis of the second helix and to the ~30° angle between the axes of the helices. Combining these four coordinates, we obtain 0.02% as the upper limit for the probability of the correct match in this random docking model. This probability of a random success can be used to estimate the statistical significance of the actual docking results.
|
Model 2: Random energy rank
If a docking run for a pair of secondary structure elements resulted in more than one correct match within 1000 lowest-energy solutions, only one match (the one with the lowest energy) was included in the statistics in Figure 2
. This selection criterion should have resulted in somewhat inflated percentages of the lowest-energy matches. To reveal the extent of this artifact, we randomized the energy rank in the lists of 1000 lowest-energy matches. These new distributions of random-rank matches were compared with the actual distributions (Fig. 4
). The actual percentages of the lowest-energy matches (ranks 120) were significantly larger than the random ones, with the exception of
ß pairs. A possible reason for the absence of the trend that goes beyond the random level in Model 2, in the case of
ß packing, may be the small number of pairs (see Table 1
). However, one also has to consider a possibility that the small number of
ß pairs in known protein structures may be the result of their loose packing, which would contribute unfavorably to a tight packing of protein structures. It is important to point out that these considerations do not affect the ability of GRAMM to correctly dock and identify the low-energy matches based on geometric complementarity alone, including the
ß pairs.
|
packing. The lowest probabilities are for the
ß packing. As shown above, the results for the
ß pairs are within the random range in Model 2. However, they are still significantly higher than the random range in Model 1. If the probability of a single correct match is 0.02% as in Model 1, then the probability of having at least one correct match among the top N matches randomly is 1 - [1 - 0.0002]N
0.02% x N. Thus, for the top 10 matches, the probability is 0.2%; for the top 20 matches, 0.4%; for the top 50, 1%, and for the top 100 matches, 2%. Interestingly, the probabilities of correct predictions for different types of pairs (ßß, looploop,
,
ß) are strongly correlated with the number of pairs available in each case (Table 1
|
-helix
-helix, ß-strandß-strand,
-helixß-strand, and looploop) were taken from a nonredundant database of known protein structures. The docking results revealed a significant percentage of correctly predicted packing configurations. The significance of the results was verified by comparison with random docking models. Different types of pairs of secondary structure elements showed different degrees of steric complementarity (from high to low: ßß, looploop,
,
ß). Interestingly, the relative contribution of the steric match in different types of pairs was correlated with the number of such pairs in known protein structures. This effect may indicate an evolutionary pressure to select tightly packed elements of secondary structure to maximize the packing of the entire structure. The overall conclusion is that the steric match plays an essential role in the packing of secondary structure elements. The results are important for better understanding of principles of protein structure and may facilitate development of better methods for protein structure prediction.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Baker, D. and Lim, W.A. 2002. Folding and binding: From folding towards function. Curr. Opin. Struct. Biol. 12: 1113.[CrossRef]
Chothia, C., Levitt, M., and Richardson, D. 1981. Helix to helix packing in proteins. J. Mol. Biol. 145: 215250.[CrossRef][Medline]
Cohen, F.E., Richmond, T.J., and Richards, F.M. 1979. Protein folding: Evaluation of some simple rules for the assembly of helices into tertiary structures with myoglobin as an example. J. Mol. Biol. 132: 275288.[CrossRef][Medline]
Glaser, F., Steinberg, D., Vakser, I.A., and Ben-Tal, N. 2001. Residue frequencies and pairing preferences at proteinprotein interfaces. Proteins 43: 89102.[CrossRef][Medline]
Halperin, I., Ma, B., Wolfson, H., and Nussinov, R. 2002. Principles of docking: An overview of search algorithms and a guide to scoring functions. Proteins 47: 409443.[CrossRef][Medline]
Hobohm, U. and Sander, C. 1992. Selection of a representative set of structures from the Brookhaven Protein Data Bank. Protein Sci. 1: 409417.[Abstract]
. 1994. Enlarged representative set of protein structures. Protein Sci. 3: 522524.[Abstract]
Hubbard, S.J. and Argos, P. 1994. Cavities and packing at protein interfaces. Protein Sci. 3: 21942206.[Abstract]
Kabsch, W. and Sander, C. 1983. Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22: 25772637.[CrossRef][Medline]
Katchalski-Katzir, E., Shariv, I., Eisenstein, M., Friesem, A.A., Aflalo, C., and Vakser, I.A. 1992. Molecular surface recognition: Determination of geometric fit between proteins and their ligands by correlation techniques. Proc. Natl. Acad. Sci. 89: 21952199.
Keskin, O., Bahar, I., Badretdinov, A.Y., Ptitsyn, O.B., and Jernigan, R.L. 1998. Empirical solvent-mediated potentials hold for both intra-molecular and inter-molecular inter-residue interactions. Protein Sci. 7: 25782586.[Abstract]
Murzin, A.G. and Finkelstein, A.V. 1988. General architecture of the
-helical globule. J. Mol. Biol. 204: 749769.[CrossRef][Medline]
Nagarajaram, H.A., Reddy, B.V.B., and Blundell, T.L. 1999. Analysis and prediction of inter-strand packing distances between ß-sheets of globular proteins. Protein Eng. 12: 10551062.
Ponder, J.W. and Richards, F.M. 1987. Internal packing and protein structural classes. In Cold Spring Harbor symposia on quantitative biology, Vol. LII, pp. 421428. Cold Spring Harbor Laboratory, Cold Spring Harbor, NY.
Reddy, B.V.B. and Blundell, T.L. 1993. Packing of secondary structure elements in proteins: Analysis and prediction of inter-helix distances. J. Mol. Biol. 233: 464479.[CrossRef][Medline]
Reddy, B.V.B., Nagarajaram, H.A., and Blundell, T.L. 1999. Analysis of interactive packing of secondary structural elements in
/ß units in proteins. Protein Sci. 8: 573586.[Abstract]
Richmond, T.J. and Richards, F.M. 1978. Packing of
-helices: Geometrical constraints and contact areas. J. Mol. Biol. 119: 537555.[CrossRef][Medline]
Shoemaker, B.A., Portman, J.J., and Wolynes, P.G. 2000. Speeding molecular recognition by using the folding funnel: The fly-casting mechanism. Proc. Natl. Acad. Sci. 97: 88688873.
Tovchigrechko, A. and Vakser, I.A. 2001. How common is the funnel-like energy landscape in proteinprotein interactions? Protein Sci. 10: 15721583.
Tsai, C.-J., Lin, S.L., Wolfson, H.J., and Nussinov, R. 1996. Proteinprotein interfaces: Architectures and interactions in proteinprotein interfaces and in protein cores: Their similarities and differences. Crit. Rev. Biochem. Mol. Biol. 31: 127152.[Medline]
Vajda, S., Sippl, M., and Novotny, J. 1997. Empirical potentials and functions for protein folding and binding. Curr. Opin. Struct. Biol. 7: 222228.[CrossRef][Medline]
Vakser, I.A. 1995. Protein docking for low-resolution structures. Protein Eng. 8: 371377.
. 1997. Evaluation of GRAMM low-resolution docking methodology on the hemagglutinin-antibody complex. Proteins 1(Suppl): 226230.
Vakser, I.A. and Aflalo, C. 1994. Hydrophobic docking: A proposed enhancement to molecular recognition techniques. Proteins 20: 320329.[CrossRef][Medline]
Vakser, I.A. and Jiang, S. 2002. Strategies for modeling the interactions of the transmembrane helices of G proteincoupled receptors by geometric complementarity using the GRAMM computer algorithm. Methods Enzymol. 343: 313328.[Medline]
Walther, D., Eisenhaber, F., and Argos, P. 1996. Principles of helixhelix packing in proteins: The helical lattice superposition model. J. Mol. Biol. 255: 536553.[CrossRef][Medline]
Yue, K. and Dill, K.A. 2000. Constraint-based assembly of tertiary protein structures from secondary structure elements. Protein Sci. 9: 19351946.[Abstract]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
T.-W. Nam, H. I. Jung, Y. J. An, Y.-H. Park, S. H. Lee, Y.-J. Seok, and S.-S. Cha Analyses of Mlc-IIBGlc interaction and a plausible molecular mechanism of Mlc inactivation by membrane sequestration PNAS, March 11, 2008; 105(10): 3751 - 3756. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Guharoy and P. Chakrabarti Secondary structure based analysis and classification of biological interfaces: identification of binding motifs in protein protein interactions Bioinformatics, August 1, 2007; 23(15): 1909 - 1918. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. D. Gaffin, K. Gokulan, J. C. Sacchettini, T. E. Hewett, R. Klevitsky, J. Robbins, V. Sarin, D. C. Zawieja, G. A. Meininger, and M. Muthuchamy Changes in end-to-end interactions of tropomyosin affect mouse cardiac muscle dynamics Am J Physiol Heart Circ Physiol, August 1, 2006; 291(2): H552 - H563. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Tovchigrechko and I. A. Vakser GRAMM-X public web server for protein-protein docking. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W310 - W314. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Jiang and I. A. Vakser Shorter side chains optimize helix-helix packing Protein Sci., May 1, 2004; 13(5): 1426 - 1429. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |