|
|
||||||||
Laboratories of Molecular Biophysics, Pels Family Center for Biochemistry and Structural Biology, The Rockefeller University, New York, New York 10021, USA
Reprint requests to: Andrej Sali, 1230 York Avenue, New York, NY 10021, USA; e-mail: sali{at}rockefeller.edu; fax: (212) 327-7540.
A protein structure model generally needs to be evaluated to assess whether or not it has the correct fold. To improve fold assessment, four types of a residue-level statistical potential were optimized, including distance-dependent, contact,
/
dihedral angle, and accessible surface statistical potentials. Approximately 10,000 test models with the correct and incorrect folds were built by automated comparative modeling of protein sequences of known structure. The criterion used to discriminate between the correct and incorrect models was the Z-score of the model energy. The performance of a Z-score was determined as a function of many variables in the derivation and use of the corresponding statistical potential. The performance was measured by the fractions of the correctly and incorrectly assessed test models. The most discriminating combination of any one of the four tested potentials is the sum of the normalized distance-dependent and accessible surface potentials. The distance-dependent potential that is optimal for assessing models of all sizes uses both C
and Cß atoms as interaction centers, distinguishes between all 20 standard residue types, has the distance range of 30 Å, and is derived and used by taking into account the sequence separation of the interacting atom pairs. The terms for the sequentially local interactions are significantly less informative than those for the sequentially nonlocal interactions. The accessible surface potential that is optimal for assessing models of all sizes uses Cß atoms as interaction centers and distinguishes between all 20 standard residue types. The performance of the tested statistical potentials is not likely to improve significantly with an increase in the number of known protein structures used in their derivation. The parameters of fold assessment whose optimal values vary significantly with model size include the size of the known protein structures used to derive the potential and the distance range of the accessible surface potential. Fold assessment by statistical potentials is most difficult for the very small models. This difficulty presents a challenge to fold assessment in large-scale comparative modeling, which produces many small and incomplete models. The results described in this study provide a basis for an optimal use of statistical potentials in fold assessment.
Keywords: Model evaluation; comparative modeling; fold assignment; fold assessment; statistical potentials; large scale protein structure modeling
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
F. P. Davis, D. T. Barkan, N. Eswar, J. H. McKerrow, and A. Sali Host pathogen protein interactions predicted by comparative modeling Protein Sci., December 1, 2007; 16(12): 2585 - 2596. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Melo and A. Sali Fold assessment for comparative protein structure modeling Protein Sci., November 1, 2007; 16(11): 2412 - 2426. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Fitzgerald, A. K. Jha, A. Colubri, T. R. Sosnick, and K. F. Freed Reduced Cbeta statistical potentials can outperform all-atom potentials in decoy identification Protein Sci., October 1, 2007; 16(10): 2123 - 2139. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. C. Choi, A. Hobolth, D. M. Robinson, H. Kishino, and J. L. Thorne Quantifying the Impact of Protein Tertiary Structure on Molecular Evolution Mol. Biol. Evol., August 1, 2007; 24(8): 1769 - 1782. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Ferrada and F. Melo Nonbonded terms extrapolated from nonlocal knowledge-based energy functions improve error detection in near-native protein structure models Protein Sci., July 1, 2007; 16(7): 1410 - 1421. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Wu, M. Lu, M. Chen, J. Li, and J. Ma OPUS-Ca: A knowledge-based potential function requiring only C{alpha} positions Protein Sci., July 1, 2007; 16(7): 1449 - 1463. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Cheng, J. Pei, and L. Lai A Free-Rotating and Self-Avoiding Chain Model for Deriving Statistical Potentials Based on Protein Structures Biophys. J., June 1, 2007; 92(11): 3868 - 3877. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. B. S. Sidhu, Q. Sun, L. J. Nkrumah, M. W. Dunne, J. C. Sacchettini, and D. A. Fidock In Vitro Efficacy, Resistance Selection, and Structural Modeling Studies Implicate the Malarial Parasite Apicoplast as the Target of Azithromycin J. Biol. Chem., January 26, 2007; 282(4): 2494 - 2504. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Martinez-Rodriguez, M. Andujar-Sanchez, J. L. Neira, J. M. Clemente-Jimenez, V. Jara-Perez, F. Rodriguez-Vico, and F. J. Las Heras-Vazquez Site-directed mutagenesis indicates an important role of cysteines 76 and 181 in the catalysis of hydantoin racemase from Sinorhizobium meliloti Protein Sci., December 1, 2006; 15(12): 2729 - 2738. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-y. Shen and A. Sali Statistical potential for assessment and prediction of protein structures. Protein Sci., November 1, 2006; 15(11): 2507 - 2524. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Eramian, M.-y. Shen, D. Devos, F. Melo, A. Sali, and M. A. Marti-Renom A composite score for predicting errors in protein structure models Protein Sci., July 1, 2006; 15(7): 1653 - 1666. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Dehouck, D. Gilis, and M. Rooman A New Generation of Statistical Potentials for Proteins Biophys. J., June 1, 2006; 90(11): 4010 - 4017. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. P. Davis, H. Braberg, M.-Y. Shen, U. Pieper, A. Sali, and M.S. Madhusudhan Protein complex compositions predicted by structural similarity Nucleic Acids Res., May 31, 2006; 34(10): 2943 - 2952. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Devos, S. Dokudovskaya, R. Williams, F. Alber, N. Eswar, B. T. Chait, M. P. Rout, and A. Sali Simple fold composition and modular architecture of the nuclear pore complex PNAS, February 14, 2006; 103(7): 2172 - 2177. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Karchin, M. Diekhans, L. Kelly, D. J. Thomas, U. Pieper, N. Eswar, D. Haussler, and A. Sali LS-SNP: large-scale annotation of coding non-synonymous SNPs based on multiple information sources Bioinformatics, June 15, 2005; 21(12): 2814 - 2820. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Opazo, R. E. Palma, F. Melo, and E. P. Lessa Adaptive Evolution of the Insulin Gene in Caviomorph Rodents Mol. Biol. Evol., May 1, 2005; 22(5): 1290 - 1298. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Dehouck, D. Gilis, and M. Rooman Database-Derived Potentials Dependent on Protein Size for In Silico Folding and Design Biophys. J., July 1, 2004; 87(1): 171 - 181. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Pieper, N. Eswar, H. Braberg, M. S. Madhusudhan, F. P. Davis, A. C. Stuart, N. Mirkovic, A. Rossi, M. A. Marti-Renom, A. Fiser, et al. MODBASE, a database of annotated comparative protein structure models, and associated resources Nucleic Acids Res., January 1, 2004; 32(90001): D217 - 222. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Kim, D. Xu, J.-t. Guo, K. Ellrott, and Y. Xu PROSPECT II: protein structure prediction program for genome-scale applications Protein Eng. Des. Sel., September 1, 2003; 16(9): 641 - 650. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. John and A. Sali Comparative protein structure modeling by iterative alignment, model building and model assessment Nucleic Acids Res., July 15, 2003; 31(14): 3982 - 3992. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Eswar, B. John, N. Mirkovic, A. Fiser, V. A. Ilyin, U. Pieper, A. C. Stuart, M. A. Marti-Renom, M. S. Madhusudhan, B. Yerkovich, et al. Tools for comparative protein structure modeling and analysis Nucleic Acids Res., July 1, 2003; 31(13): 3375 - 3380. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Wallner and A. Elofsson Can correct protein models be identified? Protein Sci., May 1, 2003; 12(5): 1073 - 1086. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |