|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Graduate Group in Biophysics, University of California at San Francisco, San Francisco, California 94158, USA
2 Departments of Biopharmaceutical Sciences and Pharmaceutical Chemistry, and California Institute for Quantitative Biomedical Research, University of California at San Francisco, San Francisco, California 94158, USA
3 Departamento de Genética Molecular y Microbiología, Facultad de Ciencias Biológicas, Pontificia Universidad Católica de Chile, Alameda 340, Santiago, Chile
(RECEIVED January 13, 2006; FINAL REVISION March 20, 2006; ACCEPTED March 30, 2006)
Reliable prediction of model accuracy is an important unsolved problem in protein structure modeling. To address this problem, we studied 24 individual assessment scores, including physics-based energy functions, statistical potentials, and machine learningbased scoring functions. Individual scores were also used to construct
85,000 composite scoring functions using support vector machine (SVM) regression. The scores were tested for their abilities to identify the most native-like models from a set of 6000 comparative models of 20 representative protein structures. Each of the 20 targets was modeled using a template of <30% sequence identity, corresponding to challenging comparative modeling cases. The best SVM score outperformed all individual scores by decreasing the average RMSD difference between the model identified as the best of the set and the model with the lowest RMSD (
RMSD) from 0.63 Å to 0.45 Å, while having a higher Pearson correlation coefficient to RMSD (r = 0.87) than any other tested score. The most accurate score is based on a combination of the DOPE non-hydrogen atom statistical potential; surface, contact, and combined statistical potentials from MODPIPE; and two PSIPRED/DSSP scores. It was implemented in the SVMod program, which can now be applied to select the final model in various modeling problems, including fold assignment, targettemplate alignment, and loop modeling.
Keywords: model assessment; comparative modeling; fold assignment; statistical potentials; support vector machine; protein structure prediction
Reprint requests to: Marc A. Marti-Renom and Andrej Sali, California Institute for Quantitative Biomedical Research, QB3 at Mission Bay, Suite 503B, University of California at San Francisco, 1700 4th Street, San Francisco, CA 94158, USA; e-mail: marcius{at}salilab.org or sali{at}salilab.org; fax: (415) 514-4231.
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.062095806.
This article has been cited by other articles:
![]() |
D. Eramian, N. Eswar, M.-Y. Shen, and A. Sali How well can the accuracy of comparative protein structure models be predicted? Protein Sci., November 1, 2008; 17(11): 1881 - 1893. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. Pieper, N. Eswar, B. M. Webb, D. Eramian, L. Kelly, D. T. Barkan, H. Carter, P. Mankoo, R. Karchin, M. A. Marti-Renom, et al. MODBASE, a database of annotated comparative protein structure models and associated resources Nucleic Acids Res., October 23, 2008; (2008) gkn791v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Yang and Y. Zhou Ab initio folding of terminal segments with secondary structures reveals the fine difference between two closely related all-atom statistical energy functions Protein Sci., July 1, 2008; 17(7): 1212 - 1219. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. J. McGuffin The ModFOLD server for the quality assessment of protein structural models Bioinformatics, February 15, 2008; 24(4): 586 - 587. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. P. Davis, D. T. Barkan, N. Eswar, J. H. McKerrow, and A. Sali Host pathogen protein interactions predicted by comparative modeling Protein Sci., December 1, 2007; 16(12): 2585 - 2596. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. Melo and A. Sali Fold assessment for comparative protein structure modeling Protein Sci., November 1, 2007; 16(11): 2412 - 2426. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Fitzgerald, A. K. Jha, A. Colubri, T. R. Sosnick, and K. F. Freed Reduced Cbeta statistical potentials can outperform all-atom potentials in decoy identification Protein Sci., October 1, 2007; 16(10): 2123 - 2139. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Fasnacht, J. Zhu, and B. Honig Local quality assessment in homology models using statistical potentials and support vector machines Protein Sci., August 1, 2007; 16(8): 1557 - 1568. [Abstract] [Full Text] [PDF] |
||||
![]() |
M.-y. Shen and A. Sali Statistical potential for assessment and prediction of protein structures. Protein Sci., November 1, 2006; 15(11): 2507 - 2524. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH |