Protein Science CSH PROT
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Protein Science (2004), 13:773-785. Published by Cold Spring Harbor Laboratory Press. Copyright © 2004 The Protein Society
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Sierk, M. L.
Right arrow Articles by Pearson, W. R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Sierk, M. L.
Right arrow Articles by Pearson, W. R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Sensitivity and selectivity in protein structure comparison

Michael L. Sierk and William R. Pearson

1 Department of Biochemistry and Molecular Genetics, University of Virginia Health System, Charlottesville, Virginia 22908, USA

(RECEIVED July 23, 2003; FINAL REVISION November 26, 2003; ACCEPTED November 28, 2003)



Abstract

Seven protein structure comparison methods and two sequence comparison programs were evaluated on their ability to detect either protein homologs or domains with the same topology (fold) as defined by the CATH structure database. The structure alignment programs Dali, Structal, Combinatorial Extension (CE), VAST, and Matras were tested along with SGM and PRIDE, which calculate a structural distance between two domains without aligning them. We also tested two sequence alignment programs, SSEARCH and PSI-BLAST. Depending upon the level of selectivity and error model, structure alignment programs can detect roughly twice as many homologous domains in CATH as sequence alignment programs. Dali finds the most homologs, 321–533 of 1120 possible true positives (28.7%–45.7%), at an error rate of 0.1 errors per query (EPQ), whereas PSI-BLAST finds 365 true positives (32.6%), regardless of the error model. At an EPQ of 1.0, Dali finds 42%–70% of possible homologs, whereas Matras finds 49%–57%; PSI-BLAST finds 36.9%. However, Dali achieves >84% coverage before the first error for half of the families tested. Dali and PSI-BLAST find 9.2% and 5.2%, respectively, of the 7056 possible topology pairs at an EPQ of 0.1 and 19.5, and 5.9% at an EPQ of 1.0. Most statistical significance estimates reported by the structural alignment programs overestimate the significance of an alignment by orders of magnitude when compared with the actual distribution of errors. These results help quantify the statistical distinction between analogous and homologous structures, and provide a benchmark for structure comparison statistics.

Keywords: structure alignment; database search; statistical significance; CATH database


Reprint requests to: William R. Pearson, Department of Biochemistry and Molecular Genetics, University of Virginia Health System, P.O. Box 800733, Charlottesville, VA 22908, USA; e-mail: wrp{at}virginia.edu; fax: (434) 924-5069.

Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.03328504.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
L. Holm, S. Kaariainen, P. Rosenstrom, and A. Schenkel
Searching protein structure databases with DaliLite v.3
Bioinformatics, December 1, 2008; 24(23): 2780 - 2781.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
A. Zen, V. Carnevale, A. M. Lesk, and C. Micheletti
Correspondences between low-energy modes in enzymes: Dynamics-based alignment of enzymatic functional families
Protein Sci., May 1, 2008; 17(5): 918 - 929.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
A. Pandini, G. Mauri, A. Bordogna, and L. Bonati
Detecting similarities among distant homologous proteins by comparison of domain flexibilities
Protein Eng. Des. Sel., June 30, 2007; (2007) gzm021v2.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
Y. Chen and G. M. Crippen
A novel approach to structural alignment using realistic structural and environmental information
Protein Sci., December 1, 2005; 14(12): 2935 - 2946.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Fladvad, M. Bellanda, A. P. Fernandes, S. Mammi, A. Vlamis-Gardikas, A. Holmgren, and M. Sunnerhagen
Molecular Mapping of Functionalities in the Solution Structure of Reduced Grx4, a Monothiol Glutaredoxin from Escherichia coli
J. Biol. Chem., July 1, 2005; 280(26): 24553 - 24561.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
Y. Zhang and J. Skolnick
TM-align: a protein structure alignment algorithm based on the TM-score
Nucleic Acids Res., April 22, 2005; 33(7): 2302 - 2309.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. G. Kann, P. A. Thiessen, A. R. Panchenko, A. A. Schaffer, S. F. Altschul, and S. H. Bryant
A structure-based method for protein sequence alignment
Bioinformatics, April 15, 2005; 21(8): 1451 - 1456.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
Y. Ye and A. Godzik
Database searching by flexible protein structure alignment
Protein Sci., July 1, 2004; 13(7): 1841 - 1850.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2004 by The Protein Society.