Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Aloy, P.
Right arrow Articles by Russell, R. B.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Aloy, P.
Right arrow Articles by Russell, R. B.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Protein Science (2002), 11:1101-1116.
Copyright © 2002 The Protein Society

Structural similarity to link sequence space: New potential superfamilies and implications for structural genomics

Patrick Aloy1, Baldomero Oliva2, Enrique Querol2, Francesc X. Aviles2 and Robert B. Russell1,3

1 EMBL, Meyerhofstrasse, 1. D-69117, Heidelberg, Germany
2 Institut de Biotecnologia i Biomedicina, Universitat Autonoma de Barcelona, Bellaterra 08193, Barcelona, Spain

The current pace of structural biology now means that protein three-dimensional structure can be known before protein function, making methods for assigning homology via structure comparison of growing importance. Previous research has suggested that sequence similarity after structure-based alignment is one of the best discriminators of homology and often functional similarity. Here, we exploit this observation, together with a merger of protein structure and sequence databases, to predict distant homologous relationships. We use the Structural Classification of Proteins (SCOP) database to link sequence alignments from the SMART and Pfam databases. We thus provide new alignments that could not be constructed easily in the absence of known three-dimensional structures. We then extend the method of Murzin (1993b) to assign statistical significance to sequence identities found after structural alignment and thus suggest the best link between diverse sequence families. We find that several distantly related protein sequence families can be linked with confidence, showing the approach to be a means for inferring homologous relationships and thus possible functions when proteins are of known structure but of unknown function. The analysis also finds several new potential superfamilies, where inspection of the associated alignments and superimpositions reveals conservation of unusual structural features or co-location of conserved amino acids and bound substrates. We discuss implications for Structural Genomics initiatives and for improvements to sequence comparison methods.

Keywords: Protein structure; sequence; function; homology; structural genomics

Abbreviations: 3D, three dimensional • Ig, immunoglobulin • RMSD, root mean square deviation • PDB, Protein Data Bank • ATP, adenosine triphosphate • SCOP, structural classification of proteins • NCBI, National Center for Biotechnology Information • URL, universal resource locator


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Protein Sci.Home page
P. K. Shah, P. Aloy, P. Bork, and R. B. Russell
Structural similarity to bridge sequence space: Finding new families on the bridges
Protein Sci., May 1, 2005; 14(5): 1305 - 1314.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
R. I. Sadreyev, D. Baker, and N. V. Grishin
Profile-profile comparisons by COMPASS predict intricate homologies between protein families
Protein Sci., October 1, 2003; 12(10): 2262 - 2272.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2002 by The Protein Society.