Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Geourjon, C.
Right arrow Articles by Deléage, G.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Geourjon, C.
Right arrow Articles by Deléage, G.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Protein Science (2001), 10:788-797.
Copyright © 2001 The Protein Society

Identification of related proteins with weak sequence identity using secondary structure information

Christophe Geourjon, Christophe Combet, Christophe Blanchet and Gilbert Deléage

Pôle BioInformatique Lyonnais, Institut de Biologie et Chimie des Protéines, Centre National de la Recherche Scientifique, UMR 5086, 69 367 Lyon CEDEX 07, France

Reprint requests to: Dr. C. Geourjon, Pôle BioInformatique Lyonnais, IBCP-CNRS UMR 5086, 7 Passage du Vercors, 69 367 Lyon CEDEX 07, France; e-mail c.geourjon{at}ibcp.fr; fax 3304-72722601.

Molecular modeling of proteins is confronted with the problem of finding homologous proteins, especially when few identities remain after the process of molecular evolution. Using even the most recent methods based on sequence identity detection, structural relationships are still difficult to establish with high reliability. As protein structures are more conserved than sequences, we investigated the possibility of using protein secondary structure comparison (observed or predicted structures) to discriminate between related and unrelated proteins sequences in the range of 10%–30% sequence identity. Pairwise comparison of secondary structures have been measured using the structural overlap (Sov) parameter. In this article, we show that if the secondary structures likeness is >50%, most of the pairs are structurally related. Taking into account the secondary structures of proteins that have been detected by BLAST, FASTA, or SSEARCH in the noisy region (with high E value), we show that distantly related protein sequences (even with <20% identity) can be still identified. This strategy can be used to identify three-dimensional templates in homology modeling by finding unexpected related proteins and to select proteins for experimental investigation in a structural genomic approach, as well as for genome annotation.

Keywords: Protein; molecular modeling; sequence; databank; alignment; structure prediction; secondary structure; Web server


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Brief BioinformHome page
P. Fariselli, I. Rossi, E. Capriotti, and R. Casadio
The WWWH of remote homolog detection: The state of the art
Brief Bioinform, March 1, 2007; 8(2): 78 - 87.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
A. Aouacheria, C. Geourjon, N. Aghajari, V. Navratil, G. Deleage, C. Lethias, and J.-Y. Exposito
Insights into Early Extracellular Matrix Evolution: Spongin Short Chain Collagen-Related Proteins Are Homologous to Basement Membrane Type IV Collagens and Form a Novel Family Widely Distributed in Invertebrates
Mol. Biol. Evol., December 1, 2006; 23(12): 2288 - 2302.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
D. Soulat, J.-M. Jault, B. Duclos, C. Geourjon, A. J. Cozzone, and C. Grangeasse
Staphylococcus aureus Operates Protein-tyrosine Phosphorylation through a Specific Mechanism
J. Biol. Chem., May 19, 2006; 281(20): 14048 - 14056.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
Y. Nozaki and M. Bellgard
Statistical evaluation and comparison of a pairwise alignment algorithm that a priori assigns the number of gaps rather than employing gap penalties
Bioinformatics, April 15, 2005; 21(8): 1421 - 1428.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Srinivasan, D. Lu, R. Eri, D. D. Brand, A. Haque, and J. S. Blum
CD80 Binding Polyproline Helical Peptide Inhibits T Cell Activation
J. Biol. Chem., March 18, 2005; 280(11): 10149 - 10155.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
S. Aubourg, V. Brunaud, C. Bruyere, M. Cock, R. Cooke, A. Cottet, A. Couloux, P. Dehais, G. Deleage, A. Duclert, et al.
GeneFarm, structural and functional annotation of Arabidopsis gene and protein families by a network of experts
Nucleic Acids Res., January 1, 2005; 33(suppl_1): D641 - D646.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
K. Uehara, T. Kawabata, and N. Go
Filtering remote homologues using predicted structural information
Protein Eng. Des. Sel., July 1, 2004; 17(7): 565 - 570.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
M. Collin and V. A. Fischetti
A Novel Secreted Endoglycosidase from Enterococcus faecalis with Activity on Human Immunoglobulin G and Ribonuclease B
J. Biol. Chem., May 21, 2004; 279(21): 22558 - 22570.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Perriere, C. Combet, S. Penel, C. Blanchet, J. Thioulouse, C. Geourjon, J. Grassot, C. Charavay, M. Gouy, L. Duret, et al.
Integrated databanks access and sequence/structure analysis services at the PBIL
Nucleic Acids Res., July 1, 2003; 31(13): 3393 - 3399.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2001 by The Protein Society.