|
|
||||||||
Pôle BioInformatique Lyonnais, Institut de Biologie et Chimie des Protéines, Centre National de la Recherche Scientifique, UMR 5086, 69 367 Lyon CEDEX 07, France
Reprint requests to: Dr. C. Geourjon, Pôle BioInformatique Lyonnais, IBCP-CNRS UMR 5086, 7 Passage du Vercors, 69 367 Lyon CEDEX 07, France; e-mail c.geourjon{at}ibcp.fr; fax 3304-72722601.
Molecular modeling of proteins is confronted with the problem of finding homologous proteins, especially when few identities remain after the process of molecular evolution. Using even the most recent methods based on sequence identity detection, structural relationships are still difficult to establish with high reliability. As protein structures are more conserved than sequences, we investigated the possibility of using protein secondary structure comparison (observed or predicted structures) to discriminate between related and unrelated proteins sequences in the range of 10%30% sequence identity. Pairwise comparison of secondary structures have been measured using the structural overlap (Sov) parameter. In this article, we show that if the secondary structures likeness is >50%, most of the pairs are structurally related. Taking into account the secondary structures of proteins that have been detected by BLAST, FASTA, or SSEARCH in the noisy region (with high E value), we show that distantly related protein sequences (even with <20% identity) can be still identified. This strategy can be used to identify three-dimensional templates in homology modeling by finding unexpected related proteins and to select proteins for experimental investigation in a structural genomic approach, as well as for genome annotation.
Keywords: Protein; molecular modeling; sequence; databank; alignment; structure prediction; secondary structure; Web server
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
P. Fariselli, I. Rossi, E. Capriotti, and R. Casadio The WWWH of remote homolog detection: The state of the art Brief Bioinform, March 1, 2007; 8(2): 78 - 87. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Aouacheria, C. Geourjon, N. Aghajari, V. Navratil, G. Deleage, C. Lethias, and J.-Y. Exposito Insights into Early Extracellular Matrix Evolution: Spongin Short Chain Collagen-Related Proteins Are Homologous to Basement Membrane Type IV Collagens and Form a Novel Family Widely Distributed in Invertebrates Mol. Biol. Evol., December 1, 2006; 23(12): 2288 - 2302. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Soulat, J.-M. Jault, B. Duclos, C. Geourjon, A. J. Cozzone, and C. Grangeasse Staphylococcus aureus Operates Protein-tyrosine Phosphorylation through a Specific Mechanism J. Biol. Chem., May 19, 2006; 281(20): 14048 - 14056. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Nozaki and M. Bellgard Statistical evaluation and comparison of a pairwise alignment algorithm that a priori assigns the number of gaps rather than employing gap penalties Bioinformatics, April 15, 2005; 21(8): 1421 - 1428. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Srinivasan, D. Lu, R. Eri, D. D. Brand, A. Haque, and J. S. Blum CD80 Binding Polyproline Helical Peptide Inhibits T Cell Activation J. Biol. Chem., March 18, 2005; 280(11): 10149 - 10155. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Aubourg, V. Brunaud, C. Bruyere, M. Cock, R. Cooke, A. Cottet, A. Couloux, P. Dehais, G. Deleage, A. Duclert, et al. GeneFarm, structural and functional annotation of Arabidopsis gene and protein families by a network of experts Nucleic Acids Res., January 1, 2005; 33(suppl_1): D641 - D646. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Uehara, T. Kawabata, and N. Go Filtering remote homologues using predicted structural information Protein Eng. Des. Sel., July 1, 2004; 17(7): 565 - 570. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Collin and V. A. Fischetti A Novel Secreted Endoglycosidase from Enterococcus faecalis with Activity on Human Immunoglobulin G and Ribonuclease B J. Biol. Chem., May 21, 2004; 279(21): 22558 - 22570. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Perriere, C. Combet, S. Penel, C. Blanchet, J. Thioulouse, C. Geourjon, J. Grassot, C. Charavay, M. Gouy, L. Duret, et al. Integrated databanks access and sequence/structure analysis services at the PBIL Nucleic Acids Res., July 1, 2003; 31(13): 3393 - 3399. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |