|
|
||||||||
1 Biophysics Program and
2 Department of Biochemistry, Stanford University, Stanford, California 94305-5307, USA
Reprint requests to: Douglas Brutlag, B400 Beckman Center, Department of Biochemistry, Stanford University School of Medicine, Stanford, CA 94305-5307, USA; e-mail: brutlag{at}stanford.edu; fax: (650) 723-6783.
We report an unsupervised structural motif discovery algorithm, FoldMiner, which is able to detect global and local motifs in a database of proteins without the need for multiple structure or sequence alignments and without relying on prior classification of proteins into families. Motifs, which are discovered from pairwise superpositions of a query structure to a database of targets, are described probabilistically in terms of the conservation of each secondary structure elements position and are used to improve detection of distant structural relationships. During each iteration of the algorithm, the motif is defined from the current set of homologs and is used both to recruit additional homologous structures and to discard false positives. FoldMiner thus achieves high specificity and sensitivity by distinguishing between homologous and nonhomologous structures by the regions of the query to which they align. We find that when two proteins of the same fold are aligned, highly conserved secondary structure elements in one protein tend to align to highly conserved elements in the second protein, suggesting that FoldMiner consistently identifies the same motif in members of a fold. Structural alignments are performed by an improved superposition algorithm, LOCK 2, which detects distant structural relationships by placing increased emphasis on the alignment of secondary structure elements. LOCK 2 obeys several properties essential in automated analysis of protein structure: It is symmetric, its alignments of secondary structure elements are transitive, its alignments of residues display a high degree of transitivity, and its scoring system is empirically found to behave as a metric.
Keywords: structural motif discovery; core fold; structural superposition; structural alignment; structural similarity score; statistical significance score; expectation
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
S. Shi, Y. Zhong, I. Majumdar, S. Sri Krishna, and N. V. Grishin Searching for three-dimensional secondary structural patterns in proteins with ProSMoS Bioinformatics, June 1, 2007; 23(11): 1331 - 1338. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Ebert and D. Brutlag Development and validation of a consistency based multiple structure alignment algorithm Bioinformatics, May 1, 2006; 22(9): 1080 - 1087. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Chen and G. M. Crippen A novel approach to structural alignment using realistic structural and environmental information Protein Sci., December 1, 2005; 14(12): 2935 - 2946. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Cheng and N. V. Grishin DOM-fold: A structure with crossing loops found in DmpA, ornithine acetyltransferase, and molybdenum cofactor-binding domain Protein Sci., July 1, 2005; 14(7): 1902 - 1910. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. G. Kann, P. A. Thiessen, A. R. Panchenko, A. A. Schaffer, S. F. Altschul, and S. H. Bryant A structure-based method for protein sequence alignment Bioinformatics, April 15, 2005; 21(8): 1451 - 1456. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Shapiro and D. Brutlag FoldMiner and LOCK 2: protein structure comparison and motif discovery on the web Nucleic Acids Res., July 1, 2004; 32(suppl_2): W536 - W541. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |