|
|
||||||||
Protein Science, Vol 7, Issue 2 233-242, Copyright © 1998 by Cold Spring Harbor Laboratory Press
ARTICLE |
S. JONES, M. STEWART, A. MICHIE, M. B. SWINDELLS, C. ORENGO and J. M. THORNTON
Biomolecular Structure and Modelling Unit, Department of Biochemistry and Molecular Biology, University College, Gower Street, London, WC1E 6BT, United Kingdom
A consensus approach for the assignment of structural domains in proteins is presented. The approach combines a number of previously published algorithms, and takes advantage of the elevated accuracy obtained when assignments from the individual algorithms are in agreement. The consensus approach is tested on a data set of 55 protein chains, for which domain assignments from four automated methods were known, and for which crystallographers assignments had been reported in the literature. Accuracy was found to increase in this test from 72% using individual algorithms to 100% when all four methods were in agreement. However a consensus prediction using all four methods was only possible for 52% of the dataset. The consensus approach (using three publicly available domain assignment algorithms (PUU, DETECTIVE, DOMAK)) was then used to make domain assignments for a data set of 787 protein chains from the Protein Data Bank. Analysis of the assignments showed 55.7% of assignments could be made automatically, and of these, 13.5% were multi-domain proteins. Of the remaining 44.3% that could not be assigned by the consensus procedure 90.4% had their domain boundaries assigned correctly by at least one of the algorithms. Once identified, these domains were analyzed for trends in their size and secondary structure class. In addition, the discontinuity of each domain along the protein chain was considered.
This article has been cited by other articles:
![]() |
S. Wong and M. A. Ragan MACHOS: Markov clusters of homologous subsequences Bioinformatics, July 1, 2008; 24(13): i77 - i85. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Koczyk and I. N. Berezovsky Domain Hierarchy and closed Loops (DHcL): a server for exploring hierarchy of protein domain structure Nucleic Acids Res., July 1, 2008; 36(suppl_2): W239 - W245. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. N.I. Pang, K. Lin, M. A. Wouters, J. Heringa, and R. A. George Identifying foldable regions in protein sequence from the hydrophobic signal Nucleic Acids Res., February 2, 2008; 36(2): 578 - 588. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Vandevenne, P. Filee, N. Scarafone, B. Cloes, G. Gaspard, N. Yilmaz, M. Dumoulin, J.-M. Francois, J.-M. Frere, and M. Galleni The Bacillus licheniformis BlaP beta-lactamase as a model protein scaffold to study the insertion of protein fragments Protein Sci., October 1, 2007; 16(10): 2260 - 2271. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Zhou, B. Xue, and Y. Zhou DDOMAIN: Dividing structures into domains using a normalized domain-domain interaction profile Protein Sci., May 1, 2007; 16(5): 947 - 955. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Gewehr and R. Zimmer SSEP-Domain: protein domain prediction by alignment of secondary structure elements and profiles Bioinformatics, January 15, 2006; 22(2): 181 - 187. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Ostermeier Engineering allosteric protein switches by domain insertion Protein Eng. Des. Sel., August 1, 2005; 18(8): 359 - 364. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. K. Saini and D. Fischer Meta-DP: domain prediction meta-server Bioinformatics, June 15, 2005; 21(12): 2917 - 2920. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Wang, M. Hsieh, and W.-H. Li A General Tendency for Conservation of Protein Length Across Eukaryotic Kingdoms Mol. Biol. Evol., January 1, 2005; 22(1): 142 - 147. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Shibagaki and A. R. Grossman Probing the Function of STAS Domains of the Arabidopsis Sulfate Transporters J. Biol. Chem., July 16, 2004; 279(29): 30791 - 30799. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu and B. Rost Sequence-based prediction of protein domains Nucleic Acids Res., July 7, 2004; 32(12): 3522 - 3530. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Day, D. A.C. Beck, R. S. Armen, and V. Daggett A consensus view of fold space: Combining SCOP, CATH, and the Dali Domain Dictionary Protein Sci., October 1, 2003; 12(10): 2150 - 2160. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. V. Galzitskaya and B. S. Melnik Prediction of protein domain boundaries from sequence alone Protein Sci., April 1, 2003; 12(4): 696 - 701. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. N. Berezovsky Discrete structure of van der Waals domains in globular proteins Protein Eng. Des. Sel., March 1, 2003; 16(3): 161 - 167. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-t. Guo, D. Xu, D. Kim, and Y. Xu Improving the performance of DomainParser for structural domain partition using neural network Nucleic Acids Res., February 1, 2003; 31(3): 944 - 952. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Scalley-Kim, P. Minard, and D. Baker Low free energy cost of very long loop insertions in proteins Protein Sci., February 1, 2003; 12(2): 197 - 206. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. M. G. Pearl, C. F. Bennett, J. E. Bray, A. P. Harrison, N. Martin, A. Shepherd, I. Sillitoe, J. Thornton, and C. A. Orengo The CATH database: an extended protein family resource for structural and functional genomics Nucleic Acids Res., January 1, 2003; 31(1): 452 - 455. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. L. Marsden, L. J. McGuffin, and D. T. Jones Rapid protein domain assignment from amino acid sequence using predicted secondary structure Protein Sci., December 1, 2002; 11(12): 2814 - 2824. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. M.G. Pearl, D. Lee, J. E. Bray, D. W.A. Buchan, A. J. Shepherd, and C. A. Orengo The CATH extended protein-family database: Providing structural annotations for genome sequences Protein Sci., February 1, 2002; 11(2): 233 - 244. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Nagano, C. T. Porter, and J. M. Thornton The ({beta}{alpha})8 glycosidases: sequence and structure analyses suggest distant evolutionary relationships Protein Eng. Des. Sel., November 1, 2001; 14(11): 845 - 855. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. M. G. Pearl, N. Martin, J. E. Bray, D. W. A. Buchan, A. P. Harrison, D. Lee, G. A. Reeves, A. J. Shepherd, I. Sillitoe, A. E. Todd, et al. A rapid classification protocol for the CATH Domain Database to support structural genomics Nucleic Acids Res., January 1, 2001; 29(1): 223 - 227. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Jones, A. Marin, and J. M.Thornton Protein domain interfaces: characterization and comparison with oligomeric protein interfaces Protein Eng. Des. Sel., February 1, 2000; 13(2): 77 - 82. [Abstract] [Full Text] [PDF] |
||||
![]() |
F. M. G. Pearl, D. Lee, J. E. Bray, I. Sillitoe, A. E. Todd, A. P. Harrison, J. M. Thornton, and C. A. Orengo Assigning genomic sequences to CATH Nucleic Acids Res., January 1, 2000; 28(1): 277 - 282. [Abstract] [Full Text] [PDF] |
||||
![]() |
A.E. Todd, C.A. Orengo, and J.M. Thornton DOMPLOT: a program to generate schematic diagrams of the structural domain organization within proteins, annotated by ligand contacts Protein Eng. Des. Sel., May 1, 1999; 12(5): 375 - 379. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. R. Taylor Protein structural domain identification Protein Eng. Des. Sel., March 1, 1999; 12(3): 203 - 216. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Collinet, M. Herve, F. Pecorari, P. Minard, O. Eder, and M. Desmadril Functionally Accepted Insertions of Proteins within Protein Domains J. Biol. Chem., June 2, 2000; 275(23): 17428 - 17433. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |