|
|
||||||||
1 Bioinformatics Unit, Department of Computer Science, University College London, London WC1E 6BT, UK
2 Institute of Cancer Genetics and Pharmacogenomics, Department of Biological Sciences, Brunel University, Uxbridge, Middlesex UB8 3PH, UK
Reprint requests to: David T. Jones, Bioinformatics Unit, Department of Computer Science, University College London, Gower Street, London WC1E 6BT, UK; e-mail: d.jones{at}cs.ucl.ac.uk; fax: +44 20 7387 1397.
The elucidation of the domain content of a given protein sequence in the absence of determined structure or significant sequence homology to known domains is an important problem in structural biology. Here we address how successfully the delineation of continuous domains can be accomplished in the absence of sequence homology using simple baseline methods, an existing prediction algorithm (Domain Guess by Size), and a newly developed method (DomSSEA). The study was undertaken with a view to measuring the usefulness of these prediction methods in terms of their application to fully automatic domain assignment. Thus, the sensitivity of each domain assignment method was measured by calculating the number of correctly assigned top scoring predictions. We have implemented a new continuous domain identification method using the alignment of predicted secondary structures of target sequences against observed secondary structures of chains with known domain boundaries as assigned by Class Architecture Topology Homology (CATH). Taking top predictions only, the success rate of the method in correctly assigning domain number to the representative chain set is 73.3%. The top prediction for domain number and location of domain boundaries was correct for 24% of the multidomain set (±20 residues). These results have been put into context in relation to the results obtained from the other prediction methods assessed.
Keywords: Domains; secondary structure; protein folding; sequence analysis; structure prediction
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
C. E. Elliott, Harjono, and B. J. Howlett Mutation of a Gene in the Fungus Leptosphaeria maculans Allows Increased Frequency of Penetration of Stomatal Apertures of Arabidopsis thaliana Mol Plant, May 9, 2008; (2008) ssn014v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. N.I. Pang, K. Lin, M. A. Wouters, J. Heringa, and R. A. George Identifying foldable regions in protein sequence from the hydrophobic signal Nucleic Acids Res., February 2, 2008; 36(2): 578 - 588. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Cheng DOMAC: an accurate, hybrid protein domain prediction server Nucleic Acids Res., July 13, 2007; 35(suppl_2): W354 - W356. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Levitt Growth of novel protein structural data PNAS, February 27, 2007; 104(9): 3183 - 3188. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Xu, J. Zhang, L. Wang, J. Zhou, H. Huang, J. Wu, Y. Zhong, and Y. Shi Solution structure of Urm1 and its implications for the origin of protein modifiers PNAS, August 1, 2006; 103(31): 11625 - 11630. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Chen, W. Wang, S. Ling, C. Jia, and F. Wang KemaDom: a web server for domain prediction using kernel machine with local context. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W158 - W163. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. E. Gewehr and R. Zimmer SSEP-Domain: protein domain prediction by alignment of secondary structure elements and profiles Bioinformatics, January 15, 2006; 22(2): 181 - 187. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bryson, L. J. McGuffin, R. L. Marsden, J. J. Ward, J. S. Sodhi, and D. T. Jones Protein structure prediction servers at University College London Nucleic Acids Res., July 1, 2005; 33(suppl_2): W36 - W38. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Jaroszewski, L. Rychlewski, Z. Li, W. Li, and A. Godzik FFAS03: a server for profile-profile sequence alignments Nucleic Acids Res., July 1, 2005; 33(suppl_2): W284 - W288. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. K. Saini and D. Fischer Meta-DP: domain prediction meta-server Bioinformatics, June 15, 2005; 21(12): 2917 - 2920. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Bae, B. K. Mallick, and C. G. Elsik Prediction of protein interdomain linker regions by a hidden Markov model Bioinformatics, May 15, 2005; 21(10): 2264 - 2270. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Liu and B. Rost Sequence-based prediction of protein domains Nucleic Acids Res., July 7, 2004; 32(12): 3522 - 3530. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Smyth, M. K. Mrozkiewicz, W. J. McGrath, P. Listwan, and B. Kobe Crystal structures of fusion proteins with large-affinity tags Protein Sci., July 1, 2003; 12(7): 1313 - 1322. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |