Protein Science Sheba protein
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Marsden, R. L.
Right arrow Articles by Jones, D. T.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Marsden, R. L.
Right arrow Articles by Jones, D. T.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Protein Science (2002), 11:2814-2824.
Copyright © 2002 The Protein Society

Rapid protein domain assignment from amino acid sequence using predicted secondary structure

Russell L. Marsden1, Liam J. McGuffin2 and David T. Jones1

1 Bioinformatics Unit, Department of Computer Science, University College London, London WC1E 6BT, UK
2 Institute of Cancer Genetics and Pharmacogenomics, Department of Biological Sciences, Brunel University, Uxbridge, Middlesex UB8 3PH, UK

Reprint requests to: David T. Jones, Bioinformatics Unit, Department of Computer Science, University College London, Gower Street, London WC1E 6BT, UK; e-mail: d.jones{at}cs.ucl.ac.uk; fax: +44 20 7387 1397.

The elucidation of the domain content of a given protein sequence in the absence of determined structure or significant sequence homology to known domains is an important problem in structural biology. Here we address how successfully the delineation of continuous domains can be accomplished in the absence of sequence homology using simple baseline methods, an existing prediction algorithm (Domain Guess by Size), and a newly developed method (DomSSEA). The study was undertaken with a view to measuring the usefulness of these prediction methods in terms of their application to fully automatic domain assignment. Thus, the sensitivity of each domain assignment method was measured by calculating the number of correctly assigned top scoring predictions. We have implemented a new continuous domain identification method using the alignment of predicted secondary structures of target sequences against observed secondary structures of chains with known domain boundaries as assigned by Class Architecture Topology Homology (CATH). Taking top predictions only, the success rate of the method in correctly assigning domain number to the representative chain set is 73.3%. The top prediction for domain number and location of domain boundaries was correct for 24% of the multidomain set (±20 residues). These results have been put into context in relation to the results obtained from the other prediction methods assessed.

Keywords: Domains; secondary structure; protein folding; sequence analysis; structure prediction


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Mol PlantHome page
C. E. Elliott, Harjono, and B. J. Howlett
Mutation of a Gene in the Fungus Leptosphaeria maculans Allows Increased Frequency of Penetration of Stomatal Apertures of Arabidopsis thaliana
Mol Plant, May 9, 2008; (2008) ssn014v1.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
C. N.I. Pang, K. Lin, M. A. Wouters, J. Heringa, and R. A. George
Identifying foldable regions in protein sequence from the hydrophobic signal
Nucleic Acids Res., February 2, 2008; 36(2): 578 - 588.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Cheng
DOMAC: an accurate, hybrid protein domain prediction server
Nucleic Acids Res., July 13, 2007; 35(suppl_2): W354 - W356.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Levitt
Growth of novel protein structural data
PNAS, February 27, 2007; 104(9): 3183 - 3188.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. Xu, J. Zhang, L. Wang, J. Zhou, H. Huang, J. Wu, Y. Zhong, and Y. Shi
Solution structure of Urm1 and its implications for the origin of protein modifiers
PNAS, August 1, 2006; 103(31): 11625 - 11630.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Chen, W. Wang, S. Ling, C. Jia, and F. Wang
KemaDom: a web server for domain prediction using kernel machine with local context.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W158 - W163.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. E. Gewehr and R. Zimmer
SSEP-Domain: protein domain prediction by alignment of secondary structure elements and profiles
Bioinformatics, January 15, 2006; 22(2): 181 - 187.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Bryson, L. J. McGuffin, R. L. Marsden, J. J. Ward, J. S. Sodhi, and D. T. Jones
Protein structure prediction servers at University College London
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W36 - W38.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Jaroszewski, L. Rychlewski, Z. Li, W. Li, and A. Godzik
FFAS03: a server for profile-profile sequence alignments
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W284 - W288.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. K. Saini and D. Fischer
Meta-DP: domain prediction meta-server
Bioinformatics, June 15, 2005; 21(12): 2917 - 2920.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
K. Bae, B. K. Mallick, and C. G. Elsik
Prediction of protein interdomain linker regions by a hidden Markov model
Bioinformatics, May 15, 2005; 21(10): 2264 - 2270.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Liu and B. Rost
Sequence-based prediction of protein domains
Nucleic Acids Res., July 7, 2004; 32(12): 3522 - 3530.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
D. R. Smyth, M. K. Mrozkiewicz, W. J. McGrath, P. Listwan, and B. Kobe
Crystal structures of fusion proteins with large-affinity tags
Protein Sci., July 1, 2003; 12(7): 1313 - 1322.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2002 by The Protein Society.