Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Hung, L.-H.
Right arrow Articles by Samudrala, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Hung, L.-H.
Right arrow Articles by Samudrala, R.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Protein Science (2003), 12:288-295.
Copyright © 2003 The Protein Society

Accurate and automated classification of protein secondary structure with PsiCSI

Ling-Hong Hung and Ram Samudrala

Computational Genomics, Department of Microbiology, University of Washington, Seattle, Washington 98109, USA

Reprint requests to Ram Samudrala, Computational Genomics, Department of Microbiology, University of Washington, Box 357242, Rosen Building, 960 Republican St., Seattle, WA 98109, USA; e-mail: ram{at}compbio.washington.edu; fax: (206) 732-6055.

PsiCSI is a highly accurate and automated method of assigning secondary structure from NMR data, which is a useful intermediate step in the determination of tertiary structures. The method combines information from chemical shifts and protein sequence using three layers of neural networks. Training and testing was performed on a suite of 92 proteins (9437 residues) with known secondary and tertiary structure. Using a stringent cross-validation procedure in which the target and homologous proteins were removed from the databases used for training the neural networks, an average 89% Q3 accuracy (per residue) was observed. This is an increase of 6.2% and 5.5% (representing 36% and 33% fewer errors) over methods that use chemical shifts (CSI) or sequence information (Psipred) alone. In addition, PsiCSI improves upon the translation of chemical shift information to secondary structure (Q3 = 87.4%) and is able to use sequence information as an effective substitute for sparse NMR data (Q3 = 86.9% without 13C shifts and Q3 = 86.8% with only H{alpha} shifts available). Finally, errors made by PsiCSI almost exclusively involve the interchange of helix or strand with coil and not helix with strand (<2.5 occurrences per 10000 residues). The automation, increased accuracy, absence of gross errors, and robustness with regards to sparse data make PsiCSI ideal for high-throughput applications, and should improve the effectiveness of hybrid NMR/de novo structure determination methods. A Web server is available for users to submit data and have the assignment returned.

Keywords: NMR; chemical shifts; secondary structure; neural networks


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
L.-H. Hung, S.-C. Ngan, T. Liu, and R. Samudrala
PROTINFO: new algorithms for enhanced protein structure predictions
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W77 - W80.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Zubkov, W. J. Lennarz, and S. Mohanty
Structural basis for the function of a minimembrane protein subunit of yeast oligosaccharyltransferase
PNAS, March 16, 2004; 101(11): 3821 - 3826.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L.-H. Hung and R. Samudrala
PROTINFO: secondary and tertiary protein structure prediction
Nucleic Acids Res., July 1, 2003; 31(13): 3296 - 3299.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2003 by The Protein Society.