Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by YU, L.
Right arrow Articles by SMITH, T. F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by YU, L.
Right arrow Articles by SMITH, T. F.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Protein Science, Vol 7, Issue 12 2499-2510, Copyright © 1998 by Cold Spring Harbor Laboratory Press


ARTICLE

A homology identification method that combines protein sequence and structure information

L. YU, J. V. WHITE and T. F. SMITH
BioMolecular Engineering Research Center, College of Engineering, Boston University, 36 Cummington Street, Boston, Massachusetts 02215

A new method is presented for identifying distantly related homologous proteins that are unrecognizable by conventional sequence comparison methods. The method combines information about functionally conserved sequence patterns with information about structure context. This information is encoded in stochastic discrete state-space models (DSMs) that comprise a new family of hidden Markov models. The new models are called sequence-pattern-embedded DSMs (pDSMs). This method can identify distantly related protein family members with a high sensitivity and specificity. The method is illustrated with trypsin-like serine proteases and globins. The strategy for building pDSMs is presented. The method has been validated using carefully constructed positive and negative control sets. In addition to the ability to recognize remote homologs, pDSM sequence analysis predicts secondary structures with higher sensitivity, specificity, and Q3 accuracy than DSM analysis, which omits information about conserved sequence patterns. The identification of trypsin-like serine proteases in new genomes is discussed.
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Proc. Natl. Acad. Sci. USAHome page
C. Phelps, V. Gburcik, E. Suslova, P. Dudek, F. Forafonov, N. Bot, M. MacLean, R. J. Fagan, and D. Picard
Fungi and animals may share a common ancestor to nuclear receptors
PNAS, May 2, 2006; 103(18): 7077 - 7081.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
M. B. Gill, J.-E. Murphy, and J. D. Fingeroth
Functional Divergence of Kaposi's Sarcoma-Associated Herpesvirus and Related Gamma-2 Herpesvirus Thymidine Kinases: Novel Cytoplasmic Phosphoproteins That Alter Cellular Morphology and Disrupt Adhesion
J. Virol., December 1, 2005; 79(23): 14647 - 14659.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 1998 by The Protein Society.