Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
 QUICK SEARCH:   [advanced]


     


Published online before print February 1, 2006
Protein Science, DOI: 10.1110/ps.051745906
Copyright © 2006 The Protein Society
This Article
Right arrow Full Text (PDF)
Right arrow Supplemental Research Data
Right arrow All Versions of this Article:
ps.051745906v1
15/3/509    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Wistrand, M.
Right arrow Articles by Sonnhammer, E. L.L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Wistrand, M.
Right arrow Articles by Sonnhammer, E. L.L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

A general model of G protein-coupled receptor sequences and its application to detect remote homologs

Markus Wistrand1, Lukas Käll1 and Erik L.L. Sonnhammer

Center for Genomics and Bioinformatics, Karolinska Institutet, S-17177 Stockholm, Sweden

(RECEIVED August 1, 2005; FINAL REVISION October 7, 2005; ACCEPTED November 15, 2005)

G protein-coupled receptors (GPCRs) constitute a large superfamily involved in various types of signal transduction pathways triggered by hormones, odorants, peptides, proteins, and other types of ligands. The superfamily is so diverse that many members lack sequence similarity, although they all span the cell membrane seven times with an extracellular N and a cytosolic C terminus. We analyzed a divergent set of GPCRs and found distinct loop length patterns and differences in amino acid composition between cytosolic loops, extracellular loops, and membrane regions. We configured GPCRHMM, a hidden Markov model, to fit those features and trained it on a large dataset representing the entire superfamily. GPCRHMM was benchmarked to profile HMMs and generic transmembrane detectors on sets of known GPCRs and non-GPCRs. In a cross-validation procedure, profile HMMs produced an error rate nearly twice as high as GPCRHMM. In a sensitivity-selectivity test, GPCRHMM’s sensitivity was about 15% higher than that of the best transmembrane predictors, at comparable false positive rates. We used GPCRHMM to search for novel members of the GPCR superfamily in five proteomes. All in all we detected 120 sequences that lacked annotation and are potentially novel GPCRs. Out of those 102 were found in Caenorhabditis elegans, four in human, and seven in mouse. Many predictions (65) belonged to Pfam domains of unknown function. GPCRHMM strongly rejected a family of arthropod-specific odorant receptors believed to be GPCRs. A detailed analysis showed that these sequences are indeed very different from other GPCRs. GPCRHMM is available at http://gpcrhmm.cgb.ki.se.

Keywords: Hidden Markov model; G protein-coupled receptors; protein classification

Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.051745906.


Reprint requests to: Erik L.L. Sonnhammer, Center for Genomics and Bioinformatics, Karolinska Institutet, S-17177 Stockholm, Sweden; e-mail: Erik.Sonnhammer{at}ki.se; fax: 46-8-337983.

Supplemental material: see www.proteinscience.org

1 These authors contributed equally to this work.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
ScienceHome page
M. Ditzen, M. Pellegrino, and L. B. Vosshall
Insect Odorant Receptors Are Molecular Targets of the Insect Repellent DEET
Science, March 28, 2008; 319(5871): 1838 - 1842.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
O. Sakarya, K. S. Kosik, and T. H. Oakley
Reconstructing ancestral genome content based on symmetrical best alignments and Dollo parsimony
Bioinformatics, March 1, 2008; 24(5): 606 - 612.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. N. Davies, A. Secker, A. A. Freitas, M. Mendao, J. Timmis, and D. R. Flower
On the hierarchical classification of G protein-coupled receptors
Bioinformatics, December 1, 2007; 23(23): 3113 - 3118.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
H. Kim, K. Melen, M. Osterberg, and G. von Heijne
A global topology map of the Saccharomyces cerevisiae membrane proteome
PNAS, July 25, 2006; 103(30): 11142 - 11147.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
Copyright © 2006 by The Protein Society.