|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Center for Genomics and Bioinformatics, Karolinska Institutet, S-17177 Stockholm, Sweden
(RECEIVED August 1, 2005; FINAL REVISION October 7, 2005; ACCEPTED November 15, 2005)
G protein-coupled receptors (GPCRs) constitute a large superfamily involved in various types of signal transduction pathways triggered by hormones, odorants, peptides, proteins, and other types of ligands. The superfamily is so diverse that many members lack sequence similarity, although they all span the cell membrane seven times with an extracellular N and a cytosolic C terminus. We analyzed a divergent set of GPCRs and found distinct loop length patterns and differences in amino acid composition between cytosolic loops, extracellular loops, and membrane regions. We configured GPCRHMM, a hidden Markov model, to fit those features and trained it on a large dataset representing the entire superfamily. GPCRHMM was benchmarked to profile HMMs and generic transmembrane detectors on sets of known GPCRs and non-GPCRs. In a cross-validation procedure, profile HMMs produced an error rate nearly twice as high as GPCRHMM. In a sensitivity-selectivity test, GPCRHMMs sensitivity was about 15% higher than that of the best transmembrane predictors, at comparable false positive rates. We used GPCRHMM to search for novel members of the GPCR superfamily in five proteomes. All in all we detected 120 sequences that lacked annotation and are potentially novel GPCRs. Out of those 102 were found in Caenorhabditis elegans, four in human, and seven in mouse. Many predictions (65) belonged to Pfam domains of unknown function. GPCRHMM strongly rejected a family of arthropod-specific odorant receptors believed to be GPCRs. A detailed analysis showed that these sequences are indeed very different from other GPCRs. GPCRHMM is available at http://gpcrhmm.cgb.ki.se.
Keywords: Hidden Markov model; G protein-coupled receptors; protein classification
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.051745906.
Reprint requests to: Erik L.L. Sonnhammer, Center for Genomics and Bioinformatics, Karolinska Institutet, S-17177 Stockholm, Sweden; e-mail: Erik.Sonnhammer{at}ki.se; fax: 46-8-337983.
Supplemental material: see www.proteinscience.org
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
M. Ditzen, M. Pellegrino, and L. B. Vosshall Insect Odorant Receptors Are Molecular Targets of the Insect Repellent DEET Science, March 28, 2008; 319(5871): 1838 - 1842. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Sakarya, K. S. Kosik, and T. H. Oakley Reconstructing ancestral genome content based on symmetrical best alignments and Dollo parsimony Bioinformatics, March 1, 2008; 24(5): 606 - 612. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. Davies, A. Secker, A. A. Freitas, M. Mendao, J. Timmis, and D. R. Flower On the hierarchical classification of G protein-coupled receptors Bioinformatics, December 1, 2007; 23(23): 3113 - 3118. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Kim, K. Melen, M. Osterberg, and G. von Heijne A global topology map of the Saccharomyces cerevisiae membrane proteome PNAS, July 25, 2006; 103(30): 11142 - 11147. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH |