Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Protein Science (2004), 13:1071-1087. Published by Cold Spring Harbor Laboratory Press. Copyright © 2004 The Protein Society
This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Marti-Renom, M. A.
Right arrow Articles by Sali, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Marti-Renom, M. A.
Right arrow Articles by Sali, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Alignment of protein sequences by their profiles

Marc A. Marti-Renom, M.S. Madhusudhan and Andrej Sali

Departments of Biopharmaceutical Sciences and Pharmaceutical Chemistry, and California Institute for Quantitative Biomedical Research, University of California at San Francisco, San Francisco, California 94143, USA

(RECEIVED August 18, 2003; FINAL REVISION December 19, 2003; ACCEPTED January 9, 2004)



Abstract

The accuracy of an alignment between two protein sequences can be improved by including other detectably related sequences in the comparison. We optimize and benchmark such an approach that relies on aligning two multiple sequence alignments, each one including one of the two protein sequences. Thirteen different protocols for creating and comparing profiles corresponding to the multiple sequence alignments are implemented in the SALIGN command of MODELLER. A test set of 200 pairwise, structure-based alignments with sequence identities below 40% is used to benchmark the 13 protocols as well as a number of previously described sequence alignment methods, including heuristic pairwise sequence alignment by BLAST, pairwise sequence alignment by global dynamic programming with an affine gap penalty function by the ALIGN command of MODELLER, sequence-profile alignment by PSI-BLAST, Hidden Markov Model methods implemented in SAM and LOBSTER, pairwise sequence alignment relying on predicted local structure by SEA, and multiple sequence alignment by CLUSTALW and COMPASS. The alignment accuracies of the best new protocols were significantly better than those of the other tested methods. For example, the fraction of the correctly aligned residues relative to the structure-based alignment by the best protocol is 56%, which can be compared with the accuracies of 26%, 42%, 43%, 48%, 50%, 49%, 43%, and 43% for the other methods, respectively. The new method is currently applied to large-scale comparative protein structure modeling of all known sequences.

Keywords: protein sequence alignment; sequence profiles; comparative protein structure modeling


Reprint requests to: Marc A. Marti-Renom, Mission Bay Genentech Hall, University of California, San Francisco, San Francisco, CA 94143, USA; e-mail: marcius{at}salilab.org; fax: (415) 514-4231.

Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.03379804


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
A. Poleksic and M. Fienup
Optimizing the size of the sequence profiles to increase the accuracy of protein sequence alignments generated by profile-profile algorithms
Bioinformatics, May 1, 2008; 24(9): 1145 - 1153.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
F. P. Davis, D. T. Barkan, N. Eswar, J. H. McKerrow, and A. Sali
Host pathogen protein interactions predicted by comparative modeling
Protein Sci., December 1, 2007; 16(12): 2585 - 2596.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. S. Papadopoulos and R. Agarwala
COBALT: constraint-based alignment tool for multiple protein sequences
Bioinformatics, May 1, 2007; 23(9): 1073 - 1079.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Rangwala and G. Karypis
Incremental window-based protein sequence alignment algorithms
Bioinformatics, January 15, 2007; 23(2): e17 - e23.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Chivian and D. Baker
Homology modeling using parametric alignment ensemble generation with consensus and energy-based model selection
Nucleic Acids Res., October 18, 2006; 34(17): e112 - e112.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Cheng and P. Baldi
A machine learning information retrieval approach to protein fold recognition
Bioinformatics, June 15, 2006; 22(12): 1456 - 1463.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
T. Muramatsu and M. Suwa
Statistical analysis and prediction of functional residues effective for GPCR-G-protein coupling selectivity
Protein Eng. Des. Sel., June 1, 2006; 19(6): 277 - 283.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
M.S. Madhusudhan, M. A. Marti-Renom, R. Sanchez, and A. Sali
Variable gap penalty for protein sequence-structure alignment
Protein Eng. Des. Sel., March 1, 2006; 19(3): 129 - 133.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
U. Pieper, N. Eswar, F. P. Davis, H. Braberg, M. S. Madhusudhan, A. Rossi, M. Marti-Renom, R. Karchin, B. M. Webb, D. Eramian, et al.
MODBASE: a database of annotated comparative protein structure models and associated resources
Nucleic Acids Res., January 1, 2006; 34(suppl_1): D291 - D295.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
H. Rangwala and G. Karypis
Profile-based direct kernels for remote homology detection and fold recognition
Bioinformatics, December 1, 2005; 21(23): 4239 - 4247.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
W. Li, J. Wang, and J.-A. Feng
NdPASA: a pairwise sequence alignment server for distantly related proteins
Bioinformatics, October 1, 2005; 21(19): 3803 - 3805.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
L. Jaroszewski, L. Rychlewski, Z. Li, W. Li, and A. Godzik
FFAS03: a server for profile-profile sequence alignments
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W284 - W288.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
J. Espadaler, R. Aragues, N. Eswar, M. A. Marti-Renom, E. Querol, F. X. Aviles, A. Sali, and B. Oliva
Detecting remotely related proteins by their interactions and sequence similarity
PNAS, May 17, 2005; 102(20): 7151 - 7156.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Soding
Protein homology detection by HMM-HMM comparison
Bioinformatics, April 1, 2005; 21(7): 951 - 960.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
G. E. Crooks and S. E. Brenner
An alternative model of amino acid replacement
Bioinformatics, April 1, 2005; 21(7): 975 - 980.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
Z. Zhang, S. Kochhar, and M. G. Grigorov
Descriptor-based protein remote homology identification
Protein Sci., February 1, 2005; 14(2): 431 - 444.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
A. Leo-Macias, P. Lopez-Romero, D. Lupyan, D. Zerbino, and A. R. Ortiz
An Analysis of Core Deformations in Protein Superfamilies
Biophys. J., February 1, 2005; 88(2): 1291 - 1299.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2004 by The Protein Society.