|
|
||||||||
Protein Science, Vol 3, Issue 10 1847-1857, Copyright © 1994 by Cold Spring Harbor Laboratory Press
ARTICLE |
T. M. KLINGLER and D. L. BRUTLAG
Department of Biochemistry and Section on Medical Informatics, Stanford University School of Medicine, Stanford, California 94305-5307
We have developed a new representation for structural and functional motifs in protein sequences based on correlations between pairs of amino acids and applied it to {alpha}-helical and {beta}-sheet sequences. Existing probabilistic methods for representing and analyzing protein sequences have traditionally assumed conditional independence of evidence. In other words, amino acids are assumed to have no effect on each other. However, analyses of protein structures have repeatedly demonstrated the importance of interactions between amino acids in conferring both structure and function. Using Bayesian networks, we are able to model the relationships between amino acids at distinct positions in a protein sequence in addition to the amino acid distributions at each position. We have also developed an automated program for discovering sequence correlations using standard statistical tests and validation techniques. In this paper, we test this program on sequences from secondary structure motifs, namely {alpha}-helices and {beta}-sheets. In each case, the correlations our program discovers correspond well with known physical and chemical interactions between amino acids in structures. Furthermore, we show that, using different chemical alphabets for the amino acids, we discover structural relationships based on the same chemical principle used in constructing the alphabet. This new representation of 3-dimensional features in protein motifs, such as those arising from structural or functional constraints on the sequence, can be used to improve sequence analysis tools including pattern analysis and database search.
This article has been cited by other articles:
![]() |
K. Deforche, R. Camacho, K. Van Laethem, P. Lemey, A. Rambaut, Y. Moreau, and A.-M. Vandamme Estimation of an in vivo fitness landscape experienced by HIV-1 under drug selective pressure useful for prediction of drug resistance evolution during treatment Bioinformatics, January 1, 2008; 24(1): 34 - 41. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Nikolajewa, R. Pudimat, M. Hiller, M. Platzer, and R. Backofen BioBayesNet: a web server for feature extraction and Bayesian network modeling of biological sequence data Nucleic Acids Res., July 13, 2007; 35(suppl_2): W688 - W693. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Deforche, T. Silander, R. Camacho, Z . Grossman, M. A. Soares, K. Van Laethem, R. Kantor, Y. Moreau, A.-M. Vandamme, and on behalf of the non-B Workgroup Analysis of HIV-1 pol sequences using Bayesian Networks: implications for drug resistance Bioinformatics, December 15, 2006; 22(24): 2975 - 2979. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Bhattacharyya, U. Samanta, and P. Chakrabarti Aromatic-aromatic interactions in and around {alpha}-helices Protein Eng. Des. Sel., February 1, 2002; 15(2): 91 - 100. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. B. Altman The Interactions Between Clinical Informatics and Bioinformatics: A Case Study J. Am. Med. Inform. Assoc., September 1, 2000; 7(5): 439 - 443. [Abstract] [Full Text] |
||||
![]() |
W. R. Atchley, K. R. Wollenberg, W. M. Fitch, W. Terhalle, and A. W. Dress Correlations Among Amino Acid Sites in bHLH Protein Domains: An Information Theoretic Analysis Mol. Biol. Evol., January 1, 2000; 17(1): 164 - 178. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Karlin, Z.-Y. Zhu, and F. Baud Atom density in protein structures PNAS, October 26, 1999; 96(22): 12500 - 12505. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. T. N. Petersen, P. H. Jonson, and S. B. Petersen Amino acid neighbours and detailed conformational analysis of cysteines in proteins Protein Eng. Des. Sel., July 1, 1999; 12(7): 535 - 548. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |