Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by RODIONOV, M. A.
Right arrow Articles by JOHNSON, M. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by RODIONOV, M. A.
Right arrow Articles by JOHNSON, M. S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Protein Science, Vol 3, Issue 12 2366-2377, Copyright © 1994 by Cold Spring Harbor Laboratory Press


ARTICLE

Residue-residue contact substitution probabilities derived from aligned three-dimensional structures and the identification of common folds

M. A. RODIONOV and M. S. JOHNSON
ICRF Unit of Structural Molecular Biology, Department of Crystallography, Birkbeck College, University of London, Malet Street, London WC1E 7HX, United Kingdom Institute of Bioorganic Chemistry, Belarus Academy of Sciences, Zhodinscaya str. 5/2, Minsk-141, Republic of Belarus 220141

We report the derivation of scores that are based on the analysis of residue-residue contact matrices from 443 3-dimensional structures aligned structurally as 96 families, which can be used to evaluate sequence-structure matches. Residue-residue contacts and the more than 3 X 10(6) amino acid substitutions that take place between pairs of these contacts at aligned positions within each family of structures have been tabulated and segregated according to the solvent accessibility of the residues involved. Contact maps within a family of structures are shown to be highly conserved (~75%) even when the sequence identity is approaching 10%. In a comparison involving a globin structure and the search of a sequence databank (>21,000 sequences), the contact probability scores are shown to provide a very powerful secondary screen for the top scoring sequence-structure matches, where between 69% and 84% of the unrelated matches are eliminated. The search of an aligned set of 2 globins against a sequence databank and the subsequent residue contact-based evaluation of matches locates all 618 globin sequences before the first non-globin match. From a single bacterial serine proteinase structure, the structural template approach coupled with residue-residue contact substitution data lead to the detection of the mammalian serine proteinase family among the top matches in the search of a sequence databank.
Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
G. E. Crooks, R. E. Green, and S. E. Brenner
Pairwise alignment incorporating dipeptide covariation
Bioinformatics, October 1, 2005; 21(19): 3704 - 3710.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Kleinjung, J. Romein, K. Lin, and J. Heringa
Contact-based sequence alignment
Nucleic Acids Res., April 30, 2004; 32(8): 2464 - 2473.
[Abstract] [Full Text] [PDF]


Home page
Physiol. Rev.Home page
R. E. Weber and S. N. Vinogradov
Nonvertebrate Hemoglobins: Functions and Molecular Adaptations
Physiol Rev, April 1, 2001; 81(2): 569 - 628.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 1994 by The Protein Society.