Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Full Text
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Larson, S. M.
Right arrow Articles by Pande, V. S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Larson, S. M.
Right arrow Articles by Pande, V. S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Protein Science (2002), 11:2804-2813.
Copyright © 2002 The Protein Society

Thoroughly sampling sequence space: Large-scale protein design of structural ensembles

Stefan M. Larson1, Jeremy L. England2, John R. Desjarlais3 and Vijay S. Pande1

1 Chemistry Department and Biophysics Program, Stanford University, Stanford, California 94305, USA
2 Biochemical Sciences Program, Harvard College, Cambridge, Massachusetts 02138, USA
3 Xencor, Inc., Monrovia, California 91016, USA

Reprint requests to: Vijay S. Pande, Chemistry Department, Stanford University, Stanford, CA 94305, USA; e-mail: pande{at}stanford.edu; fax: (650) 723-4817.

Modeling the inherent flexibility of the protein backbone as part of computational protein design is necessary to capture the behavior of real proteins and is a prerequisite for the accurate exploration of protein sequence space. We present the results of a broad exploration of sequence space, with backbone flexibility, through a novel approach: large-scale protein design to structural ensembles. A distributed computing architecture has allowed us to generate hundreds of thousands of diverse sequences for a set of 253 naturally occurring proteins, allowing exciting insights into the nature of protein sequence space. Designing to a structural ensemble produces a much greater diversity of sequences than previous studies have reported, and homology searches using profiles derived from the designed sequences against the Protein Data Bank show that the relevance and quality of the sequences is not diminished. The designed sequences have greater overall diversity than corresponding natural sequence alignments, and no direct correlations are seen between the diversity of natural sequence alignments and the diversity of the corresponding designed sequences. For structures in the same fold, the sequence entropies of the designed sequences cluster together tightly. This tight clustering of sequence entropies within a fold and the separation of sequence entropy distributions for different folds suggest that the diversity of designed sequences is primarily determined by a structure's overall fold, and that the designability principle postulated from studies of simple models holds in real proteins. This has important implications for experimental protein design and engineering, as well as providing insight into protein evolution.

Keywords: Protein design; sequence space; designability; backbone flexibility; distributed computing

Abbreviations: RMSD, root-mean-square deviation • PDB, Protein Data Bank


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Protein Eng Des SelHome page
K. A. Crowhurst and S. L. Mayo
NMR-detected conformational exchange observed in a computationally designed variant of protein G{beta}1
Protein Eng. Des. Sel., June 26, 2008; (2008) gzn035v1.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
H. K. Fung, C. A. Floudas, M. S. Taylor, L. Zhang, and D. Morikis
Toward Full-Sequence De Novo Protein Design with Flexible Templates for Human Beta-Defensin-2
Biophys. J., January 15, 2008; 94(2): 584 - 599.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
L. Meyerguz, J. Kleinberg, and R. Elber
From the Cover: The network of sequence flow between protein structures
PNAS, July 10, 2007; 104(28): 11627 - 11632.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
T. P. Treynor, C. L. Vizcarra, D. Nedelcu, and S. L. Mayo
Computationally designed libraries of fluorescent proteins evaluated by preservation and diversity of function
PNAS, January 2, 2007; 104(1): 48 - 53.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
K. A. Reynolds, J. M. Thomson, K. D. Corbett, C. R. Bethel, J. M. Berger, J. F. Kirsch, R. A. Bonomo, and T. M. Handel
Structural and Computational Characterization of the SHV-1 beta-Lactamase-beta-Lactamase Inhibitor Protein Interface
J. Biol. Chem., September 8, 2006; 281(36): 26745 - 26753.
[Abstract] [Full Text] [PDF]


Home page
Mol Biol EvolHome page
J. D. Bloom, D. A. Drummond, F. H. Arnold, and C. O. Wilke
Structural Determinants of the Rate of Protein Evolution in Yeast
Mol. Biol. Evol., September 1, 2006; 23(9): 1751 - 1761.
[Abstract] [Full Text] [PDF]


Home page
Biophys. JHome page
M. C. Saraf, G. L. Moore, N. M. Goodey, V. Y. Cao, S. J. Benkovic, and C. D. Maranas
IPRO: An Iterative Computational Protein Library Redesign and Optimization Procedure
Biophys. J., June 1, 2006; 90(11): 4167 - 4180.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
R. Chakrabarti, A. M. Klibanov, and R. A. Friesner
Sequence optimization and designability of enzyme active sites
PNAS, August 23, 2005; 102(34): 12035 - 12040.
[Abstract] [Full Text] [PDF]


Home page
Protein Eng Des SelHome page
H. Liao, W. Yeh, D. Chiang, R.L. Jernigan, and B. Lustig
Protein sequence entropy is closely related to packing density and hydrophobicity
Protein Eng. Des. Sel., February 1, 2005; 18(2): 59 - 64.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
B. Qian, A. R. Ortiz, and D. Baker
Improvement of comparative model accuracy by free-energy optimization along principal components of natural structural variation
PNAS, October 26, 2004; 101(43): 15346 - 15351.
[Abstract] [Full Text] [PDF]


Home page
ScienceHome page
B. Kuhlman, G. Dantas, G. C. Ireton, G. Varani, B. L. Stoddard, and D. Baker
Design of a Novel Globular Protein Fold with Atomic-Level Accuracy
Science, November 21, 2003; 302(5649): 1364 - 1368.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
S. Brown, N. J. Fawzi, and T. Head-Gordon
Coarse-grained sequences for protein folding and design
PNAS, September 16, 2003; 100(19): 10712 - 10717.
[Abstract] [Full Text] [PDF]




HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
Copyright © 2002 by The Protein Society.