|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Biochemistry and Molecular Biophysics, California Institute of Technology, Pasadena, California 91125, USA
2 Division of Chemistry and Chemical Engineering, California Institute of Technology, Pasadena, California 91125, USA
3 Howard Hughes Medical Institute, California Institute of Technology, Pasadena, California 91125, USA
(RECEIVED January 24, 2006; FINAL REVISION May 12, 2006; ACCEPTED May 15, 2006)
| Abstract |
|---|
|
|
|---|
Keywords: protein design; electrostatics; engrailed; N-capping
| Introduction |
|---|
|
|
|---|
Electrostatic interactions contribute significantly to protein stability and function. Not only must intraprotein interactions be considered (hydrogen bonding, salt bridges, etc.), but effects due to the aqueous environment such as polar solvation and solvent screening need to be evaluated as well. It is computationally intractable to consider all individual water molecules surrounding a protein during a protein design calculation. Continuum approaches that consider the solvent at a macroscopic level using various numerical solutions of the Poisson-Boltzmann equation (Honig and Nicholls 1995) have been used to predict side chain pKa's, the electrostatic component of binding, and other biologically important processes. However, a full Poisson-Boltzmann calculation is far too time-consuming to be used at each step of a protein design calculation. Various methods attempting to reproduce the accuracy of Poisson-Boltzmann calculations within the restrictions of protein design include the adaptation of a solvent exclusion method (Lazaridis and Karplus 1999) in designing a novel protein fold (Kuhlman et al. 2003), a modified Tanford-Kirkwood approach (Havranek and Harbury 1999) to design specific proteinprotein interactions (Havranek and Harbury 2003), use of a Born method in a new protein design force field (Pokala and Handel 2004), and a highly parameterized set of simple terms (Wisz and Hellinga 2003) in designing enzymatic activity onto a previously catalytically inactive scaffold (Dwyer et al. 2004). Work in this lab led to the development of a two-body decomposable implementation of a Poisson-Boltzmann calculation useful in protein design (Marshall et al. 2005).
Previous design studies have shown both the importance of electrostatics and the need to improve the electrostatic component (Marshall et al. 2002) of our protein design algorithm, ORBIT (Dahiyat and Mayo 1996, 1997). Local interactions were shown to be underrepresented and hydrogen bonding was overrepresented relative to long-range Coulombic interactions.
Here we show that a comparison of ORBIT electrostatic energies and those calculated using the finite difference Poisson-Boltzmann implementation in DelPhi (Rocchia et al. 2001) allowed a parameterization of the simple Coulombic equation term used in ORBIT. By scaling the dielectric value it is possible to approximate the energies calculated using the more accurate Poisson-Boltzmann method. Local interactions (side chainbackbone) and longer range interactions (side chainside chain) are parameterized separately. The polar solvation model used in this study penalizes the burial of non-hydrogen-bonded, non-backbone polar hydrogens.
| Results |
|---|
|
|
|---|
helices (Fig. 1). The wild-type sequence is not optimized for stability with seven positive charges distributed across a small amount of surface. The protein sequence resulting from a design calculation with the unoptimized electrostatic terms of ORBIT is NC0 (Marshall et al. 2002). While eliminating the charge excess, this designed protein was shown to have incorporated a number of unfavorable electrostatic interactions relative to wild type: a reduced number of N-capping interactions and an increased number of potentially destabilizing interactions with the helix dipole. NC0 has a stability slightly greater than wild type and is used as the baseline for ORBIT's electrostatic performance in this study. An updated rotamer library that was shown to lead to similar designed sequences as a previous library was used in the study reported here. The sequence designed with this new library but with the unoptimized electrostatic term is NC0_new. This protein was designed as a control for the new electrostatic term.
|
Circular dichroism wavelength scans indicated that the designed proteins were well folded and
-helical. Thermal denaturation studies were carried out on NC0, NC0_new, and Dielec_H (Fig. 2). All proteins unfolded completely and reversibly. For comparison, the thermal denaturation curve of wild-type engrailed is also included.
|
|
|
| Discussion |
|---|
|
|
|---|
| Materials and methods |
|---|
|
|
|---|
1 domain of protein G), 1rge (ribonuclease SA), 1rhe (Rhe VL), 1whi (L14 ribosomal protein), 1tta
[PDB]
(transthyretin), 2rn2 (ribonuclease H), 3lzm (T4 lysozyme), and 1amm (
B-crystalin). The DelPhi (Rocchia et al. 2001) calculations used a grid spacing of 2.0 grids Å1, an interior dielectric of 4.0, an exterior dielectric of 80.0, 0.050 M salt, and a probe radius of 1.4 Å. PARSE charges and radii were used (Sitkoff et al. 1994). In order to more directly compare with the terms in the ORBIT force field, the DelPhi results for both unfolded and folded states were separated into backbone and side chain desolvation and screened Coulombic interactions. The description of the unfolded and folded states of the backbone and the side chains can be found in Figures 13 in Marshall et al. (2005). The dielectric value used in the ORBIT Coulombic term is then scaled to more closely agree with the values calculated with DelPhi. Correlations coefficients of the fits between DelPhi electrostatic energies and scaled Coulombic energies (side chain/side chain and side chain/backbone) are >0.9 (data not shown). In order to facilitate comparison with previous work (Marshall et al. 2002), electrostatic calculations in Table 1 were performed with the same DelPhi parameters as above, with the exception of a probe radius of 0 Å.
The preparation of the engrailed homeodomain PDB (Berman et al. 2000) structure, 1enh, and the designed positions are the same surface positions as reported in Marshall et al. (2002). Residues allowed at the designed positions were Ala, Ser, Thr, Asp, Asn, His, Glu, Gln, Lys, and Arg. Rotamers are derived from the rotamer library of Dunbrack and Karplus (1994) with expansion of 1 standard deviation about angles
1 and
2 of aliphatic residues, expansion of 1 standard deviation around
1 of hydrophobic residues, and no expansion of polar residue dihedral angles.
The force field in ORBIT contains van der Waals, Coulombic, hydrogen-bond, and solvation terms (Gordon et al. 1999). The hydrogen-bond term is geometry- and hybridization-dependent, as described in Dahiyat et al. (1997). The polar hydrogen burial term is calculated as 2.0 kcal/mol for each nonbackbone, non-hydrogen-bonded buried polar hydrogen, as described (Dahiyat et al. 1997). Sequence optimization was performed using DEE (Desmet et al. 1992; Goldstein 1994; Gordon and Mayo 1998) or HERO (Gordon et al. 2003). The one best sequence for each design was expressed and purified for biophysical analysis.
Genes for the engrailed variants were prepared by recursive PCR (Prodromou and Pearl 1992) and cloned into pET-11a (Novagen). Wild-type engrailed expresses poorly and was cloned into a plasmid that had been engineered to include N-terminal His-tags and a ubiquitin domain with a ubiquitin-specific cleavage site (Pilon et al. 1997). DNA sequencing confirmed the identity of all variants. Proteins were expressed in BL21(DE3) E. coli cells (Stratagene) and isolated with freeze-thaw (Johnson and Hecht 1994) or sonication. Proteins were purified by HPLC as in Marshall et al. (2002) or nickel exchange columns (Qiagen). Cleavage of the protein of interest from the fusion domain occurred by use of the protease UCH-L3 (Boston Biochem) at 37°C for 14 h. Proteins were confirmed with MALDI-TOF mass spectrometry. Temperature denaturation circular dichroism was carried out as described (Marshall et al. 2002).
| Footnotes |
|---|
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.062105506.
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Berman H.M., Westbrook J., Feng Z., Gilliland G., Bhat T.N., Weissig H., Shindyalov I.N., Bourne P.E. 2000. The Protein Data Bank. Nucleic Acids Res. 28: 235242.
Dahiyat B.I. and Mayo S.L. 1996. Protein design automation. Protein Sci. 5: 895903.[Abstract]
Dahiyat B.I. and Mayo S.L. 1997. De novo protein design: Fully automated sequence selection. Science 278: 8287.
Dahiyat B.I., Gordon D.B., Mayo S.L. 1997. Automated design of the surface positions of protein helices. Protein Sci. 6: 13331337.[Abstract]
Desmet J., De Maeyer M., Hazes B., Lasters I. 1992. The dead-end elimination theorem and its use in protein side-chain positioning. Nature 356: 539542.[CrossRef]
Dill K.A. 1990. Dominant forces in protein folding. Biochemistry 29: 71337155.[CrossRef][Medline]
Dunbrack R.L. and Karplus M. 1994. Conformational analysis of the backbone-dependent rotamer preferences of protein side-chains. Nat. Struct. Biol. 1: 334340.[CrossRef][Medline]
Dwyer M.A., Looger L.L., Hellinga H.W. 2004. Computational design of a biologically active enzyme. Science 304: 19671971.
Gilson M.K. and Honig B. 1989. Destabilization of an
-helix bundle protein by helix dipoles. Proc. Natl. Acad. Sci. 86: 15241528.
Goldstein R.F. 1994. Efficient rotamer elimination applied to protein side-chains and related spin-glasses. Biophys. J. 66: 13351340.
Gordon D.B. and Mayo S.L. 1998. Radical performance enhancements for combinatorial optimization algorithms based on the dead-end elimination theorem. J. Comput. Chem. 19: 15051514.[CrossRef]
Gordon D.B., Marshall S.A., Mayo S.L. 1999. Energy functions for protein design. Curr. Opin. Struct. Biol. 9: 509513.[CrossRef][Medline]
Gordon D.B., Hom G.K., Mayo S.L., Pierce N.A. 2003. Exact rotamer optimization for protein design. J. Comput. Chem. 24: 232243.[CrossRef][Medline]
Havranek J.J. and Harbury P.B. 1999. Tanford-Kirkwood electrostatics for protein modeling. Proc. Natl. Acad. Sci. 96: 1114511150.
Havranek J.J. and Harbury P.B. 2003. Automated design of specificity in molecular recognition. Nat. Struct. Biol. 10: 4552.[CrossRef][Medline]
Honig B. and Nicholls A. 1995. Classical electrostatics in biology and chemistry. Science 268: 11441149.
Johnson B.H. and Hecht M.H. 1994. Recombinant proteins can be isolated from E. coli cells by repeated cycles of freezing and thawing. Biotechnology (N. Y.) 12: 13571360.[CrossRef][Medline]
Kuhlman B., Dantas G., Ireton G.C., Varani G., Stoddard B.L., Baker D. 2003. Design of a novel globular protein fold with atomic-level accuracy. Science 302: 13641368.
Lazaridis T. and Karplus M. 1999. Effective energy function for proteins in solution. Proteins 35: 133152.[CrossRef][Medline]
Marshall S.A., Morgan C.S., Mayo S.L. 2002. Electrostatics significantly affect the stability of designed homeodomain variants. J. Mol. Biol. 316: 189199.[CrossRef][Medline]
Marshall S.A., Vizcarra C.M., Mayo S.L. 2005. One- and two-body decomposable Poisson-Boltzmann methods for protein design calculations. Proteins 14: 12931304.
Pilon A., Yost P., Chase T.E., Lohnas G., Burkett T., Roberts S., Bentley W.E. 1997. Ubiquitin fusion technology: Bioprocessing of peptides. Biotechnol. Prog. 13: 374379.[CrossRef][Medline]
Pokala N. and Handel T.M. 2004. Energy functions for protein design I: Efficient and accurate continuum electrostatics and solvation. Protein Sci. 13: 925936.
Prodromou C. and Pearl L.H. 1992. Recursive PCR: A novel technique for total gene synthesis. Protein Eng. 5: 827829.
Rocchia W., Alexov E., Honig B. 2001. Extending the applicability of the nonlinear Poisson-Boltzmann equation: Multiple dielectric constants and multivalent ions. J. Phys. Chem. B 105: 65076514.[CrossRef]
Sengupta D., Behera R.N., Smith J.C., Ullmann G.M. 2005. The
helix dipole: Screened out? Structure 13: 849855.
Sitkoff D., Sharp K.A., Honig B. 1994. Accurate calculation of hydration free energies using macroscopic solvent models. J. Phys. Chem. 98: 19781988.[CrossRef]
Wisz M.S. and Hellinga H.W. 2003. An empirical model for electrostatic interactions in proteins incorporating multiple geometry-dependent dielectric constants. Proteins 51: 360377.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
K. L. Schweiker, A. Zarrine-Afsar, A. R. Davidson, and G. I. Makhatadze Computational design of the Fyn SH3 domain with increased stability through optimization of surface charge charge interactions Protein Sci., December 1, 2007; 16(12): 2694 - 2702. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Brock, K. Talley, K. Coley, P. Kundrotas, and E. Alexov Optimization of Electrostatic Interactions in Protein-Protein Complexes Biophys. J., November 15, 2007; 93(10): 3340 - 3352. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |