|
|
||||||||
1 Department of Medical Chemistry, University of Szeged, HU-6720 Szeged, Hungary
2 Department of Biochemistry, Uppsala University, SE-75123 Uppsala, Sweden
Reprint requests to: David van der Spoel, Department of Biochemistry, Uppsala University, Box 576, SE-75123 Uppsala, Sweden; e-mail: spoel{at}xray.bmc.uu.se; fax: 46-18-511755.
(RECEIVED January 17, 2002; FINAL REVISION April 16, 2002; ACCEPTED April 16, 2002)
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.0202302.
| Abstract |
|---|
|
|
|---|
Keywords: Binding site; drug research; complex; flexible ligand
| Introduction |
|---|
|
|
|---|
In the past decade, molecular docking has proven to be an important tool of computer-aided drug design. Basically, three steps are necessary for successful prediction of a target/ligand complex: (1) definition of the structure of the target molecule, (2) location of the binding site, and (3) determination of the binding mode. Ideally, the structure of the target molecule should be determined experimentally, although some applications of docking have been reported based on a modelled target (Stigler et al. 1999; Menziani et al. 2001; Hetényi et al. 2002). The second step, location of the binding site, can be taken computationally as well, and there are several approaches for finding binding pockets on a protein molecule. The simplest algorithms use shape-based fitting of the ligand to the macromolecular surface (Hendlich et al. 1997; Brady and Stouten 2000). Alternatively, an empirical method for identifying interaction sites based on known protein-ligand complexes (Verdonk et al. 2001) has been reported. The third step is the "typical" application for docking algorithms: Given the binding site on a target molecule, determine the binding mode of a ligand. A large number of programs have been developed to this end, for example, DOCK (Shoichet and Kuntz 1993), AutoDock (Morris et al. 1996, 1998), and some more recent algorithms described in Budin et al. (2001) and Pang et al. (2001).
In most published docking applications, only the third step is taken, and the binding modes of small ligands have been reproduced (Stigler et al. 1999; Sotriffer et al. 1999, 2000). In all these cases, the binding site was predetermined, and therefore, the search space was limited to that region of the protein in the docking simulations. The convincing results of such studies hint to the possibility of applying the same methodology to searching a space larger than a single binding site. In a recent report, protein-protein interactions were reproduced successfully by a combination of ab initio docking and nuclear magnetic resonance data (Morelli et al. 2001). These investigators used shape-based fitting as the main step of the searching process because of the large size of the proteins. Protein-protein interactions were modeled by docking a completely rigid oligopeptide fragment of a protein using the entire surface of the neighboring protein as a target (Neurath et al. 1996). However, for molecules with many degrees of freedom (e.g., flexible peptides), such shape-based fitting or docking of a rigid ligand to the protein would not be fruitful (Klepeis et al. 1998). We have previously scanned the entire surface of an amyloid peptide for possible binding sites of flexible ß-sheet breaker peptides (Hetényi et al. 2002) and predicted a complex with features that agree very well with nuclear magnetic resonance data. There are, however, very few reports dealing with the latter kind of blind docking, in which a flexible ligand was docked to a target without prior knowledge of the binding site.
In the present study, we used the AutoDock program package (Morris et al. 1998) to test whether it is possible to find the binding sites (and binding modes) of flexible peptides on a protein without any prior knowledge of their location and conformation. A parameter set based on the AMBER force field (Cornell et al. 1995) and the possibility of using flexible as well as fixed torsions for the ligands during the docking procedure make AutoDock an appropriate tool for such a test.
| Results and Discussion |
|---|
|
|
|---|
|
|
|
Rigid ligands
Table 2
contains the results of blind docking of rigid ligands, in which the ligand had the conformation it has in the experimental complex. Therefore, the docking algorithm only has to optimize the position and orientation of the ligand molecule. In all cases, the minimum energies Edocked of the first classes and the average ones of subclasses (see Materials and Methods for definition of the term "subclass" in this study) were lower than or close to the energies Edocked calculated for the original crystal structures (Table 1
). The reasons for this are the finite grid spacing and the limited accuracy of the force field. For the simplest system, benzamidine-trypsin (A), not much difference was found between the three types of jobs (with different number of trials and energy evaluations, see Materials and Methods). Hence, we performed only one type of the three jobs (100 trials and 50 x 106 energy evaluations per trial) for the other systems.
The RMSD values of the first class and its subclass were generally <0.7 Å. For system B, the RMSD value was >1.0 Å. In that case, a piece of the central part of the cypA-binding loop of the p24 (HIV capsid) protein was used as a ligand. This particular fragment of the loop proved to be too small to mimic perfectly the fit of the central part of the loop of p24 on the cypA molecule. With the use of an additional amino acid (His) of the p24-loop in the fragment to be docked, a more accurate fit was obtained (system G), indicating that it may be possible to predict protein interactions by blind docking of a fragment. However, one should be careful with the selection of the appropriate fragment of a protein for such a modeling as both the presence and the absence of a single amino acid could, in principle, result in different binding modes or even different binding positions.
The energy minima of all jobs were in the subclasses of the first classes, namely, the structures with low RMSD values with respect to the crystal structure. One can therefore conclude that blind docking is a reliable technique for finding the binding site of rigid ligands up to at least 30 heavy atoms. In many pharmaceutical applications, the ligand is almost rigid or has a well-defined conformation (Sotriffer et al. 2000).
Flexible ligands
The real challenge for docking is the use of flexible ligand molecules, that is, those with rotatable torsion angles. The results of these types of dockings are listed in Table 3
. In all cases, the energies (Edocked) of the first classes (subclasses) were lower than or close to Edocked calculated for the original crystal structures (Table 1
). We found this for the rigid ligands as well (Table 2
), but here the effect is somewhat larger in some cases, owing to the extra degrees of freedom. In other cases, the rigid ligands do find lower energies (e.g., Cw) and lower RMSD, most likely because one effectively has more energy evaluations for searching the right conformation.
Efficiency and robustness
For small peptides such as Ile-Val (system C and Cw) and Gly-Tyr (system D), RMSD values <1.0 Å (Figs. 1a, 2a![]()
) and 1.5 Å (Fig. 1b
), respectively, were achieved. Moreover, the lowest energy conformations of 100 out of 500 docking trials were the members of the subclasses in all cases. The occupancies (N) of the classes and their subclasses were lower, than those of C, Cw, and D in Table 2
. This is owing to the larger number of degrees of freedom of the ligand: The search algorithm was used not only for finding the correct binding position/orientation but also for searching the conformation of the ligand in this case. Therefore, the number N of successful docking trials was lower. For Ile-Val, both solvated (Cw) and water-free (C) targets were used. The presence of water molecules right above the binding site did not hinder docking to the site: The occupancies N were smaller, but the Edocked values were lower than in the case of "dry" protein. This trend was found also for the crystal energies and the Edocked values of rigid ligands owing to interactions with the water molecules (Tables 1, 2![]()
). In system B, the real binding position of the loop was found only in the second class. This limited success is caused by the length of the peptide fragment that was used as a ligand (for details, see Rigid Ligands section). Tripeptides, such as Gly-Ala-Trp (system E; Fig. 2b
) and Ace-Ala-Pro-Tyr (system F; Fig. 1c
), were also docked successfully to their respective pockets (Table 3
). The Gly-Ala-Trp (
-chymotrypsin) system is a difficult task for docking calculations. The crystal structure of this complex (Harel et al. 1991) contains an average structure of a covalently bound ligand and a nonbonded complex. Obviously, the molecular mechanics-based docking technique is developed for only nonbonded interactions. Despite the problematic crystal structure, the hydrophobic pocket (the most important part of the binding site, see Fig. 2b
) of
-chymotrypsin was identified reproducibly. Together with system B, the latter example proves the robustness of the method: The binding location was found even when only part of the ligand or of the site was defined properly. In the complex of SG protease and Ace-Pro-Ala-Pro-Tyr (system I), the first Pro residue of the peptide has no specific contacts with the protein (and there was only a small energy difference between the crystal structures of systems F and I; see Table 1
), and hence, docking was not as successful as with the truncated peptide Ace-Ala-Pro-Tyr (Fig. 1c
). However, because the rigid ligand docks with 2 kcal/mole lower energy than the flexible one, and with low RMSD with respect to the crystal structure, we should conclude that the poor results with system I are owing to insufficient searching.
|
|
|
|
Another limitation of blind docking is the size of the system or the computer time one can spend to perform enough trials (Tables 2, 3![]()
). An example for this limitation is system I, which has the largest ligand (Ace-Pro-Ala-Pro-Tyr) used in our investigations. Although blind docking was successful using the truncated Ace-Ala-Pro-Tyr form of this ligand (system F), the longer tetrapeptide was docked with limited success only (large RMSD values for the first rank) to the target. Additionally, the weak interactions between Pro1 and the protein makes this peptide a difficult ligand for docking. Finally, the possibility of accurately docking a ligand to a target depends critically on the quality of the target structure, and AutoDock, like other docking programs, has as of yet no way to treat flexible target, although a work-around using multiple target structures has been tried with success (Österberg et al. 2002).
Recommendations
Based on our results (Tables 2,3![]()
; Fig. 4
), we recommend the use of a fairly large population size (e.g., 250) and at least 10 million energy evaluations per trial for blind docking of flexible peptide ligands to proteins. The number of trials should be
100. For rigid ligands, more modest requirements will do; the default population size of 50 and 50 trials should suffice in most cases. Obviously, a finer grid size would be advantageous for the quality of docking results (Hetényi et al. 2002); however, the memory requirements of the algorithm scale as the inverse third power of the grid spacing make it difficult to go much beyond our current grid spacing.
| Conclusions |
|---|
|
|
|---|
Blind docking can be a useful tool for exploration of possible binding sites of drugs on their target proteins, if the active site of the drug is unknown. It is particularly attractive that a single method can be used for binding site search and accurate docking of ligands.
| Materials and methods |
|---|
|
|
|---|
Docking procedure
Mass-centered grid maps were generated with 0.55 Å spacing by the AutoGrid program for the whole protein target. Lennard-Jones parameters 1210 and 126 (supplied with the program package) were used for modeling H-bonds and Van der Waals interactions, respectively. The distance-dependent dielectric permittivity of Mehler and Solmajer (1991) was used for the calculation of the electrostatic grid maps. The Lamarckian genetic algorithm (LGA) and the pseudo-Solis and Wets methods were applied for minimization using default parameters (Table 4
, except as indicated). The number of generations was set to 10 million in all trials (runs), and the stopping criterion was therefore defined by the total number of energy evaluations. Random starting positions on the entire protein surface, random orientations, and torsions (flexible ligands only) were used for the ligands. Three different sets of jobs were performed, (1) 100 trials, 10 x 106 energy evaluations; (2) 500 trials, 5 x 106 energy evaluations; and (3) 100 trials and 50 x 106 energy evaluations (except D and I, 500 trials). The computational cost in energy evaluations of the different job types is then (1) 109, (2) 2.5 x 109, and (3) 5 x 109 (D and I: 25 x 109). For all job types, the populations in the genetic algorithm was 50. Some jobs with different parameters were performed, as indicated in the Results section.
|
Second, within each class, the positional RMSD of the nonhydrogen atoms of each ligand structure with respect to the crystal ligand structure was calculated. Structures having RMSD <2 Å were placed in subclasses of the classes. Therefore, in this study, subclasses are defined as the subset of a class that contains the ligand conformations that are structurally closest to the crystallographic ones. That is, ligands placed in the subclass of the first class of a job have the best energies and the best correspondence with the crystal structure. Note that a subclass of any class may be empty if no structure with low RMSD with respect to the crystal structure is found.
In real blind docking jobs (in which the crystallographic position of the ligand on the target is unknown, and no further experimental information is available), one can only assume that the energy minimum represents the best model for the real ligand binding site and binding mode. In that case, one can cluster the results directly by calculating RMSD with the energy minimum. Here we seek to distinguish trials that find the binding site (class) and trials that find both the binding site (class) and the binding mode (subclass).
The VMD program (Humphrey et al. 1996) was used for graphical interpretation and representation of results. Image rendering was performed using Raster 3D (Merritt and Bacon 1997).
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Brady, Jr., G.P. and Stouten, P.F.W. 2000. Fast prediction and visualization of protein binding pockets. J. Comput. Aided Mol. Des. 14: 383401.[CrossRef][Medline]
Budin, N., Majeux, N., and Caflisch, A. 2001. Fragment-based flexible ligand docking by evolutionary optimization. Biol. Chem. 382: 13651372.[CrossRef][Medline]
Cornell, W.D., Cieplak, P., Bayly C.I., Gould, I.R., Merz, Jr., K.M., Ferguson, D.M., Spellmeyer, D.C., Fox, T., Caldwell, J.W., and Kollman, P.A. 1995. A second generation force field for the simulation of nucleic acids, proteins, and organic molecules. J. Am. Chem. Soc. 117: 51795197.[CrossRef]
Friedman, A.R., Roberts, V.A., and Tainer, J.A. 1994. Predicting molecular interactions and inducible complementarity: Fragment docking of Fab-peptide complexes. Proteins 20: 1524.[CrossRef][Medline]
Gamble, T.R., Vajdos, F.F., Yoo, S., Worthylake, D.K., Houseweart, M., Sunquist, W.I., and Hill, C.P. 1996. Crystal structure of human cyclophilin A bound to the amino-terminal domain of HIV-1 capsid. Cell 87: 12851294.[CrossRef][Medline]
Harel, M., Su, C.-T., Frolow, F., Silman, I., and Sussman J.L. 1991.
-Chymotrypsin is a complex of
-chymotrypsin with its own autolysis products. Biochemistry 30: 52175225.[CrossRef][Medline]
Hendlich, M., Rippmann, F., and Barnickel, G. 1997. LIGSITE: Automatic and efficient detection of potential small molecule-binding sites in proteins. J. Mol. Graph. Model. 15: 359363.[CrossRef][Medline]
Hetényi, C., Körtvélyesi, T., and Penke, B. 2002. Mapping of the possible binding sequences of two ß-sheet breaker peptides on beta amyloid peptide of Alzheimer's disease. Bioorg. Med. Chem. 10: 15871593.[CrossRef][Medline]
Holland, D.R., Hausrath, A.C., Juers, D., and Matthews, B.W. 1995. Structural analysis of zinc substitutions in the active site of thermolysin. Protein Sci. 4: 19551965.[Abstract]
Humphrey, W., Dalke, A., and Schulten, K. 1996. VMD: Visual molecular dynamics. J. Mol. Graphics 14: 3338.[CrossRef][Medline]
James, M.N.G., Sielecki, A.R., Brayer, C.D., Delbaere, L.T.J., and Bauer, C.-A. 1980. Structures of product and inhibitor complexes of Streptomyces griseus protease A at 1.8 Å resolution. J. Mol. Biol. 144: 4388.[CrossRef][Medline]
Klepeis, J.L., Ierapetritou, M.G., and Floudas, C.A. 1998. Protein folding and peptide docking: A molecular modeling and global optimization approach. Comp. Chem. Eng. 22: S3S10.[CrossRef]
Lindahl, E., Hess, B., and van der Spoel, D. 2001. GROMACS 3.0: A package for molecular simulation and trajectory analysis. J. Mol. Model. 7: 306317.
Marquart, M., Walter, J., Deisenhofer, J., Bode, W., and Huber, R. 1983. The geometry of the reactive site and of the peptide groups in trypsin, trypsinogen and its complexes with inhibitors. Acta Cryst. B 39: 480490.[CrossRef]
Mehler, E.L. and Solmayer, T. 1991. Electrostatic effects in proteins: Comparison of dielectric and charge models. Protein Eng. 4: 903910.
Menziani, M.C., De Rienzo, F., Cappelli, A., Anzini, M., and De Benedetti, P.G. 2001. A computational model of the 5-HT3 receptor extracellular domain: search for ligand binding sites. Theor. Chem. Acc. 106: 98104.[CrossRef]
Merritt, E.A. and Bacon, D.J. 1997. Raster3D: Photorealistic molecular graphics. Methods Enzymol. 277: 505524.[Medline]
Minke, W.E., Diller, D.J., Hol, W.G.J., and Verlinde, C.L.M.J. 1999. The role of waters in docking strategies with incremental flexibility for carbohydrate derivatives: Heat-labile enterotoxin, a multivalent test-case. J. Med. Chem. 42: 17781788.[CrossRef][Medline]
Morelli, X.J., Palma, P.N., Guerlesquin, F., and Rigby, A. 2001. A novel approach for assessing macromolecular complexes combining soft-docking calculations with NMR-data. Protein Sci. 10: 21312137.
Morris, G.M., Goodsell, D.S., Huey, R., and Olson, A.J. 1996. Distributed automated docking of flexible ligands to proteins: Parallel applications of AutoDock 2.4. J. Comp. Aided. Mol. Des. 10: 293304.
Morris, G.M., Goodsell, D.S., Halliday, R.S., Huey, R., Hart, W.E., Belew, R.K., and Olson, A.J. 1998. Automated docking using a Lamarckian genetic algorithm and an empirical binding free energy function. J. Comp. Chem. 19: 16391662.[CrossRef]
Neurath, A.R., Jiang, S., Strick, N., Lin, K., Li, Y.-Y., and Debnath, A.K. 1996. Bovine ß-lactoglobulin modified by 3-hydroxyphtalic anhydride blocks the CD4 cell receptor for HIV. Nature Med. 2: 230234.[CrossRef][Medline]
Österberg, F., Morris, G.M., Sanner, M.F., Olson, A.J., and Goodsell, D.S. 2002. Automated docking to multiple target structures: Incorporation of protein mobility and structural water heterogeneity in AutoDock. Proteins 46: 3440.[CrossRef][Medline]
Pang, Y.-P., Perola, E., Xu, K., and Prendergast, F.G. 2001. EUDOCK: A computer program for identification of drug interaction sites in macromolecules and drug leads from chemical databases. J. Comp. Chem. 22: 17501771.[CrossRef]
Pappu, R.V., Hart, R.K., and Ponder, J.W. 1998. Analysis and application of potential energy smoothing and search methods for global optimization. J. Phys. Chem. B 102: 97259742.[CrossRef]
Rees, D.C. and Lipscomb, W.N. 1983. Crystallographic studies on apocarboxypeptidase A and the complex with glycyl-L-tyrosine. Proc. Natl. Acad. Sci. 80: 71517154.
Shoichet, B.K. and Kuntz, I.D. 1993. Matching chemistry and shape in molecular docking. Protein Eng. 6: 723732.
Sotriffer, C.A., Flader, W., Cooper, A., Rode, B.M., Linthicum, D.S., Liedl, K.R., and Varga, J.M. 1999. Ligand binding by antibody IgE Lb4: Assessment of binding site preferences using microcalorimetry, docking and free energy simulations. Biophys. J. 76: 29662977.
Sotriffer, C.A., Ni, H., and McCammon, J. A. 2000. Active site binding modes of HIV-1 integrase inhibitors. J. Med. Chem. 43: 41094117.[CrossRef][Medline]
Stigler, R.-D., Hoffmann, B., Abagyan, R., and Schneider-Mergener, J. 1999. Soft docking an L and a D peptide to an anticholera toxin antibody using internal coordinate mechanics. Structure 7: 663670.[Medline]
Verdonk, M.L., Cole, J.C., Watson, P., Gillet, V., and Willett, P. 2001. SuperStar: Improved knowledge-based interaction fields for protein binding sites. J. Mol. Biol. 307: 841859.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
G. A. Biagini, N. Fisher, N. Berry, P. A. Stocks, B. Meunier, D. P. Williams, R. Bonar-Law, P. G. Bray, A. Owen, P. M. O'Neill, et al. Acridinediones: Selective and Potent Inhibitors of the Malaria Parasite Mitochondrial bc1 Complex Mol. Pharmacol., May 1, 2008; 73(5): 1347 - 1355. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. Shao, V. A. Novakovic, J. F. Head, B. A. Seaton, and G. E. Gilbert Crystal Structure of Lactadherin C2 Domain at 1.7A Resolution with Mutational and Computational Analyses of Its Membrane-binding Motif J. Biol. Chem., March 14, 2008; 283(11): 7230 - 7241. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. H. Streiff, T. W. Allen, E. Atanasova, N. Juranic, S. Macura, A. R. Penheiter, and K. A. Jones Prediction of Volatile Anesthetic Binding Sites in Proteins Biophys. J., November 1, 2006; 91(9): 3405 - 3414. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Behr, C. Hoffmann, G. Ottolina, and K.-N. Klotz Novel Mutants of the Human beta1-Adrenergic Receptor Reveal Amino Acids Relevant for Receptor Activation J. Biol. Chem., June 30, 2006; 281(26): 18120 - 18125. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. J. Yonkunas, Y. Xu, and P. Tang Anesthetic Interaction with Ketosteroid Isomerase: Insights from Molecular Dynamics Simulations Biophys. J., October 1, 2005; 89(4): 2350 - 2356. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Li, X. Geng, M. Yonkunas, A. Su, E. Densmore, P. Tang, and P. Drain Ligand-dependent Linkage of the ATP Site to Inhibition Gate Closure in the KATP Channel J. Gen. Physiol., August 29, 2005; 126(3): 285 - 299. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Kovacs, J. Toth, C. Hetenyi, A. Malnasi-Csizmadia, and J. R. Sellers Mechanism of Blebbistatin Inhibition of Myosin II J. Biol. Chem., August 20, 2004; 279(34): 35557 - 35563. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |