|
|
||||||||
Laboratoire dEnzymologie et Biochimie Structurales, CNRS, 91198-Gif-sur-Yvette, France
Reprint requests to: Joël Janin, Laboratoire dEnzymologie et Biochimie Structurales, CNRS, 91198-Gif-sur-Yvette, France; e-mail: janin{at}lebs.cnrs-gif.fr; fax +33.1.69 82 31 29.
| Abstract |
|---|
|
|
|---|
Keywords: proteinprotein interaction; protein docking; structure prediction; conformation change
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.041081905.
| Introduction |
|---|
|
|
|---|
Macromolecular interaction is a central theme of functional genomics, subject to large-scale genetic and biochemical studies in model organisms. Most gene products, whether protein or RNA, perform their functions in cells by interacting with other gene products. In yeast, the set of macromolecular interactionsthe interactomeis at least an order of magnitude larger than the set of gene products. In humans, there may be hundreds of thousands of physical interactions, either pair wise contacts or multicomponent assemblies of polypeptide chains, that have physiological and possibly medical relevance. These assemblies are poorly represented in the Protein Data Bank (PDB), and they are likely to remain so in coming years. Structural genomics programs are designed to determine X-ray/NMR structures on a genome scale for individual gene products, not for multicomponent assemblies. Specific research programs combining crystallography, NMR, and cryoelectron microscopy, are being set up to do that, but the sampling of the interactome is likely to remain sparse in coming years while the space of individual protein structures are being progressively filled. Thus, there is a good case for testing in silico methods that generate structural models of the assemblies by docking their components. In many cases, we cannot access their atomic structures by experiment. In all cases, reliable models will greatly help in designing experiments, but we need objective estimates of the model quality and of the performance of docking methods.
| The targets of CAPRI |
|---|
|
|
|---|
The ideal CAPRI target is the unpublished X-ray structure of a complex between two proteins of known structure, submitted for "unbound" docking prediction. Due to the paucity of those, "bound/unbound" complexes between a protein of known structure and a novel protein are also acceptable. In less than 4 years, 17 complexes have been submitted as targets, two to four at a time, in five rounds of predictions (Table 1
). Five targets were "unbound," the others, "bound/unbound." On day zero of each round, component atomic coordinates are communicated to registered predictor groups who have a few weeks to submit models of the complexes to the http://capri.ebi.ac.uk Web site managed by K. Henrick at the European Bioinformatics Institute (Hinxton, UK). After that deadline, the CAPRI assessors (S. Wodak and R. Mendez, Free University of Brussels, Belgium) may start evaluating the models against the target X-ray structures.
|
| Assessing docking predictions |
|---|
|
|
|---|
atoms of L in the model and target, and by the rotation angle
L and translation dL needed to further superimpose L. However, Lrms,
L and dL concern the whole ligand, often a large protein in CAPRI targets, and they may not represent the quality of the fit where it matters, that is, in the contact regions. Thus, another useful parameter in assessing the geometry of the model is the interface RMS distance Irms, calculated on C
s of the epitopes only. The biological quality of the models was judged on the prediction of, first, the R and L epitopes, then, of the pairwise contacts between R and L residues. If the R epitope comprises NR residues in the target, nR of which make ligand contacts in the model, the ratio fR = nR/NR measures how well the model predicts the R epitope; its equivalent fL does the same for the L epitope. A good model should also say which residues of R are in contact with which residues of L. This is measured by the fraction of native contacts fnc = nc/Nc, where Nc is the number of residue pairs in contact in the target, and nc the number of those native contacts that are present in the model. Because nc can be artificially increased by pushing the ligand into the receptor, we also reject all models that have too many nonnative contacts: more than the average number in other models plus two standard deviations.
The parameters Irms, Lrms, and fnc were combined to rank models. In models of the "high-quality" and "good" categories, a majority of epitope residues and at least 30% of the contact pairs are correctly predicted (fnc > 0.3), and the L epitope is very close to its position in the X-ray structure (Irms < 2 Å). Such models reproduce the overall geometry and many biologically significant features of the interaction, but not necessarily its atomic details. Models with 10%30% of the native contact pairs and Irms between 2 Å and 4 Å, are placed in the "acceptable" category. Although their geometry is poor, they should still be useful for site-directed mutagenesis and other experiments, because a large part of the epitopes must be correctly identified to yield fnc > 0.1.
| Success and failure on CAPRI targets |
|---|
|
|
|---|
|
|
Information from the literature could be used to guide docking in most cases. It was decisive in the case of T07, a complex between a bacterial superantigen and the T-cell receptor (Sundberg et al. 2002): a simple sequence homology search could locate a similar structure already in the PDB. In contrast, the fair amount of biochemical data that was available on HPr and the kinase in T01, or on dockerin and cohesin in T11, proved to be insufficient in the presence of conformation changes. Biochemical information frequently concerns the binding regions or residues, seldomly their pair-wise contacts or the geometry of the binding. Most docking procedures incorporate such information, either to limit the rigid-body search or to filter candidate solutions issued from the search. Experience shows that it is very useful, but not foolproof. Seven CAPRI targets were complexes of "unbound" protein antigens with "bound" antibody fragments (Table 2
). Antibodies bind antigen through hypervariable loops of their VL and VH domains, which puts constraints on docking solutions. Five targets of this type have had good predictions: T02 and T03, two Fab complexes with large viral proteins, the T06 complex of
-amylase with the VHH domain of a camel single-chain antibody domain, T13, and T19. Predictors submitted models of these targets in which the epitope recognized by the antibody was correctly identified, with no help from the literature in several cases, and the geometry of binding was essentially right. In contrast, they failed entirely on two other complexes with
-amylase (T04T05) in which the VHH domain makes a lateral contact that implicates frame-work residues, and only one hypervariable loop (Fig. 3
). Conformation changes are not to be blamed here, for the antibody moiety was "bound" and the antigen essentially unchanged. "A posteriori" analysis indicates that the mode of binding seen in the X-ray structure either was outside the limits set to the search, or it had been rejected from the docking solutions as incompatible with established rules of antigenantibody recognition.
|
|
| Progress in predictions |
|---|
|
|
|---|
| Structural biologists, please help with targets! |
|---|
|
|
|---|
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Ben-Zeev, E., Berchanski, A., Heifetz, A., Shapira, B., and Eisenstein, M. 2003. Prediction of the unknown: Inspiring experience with the CAPRI experiment. Proteins 52: 416.[CrossRef][Medline]
Bressanelli, S., Stiasny, K., Allison, S.L., Stura, E.A., Duquerroy, S., Lescar, J., Heinz, F.X., and Rey, F.A. 2004. Structure of a flavivirus envelope glycoprotein in its low pH-induced membrane fusion conformation. EMBO J. 23: 728738.[CrossRef][Medline]
Carvalho, A.L., Dias, F.M.V., Prates, J.A.M., Nagy, T., Gilbert, H.J., Davies, G.J., Ferreira, L.M.A., Romao, M.J., and Fontes, C.M.G.A. 2004. Cellulosome assembly revealed by the crystal structure of the cohesindockerin complex. Proc. Natl. Acad. Sci. 100: 1380913014.
Chen, R., Mintseris, J., Janin, J., and Weng, Z. 2003. A proteinprotein docking benchmark. Proteins 52: 8891.[CrossRef][Medline]
Desmyter, A., Spinelli, S., Payan, F., Lauwereys, M., Wyns, L., Muyldermans, S., and Cambillau, C. 2002. Three camelid VHH domains in complex with porcine pancreatic
-amylase. J. Biol. Chem. 277: 2364523650.
Eghiaian, F., Grosclaude, J., Lesceu, S., Debey, P., Doublet, B., Treguer, E., Rezaei, H., and Knossow, M. 2004. Insight into the PrPC
PrPSc conversion from the structures of antibody-bound ovine prion scrapie-susceptibility variants. Proc. Natl. Acad. Sci. 101: 1025410259.
Fieulaine, S., Morera, S., Poncet, S., Mijakovic, I., Galinier, A., Janin, J., Deutscher, J., and Nessler, S. 2002. X-ray structure of a bifunctional protein kinase in complex with its protein substrate HPr. Proc. Natl. Acad. Sci. 99: 1343713441
Graille, M., Mora, L., Buckingham, R.H., van Tilbeurgh, H., and de Zamaroczy, M. 2004. Structural inhibition of the colicin D tRNase by the tRNA-mimicking immunity protein. EMBO J. 23: 14741482.[CrossRef][Medline]
Halperin, I., Ma, B., Wolfson, H., and Nussinov, R. 2002. Principles of docking: An overview of search algorithms and a guide to scoring functions. Proteins 47: 409443.[CrossRef][Medline]
Janin, J., Henrick, K., Moult, J., Ten Eyck, L., Sternberg, M., Vajda, S., Wakser, I., and Wodak, S.J. 2003. CAPRI: A critical assessment of predicted interactions. Proteins 52: 29.[CrossRef][Medline]
Mendez, R., Leplae, R., De Maria, L., and Wodak, S.J. 2003. Assessment of blind predictions of proteinprotein interactions: Current status of docking methods. Proteins 52: 5167.[CrossRef][Medline]
Sansen, S., De Ranter, C.J., Gebruers, K., Brijs, K., Courtin, C.M., Delcour, J.A., and Rabijns, A. 2004. Structural basis for inhibition of Aspergillus niger xylanase by triticum aestivum xylanase inhibitor-I. J. Biol. Chem. 279: 3602236028.
Schneidman-Duhovny, D., Inbar, Y., Polak, V., Shatsky, M., Halperin, I., Benyamini, H., Barzilai, A., Dror, O., Haspel, N., Nussinov, R., et al. 2003. Taking geometry to its edge: Fast unbound rigid (and hinge-bent) docking. Proteins 52: 107112[CrossRef][Medline]
Smith, G.R. and Sternberg, M.J. 2002. Prediction of proteinprotein interactions by docking methods. Curr. Opin. Struct. Biol. 12: 2835.[CrossRef][Medline]
Sunberg, E.J., Li, H., Llera, A.S., McCormick, J.K., Tormo, J., Schlievert, P.M., Karjalainen, K., and Mariuzza, R.A. 2002. Structures of two streptococcal superantigens bound to TCR
chains reveals diversity in the architecture of T cell signalling complexes. Structure 10: 687699.[Medline]
Takagi, J., Yang, Y., Liu, J.H., Wang, J.H., and Springer, T.A. 2003. Complex between nidogen and laminin fragments reveals a paradigmatic
-propeller interface. Nature 424: 969973.[CrossRef][Medline]
Terrak, M., Kerff, F., Langsetmo, K., Tao, T., and Dominguez, R. 2004. Structural basis of protein phosphatase 1 regulation. Nature 429: 780784.[CrossRef][Medline]
Thouvenin, E., Schoehn, G., Rey, F., Petitpas, I., Mathieu, M., Vaney, M.C., Cohen, J., Kohli, E., Pothier, P., and Hewat, E. 2001. Antibody inhibition of the transcriptase activity of rotavirus DLP: A structural view. J. Mol. Biol. 307: 161172.[CrossRef][Medline]
Valencia, A. and Pazos, F. 2002. Computational methods for the prediction of protein interactions. Curr. Opin. Struct. Biol. 12: 368373.[CrossRef][Medline]
Venclovas, C., Zemla, A., Fidelis, K., and Moult, J. 2003. Assessment of progress over the CASP experiments. Proteins 53: 585589.
Wodak, S. and Janin, J. 2002. The structural basis of macromolecular recognition. Adv. Protein Chem. 61: 968.[Medline]
Wodak, S.J. and Mendez, R. 2004. Predictions of proteinprotein interactions: The CAPRI experiment, its evaluation and implications. Curr. Opin. Struct. Biol. 14: 242249.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
V. I. Lesk and M. J. E. Sternberg 3D-Garden: a system for modelling protein-protein complexes based on conformational refinement of ensembles generated with the marching cubes algorithm Bioinformatics, May 1, 2008; 24(9): 1137 - 1144. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Pulim, J. Bienkowska, and B. Berger LTHREADER: Prediction of extracellular ligand-receptor interactions in cytokines using localized threading Protein Sci., February 1, 2008; 17(2): 279 - 292. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. S. Negi, C. H. Schein, N. Oezguen, T. D. Power, and W. Braun InterProSurf: a web server for predicting interacting sites on protein surfaces Bioinformatics, December 15, 2007; 23(24): 3397 - 3399. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Lorenzen and Y. Zhang Monte Carlo refinement of rigid-body protein docking structures with backbone displacement and side-chain optimization Protein Sci., December 1, 2007; 16(12): 2716 - 2725. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Rossi, M. Isorce, S. Morin, J. Flocard, K. Arumugam, S. Crouzy, M. Vivaudou, and S. Redon Adaptive torsion-angle quasi-statics: a general simulation method with applications to protein structure analysis and design Bioinformatics, July 1, 2007; 23(13): i408 - i417. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Schueler-Furman, C. Wang, P. Bradley, K. Misura, and D. Baker Progress in Modeling of Protein Structures and Interactions Science, October 28, 2005; 310(5748): 638 - 642. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |