|
|
||||||||
1 Department of Biochemistry, National University of Singapore, Singapore 119260
2 Research Institute for Biotechnology, Macquarie University, NSW 2109, Australia
Reprint requests to: Shoba Ranganathan, MU Biotechnology Research Institute, Building F7B-233, Macquarie University, NSW 2109, Australia; e-mail: shoba.ranganathan{at}mq.edu.au; fax: +61-2-98508313.
(RECEIVED January 12, 2004; FINAL REVISION April 16, 2004; ACCEPTED May 27, 2004)
| Abstract |
|---|
|
|
|---|
RMSD < 1.00 Å. Keywords: major histocompatibility complex; epitope prediction; flexible docking; immunology; Monte Carlo
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.04631204.
| Introduction |
|---|
|
|
|---|
MHC class I molecules are heterodimers, consisting of a heavy
-chain of about 45 kDa, and a light chain, 32-microglobulin (32M) of about 12 kDa (Klein 1986). Class I ligands are derived from endogenously expressed proteins that are degraded by cytosolic proteinases with typical length of 812 amino acids. The proteolytic fragments are transported into the endoplasmic reticulum in an ATP-dependent fashion by the transporter associated with antigen processing (TAP), where they bind to vacant receptor sites on newly synthesized MHC class I molecules. The MHCpeptide complex is subsequently transported to the cell surface and presented for recognition by the T-cell receptors (TCRs) of CD8+ cytotoxic T-cells (CTL).
Class II peptides are heterodimeric glycoproteins consisting of an
-chain (34 kDa) and
-chain (29 kDa) with very similar overall quaternary structure to that of class I molecules (Brown et al. 1993; Stern et al. 1994; Stern and Wiley 1994). Class II ligands are of variable length, usually of 925 amino acids and are derived mainly from exogenous or transmembrane proteins, as well as from cytosolic proteins that are degraded by various proteinases originating from the lysosomal compartment. After displacing the endogenous MHC class II ligand known as the CLIP peptide in the late endosomal/lysosomal compartment, this MHC-peptide complex is also transported to the cell surface and presented for recognition by the TCRs of CD4+ T cells.
In the design of molecular vaccines for the treatment of diseases, identification of T-cell epitopes from immunologically relevant antigens is an important prerequisite. The first step in T-cell-mediated immune response is the binding of antigenic peptides to MHC receptors, which serves as a necessary although not sufficient condition for epitope recognition (Flower 2003). The second step is the recognition and binding of T-cell receptors, which then initiate the immune response cascade. The experimental identification of T-cell epitopes is a time-consuming and expensive process due to the large number and diverse nature of MHC alleles and candidate peptides. Current computational techniques focus on the identification of potential MHC-binding candidate peptides, and can be broadly classified into two categories: (1) sequence-based approaches such as sequence motifs (Falk et al. 1991), matrix models (Parker et al. 1994; Davenport et al. 1995; Gulukota et al. 1997; Godkin et al. 1998; Rammensee et al. 1999), Artificial Neural Network (Brusic et al. 1998), Hidden Markov Model (Lim et al. 1996; Mamitsuka 1998; Brusic et al. 2002), and Support Vector Machine (Dönnes and Elofsson 2002; Bhasin and Raghava 2004) for large-scale screening of potential T-cell epitopes from protein sequence databanks; and (2) structure-based approaches such as homology modeling (Lim et al. 1996; Michielin et al. 2000), protein threading (Altuvia et al. 1995), and docking techniques (Caflisch et al. 1992; Rosenfeld et al. 1993, 1995; Sezerman et al. 1996; Rognan et al. 1999; Desmet et al. 2000; Michielin and Karplus 2002), which utilize three-dimensional data for the detailed structural analysis of interactions between the MHC and the bound short antigenic peptides. The former are more suitable for large-scale screening of potential T-cell epitopes, while the latter are better suited for detailed analysis of short immunogenic regions of antigens. Although sequence-based techniques are well established, a severe limitation of such approaches is the heavy reliance on the availability of large comprehensive training sets of peptides. This approach is not suitable for accurate prediction of situations where insufficient experimental data are available. As such, the coverage of sequence-based techniques is limited to subsets of binding peptides that belong to the most numerous groups and cannot generate reliable data for peptides that are least represented in the dataset. To date, developments of structure-based techniques are poorly developed and lagging far behind sequence-based procedures due to the relatively higher complexity in their development and excessive computational costs. Despite their slow progress, structure-based techniques are highly promising (Altuvia et al. 1995) and play a significant role as a predictive tool in detailed selection of peptides for binding studies, planning of experiments, and better understanding of biological processes involved in the stimulation of T-cell-mediated immune response. A preliminary analysis of the structural descriptors defining the MHCpeptide interactions has been carried out by our group (Govindarajan et al. 2003). The ability of structure-based approaches to reliably predict MHC binding peptides and thereby potential T-cell epitopes clearly has major implications for clinical immunology, particularly in the area of vaccine design. However, prior to predicting binding peptides based on structural approaches, it is crucial to improve the speed and accuracy of docking a known antigenic peptide to its cognate receptor. Once this problem has been successfully addressed, the methodology will be applied to predict antigenic regions from a protein sequence and understand which peptides are promiscuous and which ones are allele-specific.
In this article, we present a novel computational protocol for rapid and precise modeling of the bound conformations of flexible peptides to MHC receptors and apply it primarily to the remodeling of 40 (29 class I and 11 class II) nonredundant MHCpeptide complexes for which crystal structures are available in the Protein Data Bank (PDB). Our docking protocol consists of three steps: (1) peptide residues near the ends of the binding groove are docked by using an efficient pseudo-Brownian rigid body docking procedure followed by (2) loop closure of the remaining backbone structure by satisfaction of spatial constraints, and subsequently, (3) refinement of the entire backbone and the interacting side chains of the ligand and the receptor experiencing atomic clash at the MHCpeptide interface. In our preliminary experiments, the proposed docking procedure generated as many as 33 out of 40 peptides with C
RMSD of less than 1.00 Å from the crystal structures. To the best of our knowledge, these results represent up to fivefold increase in accuracy compared to available flexible docking techniques in the remodeling of MHCpeptide complexes, establishing the efficacy of out procedure to model highly accurate MHCpeptide complex structures and permitting the conformational sampling of the peptide in the binding groove. As a second test, we have attempted docking new peptides onto a single template structure, to verify whether our method can accurately model the bound conformations of novel peptides bound to a specific MHC allele. With 15 peptides derived from Class I and Class II complexes and three structural templates, the C
conformation of the docked peptides was within an RMSD of 1.00 Å in 11 structures.
| Results |
|---|
|
|
|---|
RMSD values of the N and C termini reveal their relatively fixed locations within the groove across both classes of MHCpeptide complexes. Leveraging on this observation, a three-step docking procedure (detailed in Materials and Methods) as shown in Figure 1
|
|
out of 40 nonredundant complexes (extracted from a larger dataset listed in Table 1
RMSD of 1.00 Å. The RMSD for the near-native solution ranges from 0.09 Å (complex 1G7Q
[PDB]
) to 1.53 Å (complex 1JF1
[PDB]
). Table 2
|
|
Docking MHC-binding peptides onto a single template
We next applied our technique to the modeling of 15 nonredundant peptides (13 class I and 2 class II) for which crystal structures are available into a single template. This stage of the testing is critical to determine the capability of our procedure to model unknown peptides onto available templates. Due to the deficiency of available class II crystal structures, only two class II peptides are tested in this stage. Our procedure constantly found a solution with RMSD below 1.48 Å. Table 3
shows the results obtained from this experiment.
|
RMSD of the modeled peptides.
|
| Discussion |
|---|
|
|
|---|
RMSD below 1.00 Å for 33 peptides by remodeling peptide-bound MHC structures. The worst structure was generated from the remodeling of the bound peptide ELAGIG ILTV from complex 1JF1
[PDB]
with C
RMSD of 1.53 Å. The loop formed around residues 5 to 7 was erroneously predicted, and this disorientation is a direct consequence of missing water molecules positioned around the loop in the template, which resulted in incorrect positioning of interacting residues. In the absence of explicit water molecules, the predicted conformation of our peptide is energetically more favorable than the crystal conformation. Nonetheless, our procedure can correctly predict the conformation of residues that extends into the binding cleft and identify essential contacts with the MHC receptor as shown in Figure 3
|
| Materials and methods |
|---|
|
|
|---|
The peptide-docking procedure
Our peptide-docking procedure exploits the highly conserved backbone conformation of peptide termini, originally noted for HLA-Aw68 (Guo et al. 1992), and confirmed by the analysis of all available high-quality crystal structures in the Protein Data Bank (PDB) for both MHC class I and class II complexes (Table 1
). Structural comparison of all available MHC class I crystal structures to date reveals highly conserved backbone of peptide termini residues with C
RMSD of 0.020.29 Å and 0.000.25 Å for the peptide N- and C-terminal ends, respectively. A similar highly conserved backbone conformation is observed at the ends of the core peptide fragments in the binding cleft of MHC class II alleles with the C
RMSD in the range 0.010.22 Å and 0.020.27 Å for the two peptide termini, respectively. The structure comparison results are detailed in Table 1
.
Our docking procedure (Fig. 1
) for MHC class I peptides and the core residues of MHC class II peptides is performed in three steps: (1) rigid docking of residues at the ends of the binding groove; (2) loop closure of central residues by satisfaction of spatial constraints; (3) followed by iterative ab initio refinements of backbone and ligand interacting side-chain dihedral angles to eliminate or minimize atomic clash regions at the MHC receptorpeptide interface, using a Monte Carlo procedure. The first two steps were used to generate an initial model, which is further refined in the last step to produce the final product.
Rigid docking of residues at the ends of binding groove
The backbone of peptide termini as shown in Table 1
is highly conserved over peptides in different MHC alleles (class I and class II), but due to the allelic variability in the binding grooves, there are subtle differences in the position of these peptide termini. This rigid docking step is adopted to ensure a best fit of the peptide termini before the loop closure procedure in the next step. A fast soft interaction energy function (Fernández-Recio et al. 2002) is utilized to sample different positions and orientations of peptide fragments at the ends of the binding grooves using an Internal Coordinate Mechanics (ICM) global optimization algorithm, with flexible ligand interface side chains and a grid map representation of the receptor energy localized to small cubic regions of side 1.00 Å from the backbone of the ligand.
The ligand side-chain torsions within the grid map were changed in each random step using a Biased Monte Carlo procedure, which begins by pseudorandomly selecting a set of torsion angles in the ligand and subsequently finding the local energy minimum about those angles. New conformations are adopted upon satisfaction of the Metropolis criteria with probability min(1,exp[
G/RT]), where R is the universal gas constant and T is the absolute temperature of the simulation. For the current study, T was set to 300 K. Loose restraints were imposed on the positional variables of the ligand molecule to keep it close to the starting conformation. The optimal energy function adopted for our stimulations consisted of the internal energy of the ligand and the intermolecular energy of the optimized potential maps:
![]() |
The internal energy function incorporates internal van der Waals interactions, hydrogen bonding, and torsion energy (calculated using an extension of the Empirical Conformational Energy Program for Peptides 3 (ECEPP/3; Nemethy et al. 1992; force field parameters), as well as a electrostatic energy with a distance-dependent dielectric constant (e = 4r; Fernández-Recio et al. 2002). The final energy includes the configurational entropy of side chains and the surface-based solvation energy to select the best-refined solutions.
Loop closure of center residues
An initial conformation of the central loop is generated using the program MODELLER (Sali and Blundell 1993) by satisfaction of spatial constraints based on the allowed subspace for backbone dihedrals in accordance with the conformations of peptides docked into the ends of the binding groove. A comprehensive coverage of modeling by satisfaction of spatial restraints is given in the literature (Sali and Blundell 1993). In brief, this is performed in three steps: (1) distance and dihedral angle restraints on the entire peptide sequence are derived from its alignment with the sequences of probes docked into the binding groove; (2) the restraints on spatial features of the unknown center residues are derived by extrapolation from the known 3D structures of probes in the alignment, expressed as probability density functions; and (3) spatial restraints on the unknown center residues are satisfied by optimization of the molecular probability density function using a variable target function technique that applies the conjugate gradients algorithm to positions of all nonhydrogen atoms.
Model refinements
To improve the accuracy of the initial model, partial refinement was performed for the ligand backbone and side chains as well as detected atomic clash regions at the receptor-ligand interface, using the ICM Biased Monte Carlo procedure (Abagyan and Totrov 1999). The initial stages of the refinement attempt to overcome the penalty derived from the preliminary rigid docking of terminal residues by introducing partial flexibility to the ligand backbone. Restraints are imposed upon the positional variables of the C
atoms of probes to keep it close to the starting conformation. The energy function adopted for this refinement step is:
![]() |
Refinements of ligand and receptor side-chain torsions in the vicinity of 4.00 Å from the receptor were performed upon the final backbone structure.
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Altuvia, Y., Schueler, O., and Margalit, H. 1995. Ranking potential binding peptides to MHC molecules by a computational threading approach. J. Mol. Biol. 249: 244250.[CrossRef][Medline]
Bhasin, M. and Raghava, G.P.S. 2004. Analysis and prediction of affinity of TAP binding peptides using cascade SVM. Protein Sci. 13: 596607.
Brown, J.H., Jardetzky, T.S., Gorga, J.C., Stern, L.J., Urban, R.G., Strominger, J.L., and Wiley, D.C. 1993. Three-dimensional structure of the human class II histocompatibility antigen HLA-DR1. Nature 364: 3339.[CrossRef][Medline]
Brusic, V., Rudy, G., Honeyman, M., Hammer, J., and Harrison, L. 1998. Prediction of MHC class II-binding peptides using an evolutionary algorithm and artificial neural network. 1998. Bioinformatics 14: 121130.
Brusic, V., Petrovsky, N., Zhang, G., and Bajic, V.B. 2002. Prediction of promiscuous peptides that bind HLA class I molecules. Immunol. Cell Biol. 80: 280285.[CrossRef][Medline]
Caflisch, A., Niederer, P., and Anliker, M. 1992. Monte Carlo docking of oligopeptides to proteins. Proteins 13: 223230.[CrossRef][Medline]
Davenport, M.P., Ho Shon, I., and Hill, A.V. 1995. An empirical method for the prediction of T-cell epitopes. Immunogenetics 42: 392397.[Medline]
Desmet, J., Maeyer, M.D., Spriet, J., and Lasters, I. 2000. Flexible docking of peptide ligands to proteins. Methods Mol. Biol. 143: 359376.[Medline]
Dönnes, P. and Elofsson, A. 2002. Prediction of MHC class I binding peptides, using SVMHC. BMC Bioinformatics 3: 18.[CrossRef][Medline]
Falk, K., Rötzschke, O., Stevanovic, S., Jung, G., and Rammensee, H.G. 1991. Allele-specific motifs revealed by sequencing of self-peptides eluted from MHC molecules. Nature 351: 290296.[CrossRef][Medline]
Fernández-Recio, J., Totrov, M., and Abagyan, R. 2002. Soft proteinprotein docking in internal coordinates. Protein Sci. 11: 280291.
Flower, D.R. 2003. Databases and data mining for computational vaccinology. Curr. Opin. Drug Discov. Devel. 6: 396400.[Medline]
Godkin, A.J., Davenport, M.P., Willis, A., Jewell, D.P., and Hill, A.V. 1998. Use of complete eluted peptide sequence data from HLA-DR and -DQ molecules to predict T cell epitopes and the influence of the nonbinding terminal regions of ligands in epitope selection. J. Immunol. 161: 850858.
Govindarajan, K.R., Kangueane, P., Tan, T.W., and Ranganathan, S. 2003. MPID: MHC-peptide interaction database for sequence-structure-function information on peptides binding to MHC molecules. Bioinformatics 19: 309310.
Gulukota, K., Sidney, J., Sette, A., and DeLisi, C. 1997. Two complementary methods for predicting peptides binding major histocompatibility complex molecules. J. Mol. Biol. 267: 12581267.[CrossRef][Medline]
Guo, H.C., Jardetzky, T.S., Garrett, T.P.J., Lane, W.S., Strominger, J.L., and Wiley, D.C. 1992. Different length peptides bind to HLA-Aw68 similarly at their ends but bulge out in the middle. Nature 360: 364366.[CrossRef][Medline]
Klein, J. 1986. Natural history of the major histocompatibility complex. J. Wiley & Sons, New York.
Lim, S.K., Kim, S., Lee, H.G., Lee, K.Y., Kwon, T.J., and Kim, K. 1996. Selection of peptides that bind to the HLA-A2.1 molecule by molecular modeling. Mol. Immunol. 33: 221230.[CrossRef][Medline]
Mamitsuka, H. 1998. Predicting peptides that bind to MHC molecules using supervised learning of Hidden Markov Models. Proteins 33: 460474.[CrossRef][Medline]
Michielin, O. and Karplus, M. 2002. Binding free energy differences in a TCRpeptideMHC complex induced by a peptide mutation: A stimulation analysis. J. Mol. Biol. 324: 547569.[CrossRef][Medline]
Michielin, O., Luescher, I., and Karplus, M. 2000. Modeling the TCRMHCpeptide complex. J. Mol. Biol. 300: 12051235.[CrossRef][Medline]
Nemethy, G., Gibson, K.D., Palmer, K.A., Yoon, C.N., Paterlini, G., Zagari, A., Rumsey, S., and Scheraga, H.A. 1992. Energy parameters in polypeptides, 10: Improved geometric parameters and nonbonded interactions for use in the ECEPP/3 algorithm, with application to praline-containing peptides. J. Phys. Chem. 96: 64726484.[CrossRef]
Parker, K.C., Bednarek, M.A., and Coligan, J.E. 1994. Scheme for ranking potential HLA-A2 binding peptides based on independent binding of individual peptide side-chains. J. Immunol. 152: 163175.[Abstract]
Rammensee, H., Bachmann, J., Emmerich, N.P., Bachor, O.A., and Stevanovi
, S. 1999. SYFPEITHI: Database for MHC ligands and peptide motifs. Immunogenetics 50: 213219.[CrossRef][Medline]
Rognan, D., Laumoeller, S.L., Holm, A., Buus, S., and Tschinke, V. 1999. Predicting binding affinities of protein ligands from three-dimensional models: Application to peptide binding to class I major histocompatibility proteins. J. Med. Chem. 42: 46504658.[CrossRef][Medline]
Rosenfeld, R., Zheng, Q., Vajda, S., and DeLisi, C. 1993. Computing the structure of bound peptides: Application to antigen recognition by class I major histocompatibility complex receptors. J. Mol. Biol. 234: 515521.[CrossRef][Medline]
. 1995. Flexible docking of peptides to class I major-histocompatibility-complex receptors. Genet. Anal. 12: 121.[Medline]
Sali, A. and Blundell, T.L. 1993. Comparative protein modelling by satisfaction of spatial restraints. J. Mol. Biol. 234: 779815.[CrossRef][Medline]
Sezerman, U., Vajda, S., and DeLisi, C. 1996. Free energy mapping of class I MHC molecules and structural determination of bound peptides. Protein Sci. 5: 12721281.[Abstract]
Stern, L.J. and Wiley, D.C. 1994. Antigenic peptide binding by class I and class II histocompatibility proteins. Structure 2: 245251.[Medline]
Stern, L.J., Brown, J.H., Jardetzky, T.S., Gorga, J.C., Urban, R.G., Strominger, J.L., and Wiley, D.C. 1994. Crystal structure of the human class II MHC protein HLADR1 complexed with an influenza virus peptide. Nature 368: 215221.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
N. Kumar and D. Mohanty MODPROPEP: a program for knowledge-based modeling of protein-peptide complexes Nucleic Acids Res., July 13, 2007; 35(suppl_2): W549 - W555. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Tong, T. W. Tan, and S. Ranganathan Methods and protocols for prediction of immunogenic epitopes Brief Bioinform, March 1, 2007; 8(2): 96 - 108. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Tong, T. W. Tan, and S. Ranganathan In silico grouping of peptide/HLA class I complexes using structural interaction characteristics Bioinformatics, January 15, 2007; 23(2): 177 - 183. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Zhang, C. Shao, D. Zheng, and Y. Gao An Integrated Machine Learning System to Computationally Screen Protein Databases for Protein Binding Peptide Ligands Mol. Cell. Proteomics, July 1, 2006; 5(7): 1224 - 1232. [Abstract] [Full Text] [PDF] |
||||
![]() |
P. Donnes and O. Kohlbacher SVMHC: a server for prediction of MHC-binding peptides. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W194 - W197. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. C. Tong, G. L. Zhang, T. W. Tan, J. T. August, V. Brusic, and S. Ranganathan Prediction of HLA-DQ3.2{beta} Ligands: evidence of multiple registers in class II binding peptides Bioinformatics, May 15, 2006; 22(10): 1232 - 1238. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Madurga, I. Belda, X. Llora, and E. Giralt Design of enhanced agonists through the use of a new virtual screening method: Application to peptides that bind class I major histocompatibility complex (MHC) molecules Protein Sci., August 1, 2005; 14(8): 2069 - 2079. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |