|
|
||||||||
1 Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas 753909050, USA
2 Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas 753909050, USA
Reprint requests to: Dr. Nick V. Grishin, Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd, Dallas, TX 753909050, USA; e-mail: grishin{at}chop.swmed.edu; fax: (214) 6489099.
(RECEIVED July 11, 2001; FINAL REVISION November 9, 2001; ACCEPTED November 25, 2001)
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.27302.
| Abstract |
|---|
|
|
|---|
Keywords: Metalloprotease; homology; structural comparison; sequence motif; catalytic mechanism
| Introduction |
|---|
|
|
|---|
Certain gram-positive bacteria develop spores under severe physical or chemical conditions. In spores of Bacillus and Clostridium species, the DNA is protected by a group of small acid-soluble proteins (SASP; Setlow 1988,1995). Spore germination requires degradation of these proteins through the action of a specific germination protease (GPR; Setlow 1988). GPRs from a few Bacillus or Clostridium species form a typical singleton group of very similar homologs, showing no detectable sequence similarity to other protease families. The crystal structure has been determined for GPR from Bacillus megaterium (Ponnuraj et al. 2000). However, little information has been derived regarding the enzyme's active site location and catalytic mechanism (Ponnuraj et al. 2000).
By structural comparison and sequence analysis, we show that GPR is homologous to the bacterial NiFe hydrogenase processing protease HybD, an enzyme with a known structure (Fritsche et al. 1999). GPR is likely to be regulated by metal ions and its active site location is predicted to be similar to that of HybD.
| Results and Discussion |
|---|
|
|
|---|
ß
; the central beta sheet consists of 5 strands in the order of baced, and strand d is antiparallel to the rest of the four strands (Fig. 1a,b
|
30° with each other. The second motif is located at the end of strand c, with an invariant Asp and a small residue following it (Fig. 2
|
Despite the structural similarity and the conservation of two functional motifs in GPR and HybD, PSI-BLAST searches were not able to identify their similarities at significant levels. Interestingly, during similarity searches, we identified another distant group homologous to GPRs (Fig. 2
, middle group). For example, PSI-BLAST searches starting from C. acetabutylicum GPR identified a hypothetical protein (gi586851) at e-value 0.81, with both aforementioned motifs conserved. In the non-redundant database (June 2001), there are only two hypothetical proteins in this group (gi586851 and gi10176679). We call them "intermediate" sequences. To alleviate the restrictions of small sequence numbers, we also tried to retrieve sequences from the ERGO database (http://igweb.integratedgenomics.com/ERGO/) and the complete and unfinished genome DNA databases at NCBI (http://www.ncbi.nlm.nih.gov/Microb_blast/unfinishedgenome.html). A total of seven GPRs and nine intermediate sequences from different Bacillus and Clostridium species were found (Fig. 2
). Additional sequences improved the profile and helped homology detection. For example, we used the profile generated from the alignment of the nine intermediate sequences for PSI-BLAST searches and detected GPR sequences at more significant e-value levels (<0.1). The succession of the predicted secondary structure elements in the intermediate group is consistent with the two known structures (Figure 2
). All the intermediate sequences are also from the Bacillus or Clostridium species, apparently forming another singleton group. It is highly likely that they are proteases that may also participate in some process unique for Bacillus and Clostridium species, such as sporulation. Indeed, there is evidence of a second SASP-specific protease in B. megaterium besides GPR (Postemsky et al. 1978; Setlow 1988). Whereas each Bacillus species has only one intermediate sequence, Clostridium acetabutylicum has four, one of which (INTER_Ca4; Fig. 2
) is predicted to have no activity because both the key acidic residues are mutated. Like the GPRs, intermediate sequences do not have a conserved residue corresponding to the third metal ligand of HybD.
High sequence divergence makes it difficult to build a reliable evolutionary tree for all these homologous proteins. Instead, we used a distance diagram (V.N. Grishin and N.V. Grishin, unpubl.; see Materials and Methods for details) to illustrate the similarities among them. The distance diagram shown in Figure 3
is a two-dimensional projection of the multi-dimensional Euclidian space, with the points representing individual sequences. The distance between two points reflects the dissimilarity of the two sequences. Three well-separated groups of points are obvious in the diagram, corresponding to the three distantly related sequence groups. The group of hydrogenase maturation proteases has many representatives. The GPRs form a typical singleton group with the seven sequences being very close to each other, whereas the group of intermediate sequences is more dispersed (the predicted inactive protein from C. acetabutylicum shows the largest deviation from other sequences).
|
Although there is no direct evidence that metals function in the catalysis of GPRs, the homology to hydrogenase maturation proteases and the abundance of divalent metal ions in the spores (Ponnuraj et al. 2000; Setlow 1994) indicate strongly that they do. The GPR substrates are the SASPs that are classified into two types,
/ß and
(Setlow 1988). Analysis of their cleavage sites revealed a short motif with two conserved acidic residues, [ED]hXXE. Cleavage was found to occur after the first acidic residue. The acidic residues could act as metal ligands in substrate recognition or as the general base in catalysis. There seems to be no similarity around the cleavage sites of the NiFe hydrogenases and the substrates of GPRs, except that in both cases the position immediately after the cleavage site is usually occupied by a hydrophobic residue. This residue was shown to be important for the processing of the E. coli hydrogenase (HycE; Theodoratou et al. 2000b). Difference of cleavage site signatures implies that the exact mechanisms for GPRs and hydrogenase maturation proteases could be different. The exact roles of metals and the key residues involved in the catalysis of GPR still need to be addressed by experimental studies, but this work points to the location of the active site in GPR and residues that should be important for its function.
| Materials and methods |
|---|
|
|
|---|
Multiple alignments, secondary structure predictions, and generation of sequence profile
Multiple sequence alignments were constructed using the T-Coffee program (Notredame et al. 2000) for each sequence group. These alignments were merged manually based on the structural superposition of GPR (PDB entry 1c8b, chain A) and HybD (PDB entry 1cfz, chain A), in conjunction with sequence conservation patterns and secondary structure predictions from the Jpred2 server (Cuff and Barton 2000). The sequence alignment of the intermediate group was used as an input to construct a profile (-B option in the blastpgp program; Altschul et al. 1997) for a new round of PSI-BLAST searches starting from each intermediate sequence.
Euclidian space mapping and distance diagram
The conserved segments of each sequence shown in Figure 2
were used to calculate pairwise identity fractions qij between each sequence pair i and j. The identity fractions were converted to evolutionary distances with the formula
|
|
|
|
2 of each density was identical for all points. Starting from each point, the local maximum of the sum of such Gaussians was found. The points giving rise to the same local maximum were grouped together.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Appleby, T.C., Erion, M.D., and Ealick, S.E. 1999. The structure of human 5`-deoxy-5`-methylthioadenosine phosphorylase at 1.7 Å resolution provides insights into substrate binding and catalysis. Structure Fold Des. 7: 629641.[Medline]
Aravind, L. and Koonin, E.V. 1999. Gleaning non-trivial structural, functional, and evolutionary information about proteins by iterative database searches. J. Mol. Biol. 287: 10231040.[CrossRef][Medline]
Bode, W., Gomis-Ruth, F.X., Huber, R., Zwilling, R., and Stocker, W. 1992. Structure of astacin and implications for activation of astacins and zinc-ligation of collagenases. Nature 358: 164167.[CrossRef][Medline]
Burger, A., Voges, D., Demange, P., Perez, C.R., Huber, R., and Berendes, R. 1994. Structural and electrophysiological analysis of annexin V mutants. Mutagenesis of human annexin V, an in vitro voltage-gated calcium channel, provides information about the structural features of the ion pathway, the voltage sensor, and the ion selectivity filter. J. Mol. Biol. 237: 479499.[CrossRef][Medline]
Christianson, D.W., David, P.R., and Lipscomb, W.N. 1987. Mechanism of carboxypeptidase A: Hydration of a ketonic substrate analogue. Proc. Natl. Acad. Sci. 84: 15121515.
Christianson, D.W., Mangani, S., Shoham, G., and Lipscomb, W.N. 1989. Binding of D-phenylalanine and D-tyrosine to carboxypeptidase A. J. Biol. Chem. 264: 1284912853.
Cort, J.R., Yee, A., Edwards, A.M., Arrowsmith, C.H., and Kennedy, M.A. 2000. Structure-based functional classification of hypothetical protein MTH538 from Methanobacterium thermoautotrophicum. J. Mol. Biol. 302: 189203.[CrossRef][Medline]
Crawford, J.L., Lipscomb, W.N., and Schellman, C.G. 1973. The reverse turn as a polypeptide conformation in globular proteins. Proc. Natl. Acad. Sci. 70: 538542.
Cuff, J.A. and Barton, G.J. 2000. Application of multiple sequence alignment profiles to improve protein secondary structure prediction. Proteins 40: 502511.[CrossRef][Medline]
Dietmann, S., Park, J., Notredame, C., Heger, A., Lappe, M., and Holm, L. 2001. A fully automatic evolutionary classification of protein folds: Dali Domain Dictionary version 3. Nucleic Acids Res. 29: 5557.
Eddy, S. R. 1996. Hidden Markov models. Curr. Opin. Struct. Biol. 6: 361365.[CrossRef][Medline]
Eisenstein, E., Gilliland, G.L., Herzberg, O., Moult, J., Orban, J., Poljak, R.J., Banerjei, L., Richardson, D., and Howard, A.J. 2000. Biological function made crystal clear - annotation of hypothetical proteins via structural genomics. Curr. Opin. Biotechnol. 11: 2530.[CrossRef][Medline]
Esnouf, R.M. 1997. An extensively modified version of MolScript that includes greatly enhanced coloring capabilities. J. Mol. Graph. Model. 15:, 132134, 112113.[CrossRef][Medline]
Fritsche, E., Paschos, A., Beisel, H.G., Bock, A., and Huber, R. 1999. Crystal structure of the hydrogenase maturating endopeptidase HYBD from Escherichia coli. J. Mol. Biol. 288: 989998.[CrossRef][Medline]
Gribskov, M., McLachlan, A.D., and Eisenberg, D. 1987. Profile analysis: Detection of distantly related proteins. Proc. Natl. Acad. Sci. 84: 43554358.
Holm, L. and Sander, C. 1995. Dali: A network tool for protein structure comparison. Trends Biochem. Sci. 20: 478480.[CrossRef][Medline]
Kim, S. H. 1998. Shining a light on structural genomics. Nat. Struct. Biol. (Suppl.) 5: 643645.
Koppensteiner, W.A., Lackner, P., Wiederstein, M., and Sippl, M.J. 2000. Characterization of novel proteins based on known protein structures. J. Mol. Biol. 296: 11391152.[CrossRef][Medline]
Long, S.B., Casey, P.J., and Beese, L.S. 2000. The basis for K-Ras4B binding specificity to protein farnesyltransferase revealed by 2 Å resolution ternary complex structures. Structure Fold. Des. 8: 209222.[Medline]
Maier, T. and Bock, A. 1996. Generation of active [NiFe] hydrogenase in vitro from a nickel-free precursor form. Biochemistry 35: 1008910093.[CrossRef][Medline]
Makarova, K.S. & Grishin, N.V. 1999. The Zn-peptidase superfamily: Functional convergence after evolutionary divergence. J. Mol. Biol. 292: 1117.[CrossRef][Medline]
Menon, A.L. and Robson, R.L. 1994. In vivo and in vitro nickel-dependent processing of the [NiFe] hydrogenase in Azotobacter vinelandii. J. Bacteriol. 176: 291295.
Murzin, A.G. 1998. How far divergent evolution goes in proteins. Curr. Opin. Struct. Biol. 8: 380387.[CrossRef][Medline]
Murzin, A.G. 1999. Structure classification-based assessment of CASP3 predictions for the fold recognition targets. Proteins (Suppl.) 3: 88103.
Murzin, A.G., Brenner, S.E., Hubbard, T., and Chothia, C. 1995. SCOP: A structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247: 536540.[CrossRef][Medline]
Notredame, C., Higgins, D.G., and Heringa, J. 2000. T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302: 205217.[CrossRef][Medline]
Ponnuraj, K., Rowland, S., Nessi, C., Setlow, P., and Jedrzejas, M. J. 2000. Crystal structure of a novel germination protease from spores of Bacillus megaterium: Structural arrangement and zymogen activation. J. Mol. Biol. 300: 110.[CrossRef][Medline]
Postemsky, C.J., Dignam, S.S., and Setlow, P. 1978. Isolation and characterization of Bacillus megaterium mutants containing decreased levels of spore protease. J. Bacteriol. 135: 841850.
Rossmann, R., Sauter, M., Lottspeich, F., and Bock, A. 1994. Maturation of the large subunit (HYCE) of Escherichia coli hydrogenase 3 requires nickel incorporation followed by C-terminal processing at Arg537. Eur. J. Biochem. 220: 377384.[Medline]
Sali, A. 1998. 100,000 protein structures for the biologist. Nat. Struct. Biol. 5: 10291032.[CrossRef][Medline]
Schmitt, E., Panvert, M., Blanquet, S., and Mechulam, Y. 1998. Crystal structure of methionyl-tRNAfMet transformylase complexed with the initiator formyl-methionyl-tRNAfMet. EMBO J. 17: 68196826.[CrossRef][Medline]
Scott, D.L., Otwinowski, Z., Gelb, M.H., and Sigler, P.B. 1990. Crystal structure of bee-venom phospholipase A2 in a complex with a transition-state analogue. Science 250: 15631566.
Setlow, P. 1988. Small, acid-soluble spore proteins of Bacillus species: Structure, synthesis, genetics, function, and degradation. Annu. Rev. Microbiol. 42: 319338.[CrossRef][Medline]
Setlow, P. 1994. Mechanisms which contribute to the long-term survival of spores of Bacillus species. Soc. Appl. Bacteriol. Symp. Ser. 23: 49S60S.[Medline]
Setlow, P. 1995. Mechanisms for the prevention of damage to DNA in spores of Bacillus species. Annu. Rev. Microbiol. 49: 2954.[CrossRef][Medline]
Strater, N., Sun, L., Kantrowitz, E.R., and Lipscomb, W.N. 1999. A bicarbonate ion as a general base in the mechanism of peptide hydrolysis by dizinc leucine aminopeptidase. Proc. Natl. Acad. Sci. 96: 1115111155.
Terwilliger, T.C., Waldo, G., Peat, T.S., Newman, J.M., Chu, K., and Berendzen, J. 1998. Class-directed structure determination: Foundation for a protein structure initiative. Protein Sci. 7: 18511856.[Abstract]
Theodoratou, E., Paschos, A., Magalon, A., Fritsche, E., Huber, R., and Bock, A. 2000a. Nickel serves as a substrate recognition motif for the endopeptidase involved in hydrogenase maturation. Eur. J. Biochem. 267: 19951999.[Medline]
Theodoratou, E., Paschos, A., Mintz, W., and Bock, A. 2000b. Analysis of the cleavage site specificity of the endopeptidase involved in the maturation of the large subunit of hydrogenase 3 from Escherichia coli. Arch. Microbiol. 173: 110116.[CrossRef][Medline]
Volbeda, A., Charon, M. H., Piras, C., Hatchikian, E. C., Frey, M., and Fontecilla-Camps, J.C. 1995. Crystal structure of the nickel-iron hydrogenase from Desulfovibrio gigas. Nature 373: 580587.[CrossRef][Medline]
Walker, D.R. and Koonin, E.V. 1997. SEALS: A system for easy analysis of lots of sequences. Intelligtent Systems for Molecular Biology 5: 333339.
Zarembinski, T.I., Hung, L.W., Mueller-Dieckmann, H.J., Kim, K.K., Yokota, H., Kim, R., and Kim, S.H. 1998. Structure-based assignment of the biochemical function of a hypothetical protein: A test case of structural genomics. Proc. Natl. Acad. Sci. 95: 1518915193.
Zhang, H., Seabra, M.C., and Deisenhofer, J. 2000. Crystal structure of Rab geranylgeranyltransferase at 2.0 Å resolution. Structure Fold. Des. 8: 241251.[Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
T. M. Carroll and P. Setlow Site-Directed Mutagenesis and Structural Studies Suggest that the Germination Protease, GPR, in Spores of Bacillus Species Is an Atypical Aspartic Acid Protease J. Bacteriol., October 15, 2005; 187(20): 7119 - 7125. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei and N. V. Grishin The P5 protein from bacteriophage phi-6 is a distant homolog of lytic transglycosylases Protein Sci., May 1, 2005; 14(5): 1370 - 1374. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |