|
|
||||||||
1 Institut de Biochimie et de Biophysique Moléculaire et Cellulaire, Centre National de la Recherche Scientifique-Unité Mixte de Recherche (CNRS-UMR 8619) and
2 Institut de Génétique et Microbiologie (CNRS-UMR 8621), Université Paris-Sud, 91405 Orsay, France
3 Laboratoire dEnzymologie et Biochimie Structurale (CNRS-UPR [Unité Propre de Recherche] 9063), 91198 Gif sur Yvette, France
4 Centre de Bioingénierie Gilbert Durand, UMR-CNRS 5504, UR-INRA 792, F-31077, Toulouse, France
Reprint requests to: Herman van Tilbeurgh, Institut de Biochimie et de Biophysique Moléculaire et Cellulaire (CNRS-UMR 8619), Université Paris-Sud, Bât. 430, 91405 Orsay, France; e-mail: Herman.Van-Tilbeurgh{at}ibbmc.u-psud.fr; fax: +00-331-69-85-37-15.
(RECEIVED September 14, 2004; FINAL REVISION September 16, 2004; ACCEPTED September 16, 2004)
| Abstract |
|---|
|
|
|---|
-strands organized in two parallel packed
-sheets. The protein has strong structural resemblance to the plant storage and ligand binding proteins (canavalin, glycinin, auxin binding protein) but also to some plant and bacterial enzymes (epimerase, germin). The protein forms homodimers in the crystal, confirming measurements of its molecular mass in solution. Two monomers have their
-sheet packed together to form the dimer. The presence of a hydrophobic ligand in a well conserved pocket inside the barrel and local sequence similarity with bacterial epimerases may suggest a biochemical function for this protein. Keywords: jelly-roll motif; cupin superfamily; structural genomics; YML079wp; S. cerevisiae
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.041121305.
| Introduction |
|---|
|
|
|---|
Although we prioritized our targets for the discovery of new folds, the protein structures solved in our project inevitably brought us to investigate about their biochemical function. Structure might indeed reveal features undetectable by sequence analysis: (1) analogies with proteins of known function and (2) the presence of functional sites (Stark and Russell 2003). In these favorable cases structure provides a hypothesis for biochemical function, testable through site-directed mutagenesis and cocrystallization. Sometimes the crystal structure reveals the presence of a fortuitous ligand in an unexpected binding pocket providing some hints on the binding specificity of the protein (Zarembinski et al. 1998; Liger et al. 2004). The precise biochemical and cellular functions of the protein remain, however, frequently unknown and firm confirmation necessitates further experimental investigation (Graille et al. 2004).
The yeast YML079w gene codes for a hypothetical protein of 201 amino acids (molecular weight 22,460 Da and a calculated isoelectric point of 5.2) of unknown function. The sequence has similarity with a number of bacterial open reading frames (sequence identity around 30%). Homologous genes are also present in the eukaryotic organisms Neurospora crassa and Arabidopsis thaliana (35% sequence identity). None of these sequences is related to a protein of known function. Threading protocols do not propose a reasonable structure prediction for this ORF. Since the structure and function of this protein are unknown, it provided an excellent target in our structural genomics project. We determined the crystal structure of this protein by single wavelength anomalous scattering measurements on Semethionine substituted crystals and refined it to 1.75 Å resolution. The structure of YML079wp defines a new sequence family of the well-known vivilin/germin jelly-roll fold.
| Results and Discussion |
|---|
|
|
|---|
-strands (residues 6569). Electron density is missing for these regions in the three independent copies of the molecule in the crystal, suggesting this is due to inherent mobility. The three molecules in the asymmetric unit have identical structures (RMSD between the molecules: 0.37 Å). As measured by analytic gel filtration, YML079wp forms a dimer in solution (results not shown). Analysis of packing in the crystals shows that two subunits form a tight homodimer related by a local twofold axis and the third protomer is part of second dimer that is generated by a regular crystal symmetry axis.
|
-strands and three
-helices. Ten
-strands are part of two five-stranded anti-parallel
-sheets (S1, strands
1
4
11
6
9 and S2, strands
12
5
10
7
8), packed parallel against each other and forming a
-barrel known as jelly-roll motif. Two extra strands,
2 and
3 are connected by the mobile region and form a handle-like extension from the roll. The N-terminal part of the molecule starts with an extended conformation that covers the back of
-sheet S2, followed by helix
1. At the C-terminal part, a small two-helical anti-parallel bundle lies against the back of sheet S2.
|
-sheet handle extension as observed in YML079wp. The N-and C-terminal regions that are not part of the jelly-roll motif are also very different between these proteins and YML079wp.
|
|
1,
4 and
5; Fig. 3A
|
The best structural match of this YML079wp base binding pocket is found with the nucleotide binding pocket of RmlC. However in YML079wp, all the
-strands contributing to the pocket are from the same monomer, while in the available RmlC structures, these strands come from different monomers (Christendat et al. 2000; Giraud et al. 2000). The amino acid stretch at the active site entrance shows some degree of conservation. The RmlC family is characterized by the presence of a 24RG
F
E29 signature (where
is a hydrophobic residue, numbering according to Salmonella enterica RmlC). In RmlC structures, this peptide is localized in strand
3, which through its interaction with a strand from the other monomer, extends the RmlC
-sheet responsible for homodimer formation. This RmlC strand coincides with YML079wp strand
1 that bears the conserved sequence 45GG
(R/K)E50 (Fig. 2
). Two residues from this peptide seem to play a critical role in ligand binding. First, the aromatic side chain of the RmlC conserved Phe27 and of the YML079wp Phe48 (according to YML079wp numbering) is involved in stacking with the base moiety of the respective ligands. Second, the glutamate at positions 29 in RmlC and 50 in YML079wp is hydrogen-bonded to a nitrogen atom from the thymidine or the guanine bases, respectively.
Further analysis of the superposition between the YML079wp and the S. suis RmlC complexed to a substrate analog shows that the RmlC sugar moiety binding cavity (Dong et al. 2003), is also present in YML079wp (Fig. 3C
). In RmlC, this cavity harbors the residues critical for enzymatic activity and/or substrate binding: His63, Lys73, Tyr133, and Asp170 (Salmonella enterica numbering). RmlC His63 matches with YML079wp His96, which is totally conserved in all the structural homologs, even those with no documented catalytic activity such as auxin-binding protein, germin, or canavalin. This YML079wp cavity is made by well-conserved residues (His42, His96, Tyr84) and also coincides with the auxin-binding pocket. A carboxylate group (Glu44), which is almost totally conserved in the related sequence family is situated close to His96 (Fig. 2
). No strictly conserved residues corresponding to RmlC Lys73 and Tyr 133 are present. We were not able to demonstrate epimerase activity on a set test UDP-substrates. In conclusion, the discussed pocket in YML079wp is very likely involved in binding of a ligand that remains to be identified, but that probably contains an aromatic group. The pocket also has some but not all characteristics of active site pockets of a few enzymes within the cupin superfamily.
Conclusion
The structure of YML079wp shows that it is a new member of the jelly-roll fold superfamily and hence we propose to annotate the gene as CFF1 for Cupin Fold Family 1. This fold is already observed in a number of plant storage proteins and enzymes from plant and bacterial origin. All these proteins share a conserved surface pocket that in many cases has been identified as substrate or ligand binding sites. Sequence analogies with RmlC epimerase in the base binding pocket and conservation of a catalytic base (His96) suggest that YML079wp could be an enzyme involved in the modification of nucleotide sugars. This hypothesis is tempered, however, by the fact that some conserved residues are also present in storage proteins that have no documented ligand binding and enzymatic capacities. A number of epimerase reactions on nucleotide sugars are known in yeast for which no enzyme was associated yet. This hypothesis is under investigation for YML079wp.
| Materials and methods |
|---|
|
|
|---|
Crystallization and resolution of the structure
Se-Met substituted protein crystals were grown from a mixture of equal volumes of 9 mg/ml protein solution in 20 mM Tris-HCl (pH 8), 10 mM
mercaptoethanol, 100 mM sodium chloride and a reservoir solution of 20%24% PEG4000, 0.1 M Na-citrate (pH 5.6) and 0.2 M ammonium acetate. For data collection, crystals were flash-frozen after transfer into the crystallization solution containing 30% glycerol. The protein crystallized in the cubic space group I432 (a = b = c = 208 Å) with three subunits per asymmetric unit and the solvent content was 55%. Single wavelength anomalous diffraction (SAD) data to 2.8 Å resolution were collected from a crystal flash cooled at 100 K on the ESRF BM30A beam line in Grenoble. The data were processed using the MOSFLM and SCALA programs from the CCP4 package. The nine Selenium sites were found by Patterson methods with the program SOLVE (Terwilliger and Berendzen 1999). The experimental map was improved by solvent modification within the program RESOLVE (Terwilliger and Berendzen 1999). The resulting map was of very good quality and most of the protein model (181 out of 201 residues) was built using the molecular graphics program TURBO-FRODO (http://afmb.cnrs-mrs.fr/TURBO_FRODO/). Refinement was performed with the maximum likelihood target using amplitudes and experimental phases within the CNS program (Brunger et al. 1998) followed by manual checking and model rebuilding using TURBO-FRODO. The final model yielding crystallographic refinement values of R = 22.3% and Rfree = 28.4% contains 570 residues provided by three protein chains and 180 water molecules. After refinement, strong residual density remained in a pocket on the surface between two aromatic residues. The flatly shaped electron density was modeled as an adenine base because ATP was present in the crystallization solution. Residues 65 to 68 and five amino-terminal residues were not defined in the 2Fo-Fc electron density map, and are absent from the final model. The statistics on data collection, phasing and refinement are summarized in Table 1
.
In order to establish any nucleotide preference, we cocrystallized the native protein in presence of many different nucleotides. Good diffracting crystals were obtained with 5 mM GTP in the reaction mixture. They diffracted to 1.7 Å resolution on the ESRF BM30A beam line. The data were processed using the XDS program (Kabsch 1993). As observed for the crystals grown in the presence of ATP, the space group is I432 with three molecules in the asymmetric unit. The structure was refined with the CNS program to 1.75 Å resolution, followed by manual checking and model rebuilding using TURBO-FRODO. Clear residual electron density was modeled as the guanine moiety of the GTP introduced in the crystallization assays. Although resolution was higher, the residues which were not present in the ATP-bound form could not have been modeled into the electron density maps. The atomic coordinates and structure factors for the YML079wp GTP and ATP-bound structures have been deposited into the Brookhaven Protein Data Bank under the accession numbers 1XE7 and 1XE8, respectively.
| Footnotes |
|---|
6 These authors contributed equally to this work. ![]()
| References |
|---|
|
|
|---|
Anand, R., Dorrestein, P.C., Kinsland, C., Begley, T.P., and Ealick, S.E. 2002. Structure of oxalate decarboxylase from Bacillus subtilis at 1.75 Å resolution. Biochemistry 41: 76597669.[CrossRef][Medline]
Berrisford, J.M., Akerboom, J., Turnbull, A.P., de Geus, D., Sedelnikova, S.E., Staton, I., McLeod, C.W., Verhees, C.H., van der Oost, J., Rice, D.W., et al. 2003. Crystal structure of Pyrococcus furiosus phosphoglucose isomerase. Implications for substrate binding and catalysis. J. Biol. Chem. 278: 3329033297.
Bertone, P., Kluger, Y., Lan, N., Zheng, D., Christendat, D., Yee, A., Edwards, A.M., Arrowsmith, C.H., Montelione, G.T., and Gerstein, M. 2001. SPINE: An integrated tracking database and data mining approach for identifying feasible targets in high-throughput structural proteomics. Nucleic Acids Res. 29: 28842898.
Brenner, S.E. 2000. Target selection for structural genomics. Nat. Struct. Biol. 7 (Suppl.): 967969.
Brunger, A.T., Adams, P.D., Clore, G.M., DeLano, W.L., Gros, P., Grosse-Kunstleve, R.W., Jiang, J.S., Kuszewski, J., Nilges, M., Pannu, N.S., et al. 1998. Crystallography & NMR system: A new software suite for macro-molecular structure determination. Acta Crystallogr. D Biol. Crystallogr. 54: 905921.[CrossRef][Medline]
Christendat, D., Saridakis, V., Dharamsi, A., Bochkarev, A., Pai, E.F., Arrow-smith, C.H., and Edwards, A.M. 2000. Crystal structure of dTDP-4-keto-6-deoxy-D-hexulose 3,5-epimerase from Methanobacterium thermoautotrophicum complexed with dTDP. J. Biol. Chem. 275: 2460824612.
Dong, C., Major, L.L., Allen, A., Blankenfeldt, W., Maskell, D., and Naismith, J.H. 2003. High-resolution structures of RmlC from Streptococcus suis in complex with substrate analogs locate the active site of this class of enzyme. Structure (Camb.) 11: 715723.
Dunwell, J.M. 1998. Cupins: A new superfamily of functionally diverse proteins that include germins and plant storage proteins. Biotechnol. Genet. Eng. Rev. 15: 132.[Medline]
Giraud, M.F., Leonard, G.A., Field, R.A., Berlind, C., and Naismith, J.H. 2000. RmlC, the third enzyme of dTDP-L-rhamnose pathway, is a new class of epimerase. Nat. Struct. Biol. 7: 398402.[CrossRef][Medline]
Gong, W.M., Liu, H.Y., Niu, L.W., Shi, Y.Y., Tang, Y.J., Teng, M.K., Wu, J.H., Liang, D.C., Wang, D.C., Wang, J.F., et al. 2003. Structural genomics efforts at the Chinese Academy of Sciences and Peking University. J. Struct. Funct. Genomics 4: 137139.[Medline]
Gouet, P., Courcelle, E., Stuart, D.I., and Metoz, F. 1999. ESPript: Multiple sequence alignments in PostScript. Bioinformatics 15: 305308.
Graille, M., Quevillon-Cheruel, S., Leulliot, N., Zhou, C.Z., de la Sierra Gallay, I.L., Jacquamet, L., Ferrer, J.L., Liger, D., Poupon, A., Janin, J., et al. 2004. Crystal structure of the YDR533c S. cerevisiae protein, a class II member of the Hsp31 family. Structure (Camb.) 12: 839847.
Kabsch, W. 1993. Automatic processing of rotation diffraction data from crystals of initially unknown symmetry and cell constants. J. Appl. Crystallogr.: 795800.
Ko, T.P., Day, J., and McPherson, A. 2000. The refined structure of canavalin from jack bean in two crystal forms at 2.1 and 2.0 Å resolution. Acta Crystallogr. D Biol. Crystallogr. 56: 411420.[CrossRef][Medline]
Liger, D., Graille, M., Zhou, C.Z., Leulliot, N., Quevillon-Cheruel, S., Blondeau, K., Janin, J., and van Tilbeurgh, H. 2004. Crystal structure and functional characterization of yeast YLR011wp, an enzyme with NAD(P)H-FMN and ferric iron reductase activities. J. Biol. Chem. 279: 3489034897.
Marti-Renom, M.A., Stuart, A.C., Fiser, A., Sanchez, R., Melo, F., and Sali, A. 2000. Comparative protein structure modeling of genes and genomes. Annu. Rev. Biophys. Biomol. Struct. 29: 291325.[CrossRef][Medline]
Quevillon-Cheruel, S., Collinet, B., Zhou, C.Z., Minard, P., Blondeau, K., Henkes, G., Aufrere, R., Coutant, J., Guittet, E., Lewit-Bentley, A., et al. 2003. A structural genomics initiative on yeast proteins. J. Synchrotron Radiat. 10: 48.[Medline]
Stark, A. and Russell, R.B. 2003. Annotation in three dimensions. PINTS: Patterns in non-homologous tertiary structures. Nucleic Acids Res. 31: 33413344.
Terwilliger, T.C., and Berendzen, J. 1999. Automated MAD and MIR structure solution. Acta Crystallogr. D Biol. Crystallogr. 55: 849861.[CrossRef][Medline]
Thompson, J.D., Higgins, D.G., and Gibson, T.J. 1994. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22: 46734680.
Woo, E.J., Dunwell, J.M., Goodenough, P.W., Marvier, A.C., and Pickersgill, R.W. 2000. Germin is a manganese containing homohexamer with oxalate oxidase and superoxide dismutase activities. Nat. Struct. Biol. 7: 10361040.[CrossRef][Medline]
Woo, E.J., Marshall, J., Bauly, J., Chen, J.G., Venis, M., Napier, R.M., and Pickersgill, R.W. 2002. Crystal structure of auxin-binding protein 1 in complex with auxin. Embo J. 21: 28772885.[CrossRef][Medline]
Zarembinski, T.I., Hung, L.W., Mueller-Dieckmann, H.J., Kim, K.K., Yokota, H., Kim, R., and Kim, S.H. 1998. Structure-based assignment of the biochemical function of a hypothetical protein: A test case of structural genomics. Proc. Natl. Acad. Sci. 95: 1518915193.
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
C. R. Simmons, Q. Liu, Q. Huang, Q. Hao, T. P. Begley, P. A. Karplus, and M. H. Stipanuk Crystal Structure of Mammalian Cysteine Dioxygenase: A NOVEL MONONUCLEAR IRON CENTER FOR CYSTEINE THIOL OXIDATION J. Biol. Chem., July 7, 2006; 281(27): 18723 - 18733. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Quevillon-Cheruel, N. Leulliot, M. Graille, N. Hervouet, F. Coste, H. Benedetti, C. Zelwer, J. Janin, and H. Van Tilbeurgh Crystal structure of yeast YHR049W/FSH1, a member of the serine hydrolase family Protein Sci., May 1, 2005; 14(5): 1350 - 1356. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |