|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
The Scripps Research Institute (TSRI), Department of Molecular Biology and Joint Center for Structural Genomics, La Jolla, California 92037, USA
Reprint requests to: Kurt Wüthrich, The Scripps Research Institute, Department of Molecular Biology and Joint Center for Structural Genomics, La Jolla, CA 92037, USA; e-mail: wuthrich{at}scripps.edu; fax: (858) 784-8014.
(RECEIVED August 5, 2005; FINAL REVISION August 5, 2005; ACCEPTED August 10, 2005)
| Abstract |
|---|
|
|
|---|
/
-topology formed by the regular secondary structures
1
1
2
2
3
4
3
5310
4, with a small anti-parallel
-sheet of
-strands 1 and 2, and a mixed parallel/anti-parallel
-sheet of
-strands 35. Similar folds have previously been observed in other proteins, with amino acid sequence identity as low as 3% and a variety of different functions. There are also 216 sequence homologs of TM0487, which all have the signature sequence of domains of unknown function 59 (DUF59), for which no three-dimensional structures have as yet been reported. The TM0487 structure thus presents a platform for homology modeling of this large group of DUF59 proteins. Conserved among most of the DUF59s are 13 hydrophobic residues, which are clustered in the core of TM0487. A putative active site of TM0487 consisting of residues D20, E22, L23, T51, T52, and C55 is conserved in 98 of the 216 DUF59 sequences. Asp20 is buried within the proposed active site without any compensating positive charge, which suggests that its pKa value may be perturbed. Furthermore, the DUF59 family includes ORFs that are part of a conserved chromosomal group of proteins predicted to be involved in FeS cluster metabolism. Keywords: NMR structure determination; Thermotoga maritima; structural genomics; DUF59 proteins
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.051755805.
| Introduction |
|---|
|
|
|---|
The Thermotoga maritima protein TM0487 (Nelson et al. 1999) contains the signature sequence of the DUF59 family. This 103-residue conserved hypothetical protein was selected for structural studies in the context of the research program of the Joint Center for Structural Genomics (JCSG) (Lesley et al. 2002). TM0487 has a molecular weight of 11,490 Da and a calculated isoelectric point of 4.39 and was identified as a target for NMR structure determination based on 1D 1H NMR screening (Peti et al. 2004). This study describes the NMR structure determination of TM0487, and based on three-dimensional structure comparisons and amino acid sequence analysis, we discuss structural and functional roles of the conserved residues found in proteins belonging to the DUF59 family.
| Results |
|---|
|
|
|---|
|
|
/
-topology with five helices; one small, two-stranded, anti-parallel
-sheet composed of
-strands 1 and 2; and a three-stranded, mixed parallel/anti-parallel
-sheet of
-strands 35 (Fig. 1B
1
1
2
2
3
4
3
5310
4. An all-heavy atom presentation of the bundle of 20 DYANA conformers used to represent the TM0487NMR structure is presented in Figure 1A
-positions. The backbone is well-defined, with the sole exceptions of the N-terminal tetrapeptide segment and the C-terminal helix
4. The majority of the side chains are well-defined, with local displacement calculated for the heavy atoms of <1.0 Å (side chains shown in blue). These best-defined side chains are mostly located in the core of the TM0487 structure, and have, on average, <35% of their surface exposed to the solvent. | Discussion |
|---|
|
|
|---|
|
/
-motif present in these proteins and in TM0487 represents a structural scaffold that is utilized as an architectural module for a variety of different functions. Although the three-dimensional structures of these proteins have a considerable degree of similarity, they have distinct disordered loops or secondary structure elements that cause distortions of the canonical
1
1
2
2
3
4
3
5310
4-topology.
Identification of a putative active site
Thirteen hydrophobic residues conserved in the DUF59s are positioned in the core of the TM0487 structure (shown in brown in Fig. 2B
), suggesting that they are involved in the stabilization of closely related folds in all of these proteins. An acidic residue at position 20 and the cysteine at position 55 are proximal to each other in the structure of TM0487 (in the bundle of conformers of Fig. 1A
, the average distance between C
of D20 and S
of C55 is 6.3 Å). A search for similar three-dimensional structure patterns in nonhomologous tertiary structures, using the PINTS server (Stark and Russell 2003), identified closest similarity of TM0487 with the hybrid cluster protein (HCP) from Desulfovibrio desulfuricans (PDB ID 1oa0
[PDB]
), which has not yet been assigned a specific function (Aragão et al. 2003). The 216 nonredundant sequences that contain the DUF59 domain were then realigned by CLUSTALW (1.82; Chenna et al. 2003) in such a way that only sequences with the following traits were retained, i.e., an acidic residue at position 20, T51, T52, or S52, and C55 (residue numbering of TM0487). A serine rather than a threonine at position 52 was found in 37 of these sequences, and was included due to its similarity to threonine. Two gaps were allowed in the sequence alignment (Fig. 2A
), to avoid interruption of putative secondary structure elements observed in the TM0487 NMR structure. This alignment revealed three new positions with highly conserved traits, i.e., an acidic residue at position 22, a hydrophobic residue at position 23, and proline at position 56. Among the 216 DUF59s, we thus found a group of 98 nonredundant sequences containing acidic residues at positions 20 and 22; a hydrophobic residue at position 23; threonine 51; threonine or serine at position 52; cysteine 55; and proline 56. Residues 22, 23, 51, 52, and 55 form a contiguous surface in the TM0487 structure (green in Fig. 2B
), suggesting that they might form an active site for this class of proteins. It is also worth noting that the buried side-chain carboxyl group of D20 does not have a nearby positively charged group within the protein structure, which may then suggest it has a perturbed pKa-value. Finally, the strictly conserved residues C55 and P56 (Fig. 2A
) are the first two residues in a type IV
-turn (Richardson 1981), and proline in position 56 might have a role in properly positioning C55 in the active site.
In a comparison of the putative active site of TM0487 with the structure of one of the FeS clusters in the HCP protein from D. desulfuricans, a positive match was identified between the residues T51, T52, and C55 of TM0487, and the residues T75, T12, and C18 in this FeS cluster (red labels in Fig. 3
). Although there is apparent near-identity in the spatial arrangement of this group of three chemically identical residues (Fig. 3
), the three cysteines in positions 6, 9, and 24 of the FeS cluster of HCP are missing in TM0487. This does not a priori exclude the presence of an FeS cluster in TM0487, since other FeS clusters have been described with only two or three cysteines, with the other "cysteine positions" being occupied by Asp or Glu (Aragão et al. 2003; Johnson et al. 2005). In conclusion, although some closely homologous proteins to TM0487 are part of a conserved chromosomal cluster of proteins related to FeS cluster assembly (Zheng et al. 1993, 1998), the evidence presently available leaves open whether or not the proposed active site residues (green in Figs. 2B
, 3A
) might form a novel type of FeS cluster environment.
|
| Materials and methods |
|---|
|
|
|---|
Data collection
All NMR experiments were performed at 313 K on Bruker Avance600 and Avance900 spectrometers equipped with TXI-HCN-z or TXI-HCN-xyz gradient probeheads.
NMR data analysis and structure calculation
The sequence-specific resonance assignments were previously described (Almeida et al. 2004). The 1H, 13C, and 15N chemical shifts have been deposited in the BioMagResBank (http://www.bmrb.wisc.edu) under accession number 5976.
To collect the input for the structure calculation, three-dimensional 15N-resolved [1H,1H]-NOESY and three-dimensional 13C-resolved [1H,1H]-NOESY spectra (both recorded at 900 MHz with a mixing time of 75 msec) were processed with PROSA (Güntert et al. 1992) and analyzed with an automated routine for NOESY peak picking and NOE assignment, ATNOS/CANDID (Herrmann et al. 2002a,b) implemented in the torsion angle dynamics program DYANA (Güntert et al. 1997). The input for ATNOS/CANDID/DYANA (Herrmann et al. 2002a, b) consisted of the amino acid sequence of TM0487, the chemical-shift lists obtained from the previous sequence-specific resonance assignment (Almeida et al. 2004), and the three-dimensional heteronuclear-resolved [1H,1H]-NOESY spectra. Stereospecific assignments of 42 valine and leucine isopropyl methyl groups had been determined experimentally by biosynthetic fractional 13C-labeling (Senn et al. 1989). These 42 stereospecific assignments, and 106 backbone dihedral angle constraints derived from the C
and C
chemical shifts (Spera and Bax 1991; Luginbühl et al. 1995) were added to the input for each cycle of structure calculation. The standard iterative protocol with seven cycles of ATNOS peak picking, CANDID NOE assignment, and three-dimensional structure calculation with DYANA was applied (Herrmann et al. 2002a,b). During the first six ATNOS/CANDID cycles, ambiguous distance constraints were used (Nilges et al. 1997). In the second and subsequent cycles, the intermediate three-dimensional protein structures were used as an additional guide for the interpretation of the NOESY spectra. For the final structure calculation in cycle 7, only distance constraints were retained that could be unambiguously assigned based on the intermediate protein structure from cycle 6. The 20 conformers from cycle 7 with the lowest residual DYANA target function values were energy minimized in a water shell with the program OPALp (Luginbühl et al. 1996, Koradi et al. 2000), using the AMBER force field (Cornell et al. 1995). The program MOLMOL (Koradi et al. 1996) was used to analyze the resulting bundle of 20 energy-minimized conformers and to prepare the figures showing molecular models.
Data validation and deposition
Analysis of the stereochemical quality of the NMR structure bundle was accomplished using the JCSG Validation Central Suite (http://www.jcsg.org), which integrates seven validation tools: Procheck 3.5.4, Sfcheck 4.0, Prove 2.5.1, ERRAT, WASP, DDQ 2.0, and Whatcheck. The atomic coordinates of the refined structure have been deposited in the PDB (http://www.rcsb.org/pdb), with codes 1uwx
[PDB]
for the bundle of 20 structures and 1wcj for the structure with the smallest RMSD to the mean coordinates of the bundle.
| Footnotes |
|---|
2 Department of Molecular Pharmacology, Physiology and Biotechnology, Brown University, Providence, RI 02912, USA. ![]()
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Aragão, D., Macedo, S., Mitchell, E.P., Romão, C.V., Liu, M.Y., Frazão, C., Saraiva, L.M., Xavier, A.V., LeGall, J., van Dongen, et al. 2003. Reduced hybrid cluster proteins (HCP) from Desulfovibrio desulfuricans ATCC 27774 and Desulfovibrio vulgaris (Hildenborough): X-ray structures at high resolution using synchrotron radiation. J. Biol. Inorg. Chem. 8: 540548.[Medline]
Chen, X., Court, D.L., and Ji, X. 1999. Crystal structure of Era: A Gtpase-dependent cell cycle regulator containing an RNA-binding motif. Proc. Nat. Acad. Sci. 96: 83968401.
Chenna, R., Sugawara, H., Koike, T., Lopez, R., Gibson, T.J., Higgins, D.G., and Thompson, J.D. 2003. Multiple sequence alignment with the Clustal series of programs. Nucleic Acids Res. 31: 34973500.
Conti, E., Franks, N.P., and Brick, P. 1996. Crystal structure of firefly luciferase throws light on a superfamily of adenylate-forming enzymes. Structure 4: 287298.[Medline]
Cornell, W.D., Cieplak, P., Bayly, C.I., Gould, I.R., Merz, J.K.M., Fergunson, D.M., Spellmeyer, D.C., Fox, T., Caldwell, J.W., and Kollman, P.A. 1995. A second generation force field for the simulation of proteins, nucleic acids, and organic molecules. J. Am. Chem. Soc. 117: 51795197.[CrossRef]
Ferrández, A., Miñambres, B., García, B., Olivera, E.R., Luengo, J.M., García, J.L., and Díaz, E. 1998. Catabolism of phenylacetic acid in Escherichia coli. Characterization of a new aerobic hybrid pathway. J. Biol. Chem. 273: 2597425986.
Gorman, M.A., Morera, S., Rothwell, D.G., De La Fortelle, E., Mol, C.D., Tainer, J.A., Hickson, I.D., and Freemont, P.S. 1997. The crystal structure of the human DNA repair endonuclease Hap1 suggests the recognition of extra-helical deoxyribose at DNA abasic sites. EMBO J. 16: 65486558.[CrossRef][Medline]
Guex, N. and Peitsch, M.C. 1997. SWISS-MODEL and the Swiss-Pdb Viewer: An environment for comparative protein modeling. Electrophoresis 18: 27142723.[CrossRef][Medline]
Güntert, P., Dötsch, V., Wider, G., and Wüthrich, K. 1992. Processing of multi-dimensional NMR data with the new software PROSA. J. Biomol. NMR 2: 619629.[CrossRef]
Güntert, P., Mumenthaler, C., and Wüthrich, K. 1997. Torsion angle dynamics for NMR structure calculation with the new program DYANA. J. Mol. Biol. 273: 283298.[CrossRef][Medline]
Herrmann, T., Güntert, P., and Wüthrich, K. 2002a. Protein NMR structure determination with automated NOE-identification in the NOESY spectra using the new software ATNOS. J. Biomol. NMR 24: 171189.[CrossRef][Medline]
. 2002b. Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. J. Mol. Biol. 319: 209227.[CrossRef][Medline]
Holm, L. and Sander, C. 1993. Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233: 123138.[CrossRef][Medline]
Johnson, D., Dean, D.R., Smith, A.D., and Johnson, M.K. 2005. Structure, function, and formation of biological iron-sulfur clusters. Annu. Rev. Biochem. 74: 247281.[CrossRef][Medline]
Koradi, R., Billeter, M., and Wüthrich, K. 1996. MOLMOL: A program for display and analysis of macromolecular structures. J. Mol. Graphics 14: 5155.[CrossRef][Medline]
Koradi, R., Billeter, M., and Güntert, P. 2000. Point-centered domain decomposition for parallel molecular dynamics simulation. Comput. Phys. Commun. 124: 139147.[CrossRef]
Laskowski, R.A., Macarthur, M.W., Moss, D.S., and Thornton, J.M. 1993. PROCHECK: A program to check the stereochemical quality of protein structures. J. Appl. Cryst. 26: 283291.[CrossRef]
Lesley, S.A., Kuhn, P., Godzik, A., Deacon, A.M., Mathews, I., Kreusch, A., Spraggon, G., Klock, H.E., Mcmullan, D., Shin, T., et al. 2002. Structural genomics of the Thermotoga maritima proteome implemented in a high-throughput structure determination pipeline. Proc. Natl. Acad. Sci. 99: 1166411669.
Lezhneva, L., Amann, K., and Meurer, J. 2004. The universally conserved HCF101 protein is involved in the assembly of [4Fe4S]-cluster-containing complexes in Arabidopsis thaliana chloroplasts. Plant J. 37: 174185.[Medline]
Luginbühl, P., Szyperski, T., and Wüthrich, K. 1995. Statistical basis for the use of 13C
chemical shifts in protein structure determination. J. Magn. Reson. 109: 229233.[CrossRef]
Luginbühl, P., Güntert, P., Billeter, M., and Wüthrich, K. 1996. The new program OPAL for molecular dynamics simulations and energy refinements of biological macromolecules. J. Biomol. NMR 8: 136146.[Medline]
Marchler-Bauer, A., Panchenko, A.R., Shoemaker, B.A., Thiessen, P.A., Geer, L.Y., and Bryant, S.H. 2002. CDD: A database of conserved domain alignments with links to domain three-dimensional structure. Nucleic Acids Res. 30: 281283.
Nelson, K.E., Clayton, R.A., Gill, S.R., Gwinn, M.L., Dodson, R.J., Haft, D.H., Hickey, E.K., Peterson, L.D., Nelson, W.C., Ketchum, K.A., et al. 1999. Evidence for lateral gene transfer between archaea and bacteria from the genome sequence of Thermotoga maritima. Nature 399: 323329.[CrossRef][Medline]
Nilges, M., Macias, M.J., ODonoghue, S.I., and Oschkinat, H. 1997. Automated NOESY interpretation with ambiguous distance restrains: The refined NMR solution structure of the pleckstrin homology domain from
-spectrin. J. Mol. Biol. 269: 408422.[CrossRef][Medline]
Olivera, E.R., Minambres, B., Garcia, B., Muniz, C., Moreno, M.A., Ferrández, A., Diaz, E., Garcia, J.L., and Luengo, J.M. 1998. Molecular characterization of the phenylacetic acid catabolic pathway in Pseudomonas putida U: The phenylacetyl-CoA catabolon. Proc. Natl. Acad. Sci. 95: 64196424.
Peti, W., Etezady-Esfarjani, T., Herrmann, T., Klock, H.E., Lesley, S.A., and Wüthrich, K. 2004. NMR for structural proteomics of Thermotoga maritima: Screening and structure determination. J. Struct. Funct. Genomics 5: 205215.[CrossRef][Medline]
Richardson, J.S. 1981. The anatomy and taxonomy of protein structure. Adv. Protein Chem. 34: 167339.[Medline]
Sattler, J., Schleucher, J., and Griesinger, C. 1999. Heteronuclear multidimensional NMR experiments for the structure determination of proteins in solution employing pulsed field gradients. Prog. Nucl. Magn. Reson. Spectrosc. 34: 93158.[CrossRef]
Senn, H., Werner, B., Messerle, B.A., Weber, C., Traber, R., and Wüthrich, K. 1989. Stereospecific assignment of the methyl 1H NMR lines of valine and leucine in polypeptides by nonrandom 13C labelling. FEBS Lett. 249: 113118.[CrossRef]
Spera, S. and Bax, A. 1991. Empirical correlation between protein backbone conformation and 13C
and 13C
nuclear magnetic resonance chemical shifts. J. Am. Chem. Soc. 113: 54905492.[CrossRef]
Stark, A. and Russell, R.B. 2003. Annotation in three dimensions. PINTS: Patterns in non-homologous tertiary structures. Nucleic Acids Res. 326: 13071316.
Stöckel, J. and Oelmüller, R. 2004. A novel protein for photosystem I biogenesis. J. Biol. Chem. 279: 1024310251.
Tesmer, J.J., Klem, T.J., Deras, M.L., Davisson, V.J., and Smith, J.L. 1996. The crystal structure of GMP synthetase reveals a novel catalytic triad and is a structural paradigm for two enzyme families. Nat. Struct. Biol. 3: 7486.[CrossRef][Medline]
Wüthrich, K. 1986. NMR of proteins and nucleic acids. Wiley, New York.
Zheng, L., White, R.H., Cash, V.L., Jack, R.F., and Dean, D.R. 1993. Cysteine desulfurase activity indicates a role for NIFS in metallocluster biosynthesis. Proc. Natl. Acad. Sci. 90: 27542758.
Zheng, L., Cash, V.L., Flint, D.H., and Dean, D.R. 1998. Assembly of iron-sulfur clusters: Identification of an iscSUA-hscBA-fdx gene cluster from Azotobacter vinelandii. J. Biol. Chem. 273: 1326413272.
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
I. C. Sutcliffe, G. W. Black, and D. J. Harrington Bioinformatic insights into the biosynthesis of the Group B carbohydrate in Streptococcus agalactiae Microbiology, May 1, 2008; 154(5): 1354 - 1363. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |