|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Center for Eukaryotic Structural Genomics, Department of Biochemistry, University of Wisconsin-Madison, Madison, Wisconsin 53706-1544, USA
Reprint requests to: John L. Markley, Center for Eukaryotic Structural Genomics, Department of Biochemistry, 433 Babcock Drive, University of Wisconsin-Madison, Madison, WI 53706-1544, USA; e-mail: markley{at}nmrfam.wisc.edu; fax: (608) 262-3759.
(RECEIVED November 16, 2004; FINAL REVISION December 21, 2004; ACCEPTED December 21, 2004)
| Abstract |
|---|
|
|
|---|
-barrel sandwich. A two-stranded anti-parallel
-sheet, which seals off one end of the
-barrel, is flanked by two flexible loops rich in acidic amino acids. Although this fold often provides a ligand binding site, the structure did not reveal an appreciable cavity inside the
-barrel. The three-dimensional structure of At3g04780.1-des15 provides an entry point for understanding its functional role and those of its mammalian homologs. Keywords: structural genomics; Arabidopsis thaliana; NMR; TXNL_HUMAN ortholog
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.041246805.
| Introduction |
|---|
|
|
|---|
The goals of the Center for Eukaryotic Structural Genomics (CESG) are to develop the technology for producing eukaryotic proteins and for determining their three-dimensional structures. CESG applies this technology in determining structures of novel eukaryotic proteins. The structure described here is the product of gene At3g04780.1 from Arabidopsis thaliana, which was chosen as a fold-space target. The full-length sequence of At3g04780.1 revealed little sequence identity with any other protein with known structure or function. However, part of this protein shows a 42% sequence identity with the C-terminal domain of the 32-kDa human thioredoxin-like protein (TXNL_HUMAN) (Fig. 1
; Lee et al. 1998). The C-terminal domain of the TXNL_HUMAN is rich in acidic residues and has a calculated pI of 4.3, which is also a distinct feature for At3g04780.1 (calculated pI of 4.9). TXNL_HUMAN is expressed in cytoplasm. Although the molecular function of this protein is unknown, its genomic location has suggested that it plays a role in cell apoptosis and cancer (Miranda-Vizuete and Spyrou 2000). The X-ray structure of the N-terminal domain of TXNL_HUMAN has been solved and shown to be a member of thioredoxin family (Jin et. al. 2002). However, attempts to crystallize the C-terminal domain of this protein proved unsuccessful. We report here the NMR structure of At3g04780.1-des15 (Fig. 2
), which allows one to model the three-dimensional structure of the C-terminal domain of TXNL_HUMAN and provides the foundation for future functional studies of this and other homologous proteins.
|
|
| Results and Discussion |
|---|
|
|
|---|
|
|
-sandwich jelly roll structure. The
-sandwich is formed by face-to-face packing of two anti-parallel
-sheets, sheet A composed of four strands (
1, residues 1415;
5, 7178;
8, 123124; and
11, 154160) and sheet B composed of five strands (
2, 2627;
4, 6070;
6, 8592;
7, 109112; and
9, 133143) (Fig. 3B,C
-barrel is sealed by a third anti-parallel
-sheet (sheet C), which is composed of two
strands (
3, 5253 and
10, 151152). Flexible loops flanking sheet C as well as loops encompassing sheets A and B protrude and are exposed to the solvent. The loop residues are largely composed of hydrophilic residues. On the opposite side, the extension and curvature of strands
4,
5, and
7 serve to seal the
-barrel. Other secondary structural elements include four short helices (
1, residues 1619;
2, 3740;
3, 99102; and
4, 115118), arranged at discrete regions of the protein.
The VAST (http://www.ncbi.nlm.nih.gov/Structure/VAST/vastsearch.html) server was used to search the Protein Data Bank for the structural homologs of At3g04780.1-des15. Among the closest structural neighbors are the Saccharomyces cerevisiae anaphase-promoting complex subunit DOC1/Apc10 (PDB accession number 1GQP
[PDB]
) with C
r.m.s.d. of 2.9 Å for 127 aligned residues, the Anguilla anguilla fucose binding lectin (1K12) with C
r.m.s.d. of 3.0 Å for 129 aligned residues, and the galactose binding domain of Dactylium dendriodes galactose oxidase (1GOF) with C
r.m.s.d. of 3.1 Å for 128 aligned residues (Fig. 4
). DOC1/Apc10 is involved in proteinprotein interactions, and the other two proteins bind sugars (Ito et al. 1991; Au et al. 2002; Bianchet et al. 2002). Coincidentally, the molecular interactions in these three disparate proteins occur at a region structurally equivalent to sheet C and its two flanking loops of At3g04780.1-des15. This region in Doc1/APC10 was proposed to mediate its conserved function in forming the anaphase-promoting complex (APC) (Au et al. 2002); the corresponding regions in fucose binding lectin and galactose oxidase serve as substrate binding pockets (Ito et al. 1991; Bianchet et al. 2002). These observations suggest that this region of At3g04780.1 may play an important role in its biological function. This implication is further reinforced by another finding: Structural analysis of At3g04780.1-des15 reveals that, although its charged residues are spread over the surface, the highest concentration of acidic residues occurs in the proximal region of sheet C (Fig. 3D,E
). In particular, the Pro80-Glu81-Glu82-Glu83-Gly84-Pro85 sequence motif (Pro-Asp-Asn-Gly-Gln-Gly-Pro in TXNL_HUMAN), located on the one of the two loops flanking sheet C is a unique feature of At3g04780.1-des15.
|
The three-dimensional structure of At3g04780.1-des15 provides an entry point for understanding its functional role. The structure suggests further experiments designed to screen for ligands and/or other proteins involved in the biological role of At3g04780.1 and its mammalian orthologs.
| Materials and methods |
|---|
|
|
|---|
1 mM [U-13C,U-15N]-At3g04780.1-des15, 50 mM KH2PO4, 90% H2O/10% D2O, and 10 mM DTT at pH 7.2.
All NMR spectra were recorded at the National Magnetic Resonance Facility at Madison (NMRFAM) on a Varian Inova 600 spectrometer equipped with a cryogenic probe. The temperature of the sample was held at 25°C. Data sets collected for resonance assignments and structure determination included 2D [1H, 15N] HSQC, 2D [1H, 13C] HSQC, 2D (HB)CB(CGCD)HD, 2D (HB)CB(CGCDCE)HE, 3D HNCACB, HN(CO)CACB, HNCO, HCCONH, CCONH, HCCH-COSY, 15N edited [1H, 1H] NOESY (
mix = 100 msec), and 13C edited [1H, 1H] NOESY (
mix = 100 msec).
We used published approaches for NMR spectral processing, data analysis, and resonance assignment (Song et al. 2004). The GARANT (Bartels et al. 1996) program was used as a semiautomatic approach for determining the backbone and aliphatic side chain resonance assignments. Information from 2D (HB)CB (CGCD)HD and 2D (HB)CB(CGCDCE)HE (Yamazaki et al. 1993) data sets, which correlate the 13C
of an aromatic residue with its 1H
/
, were used to assign the aromatic side chains of Phe, Tyr, and His. The backbone assignments were 99% complete (Fig. 2
), and the side chain assignments were
90% complete. The raw, time-domain, NMR data sets and chemical shift assignments have been deposited in the BioMagResBank database under BMRB accession number 6341
[BMRB]
.
The TALOS (Cornilescu et al. 1999) software package was used to predict
and
torsion angles from the assigned chemical shifts as restraints for structure calculations. The automated CANDID iterative refinement module of the CYANA software package (Herrmann et al. 2002) was used in generating the initial NOE assignments and the initial set of structural models. Additional NOE assignments were then added and erroneous ones corrected through examination of NMR spectra prior to recalculation of structures by CYANA (Güntert et al. 1997). The final structure refinement by CYANA included 76 hydrogen bond constraints, which were generated from NOEs observed as characteristic for the
-helices and
-sheets. The 20 structures with the lowest target function were chosen for further refinement by X-PLOR (Brünger 1992; Schwieters et al. 2003), in which physical force field terms and explicit water solvent molecules were added to the experimental constraints. The final 20 NMR structures of At3g04780.1 were validated by Procheck-NMR (Laskowski et al. 1996), and the statistics for these are listed in Table 1
. The coordinates for these structural models have been deposited in the Protein Data Bank (PDB) under the accession number 1XOY
[PDB]
.
| Acknowledgments |
|---|
This research was supported by the NIH Protein Structure Initiative through grant 1 P50 GM64598. NMR data were collected and analyzed in the National Magnetic Resonance Facility at Madison, which is supported by National Institutes of Health grants P41RR02301 (Biomedical Research Technology Program, National Center for Research Resources) and P41GM66326 (National Institute of General Medical Sciences). Equipment in the facility was purchased with funds from the University of Wisconsin, the National Institutes of Health (P41GM66326, P41RR02301, RR02781, RR08438), the National Science Foundation (DMB-8415048, OIA-9977486, BIR-9214394), and the U.S. Department of Agriculture.
| References |
|---|
|
|
|---|
Au, S.W., Leng, X., Harper, J.W., and Barford, D. 2002. Implications for the ubiquitination reaction of the anaphase-promoting complex from the crystal structure of the Doc1/Apc10 subunit. J. Mol. Biol. 316: 955968.[CrossRef][Medline]
Bartels, C., Billeter, M., Güntert, P., and Wüthrich, K. 1996. Automated sequence-specific NMR assignment of homologous proteins using the program GARANT. J. Biomol. NMR 7: 207213.
Bianchet, M.A., Odom, E.W., Vasta, G.R., and Amzel, L.M. 2002. A novel fucose recognition fold involved in innate immunity. Nat. Struct. Biol. 9: 628634.[Medline]
Brünger, A.T. 1992. X-PLOR version 3.1: A system for X-ray crystallography and NMR. Yale University Press, New Haven, CT.
Cornilescu, G., Delaglio, F., and Bax, A. 1999. Protein backbone angle restraints from searching a database for chemical shift and sequence homology. J. Biomol. NMR 13: 289302.[CrossRef][Medline]
Güntert, P., Mumenthaler, C., and Wüthrich, K. 1997. Torsion angle dynamics for NMR structure calculation with the new program DYANA. J. Mol. Biol. 273: 283298.[CrossRef][Medline]
Herrmann, T., Guntert, P., and Wuthrich, K. 2002. Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. J. Mol. Biol. 319: 209227.[CrossRef][Medline]
Ito, N., Phillips, S.E., Stevens, C., Ogel, Z.B., McPherson, M.J., Keen, J.N., Yadav, K.D., and Knowles, P.F. 1991. Novel thioether bond revealed by a 1.7 Å crystal structure of galactose oxidase. Nature 350: 8790.[CrossRef][Medline]
Jeon, W.B., Aceti, D.J., Bingman, C.A., Vojtik, F.C., Olson, A.C., Ellefson, J.M., Mccombs, J.E., Sreenath, H.K., Blommel, P.G., Seder, K.D., et al. 2004. High-throughtput purification and quality assurance of Arabidopsis thaliana proteins for eukaryotic structural genomics. J. Struct. Funct. Genomics (in press).
Jin, J., Chen, X., Zhou, Y., Bartlam, M., Guo, Q., Liu, Y., Sun, Y., Gao, Y., Ye, S., Li, G., et al. 2002. Crystal structure of the catalytic domain of a human thioredoxin-like protein. Eur. J. Biochem. 269: 20602068.[Medline]
Koradi, R., Billeter, M., and Wüthrich, K. 1996. MOLMOLA program for display and analysis of macromolecular structures. J. Mol. Graph. 14: 5155.[CrossRef][Medline]
Laskowski, R.A., Rullmann, J.A.C., MacArthur, M.W., Kaptein, R., and Thornton, J.M. 1996. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. J. Biomol. NMR 8: 477486.[Medline]
Lee, K.K., Murakawa, M., Takahashi, S., Tsubuki, S., Kawashima, S., Sakamaki, K., and Yonehara, S. 1998. Purification, molecular cloning, and characterization of TRP32, a novel thioredoxin-related mammalian protein of 32 kDa. J. Biol. Chem. 273: 1916019166.
Miranda-Vizuete, A. and Spyrou, G. 2000. Genomic structure and chromosomal localization of human thioredoxin-like protein gene (txl). DNA Seq. 10: 419424.[Medline]
Mittl, P.R. and Grutter, M.G. 2001. Structural genomics: Opportunities and challenges. Curr. Opin. Chem. Biol. 5: 402408.[CrossRef][Medline]
Schwieters, C.D., Kuszewski, J.J., Tjandra, N., and Clore, G.M. 2003. The Xplor-NIH NMR molecular structure determination package. J. Magn. Reson. 160: 6573.[CrossRef][Medline]
Song, J., Vinarov, D.A., Tyler, E.M., Shahan, M.N., Tyler, R.C., and Markley, J.L. 2004. Hypothetical protein At2g24940.1 from Arabidopsis thaliana has a cytochrome B5-like fold. J. Biomol. NMR (in press).
Terwilliger, T.C. 2000. Structural genomics in North America. Nat. Struct. Biol. 7 (Suppl.): 935939.
Tyler, R., Sreenath, H.K., Aceti, D.J., Bingman, C.A., Singh, S., Markley, J.L., and Fox, B.G. 2004. Auto-induction medium for the production of [U-15N] and [U-13C, U-15N]-labeled proteins for NMR screening and structure determination. Prot. Expr. Purif. 30: 215218.
Weichsel, A., Gasdaska, J.R., Powis, G., and Montfort, W.R. 1996. Crystal structures of reduced, oxidized, and mutated human thioredoxins: Evidence for a regulatory homodimer. Structure 4: 735751.[Medline]
Yamazaki, T., Forman-Kay, J.D., and Kay, L.E. 1993. 2-Dimensional NMR experiments for correlating 13C
and 1H
/
chemical-shifts of aromatic residues in 13C-labeled proteins via scalar couplings. J. Am. Chem. Soc. 115: 1105411055.[CrossRef]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |