|
|
||||||||
ß fold: Solution structure of the hypothetical protein YML108W from Saccharomyces cerevisiae
1 Division of Molecular and Structural Biology, Ontario Cancer Institute and Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 2M9, Canada
2 Biological Sciences Division, Pacific Northwest National Laboratory, Richland, Washington 99352, USA
Reprint requests to: Cheryl H. Arrowsmith, Ontario Cancer Institute, 610 University Avenue, Toronto, Ontario M5G 2M9, Canada; e-mail: carrow{at}uhnres.utoronto.ca; fax: (416) 946-6529.
(RECEIVED November 27, 2002; FINAL REVISION January 28, 2003; ACCEPTED February 6, 2003)
Northeast Structural Genomics Consortium
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.0240903.
| Abstract |
|---|
|
|
|---|
-helices, with an overall topology of ßß
ßß
. Strand ß1 runs parallel to ß4, and ß2:ß1 and ß4:ß3 pairs are arranged in an antiparallel fashion. Although this fold belongs to the split ß
ß family, it appears to be unique among this family; it is a novel arrangement of secondary structure, thereby expanding the universe of protein folds. Keywords: Heteronuclear NMR; protein fold; Saccharomyces cerevisiae; structural proteomics
| Introduction |
|---|
|
|
|---|
Here, we describe the solution structure of the protein encoded by ORF YML108W from Saccharomyces cerevisiae. The biological function of YML108W is unknown, and a BLAST search against all the databases did not reveal sequence similarity to any other protein or protein family. Three possible outcomes were predicted for this project: (1) YML108W would have, in spite of the absence of any sequence similarity, a previously known fold; (2) YML108W would have a new fold and define a new superfamily; or (3) YML108W would have a known fold, yet still define a new family. In fact, we find that the general architecture (split ß
ß) of YML108W is quite common. However, we were unable to find any protein with the same order and configuration of secondary structure elements, so we believe YML208W defines a new subfamily of the split ß
ß sandwiches.
| Results and Discussion |
|---|
|
|
|---|
|
|
-helices (Leu 50Glu 69 and Ile 90Asn 105) and a four-stranded ß-sheet with a strand order 2143, with an overall topology ßß
ßß
(Fig. 2A
-helices are located on the same side of the ß-sheet and run antiparallel to each other. Long-range NOEs between the helices and the sheet established their relative orientation. The residues connecting the first two ß-strands (Asp 15Gly 32) form a large disordered loop.
|
/ß domains of a number of proteins. This is not surprising given the topology of YML108W (ßß
ßß
). However, in all of the proteins but one, the two central ß-strands are antiparallel. The only exception, a fragment of the formaldehyde ferredoxin oxidoreductase (PDB accession no. 1B25
[PDB]
, Z-score 2.0), contains an insertion of two antiparallel ß-strands and an
-helix between the corresponding ß3 and ß4 strands from YML108W, and is missing the second
-helix (Fig. 2B
An interactive analysis of the SCOP database (Lo Conte et al. 2000) and the classification of
/ß folds by Orengo and Thornton (1993), reveals that the structure of YML108W resembles the ß-grasp fold, the POZ domain fold (SCOP classification), and the split ß
ß fold (Orengo and Thornton 1993). Thus, proteins like the B1 domain of protein G (PDB accession no. 2GB1
[PDB]
) contain a four-stranded ß-sheet, with the two central ß-strands running parallel to each other, but the second
-helix is missing (Fig. 2C
). The chain E of the Cytoplasmic ß Subunit-T1 Assembly Of Voltage-Dependent K Channels (PDB accession no. 1EXB
[PDB]
) also contains a four-stranded ß-sheet with parallel central ß-strands, but there is an insertion of an extra
-helix between ß2 and ß3, and the packing of the
-helices is quite different from that in YML108W (Fig. 2D
). The structures of the carboxy-terminal domain of the Escherichia coli arginine repressor (PDB accession no. 1XXA
[PDB]
) and YML108W are very similar, except that in the former, the two central ß-strands are antiparallel (Fig. 2E
). Finally, we found one protein with parallel central ß-strands and containing two
-helices (amino-terminal domain of the initiation factor 3; PDB accession no. 1TIF
[PDB]
), but, in this case, the two
-helices run parallel to each other, with the second
-helix protruding from the body of the amino-terminal domain toward the carboxy-terminal domain (Fig. 2F
).
We believe that the structure of YML108W represents another subfamily of the split ß
ß fold. This data point in protein fold space adds to the diversity of folds available to proteins and will facilitate the understanding of the relationship between sequence and three-dimensional structure.
| Materials and methods |
|---|
|
|
|---|
NMR spectroscopy
All NMR spectra were recorded at 25°C on a Varian INOVA 600 MHz spectrometer equipped with pulsed-field gradient triple-resonance probes. Linear prediction was used in the 13C and 15N dimensions to improve the digital resolution. Spectra were processed using the NMRPipe software package (Delaglio et al. 1995) and analyzed with XEASY (Bartels et al. 1995). SPSCAN (Glaser and Wüthrich) was used to convert nmrPipe formatted spectra into XEASY. The assignments of the 1H, 15N, 13CO, and 13C resonances were based on the following experiments: CBCA(CO)NH, HNCACB, CC(CO)NH-TOCSY, HNCO, HNHA, 15N-edited TOCSY-HSQC, and HCCH-TOCSY (Bax et al. 1994; Kay 1997). The backbone resonance assignment was achieved mainly by the combined analysis of the HNCACB and CBCA(CO)NH data. The side-chain resonances were identified mainly by the analysis of HCCH-TOCSY. Aromatic ring resonances were assigned on the basis of the analysis of heteronuclear NOESY optimized for the detection of aromatic 13C/1H resonances. In the 1H-15N HSQC, 84% backbone amide, resonances were assigned. Of the other resonances, 96% have been assigned for C
, 93% for H
, and 96% for C'. Moreover, 95% aliphatic side chains have been assigned for YML108W.
Structure calculation
For structure calculation purposes, a 15N-edited and a 13C-edited NOESY-HSQC (
m = 150 msec; Kay et al. 1992, Pascal et al. 1994) were acquired. NOE cross-peak assignments were obtained by using a combination of manual and automatic procedures. An initial fold of the protein was calculated on the basis of unambiguously assigned NOEs, with subsequent use of the module CANDID within the program CYANA (Herrmann et al. 2002). CANDID/CYANA performs automated assignment and distance calibration of NOE intensities, structure calculation with torsion angle dynamics, and automatic NOE upper-distance limit violation analysis. Peak analysis of the NOESY spectra were generated by interactive peak picking with the program XEASY. Backbone dihedral restraints were derived from 1H
and 13C
secondary chemical shifts using TALOS (Cornilescu et al. 1999). The program MOLMOL (Koradi et al. 1996) was used to analyze the resulting 10 energy-minimized conformers and to prepare drawings of the structures.
Accession numbers
The chemical shifts have been submitted to the BMRB (accession no. 5568), and the structure ensemble and NOE constraint file has been submitted to the PDB (accession no. 1N6Z
[PDB]
).
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Bax, A., Vuister, G.W., Grzesiek, S., Delaglio, F., Wang, A.C., Tschudin, R., and Zhu, G. 1994. Measurement of homo- and heteronuclear J couplings from quantitative J correlation. Methods Enzymol. 239: 79105.[Medline]
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., and Bourne, P.E. 2000. The Protein Data Bank. Nucleic Acids Res. 28: 235242.
Burley, S.K. and Bonanno, J.B. 2002. Structuring the universe of proteins. Annu. Rev. Genomics Hum. Genet. 3: 243262.[CrossRef][Medline]
Cornilescu, G., Delaglio, F., and Bax, A. 1999. Protein backbone angle restraints from searching a database for chemical shift and sequence homology. J. Biomol. NMR 13: 289302.[CrossRef][Medline]
Cort, J.R., Yee, A., Edwards, A.M., Arrowsmith, C.H., and Kennedy, M.A. 2000. NMR structure determination and structure-based functional characterization of conserved hypothetical protein MTH1175 from Methanobacterium thermoautotrophicum. J. Struct. Func. Genomics 1: 1525.
Delaglio, F., Grzesiek, S., Vuister, G.W., Zhu, G., Pfeifer, J., and Bax, A. 1995. NMRPipe: A multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6: 277293.[Medline]
Glaser, R. and Wüthrich, K. SPSCAN (software). http://www.mol.biol.ethz.ch/wuthrich/software/spscan.
Herrmann, T., Güntert, P, and Wüthrich, K. 2002. Protein NMR structure determination with automated NOE assignment using the new software CANDID and the torsion angle dynamics algorithm DYANA. J. Mol. Biol. 319: 209227.[CrossRef][Medline]
Holm, L. and Sander, C. 1993. Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233: 123138.[CrossRef][Medline]
. 1999. Protein folds and families: Sequence and structure alignments. Nucleic Acids Res. 27: 244247.
Kay, L.E. 1997. NMR methods for the study of protein structure and dynamics. Biochem. Cell. Biol. 75: 115.[CrossRef][Medline]
Kay, L.E., Keifer, P., and Saarinen, T. 1992. Pure absorption gradient enhanced heteronuclear single quantum correlation spectroscopy with improved sensitivity. J. Am. Chem. Soc. 114: 1066310665.[CrossRef]
Koradi, R., Billeter, M., and Wüthrich, K. 1996. MOLMOL: A program for display and analysis of macromolecular structures. J. Mol. Graph. 14: 5155.[CrossRef][Medline]
Liu, J. and Rost, B. 2002. Target space for structural genomics revisited. Bioinformatics 18: 922933.
Lo Conte, L., Ailey, B., Hubbard, T.J., Brenner, S.E., Murzin, A.G., and Chothia, C. 2000. SCOP: A structural classification of proteins database. Nucleic Acids Res. 28: 257259.
Orengo, C.A. and Thornton, J.M. 1993.
plus ß folds revisited: Some favoured motifs. Structure 1: 105120.[Medline]
Orengo, C.A., Michie, A.D., Jones, S., Jones, D.T., Swindells, M.B., and Thornton, J.M. 1997. CATHa hierarchic classification of protein domain structures. Structure 5: 10931108.[Medline]
Pascal, S.M., Muhandiram, D.R., Yamazaki, T., Forman-Kay, J.D., and Kay, L.E. 1994. Simultaneous acquisition of 15N-edited and 13C-edited NOE spectra of proteins dissolved in H2O. J. Magn. Reson. B. 103: 197201.[CrossRef]
Schneider, R., de Daruvar, A., and Sander, C. 1997. The HSSP database of protein structure-sequence alignments. Nucleic Acids Res. 25: 226230.
Yang, A.S. and Honig, B. 2000. An integrated approach to the analysis and modeling of protein sequences and structures. III. A comparative study of sequence conservation in protein structural families using multiple structural alignments. J. Mol. Biol. 301: 679689.[CrossRef][Medline]
Yang, Z., Savchenko, A., Yakunin, A., Zhang, R., Edwards, A., Arrowsmith, C.H., and Tong, L. 2002. Aspartate dehydrogenase, a novel enzyme identified from structural and functional studies of TM1643. J. Biol. Chem. [online ahead of print].
Zarembinski, T.I., Hung, L.W., Mueller-Dieckmann, H.J., Kim, K.K., Yokota, H., Kim, R., and Kim, S.H. 1998. Structure-based assignment of the biochemical function of a hypothetical protein: A test case of structural genomics. Proc. Natl. Acad. Sci. 95: 189193.
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |