|
|
||||||||
1 Department of Chemistry, University of Saskatchewan, Saskatoon, Saskatchewan S7N 5C9, Canada
2 Ontario Center for Structural Proteomics, University Health Network, Toronto, Ontario M5G 2C4, Canada
(RECEIVED November 17, 2006; FINAL REVISION December 11, 2006; ACCEPTED December 12, 2006)
| Abstract |
|---|
|
|
|---|
Keywords: transcription factor; winged-helix; crystal structure
| Introduction |
|---|
|
|
|---|
|
The structure consists of a dimer and shows similarities to the winged-helix motif adopted by other transcription factors, including the MarR family of proteins (Alekshun et al. 2001). All of these proteins form a biological dimer, and the structure of PA1607 is consistent with this, though the formation of the dimer is altered in PA1607. Additionally, some proteins with similar structure have been proposed to have a metal-binding site (Cook et al. 1998). This structure suggests that the metal-binding site is not conserved in PA1607. The potential roles for PA1607 as a DNA-binding proteins as either a metallo-regulatory protein or an oxidative stress sensor are discussed with respect to the similar structures.
| Results and Discussion |
|---|
|
|
|---|
/
motif, containing six
-helices, four
-strands, and one 310 helix (Fig. 1B). This fold is representative of the winged-helix repressor DNA-binding domain (CATH no. 1.10.10.10
[EC]
) (Orengo et al. 1997). Helices
5 and
6 form one long helix, with a 35° kink in the middle, likely induced by a highly conserved Pro at the N terminus of helix 6. The first two
-strands, along with
4 and
5 form the "winged-helix" motif (wHTH), with the two
-strands forming a short anti-parallel
-sheet. The other two strands form part of the dimer interface as described below. There are three
-turns located in the protein, as defined by DSSP (Kabsch and Sanders 1983). All three turns are classical type I turns, as defined by Hutchinson and Thornton (1994).
The structure contains four sulfate molecules, all associated with arginine residues (Arg39 and Arg117 of chain A, and Arg39 and Arg125 of chain B) on the surface of the protein. The sulfate at Arg125 is located on the twofold axis. Also located close to this position is sugar residue, modeled as
-D-glucose, likely from the sucrose used as a cryoprotectant, though the other portion of the sucrose could not be located in any density.
Dimer formation
PA1607 protein crystallizes as a dimer, consistent with the other members of this structural superfamily and many other transcription factors. The dimerization surface buries 31% of the solvent accessible surface area (ASA) of each monomer and includes 27 hydrogen bonds and two salt bridges (Jones and Thornton 1996; Table 1). The contacts consist mostly of hydrophobic atoms (67%) and the dimer is predominantly stabilized by contacts made by
-helices 1, 2, 5, and 6. Additional contacts are made by the extended loop at the C-terminal end of the protein. These contacts include the formation of two short anti-parallel
-sheets between the
3 strands from each monomer and the
4 strands (
3 and
3' form one sheet and
4 and
4' form another sheet) (Fig. 1B).
|
6 of the other monomer. The
5 and
6 helices form an extensive "anti-parallel" two-helix bundle across the dimer interface with
5' and
6' (Fig. 1B). The helix
6 in particular maintains a high degree of hydrophobic conservation (Fig. 1A) that appears to be essential for dimer formation.
Comparison to other structures
A search of the Protein Data Bank (Berman et al. 2000) using the Dali server (Holm and Sanders 1993) yields many structures with a Z-score over 7, almost all identified as transcription factors. Eight of these structures have a Z-score over 9, but none greater than 9.8. The best matches are MarR (PDB code: 1JGS
[PDB]
), a regulator of antibiotic resistance in Escherichia coli (Alekshun et al. 2001), a SlyA transcriptional regulator (PDB code: 1LJ9
[PDB]
) from Enterococcus faecalis (Wu et al. 2003), SmtB (PDB code: 1SMT), a metallothionein repressor protein from Synechococcus PCC7942 (Cook et al. 1998), and OhrO (PDB Code: 1Z91), a MarR-like protein from Bacillus subtilis (Hong et al. 2005). The sequence identity between PA1607 and each of these proteins is <20%. The DALI search shows that all of the similar structures have <23% identity with PA1607 (Fig. 2A). In each monomer, the section from
1 to
6 (residues 20100) is the most highly conserved structurally, including the kinked helix at
5/
6, which in some of the reports is considered to be one helix.
|
6 that plays a critical role in the dimer interface of PA1607 is not conserved as a hydrophobic helix in any of the similar structures and is not utilized in dimer formation in all structures. This results in very different orientations of the dimers for all of the structures (Fig. 2C). The structures where this helix does make extensive dimer contacts (1smt and 1z91) still do not retain similar overall dimer geometries. This can be seen by comparing the
-carbon RMSD values of each monomer of PA1607 with the corresponding monomer from the other proteins. The conserved structural core of PA1607 in monomer A (superimposed structures) fits well with each of the other structures (RMSD < 2.5 Å). However, when this superposition (Fig. 2C) is used to compare the same structural core of monomer B from each dimer, the RMSD values increase to between 16 Å and 41 Å (Table 1, last column).
DNA binding
Hong et al. (2005) determined the structure of OhrO bound to a 29-base-pair oligonucleotide and give insight into the binding of these types of protein to DNA. OhrO undergoes a conformational shift in the DNA-bound structure, with the majority of the changes taking place in the wHTH region (in PA1607, this is
4,
1,
2, and
5). These changes result in a 25° rotation and a 16-Å movement in the tips of the wings. In OhrO, the specific contacts between the DNA and protein are quite limitedthere are two regions that appear to be conserved as DNA-binding regions. These regions are also consistent with regions proposed to be DNA-binding regions for SmtB (Cook et al. 1998). In OhrO, the DNA-binding regions consist of side-chain contacts between Tyr65, Leu66, Asp67, Ser68, Thr70, Thr72, Lys76, Arg77 (see Fig. 2C, blue triangles), and the DNA, plus the carbonyl of Gly69 for region 1 and Arg86, Arg88, Asp92, Glu93, and Arg94 for region 2 (Fig. 2A, blue triangles). Asp92 and Arg94 are highly conserved in the MarR family members, and Arg94 has been shown to play an important role in DNA binding for MarR and MexR (Alekshun et al. 2001; Saito et al. 2003).
In PA1607, neither of these regions is conserved in the proteins similar to PA1607 (Fig. 1A), although there are conserved residues found in these regions (Fig. 1A, blue underlined regions), particularly in region 1. In region 2, the arginine residues that are important for DNA binding in the MarR family of proteins are missing from the PA1607 family. The lack of conserved residues, particularly arginines, in the tip of the wing of PA1607 that are seen in other wHTH proteins suggests that if PA1607 is actually a DNA-binding protein, the mode of recognition of and binding to DNA will be different than is seen in other members of this structural family.
Metal binding
In the report on the structure of SmtB (Cook et al. 1998), it was proposed that there are several Zn2+-binding sites located in the protein. SmtB was crystallized without Zn2+; however, soaks of the crystals with Hg2+ revealed the presence of four Hg2+-binding sites in the derivative structure and these sites are suggested to be the location of the Zn2+-binding sites. There is one independent site per dimer, consisting of residues Cys61, Asp64, and His97. The other two sites are found in the dimer interface, these sites being composed of residues Asp104 and His106 from one monomer and His117 and Glu120 from the other monomer (Fig. 2A, red stars). The structural alignment of the similar proteins shows that these residues are not conserved in PA1607.
Additionally, PA1607 contains only a single cysteine residue (Cys12) that, although highly conserved among similar proteins (Fig. 1A), is located away from the regions in SmtB that are believed to form the metal-binding sites. This cysteine residue is within 3.3 Å of a conserved arginine residue (Arg31) from the other monomer, making it more likely that this residue is playing a role in dimerization rather than metal binding. There is also a second conserved arginine from the other monomer (Arg97) located 5.0 Å from Cys12. This combination makes it unlikely that Cys12 is playing a role in metal binding.
Oxidative stress sensor
A cysteine residue near the N terminus in OhrO (Cys15) is a putative sensor of reactive oxygen species and acts as a regulator of the DNA binding of OhrO (Hong et al. 2005). In OhrO, the high reactivity of Cys15 is attributed to the presence of two conserved tyrosine residues (29 and 40from the other monomer) that are thought to lower the pKa, in conjunction with the positively charged end of the helix dipole (Wada 1976; Hol et al. 1978; Kortemme and Creighton 1995). In PA1607, the conserved Cys12 (Fig. 1A) could play a similar role in sensing oxidative stress. This cysteine residue is located between the two monomers and interacts with conserved arginine residues from the other monomer. These arginines would act to stabilize the negatively charged thiolate that would be the sensor residue. Oxidation of Cys12 could therefore act to signal in a manner analogous to Cys15 in OhrO.
Conclusions
Without any biochemical data, it is difficult to suggest any roles for PA1607 other than that it is likely to be a DNA-binding protein, based on the structure and sequence alignment. From the structural comparisons, it seems unlikely that PA1607 has a metal-binding site similar to SmtB; however, it is possible that this protein plays a role in sensing oxidative stress in an manner analogous to OhrO.
| Materials and methods |
|---|
|
|
|---|
Data collection
Diffraction data (Table 2) were collected at beamline 19ID of APS, Argonne National Laboratory, following the approach described earlier (Walsh et al. 1999). The two-wavelength inversed beam MAD inflection and peak wavelengths up to 1.85 Å were collected from one Se-Met labeled protein crystal at 100 K with 4-sec exposure/1°/frame using a 180-mm crystal-to-detector distance. The total oscillation range was 180°. All data were processed and scaled with HKL2000 (Otwinowski and Minor 1997). The space group was determined to be C2 with unit-cell dimensions of a = 46.88, b = 78.87, c = 78.94,
=
= 90, and
= 91.64. Calculation of the Matthews volume (Matthews 1968) indicated that the unit cell contained two monomers (Vm = 2.25 Å3Da1, solvent content = 45.4%), assuming a molecular weight of 16.2 kDa.
|
The peak wavelength datum was chosen for structure refinement as it had slightly better statistics than the inflection point wavelength (Table 2). Refinement was carried out using REFMAC5, using TLS and isotropic B-factor refinement. The final R-factor was 0.181 and the free R was 0.251 (Table 2). The N termini of both monomers were missing from any electron density maps and the missing N-terminal residues were not included in the structure refinement. Monomer A therefore starts from residue 5 (threonine) and monomer B from residue 6 (serine).
| Footnotes |
|---|
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.062668207.
| Acknowledgments |
|---|
|
|
|---|
| References |
|---|
|
|
|---|
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., and Bourne, P.E. 2000. The protein data bank. Nucleic Acids Res. 28: 235242.
Collaborative Computational Project, Number 4 1994. The CCP4 suite: Programs for protein crystallography. Acta Crystallogr. D Biol. Crystallogr. 50: 760763.[CrossRef][Medline]
Cook, W.J., Kar, S.R., Taylor, K.B., and Hall, L.M. 1998. Crystal structure of the cyanobacterial metallothionein repressor SmtB: A model for metalloregulatory proteins. J. Mol. Biol. 275: 337346.[CrossRef][Medline]
Delano, W.L.. 2002. The PyMOL molecular graphics system. Delano Scientific, San Carlos, CA.
Emsley, P. and Cowtan, K. 2004. COOT: Model-building tools for molecular graphics. Acta Cryst. D 60: 21262132.[CrossRef][Medline]
Gouet, P., Courcelle, E., Stuart, D.I., and Metoz, F. 1999. ESPript: Analysis of multiple sequence alignments in PostScript. Bioinformatics 15: 305308.
Hendrickson, W.A.. 1985. Analysis of protein structure from diffraction measurement at multiple wavelengths. Trans. Am. Cryst. Assn. 21: 1121.
Hol, W.G., van Duijnen, P.T., and Berendsen, H.J. 1978. The
-helix dipole and the properties of proteins. Nature 273: 443446.[CrossRef][Medline]
Holm, L. and Sanders, C. 1993. Dali ver. 2.0. J. Mol. Biol. 233: 123138.[CrossRef][Medline]
Hong, M., Fuangthong, M., Helmann, J.D., and Brennan, R.G. 2005. Structure of an OhrO-ohrA operator complex reveals the DNA binding mechanism of the MarR family. Mol. Cell. 20: 131141.[CrossRef][Medline]
Hutchinson, E.G. and Thornton, J.M. 1994. A revised set of potentials for
-turn formation in proteins. Protein Sci. 3: 22072216.[Abstract]
Jones, S. and Thornton, J.M. 1996. Principles of proteinprotein interactions derived from structural studies. Proc. Natl. Acad. Sci. 93: 1320.
Kabsch, W. and Sanders, C. 1983. Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22: 25772637.[CrossRef][Medline]
Kortemme, T. and Creighton, T.E. 1995. Ionisation of cysteine residues at the termini of model
-helical peptides. Relevance to unusual thiol pKa values in proteins of the thioredoxin family. J. Mol. Biol. 253: 799812.[CrossRef][Medline]
Lamzin, V.S., Perrakis, A., and Wilson, K.S. 2001. The ARP/wARP suite of automated construction and refinement of protein models. In International tables for crystallography, vol. F: Crystallography of biological macromolecules (eds. M.G. Rossman and E. Arnold), pp. 720722. Kluwer Academic Publishers, Dordrecht, The Netherlands.
Marchler-Bauer, A. and Bryant, S.H. 2004. CD-search: Protein domain annotations on the fly. Nucleic Acids Res. 32: W327W331.
Matthews, B.W.. 1968. Solvent content of protein crystals. J. Mol. Biol. 33: 491497.[Medline]
Orengo, C.A., Michie, A.D., Jones, S., Jones, D.T., Swindells, M.B., and Thornton, J.M. 1997. CATHA hierarchic classification of protein domain structures. Structure 5: 10931108.[Medline]
Otwinowski, Z. and Minor, W. 1997. Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol. 276: 307326.
Pettersen, E.F., Goddard, T.D., Huang, C.C., Couch, G.S., Greenblatt, D.M., Meng, E.C., and Ferrin, T.E. 2004. UCSF ChimeraA visualization system for exploratory research and analysis. J. Comput. Chem. 25: 16051612.[CrossRef][Medline]
Saito, K., Akama, H., Yoshihara, E., and Nakae, T. 2003. Mutations affecting DNA-binding activity of the MecR repressor of mexRmexAmexBoprM operon expression. J. Bacteriol. 185: 61956198.
Terwilliger, T.. 2000. Maximum-likelihood density modification. Acta Crystallogr. D 56: 965972.[CrossRef][Medline]
Wada, A.. 1976. The
-helix as an electric macro-dipole. Adv. Biophys. 9: 163.[Medline]
Walsh, M.A., Evans, G., Sanishvili, R., Dementieva, I., and Joachimiak, A. 1999. MAD data collectionCurrent trends. Acta Crystallogr. D 55: 17261732.[CrossRef][Medline]
Weeks, C.M., Blessing, R.H., Miller, R., Mungee, R., Potter, S.A., Rappleye, J., Smith, G.D., Xu, H., and Furey, W. 2002. Towards automated protein structure determination: BnP, the BnP-PHASES interface. Z. Kristallogr. 217: 686693.[CrossRef]
Wu, R.-y., Zhang, R.-g., Zagnitkov, O., Dementieva, I., Maltzev, N., Watson, J.D., Laskowski, R., Gornicki, P., and Joachimiak, A. 2003. Crystal structure of Enterococcus faecalis SlyA-like transcriptional factor. J. Biol. Chem. 278: 2024020244.
Zhang, R.G., Skarina, T., Katz, J.E., Beasley, S., Khachatryan, A., Vyas, S., Arrowsmith, C.H., Clarke, S., Edwards, A., and Joachimiak, A., et al. 2001. Structure of Thermotoga maritima stationary phase survival protein SurE: A novel acid phosphatase. Structure 9: 10951106.[Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |