|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Ontario Center for Structural Proteomics, University Health Network, University of Toronto, Toronto, Ontario M5G 1L7, Canada
2 Banting and Best Department of Medical Research, University of Toronto, Toronto, Ontario M5G 1L6, Canada
(RECEIVED February 13, 2006; FINAL REVISION February 13, 2006; ACCEPTED February 22, 2006)
| Abstract |
|---|
|
|
|---|
-helices and
-strands, with a six-stranded antiparallel
-sheet at the core. C-PhnF monomers form a dimer by establishing interdomain eight-strand
-sheets that include core antiparallel and N-terminal two-strand parallel
-sheets from each monomer. C-PhnF shares strong structural similarity with the chorismate lyase fold, which features a buried active site locked behind two helix-turn-helix loops. The structural comparison of the C-PhnF and UbiC proteins allows us to propose that a similar site in the PhnF structure is adapted for effector binding. Keywords: effector binding domain; transcriptional regulator; HutC; PhnF; chorismate lyase fold; GntR family
| Introduction |
|---|
|
|
|---|
This is true for the GntR family of transcriptional regulators, which gathers close to 2000 members in both bacterial and archael genomes (Haydon and Guest 1991; Rigali et al. 2002). The proteins in the GntR family share a characteristic version of the N-terminal winged helix-turn-helix (wHTH) DNA-binding domain. This output domain is coupled with the C-terminal signaling domain responding to a range of stimuli in the form of different small molecules. Recent sequence analysis of the GntR proteins revealed the presence of several distinct groups with different types of the C-terminal signaling domains (Rigali et al. 2002). According to this analysis, the GntR family has been divided into four major subfamilies, FadR, HutC, MocR, and YtrA, where each subfamily is categorized by a specific type of the C-terminal domain. While the structure of FadR alone and in complex with its effector and operator DNA has been recently determined (van Aalten et al. 2000, 2001; Xu et al. 2001), no structural information is available for the other three subfamilies of GntR-like regulators.
The HutC subfamily represents more then 30% of all GntR members. It had been named after the HutC regulator from Pseudomonas putida, which represses the expression of histidine utilization genes and is, in turn, inactivated by the binding of uroconate (Allison and Phillips 1990). Other characterized members of the HutC family include FarR from Escherichia coli, which regulates the expression of citric acid cycle genes and responds to long-chain fatty acids (Quail et al. 1994), the trehalose operon repressor TreR from Bacillus subtilis that is inhibited by trehalose-6-phosphate (Schock and Dahl 1996), and a number of proteins (KorSA, KorA, and TraR) that are repressors of the genes involved in conjugative plasmid transfer in Streptomyces species (Kendall and Cohen 1988; Hagege et al. 1993; Kataoka et al. 1994). PhnF from Escherichia coli, which is the focus of this article, belongs to the phn operon that is involved in transport and biodegradation of phosphonates, (Pn)-compounds having carbonphosphorous (CP) bonds (Metcalf and Wanner 1993). Based on its sequence similarity to the HutC proteins, PhnF is predicted to regulate the expression of the phn genes and respond to alkylphosphonates. However, its specific regulatory function, its ligand, or the operator sites have not yet been established. A project was undertaken aimed at obtaining structural information on the HutC subfamily ligand-binding domain using X-ray crystallography. In order to increase the chance of successful expression, purification, and crystallization of this protein, only the C-terminal ligand-binding domain, defined by bioinformatics analysis, was included.
| Results and Discussion |
|---|
|
|
|---|
|
-sheet, a two-stranded parallel
-sheet, and four short
helices. The six-stranded antiparallel
-sheet forms the core of the structure and was constructed by two
-hairpins (S2S4 and S5S7), which are linked by an
-hairpin (H2H3). The
-hairpin (H2H3) and the two-stranded parallel
-sheet (S1, S8) followed by a loop (L1) clasp the core
-sheet from one side, while helices H1 and H4 are positioned on the other side. The C-PhnF fold showed no similarity with the FadR structure confirming divergence between the FadR and HutC subfamilies of GntR regulators. On the other hand, the C-PhnF fold had the strongest match with the chorismate lyase (UbiC) structure (PDB code 1FW9; Fig. 2A), with a root-mean-square deviation (RMSD) 3.1 Å and z-score of 10.7 over 125 amino acids according to the Dali search (Heger and Holm 2003). A prediction of the structural similarity between the UbiC fold and HutC regulators' C-terminal domains had been recently made using sensitive sequence profile analysis (Aravind and Anantharaman 2003).
|
-sheet from one side (Fig. 2A). Determination of the C-PhnF structure allowed performing a comparison of these structures in detail.
The superimposable sections of the UbiC and C-PhnF structures are the central
-sheet, the
-hairpin (H2H3 in C-PhnF) and the
-helix (H1 in C-PhnF) inserted between the first and the second strand of the
-sheet (Fig. 2A), while the N and C termini of these proteins contain elements that cannot be matched. In the C-PhnF structure these elements primarily involve the two
-strands (S1 and S8 in Fig. 2A), which form a two-strand parallel
-sheet and participate in dimerization and connection with the N-terminal DNA binding domain in context of the full-length PhnF.
The C-PhnF domain crystallized as a symmetric dimer with a tight interdomain interface (Fig. 2B). This interface is formed by the association of the two-stranded parallel
-sheet (S1, S8) from one monomer and a six-stranded antiparallel
-sheet (S2S7) from the other monomer into an eight-stranded
-sheet (Fig. 2B). The intersubunit contacts are mediated by hydrogen bonding between S8 strand of one monomer and S5 strand of the other monomer. The dimer is an expected minimal functional unit for full-length PhnF, as this is a common feature among regulators with wHTH DNA-binding domains in general and characterized GntR family members in particular (Miwa and Fujita 1988; van Aalten et al. 2000; Kalivoda et al. 2003). Dimerization of the C-PhnF domain was confirmed by gel filtration results (for details, see Materials and Methods). Although the absence of the DNA-binding domain in the current structure does not permit full characterization of the PhnF multimerization mechanism, the tight interaction between the C-PhnF domains and the symmetry of their dimer strongly suggests that observed interactions are relevant in context of the full-length PhnF.
By comparison with characterized GntR regulators, the C-PhnF domain is expected to bind an effector molecule and transfer this signal to the DNA-binding domain. The nature of this effector remains unknown, although the involvement of PhnF in the regulation of genes responsible for uptake and degradation of alkylphosphonates (Metcalf and Wanner 1993) indicates that its stimuli might be an intermediate of alkylphosphonate metabolism. Structural similarity between C-PhnF and UbiC raises the question whether the location of the effector binding pocket in C-PhnF corresponds to the location of the active site in UbiC. None of the amino acids participating in the binding of the reaction product in UbiC are conserved in PhnF, which is not surprising, since PhnF is not expected to bind either chorismate or 4-hydroxybenzoate. On the other hand, using sequence analysis of various chorismate lyase orthologs and HutC family transcription factors Aravind and Anantharaman (2003) determined that there is weak conservation of certain types of amino acids within the PhnF region that aligns with the UbiC active site. In UbiC these conserved residues define the borders of the active site (Fig. 3), suggesting that in the HutC proteins corresponding amino acids might define the borders of the effector-binding pocket. The conserved amino acid types proposed by Aravind and Anantharaman (2003) include a bulky residue in the middle and toward the end of strand 2 (Arg76 and Leu80 in UbiC) and a polar residue in the middle of strand 3 (Thr92 in UbiC) (Fig. 3). In C-PhnF these residues correspond to Leu132, Arg136, and His148, respectively, and in agreement with the above prediction they are positioned in a similar fashion, with their side chains facing the same side of the central
-sheet (Fig. 3). Nevertheless, the potential pocket secured by these residues in the PhnF structure is largely occupied by the Ile146 side chain, which is situated prior to His148 on strand 3 (Fig. 3). In UbiC, the corresponding position is occupied by Gly90, leaving enough space for the reaction product (Fig. 3).
|
-helix formed in this region in one of them (Fig. 2B). This observation together with high B-factor (data not shown) for amino acids 9499 compared to the rest of C-PhnF polypeptide points to high flexibility of this region, which may be required for the effector binding.
Several residues conserved among known PhnF orthologs such as Leu94, Ser166, Arg181, Val210, and Ser228 outline the cavity proximal to the region corresponding to the UbiC active site (Fig. 3). This cavity is formed by L1, H2H3 loops, and
-strands 38, and is the largest cavity on the C-PhnF surface according to surface calculations by CASTp (Binkowski et al. 2003). Relativly low sequence conservation among PhnF proteins indicates that accumulation of conserved residues in this cavity is of functional relevance.
This structural analysis drove us to the conclusion that the PhnF region corresponding to UbiC active site is an attractive candidate for the effector-binding site, which may also include the large cavity formed on the C-PhnF surface.
Interestingly, the surface of one of the C-PhnF molecules contained electron density that could not be ascribed to the protein (Fig. 2A).
-D-fructopyranose was modeled into this density to facilitate crystallographic analysis, although no exogenous sugars were added during the purification or crystallization. The modeled molecule forms hydrogen bonds with the terminal amines of Arg231 (2.89, 2.81 Å) and the backbone oxygen of Glu200 (2.46 Å). It is unclear if the found molecule has any functional relevance or it is an artifact of purification or the crystallization process.
To conclude, the C-PhnF structure represents another example of a common fold adapted for enzymatic (UbiC-type) as well as regulatory (HutC-type) functions. According to the earlier prediction (Aravind and Anantharaman 2003), both activities arise from a common small molecule-binding domain, which in the case of HutC regulators had been fused to N-terminal DNA-binding domain and tailored to specifically recognize a diverse range of molecules (Allison and Phillips 1990; Quail et al. 1994; Schock and Dahl 1996). Presented data offers the first insight into the molecular mechanisms behind these evolutional processes, as well as provides the necessary structural scaffold for further biochemical and structural studies of HutC-type transcriptional regulators.
| Materials and methods |
|---|
|
|
|---|
Protein crystallization
The C-PhnF domain was crystallized by vapor diffusion in hanging drops by mixing 2 µL of the protein solution (20 mg/mL) with 2 µL of a 20% PEG3350, 0.2 M Mg formate, and 50 mM TRIS (pH 8.0) solution and equilibrated at 20°C over 500 µL of this solution. Crystals, which appeared after 1 d, were flash-frozen in liquid nitrogen prior to data collection.
Data collection, structure determination, and refinement
Diffraction data were collected at the 19ID beamline at the Advanced Photon Source, Argonne National Laboratory. The data set of 360 images complete to 2 Å at 0.97944 Å wavelength and another 180 images for higher resolution up to 1.7 Å at the same wavelength were collected from a single Se-Met-labeled protein crystal. The space group was P21 with the cell dimensions of a = 46.8 Å, b = 51.6 Å, c = 64.0 Å, and
= 103.6° (Table 1). The data were processed and phased using HKL2000 (Otwinowski and Minor 1997). The structure was determined by SAD phasing using SOLVE (Terwilliger and Berendzen 1999), and an initial model was built after density modification using RESOLVE (Terwilliger 2000). The model was improved using ARP/warp (Perrakis et al. 1999) and refined using REFMAC5 (Murshudov et al. 1997) at 1.7 Å resolution. The final R was 0.187 and the Rfree was 0.241. The refined model has an excellent geometry with no outliers according to PROCHECK (Morris et al. 1992).
|
Accession numbers
Coordinates and restraints have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/) under PDB code 2FA1.
| Footnotes |
|---|
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.062146906.
Abbreviations: C-PhnF, C-terminal domain of PhnF; C-P, carbon-phosphate; DB, DNA binding domain; HTH, helix-turn-helix; SAD, single wavelength anomalous dispersion; Pn, Phosphonates; TEV protease, Tobacco Etch Virus protease; wHTH, winged helix-turn-helix.
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Aravind L. and Anantharaman V. 2003. HutC/FarR-like bacterial transcription factors of the GntR family contain a small molecule-binding domain of the chorismate lyase fold FEMS Microbiol. Lett. 222: 1723.[CrossRef][Medline]
Binkowski T.A., Naghibzadeh S., Liang J. 2003. CASTp: Computed Atlas of Surface Topography of proteins Nucleic Acids Res. 31: 33523355.
Gallagher D.T., Mayhew M., Holden M.J., Howard A., Kim K.J., Vilker V.L. 2001. The crystal structure of chorismate lyase shows a new fold and a tightly retained product Proteins 44: 304311.[CrossRef][Medline]
Gallegos M.T., Schleif R., Bairoch A., Hofmann K., Ramos J.L. 1997. Arac/XylS family of transcriptional regulators Microbiol. Mol. Biol. Rev. 61: 393410.[Abstract]
Hagege J., Pernodet J.L., Sezonov G., Gerbaud C., Friedmann A., Guerineau M. 1993. Transfer functions of the conjugative integrating element pSAM2 from Streptomyces ambofaciens: Characterization of a kil-kor system associated with transfer J. Bacteriol. 175: 55295538.
Haydon D.J. and Guest J.R. 1991. A new family of bacterial regulatory proteins FEMS Microbiol. Lett. 63: 291295.[Medline]
Heger A. and Holm L. 2003. Exhaustive enumeration of protein domain families J. Mol. Biol. 328: 749767.[CrossRef][Medline]
Henikoff S., Haughn G.W., Calvo J.M., Wallace J.C. 1988. A large family of bacterial activator proteins Proc. Natl. Acad. Sci. 85: 66026606.
Kalivoda K.A., Steenbergen S.M., Vimr E.R., Plumbridge J. 2003. Regulation of sialic acid catabolism by the DNA binding protein NanR in Escherichia coli J. Bacteriol. 185: 48064815.
Kataoka M., Kosono S., Seki T., Yoshida T. 1994. Regulation of the transfer genes of Streptomyces plasmid pSN22: In vivo and in vitro study of the interaction of TraR with promoter regions J. Bacteriol. 176: 72917298.
Kendall K.J. and Cohen S.N. 1988. Complete nucleotide sequence of the Streptomyces lividans plasmid pIJ101 and correlation of the sequence with genetic properties J. Bacteriol. 170: 46344651.
Metcalf W.W. and Wanner B.L. 1993. Mutational analysis of an Escherichia coli fourteen-gene operon for phosphonate degradation, using TnphoA' elements J. Bacteriol. 175: 34303442.
Miwa Y. and Fujita Y. 1988. Purification and characterization of a repressor for the Bacillus subtilis gnt operon J. Biol. Chem. 263: 1325213257.
Morris A.L., MacArthur M.W., Hutchinson E.G., Thornton J.M. 1992. Stereochemical quality of protein structure coordinates Proteins 12: 345364.[CrossRef][Medline]
Murshudov G.N., Vagin A.A., Dodson E.J. 1997. Refinement of macromolecular structures by the maximum-likelihood method Acta Crystallogr. D Biol. Crystallogr. 53: 240255.[CrossRef][Medline]
Nasser W., Reverchon S., Condemine G., Robert-Baudouy J. 1994. Specific interactions of Erwinia chrysanthemi KdgR repressor with different operators of genes involved in pectinolysis J. Mol. Biol. 236: 427440.[CrossRef][Medline]
Otwinowski Z. and Minor W. In Processing of X-ray diffraction data collected in oscillation mode (eds. Carter Jr. C.W. and Sweet R.M.) . pp. 307326. 1997. Academic Press, New York.
Perez-Rueda E. and Collado-Vides J. 2000. The repertoire of DNA-binding transcriptional regulators in Escherichia coli K-12 Nucleic Acids Res. 28: 18381847.
Perez-Rueda E., Collado-Vides J., Segovia L. 2004. Phylogenetic distribution of DNA-binding transcription factors in bacteria and archaea Comput. Biol. Chem. 28: 341350.[CrossRef][Medline]
Perrakis A., Morris R., Lamzin V.S. 1999. Automated protein model building combined with iterative structure refinement Nat. Struct. Biol. 6: 458463.[CrossRef][Medline]
Quail M.A., Dempsey C.E., Guest J.R. 1994. Identification of a fatty acyl responsive regulator (FarR) in Escherichia coli FEBS Lett. 356: 183187.[CrossRef][Medline]
Ramos J.L., Martinez-Bueno M., Molina-Henares A.J., Teran W., Watanabe K., Zhang X., Gallegos M.T., Brennan R., Tobes R. 2005. The TetR family of transcriptional repressors Microbiol. Mol. Biol. Rev. 69: 326356.
Rigali S., Derouaux A., Giannotta F., Dusart J. 2002. Subdivision of the helix-turn-helix GntR family of bacterial regulators in the FadR, HutC, MocR, and YtrA subfamilies J. Biol. Chem. 277: 1250712515.
Rosinski J.A. and Atchley W.R. 1999. Molecular evolution of helix-turn-helix proteins J. Mol. Evol. 49: 301309.[CrossRef][Medline]
Schock F. and Dahl M.K. 1996. Expression of the tre operon of Bacillus subtilis 168 is regulated by the repressor TreR J. Bacteriol. 178: 45764581.
Terwilliger T.C. 2000. Maximum-likelihood density modification Acta Crystallogr. D Biol. Crystallogr. 56: 965972.[CrossRef][Medline]
Terwilliger T.C. and Berendzen J. 1999. Automated MAD and MIR structure solution Acta Crystallogr. D Biol. Crystallogr. 55: 849861.[CrossRef][Medline]
Ulrich L.E., Koonin E.V., Zhulin I.B. 2005. One-component systems dominate signal transduction in prokaryotes Trends Microbiol. 13: 5256.[CrossRef][Medline]
van Aalten D.M., DiRusso C.C., Knudsen J., Wierenga R.K. 2000. Crystal structure of FadR, a fatty acid-responsive transcription factor with a novel acyl coenzyme A-binding fold EMBO J. 19: 51675177.[CrossRef][Medline]
van Aalten D.M., DiRusso C.C., Knudsen J. 2001. The structural basis of acyl coenzyme A-dependent regulation of the transcription factor FadR EMBO J. 20: 20412050.[CrossRef][Medline]
Xu Y., Heath R.J., Li Z., Rock C.O., White S.W. 2001. The FadR.DNA complex. Transcriptional control of fatty acid metabolism in Escherichia coli J. Biol. Chem. 276: 1737317379.
Zhang R.G., Kim Y., Skarina T., Beasley S., Laskowski R., Arrowsmith C., Edwards A., Joachimiak A., Savchenko A. 2002. Crystal structure of Thermotoga maritima 0065, a member of the IclR transcriptional factor family J. Biol. Chem. 277: 1918319190.
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
S. Gebhard and G. M. Cook Differential Regulation of High-Affinity Phosphate Transport Systems of Mycobacterium smegmatis: Identification of PhnF, a Repressor of the phnDCE Operon J. Bacteriol., February 15, 2008; 190(4): 1335 - 1343. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-G. Gao, M. Yao, H. Itou, Y. Zhou, and I. Tanaka The structures of transcription factor CGL2947 from Corynebacterium glutamicum in two crystal forms: A novel homodimer assembling and the implication for effector-binding mode Protein Sci., September 1, 2007; 16(9): 1878 - 1886. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. L. Lorca, A. Ezersky, V. V. Lunin, J. R. Walker, S. Altamentova, E. Evdokimova, M. Vedadi, A. Bochkarev, and A. Savchenko Glyoxylate and Pyruvate Are Antagonistic Effectors of the Escherichia coli IclR Transcriptional Regulator J. Biol. Chem., June 1, 2007; 282(22): 16476 - 16491. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |