|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Center for Advanced Research in Biotechnology, University of Maryland Biotechnology Institute, Rockville, Maryland 20850, USA
Reprint requests to: John Orban, Center for Advanced Research in Bio-technology, University of Maryland Biotechnology Institute, 9600 Gudel-sky Drive, Rockville, MD 20850, USA; e-mail: orban{at}umbi.umd.edu; fax: (301) 738-6255.
(RECEIVED September 2, 2004; FINAL REVISION November 3, 2004; ACCEPTED November 4, 2004)
| Abstract |
|---|
|
|
|---|
sandwich architecture consisting of a central four-stranded
-sheet with the
2-helix packed against one side of the
-sheet and four
-helices (
1,
3,
4,
5) on the other side. There is structural homology with the eukaryotic matrix metalloproteases (MMPs), but little sequence similarity except for a conserved region containing three histidines that appears in both the MMPs and throughout the HI0004 family of proteins. The solution structure of HI0004 is compared with the X-ray structure of an Aquifex aeolicus homolog, AQ_1354, which has 36% sequence identity over 148 residues. Despite this level of sequence homology, significant differences exist between the two structures. These differences are described along with possible functional implications of the structures. Keywords: Haemophilus influenzae; NMR; structural genomics; MMP; hydrolase
Abbreviations: NMR, nuclear magnetic resonance NOE, nuclear Over-hauser effect NOESY, NOE spectroscopy EDTA, ethylenediaminetetra-acetic acid HSQC, heteronuclear single quantum coherence RMSD, root mean square deviation
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.041096705.
| Introduction |
|---|
|
|
|---|
As part of our structural genomics effort, we describe here the solution NMR structure of the 154-residue protein HI0004 from Haemophilus influenzae. HI0004 belongs to a large family of more than 150 protein sequences with homologs spanning organisms from bacteria to humans. Recent mutagenesis experiments in Haemophilus influenzae indicate that HI0004 is an essential gene (Akerley et al. 2002), but no function has yet been assigned. The structural analysis presented here provides insights into the function of this protein. In addition, the NMR structure is compared with the recent X-ray structure of a homolog from Aquifex aeolicus with 36% sequence identity (Oganesyan et al. 2003). A significant difference in the position of one of the helical elements is observed between the two structures despite the relatively high sequence similarity. The possible functional relevance of these structural differences is discussed.
| Results and Discussion |
|---|
|
|
|---|
and
) were obtained from an analysis of chemical shift data using TALOS (Cornilescu et al. 1999). Hydrogen bond restraints were also used, but these were only employed at the later stages of refinement where the identity of hydrogen bond donoracceptor pairs was readily apparent. Residual dipolar couplings (RDCs) provided an independent verification of the solution structure (Bax 2003). The overall backbone RMSD (residues 4146) in the final ensemble of 20 structures is 0.96 Å, and the backbone RMSD in the secondary structure regions is 0.67 Å.
|
|
sandwich architecture consisting of a central four-stranded
-sheet with the
2-helix packed against one side of the
-sheet and four
-helices (
1,
3,
4,
5) on the other side (Fig. 2A,B
1-,
4-, and
5-helices make extensive contacts with the
-sheet and each other, whereas the long axis of
3 is oriented almost perpendicularly to the
-sheet. The arrangement of the secondary structure elements is
with a
-strand order of 1243. The
1-,
2-, and
4-strands are parallel and
3 is antiparallel.
|
1
2,
3
4, and
4
5 are particularly mobile based on 15N-transverse relaxation rates (Fig. 1B
2 and the first turn of
5 also exhibit motions on the nanosecond-to-picosecond timescale. In addition, the C-terminal residues 147154 and the N-terminal residues 13 are disordered and have few detectable inter-residue NOE contacts (Fig. 1A
Structural homologs of HI0004
An initial search for structural homologs of HI0004 using CE (Shindyalov and Bourne 1998) revealed that the closest structures were matrix metalloprotease (1mmr, Z-score 4.2) and human fibroblast collagenase (1ayk, Z-score 4.2). These proteins are both zinc-dependent matrix metalloproteases (MMPs), share a similar
-sheet topology with HI0004, and have comparable locations for two of their helices. The C
superposition of these structures with HI0004 gave RMSDs of 3.9 Å. The structural homology with MMPs suggests a hydrolase function for HI0004. Although there are structural similarities, very little sequence homology (8%14%) exists between the eukaryotic MMPs and HI0004. The exception is a conserved region containing three His residues that appears in both the MMPs and throughout the HI0004 family of proteins (Fig. 3
). In the MMP structures, the three histidines are close to each other and bind Zn2+ with tetrahedral coordination, the fourth ligand being a water molecule. The catalytic zinc in the active site is essential for full protease activity. In the structure of HI0004, two of the histidines, H114 and H118, are proximal but the third, H124, is over 10 Å away (Fig. 2B
). However, it must be emphasized that the NMR structure of HI0004 was determined in the absence of zinc ions due to the use of EDTA in the purification buffers (Cheon Yeh et al. 2004).
|
Comparison with the X-ray structure of AQ_1354 from Aquifex aeolicus
The HI0004 and AQ_1354 structures both have the same number and order of secondary structure elements (Fig. 2B,C
). The arrangement of the four
-strands is the same and four of the five
-helices are positioned similarly. Superposition of the C
atoms of HI0004 and AQ_1354 gives a RMSD of 3.4 Å (Z-score 5.3). However, there is an obvious difference in the position of the
5-helix. In the HI0004 NMR structure,
5 is oriented at an angle of about 30° with respect to the
-strands, is between the
1- and
4-helices, and has some contacts to the
-sheet (Fig. 2B
). By contrast, the
5-helix in the AQ_1354 X-ray structure is approximately perpendicular to the
-strands, packing on top of
1 and
4, and making no contacts with the
-sheet (Fig. 2C
). The position of the
5-helix in HI0004 is well-defined and is constrained by more than 40 long-range reciprocal NOE contacts. These NOEs are inconsistent with the conformation of the
5-helix seen in the X-ray structure.
Other differences are also noteworthy. The
1-helix is shorter in the NMR structure of HI0004 (10 residues vs. 16 in AQ_1354) and is also oriented at about 80° relative to
4, allowing
5 to pack between these two helices. In AQ_1354, the
1- and
4-helices are more closely packed and almost parallel. Further, the
1
1 loop is longer in HI0004 (10 residues vs. 6 in AQ_1354) as is the
1
2 loop (6 residues vs. 2 in AQ_1354). The orientation of
2, the lone helix on the other side of the
-sheet, is more parallel with the
-strands in the NMR structure of HI0004. The
3-and
4-strands are shorter in HI0004, leaving a longer, flexible loop (10 residues vs. 4 in AQ_1354) connecting these strands.
Since both the X-ray and NMR structures were determined in the absence of zinc, it is not yet clear why there is such a difference in their structures. This may be due to the different techniques used for structure determination or there may be sufficient differences in the sequences of HI0004 and AQ_1354 to cause the structural variance. For example, A109 is located in the
4-helix of HI0004, and its buried methyl group has extensive hydrophobic contacts with A28, I140, and F146. These interactions help to constrain the position of the
5-helix between
4 and
1 in the NMR structure. In AQ_1354, the corresponding residue to A109 is K110, and this would be expected to disrupt an equivalent set of hydrophobic interactions for steric and electronic reasons. Other residues may also possibly contribute to the structural differences.
Binding of Zn2+ to HI0004
Since the structurally related MMPs are zinc-dependent, we examined the effect of Zn2+ on HI0004 using chemical shift perturbation. Binding of zinc to the three conserved histidine residues in the HI0004 NMR structure (H114, H118, and H124) would require a conformational change, since H124, located in the
4
5 loop, is well removed from the other two conserved histidines in the
4-helix, H114, and H118 (Fig. 2B
). In contrast, the corresponding three conserved histidines in the X-ray structure of AQ_1354 (H115, H119, and H125) are in close proximity so that little change from this conformation would be needed for zinc binding (Fig. 2C
). Figure 4A
shows that significant shift changes were detected for some peaks in the 15N HSQC spectrum of HI0004 upon zinc addition. Binding of zinc resulted in two distinct sets of peaks in the HSQC spectrum corresponding with the free and bound forms of HI0004 in slow exchange on the chemical shift time scale. The HSQC spectrum did not change beyond the addition of 1.4 equivalents of Zn2+. Since peak positions could not be followed as a function of zinc concentration due to slow exchange, backbone resonances were assigned for the zinc-bound form of HI0004 using standard methods (Grzesiek and Bax 1992a,b). The chemical shift changes are shown as a function of sequence in Figure 4B
and are mapped onto the HI0004 structure in Figure 4C
. Changes in chemical shift occur for the conserved histidines H114, H118, and H124 and adjacent residues as might be expected. However, shift perturbations are also observed in the
4
5 loop,
5-helix,
1-helix, and
-sheet. Indeed, many of the residues with shift changes either make contacts to or are in the
5-helix or
4
5 loop. In addition, a large change is observed in the main chain C
shift of P149, which is in the disordered C-terminal tail of HI0004. The shift perturbation results suggest that a structural change involving the
4
5 loop and
5-helix may occur upon zinc binding.
|
5-helix from its position in the NMR structure to a conformation similar to that observed in the X-ray structure of AQ_1354.
|
5-helix. Consequently, there is only a small region of corresponding exposed hydrophobic surface for HI0004. Based on the RDC results, the conformation of the zinc-bound form of HI0004 is likely similar to the X-ray structure of AQ_1354, and this is probably the active conformation. Therefore, it appears that zinc-binding to HI0004 could serve at least two functions: It may provide the catalytic site metal ion required for enzymatic activity and may also promote the conformational change needed to expose the hydrophobic patch. This solvent-accessible hydrophobic surface is likely to be biologically relevant for molecular recognition since it is both well conserved and next to the putative active site. Although the specific enzymatic function of these proteins is still unknown (Oganesyan et al. 2003), the structural analysis presented here suggests that the substrate(s) for AQ_1354 and HI0004 may have a significant hydrophobic component. Moreover, it has been noted that homologs of HI0004 are found on a short operon together with diacylglycerol kinase in other organisms (Chen et al. 1998), raising the possibility that proteins in this family may act on intermediates in membrane lipid biosynthetic pathways. Further work will be needed to test these ideas.
|
| Materials and methods |
|---|
|
|
|---|
15N-transverse relaxation rates (R2) were obtained from 10 CPMG experiments, with relaxation delays ranging from 16 msec to 144 msec. Experiments were acquired in two single interleaved matrices to ensure uniformity of experimental conditions. Steady-state {1H}-15N NOEs were determined from the peak intensity ratio of two experiments, with and without saturation. The relaxation delay was 2 sec, with 3 sec of saturation for NOE experiments. A reference spectrum was acquired with a 5-sec relaxation delay. For titration experiments, a zinc chloride solution was added to HI0004 in 10 mM Tris buffer (pH 7.0), in steps of 0.2 equivalents up to 2 mol equivalents and a 15N HSQC spectrum was recorded after each addition. The total chemical shift change upon zinc addition was calculated using equation 1
![]() | (1) |
with the following weighting factors WHN = 1, WN = 0.154, and WC
= 0.276 (Ayed et al. 2001).
Main chain amide RDCs (1DNH) were measured using a 1H-15N IPAP-HSQC experiment (Ottiger et al. 1998) with Pf1 phage as the alignment medium (Hansen et al. 1998). Resolved RDCs of all NH bond vectors in well-defined areas of secondary structure were used in combination with the high resolution NOE-derived structure to calculate the axial and rhombic components of the alignment tensor using PALES (Zweckstetter and Bax 2000). Theoretical RDCs were calculated using the singular value decomposition method (Losonczi et al. 1999) with either the NMR structure of HI0004 or a homology model of HI0004 based on the X-ray structure of AQ_1354 as inputs. Q-factors were determined as described previously (Bax 2003).
Structure calculations
Interproton distance restraints were obtained from 3D 15N-NOESY (
m 100 msec) and 3D 13C-NOESY (
m 100 msec) spectra run on a sample in water and a homonuclear 2D NOESY (
m 120 msec) spectrum collected on a sample in D2O. Distance restraints were divided into five classes based on peak intensities: very strong (1.82.7 Å), strong (1.83.1 Å), medium (1.83.5 Å), weak (2.34.2 Å), and very weak (2.85.0 Å). For equivalent protons or nonstereospecifically assigned protons, pseudoatoms were introduced. Prochiral groups were given floating stereospecific assignments until they could be unambiguously assigned from the structure. Backbone dihedral angle restraints were identified using TALOS (Cornilescu et al. 1999) and chemical shift data. Amide hydrogen exchange was investigated by recording 15N HSQC spectra of lyophilized protein freshly dissolved in D2O. Hydrogen bond restraints were derived using standard criteria on the basis of the amide deuterium exchange experiment, chemical shifts, and NOE data (Parsons et al. 2001). Structures were calculated initially from an extended polypeptide chain using CNS 1.1 (Brunger et al. 1998) and standard simulated annealing and torsion angle dynamics protocols. Final force constants were 1000 kcal mol1 Å2 for bond angles, 500 kcal mol1 rad2 for angles and improper torsions, 40 kcal mol1 Å2 for experimental distance restraints, 200 kcal mol1 rad2 for dihedral angle restraints, and 4 kcal mol1 Å4 for the van der Waals repulsion term. The 20 best structures were chosen based on a low total energy, no NOE distance violations greater than 0.35 Å, fewer than 10 bad contacts per 100 residues, and a dihedral angle G factor greater than 0.5. Structures were analyzed with PROCHECK (Laskowski et al. 1996) and QUANTA (Molecular Simulations Inc.) and displayed with PyMOL (DeLano Scientific).
Protein Data Bank accession code
Atomic coordinates for the ensemble of 20 structures have been deposited in the RCSB Protein Data Bank (accession code 1xax).
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Ayed, A., Mulder, F.A., Yi, G.S., Lu, Y., Kay, L.E., and Arrowsmith, C.H. 2001. Latent and active p53 are identical in conformation. Nat. Struct. Biol. 8: 756760.[CrossRef][Medline]
Bax, A. 2003. Weak alignment offers new NMR opportunities to study protein structure and dynamics. Protein Sci. 12: 116.
Brunger, A.T., Adams, P.D., Clore, G.M., DeLano, W.L., Gros, P., Grosse, K.R., Jiang, J.S., Kuszewski, J., Nilges, M., Pannu, N.S., et al. 1998. Crystallography & NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr. D Biol. Crystallogr. 54: 905921.[CrossRef][Medline]
Chen, P., Novak, J., Qi, F., and Caufield, P.W. 1998. Diacylglycerol kinase is involved in regulation of expression of the lantibiotic mutacin II of Streptococcus mutans. J. Bacteriol. 180: 167170.
Cheon Yeh, D., Parsons, J.F., Parsons, L.M., Liu, F., Eisenstein, E., and Orban, J. 2004. NMR assignment of the hypothetical protein HI0004 from Haemophilus influenzaeA putative essential gene product. J. Biomol. NMR 29: 101102.[CrossRef][Medline]
Cornilescu, G., Delaglio, F., and Bax, A. 1999. Protein backbone angle restraints from searching a database for chemical shift and sequence homology. J. Biomol. NMR 13: 289302.[CrossRef][Medline]
Delaglio, F., Grzesiek, S., Vuister, G.W., Zhu, G., Pfeifer, J., and Bax, A. 1995. NMRPipe: A multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6: 277293.[Medline]
Eisenstein, E., Gilliland, G.L., Herzberg, O., Moult, J., Orban, J., Poljak, R.J., Banerjei, L., Richardson, D., and Howard, A.J. 2000. Biological function made crystal clearAnnotation of hypothetical proteins via structural genomics. Curr. Opin. Biotechnol. 11: 2530.[CrossRef][Medline]
Fleischmann, R.D., Adams, M.D., White, O., Clayton, R.A., Kirkness, E.F., Kerlavage, A.R., Bult, C.J., Tomb, J.-F., Dougherty, B.A., Merrick, J.M., et al. 1995. Whole-genome random sequencing and assembly of Haemophilus influenzae Rd. Science 269: 496512.
Grzesiek, S. and Bax, A. 1992a. An efficient experiment for sequential backbone assignment of medium-sized isotopically enriched proteins. J. Magn. Reson. 99: 201207.
. 1992b. Improved 3D triple-resonance NMR techniques applied to a 31 kDa protein. J. Magn. Reson. 96: 432440.
Hansen, M.R., Mueller, L., and Pardi, A. 1998. Tunable alignment of macromolecules by filamentous phage yields dipolar coupling interactions. Nat. Struct. Biol. 5: 10651074.[CrossRef][Medline]
Laskowski, R.A., Rullmann, J.A., MacArthur, M.W., Kaptein, R., and Thornton, J.M. 1996. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. J. Biomol. NMR 8: 477486.[Medline]
Lim, K., Zhang, H., Tempczyk, A., Bonander, N., Toedt, J., Howard, A., Eisenstein, E., and Herzberg, O. 2001. Crystal structure of YecO from Haemophilus influenzae (HI0319) reveals a methyltransferase fold and a bound S-adenosylhomocysteine. Proteins 45: 397407.[CrossRef][Medline]
Losonczi, J.A., Andrec, M., Fischer, M.W.F., and Prestegard, J.H. 1999. Order matrix analysis of residual dipolar couplings using singular value decomposition. J. Magn. Reson. 138: 334342.[CrossRef][Medline]
Oganesyan, V., Busso, D., Brandsen, J., Chen, S., Jancarik, J., Kim, R. and Kim, S.H. 2003. Structure of the hypothetical protein AQ_1354 from Aquifex aeolicus. Acta Crystallogr. D Biol. Crystallogr. 59: 12191223.[CrossRef][Medline]
Ottiger, M., Delaglio, F., and Bax, A. 1998. Measurement of J and dipolar couplings from simplified two-dimensional NMR spectra. J. Magn. Reson. 131: 373378.[CrossRef][Medline]
Parsons, L. and Orban, J. 2004. Structural genomics and the metabolome: Combining computational and NMR methods to identify target ligands. Curr. Opin. Drug Discov. Dev. 7: 6268.[Medline]
Parsons, L., Eisenstein, E., and Orban, J. 2001. Solution structure of HI0257, a bacterial ribosome binding protein. Biochemistry 40: 1097910986.[CrossRef][Medline]
Parsons, L.M., Yeh, D.C., and Orban, J. 2004. Solution structure of the highly acidic protein HI1450 from Haemophilus influenzae, a putative double-stranded DNA mimic. Proteins 54: 375383.[CrossRef][Medline]
Shindyalov, I.N. and Bourne, P.E. 1998. Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 11: 739747.
Thompson, J.D., Higgins, D.G., and Gibson, T.J. 1994. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22: 46734680.
Zweckstetter, M. and Bax, A. 2000. Prediction of sterically induced alignment in a dilute liquid crystalline phase: Aid to protein structure determination by NMR. J. Am. Chem. Soc. 122: 37913792.[CrossRef]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |