|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
1 Macromolecular Crystallography Laboratory, National Cancer Institute at Frederick, Frederick, Maryland 21702-1201, USA
2 Basic Research Program, SAIC-Frederick, Frederick, Maryland 21702, USA
3 Laboratory of Cell Biology, National Cancer Institute, Bethesda, Maryland 20892, USA
4 Shemyakin-Ovchinnikov Institute of Bioorganic Chemistry, Russian Academy of Sciences, Moscow 117997, Russia
Reprint requests to: Alexander Wlodawer, National Cancer Institute, MCL, Building 536, Room 5, Frederick, MD 21702-1201, USA; e-mail: wlodawer{at}ncifcrf.gov; fax: (301) 846-6128.
(RECEIVED July 28, 2005; FINAL REVISION July 28, 2005; ACCEPTED August 10, 2005)
| Abstract |
|---|
|
|
|---|
-sheets and a single long
-helix. Analysis of recent PDB depositions identified a similar fold in BPP1347 (PDB code 1ZBO
[PDB]
), a 203-amino acid protein of unknown function from Bordetella parapertussis, crystallized as part of a structural genomics effort. BPP1347 shares sequence homology with Lon N-domains and with a family of other independently expressed proteins of unknown functions. We postulate that, as is the case in Lon proteases, this structural domain represents a general protein and polypeptide interaction domain. Keywords: ATP-dependent proteases; protein domain; structural genomics
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.051736805.
| Introduction |
|---|
|
|
|---|
The enzymatic properties of Escherichia coli Lon protease (EcLon) have been extensively studied (Goldberg et al. 1994; Gottesman et al. 1997; Melnikov et al. 2000). Lon couples ATP hydrolysis to structural disruption and processive degradation of proteins into peptides of 512 amino acids. EcLon is active as an oligomer of identical 784-amino-acid polypeptide chains (Botos et al. 2004b). Three functional domains have been identified within each subunit: the central region (A domain), an ATPase belonging to the AAA+ superfamily (Neuwald et al. 1999); the C-terminal region (P domain), a unique serine protease with a serine-lysine catalytic dyad (Rotanova et al. 2004); and the ~300-amino-acid N-terminal region (N domain), which is divided into two or more subdomains and is expected by analogy with other ATP-dependent proteases to participate in recognition and binding of target proteins or their adaptors (Iyer et al. 2004; Rotanova et al. 2004). However, detailed analysis of the organization and structural interactions between the domains of Lon proteases is lacking. To date, crystal structures have been solved for a small
-helical portion of the EcLon A domain (residues 491584) (Botos et al. 2004b) and for the P domains of E. coli (Botos et al. 2004a), Methanococcus jannaschii (Im et al. 2004), and Archaeoglobus fulgidus (Botos et al. 2005) Lons. By contrast, no structural data for the N domain of Lon have been available until now, and its lack of sequence similarity to any proteins of known structure has prevented homology modeling. This report provides initial data on the structure of the N domain of EcLon.
| Results and Discussion |
|---|
|
|
|---|
|
-strands,
1 (916) and
2 (2631), are followed by the sole prominent helix,
1 (3445). Strand
3 (4954) leads to a very irregular turn that varies considerably between different crystallographically independent molecules (Fig. 2
2 is made by residues 6567. The remainder of the fragment is made of a long strand
4 (7182), even longer
5 (88105), which is slightly distorted in the middle in molecules D and E, and the terminal strand
6 (110116). Strands
1,
3,
4, and part of strand
5 form a mixed
-sheet, while strands
2, the rest of strand
5, and strand
4 form an anti-parallel sheet almost exactly perpendicular to the former. Finally, a third anti-parallel sheet is formed by strands 1, 6, and 5. A turn involving residues 8385 is also quite variable between different molecules (Fig. 2
|
|
pairs. Structure-based sequence alignment shows 22 identical residues in the aligned structures (19% identity). A similar degree of sequence homology can also be observed between EcLon and the C-terminal half of BPP1347 (Fig. 3
|
-helices with a topology that appears to be unique, since a DALI search found no structural homologs of this domain. Secondary structure prediction based on the sequences of >100 Lon proteins predicted a mostly helical structure for residues 119209 of Lon, with turns or loops between helices at nearly the precise positions observed in the structure of BBP1347. Thus, we postulate that the N-terminal domain of EcLon is likely to have the same overall structure as BBP1347.
The genome sequence of B. parapertussis encodes two full-length homologs of Lon, BPP1777 and BPP1033, and there is little reason to question that BPP1347 is a standalone homolog of the Lon N domain. The coding region of BPP1347 is followed by another open reading frame not related to Lon proteases and has numerous stop codons in the other reading frames. BPP1347 is representative of a large, diverse family of proteins or domains of larger proteins. A BLAST search using either EcLon -N209 or BPP1347 as a query revealed >100 hypothetical proteins that had significant homology to Lon N domain (e = 1060 106), but lacked AAA+ or Lon P domains and thus could not be ATP-dependent proteases. The conserved domains database on the National Center for Biotechnology Information (NCBI) Web site (http://www.ncbi.nlm.nih.gov/) identified these proteins as members of COG2802, a conserved domain related to Lon N domain (Fig. 3
). The regions of homology within the COG correspond to regions of similarity between EcLon -N209 and BBP1347.
It is not known if, and under what conditions, BBP1347 is expressed in vivo, as is the case for most proteins possessing this conserved domain. The N domain of EcLon apparently has protein binding ability, as shown by the ability of isolated N domains to inhibit Lon protease activity in vivo (F.S. Rasulova and M.R. Maurizi, unpubl.) and by decreased affinity of N domain-deleted forms of Lon for unfolded substrates such as casein (Rasulova et al. 1998; Roudiak and Shrader 1998). We speculate that the function of the Lon Nlike regions in the more distantly related proteins and domains also involves similar binding activity directed at unfolded proteins or polypeptides. The combination of structural genomics and targeted structural studies (Wlodawer 2005) presented here has provided significant new information that should help in establishing the role of the N-terminal domain of Lon proteases and perhaps provide a means of investigating the function of a novel protein family.
| Materials and methods |
|---|
|
|
|---|
After concentration to 10 mg/mL by ultrafiltration using a Centriprep apparatus with a YM3 membrane, LonN119 was screened for crystallization conditions (Jancarik and Kim 1991) by the hanging drop, vapor-diffusion method (McPherson 1982), using the Hampton (Hampton Research) and Wizard (Emerald Biostructures) screening kits. Diffraction-quality crystals were grown in 30% PEG 3000, 0.1M CHES (pH 9.5), as well as in 20% PEG 8000, 0.1M CHES (pH 9.5). The largest crystals grew in 14 d at room temperature to the size of 0.4 x 0.1 x 0.05 mm. Before flash freezing, the crystals were transferred into a cryoprotectant solution consisting of 80% mother liquor and 20% ethylene glycol.
The 2.4 Å data collected from an orthorhombic crystal of a SeMet derivative (Table 1
) were measured on beamline X9B in BNL at the wavelength of 0.9788 Å and processed with HKL2000 (Otwinowski and Minor 1997). SHELXD/E was used to solve the substructure and the AUTO-SHARP (Global Phasing Ltd.) for phasing and refinement, followed by SOLO-MON solvent flipping. The model was initially built automatically by the program RESOLVE (Terwilliger 2001), which located 441 residues164 with side chains and 277 without. The rest of the model was built manually into the initial map by using the program O (Jones and Kjeldgaard 1997). Only preliminary refinement of this model was attempted (Table 1
). The native X-ray data were collected from a monoclinic crystal on a Mar345 detector mounted on a Rigaku RU-200 rotating anode X-ray generator, operated at 50 kV and 100 mA. The Cu K
radiation was focused by an MSC/Osmic mirror system. Native data were collected to 2 Å resolution and were processed and scaled with HKL2000 (Otwinowski and Minor 1997; Table 1
). The native structure was solved by molecular replacement with program EPMR (Kissinger et al. 1999), with a search model consisting of a dimer created from monomers A and B of the orthorhombic structure. Initial rigid body refinement with CNS (Brünger et al. 1998), using maximum-likelihood targets, was followed by simulated annealing (Brünger et al. 1990) with Engh and Huber (1991) parameters. Final rounds of refinement were carried out with CNS, leading to a model with an R of 21.5% and Rfree (Brünger 1992) of 27.2% for all data between 30 and 2.03 Å resolution. The Ramachandran plot for the final structure, obtained with the program PROCHECK (Laskowski et al. 1993), showed 88% of the residues in the core region, 9.5% in the additionally allowed region, and 2.5% in generously allowed region. The coordinates and the structure factors have been submitted to the PDB with accession code 2ANE for immediate release.
| Footnotes |
|---|
6 Present address: Laboratory of Microbial Pathogenesis, Navy Medical Research Institute, Silver Spring, MD 20910, USA. ![]()
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
domain of E. coli Lon protease at 1.9 Å resolution. J. Struct. Biol. 146: 113122.[CrossRef][Medline]
Botos, I., Melnikov, E.E., Cherry, S., Tropea, J.E., Khalatova, A.G., Rasulova, F., Dauter, Z., Maurizi, M.R., Rotanova, T.V., Wlodawer, A., et al. 2004b. The catalytic domain of Escherichia coli Lon protease has a unique fold and a Ser-Lys dyad in the active site. J. Biol. Chem. 279: 81408148.
Botos, I., Melnikov, E.E., Cherry, S., Kozlov, S., Makhovskaya, O.V., Tropea, J.E., Gustchina, A., Rotanova, T.V., and Wlodawer, A. 2005. Atomic-resolution crystal structure of the proteolytic domain of Archaeoglobus fulgidus Lon reveals the conformational variability in the active sites of Lon proteases. J. Mol. Biol. 351: 144157.[CrossRef][Medline]
Brünger, A.T. 1992. The free R value: A novel statistical quantity for assessing the accuracy of crystal structures. Nature 355: 472474.[CrossRef]
Brünger, A.T., Krukowski, A., and Erickson, J.W. 1990. Slow-cooling protocols for crystallographic refinement by simulated annealing. Acta Crystallogr. A 46: 585593.
Brünger, A.T., Adams, P.D., Clore, G.M., DeLano, W.L., Gros, P., Grosse-Kunstleve, R.W., Jiang, J.S., Kuszewski, J., Nilges, M., Pannu, N.S., et al. 1998. Crystallography and NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr. D Biol. Crystallogr. 54: 905921.[CrossRef][Medline]
Cohen, G.E. 1997. ALIGN: A program to superimpose protein coordinates, accounting for insertions and deletions. J. Appl. Crystallogr. 30: 11601161.
Engh, R. and Huber, R. 1991. Accurate bond and angle parameters for X-ray protein-structure refinement. Acta Crystallogr. A 47: 392400.[CrossRef]
Goldberg, A.L., Moerschell, R.P., Chung, C.H., and Maurizi, M.R. 1994. ATP-dependent protease La (lon) from Escherichia coli. Methods Enzymol. 244: 350375.[Medline]
Gottesman, S. and Maurizi, M.R. 1992. Regulation by proteolysis: Energy-dependent proteases and their targets. Microbiol. Rev. 56: 592621.
Gottesman, S., Wickner, S., and Maurizi, M.R. 1997. Protein quality control: Triage by chaperones and proteases. Genes & Dev. 11: 815823.
Holm, L. and Sander, C. 1993. Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233: 123138.[CrossRef][Medline]
Im, Y.J., Na, Y., Kang, G.B., Rho, S.H., Kim, M.K., Lee, J.H., Chung, C.H., and Eom, S.H. 2004. The active site of a Lon protease from Methanococcus jannaschii distinctly differs from the canonical catalytic dyad of Lon proteases. J. Biol. Chem. 279: 5345153457.
Iyer, L.M., Leipe, D.D., Koonin, E.V., and Aravind, L. 2004. Evolutionary history and higher order classification of AAA+ ATPases. J. Struct. Biol. 146: 1131.[CrossRef][Medline]
Jancarik, J. and Kim, S.H. 1991. Sparse matrix sampling: A screening method for crystallization of proteins. J. Appl. Crystallogr. 21: 916924.[CrossRef]
Jones, T.A. and Kjeldgaard, M. 1997. Electron-density map interpretation. Methods Enzymol. 277: 173208.
Kissinger, C.R., Gehlhaar, D.K., and Fogel, D.B. 1999. Rapid automated molecular replacement by evolutionary search. Acta Crystallogr. D Biol. Crystallogr. 55: 484491.[CrossRef][Medline]
Laskowski, R.A., MacArthur, M.W., Moss, D.S., and Thornton, J.M. 1993. PROCHECK: Program to check the stereochemical quality of protein structures. J. Appl. Crystallogr. 26: 283291.[CrossRef]
Marchler-Bauer, A. and Bryant S.H. 2004. CD-Search: Protein domain annotations on the fly. Nucleic Acids Res. 32: W327W331.
McPherson, A. 1982. Preparation and analysis of protein crystals. John Wiley and Sons, Inc., New York.
Melnikov, E.E., Tsirulnikov, K.B., and Rotanova, T.V. 2000. Coupling of proteolysis to ATP hydrolysis upon Escherichia coli Lon protease functioning, I: Kinetic aspects of ATP hydrolysis. Bioorg. Khim. 26: 530538.[Medline]
Neuwald, A.F., Aravind, L., Spouge, J.L., and Koonin, E.V. 1999. AAA+: A class of chaperone-like ATPases associated with the assembly, operation, and disassembly of protein complexes. Genome Res. 9: 2743.
Otwinowski, Z. and Minor, W. 1997. Processing of X-ray diffraction data collected in oscillation mode. Methods Enzymol. 276: 307326.
Rasulova, F.S., Dergousova, N.I., Melnikov, E.E., Ginodman, L.M., and Rotanova, T.V. 1998. Synthesis and characterisation of ATP-dependent forms of Lon-proteinase with modified N-terminal domain from Escherichia coli. Bioorg. Khim. 24: 370375.[Medline]
Rotanova, T.V., Melnikov, E.E., Khalatova, A.G., Makhovskaya, O.V., Botos, I., Wlodawer, A., and Gustchina, A. 2004. Classification of ATP-dependent proteases Lon and comparison of the active sites of their proteolytic domains. Eur. J. Biochem. 271: 48654871.[Medline]
Roudiak, S.G. and Shrader, T.E. 1998. Functional role of the N-terminal region of the Lon protease from Mycobacterium smegmatis. Biochemistry 37: 1125511263.[CrossRef][Medline]
Terwilliger, T.C. 2001. Map-likelihood phasing. Acta Crystallogr. D Biol. Crystallogr. 57: 17631775.[CrossRef][Medline]
Wlodawer, A. 2005. Giving credit where credit is due. Nat. Struct. Mol. Biol. 12: 634.[Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
M. Chruszcz, A. Wlodawer, and W. Minor Determination of Protein Structures--A Series of Fortunate Events Biophys. J., July 1, 2008; 95(1): 1 - 9. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. V. Rotanova, I. Botos, E. E. Melnikov, F. Rasulova, A. Gustchina, M. R. Maurizi, and A. Wlodawer Slicing a protease: Structural features of the ATP-dependent Lon proteases gleaned from investigations of isolated domains. Protein Sci., August 1, 2006; 15(8): 1815 - 1828. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |