Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Published online before print February 2, 2005, 10.1110/ps.04975305
Protein Science (2005), 14:756-764. Published by Cold Spring Harbor Laboratory Press. Copyright © 2005 The Protein Society
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
ps.04975305v1
14/3/756    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Nameki, N.
Right arrow Articles by Yokoyama, S.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Nameki, N.
Right arrow Articles by Yokoyama, S.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Solution structure of the PWWP domain of the hepatoma-derived growth factor family

Nobukazu Nameki1, Naoya Tochio1, Seizo Koshiba1, Makoto Inoue1, Takashi Yabuki1, Masaaki Aoki1, Eiko Seki1, Takayoshi Matsuda1, Yukiko Fujikura1, Miyuki Saito1, Masaomi Ikari1, Megumi Watanabe1, Takaho Terada1, Mikako Shirouzu1, Mayumi Yoshida1, Hiroshi Hirota1, Akiko Tanaka1, Yoshihide Hayashizaki1, Peter Güntert2, Takanori Kigawa1 and Shigeyuki Yokoyama1,3,4

1 RIKEN Genomic Sciences Center, 1-7-22 Suehiro-cho, Tsurumi, Yokohama 230-0045, Japan
2 Tatsuo Miyazawa Memorial Program, RIKEN Genomic Sciences Center, 1-7-22 Suehiro-cho, Tsurumi, Yokohama 230-0045, Japan
3 RIKEN Harima Institute at SPring-8, 1-1-1 Kouto, Mikazuki-cho, Sayo, Hyogo 679-5148, Japan
4 Department of Biophysics and Biochemistry, Graduate School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

Reprint requests to: Shigeyuki Yokoyama, Department of Biophysics and Biochemistry, Graduate School of Science, The University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan; e-mail: yokoyama{at}biochem.s.u-tokyo.ac.jp; fax: +81-45–503-9195.

(RECEIVED July 6, 2004; FINAL REVISION October 4, 2004; ACCEPTED November 9, 2004)


    Abstract
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Among the many PWWP-containing proteins, the largest group of homologous proteins is related to hepatoma-derived growth factor (HDGF). Within a well-conserved region at the extreme N-terminus, HDGF and five HDGF-related proteins (HRPs) always have a PWWP domain, which is a module found in many chromatin-associated proteins. In this study, we determined the solution structure of the PWWP domain of HDGF-related protein-3 (HRP-3) by NMR spectroscopy. The structure consists of a five-stranded {beta}-barrel with a PWWP-specific long loop connecting {beta}2 and {beta}3 (PR-loop), followed by a helical region including two {alpha}-helices. Its structure was found to have a characteristic solvent-exposed hydrophobic cavity, which is composed of an abundance of aromatic residues in the {beta}1/{beta}2 loop ({beta}-{beta} arch) and the {beta}3/{beta}4 loop. A similar ligand binding cavity occurs at the corresponding position in the Tudor, chromo, and MBT domains, which have structural and probable evolutionary relationships with PWWP domains. These findings suggest that the PWWP domains of the HDGF family bind to some component of chromatin via the cavity.

Keywords: NMR; HDGF; HATH region; PR-loop; {beta}-{beta} arch; cavity; protein structure

Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.04975305.


    Introduction
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
The PWWP domain, which was named after the relatively well-conserved residues, Pro-Trp-Trp-Pro, is composed of ~90 residues and is present within all eukaryotes (Stec et al. 1998). PWWP domains have been found in >60 proteins, which are involved in the processes of transcriptional regulation, DNA repair, and DNA methylation (Stec et al. 2000).

Recently, the function of the PWWP domain as a chromatin targeting module has been indicated in the mammalian DNA methyltransferases, Dnmt3a and Dnmt3b, which methylate genomic DNA during gametogenesis and early embryonic development (Ge et al. 2004). Experiments using mammalian cells have shown that the PWWP domain is required for the association of Dnmt3a and Dnmt3b with chromatin, suggesting that the PWWP-mediated chromatin targeting is essential for the enzyme function during development. The functional importance of the PWWP-mediated chromatin targeting was also suggested by the fact that a missense mutation in the PWWP domain causes ICF (immunodeficiency, centromeric heterochromatin instability, and facial anomalies) syndrome, which is characterized by hypomethylation in classical satellite DNA (Shirohzu et al. 2002). Experiments revealed that the mutant protein had completely lost its chromatin targeting capacity (Ge et al. 2004). However, the ligand and its functional interaction mechanism are still unknown.

Among the PWWP-containing proteins, the largest homologous group is related to the hepatoma-derived growth factor (HDGF), which has growth-stimulating activity in several specific cells (Stec et al. 2000). The PWWP domain always occurs at the N-termini of HDGF and the HDGF-related proteins (HRPs). Its sequence is well conserved in the HDGF family (Fig. 1AGo), and is also termed the HATH (homologous to amino terminus of HDGF) region (Izumoto et al. 1997). In contrast, the region outside the PWWP domain has different sequences in the HDGF family, and is termed a gene-specific region.



View larger version (65K):
[in this window]
[in a new window]
 
Figure 1. (A) Sequence alignment of the N-terminal regions of the HDGF family members. Accession codes used are as follows: in Mus musculus, HRP-3, NP_038914 [GenBank] ; HDGF, BAA09838 [GenBank] HRP-2, NP_032259 [GenBank] ; LEDGF, NP_598709 [GenBank] ; HRP-1, NP_032258 [GenBank] ; in Bos taurus, HRP-4, CAB40348 [GenBank] Gray-colored residues indicate that more than half of the six residues in the same position are identical [the following residues in parentheses are grouped into the same type: (I, L, M, V) and (E, D)]. The PWWP motif is indicated by a black background. (B) Alignment of the Pfam PWWP domains (Bateman et al. 2004) shown in the HMM-Logo format, where the tallest residues are invariant (Schuster-Böckler et al. 2004). The sequence of the PWWP domain of HRP-3 is shown below with the secondary structural elements colored. Circles and asterisks indicate highly conserved residues involved in the interactions between {alpha}3 and the {beta}-barrel substructure, and the formation of the cavity, respectively, as described in the text.

 
The first member, HDGF, was initially isolated as a heparin binding growth factor from the conditioned medium of HuH-7, a hepatoma cell line. It is expressed in various normal tissues and is amplified in several tumor cell lines (Wanschura et al. 1996; Everett 2001). HDGF participates in many cellular processes, such as astrocyte proliferation (Matsuyama et al. 2001), renal development (Oliver and Al-Awqati 1998), vascular lesion formation (Everett et al. 2000), and cardiovascular differentiation (Everett 2001). HDGF has mitogenetic activity for the growth of a variety of cells, including fibroblasts, hepatoma cells, vascular smooth muscle cells, and endothelial cells (Nakamura et al. 1994; Oliver and Al-Awqati 1998; Everett et al. 2000). HDGF contains two bipartite nuclear localization signals, but lacks a hydrophobic signal sequence for secretion (Everett et al. 2001; Kishima et al. 2002). The nuclear localization signals were shown to be required for HDGF to stimulate DNA replication (Everett et al. 2001; Kishima et al. 2002).

Five HDGF-related proteins (HRPs) have been identified thus far: mouse HRP-1 (Izumoto et al. 1997), mouse HRP-2 (Izumoto et al. 1997), human and mouse HRP-3 (Ikegame et al. 1999; Abouzied et al. 2004), bovine HRP-4 (Dietz et al. 2002), and human LEDGF (lens epithelium-derived growth factor) (Singh et al. 2000a). In addition to the PWWP domain at the N-terminus, these proteins always have putative bipartite nuclear localization signals. The mitogenic activity was confirmed with HRP-3 in fibroblasts and renal epithelial cells, HRP-4 in fibroblasts, and LEDGF in lens epithelial cells, keratinocytes, and fibroblasts. The mRNA expression shows a broad tissue distribution of HRP-2 as well as HDGF, whereas those of HRP-1 and HRP-4 are restricted to testis and that of HRP-3 is restricted to brain. All of the HRPs, as well as HDGF, were detected in the nuclei. Presumably, the HDGF family proteins are primarily targeted to the nucleus via the nuclear localization signals. Except for their growth factor activities, the functions of HRPs 1–4 are largely unknown, while the different functions of LEDGF have recently been revealed. By virtue of its homology to HDGF, the protein was originally found to be a novel growth factor on lens epithelial cells, keratinocytes, and fibroblasts, and hence it was termed LEDGF (Singh et al. 2000b). In addition, LEDGF has been shown to be a survival factor and a transcriptional activator, which rescues cells under stress by activating the expression of stress-related genes via its binding to their promoter elements (Singh et al. 2000b; Shinohara et al. 2002).

To obtain insights into the functions of the PWWP domains of the HDGF family, we determined the solution structure of the PWWP domain of mouse HRP-3 by heteronuclear NMR methods. Detailed structural comparisons with several domains having probable evolutionary relationships revealed a characteristic cavity as a putative binding site of the PWWP domain as well as the distinguishing structural features.


    Results and Discussion
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Resonance assignments and structural description
Samples of the 13C/15N-labeled PWWP domain of HRP-3, composed of 96 residues, were prepared for structure determination by a cell-free protein expression system. The protein sample has tag sequences (14 residues in total) at the N-terminus (GSSGSSG) and the C-terminus (SGPSSG), which are both derived from the expression vector. NMR resonances were assigned using conventional heteronuclear methods with the 13C/15N-labeled protein. The backbone resonance assignments were complete, with the exception of the first three residues in the tag sequence region at the N-terminus. Tertiary structures were calculated using the CYANA software package (Güntert et al. 1997; Herrmann et al. 2002), based on a total of 1377 NOE-derived distance restraints and 47 backbone torsion angle restraints (Table 1Go). A best-fit superposition of the ensemble of the 20 lowest-energy calculated structures is shown in Figure 2AGo. The root mean square deviation (RMSD) from the mean structure was 0.30 ± 0.03 Å for the backbone (N, C{alpha}, C') atoms and 0.76 ± 0.03 Å for all heavy (nonproton) atoms in the well-ordered region (residues 13–29, 44–68, and 81–90). The statistics of the structures are summarized in Table 1Go.


View this table:
[in this window]
[in a new window]
 
Table 1. Summary of conformational constraints and statistics of the final 20 best structures of the PWWP domain of mouse HRP-3
 


View larger version (44K):
[in this window]
[in a new window]
 
Figure 2. (A) Stereo view illustrating a trace of the backbone atoms for the ensemble of the 20 lowest energy structures of the mouse HRP-3 PWWP domain (residues 8–90). (B) Ribbon diagrams of the PWWP domains from the mouse HRP-3, the S. pombe protein SPBC215.07c (PDB code 1H3Z [PDB] ), and the mouse DNA methyltransferase Dnmt3b (1KHC) in the same view. Four residues in parentheses indicate those of the PWWP motif of each PWWP domain. The {alpha}-helices, {beta}-strands and the PR-loop are colored pink, green, and dark orange, respectively.

 
The NMR results show that the PWWP domain has a {beta}-barrel followed by a helical region, with the N- and C-termini on the same side of the molecule (Fig. 2BGo). The {beta}-barrel consists of five antiparallel strands ({beta}1: 14–18, {beta}2: 24–29, {beta}3: 44–48, {beta}4: 53–57; {beta}5: 62–64). Between {beta}4 and {beta}5 occurs a 310-helix ({alpha}1:59–61). The helical region consists of a one-turn {alpha}-helix ({alpha}2: 65–68), a coil region (69–72), and a 10-residue {alpha}-helix ({alpha}3: 81–90).

The common topology for the PWWP structures
The structures of the PWWP domains were reported from mouse DNA methyltransferase Dnmt3b (Qiu et al. 2002) and from a Schizosaccharomyces pombe protein, SPBC215.07c, of unknown function (Slater et al. 2003). Structural comparisons showed significant similarities in the {beta}-barrel region, and variations in the following helical region, revealing the structural essence of the PWWP domains (Fig. 2BGo). In the helical region, only the positioning of an {alpha}-helix equivalent to {alpha}3 (the {alpha}3 equivalent) is virtually identical among the three PWWP structures, despite the apparent lack of sequence conservation (Fig. 1BGo). This common {alpha}-helix is always placed in a groove between {beta}2 and the {beta}3/{beta}4 loop, in a fixed direction (Fig. 2BGo). The interactions involve highly conserved residues, including the third (Trp25) and fourth (Pro26) residues of the PWWP motif in {beta}2 (Fig. 3AGo). The aromatic ring of Trp25 vertically stacks with that of Phe81 (Tyr in the Dnmt3b domain and Leu in the S. pombe domain) at the beginning of {alpha}3, and the opposite side is partially solvent accessible. The H{varepsilon}1 of the indole ring of Trp25 hydrogen-bonds with the O of Phe49 in the {beta}3/{beta}4 loop. The fourth residue, Pro26, located in the middle of {beta}2, is buried with {alpha}3. In addition, the aromatic ring of Phe49 in the {beta}3/{beta}4 loop is snugly sandwiched between {alpha}2 and {alpha}3. This PWWP architecture involving the highly conserved residues, Trp25, Pro26, and Phe49, is essentially the same among the three PWWP structures. The positioning of {alpha}3, {beta}2, and the {beta}3/{beta}4 loop is probably retained in all of the PWWP domains, and is involved in forming a cavity as a putative binding site, as described later. It thus appears that the common topology for the PWWP structures is {beta}-{beta}-{beta}-{beta}-310-{beta}-the {alpha}3 equivalent.



View larger version (61K):
[in this window]
[in a new window]
 
Figure 3. (A) Stereo view illustrating the interactions between {alpha}3 and the {beta}-barrel substructure. {beta}1, {beta}2 and the connecting loop ({beta}-{beta} arch) are depicted in green, while {alpha}3 is pink. Shown are the PHWP residues (hot pink), Phe49 and the {beta}3/{beta}4 loop (brown), and Phe81 (navy). A hydrogen bond is depicted by a red dotted line. (B) Comparison of the second residue of the PWWP motif between the HRP-3 and S. pombe domains. {beta}1, {beta}2 and {beta}5 are depicted in green. van der Waals surfaces are shown for the side chain atoms of His24 (hot pink) and Phe63 (navy) in the HRP-3 domain, and for those of Trp138 (hot pink) and Thr184 (navy) in the S. pombe domain. Residues in the {beta}-{beta} arch are colored orange.

 
Role of the unique His residue among the PHWP residues
Although the third (Trp) and fourth (Pro) residues of the PWWP motif are highly conserved, the other residues vary (Fig. 1BGo). The first residue of the PWWP motif is located at the end of a loop, with the side chain accessible to the solvent, and thus its position could be occupied by several different residues. The second residue position is usually occupied by aromatic residues. The His residue at the second position is unique to the PWWP domains of the HDGF family. The PWWP structure with the PHWP residues shows that the imidazole ring of His24 in {beta}2 is located over {beta}1, and vertically contacts the aromatic ring of Phe63 in {beta}5 (Fig. 3BGo). Instead of the His and Phe residues, the PWWP domain of the S. pombe SPBC215.07c protein with the PWWP residues has Trp138 and Thr184 at the corresponding regions, respectively, and the indole ring of Trp is located over {beta}1 and stacks well with the methyl of Thr in {beta}5 (Slater et al. 2003). Hence, by comparison between the two PWWP domains, it follows that the indole ring of Trp, which is larger than that of His by a phenyl ring, compensates well for the space created by the replacement of Phe by Thr in {beta}5. One side of each of the aromatic rings of His24 and Trp138 in the HRP-3 and S. pombe domains interacts with the aliphatic portions of the invariant Lys residues in {beta}1, Lys18 and Lys132, respectively (Fig. 3BGo). A similar conformation also exists in the Dnmt3b domain, with Trp and Val in the corresponding positions (Qiu et al. 2002). Hence, despite these residue differences, the structural role of the second residue of the PWWP motif is virtually identical among the three PWWP structures. As an exception to the aromatic residues, Ala, Ser, or Pro occurs at the second residue position in a small number of PWWP domains (Fig. 1BGo). It will be interesting to examine the PWWP structures with these aliphatic residues at the second position of the PWWP motif.

{beta}-{beta} Arch
The loop connecting {beta}1 and {beta}2 is composed of five residues, 19MKGYP23, in the region neighboring the conserved Lys18 through the first residue, Pro23, of the PWWP motif (Fig. 3BGo). According to the Ramachandran nomenclature (Efimov 1993), this region adopts a {beta}{beta}{alpha}L{beta}{beta} conformation. Its conformation provides a reverse turn in the polypeptide chain and facilitates its transition from one layer to the other. Thus, this loop is called a {beta}-{beta} arch, but it is not a canonical {beta}-turn (Efimov 1993). The arch formation allows the side chains of Met19 and Tyr22 to be directed to the {beta}3/{beta}4 loop, and to participate as components of the cavity, as described later. A similar {beta}{beta}{alpha}L{beta}{beta} conformation is seen in the range of 133MSGFP137 in the S. pombe domain (Slater et al. 2003) (in the X-ray structure of the Dnmt3b domain, the electron density is not observed for Phe243, corresponding to Tyr22 in the HRP-3 domain). In all of the PWWP domains, the {beta}-{beta} arch region always contains five residues, and the predominant residue pattern is M/L/V-K-G-F/Y/H-P/S/A (Fig. 1BGo), suggesting the conservation of the {beta}-{beta} arch conformation among all of the domains, presumably also to allow the cavity formation.

The proline-rich loop (PR-loop) connecting {beta}2 and {beta}3
Within the {beta}-barrel substructure of the three PWWP domains, the only apparent differences were found in a loop connecting {beta}2 and {beta}3, which includes short helices in two domains (Fig. 2BGo). The sequence and length (8 ~ 20 residues) of this region vary among all of the PWWP domains (Fig. 1BGo). This loop is relatively rich in prolines, and thus we call it the proline-rich loop, PR-loop, in this paper. The characteristic feature of the PR-loop is highlighted by comparison with the corresponding loop in the Tudor domain, which has the most significant similarity to the {beta}-barrel substructure of the PWWP domain (Qiu et al. 2002; Maurer-Stroh et al. 2003; Fig. 4Go). In the Tudor domain, the presence of a {beta}-bulge in {beta}2 allows the {beta}-strand to bend sufficiently to form a {beta}-barrel. Consequently, {beta}2 and {beta}3 form a longer {beta}-sheet than that found in the PWWP domain, and the region connecting {beta}2 and {beta}3 forms a short loop or a turn. In contrast, in the PWWP domain, the lack of a {beta}-bulge in {beta}2 prevents the formation of such a long {beta}-sheet, and at the position equivalent to the {beta}-bulge, the PR-loop begins. Accordingly, {beta}2 and {beta}3 cross each other like the character "{lambda}," so that the connecting loop, the PR-loop, is stretched in a characteristic form.



View larger version (31K):
[in this window]
[in a new window]
 
Figure 4. Topological comparisons of the PWWP and Tudor domains. Note that the region depicted by a broken line region corresponds to a 310-helix in the two domains. An asterisk indicates a {beta}-bulge.

 
A characteristic cavity as a putative binding site
A possible functional role of the PWWP domain is suggested by its structural characteristics, which are related to its putative evolutionary relationships. Extensive structural and sequence analyses revealed a family of homologous motifs containing a three-{beta}-stranded core element, which includes the PWWP, Tudor, chromo (chromatin-binding), and MBT (malignant brain tumor) domains (Maurer-Stroh et al. 2003). The last three domains have a ligand interaction site located at the corresponding position, where a richly aromatic site involving the {beta}-{beta} arch and the {beta}3/{beta}4 loop plays a major role (in the chromodomain, the {beta}2 equivalent is missing) (Sathyamurthy et al. 2003; Wang et al. 2003; Fig. 5AGo). The binding sites seem to be sandwiched by a clothespin made of the two sticks of {beta}1/{beta}2 and {beta}3/{beta}4. As for the ligand, the Tudor domain of the survival motor neuron (SMN) protein binds to methylated Arg residues in the C-terminal Arg-Gly-Gly rich tails of the Sm proteins (Selenko et al. 2001). The chromodomain binds to the methylated lysine peptide from the histone H3 tail (Jacobs and Khorasanizadeh 2002; Nielsen et al. 2002). Although the function of the MBT repeats is unknown, one ligand binding pocket per MBT repeat seems to exist at the corresponding position, which in the X-ray structure can accommodate the morpholino ring of 2-(N-morpholino) ethanesulfonic acid (MES) or the proline ring of the C-terminal peptide segment (Wang et al. 2003).



View larger version (53K):
[in this window]
[in a new window]
 
Figure 5. (A) Binding sites of the Tudor (PDB code 1G5V [PDB] ), MBT (1OZ3) and chromo (1KNA) domains. In the MBT domain, the region encompassing residues 349–44 is shown. The equivalents to {beta}1 and {beta}2 in the PWWP domain are colored blue, while those corresponding to {beta}3 and {beta}4 are dark green. Aromatic residues and negatively charged residues involved in ligand binding are colored dark orange and red, respectively. (B) A cavity as a putative binding site in the PWWP domain (left) and molecular surface representations showing hydrophobic residues in green (right). The figure on the right was prepared with GRASP (Nicholls et al. 1991). Residues involved in cavity formation are shown. Arrows indicate the cavity as a putative binding site. The red hatched circle indicates the position corresponding to that in which a missense mutation causes ICF syndrome in the human Dnmt3b PWWP domain, as described in the text. (C) Molecular surface representations of the electrostatic potential (blue, positive; red, negative) of the PWWP domain, calculated by GRASP. The middle view is in the same orientation as those in (B). On the left and right, the views are rotated by 90° around the vertical axis.

 
The corresponding position of the PWWP structure has a solvent-exposed hydrophobic cavity, which is composed of Trp25 in {beta}2 of the PWWP motif, Met19 and Tyr22 in the {beta}-{beta} arch (the {beta}1/{beta}2 loop), Thr51 in the {beta}3/{beta}4 loop and Phe48 in {beta}3 (Fig. 5BGo). Many PWWP domains possess M/L/V at position 19, Y/F/H at 22, and T/S/D/H at 51; and the more conserved residues, W (in a few, Y) at 25 and F (W) at 48 (Fig. 1BGo). In addition to these residues in the {beta}-barrel region, only in the PWWP domains, the {alpha}3 equivalent, which interacts with {beta}2 (particularly, Trp25) and the {beta}3/{beta}4 loop, appears to contribute to cavity formation. At the side of the cavity, a negatively charged residue, Glu53, exists in {beta}4, which is an interaction site in the Tudor and chromo domains (Fig. 5AGo). Its probable evolutionary relationship to those domains that bind to the modified residues suggests that the PWWP domain binds to a modified residue of some chromatin component via the cavity. In the PWWP domains of the HDGF family, the cavity residues are all identical, suggesting the same ligand for a minimal element. The possibility cannot be excluded that other elements of the ligand, for example the flanking residues, differ depending on the proteins. It is notable that this cavity is followed by a hydrophobic patch composed mainly of residues in the PR-loop (Fig. 5BGo), which is the only region that differs in sequence and length among the PWWP domains of the HDGF family (Fig. 1AGo). The PR-loop may play specific roles in the interaction of other elements of the ligand, conferring ligand variations to each PWWP domain.

In the human Dnmt3b PWWP domain, a missense mutation (S282P, the corresponding position is Ala55 in {beta}4 of the PWWP domain in this study) causes ICF syndrome (Shirohzu et al. 2002). Intriguingly, this position is quite close to the cavity, or could be one component of the cavity structure (Fig. 5BGo). This mutation may alter the structure or the volume of the cavity so significantly as to affect the ligand binding affinity and/or specificity.

The possible chromatin binding property of the PWWP domain is supported by its characteristic electrostatic charge distribution (Fig. 5CGo). The charged residues are clustered in distinctly polarized patches, which surround the putative binding cavity. A similar surface electrostatic charge distribution exists in the bromodomain, which binds selectively to multiply acetylated histone H4 peptides (Jacobson et al. 2000). Upon binding, the charged surfaces may make non-specific contacts with the nucleic acid on the basic side, and contact the core histone molecules along the acidic surface.

Concluding remarks
Recent in vivo experiments have demonstrated that the PWWP domain of DNA methyltransferases is essential for the chromatin targeting of the enzymes, suggesting a direct interaction between the PWWP domain and chromatin (Ge et al. 2004). It is quite likely that some chromatin modifications, such as modified residues, are recognized by the PWWP-containing proteins via the PWWP domain, in which the cavity shown in this study plays a key role in the interaction.

Among the HDGF family members, LEDGF preferentially associates with condensed chromatin areas (Nishizawa et al. 2001). LEDGF confers cell protection against stress-induced apoptosis by binding to the promoter elements of heat shock and stress-related genes for activation (Singh et al. 2000b; Shinohara et al. 2002), and the PWWP domain may simultaneously bind to the chromatin surrounding the genes to facilitate or regulate the DNA binding. During apoptosis, interestingly, LEDGF is reportedly cleaved at three sites, by caspases-3 and -7, into two fragments lacking the PWWP domain (65 and 58 kDa), and this cleavage abrogates its pro-survival function (Wu et al. 2002). One of the cleavage sites (-DEVPD{uparrow}G-) is located in the PR-loop of LEDGF. These findings suggest the importance of the PWWP-mediated chromatin targeting for the protein function, as also seen with Dnmt3b. Similarly, the other HRPs and HDGF would be translocated to the nucleus by the nuclear localization signals, where they would associate via the PWWP domains with chromatin for their specific functions, which presumably depend on each gene-specific region in the C-terminus. Thus, the chromatin binding of these proteins seems to affect transcriptional processes, and consequently to regulate various cellular processes and development. The structural features of the PWWP domains provide insights for further experiments in the search for a ligand.


    Materials and methods
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Protein expression and purification
The DNA encoding the PWWP domain of mouse HRP-3 (Glu9-Asn90) was subcloned by PCR from the mouse full-length cDNA clone, with the ID RIKEN cDNA 4632410H03 (Okazaki et al. 2002). This DNA fragment was cloned into the expression vector pCR2.1 (Invitrogen) as a fusion with an N-terminal 6-His affinity tag and a TEV protease cleavage site. The 13C/15N-labeled fusion protein was synthesized by a cell-free protein expression system, as described (Kigawa et al. 1999, 2004). The solution was first adsorbed to a HiTrap Chelating column (Amersham Biosciences), which was washed with buffer A (50 mM sodium phosphate buffer, pH 8.0, containing 500 mM sodium chloride and 20 mM imidazole) and was eluted with buffer B (50 mM sodium phosphate buffer,pH 8.0, containing 500 mM sodium chloride and 500 mM imidazole). To remove the His-tag, the eluted protein was incubated at 30°C for 1 h with the TEV protease. After dialysis against buffer A without imidazole, the dialysate was mixed with imidazole (20 mM final concentration) and then applied to a HiTrap Chelating column, which was washed with buffer A. The flow through fraction was dialyzed against buffer C (20 mM sodium phosphate buffer, pH 7.2, containing 1 mM PMSF). The dialysate was fractionated on a HiTrap SP column by a concentration gradient of buffer C and buffer D (20 mM sodium phosphate buffer, pH 7.2, containing 1 M sodium chloride and 1 mM PMSF). The PWWP-containing fractions were collected.

For NMR measurements, the purified protein was concentrated to 0.7 mM in 1H2O/2H2O (9:1) 20 mM sodium phosphate buffer (pH 6.0), containing 100 mM NaCl, 1 mM 1,4-DL-dithiothreitol-d10 (d-DTT), and 0.02% NaN3. It was stable for at least 6 mo, when stored at 4°C.

NMR spectroscopy, structure determination, and analysis
All NMR measurements were performed at 25°C on Bruker AVANCE 600 and AVANCE 800 spectrometers. Sequence-specific backbone chemical shift assignments (Wüthrich 1986) were made with the 13C/15N-labeled sample, using standard triple-resonance experiments (Bax 1994; Cavanagh et al. 1996). Assignments of side chains were obtained from HBHACONH, HCCCONNH, CCCONNH, HCCH-TOCSY, and HCCH-COSY spectra. 15N- and 13C-edited NOESY spectra with 80- and 40-msec mixing times were used to determine the distance restraints. The spectra were processed with the program NMRpipe (Delaglio et al. 1995). The program KUJIRA (N. Kobayashi, pers. comm.), created on the basis of NMRView (Johnson and Blevins 1994), was employed for optimal visualization and spectral analysis.

Automated NOE cross-peak assignments (Herrmann et al. 2002) and structure calculations with torsion angle dynamics (Güntert et al. 1997) were performed using the software package CYANA1.0.7 (Güntert 2003). Peak lists of the two NOESY spectra were generated as input with the program NMRView (Johnson and Blevins 1994). The input further contained the chemical shift list corresponding to the sequence-specific assignments. Dihedral angle restraints were derived using the program TALOS (Cornilescu et al. 1999). No hydrogen bond constraints were used.

A total of 100 structures were independently calculated. The 20 conformers of the CYANA cycle 7 with the lowest final CYANA target function values were energy-minimized in a water shell with the program OPALp (Koradi et al. 2000), using the AMBER force field (Cornell et al. 1995). The structures were validated using PROCHECK-NMR (Laskowski et al. 1996). The program MOLMOL (Koradi et al. 1996) was used to analyze the resulting 20 conformers, and to prepare drawings of the structures, unless otherwise noted in the legends.

The atomic coordinates of the 20 energy-minimized CYANA conformers of the mouse HRP-3 PWWP domain have been deposited at the RCSB Protein Data Bank (www.rcsb.org) under the PDB Accession code 1N27 [PDB] .


    Acknowledgments
 
We are indebted to Dr. Yutaka Muto for valuable suggestions and comments on the structural analysis. We thank Emi Nunokawa, Natsuko Matsuda, Yoko Motoda, Yukako Miyata, Atsuo Kobayashi, Noriko Hirakawa, Noriko Sakagami, Fumiko Hiroyasu, Yasuko Tomo, Takushi Harada, Miyuki Sato, Mari Hirato, Satoko Yasuda, and Takashi Osanai for their technical assistance. We also thank Tomoko Nakayama and Kiyomi Yajima for expert secretarial assistance. This work was supported by the RIKEN Structural Genomics/Proteomics Initiative (RSGI), the National Project on Protein Structural and Functional Analyses, Ministry of Education, Culture, Sports, Science and Technology of Japan.


    References
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Abouzied, M.M., Baader, S.L., Dietz, F., Kappler, J., Gieselmann, V., and Franken, S. 2004. Expression patterns and different subcellular localization of the growth factors HDGF (hepatoma-derived growth factor) and HRP-3 (HDGF-related protein-3) suggest functions in addition to their mitogenic activity. Biochem. J. 378: 169–176.[CrossRef][Medline]

Bateman, A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., Sonnhammer, E.L., et al. 2004. The Pfam protein families database. Nucleic Acids Res. 32: D138–D141.[Abstract/Free Full Text]

Bax, A. 1994. Multidimensional nuclear magnetic resonance methods for protein studies. Curr. Opin. Struct. Biol. 4: 738–744.[CrossRef]

Cavanagh, J., Fairbrother, W.J., Palmer III, A.G., and Skelton, N.J. 1996. Protein NMR spectroscopy, principles and practice. Academic Press, San Diego, CA.

Cornell, W.D., Cieplak, P., Bayly, C.I., Gould, I.R., Merz, K.M., Ferguson, D.M., Spellmeyer, D.C., Fox, T., Caldwell, J.W., and Kollman, P.A. 1995. A second generation force field for the simulation of proteins, nucleic acids, and organic molecules. J. Am. Chem. Soc. 117: 5179–5197.[CrossRef]

Cornilescu, G., Delaglio, F., and Bax, A. 1999. Protein backbone angle restraints from searching a database for chemical shift and sequence homology. J. Biomol. NMR 13: 289–302.[CrossRef][Medline]

Delaglio, F., Grzesiek, S., Vuister, G.W., Zhu, G., Pfeifer, J., and Bax, A. 1995. NMRPipe: A multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6: 277–293.[Medline]

Dietz, F., Franken, S., Yoshida, K., Nakamura, H., Kappler, J., and Gieselmann, V. 2002. The family of hepatoma-derived growth factor proteins: Characterization of a new member HRP-4 and classification of its subfamilies. Biochem. J. 366: 491–500.[CrossRef][Medline]

Efimov, A.V. 1993. Standard structures in proteins. Prog. Biophys. Mol. Biol. 60: 201–239.[CrossRef][Medline]

Everett, A.D. 2001. Identification, cloning, and developmental expression of hepatoma-derived growth factor in the developing rat heart. Dev. Dyn. 222: 450–458.[CrossRef][Medline]

Everett, A.D., Lobe, D.R., Matsumura, M.E., Nakamura, H., and McNamara, C.A. 2000. Hepatoma-derived growth factor stimulates smooth muscle cell growth and is expressed in vascular development. J. Clin. Invest. 105: 567–575.[Medline]

Everett, A.D., Stoops, T., and McNamara, C.A. 2001. Nuclear targeting is required for hepatoma-derived growth factor-stimulated mitogenesis in vascular smooth muscle cells. J. Biol. Chem. 276: 37564–37568.[Abstract/Free Full Text]

Ge, Y.Z., Pu, M.T., Gowher, H., Wu, H.P., Ding, J.P., Jeltsch, A., and Xu, G.L. 2004. Chromatin targeting of de novo DNA methyltransferases by the PWWP domain. J. Biol. Chem. 279: 25447–25454.[Abstract/Free Full Text]

Güntert, P. 2003. Automated NMR protein structure calculation. Prog. NMR Spectrosc. 43: 105–125.[CrossRef]

Güntert, P., Mumenthaler, C., and Wüthrich, K. 1997. Torsion angle dynamics for NMR structure calculation with the new program DYANA. J. Mol. Biol. 273: 283–298.[CrossRef][Medline]

Herrmann, T., Güntert, P., and Wüthrich, K. 2002. Protein NMR structure determination with automated NOE-identification in the NOESY spectra using the new software ATNOS. J. Biomol. NMR 24: 171–189.[CrossRef][Medline]

Ikegame, K., Yamamoto, M., Kishima, Y., Enomoto, H., Yoshida, K., Suemura, M., Kishimoto, T., and Nakamura, H. 1999. A new member of a hepatoma-derived growth factor gene family can translocate to the nucleus. Biochem. Biophys. Res. Commun. 266: 81–87.[CrossRef][Medline]

Izumoto, Y., Kuroda, T., Harada, H., Kishimoto, T., and Nakamura, H. 1997. Hepatoma-derived growth factor belongs to a gene family in mice showing significant homology in the amino terminus. Biochem. Biophys. Res. Commun. 238: 26–32.[CrossRef][Medline]

Jacobs, S.A. and Khorasanizadeh, S. 2002. Structure of HP1 chromodomain bound to a lysine 9-methylated histone H3 tail. Science 295: 2080–2083.[Abstract/Free Full Text]

Jacobson, R.H., Ladurner, A.G., King, D.S., and Tjian, R. 2000. Structure and function of a human TAFII250 double bromodomain module. Science 288: 1422–1425.[Abstract/Free Full Text]

Johnson, B. and Blevins, R. 1994. NMRView: A computer program for the visualization and analysis of NMR data. J. Biomol. NMR 4: 603–614.[CrossRef]

Kigawa, T., Yabuki, T., Yoshida, Y., Tsutsui, M., Ito, Y., Shibata, T., and Yokoyama, S. 1999. Cell-free production and stable-isotope labeling of milligram quantities of proteins. FEBS Lett. 442: 15–19.[CrossRef][Medline]

Kigawa, T., Yabuki, T., Matsuda, N., Matsuda, T., Nakajima, R., Tanaka, A., and Yokoyama, S. 2004. Preparation of Escherichia coli cell extract for highly productive cell-free protein expression. J. Struct. Funct. Genomics 5: 63–68.[CrossRef][Medline]

Kishima, Y., Yamamoto, H., Izumoto, Y., Yoshida, K., Enomoto, H., Yamamoto, M., Kuroda, T., Ito, H., Yoshizaki, K., and Nakamura, H. 2002. Hepatoma-derived growth factor stimulates cell growth after translocation to the nucleus by nuclear localization signals. J. Biol. Chem. 277: 10315–10322.[Abstract/Free Full Text]

Koradi, R., Billeter, M., and Wüthrich, K. 1996. MOLMOL: A program for display and analysis of macromolecular structures. J. Mol. Graph. 14: 51–55.[CrossRef][Medline]

Koradi, R., Billeter, M., and Güntert, P. 2000. Point-centered domain decomposition for parallel molecular dynamics simulation. Comput. Phys. Commun. 124: 139–147.[CrossRef]

Laskowski, R.A., Rullmannn, J.A., MacArthur, M.W., Kaptein, R., and Thornton, J.M. 1996. AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR. J. Biomol. NMR 8: 477–486.[Medline]

Matsuyama, A., Inoue, H., Shibuta, K., Tanaka, Y., Barnard, G.F., Sugimachi, K., and Mori, M. 2001. Hepatoma-derived growth factor is associated with reduced sensitivity to irradiation in esophageal cancer. Cancer Res. 61: 5714–5717.[Abstract/Free Full Text]

Maurer-Stroh, S., Dickens, N.J., Hughes-Davies, L., Kouzarides, T., Eisenhaber, F., and Ponting, C.P. 2003. The Tudor domain ‘Royal Family’: Tudor, plant Agenet, Chromo, PWWP and MBT domains. Trends Biochem. Sci. 28: 69–74.[CrossRef][Medline]

Nakamura, H., Izumoto, Y., Kambe, H., Kuroda, T., Mori, T., Kawamura, K., Yamamoto, H., and Kishimoto, T. 1994. Molecular cloning of complementary DNA for a novel human hepatoma-derived growth factor. Its homology with high mobility group-1 protein. J. Biol. Chem. 269: 25143–25149.[Abstract/Free Full Text]

Nicholls, A., Sharp, K.A., and Honig, B. 1991. Protein folding and association: Insights from the interfacial and thermodynamic properties of hydrocarbons. Proteins 11: 281–296.[CrossRef][Medline]

Nielsen, P.R., Nietlispach, D., Mott, H.R., Callaghan, J., Bannister, A., Kouzari-des, T., Murzin, A.G., Murzina, N.V., and Laue, E.D. 2002. Structure of the HP1 chromodomain bound to histone H3 methylated at lysine 9. Nature 416: 103–107.[CrossRef][Medline]

Nishizawa, Y., Usukura, J., Singh, D.P., Chylack Jr., L.T., and Shinohara, T. 2001. Spatial and temporal dynamics of two alternatively spliced regulatory factors, lens epithelium-derived growth factor (ledgf/p75) and p52, in the nucleus. Cell Tissue Res. 305: 107–114.[CrossRef][Medline]

Okazaki, Y., Furuno, M., Kasukawa, T., Adachi, J., Bono, H., Kondo, S., Nikaido, I., Osato, N., Saito, R., Suzuki, H., et al. 2002. Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs. Nature 420: 563–573.[CrossRef][Medline]

Oliver, J.A. and Al-Awqati, Q. 1998. An endothelial growth factor involved in rat renal development. J. Clin. Invest. 102: 1208–1219.[Medline]

Qiu, C., Sawada, K., Zhang, X., and Cheng, X. 2002. The PWWP domain of mammalian DNA methyltransferase Dnmt3b defines a new family of DNA-binding folds. Nat. Struct. Biol. 9: 217–224.[Medline]

Sathyamurthy, A., Allen, M.D., Murzin, A.G., and Bycroft, M. 2003. Crystal structure of the malignant brain tumor (MBT) repeats in Sex Comb on Midleg-like 2 (SCML2). J. Biol. Chem. 278: 46968–46973.[Abstract/Free Full Text]

Schuster-Böckler, B., Schultz, J., and Rahmann, S. 2004. HMM Logos for visualization of protein families. BMC Bioinformatics 5: 7.[CrossRef][Medline]

Selenko, P., Sprangers, R., Stier, G., Bühler, D., Fischer, U., and Sattler, M. 2001. SMN tudor domain structure and its interaction with the Sm proteins. Nat. Struct. Biol. 8: 27–31.[CrossRef][Medline]

Shinohara, T., Singh, D.P., and Fatma, N. 2002. LEDGF, a survival factor, activates stress-related genes. Prog. Retin. Eye Res. 21: 341–358.[CrossRef][Medline]

Shirohzu, H., Kubota, T., Kumazawa, A., Sado, T., Chijiwa, T., Inagaki, K., Suetake, I., Tajima, S., Wakui, K., Miki, Y., et al. 2002. Three novel DNMT3B mutations in Japanese patients with ICF syndrome. Am. J. Med. Genet. 112: 31–37.[CrossRef][Medline]

Singh, D.P., Kimura, A., Chylack Jr., L.T., and Shinohara, T. 2000a. Lens epithelium-derived growth factor (LEDGF/p75) and p52 are derived from a single gene by alternative splicing. Gene 242: 265–273.[CrossRef][Medline]

Singh, D.P., Ohguro, N., Kikuchi, T., Sueno, T., Reddy, V.N., Yuge, K., Chylack Jr., L.T., and Shinohara, T. 2000b. Lens epithelium-derived growth factor: Effects on growth and survival of lens epithelial cells, keratinocytes, and fibroblasts. Biochem. Biophys. Res. Commun. 267: 373–381.[CrossRef][Medline]

Slater, L.M., Allen, M.D., and Bycroft, M. 2003. Structural variation in PWWP domains. J. Mol. Biol. 330: 571–576.[CrossRef][Medline]

Stec, I., Wright, T.J., van Ommen, G.J., de Boer, P.A., van Haeringen, A., Moorman, A.F., Altherr, M.R., and den Dunnen, J.T. 1998. WHSC1, a 90 kb SET domain-containing gene, expressed in early development and homologous to a Drosophila dysmorphy gene maps in the Wolf-Hirschhorn syndrome critical region and is fused to IgH in t(4;14) multiple myeloma. Hum. Mol. Genet. 7: 1071–1082.[Abstract/Free Full Text]

Stec, I., Nagl, S.B., van Ommen, G.J., and den Dunnen, J.T. 2000. The PWWP domain: A potential protein-protein interaction domain in nuclear proteins influencing differentiation? FEBS Lett. 473: 1–5.[CrossRef][Medline]

Wang, W.K., Tereshko, V., Boccuni, P., MacGrogan, D., Nimer, S.D., and Patel, D.J. 2003. Malignant brain tumor repeats: a three-leaved propeller architecture with ligand/peptide binding pockets. Structure 11: 775–789.[Medline]

Wanschura, S., Schoenmakers, E.F., Huysmans, C., Bartnitzke, S., Van de Ven, W.J., and Bullerdiek, J. 1996. Mapping of the gene encoding the human hepatoma-derived growth factor (HDGF) with homology to the high-mobility group (HMG)-1 protein to Xq25. Genomics 32: 298–300.[CrossRef][Medline]

Wu, X., Daniels, T., Molinaro, C., Lilly, M.B., and Casiano, C.A. 2002. Caspase cleavage of the nuclear autoantigen LEDGF/p75 abrogates its pro-survival function: Implications for autoimmunity in atopic disorders. Cell Death Differ. 9: 915–925.[CrossRef][Medline]

Wüthrich, K. 1986. NMR of proteins and nucleic acids. Wiley, New York.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
J. Virol.Home page
M.-C. Shun, Y. Botbol, X. Li, F. Di Nunzio, J. E. Daigle, N. Yan, J. Lieberman, M. Lavigne, and A. Engelman
Identification and Characterization of PWWP Domain Residues Critical for LEDGF/p75 Chromatin Binding and Human Immunodeficiency Virus Type 1 Infectivity
J. Virol., December 1, 2008; 82(23): 11555 - 11567.
[Abstract] [Full Text] [PDF]


Home page
Mol Cancer ResHome page
T. A. Brown-Bryan, L. S. Leoh, V. Ganapathy, F. J. Pacheco, M. Mediavilla-Varela, M. Filippova, T. A. Linkhart, R. Gijsbers, Z. Debyser, and C. A. Casiano
Alternative Splicing and Caspase-Mediated Cleavage Generate Antagonistic Variants of the Stress Oncoprotein LEDGF/p75
Mol. Cancer Res., August 1, 2008; 6(8): 1293 - 1307.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
L. Vandekerckhove, F. Christ, B. Van Maele, J. De Rijck, R. Gijsbers, C. Van den Haute, M. Witvrouw, and Z. Debyser
Transient and Stable Knockdown of the Integrase Cofactor LEDGF/p75 Reveals Its Role in the Replication Cycle of Human Immunodeficiency Virus
J. Virol., February 15, 2006; 80(4): 1886 - 1896.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
S. M. Lukasik, T. Cierpicki, M. Borloz, J. Grembecka, A. Everett, and J. H. Bushweller
High resolution structure of the HDGF PWWP domain: A potential DNA binding domain
Protein Sci., February 1, 2006; 15(2): 314 - 323.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow All Versions of this Article:
ps.04975305v1
14/3/756    most recent
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Nameki, N.