Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Otte, L.
Right arrow Articles by Oschkinat, H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Otte, L.
Right arrow Articles by Oschkinat, H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Protein Science (2003), 12:491-500.
Copyright © 2003 The Protein Society

WW domain sequence activity relationships identified using ligand recognition propensities of 42 WW domains

Livia Otte1,4, Urs Wiedemann2,3,4, Brigitte Schlegel2, José Ricardo Pires2, Michael Beyermann2, Peter Schmieder2, Gerd Krause2, Rudolf Volkmer-Engert1, Jens Schneider-Mergener1 and Hartmut Oschkinat2,3

1 Institut für Medizinische Immunologie, Universitätsklinikum Charité, Schumannstr. 20-21, 10117 Berlin, Germany
2 Forschungsinstitut für Molekulare Pharmakologie, Robert-Rössle-Str. 10, 13125 Berlin, Germany
3 The Berlin Structure Factory Project, Heubnerweg 6, 14059 Berlin, Germany

Reprint requests to: Hartmut Oschkinat, Forschungsinstitut für Molekulare Pharmakologie, Robert-Rössle-Str. 10, 13125 Berlin, Germany; E-mail: Oschkinat{at}fmp-berlin.de; fax: 49 (30) 94793-169.

4 These authors contributed equally to this work. Back

Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.0233203.

Abbreviations and conventions: Consensus sequences are given according to modified Seefeld Convention 2001 nomenclature (Aasland et al. 2002). Amino acids are represented by standard IUPAC single-character notation with uppercase bold characters representing the most highly conserved positions lowercase characters representing less specific residues lowercase "x" denoting "any amino acid," amino acid alternatives specified within brackets, {phi} representing hydrophobic amino acids, {xi} representing aliphatic amino acids, and phosphorylated amino acids denoted by a po-prefix. Multiple WW domains within one protein are sequentially abbreviated by the protein name, with the sequential number of the respective WW domain as a suffix. WW domains of proteins containing only a single WW domain are represented by the protein name only, omitting a suffix. n.d., not determined.


    Abstract
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
WW domains mediate protein–protein interactions in a number of different cellular functions by recognizing proline-containing peptide sequences. We determined peptide recognition propensities for 42 WW domains using NMR spectroscopy and peptide library screens. As potential ligands, we studied both model peptides and peptides based on naturally occurring sequences, including phosphorylated residues. Thirty-two WW domains were classified into six groups according to detected ligand recognition preferences for binding the motifs PPx(Y/poY), (p/{phi})P(p,g)PPpR, (p/{phi})PPRgpPp, PPLPp, (p/{xi})PPPPP, and (poS/poT)P (motifs according to modified Seefeld Convention 2001). In addition to these distinct binding motifs, group-specific WW domain consensus sequences were identified. For PPxY-recognizing domains, phospho-tyrosine binding was also observed. Based on the sequences of the PPx(Y/poY)-specific group, a profile hidden Markov model was calculated and used to predict PPx(Y/poY)-recognition activity for WW domains, which were not assayed. PPx(Y/poY)-binding was found to be a common property of NEDD4-like ubiquitin ligases.

Keywords: Protein—protein interaction; WW domain; protein interaction modules; ligand screen; peptide libraries; sequence—activity relationships; protein—protein interaction prediction


    Introduction
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Cellular processes depend on finely tuned protein–protein interactions, which together form complex interaction networks. These interactions are often mediated by noncatalytic protein domains. One of these protein interaction modules, the WW domain, is supposed to recognize proline-containing peptide segments via two highly conserved aromatic residues (Y28 and W39—numbering according to hYAP65) (Macias et al. 1996). Broadly speaking, WW domains target two kinds of peptide ligands (Sudol and Hunter 2000): The first consists of several consecutive prolines and one specificity-determining residue, which collectively form a PPII helix. These sequences include the PPxY motif (Chen and Sudol 1995), proline-rich stretches containing arginine or lysine (Bedford et al. 2000), and the PPLP motif (Bedford et al. 1997). The second type of ligand contains a phosphorylated serine or threonine preceding a proline, sharing fewer sequence features with each other compared to the first type (Yaffe et al. 1997) (motifs according to modified Seefeld Convention 2001; see Abbreviations and Conventions).

We attempted to identify specificity-determining residues within WW domains and their ligand-recognition motifs. Thereby, sequence activity relationships could be inferred that then allowed for the prediction of potential activities and interaction partners of yet unexplored WW domains and ligands. These predictions aid the design of experiments to assess the cellular function of WW domain-containing proteins, and hence help to unravel the protein interaction network. As these dynamic networks have to respond to changing environments, mechanisms are required to regulate the underlying protein interactions. Hence, we also investigated the potential regulation of WW domain–ligand interaction by phosphorylation of the ligand (Sudol and Hunter 2000).

To determine the ligand binding propensities of 42 chemically synthesized WW domains (29 human, 2 mouse, 11 yeast), we used NMR spectroscopy for detecting interactions with nonoptimized model peptides in a site-specific manner. As a second, independent approach we used a cellulose-bound array of the 42 WW domains for solid-phase binding assays with peptides derived from naturally occurring proteins. To infer sequence activity relationships, the ligand binding propensities were mapped onto a tree, which clusters WW domains based on sequence similarity. Group-specific consensus sequences of the WW domains were obtained. Substitutional analyses of the ligands revealed group-specific binding patterns suggesting common group-binding mechanisms. Regulation of WW domain–ligand interactions by ligand phosphorylation was analyzed by screening a library consisting of 600 potential kinase target sequences and determination of dissociation constants by fluorescence spectroscopy. Finally, a profile hidden Markov model (HMM) was used to predict the binding propensities towards tyrosine/phospho-tyrosine-containing ligands for the remaining WW sequences found in the SMART database (Schultz et al. 1998).


    Results and Discussion
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Identification of WW domain ligand binding propensities by ligand screens
1D-1H NMR spectra of 42 WW domains were recorded with and without peptide ligands of the type GPPPPBG (where B = Y, "Y-peptide"; phospho-tyrosine (poY), "poY-peptide"; phospho-serine (poS), "poS-peptide"). In addition, a cellulose bound array of the 42 WW domains ("WW array") was assayed with peptides from naturally occurring proteins (APPTPPPLPP, "short L-peptide" (Bedford et al. 1997); APPTPPPLPPPLIPPPPPLPP, "long L-peptide"; PPGPPPRGPPPR, "short R-peptide" (Bedford et al. 2000); PPGPPPRGPPPRPPGPPPRGPPPR, "long R-peptide").

According to the 1D-1H NMR reference spectra, without ligand, 30 WW domains were folded, showing the characteristic peak pattern between 9.0–9.6 and 10 ppm (Macias et al. 1996; Figs. 1A, 2GoGo). Two further domains, hWWP3-2 and hPQBP1, were folded only after ligand addition, judged by the appearance of the characteristic peak pattern in the 1D-1H NMR spectra. Sixteen WW domains showed chemical shift perturbations after addition of both the Y- and the poY-peptide. For four more domains only weak perturbations were observed with the poY- but none with the Y-peptide (Fig. 2Go, first and second result columns). WW array experiments using tyrosine- and phospho-tyrosine-containing peptides supported the NMR results. The NMR screen for the poS-peptide revealed seven potential weak interactions, six of these with WW domains also recognizing phospho-tyrosine (Fig. 2Go, third result column). An additional seven domains showed chemical shift changes when exposed to a mixture of model peptides of the type GPPPPBG (where B = E, I, Q, R, and phospho-threonine [poT]).



View larger version (29K):
[in this window]
[in a new window]
 
Figure 1. Determination of WW domain ligand recognition propensities by NMR spectroscopy and WW arrays. (A) 1D-1H NMR spectroscopy. The spectra typically contained three signals in the downfield region: two originating from the two key tryptophan side-chain NH signals (W17 and W39) in the folded state, and one originating from both tryptophans of residual unfolded material. The assignment of the two tryptophan signals was based on the assumption that the signal of the buried W17 did not change upon ligand addition (Pires et al. 2001). (B, C) The WW array. Interaction of 42 WW domains with the peroxidase-labeled peptides PPGPPPRGPPPR (B) and APPTPPPLPP (C). PPGPPPRGPPPR bound to yYFB0 (Spot #5), hFE65 (Spot #20), hPQBP1 (Spot #22), hHYP109-1 (Spot #26), hFBP21-1 (Spot #27), and hFBP21-2 (Spot #28). APPTPPPLPP bound to hFBP11-1 (Spot #23) and hFBP11-2 (Spot #24). The complete assignment of WW array spot numbers and corresponding WW domains can be found in column 3 (Nr.*) of Figure 2Go.

 


View larger version (96K):
[in this window]
[in a new window]
 
Figure 2. Alignment and classification of WW domain sequences according to ligand recognition propensities. The WW domain sequences are grouped according to the experimental results and the clustering obtained by the tree (Fig. 3Go). The consensus sequences were calculated for only those domains exhibiting the group-specific recognition activities. Group-specific residues are indicated in bold type. A priori unfolded WW domains are highlighted in gray. The chemical shift changes of the W39 side-chain NH obtained by 1D-1H NMR spectroscopy are summarized in the first four result columns. Small changes are indicated by pale colors (threshold: 0.03 ppm for Y-, poY-, poT/R-, and 0.01 ppm for poS-peptides), large changes by bright colors (threshold: 0.15 ppm for Y-, poY-, poT/R-, and poS-peptides). "Shift" indicates cases in which a chemical shift change for the side-chain NH signal of W39 was not observed or not observable because of the absence of a tryptophan residue at position 39, but other NH signals changed significantly. For WW domains, which do not have a tryptophan residue at position 39, "none" indicates that no other chemical shift perturbations were observable. The results obtained by at least three independent repetitions of WW array screens were classified as strong signals ( + ), weak signals ( +/- ), and no signals ( - ). Serines replacing cysteines are underlined in the sequences. Residues on the ligand-exposed face of the ß-sheet are highlighted by a beige background. ID: accession code; Nr.* position on WW array; Sequence**: sequence numbering according to hYAP65 WW domain (Pires et al. 2001), insertion between residues 24 and 25 indicated as 24b; Y (Y-peptide): GPPPPYG; poY (poY-peptide): GPPPPpoYG; poS (poS-peptide): GPPPPpoSG; poT/R (poT/R-peptide): VLpoTPPDRL; Rs (short R-peptide): PPGPPPRGPPPR; Rl (long R-peptide): PPGPPPRGPPPRPPGPPPRGPPPR; Ls (short L-peptide): APPTPPPLPP; L(I)l (long L-peptide): APPTPPPLPPPLIPPPPPLPP.

 
In WW array experiments using the short and long R-peptides, respectively 6 and 11 interactions were obtained (Figs. 1B, 2GoGo, fifth and sixth result columns), with the short R-peptide being more selective. Using the short and long L-peptides, respectively, two and nine positive responses were obtained (Figs. 1C, 2GoGo, seventh and eighth result columns).

In summary, binding to specific proline-containing ligands is a general feature of WW domains. In total, 31 WW domains bound to at least one of the tested proline-containing peptides. Ten of the 11 domains that did not show any potential interaction were unfolded, as shown by the 1D-1H NMR spectra. The screening results allowed a definition of six groups of WW domains according to their activity (Fig. 2Go): WW domains recognizing exclusively the Y- and poY-peptide constitute the Y-group. For arginine-containing ligands, two distinct groups could be identified: The Ra-group, whose members bound strongly to both R-peptides and in addition the long L-peptide, and the Rb-group showing strong interactions solely with both arginine-containing peptides. Strong binding to the short and long L-peptide was found for L-group members. Distinct from these, a group of WW domains, which bind to the long L-peptide only, was found. We called this the poly-P-group, because the long L-peptide contains a stretch of five prolines distinguishing it from the short L-peptide. The well-known phospho-serine/threonine-recognizing WW domains hPIN1 and yESS1 (Verdecia et al. 2000) were combined to form the sixth group.

Sequence activity relationships
To cluster the 42 WW domains on the basis of conserved sequence features, a tree representing sequence similarity was calculated (Felsenstein 1989; Page 1996). Only residues on the ligand-exposed face of the ß-sheet were included to bias the tree towards functional relatedness. Mapping of the detected ligand binding propensities (Fig. 2Go) onto this tree revealed a correlation between the Y-, Ra-, and Rb-group and three distinct branches (Fig. 3Go). All domains recognizing exclusively tyrosine- and phospho-tyrosine-containing peptides (Y-group) are clustered into a compact branch (Y-branch), which does not contain domains binding strongly to any other motif. hSMURF2-1, yYJQ8 and four unfolded domains, for all of which no ligands were found, are also found in the same branch. Domains outside this branch showing weak tyrosine- or phospho-tyrosine-affinity invariably bound more strongly to R-, L-, or poT-peptides. All R-, L-, and poT-specific domains are located in the lower half of the tree. In this region two distinct branches cluster Ra-group and Rb-group members, respectively. In contrast to this, the L-, poly-P-, and poS/poT-group WW domains are more dispersed in the lower half of the tree. The phylogenetic tree calculated using all residues does not differ fundamentally from the one presented here (data not shown). Most notably the Y-, Ra-, and Rb-branches are found in this tree as well. The trees differ mainly in the fact that yYJQ8 does not cluster to the Y-branch, and hHYP109-1 not to the Ra-branch.



View larger version (31K):
[in this window]
[in a new window]
 
Figure 3. Mapping of ligand recognition propensities onto a tree of sequence similarity for the 42 WW domains. The tree was constructed from a sequence distance matrix, which was calculated using only the residues on the ligand-exposed face of the ß-sheet [18, 20, 22–26, 28, 30, 32–35, 37, 39] and W17. The WW domain prototype sequence was included for comparison (Macias et al. 2000). Unfolded WW domains are highlighted in gray. The tree revealed three branches correlating with distinct recognition activities (Y-, Ra-, and Rb-branches). Result coloring according to Figure 2Go.

 
To identify specificity-determining residues, consensus sequences were calculated for the Y- and Ra-group and compared to the group-specific residue patterns of the Rb- and poS/poT-group (Figs. 2, 4GoGo). The L- and poly-P-group consensus sequences were omitted, because these groups had not enough members, and thus the respective consensi were not revealing. All other residue patterns share the residues characteristic for WW domains (structural: W17, Y/F29, and P42; functional: Y28, S/T37, and W39). Beyond this the residue patterns show distinct group specific features (Figs. 2, 4GoGo). In the case of the Y-group, the consensus additionally comprises E18, V30, H32, and R35. Except for E18, these residues have been shown by the available complex structures to mediate the tyrosine recognition (Macias et al. 1996). In the case of the two different R-groups, two distinct group-specific residue patterns were found: V20, D22, and W30 for the Ra-group and V18, G20, T/S22, D/E24, Y/F26, and Y30 for the Rb-group. Despite their differences, negatively charged residues in the ß1–ß2-loop in conjunction with an aromatic residue in position 30 are characteristic for both consensus sequences. These negatively charged residues might recognize the arginine of the ligand. Their location in the ß1–ß2-loop closely resembles the location of the S22R/K23-motif of the poS/poT-group members recognizing the poS/poT residues in their ligands, suggesting an analogous ligand recognition mechanism. Contrary to this, the specificity-determining residues of the Y-group are mainly located around the ß2–ß3-loop. The presence of a third aromatic residue on the binding surface of the WW domains of both R-groups might lead to the formation of a second proline binding site (Macias et al. 2002). This second proline binding site might be even more pronounced in the Ra-group by the presence of a tryptophan, which may explain their propensity to bind longer proline only sequences (Fig. 2Go, eighth result column). In fact, hFE65 has already been reported to bind to the peptide PPPPPP (Espanel and Sudol 1999).



View larger version (36K):
[in this window]
[in a new window]
 
Figure 4. Schematic representations of the WW domain ß-sheet indicating group-specific sequence patterns. The sequence consensi obtained for the Y-, Ra-, and Rb-groups are compared to the poS/poT-group consensus (residues on the ligand-exposed face are contained within large squares). Allover conserved binding site residues (Y28, S/T37, and W39) are colored in gray. Residues possibly responsible for ligand specificity are colored according to Figure 2Go.

 
Altogether, conserved residues were found mainly on the ligand-exposed face of the WW domain ß-sheet, supporting our approach of identifying functionally relevant residues. The identification of distinct and reasonable group consensus sequences further strengthens our WW domain classification.

Analysis of group-specific binding motifs
To identify the key residues of the ligand and to address the question of whether our classification shows group-specific binding motifs, substitutional analyses of the identified ligands were performed for all members of the Ra-, Rb-, L-, poly-P-, and poS/poT-group and for selected and predicted members of the Y-group.

In the case of the Y-group substitutional analyses of the respective ligands are already known for several members, for example, hYAP65 (Pires et al. 2001). The characteristic substitutional pattern could be reproduced for selected members of the Y-group and for WW domains predicted to be Y-group members (see below), like hBAG3 shown in Figure 5AGo. The residues P-3, P-2, and Y0 are the key residues recognized by WW domains of the Y-group (Aasland et al. 2002). The substitutions tolerated at positions -1 might be domain- but not group-specific. An amino acid in position -4 is not essential but necessary for binding with higher affinity, as shown by length analyses. Altogether, the conserved binding motif for the Y-group can be summarized as PPxY.



View larger version (43K):
[in this window]
[in a new window]
 
Figure 5. Substitutional analyses of ligands incubated with the respective WW domains. Each residue within the ligand was substituted by 19 naturally occurring L-amino acids (cysteine omitted). All spots in the left-most column are identical and represent the wild-type peptide. All other spots are single substitution analogs, with rows defining the sequence position that is substituted and columns defining the amino acid that replaces the wild-type residue. The relative spot intensities correlate qualitatively with the binding affinities (Kramer et al. 1999). (A) GTPPPPYTVG incubated with hBAG3 WW domain. (B) PPGPPPRGPPP incubated with hFE65 WW domain. (C) PPGPPPRGPPP incubated with hFBP21-2 WW domain. (D) APPTPPPLPP incubated with hFBP11-1 WW domain. (E) PPPLIPPPPPLPP incubated with yPRP40-2 WW domain. (F) TRHPPVLpoTPPDQE incubated with hPIN1 WW domain.

 
The Ra- and Rb-group members were analyzed with single substitution analogs of the peptide PPGPPPRGPPP (short R-peptide except C-terminal arginine). Group-specific and distinct binding motifs were identified for the Ra- and Rb-group (except for yYFB0 showing an intermediate pattern), with the representatives hFE65 and hFBP21-2 shown in Figure 5Go, B and C, respectively. In the case of the Ra-group P-5, P-3, P-2, and R0 were found to be the key residues, which could not be substituted at all (Fig. 5BGo). Positions -4 and -1 were less restricted, but showed a clear preference for glycine or proline, in case of position -4, and proline, in the case of position -1. Although still quite specific, P-6 could be replaced by hydrophobic residues. The four C-terminal positions seemed not to be important for recognition, which was confirmed by length analyses (data not shown). Therefore, the preferred binding motif for the Ra-group can be summarized as (p/{phi})P(p,g)PPpR.

In the case of the Rb-group the key residues comprise P-2, P-1, R0, G1, and P3 (Fig. 5CGo). P-3, P2, and P4 could be replaced by other amino acids, albeit resulting in lower binding affinity. Substitution of the 3 N-terminal positions by different amino acids had only minor effects on the binding affinity. Together this results in (p/{phi})PPRgpPp as the preferred binding motif for the Rb-group.

The main difference between the Ra- and Rb-group binding patterns is the relative position of the key residues with respect to the arginine. In the case of the Rb-group both N-terminal and C-terminal residues provide specificity, whereas in the case of the Ra-group only N-terminal residues are important. The binding pattern of the Ra-group member yYFB0 more closely resembles that of the Rb-group, and therefore, yYFB0 may represent an intermediate form. In fact, the sequence of yYFB0 differs significantly from the other Ra-group consensus in that it has a valine instead of a tryptophan at position 30, and therefore, it is located outside the Ra-branch in the tree.

For the interaction of L-group members with APPTPPPLPP (short L-peptide) P-2, P-1, L0, and P1 were identified as key residues (Fig. 5DGo). In addition, position 2 is essential, as shown by length analyses (data not shown), and preferably proline. This results in the L-group binding motif PPLPp.

To validate the hypothesis that the members of the poly-P-group specifically recognize stretches of proline only, a substitutional analysis of PPPLIPPPPPLPP (part of the long L-peptide) was performed. The specific ligand binding pattern of both WW domains comprised only a stretch of five prolines preceded by a hydrophobic residue (Fig. 5EGo), yielding the preferred poly-P-group binding motif (p/{xi})PPPPP.

Investigation of the poS/poT-group members with substitution analogs of poS/poT-containing peptides showed only two consecutive residues that are essential for recognition (Fig. 5FGo), with the pattern (poS/poT)P comprising the specifically recognized motif.

Taken together, the significance of arginine, leucine, phospho-serine/threonine, and proline only as the specificity-determining residues within the ligands for the respective groups was confirmed by these analyses. A general property of all ligand motifs, except for the poS/poT-group, is a pair of two consecutive prolines, which cannot be substituted at all. The position of this pair with respect to the specificity-determining residue differs between the groups. In the case of the Y-group and Ra-group, the two consecutive prolines are separated by one residue from the specificity-determining residue, whereas in the case of the Rb- and L-group this pair directly precedes the specificity-determining residue within the ligand.

The validity of our classification, until now based on the detected recognition profiles and group consensi, is confirmed by the finding of group-specific binding motifs.

Regulation by ligand phosphorylation
It has previously been shown that certain WW domain–ligand interactions may be regulated by ligand phosphorylation (Sudol and Hunter 2000). To investigate this further, a peptide library consisting of 600 potential tyrosine, serine, and threonine kinase target sequences (present in duplicate in phosphorylated and nonphosphorylated form) was screened with the poS/poT-group members hPIN1, yESS1 (Table 1Go). Both bound a large number of phospho-serine- or phospho-threonine- but no phospho-tyrosine-containing peptides. For hPIN1, selective binding to the phosphorylated peptide only was observed for all 38 binding sequences and dominantly for yESS1 (19 peptides in the phosphorylated form only, seven in both and five in the nonphosphorylated form only). In accordance with the substitutional analyses the common denominator of the identified peptides is (poS/poT)P, with the most strongly binding peptides containing {phi}{phi}poTPP. Interestingly, phospho-threonine-containing peptides bound, in general, more strongly compared to phospho-serine-containing peptides. Replacing phospho-serine by phospho-threonine in previously identified phospho-serine-containing ligands always increased affinity, substantiating this preference for phospho-threonine (data not shown).


View this table:
[in this window]
[in a new window]
 
Table 1. hPIN1 and yESS1 binding human kinase target sequences obtained by a phosphopeptide library screen

 
Following these results, an NMR screen of 39 of the 42 WW domains using a peptide derived from the phospho-peptide library (VLpoTPPDRL, "poT-peptide") was performed (Fig. 2Go, fourth result column). For seven WW domains, a chemical shift change was detected, and in three cases strong chemical shift perturbations were observed, although hFBP21-1 and -2 probably interacted with this peptide due to the presence of an arginine. The fact that the NMR screening results for the poT-peptide and the poS-model-peptide did not correlate (only hPIN1 and yYPR152C recognized both) can be attributed to the different positioning of the prolines with respect to the phosphorylated residue.

In addition to these phospho-serine/threonine-recognizing WW domains the NMR screening also showed that 20 of the 42 WW domains recognize the poY-peptide. This raised the question of WW domain selectivity towards peptides phosphorylated on tyrosine, and could imply a WW domain-poY-ligand interaction regulation. Therefore, we complemented the NMR screens by cellulose-bound model peptide arrays of the type GPPPPBG (where B = 19 natural L-amino acids, cysteine omitted, poS, poT, and poY) incubated with selected tyrosine and phospho-tyrosine recognizing domains. hYAP65, yRSP5-3, and mITCHY-1 bound peptides where B was F or Y (Fig. 6AGo). yRSP5-3 and mITCHY-1 additionally bound the peptide where B was poY, with mITCHY-1 preferring the poY-peptide. Fluorescence titrations of the Y- and poY-peptides with selected WW domains, which showed a chemical shift change larger than 0.15 ppm, yielded dissociation constants below 200 µM (Fig. 6BGo). This correlation between chemical shift change and dissociation constant supports our approach of estimating binding affinities from chemical shift changes. In general, a weak preference for the nonphosphorylated Y-peptide is observed, with the exception of mITCHY-1 binding with higher affinity to the poY-peptide. It is noteworthy that potential phosphorylation sites are predicted for B1PPB2Y-type motifs (where B1 preferentially equals S,E,D, ... and B2 preferentially equals D,K,E, ...) (Blom et al. 1999) and therefore, mITCHY-1 might be an example of phospho-tyrosine recognition.



View larger version (16K):
[in this window]
[in a new window]
 
Figure 6. Analysis of phospho-regulation by ligand array and fluorescence spectroscopy. (A) Interactions of peroxidase-labeled WW domains with 22 variants of the peptide GPPPPBG (where B = 19 natural L-amino acids, cysteine omitted, poS, poT, and poY). (B) Dissociation constants determined by fluorescence spectroscopy titrations. The standard deviation calculated while fitting the 1:1 complex model to the titration series data is given in parentheses after the corresponding dissociation constant.

 
Altogether, these results reveal a class of WW domains that bind phosphorylated peptides. Regulation of these interactions could occur via a positive switch as in the case of serine- or threonine-phosphorylation (Sudol and Hunter, 2000), which is supported by our results. In the case of tyrosine-phosphorylation, we could not verify the proposed negative switch hypothesis, as most nonphosphorylated tyrosine-containing peptides bound only slightly more strongly, with affinities less than a factor of 10 different. mITCHY-1 even preferred the phosphorylated peptide.

Prediction of WW domain binding activities
Based on our experimental results we attempted to predict tyrosine/phospho-tyrosine recognition propensity for the 482 WW domain sequences found in the SMART database (as of 11/15/2001, sequence redundancy was not removed) (Schultz et al. 1998). To achieve this, a profile HMM (Eddy 1998) based on the sequences of the Y-branch was calculated and used for classifying the WW domain sequences. Two hundred twenty-three WW domains were predicted as Y-branch members, including WW domains of the protein families NEDD4-like ubiquitin ligases (138 of 155 protein family members were classified into the Y-branch), membrane associated guanylate kinases (24 of 36), WW containing oxidoreductases (24 of 24), YAP and homologs (9 of 9), Dystrophin (5 of 21), and PEPP2 (2 of 2). Members of these protein families were already present in our set of 42 WW domains. New candidates for tyrosine/phospho-tyrosine recognition include WW domains of the BAG-family molecular chaperone regulator-3 (hBAG3, ID: O95817; mBAG3, ID: Q9JLV1), the syntaxin4-interacting protein (mSYNIP; ID: Q9WV89) and the zinc finger protein 2 from Trypanosoma brucei (tbZFP2; ID: AAK39107). Solid-phase binding studies with cellulose-bound model peptide arrays of the type GPPPPBG (where B = 19 natural L-amino acids, cysteine omitted, poS, poT, and poY) were performed with these candidates and all bound only peptides where B was F or Y (Fig. 6AGo). mSYNIP bound additionally GPPPPpoYG but with lower affinity. Substitutional analyses of the ligand GTPPPPYTVG incubated with these candidates revealed the Y-group-specific binding motif PPxY (shown for hBAG3 in Fig. 5AGo). This altogether validates our prediction for the Y-group.

The exclusive occurrence of all investigated NEDD4like ubiquitin ligase WW domains in the Y-branch (except hSMURF1-1) suggests that binding to tyrosine/phospho-tyrosine-containing peptides is a general feature of this protein family. Moreover, the application of the Y-branch profile HMM to the SMART database WW domain sequences classified the residual NEDD4-like ubiquitin ligases almost exclusively, except for hSMURF1-1 and hSMURF2-2 of various species, in the Y-branch, thus implicating similar binding characteristics. In fact, hSMURF1-1 and hSMURF2-2 and their homologs clearly deviate from the Y-group consensus. This classification by the profile HMM is consistent with in vitro and in vivo studies that indicate that these WW domains interact with proteins containing PY motifs, like ENaC (Kanelis et al. 2001), SMADs (Kavsak et al. 2000), RNA-Polymerase-II (Chang et al. 2000), NF-E2 (Gavva et al. 1997), Atrophin-1 (Wood et al. 1998), and Occludin (Traweger et al. 2002). This can be taken as statistical evidence for the in vivo relevance of this interaction.


    Materials and methods
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Synthesis of soluble peptides
WW domains and peptide ligands were synthesized according to standard Fmoc protocols with an automated peptide synthesizer (Abimed), purified using HPLC, and analyzed by MALDI-TOF mass spectrometry.

All WW domains were synthesized with a cysteine attached to their N-terminus either to allow for immobilization on a cellulose membrane or for labeling with peroxidase via a Michael-Addition (see sections below, concerning cellulose-bound peptide libraries). To avoid immobilization and labeling at unwanted residues, all cysteines naturally occurring within the WW domain sequences were substituted by serine.

NMR spectroscopy
WW domain NMR samples (200 or 400 µM) were prepared by dissolving 500 µg or 1 mg of lyophylized protein in 500 µL buffer (10 mM phosphate, 100 mM NaCl, 0.1 mM DTT, 0.1 mM EDTA, and 0.02% NaN3 adjusted to a pH of 6.0), followed by the addition of D2O (50 µL). Ligand stock solutions were prepared by dissolving 16 mg of peptide in 400 µL of the same buffer. Ligand was added to the WW domain samples in a twofold molar excess.

All 1D-1H NMR spectra were recorded on a Bruker DRX600 in standard configuration using an inverse triple resonance probe equipped with triple axis self shielded gradient coils. A sample changer able to handle 60 samples was used for the measurements. The spectrometer was operated using the program ICON-NMR on top of XWIN-NMR (Bruker). All experiments were performed at 285 K; water suppression was achieved using a 3–9–19WATERGATE sequence (Sklenar et al. 1993). Five hundred twelve and 1024 scans were taken for samples containing protein concentrations of 400 and 200 µM, respectively. Eight thousand one hundred ninety-two datapoints were recorded for each spectrum. Data were processed and analyzed using XWIN-NMR.

Cellulose-bound peptide libraries
Soluble synthetic WW domains, all N-terminally labeled with cysteine-aminohexanoic acid, were attached via the thiol group to a maleimide-functionalized Whatman 50 cellulose membrane (Whatman) using a spot robot ("WW array"). The membranes were derivatized with two ß-alanines (Kramer et al. 1999). Subsequently, N-maleoyl-ß-alanine (0.6 M in Dimethylacetamide) was activated with N,N‘-Diisopropylcarbodiimide (0.3 M) and coupled to the ß-alanine membrane. The WW domains were then spotted onto the membrane as 1 mM protein solutions (in 0.1 M phosphate buffer, pH 7.4, containing up to 20% Dioxane). The remaining maleimide functions were quenched with 0.3 M ß-Mercaptoethanol in phosphate-buffered saline, pH 7.4.

Arrays of single substitution analogs of ligands ("Substitutional analyses of ligands") and a library containing 600 phosphopeptides ("phosphopeptide library", see below) were generated by semiautomated spot synthesis (Frank 1992; Kramer and Schneider-Mergener 1998) (Abimed; Software LISA, Jerini AG) on Whatman 50 cellulose membranes as described (Kramer et al. 1999).

Phosphopeptide library
In the cellulose-bound phosphopeptide library 600 phosphorylation sites taken from the SWISS-PROT/TrEMBL or PhosphoBase v.2 (Kreegipuu et al. 1999) databases were included. They comprised 360 serine, 121 threonine, and 119 tyrosine phosphorylation sites, each of which was represented by one peptide spot in the library. The peptides had a length of 13 amino acids, with the respective phosphorylation site located at position 8. The whole library was synthesized in duplicate in a phosphorylated and nonphosphorylated form.

Binding studies of cellulose-bound peptides
The membrane-bound libraries (WW array, substitutional analyses of ligands, and phosphopeptide library) were prewashed once with ethanol and three times with Tris-buffered saline (TBS), pH 8.0, and then blocked for 4 h with blocking buffer (blocking reagent [Sigma-Genosys] in TBS containing 5% sucrose). Subsequently, the membranes were incubated with peroxidase-labeled target compounds (peptide ligands [10 µg/mL] or WW domains [50 µg/mL], respectively) in blocking buffer overnight at 4°C and then washed at least four times with TBS. For detection, a chemiluminescence system (Pierce) was applied, and detection was performed using a LumiImagerTM (Boehringer Mannheim GmbH). Peroxidase labeling of the ligands and WW domains was carried out as described (Toepert et al. 2001), linking a maleimide-activated horseradish peroxidase (Pierce) to their N-terminus through a cysteine, which was introduced during peptide synthesis.

Fluorescence spectroscopy
Changes in the fluorescence emission spectra of selected WW domains for varying concentrations of tyrosine- or phospho-tyrosine-containing ligands were used to determine dissociation constants. A 1:1 complex model was assumed for calculation of the dissociation constants. The samples were excited at a wavelength of 298 nm and a slit width of 5 nm. For each titration step the emission spectrum was recorded in the wavelength range of 320–420 nm using a slit width of 10 nm. Binding experiments were performed at 25°C (waterbath controlled) using a Perkin-Elmer Luminescence Spectrometer LS50B. Stock solutions of WW domains (100 µM) and peptide ligands (32 mM) were prepared in 40 mM phosphate buffer with 100 mM NaCl and 50 mM Dithiothreitol, pH 7.2. The final WW domain concentration was 10 µM, and the ligand was added to defined concentrations up to approx. 1 mM.

Tree calculation
The tree was constructed using residues on the ligand-exposed face of the ß-sheet [18, 20, 22–26, 28, 30, 32–35, 37, 39] and W17 only. The WW domain prototype sequence was also included (Macias et al. 2000). Based on sequence distances calculated with PROTDIST using the Dayhoff amino acid exchange model (PAM Matrix) the tree was calculated with NEIGHBOR, using the neighbor joining algorithm (PHYLIP package v3.6a [Felsenstein 1989]). The tree visualization was performed using TREEVIEW (Page 1996).

Profile hidden Markov model
Sequences of the Y-branch were compiled into a profile hidden Markov model calculated with HMMer 2.2 (Eddy 1998). By applying this profile HMM to the set of 42 WW domains, the profile HMM was validated and the classification threshold was determined. Using this threshold, the profile HMM was used to rank the 482 WW domain sequences from the SMART database (as of 11/15/2001, sequence redundancy was not removed) according to their homology with the model.


    Acknowledgments
 
This project is funded by the Berlin Structure Factory Project and the Deutsche Forschungsgemeinschaft (DFG).

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.


    References
 TOP
 Abstract
 Introduction
 Results and Discussion
 Materials and methods
 References
 
Aasland, R., Abrams, C., Ampe, C., Ball, L.J., Bedford, M.T., Cesareni, G., Gimona, M., Hurley, J.H., Jarchau, T., Lehto, V.P., et al. 2002. Normalization of nomenclature for peptide motifs as ligands of modular protein domains. FEBS Lett. 513: 141–144.[CrossRef][Medline]

Bedford, M.T., Chan, D.C., and Leder, P. 1997. FBP WW domains and the Abl SH3 domain bind to a specific class of proline-rich ligands. EMBO J. 16: 2376–2383.[CrossRef][Medline]

Bedford, M.T., Sarbassova, D., Xu, J., Leder, P., and Yaffe, M.B. 2000. A novel pro-Arg motif recognized by WW domains. J. Biol. Chem. 275: 10359–10369.[Abstract/Free Full Text]

Blom, N., Gammeltoft, S., and Brunak, S. 1999. Sequence and structure-based prediction of eukaryotic protein phosphorylation sites. J. Mol. Biol. 294: 1351–1362.[CrossRef][Medline]

Chang, A., Cheang, S., Espanel, X., and Sudol, M. 2000. Rsp5 WW domains interact directly with the carboxyl-terminal domain of RNA polymerase II. J. Biol. Chem. 275: 20562–20571.[Abstract/Free Full Text]

Chen, H.I. and Sudol, M. 1995. The WW domain of Yes-associated protein binds a proline-rich ligand that differs from the consensus established for Src homology 3-binding modules. Proc. Natl. Acad. Sci. 92: 7819–7823.[Abstract/Free Full Text]

Eddy, S.R. 1998. Profile hidden Markov models. Bioinformatics 14: 755–763.[Abstract/Free Full Text]

Espanel, X. and Sudol, M. 1999. A single point mutation in a group I WW domain shifts its specificity to that of group II WW domains. J. Biol. Chem. 274: 17284–17289.[Abstract/Free Full Text]

Felsenstein, J. 1989. PHYLIP—Phylogeny inference package (version 3.2). Cladistics 5: 164–166.

Frank, R. 1992. Spot synthesis: An easy technique for the positionally addressable, parallel chemical synthesis on a membrane support. Tetrahedron 48: 9217–9232.[CrossRef]

Gavva, N.R., Gavva, R., Ermekova, K., Sudol, M., and Shen, C.J. 1997. Interaction of WW domains with hematopoietic transcription factor p45/NF-E2 and RNA polymerase II. J. Biol. Chem. 272: 24105–24108.[Abstract/Free Full Text]

Kanelis, V., Rotin, D., and Forman-Kay, J.D. 2001. Solution structure of a Nedd4 WW domain–ENaC peptide complex. Nat. Struct. Biol. 8: 407–412.[CrossRef][Medline]

Kavsak, P., Rasmussen, R.K., Causing, C.G., Bonni, S., Zhu, H., Thomsen, G.H., and Wrana, J.L. 2000. Smad7 binds to Smurf2 to form an E3 ubiquitin ligase that targets the TGF ß receptor for degradation. Mol. Cell 6: 1365–1375.[CrossRef][Medline]

Kramer, A. and Schneider-Mergener, J. 1998. Synthesis and screening of peptide libraries on continuous cellulose membrane supports. Methods Mol. Biol. 87: 25–39.[Medline]

Kramer, A., Reineke, U., Dong, L., Hoffmann, B., Hoffmuller, U., Winkler, D., Volkmer-Engert, R., and Schneider-Mergener, J. 1999. Spot synthesis: Observations and optimizations. J. Pept. Res. 54: 319–327.[CrossRef][Medline]

Kreegipuu, A., Blom, N., and Brunak, S. 1999. PhosphoBase, a database of phosphorylation sites: Release 2.0. Nucleic Acids Res. 27: 237–239.[Abstract/Free Full Text]

Macias, M.J., Hyvonen, M., Baraldi, E., Schultz, J., Sudol, M., Saraste, M., and Oschkinat, H. 1996. Structure of the WW domain of a kinase-associated protein complexed with a proline-rich peptide. Nature 382: 646–649.[CrossRef][Medline]

Macias, M.J., Gervais, V., Civera, C., and Oschkinat, H. 2000. Structural analysis of WW domains and design of a WW prototype. Nat. Struct. Biol. 7: 375–379.[CrossRef][Medline]

Macias, M.J., Wiesner, S., and Sudol, M. 2002. WW and SH3 domains, two different scaffolds to recognize proline-rich ligands. FEBS Lett. 513: 30–37.[CrossRef][Medline]

Page, R.D. 1996. TreeView: An application to display phylogenetic trees on personal computers. Comput. Appl. Biosci. 12: 357–358.[Free Full Text]

Pires, J.R., Taha-Nejad, F., Toepert, F., Ast, T., Hoffmuller, U., Schneider-Mergener, J., Kuhne, R., Macias, M.J., and Oschkinat, H. 2001. Solution structures of the YAP65 WW domain and the variant L30 K in complex with the peptides GTPPPPYTVG, N-(n-octyl)-GPPPY and PLPPY and the application of peptide libraries reveal a minimal binding epitope. J. Mol. Biol. 314: 1147–1156.[CrossRef][Medline]

Schultz, J., Milpetz, F., Bork, P., and Ponting, C.P. 1998. SMART, a simple modular architecture research tool: Identification of signaling domains. Proc. Natl. Acad. Sci. 95: 5857–5864.[Abstract/Free Full Text]

Sklenar, V., Piotto, M., Leppik, R., and Saudek, V. 1993. Gradient-tailored water suppression for H-1-N-15 Hsqc experiments optimized to retain full sensitivity. J. Magn. Reson. A 102: 241–245.[CrossRef]

Sudol, M. and Hunter, T. 2000. NeW wrinkles for an old domain. Cell 103: 1001–1004.[CrossRef][Medline]

Toepert, F., Pires, J.R., Landgraf, C., Oschkinat, H., and Schneider-Mergener, J. 2001. Synthesis of an array comprising 837 variants of the hYAP WW protein domain. Angew. Chem. Int. Ed. Engl. 40: 897–900.[CrossRef][Medline]

Traweger, A., Fang, D., Liu, Y.C., Stelzhammer, W., Krizbai, I.A., Fresser, F., Bauer, H.C., and Bauer, H. 2002. The tight junction-specific protein occludin is a functional target of the E3 ubiquitin-protein ligase itch. J. Biol. Chem. 277: 10201–10208.[Abstract/Free Full Text]

Verdecia, M.A., Bowman, M.E., Lu, K.P., Hunter, T., and Noel, J.P. 2000. Structural basis for phosphoserine-proline recognition by group IV WW domains. Nat. Struct. Biol. 7: 639–643.[CrossRef][Medline]

Wood, J.D., Yuan, J., Margolis, R.L., Colomer, V., Duan, K., Kushi, J., Kaminsky, Z., Kleiderlein, J.J., Sharp, A.H., and Ross, C.A. 1998. Atrophin-1, the DRPLA gene product, interacts with two families of WW domain-containing proteins. Mol. Cell. Neurosci. 11:149–160.[CrossRef][Medline]

Yaffe, M.B., Schutkowski, M., Shen, M., Zhou, X.Z., Stukenberg, P.T., Rahfeld, J.U., Xu, J., Kuang, J., Kirschner, M.W., Fischer, G., et al. 1997. Sequence-specific and phosphorylation-dependent proline isomerization: A potential mitotic regulatory mechanism. Science 278: 1957–1960.[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
J. Biol. Chem.Home page
Y. He, D. H. Hryciw, M. L. Carroll, S. A. Myers, A. K. Whitbread, S. Kumar, P. Poronnik, and J. D. Hooper
The Ubiquitin-Protein Ligase Nedd4-2 Differentially Interacts with and Regulates Members of the Tweety Family of Chloride Ion Channels
J. Biol. Chem., August 29, 2008; 283(35): 24000 - 24010.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
S. K. Shenoy, K. Xiao, V. Venkataramanan, P. M. Snyder, N. J. Freedman, and A. M. Weissman
Nedd4 Mediates Agonist-dependent Ubiquitination, Lysosomal Targeting, and Degradation of the {beta}2-Adrenergic Receptor
J. Biol. Chem., August 8, 2008; 283(32): 22166 - 22176.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
R. Mouchantaf, B. A. Azakir, P. S. McPherson, S. M. Millard, S. A. Wood, and A. Angers
The Ubiquitin Ligase Itch Is Auto-ubiquitylated in Vivo and in Vitro but Is Protected from Degradation by Interacting with the Deubiquitylating Enzyme FAM/USP9X
J. Biol. Chem., December 15, 2006; 281(50): 38738 - 38747.
[Abstract] [Full Text] [PDF]


Home page
Proc. Natl. Acad. Sci. USAHome page
M. Jager, Y. Zhang, J. Bieschke, H. Nguyen, M. Dendle, M. E. Bowman, J. P. Noel, M. Gruebele, and J. W. Kelly
Structure-function-folding relationship in a WW domain
PNAS, July 11, 2006; 103(28): 10648 - 10653.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
P. A. Chong, H. Lin, J. L. Wrana, and J. D. Forman-Kay
An Expanded WW Domain Recognition Motif Revealed by the Interaction between Smad7 and the E3 Ubiquitin Ligase Smurf2
J. Biol. Chem., June 23, 2006; 281(25): 17069 - 17075.
[Abstract] [Full Text] [PDF]


Home page
Sci SignalHome page
M. B. Yaffe
"Bits" and Pieces
Sci. Signal., June 20, 2006; 2006(340): pe28 - pe28.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
E.-J. Jeon, K.-Y. Lee, N.-S. Choi, M.-H. Lee, H.-N. Kim, Y.-H. Jin, H.-M. Ryoo, J.-Y. Choi, M. Yoshida, N. Nishino, et al.
Bone Morphogenetic Protein-2 Stimulates Runx2 Acetylation
J. Biol. Chem., June 16, 2006; 281(24): 16502 - 16511.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. Biol.Home page
R. J. Ingham, K. Colwill, C. Howard, S. Dettwiler, C. S. H. Lim, J. Yu, K. Hersi, J. Raaijmakers, G. Gish, G. Mbamalu, et al.
WW Domains Provide a Platform for the Assembly of Multiprotein Networks
Mol. Cell. Biol., August 15, 2005; 25(16): 7092 - 7106.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
O. Y. Fedoroff, S. A. Townson, A. P. Golovanov, M. Baron, and J. M. Avis
The Structure and Dynamics of Tandem WW Domains in a Negative Regulator of Notch Signaling, Suppressor of Deltex
J. Biol. Chem., August 13, 2004; 279(33): 34991 - 35000.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
Y.-H. Jin, E.-J. Jeon, Q.-L. Li, Y. H. Lee, J.-K. Choi, W.-J. Kim, K.-Y. Lee, and S.-C. Bae
Transforming Growth Factor-{beta} Stimulates p300-dependent RUNX3 Acetylation, Which Inhibits Ubiquitination-mediated Degradation
J. Biol. Chem., July 9, 2004; 279(28): 29409 - 29417.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
G. Heidecker, P. A. Lloyd, K. Fox, K. Nagashima, and D. Derse
Late Assembly Motifs of Human T-Cell Leukemia Virus Type 1 and Their Relative Roles in Particle Release
J. Virol., June 15, 2004; 78(12): 6636 - 6648.
[Abstract] [Full Text] [PDF]


Home page
J. Biol. Chem.Home page
A. Angers, A. R. Ramjaun, and P. S. McPherson
The HECT Domain Ligase Itch Ubiquitinates Endophilin and Localizes to the trans-Golgi Network and Endosomal System
J. Biol. Chem., March 19, 2004; 279(12): 11471 - 11479.
[Abstract] [Full Text] [PDF]


Home page
Protein Sci.Home page
C. M. Kraemer-Pecore, J. T.J. Lecomte, and J. R. Desjarlais
A de novo redesign of the WW domain
Protein Sci., October 1, 2003; 12(10): 2194 - 2205.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited