|
|
||||||||
Howard Hughes Medical Institute and Department of Biochemistry, University of Texas Southwestern Medical Center, Dallas, Texas 75390-9050, USA
(RECEIVED July 3, 2006; FINAL REVISION August 24, 2006; ACCEPTED August 29, 2006)
| Abstract |
|---|
|
|
|---|
-adaptin, the synaptobrevin N-terminal domains sec22 and Ykt6, and the srx domain of the signal recognition particle receptor also regulate vesicle trafficking pathways by mediating SNARE fusion, recognizing specialized compartments, and interacting with small GTPases that resemble Ypt7. Keywords: fold recognition; SNARE-like superfamily; longin domain; DUF254; CHiPS domain; vesicle transport; lysosome-related organelles; Hermansky-Pudlak Syndrome; HPS-1; HPS-4; Mon1; Ccz1
| Introduction |
|---|
|
|
|---|
Deletion mutants of the yeast DUF254 protein Mon1 or of the protein Ccz1 display a similar phenotype to deletion of Ypt7, and are thought to act as a complex at the tethering stage of vesicle fusion to the yeast vacuole (Wang et al. 2002, 2003). DUF254 proteins have been identified in all major eukaryotic lineages (Cottage et al. 2004), and the Caenorhabditis elegans homolog (SAND-1) also functions in vesicle transport (Poteryaev and Spang 2005). Despite this defined functional preservation, the evolutionary conservation of the complex with Ccz1 remains unresolved. A conserved Ccz1 N-terminal sequence, termed the CHiPS domain, was recently identified in species from all major eukaryotic lineages, suggesting the potential for a preserved functional complex. The CHiPS domain is also present in a human gene (HPS-4) implicated in a known protein trafficking disorder to lysosome-related organelles called Hermansky-Pudlak Syndrome (Hoffman-Sommer et al. 2005).
We identify a conserved domain in DUF254 proteins homologous to domains of known structure that assemble into classic cargo vesicle adapter protein (AP) complexes (Collins et al. 2002; Heldwein et al. 2004) (
-adaptin and µ-adaptin N-terminal domain). In the Structural Classification of Proteins (SCOP) (Murzin et al. 1995), the identified adaptin structures belong to the SNARE-like superfamily also known as longin (Rossi et al. 2004). Longins possess a five-stranded, antiparallel
-sheet (order 21543), surrounded by
-helices on either side. Consistent with a role for DUF254 proteins in vesicle tethering, other known longin domains function in trafficking pathways by mediating SNARE fusion, by recognizing specialized compartments, and by interacting with GTP bound states of small GTPases (Rossi et al. 2004). Our analysis also assigns a longin-like fold to the conserved Ccz1/HPS-4 CHiPS domain, although the CHiPS sequences display little sequence similarity to the DUF254 longin domains. Finally, we identify the HPS-4 binding partner (HPS-1) as a homolog of DUF254 members, supporting an evolutionary conserved role for these complexes in vesicular traffic to specialized lysosome-related compartments.
| Results and Discussion |
|---|
|
|
|---|
The DUF254 longin domain assignment was further justified with results from fold recognition servers assembled by the 3D-JURY meta-server (Ginalski et al. 2003). Reliable scores were assigned to
-adaptin longin domains (1gw5S, 62.00 and 1w63q, 61.43); and all of the top-scoring hits were to longin folds: µ-adaptin N-terminal domain (1w63M, 40), sedlin (1h3qA, 31.71), gliding protein Mglb (1j3wA, 29.00), and signal recognition particle receptor (srx)
subunit (1nrjA, 27.57 and 2fh5A, 27.29). These results agree with a previous report identifying adaptin (1gw5), among other
/
folds, as a potential DUF254 domain (Cottage et al. 2004). Finally, the top ROSETTA (Rohl et al. 2004) fragment assembly model displays a longin-like topology (Fig. 1B). With an exception of the C-terminal helix (shown in white), this model includes all of the µ-adaptin core secondary structural elements (Fig. 1A, yellow strands and blue helices).
|
1,
4, and
5, that are surrounded on either side by helices
1 and
2 (Fig. 1A,D), and the predicted secondary structure topology (






) of the sequences.
HPS-1/DUF254 longin-like domains function in an evolutionary conserved complex
Identification of HPS-1 as a DUF254 homolog supports an evolutionary conserved function of these proteins. The best characterized DUF254 representative (Mon1) functions as a complex with the CHiPS domain protein (Ccz1) in vesicle trafficking pathways leading to the yeast vacuole (Wang et al. 2002, 2003), an organelle that functions analogously to the mammalian lysosome. The CHiPS domain is also found in HPS-4 (Hoffman-Sommer et al. 2005), a protein that interacts with HPS-1 to regulate various lysosome-related organelles (Wei 2006). Interestingly, the genes encoding the yeast DUF254 complex have expanded in human genomes, which contain two closely related DUF254 sequences (Cottage et al. 2004) and the more distantly related HPS-1. This expansion perhaps coincides with specialized cell types that have evolved different lysosome-related organelles. Whether DUF254 proteins in other species interact with CHiPS domains remains to be determined.
DUF254 longin-like domains are homologs of CHiPS domains
The CHiPS domain identified in HPS-4/Ccz1 contains a homologous stretch of
200 residues (Hoffman-Sommer et al. 2005), with a hydrophobicity pattern and predicted secondary structure topology resembling the longin fold. Extensive PSI-BLAST searches with CHiPS queries identified both DUF254 and µ-adaptin longin domains with modest scores (E-values <1). For example, the CHiPS sequence (gi|76154640, range 3165) finds the DUF254 sequence (gi|66508937, range 131246, sixth iteration, E-value 0.013). More sensitive profile-based searches using CHiPS alignments also revealed links to the DUF254: an HPS-4 alignment confidently identified as the top ranked hit a domain of unknown function (DUF1712, E-value 1.8 e06) that includes CHiPS sequences, followed by DUF254 (E-value 6.86e05).
The fold recognition meta-server identified longin structures as top hits to the Ccz1 CHiPS domain: sedlin (1h3qA, 52.17), synaptobrevin (1h8m, 48.83),
-adaptin (1gw5S, 45.17 and 1w63Q, 45.17), µ-adaptin (1w63M, 42.00), vesicle trafficking protein sec22
(1ifq, 36.83), and srx (1nrjA, 36.83); and a meta-server component (BASIC) (Ginalski et al. 2004) identified DUF254 as a confident hit (score 14.17). The top-ranked CHiPS domain ROSETTA model (Fig. 1C) resembles the DUF254 model (Fig. 1B), except for a helix replacing the edge
strand (
2) of the core sheet, although other CHiPS domain secondary structure predictions (Hoffman-Sommer et al. 2005) assign a strand to this region. These collective results support a homologous relationship between the DUF254 longin domain and the CHiPS domain. Such a relationship mimics the AP complex
-adaptin and µ-adaptin longins (Collins et al. 2002; Heldwein et al. 2004), which are thought to have arisen from a genetic duplication in spite of retaining little sequence similarity (Boehm and Bonifacino 2001).
Diverse longin-domain complexes interact with small GTPases
The identified DUF254 homolog µ-adaptin functions as a component of AP complexes that link clathrin to specific membrane cargo and lipids during vesicle budding. AP complex structures AP-1 (Heldwein et al. 2004) and AP-2 (Collins et al. 2002) contain related sets of subunits: two trunk domains (
and
1 in AP-1;
and
2 in AP-2) and two longin-domain subunits (
1 and µ1 in AP-1;
2 and µ2 in AP-2). The longin domains stabilize the core of the tetramer, with each making specific interactions to the other three subunits. AP complexes distinguish membrane compartments by "coincidently" recognizing phosphoinositide (PI-4-P for AP-1) and an organelle specific GTPase (Arf1 for AP-1) (Heldwein et al. 2004). Interestingly, at the Mon1/Ccz1-mediated tethering stage of vacuole fusion (Wang et al. 2003), the Ypt7 GTPase effector complex c-VPS/HOPS also appears to bind phosphoinositide (Stroupe et al. 2006). Perhaps the DUF254/CHiPS longins help organize the effector complex to coincidently recognize phosphoinositide and Ypt7 GTPase to specifically recognize the vacuolar membrane.
Other identified longins function as small GTPase effectors. Their interaction is defined structurally in signal recognition particle receptors (Schwartz and Blobel 2003; Schlenker et al. 2006), which govern cotranslational targeting of secratory and membrane proteins to the endoplasmic reticulum. In these structures, conserved srx longin domain residues shape the GTPase interface (Fig. 2A). An invariant
1
2 loop glycine and a somewhat less conserved
1 helix polar residue dictate interactions with GTPase switch loops (Fig. 2A, black), whose conformations are defined by nucleotide (Fig. 2A, red bonds). Thus, conserved srx residues help identify the longin domain as a GTPase effector (Schwartz and Blobel 2003; Schlenker et al. 2006). To help identify potential functional sites of DUF254/CHiPS longin domains, family sequence conservations were mapped to the srx structure (Fig. 2, B and C, respectively). DUF254/CHiPS conservations map to the same
1
2 loop and
1 helix vicinity (Fig. 2B,C) identified by srx conservations, perhaps suggesting a similar effector interaction for these longin domains. The invariant srx glycine is also an invariant glycine in DUF254 sequences, but is not conserved in
1
2 loop insertions of CHiPS sequences. The srx
1 polar residue is a conserved small residue (mostly glycine) in DUF254 and CHiPS sequences (Fig. 1C). Substitution of a small residue at this polar position might allow a backbone hydrogen bond to replace an ionic bond. Alternatively, another conserved polar position (like the invariant DUF254
1
2 loop lysine) could substitute for this role.
|
| Materials and methods |
|---|
|
|
|---|
To further justify homology, identified sequences were grouped using linkage clustering (0.6 bit per site threshold) (SEALS Package; Walker and Koonin 1997), and the resulting groups were aligned using MAFFT (Katoh et al. 2005) with iterative refinement (ver 5.743; FFT-NS-I option, default values). MAFFT alignments were used to search profile databases KOG (Tatusov et al. 2003) or PFAM (Bateman et al. 2004) using COMPASS (Sadreyev and Grishin 2003).
Identifying longin structures
Mon1 DUF254 (gi|6321314) and HPS-1 N terminus (gi|33286416, range 1135) were submitted to 3D-Jury (Ginalski et al. 2003). Scores >50 were considered significant (>90% correct; Ginalski and Rychlewski 2003). Individual scores and alignments from a component of 3D-Jury, meta-BASIC, also substantiated homologs. Meta-BASIC scores >12 were considered confident (<5% probability of being incorrect; Ginalski et al. 2004).
Protein structure prediction from a ROSETTA fragment assembly (Rohl et al. 2004) was applied to a DUF254 longin domain (gi|66508937, range 117235) and a CHiPS longin domain (gi|49524510, range 11179). For each target sequence, 1000 independent fold decoys were clustered based on RMSD. The coordinates for the center decoy of the cluster containing the most decoys was used to generate DUF254 and CHiPS models using MolScript (Esnouf 1999).
Multiple sequence-structure alignment
Superposition and alignment of identified structures were carried out using VAST (Madej et al. 1995), with some manual adjustments. Multiple sequence alignments generated by MAFFT (Katoh et al. 2005), corresponding to conserved core elements of identified families, were mapped to the resulting structure alignments, using as guides secondary structure predictions from JPRED (Cuff et al. 1998) and Rosetta models (Rohl et al. 2004) and alignments from PSI-BLAST (Altschul et al. 1997), Meta-BASIC (Ginalski et al. 2004), and COMPASS (Sadreyev and Grishin 2003).
Family conservation mappings
To visualize and compare longin conservations, sequences from each longin-like family (srx, DUF254, and CHiPS) were collected with PSI-BLAST (described above). Srx sequences (excluding fragments) belonging to groups with known structures (1nrj and 2fh5) and all DUF254 or CHiPS sequences (excluding fragments and distant HPS1/HPS4 sequences) identified in the initial round of PSI-BLAST were aligned as described above (Katoh et al. 2005). Positional conservations of alignment columns were calculated using an al2co entropy-based conservation measure (Pei and Grishin 2001), and were mapped to srx (1nrjA) based on the multiple sequence-structure alignment, with a rainbow color ramp from blue (least conserved, al2co score 1.2 as minimum value) to red (most conserved, al2co score 2.35 as a maximum value) using MolScript (Esnouf 1999).
| Footnotes |
|---|
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.062419006.
| Acknowledgments |
|---|
|
|
|---|
| References |
|---|
|
|
|---|
Bateman, A., Coin, L., Durbin, R., Finn, R.D., Hollich, V., Griffiths-Jones, S., Khanna, A., Marshall, M., Moxon, S., and Sonnhammer, E.L., et al. 2004. The Pfam protein families database. Nucleic Acids Res. 32: (Database issue): D138D141.
Behnia, R. and Munro, S. 2005. Organelle identity and the signposts for membrane traffic. Nature 438: 597604.[CrossRef][Medline]
Boehm, M. and Bonifacino, J.S. 2001. Adaptins: The final recount. Mol. Biol. Cell 12: 29072920.
Bonifacino, J.S. and Glick, B.S. 2004. The mechanisms of vesicle budding and fusion. Cell 116: 153166.[CrossRef][Medline]
Collins, B.M., McCoy, A.J., Kent, H.M., Evans, P.R., and Owen, D.J. 2002. Molecular architecture and functional model of the endocytic AP2 complex. Cell 109: 523535.[CrossRef][Medline]
Cottage, A., Mullan, L., Portela, M.B., Hellen, E., Carver, T., Patel, S., Vavouri, T., Elgar, G., and Edwards, Y.J. 2004. Molecular characterisation of the SAND protein family: A study based on comparative genomics, structural bioinformatics and phylogeny. Cell. Mol. Biol. Lett. 9: 739753.[Medline]
Cuff, J.A., Clamp, M.E., Siddiqui, A.S., Finlay, M., and Barton, G.J. 1998. JPred: A consensus secondary structure prediction server. Bioinformatics 14: 892893.
Esnouf, R.M. 1999. Further additions to MolScript version 1.4, including reading and contouring of electron-density maps. Acta Crystallogr. D Biol. Crystallogr. 55: 938940.[CrossRef][Medline]
Ginalski, K. and Rychlewski, L. 2003. Detection of reliable and unexpected protein fold predictions using 3D-Jury. Nucleic Acids Res. 31: 32913292.
Ginalski, K., Elofsson, A., Fischer, D., and Rychlewski, L. 2003. 3D-Jury: A simple approach to improve protein structure predictions. Bioinformatics 19: 10151018.
Ginalski, K., von Grotthuss, M., Grishin, N.V., and Rychlewski, L. 2004. Detecting distant homology with Meta-BASIC. Nucleic Acids Res. 32: W576W581.
Heldwein, E.E., Macia, E., Wang, J., Yin, H.L., Kirchhausen, T., and Harrison, S.C. 2004. Crystal structure of the clathrin adaptor protein 1 core. Proc. Natl. Acad. Sci. 101: 1410814113.
Hoffman-Sommer, M., Grynberg, M., Kucharczyk, R., and Rytka, J. 2005. The CHiPS DomainAncient traces for the Hermansky-Pudlak syndrome. Traffic 6: 534538.[CrossRef][Medline]
Katoh, K., Kuma, K., Toh, H., and Miyata, T. 2005. MAFFT version 5: Improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 33: 511518.
Madej, T., Gibrat, J.F., and Bryant, S.H. 1995. Threading a database of protein cores. Proteins 23: 356369.[CrossRef][Medline]
Murzin, A.G., Brenner, S.E., Hubbard, T., and Chothia, C. 1995. SCOP: A structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247: 536540.[CrossRef][Medline]
Pei, J. and Grishin, N.V. 2001. AL2CO: Calculation of positional conservation in a protein sequence alignment. Bioinformatics 17: 700712.
Poteryaev, D. and Spang, A. 2005. A role of SAND-family proteins in endocytosis. Biochem. Soc. Trans. 33: 606608.[CrossRef][Medline]
Rohl, C.A., Strauss, C.E., Misura, K.M., and Baker, D. 2004. Protein structure prediction using Rosetta. Methods Enzymol. 383: 6693.[Medline]
Rossi, V., Banfield, D.K., Vacca, M., Dietrich, L.E., Ungermann, C., D'Esposito, M., Galli, T., and Filippini, F. 2004. Longins and their longin domains: Regulated SNAREs and multifunctional SNARE regulators. Trends Biochem. Sci. 29: 682688.[CrossRef][Medline]
Sadreyev, R. and Grishin, N. 2003. COMPASS: A tool for comparison of multiple protein alignments with assessment of statistical significance. J. Mol. Biol. 326: 317336.[CrossRef][Medline]
Schlenker, O., Hendricks, A., Sinning, I., and Wild, K. 2006. The structure of the mammalian signal recognition particle (SRP) receptor as prototype for the interaction of small GTPases with Longin domains. J. Biol. Chem. 281: 88988906.
Schwartz, T. and Blobel, G. 2003. Structural basis for the function of the
subunit of the eukaryotic signal recognition particle receptor. Cell 112: 793803.[CrossRef][Medline]
Stroupe, C., Collins, K.M., Fratti, R.A., and Wickner, W. 2006. Purification of active HOPS complex reveals its affinities for phosphoinositides and the SNARE Vam7p. EMBO J. 25: 15791589.[CrossRef][Medline]
Tatusov, R.L., Fedorova, N.D., Jackson, J.D., Jacobs, A.R., Kiryutin, B., Koonin, E.V., Krylov, D.M., Mazumder, R., Mekhedov, S.L., and Nikolskaya, A.N., et al. 2003. The COG database: An updated version includes eukaryotes. BMC Bioinformatics 4: 41.[CrossRef][Medline]
Walker, D.R. and Koonin, E.V. 1997. SEALS: A system for easy analysis of lots of sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 5: 333339.[Medline]
Wang, C.W., Stromhaug, P.E., Shima, J., and Klionsky, D.J. 2002. The Ccz1-Mon1 protein complex is required for the late step of multiple vacuole delivery pathways. J. Biol. Chem. 277: 4791747927.
Wang, C.W., Stromhaug, P.E., Kauffman, E.J., Weisman, L.S., and Klionsky, D.J. 2003. Yeast homotypic vacuole fusion requires the Ccz1-Mon1 complex during the tethering/docking stage. J. Cell Biol. 163: 973985.
Wei, M.L. 2006. Hermansky-Pudlak syndrome: A disease of protein trafficking and organelle function. Pigment Cell Res. 19: 1942.[CrossRef][Medline]
Wickner, W. and Haas, A. 2000. Yeast homotypic vacuole fusion: A window on organelle trafficking mechanisms. Annu. Rev. Biochem. 69: 247275.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |