|
|
||||||||
1 Howard Hughes Medical Institute, Molecular Biology Institute, and UCLA-DOE Center for Genomics and Proteomics, Los Angeles, California 90095-1570, USA
2 Department of Chemistry and Biochemistry and Department of Biological Chemistry, University of California Los Angeles, Los Angeles, California 90095, USA
Reprint requests to: David Eisenberg, Department of Chemistry and Biochemistry and Department of Biological Chemistry, University of California Los Angeles, Los Angeles, CA 90095, USA; e-mail: david{at}mbi.ucla.edu; fax: (310) 206-3914.
(RECEIVED July 19, 2002; FINAL REVISION December 13, 2002; ACCEPTED December 19, 2002)
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.0224703.
| Abstract |
|---|
|
|
|---|
Keywords: Ribonucleoprotein; Sm protein; protein polymerization; uridine binding; OB fold
Abbreviations: Afu, Archaeoglobus fulgidus DTT, dithiothreitol EM, electron microscopy MPD, 2-methyl-2,4-pentanediol Mth, Methanobacterium thermautotrophicum NCS, noncrystallographic symmetry nt, nucleotide OB-fold, oligosaccharide/oligonucleotide-binding fold Pae, Pyrobaculum aerophilum ss(D/R)NA, single-stranded (D/R)NA SmAP, Sm-like archaeal protein snRNP, small nuclear ribonucleoprotein UMP, uridine-5'-monophosphate wt, wild type
| Introduction |
|---|
|
|
|---|
Extensive biochemical and genetic data have shown that a key step in snRNP assembly is stepwise binding of seven cytoplasmic Sm proteins to exported snRNAs (Will and Luhrmann 2001). Each U snRNP is a complex formed from an ~110180-nucleotide (nt) snRNA and two classes of proteins: (1) snRNP-specific proteins that confer snRNP-specific functions (e.g., U1A protein of U1 snRNPs) and (2) the Sm or Sm-like (Lsm) proteins that are common to each snRNP core (Will and Luhrmann 1997). The snRNAs contain a single Sm or Lsm binding site with the uridine-rich consensus sequence PuAU~46GPu (Pu = purine). However, specificity for this sequence is not stringent and there can be redundancy in Sm-snRNA binding (Jones and Guthrie 1990). The Sm sites are predicted to be single-stranded RNA regions flanked by stem-loop structures (Burge et al. 1999; Yu et al. 1999). Sm binding is highly sensitive to modifications of the flanking stem-loops and the Sm site of a given snRNA, and varies from one snRNA to another (Jarmolowski and Mattaj 1993). Sm-snRNA binding also may be modulated by interactions between certain Sm proteins and the survival of motor neurons (SMN) protein complex (Selenko et al. 2001), and by symmetric dimethylation of arginine residues in some of the RG dipeptide repeats of Sm (Brahms et al. 2000; Friesen et al. 2001; Meister et al. 2001) and Lsm (Brahms et al. 2001) proteins by a putative "methylosome" (Friesen et al. 2002). In eukaryotes, Sm D1D2 and EFG heteromers simultaneously bind to snRNA to yield a "subcore" snRNP complex (Raker et al. 1996, 1999; Will and Luhrmann 2001). The final component to join the Sm complex is the B/B'D3 heterodimer, and this triggers hypermethylation of the 5' m7G cap of snRNA to a trimethylated guanosine cap (m3G). The m3G cap and the snRNASm core complex form a bipartite nuclear localization signal that results in transit of the snRNP core to the nucleus, where association of various snRNP-specific proteins completes the assembly process.
The importance of Sm proteins in RNP assemblies is underscored by their phylogenetic distribution: In addition to the canonical Sm and Lsm proteins found in eukaryotes ranging from yeast to humans, an Sm-like archaeal protein ("SmAP") family has been discovered (Salgado-Garrido et al. 1999; Mura et al. 2001). The recent demonstration that the E. coli bacteriophage host factor Hfq is an Sm-like protein provides the first example of a eubacterial Sm protein (Moller et al. 2002; Zhang et al. 2002). These results imply fundamental roles for Sm proteins in the early evolution of RNA metabolism. Sm proteins probably mediate critical RNA-RNA, RNA-protein, and protein-protein interactions in snRNP cores. The vast network of protein-protein interactions in which Sm proteins participate was recently suggested by genome-wide two-hybrid screens of yeast Lsm proteins (Fromont-Racine et al. 2000).
Sm proteins have a tendency to associate into cyclic oligomers. Prompted by biochemical and genetic data, electron microscopic (EM) investigations of U snRNP particles revealed the "doughnut-shaped" ultrastructure of Sm and Lsm cores (Kastner et al. 1990; Achsel et al. 1999). The realization that Sm and Lsm proteins occur in groups of at least seven paralogs within the genome of a given organism suggests that snRNP cores are formed from Sm heteroheptamers, and two recent results verify this. First, Stark et al. (2001) reconstructed a 10 Å-resolution map of the U1 snRNP by cryo-EM and found that a model of the Sm heptamer could be docked into the ring-shaped body of the snRNP. Next, the in vivo stoichiometry of Sm proteins in yeast spliceosomal snRNPs was determined by a differential tag/pull-down assay, showing that the snRNP core domain contains a single copy of each of the seven Sm proteins (Walke et al. 2001). Stable subheptameric Sm complexes have been suggested as intermediates along the snRNP core assembly pathway (e.g., a D1D2EFG complex that binds snRNA; Raker et al. 1996), and ultracentrifugation and EM show that some of these oligomers can form ring-like structures that resemble intact, heptameric snRNP cores (e.g., a (EFG)2 hexamer in Plessel et al. 1997). Such findings emphasize the importance of cyclic Sm heptamers in the snRNP core, and raise the possibility of other oligomeric states.
There is no atomic-resolution structure of a eukaryotic snRNP core. Nonetheless, the crystal structures of Sm-like archaeal proteins from Afu (Toro et al. 2001), Pae (Mura et al. 2001), and Mth (Collins et al. 2001) reveal a cyclic Sm homoheptamer and provide a model for snRNA binding in the snRNP core. Sm monomers fold as strongly bent, five-stranded antiparallel ß-sheets (Kambach et al. 1999a) and form toroidal heptamers that surround a conserved cationic pore. The inner surface of this pore appears to be the oligouridine-binding site. The similarity between SmAP1 monomer and dimer structures and the nearly identical human Sm D1D2 and D3B heterodimers (Kambach et al. 1999b) supports SmAP-based models for the heptameric snRNP core.
| Results |
|---|
|
|
|---|
|
Comparisons of known SmAP monomer, dimer, and heptamer structures
Several structures of Sm proteins and SmAPs are now available, making possible the comparative structural analyses of these proteins, and revealing the strict conservation of the Sm fold. The Mth heptamer structures reported here are virtually identical to the Mth SmAP1 structure reported by Collins et al. (2001), for example, 0.65 Å RMSD for superimposition of the P1 heptamer using main chain atoms. Pairwise comparisons of the Pae, Mth, and Afu SmAP1s show that the compact, ~80-amino acid SmAP1 monomer structures are nearly identical (Fig. 1
, Table 2
). The most similar monomer structures are the Afu/Mth pair (0.51 Å RMSD), and the most dissimilar are Mth/Pae SmAP1 (1.02 Å RMSD). These values do not correlate to pairwise sequence similarities. The overall structure of the dimer interface is strictly conserved between SmAPs and human Sm heterodimers, as emphasized by the view in Figure 1
. Greater RMSDs for heptamer compared to dimer alignments (and dimer compared to monomer alignments) suggests that a large fraction of the structural variation in higher-order SmAP oligomers is due to rigid-body reorientation of monomers with respect to one another. Mapping of the phylogenetic conservation of SmAP residues onto the Pae, Mth, or Afu heptamer structures shows that most of the conserved residues cluster about the cationic pore region (data not shown). The calculated electrostatic potential of the Mth SmAP1 surface reveals a strongly acidic loop-4 (L4) face, as found for Pae SmAP1 (Mura et al. 2001).
|
|
The results of equilibrium sedimentation analyses of wt Mth, wt Pae, and the C8S mutant of Pae SmAP1 reveal the oligomeric states of these SmAP1s in solution, as shown in Figure 2
. Molecular weights were estimated by fitting experimental curves to single exponential models. The calculated molecular weight of wt Pae (Fig. 2A
) suggests that it exists as a 14-mer. Because other data also suggested a disulfide-bonded 14-mer, the single cysteine of Pae SmAP1 was mutated to serine to give the C8S mutant of Pae SmAP1. Sedimentation results with this mutant can be fit only by species with molecular weights much less than that of a heptamer (e.g., the 46.7-kD species shown in Fig. 2B
), suggesting a pentamer (n = 5 gives a Mr of ~45 kD). The monodispersity of the data in Figure 2B
suggests a single, stable subheptameric complex, although a rapidly exchanging mixture of several states (e.g., tetramers, pentamers, and hexamers) cannot be ruled out. In contrast to Pae, sedimentation equilibrium data for Mth SmAP1 show that it only forms a stable, monodisperse heptamer (Fig. 2C
). The concentration dependence of the experimentally calculated Mrs (not shown), as well as the slight upward concavity of the residuals in Figure 2B,C, provide additional evidence for Pae and Mth SmAP1 monomer
oligomer association reactions.
|
|
|
The Mth P212121 crystal structure provides a model for the atomic structure of SmAP1 fibers. In the Mth P1 and P212121 lattices, SmAP1 heptamers form quasihexagonal layers that stack upon one another to give a crystal. In the P1 form these layers are staggered; however, in the P212121 form these layers are in register. Figure 4B
shows how the head-to-tail stacking of SmAP1 heptamers in this crystal form produces cylindrical tubes. A slight tilt of each heptamer with respect to the tube axis (~15°) results in the SmAP1 sevenfold axes being parallel, but not coaxial. Because they are formed by head-to-tail stacking of asymmetric heptamers, these tubes have a defined polarity, and, when rendered as molecular surfaces, they bear a striking resemblance to the EM fibers shown in Figure 3
. The tubes are also consistent with EM fiber dimensions. In addition to providing insights into polymerization and oligomerization states, two of the crystal forms (Pae C2221 and Mth P21) were used to investigate the ligand-binding properties of SmAP1s.
Crystal structures of Mth and Pae SmAP1 bound to various ligands
The 1.90 Å-resolution crystal structure of Mth SmAP1 bound to uridine-5'-monophosphate (UMP) is shown in Figure 5
. The protein was cocrystallized with this ribonucleotide in an effort to determine its likely RNA-binding site (cocrystallization efforts were unsuccessful with single-stranded DNA or RNA oligonucleotides). As shown in Figure 5A
, SmAP1 binds UMP with a 1:1 stoichiometry, so that 14 UMPs are bound to the 14-mer near the pore region. The UMPs bind near the flat face of the Mth heptamer, opposite the highly acidic loop L4 face. The structure of the SmAP1UMP complex is shown in more detail in Figure 5B
, which shows that the binding site is well defined by electron density. The uracil ring intercalates between the guanidinium group of Arg72 and the imidazole ring of His46 (both of these residues are highly conserved in SmAPs). The planes of these three moieties are spaced ~3.6 Å apart, as expected for energetically favorable stacking interactions between conjugated
-systems. Individual protein-UMP contacts are discussed in greater detail below.
|
A UMP binding site was found in the Pae SmAP1UMP co-crystal structure as well, but it is not as clearly defined by electron density as is Mth SmAP1UMP. The PaeUMP structure, which was refined to a resolution of 2.05 Å, is shown in Figure 5C
. UMPs bind to the same face of the heptamer as in Mth (i.e., the "flat face" opposite L4), but are much more distant from the pore. Only the planar uracil fragment of UMP is clearly defined in 2|Fo| - |Fc| electron density maps, and protein-UMP contacts are scarce in this binding site (Pae SmAP1 residues in the region of this uridine are not very conserved). The only close UMP contact is made by the side chain of Asn46, but the geometry of the Asn46...UMP interaction does not satisfy standard hydrogen bond criteria (in terms of both distances and angles), and favorable interactions probably do not exist between the UMP O4 oxygen and the amide nitrogen of the Asn46 side chain, or between the UMP N3 nitrogen and the amide oxygen of Asn46. Also, there are no aromatic side chains in this region to participate in
-stacking interactions with the uracil base. As in Mth SmAP1, additional small-molecule binding sites exist in Pae SmAP1: Many of the modeled glycerol molecules are bound identically near the loop L4 faces (Fig. 5C
).
The structure of an Afu SmAP1U3 complex was recently determined by Toro et al. (2001) and reveals a similar mode of uridine recognition in Afu and Mth SmAP1. The UMP binding site and SmAP1...UMP interactions clearly differ in Mth and Pae SmAP1, and, because the binding site was poorly resolved in the PaeUMP complex, this structure was not included in the comparative analysis shown in Figure 6
. In the Mth and Afu structures, the aromatic pyrimidine ring intercalates between the side chains of the highly conserved Arg/His pair, and specific recognition is achieved by hydrogen bonding of the uracil ring to the side chain of a strictly conserved asparagine residue (Asn48Mth). The main chain amide nitrogen of a highly conserved aspartate (Asp74Mth) also participates in hydrogen bonding to a uracil carbonyl oxygen. The pattern of hydrogen bond donors/acceptors in the Asn48/Asp74Mth pair makes binding specific for a uracil (if RNA) or thymine (if DNA) base. Additional specificity for uracil may be achieved by two means: (1) recognition of the 2' hydroxyl of the ribose (RNA vs. DNA discrimination) and (2) the C5 carbon of the pyrimidine ring of uracil is only 3.8 Å from the backbone carbonyl oxygen of Leu45Mth from an adjacent monomer, thus providing steric and polar discrimination against the methyl on the C5 carbon of thymine. We crystallized Pae and Mth SmAP1 in the presence of various other nucleoside monophosphates (e.g., AMP, CMP, GMP), but there was no evidence for binding of these non-uridine NMPs (data not shown). The only significant differences in uridine recognition by Mth and Afu SmAP1 are highlighted by two arrows in Figure 6B
. These are: (1) hydrogen bonding of an Mth Arg72 side chain from an adjacent monomer to the 2' hydroxyl of the ribose, and (2) hydrogen bonding between a phosphate oxygen and an imidazole nitrogen from the His46Mth residue of an adjacent monomer. Overall, it appears that the mode of uridine recognition is conserved in the SmAP family.
|
|
| Discussion |
|---|
|
|
|---|
-helix, Sm and SmAP structures closely resemble proteins of the oligosaccharide/oligonucleotide binding (OB) fold family (Murzin 1993). The SmSm interface is mostly formed by hydrogen bonding between mainchain atoms of ß4 and ß5 strands from adjacent monomers, thus explaining the reduced sequence conservation of interfacial residues. The recent solution structure of the SMN Tudor domain (Selenko et al. 2001), which interacts with Sm proteins to form snRNP cores, provides the unexpected result that the SMN and Sm monomers have similar OB-like folds.
The Afu, Mth, and Pae SmAP1 heptamer structures are similar (Table 2
), primarily because of the conserved interface between adjacent monomers. The central cationic pore is also highly conserved in terms of sequence and overall structure. However, one of the least conserved features of the SmAP1 heptamers is the calculated electrostatic potential of the surfaces: The L4 face of the Afu surface is very basic, whereas the Pae and Mth L4 faces are intensely acidic. Such differences are likely to be important for modulating putative SmAP-RNA interactions near the pore. The most obvious structural difference between SmAP1 heptamers is the width of the cationic pore. Variation of pore width in Pae (~89 Å diameter) versus Afu and Mth (~1215 Å) is due to main chain and side chain rotamer variations in the L2 and L4 loops. Besides the N- and C-termini, these pore-forming loops are the most structurally variant regions in SmAP1 monomers (Fig. 1
). The most significant differences between SmAP1 heptamer structures and snRNP Sm cores will likely arise from the two largest differences between the Pae, Mth, and Afu SmAP1 sequences and some eukaryotic and archaeal Sm protein sequences: (1) several eukaryotic Sm proteins (e.g., human SmB/B') and some SmAPs have extended C-terminal regions with up to 70 more residues than the ~80-residue core Sm domain and (2) some eukaryotic Sm proteins may have up to 30 more amino acids in the L4 loop. Preliminary results with a Pae SmAP3 homolog that contains 60 additional C-terminal residues show that it too forms heptamers, and that the Sm core heptamer is conserved (C. Mura and D. Eisenberg, unpubl.).
The oligomerization properties of SmAPs
Like the Lsm (but not Sm) proteins, Pae, Mth, and Afu SmAP1 form heptamers in the absence of RNA. In addition to the expected heptamers, SmAP1 exhibits complex self-association properties indicative of 14-mers and subheptameric oligomers. Various oligomeric states were characterized in vitro (primarily by ultracentrifugation, Fig. 2
), revealing roughly spherical disulfide-bonded Pae SmAP1 14-mers and a monodisperse population of Mth SmAP1 heptamers. Additionally, we created a cysteine-free point mutant of Pae SmAP (C8S) and found that it forms subheptameric states (most likely pentamers). Similar plasticity of oligomerization behavior has been reported for human Sm proteins. Lührmann et al. found that a human Sm EFG complex forms a stable oligomermost likely an (EFG)2 hexamerwhose ring-shaped structure resembles intact Sm heteroheptamers by EM (Raker et al. 1996; Plessel et al. 1997). One of these studies also found that stable, subheptameric complexes of human Sm proteins (e.g., a D1D2EFG pentamer) may be intermediates in the Sm-RNA assembly pathway (Raker et al. 1996). In the human Sm D3B structure, the heterodimers pack as (D3B)3 hexamers in the asymmetric unit of the crystal (Kambach et al. 1999b), and Afu SmAP2 has been shown to form hexamers (Toro et al. 2002). The recently discovered E. coli Sm-like protein Hfq is thought to form hexamers as well (Arluison et al. 2002).
We found that Pae and Mth SmAP1 oligomerize into 14-mers, either in vitro (Pae) or in various crystal forms (Pae and Mth). The highly acidic L4 faces are exposed in the barrel-shaped 14-mers, as expected from electrostatic considerations. The heptamerheptamer interface buries much surface area in both Pae (7550 Å2) and Mth (3005 Å2), suggesting the significance of these oligomers. The crystal structure of another SmAP homolog (Pae SmAP3) shows that it also forms 14-mers in the asymmetric unit (C. Mura and D. Eisenberg, unpubl.). The propensity of cyclic SmAPs to crystallize as head-to-head oligomers with dihedral symmetry is shared by another single-stranded RNA binding protein that has an OB-like fold: The trp RNA-binding attenuation protein (TRAP) forms toroidal 11-mers that stack as both head-to-head and head-to-tail 22-mers in the crystal (Antson et al. 1999).
An unexpected property of SmAP1s is their polymerization into well-ordered fibers under physiological conditions. Three lines of evidence suggest that these polar fibers form by the head-to-tail stacking of heptamers (Fig. 3
): differential fiber formation by C8S and wt Pae SmAP1; comparison of measured fiber dimensions with SmAP1 heptamer dimensions; and electrostatic considerations for the packing of highly charged heptameric disks. The packing of Mth SmAP1 heptamers in the P212121 lattice supports our head-to-tail polymerization model, and provides an atomic-resolution model for the fibers (Fig. 4
). The critical role of Cys8 in preventing fiber formation by stabilizing the Pae 14-mer seems significant, given the potentially oxidative cytosol of Pae and other thermophilic archaea (Mallick et al. 2002). Such complex oligomerization properties have not been reported for eukaryotic Sm proteins, and the biological significance of SmAP1 14-mers and homogeneous, fibrillar SmAP polymers is not yet known.
The ligand-binding properties of SmAPs
Comparison of the structures of Mth SmAP1 bound to UMP and Afu SmAP1 bound to oligouridine (U3) reveals a highly conserved mode of RNA recognition in SmAPs. UMP binds near the sevenfold axis, suggesting the pore as a putative RNA binding site. Diagrams of SmAP1...UMP interactions show that both SmAP1s specifically bind the uracil base by a combination of
-stacking and hydrogen-bond interactions with strictly conserved SmAP residues (Fig. 6
). Differences between UMP binding in Mth and Afu are limited to interactions with the ribophosphate moiety, and may not be significant, because Mth SmAP1 was cocrystallized with free UMP nucleotide, whereas Afu SmAP1 was crystallized with a U3 oligouridine. The oligo(U) specificity of RNA binding to Afu SmAP1 is the same as the substrate specificity of eukaryotic Sm proteins (Achsel et al. 2001; Toro et al. 2001). The binding geometry of UMPs in Mth SmAP1 allows them to be strung together into a hypothetical oligouridine that may mimic biologically relevant RNA binding in the Sm core of snRNPs. If all SmAPs specifically bind to an oligouridine site in vivo, then geometric considerations require such an RNA-binding site to lie near the sevenfold symmetry axis (i.e., the pore); however, the uridine-binding site in Pae SmAP1 is distal to the pore and not easily interpretable in electron density maps, suggesting low-affinity binding at this alternative site (Fig. 5
). We note that the same UMP-binding site proximal to the pore in Afu and Mth exists in Pae SmAP1, and that UMP can be docked into this site with only minimal changes to side chain rotamers. Failure of other NMPs to cocrystallize with Mth or Pae SmAP1 supports the specificity of uridine binding that we infer from the Mth and Afu crystal structures. Additional sites occupied by MPD, ethylene glycol, or glycerol are clearly defined by electron density in Mth and Pae SmAP1, and many of the residues in these sites are phylogenetically conserved; however, any biological significance of these additional ligand-binding sites is unknown.
Based on the gel-shift activity of Mth and Pae SmAP1 on supercoiled DNA (Fig. 7
) and the striking similarity of SmAP monomers to the OB fold, we propose that SmAPs may have a generic single-stranded nucleic acid-binding activity (e.g., as a nucleic acid chaperone). We found that SmAP1s nonspecifically gel-shift a variety of negatively supercoiled DNA substrates and that ssDNA oligonucleotides of >20 nt inhibit the gel shift (Fig. 7B
). Because eukaryotic Sm proteins bind to ssRNA, and because SmAP homoheptamers probably do not function identically to eukaryotic Sm heteroheptamers, it is possible that this gel-shift inhibition results from direct binding of the oligonucleotides to SmAP1. The striking resemblance of the SmAP and OB folds corroborates this idea, given that several OB-fold proteins bind to ssDNA nonspecifically. The following recently determined structures are highly similar or identical to the OB-like fold of Sm proteins: the single-stranded DNA-binding domain of replication factor A (Bochkareva et al. 2001); the S1 RNA-binding domain (Bycroft et al. 1997); the single-stranded telomeric DNA binding protein (Mitton-Fry et al. 2002); and the Streptococcus pneumoniae SP14.3 protein (which is fused to a domain that is homologous to ribosomal protein S3; Yu et al. 2001).
Emerging differences between SmAPs and eukaryotic Sm proteins
Eukaryotic Sm and Lsm proteins and their archaeal homologs, which we term Sm-like archaeal proteins, share a number of structural and functional features. Perhaps the most significant similarity is in their 3D and quaternary structures: The monomers are nearly identical, and the SmAP homoheptamer parallels the Sm heteroheptamer that forms snRNP cores. Also, both sets of proteins apparently bind oligouridine-containing RNA. However, several differences are emerging between SmAPs and the snRNP-based roles of canonical, eukaryotic Sm proteins. The results presented here show that SmAPs associate into many oligomeric states besides the standard heptamer (e.g., 14-mers and subheptamers), and can polymerize into homogeneous fibers. No structural information is available for Sm proteins bound to RNA (or any other ligand), and thus it is difficult to evaluate the similarity of uridine binding by eukaryotic Sm proteins and SmAPs. Cross-linking experiments with human Sm heptamers corroborate RNA binding near the pore (Urlaub et al. 2001). The near identity of the Sm and SmAP dimer structures, as well as the strictly conserved mode of uridine recognition between Afu and Mth SmAP1, suggest that the SmAP1 UMP-binding site is an accurate model for RNA binding in the snRNP core. In this model, snRNA wraps around the circumference of the pore, but does not thread through it. Further elucidation of the similarities and differences between archaeal SmAP complexes and the Sm cores of eukaryotic snRNPs will provide insight into the structures and evolution of snRNPs.
| Materials and methods |
|---|
|
|
|---|
H) SmAP1 ORF were kindly provided by the laboratories of Jeffrey H. Miller (UCLA) and John Reeve (Ohio State Univ.), respectively. Primers were designed based on these sequences, and PCR products were cloned into a pET-based expression vector. DNA sequencing of plasmids verified that expressed constructs would contain a C-terminal His6x-tag after a 10-residue serine protease-sensitive linker, that is, wild-type (wt) SmAP1 + GR*GKLAAALEHHHHHH (* indicates intended protease site). Recombinant proteins were overexpressed in BL21(DE3) E. coli, with at least 120 mg of soluble protein produced per liter of cell culture. The Cys8
Ser mutant of Pae SmAP1 was created in a similar manner, except that site-directed mutagenesis was achieved via overlap-extension PCR with an additional pair of primers that contained the mutant site. Harvested cells were thawed and resuspended in a high-salt-concentration buffer, and were lysed by French-press and lysozyme treatment. Cleared supernatants were heated to ~80°C, followed by high-speed centrifugation (37,000g). SmAP1-His6x proteins were further purified by affinity chromatography on Ni2+-charged iminodiacetic acid-sepharose, which afforded >99% purity (as determined by SDS-PAGE and MALDI-TOF mass spectrometry). Because the His6x tag prevents heptamer formation for some SmAPs (C. Mura, unpubl.), the next step was proteolytic removal of the C-terminal tag and its linker (wt Mth SmAP1 is 81 aa, with a Mr of 9029 Da; wt Pae SmAP1 is 80 aa, with a Mr of 8800 Da). Trypsin was used for limited proteolysis, as thrombin was ineffective: Ni2+-column fractions were pooled and dialyzed at room temperature into a phosphate-buffered saline buffer supplemented with 15 mM EDTA (to prevent His-tag mediated aggregation). EDTA was gradually eliminated over 23 buffer exchanges, and porcine trypsin was added at ~1 mg trypsin per 100 mg SmAP1. Complete removal of the tag occurred after ~4 h at 37°C, as assayed by MALDI-TOF spectra of time points. Transfer to 4°C and addition of a protease inhibitor (50 mM PMSF) terminated the reaction. Isoelectric points of ~5.2 and 5.8 were calculated for Mth and Pae SmAP1, respectively; therefore, anion exchange chromatography on a quaternary ammonium matrix (UNO-Q6, BioRad) was used to separate cut (i.e., wt) SmAP1 from trypsin, uncut protein, and any other contaminants. Pae SmAP1 was in 20 mM Tris, pH 8.55 and Mth SmAP1 was in 20 mM Tris pH 8.55, 30 mM EDTA pH 8.0 (EDTA was required for solubility, and did not interfere with chromatography). Both SmAP1s eluted at ~80 mM NaCl in the salt concentration gradient. Protein purity was assayed by SDS-PAGE and MALDI-TOF, and pure fractions were pooled and dialyzed into a buffer for crystallization.
Crystallization and data collection
For crystallization, Pae SmAP1 was in buffer "XB" (10 mM Tris pH 7.8, 5 mM EDTA pH 8.0), and Mth (which requires higher ionic strengths for solubility) was in "XB6ß" (10 mM Tris pH 7.8, 5 mM EDTA pH 8.0, and 0.1 M NaCl). Protein concentrations in these buffers were increased by using Centripreps to reduce sample volumes. After initial sparse matrix screening of conditions, final, optimized Pae SmAP1 crystals of the C2221 form were grown by hanging-drop vapor diffusion. An 11-µL drop [4 µL well buffer + 5 µL wt 29.6 mg/mL Pae SmAP1 + 1 µL 0.1 M dithiothreitol (DTT) + 1 µL 0.1 M uridine-5'-monophosphate (UMP)] was equilibrated against an 800-µL well [0.1 M sodium acetate pH 8.20, 0.1 M ammonium acetate, 8.6% w/v PEG-4000, and 23.8% v/v glycerol] at room temperature (~19.8°C). Orthorhombic crystals reached maximum dimensions of 0.1 x 0.1 x 0.3 mm within 5 d. Hanging drops contained a mixture of the new C2221 crystals and the previously reported C2 form (Mura et al. 2001).
Three forms of Mth SmAP1 crystals were obtained by hanging-drop vapor diffusion at room temperature. For the P1 form, Mth SmAP1 was at 56 mg/mL in buffer XB6ß. The drop was 4 µL of protein + 4 µL of well buffer. The well was 600 µL of [0.1 M sodium citrate pH 5.60, 15% w/v PEG-4000, 0.2 M ammonium acetate]. Crystals grew to maximum dimensions of ~0.1 x 0.1 x 0.25 mm within 7 d. For the P212121 form, Mth SmAP1 was at 42 mg/mL in buffer XB6ß. The drop was 3 µL of protein + 3 µL of well buffer. The well was 600 µL of [0.1 M Tris pH 8.50, 10% v/v isopropanol]. Crystals grew to maximum dimensions of ~0.3 x 0.3 x 0.6 mm within 3 d. For the P21 form, Mth SmAP1 was at 30.3 mg/mL in a modified form of buffer XB6ß that contained a 26-nt single-stranded DNA [10 mM Tris pH ~7.7, 3 mM EDTA pH 8.0, 55 mM NaCl, 0.6 mM ssDNA]. Drops were 2.5 µL of protein/ssDNA + 2.5 µL of well buffer + 1 µL of 0.1 M UMP. The 600-µL well contained 55 µL of 1.0 M sodium citrate (pH 5.6), 5 µL of 1.0 M sodium citrate (pH 8.0), 60 µL of 2.0 M ammonium acetate, 180 µL of neat MPD and 300 µL of sterile dH2O (interestingly, 2.5 M 1,6-hexanediol could be substituted for neat MPD). Crystals grew to maximum dimensions of ~0.15 x 0.15 x 0.25 mm within 7 d.
The C2221 Pae SmAP1 and P21 Mth SmAP1 crystals did not require additional cryosolvent, due to the 23.8% v/v glycerol or 30% v/v MPD in those drops, respectively. The other two Mth SmAP1 crystal forms were cryoprotected as follows: (1) for the P1 form, ethylene glycol was added directly to the drop to a final concentration of ~20% v/v, and crystals were allowed to soak for 20 sec prior to mounting in a cryoloop; (2) for the fragile P212121 crystals, the cryoprotectant was ethylene glycol (mixed with well buffer), and had to be introduced gradually over several hours (in ~5% v/v increments). The P212121 crystals were soaked for only ~23 sec at the final ethylene glycol concentration (30% v/v). Diffraction data were collected either at the synchrotron (P1 and P21 form Mth crystals) or in-house (P212121 Mth and C2221 Pae crystals) on an ADSC Quantum-4 charge-coupled device (CCD) detector. All crystals were mounted in a cryogenic nitrogen stream at -168°C for data collection. After autoindexing, images were indexed/integrated/reduced in DENZO, and reflections were scaled and merged in SCALEPACK (Otwinowski and Minor 1997). Complete data sets were collected from single crystals (Table 1
). Unit cell dimensions for the Pae C2221 form are a = 91.83, b = 113.76, c = 126.59 Å; for the Mth crystals they are: a = 45.07, b = 54.08, c = 62.35 Å,
= 87.58°, ß = 72.86°,
= 81.45° (P1); a = 65.25, b = 109.96, c = 83.76 Å, ß = 95.81° (P21); a = 40.37, b = 114.70, c = 238.60 Å (P212121). The large unit cell edge of the Mth P212121 crystals led to spot overlap for high-resolution reflections (d < 3 Å), so multiple data sets were collected at two 2
values (0°, -12°) for two crystal alignments (related by a 45° azimuthal rotation).
Structure determination, refinement, and validation
Initial phases for the C2221 Pae SmAP1 structure were determined by the stochastic evolutionarily programmed molecular replacement method (EPMR; Kissinger et al. 2001). The most reasonable Matthews coefficient (VM = 2.58 Å3/Da) corresponds to a heptamer in the asymmetric unit (a.u.); therefore, the search model was the identical Pae SmAP1 heptamer from the C2 crystal form (Mura et al. 2001). The EPMR solution was used for manual model building in the program O (Jones et al. 1991), and model refinement in CNS (Brunger et al. 1998). Refinement in CNS proceeded by standard protocols, using the maximum-likelihood target function for amplitudes (mLf), bulk solvent correction, and anisotropic B-factor correction terms. Sevenfold noncrystallographic symmetry (NCS) was determined by calculation of a locked self-rotation function, but NCS restraints were not imposed during refinement. Solvent molecules were added as necessary (water, glycerol, acetate). Refinement of individual atomic positions, isotropic temperature factors, and simulated annealing torsion angle dynamics was performed in most rounds. Each refinement round ended with inspection of the agreement between the model and
A-weighted 2|Fo| |Fc|, |Fo| |Fc|, and, when needed, simulated annealing omit maps.
Determination of the Mth P1 structure proceeded in two steps. First, a homology model of the Mth SmAP1 heptamer was built from the Pae SmAP1 structure using an in-house script (C. Mura and D. Eisenberg, unpubl.), and was used as a search model for molecular replacement (VM = 2.29 Å3/Da for one heptamer in the P1 cell). Then, the EPMR solution was converted to a polyalanine model and subjected to free-atom model refinement with the ARP/wARP program (Perrakis et al. 1999) in the "molrep" mode. Mth SmAP1 side chains were built in the final wARP stage. The Mth P1 structure was refined with CNS, as described above for the Pae structure. The P21 and P212121 Mth structures were solved by molecular replacement with the refined P1 Mth model. Self-rotation functions and |Fo|2 Patterson maps were calculated to deduce the NCS between heptamers in the P21 and P212121 forms (each of which contains 14 monomers per a.u.). Solvent was added as necessary for all structures (see Table 1
), and no NCS restraints were enforced at any point in the refinements. Partial atomic occupancies (q) were restricted to a reasonable range (0.2 < q < 1.5) during latter refinement rounds, in which only the occupancies for atoms of UMP (not for any other ligand or protein atoms) were refined.
Refinement statistics for the Pae and three Mth structures are shown in Table 1
. Each of the four protein models is complete, except for ~611 missing N-term residues in various models (see PDB files). The stereochemistry and geometry of each SmAP1 monomer was validated with PROCHECK (Laskowski et al. 1993) and ERRAT (Colovos and Yeates 1993), and was found to be acceptable (e.g., no residues in the disallowed region of
,
space for the Pae C2221 model). Final model coordinates and diffraction intensity data were submitted to the PDB with ID codes 1JBM, 1LOJ, 1JRI, and 1LNX (see Table 1
).
Analytical ultracentrifugation
The wt Pae protein in 75 mM NaCl, 10 mM Tris, pH 7.8, was examined by sedimentation velocity in a Beckman Optima XL-A analytical ultracentrifuge at 52,000 rpm and 20°C using absorption optics at 273 nm and a 12-mm pathlength double sector cell. The sedimentation coefficient distribution was determined from a g(s) plot using the Beckman Origin-based software (Version 3.01). The peak sedimentation coefficient was corrected for density and viscosity to an S20,wat value by using a value for the partial specific volume at 20°C of 0.743 [calculated from the amino acid composition (Edsall 1943) and corrected to 20°C (Laue et al. 1992)].
Sedimentation equilibrium runs at 20°C were performed on all three proteinswt Mth, wt Pae, and the Pae C8S mutantin 150 mM NaCl, 10 mM Tris, pH 7.8, again using a Beckman Optima XL-A analytical ultracentrifuge. Each protein was examined at three different concentrations and four speeds, using 12-mm pathlength six-sector cells. Protein concentrations used were 3.4, 0.69, and 0.19 mg/mL for wt Pae; 5.9, 1.26, and 0.32 mg/mL for the C8S mutant of Pae; and 4.1, 0.85, and 0.22 mg/mL for wt Mth. Rotor speeds were 8,000, 10,000, 12,500, and 14,500 rpm. Protein concentration was monitored by absorption at 280 nm and, for the lowest protein concentrations, at 232 nm. A partial specific volume of 0.743, calculated as described above, was used for all three proteins. Individual scans were analyzed using the Beckman Origin-based software (Version 3.01) to perform a nonlinear least-squares exponential fit for a single ideal species, thus giving the weight-averaged molecular weight for each protein.
Transmission electron microscopy
The following protein samples were prepared for electron microscopy: (1) 0.5 mg/mL wild-type Mth SmAP1 in 10 mM Tris pH 7.5, 60 mM NaCl; (2) 1.2 mg/mL wild-type Pae SmAP1 in 25 mM Tris pH 7.5, 30 mM NaCl; (3) 1.1 mg/mL C8S mutant Pae SmAP1 in the same buffer as the wt protein; and (4) 1.2 mg/mL wt Pae SmAP1 in reductant buffer (25 mM Tris pH 7.5, 30 mM NaCl, 10 mM DTT). Carbon-coated parlodion support films mounted on copper grids were made hydrophilic immediately before use by high-voltage, alternating-current glow discharge. Protein samples were applied directly onto the grids and allowed to adhere for 2 min. Grids were rinsed with distilled water and negatively stained with 1% w/v uranyl acetate. Specimens were examined in a Hitachi H-7000 electron microscope at an accelerating voltage of 75 kV.
Gel-shift assays
Negatively supercoiled plasmid DNA was prepared by transforming the plasmid into E. coli BL21(DE3) cells and mini-prepping (Qiagen) it from spun-down cells that had reached stationary phase. Several different plasmids were tested, including ones derived from pUC18, pACYC, pET-22b(+) (Novagen), and pCR-Blunt (Invitrogen). Titration of plasmids with ethidium bromide was used to verify the negative superhelicity of the DNA via electrophoretic mobility changes in agarose gels. Single-stranded DNAs of various lengths and sequences were synthesized by Integrated DNA Technologies (e.g., the 26-mer in Fig. 7B
with the following sequence: 5'CGGATCCTCAGTAAAAAGTGCGGA AA3'). Stock solutions of protein were wt Pae at 5.6 mg/mL in buffer XB (see above) or wt Mth at 5.6 mg/mL in buffer XB6ß (see above). Except as noted, buffer, DNA, and protein samples were mixed to produce 25- or 50-µL reactions that were incubated at room temperature (generally for 3060 min). Gel-shift of the DNA was assayed by electrophoresis at a constant voltage (120V) in 1.3% or 1.5% w/v TAE/agarose gels. Examples of typical reactions and concentration ranges are given in Figure 7
. Reactions in which SmAP1 was replaced by single-stranded DNA-binding protein (Stratagene) or by arbitrary Pae proteins unrelated to SmAP1 (e.g., an acid phosphatase) served as positive and negative controls, respectively.
| Note added in proof |
|---|
|
|
|---|
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Achsel, T., Stark, H., and Luhrmann, R. 2001. The Sm domain is an ancient RNA-binding motif with oligo(U) specificity. Proc. Natl. Acad. Sci. 98: 36853689.
Antson, A.A., Dodson, E.J., Dodson, G., Greaves, R.B., Chen, X., and Gollnick, P. 1999. Structure of the trp RNA-binding attenuation protein, TRAP, bound to RNA. Nature 401: 235242.[CrossRef][Medline]
Arluison, V., Derreumaux, P., Allemand, F., Folichon, M., Hajnsdorf, E., and Regnier, P. 2002. Structural modeling of the Sm-like protein Hfq from Escherichia coli. J. Mol. Biol. 320: 705712.[CrossRef][Medline]
Bochkareva, E., Belegu, V., Korolev, S., and Bochkarev, A. 2001. Structure of the major single-stranded DNA-binding domain of replication protein A suggests a dynamic mechanism for DNA binding. EMBO J. 20: 612618.[CrossRef][Medline]
Brahms, H., Raymackers, J., Union, A., de Keyser, F., Meheus, L., and Luhrmann, R. 2000. The C-terminal RG dipeptide repeats of the spliceosomal Sm proteins D1 and D3 contain symmetrical dimethylarginines, which form a major B-cell epitope for anti-Sm autoantibodies. J. Biol. Chem. 275: 1712217129.
Brahms, H., Meheus, L., de Brabandere, V., Fischer, U., and Luhrmann, R. 2001. Symmetrical dimethylation of arginine residues in spliceosomal Sm protein B/B' and the Sm-like protein LSm4, and their interaction with the SMN protein. RNA 7: 15311542.[Abstract]
Brunger, A.T., Adams, P.D., Clore, G.M., DeLano, W.L., Gros, P., Grosse-Kunstleve, R.W., Jiang, J.S., Kuszewski, J., Nilges, M., Pannu, N.S., et al. 1998. Crystallography and NMR system: A new software suite for macromolecular structure determination. Acta Crystallogr. D Biol. Crystallogr. 54: 905921.[CrossRef][Med