|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
S-crystallin by molecular fragment replacement NMR
1 Laboratory of Chemical Physics, NIDDK, and 2 Section on Molecular Structure and Functional Genomics, NEI, National Institutes of Health, Bethesda, Maryland 20892, USA
3 Biochemistry Department, Ohio State University, Columbus, Ohio 43210, USA
Reprint requests to: Ad Bax, Building 5, Room 126, NIH, Bethesda, MD 20892-0520, USA; e-mail: bax{at}nih.gov; fax: (301) 402-0907.
(RECEIVED June 6, 2005; FINAL REVISION September 2, 2005; ACCEPTED September 4, 2005)
| Abstract |
|---|
|
|
|---|
S-crystallin (
S) has been determined by multidimensional triple resonance NMR spectroscopy, using restraints derived from two sets of dipolar couplings, recorded in different alignment media, and supplemented by a small number of NOE distance restraints.
S consists of two topologically similar domains, arranged with an approximate twofold symmetry, and each domain shows close structural homology to closely related (~50% sequence identity) domains found in other members of the
-crystallin family. Each domain consists of two four-strand "Greek key"
-sheets. Although the domains are tightly anchored to one another by the hydrophobic surfaces of the two inner Greek key motifs, the N-arm, the interdomain linker and several turn regions show unexpected flexibility and disorder in solution. This may contribute entropic stabilization to the protein in solution, but may also indicate nucleation sites for unfolding or other structural transitions. The method used for solving the
S structure relies on the recently introduced molecular fragment replacement method, which capitalizes on the large database of protein structures previously solved by X-ray crystallography and NMR. Keywords: alignment; deuteration; liquid crystal; Pf1; relaxation; RDC; residual dipolar coupling; structural proteins; NMR spectroscopy; heteronuclear NMR; new methods; database mining
Abbreviations: MFR, molecular fragment replacement
S, murine
S-crystallin Da, magnitude of the dipolar coupling tensor R, rhombicity of the dipolar coupling tensor
Article published online ahead of print. Article and publication date are at http://www.proteinscience.org/cgi/doi/10.1110/ps.051635205.
| Introduction |
|---|
|
|
|---|
-crystallins are members of the small heat-shock protein superfamily (Dejong et al. 1993). The
- and
-crystallins are evolutionarily related; the 
-crystallin superfamily also includes nonlens members in both prokaryotes and eukaryotes (Lubsen et al. 1988; Wistow 1995; Ray et al. 1997). The
-crystallins seem to be particularly adapted for the highest concentration, central regions of the lens. However, one member of the family,
S-crystallin, is also expressed at high levels in cortical regions of the lens and even in epithelial cells (Wang et al. 2004).
S is one of the most abundantly expressed proteins of the adult lens, and it is highly conserved in evolution (Sinha et al. 1998; Wistow et al. 2002, 2005). A destabilizing F9S mutation in mouse
S leads to the Opj cataract in which the protein unfolds and forms plaques, severely disrupting the cells of the lens cortex (Sinha et al. 2001).
Several structures of members of the
- and
-crystallin family previously have been solved by X-ray crystallography (Kumaraswamy et al. 1996; Norledge et al. 1997; Basak et al. 1998, 2003). They all display two structurally similar domains held together by hydrophobic interdomain interactions. Homology models based on these structures indicate that
S shares the two-domain pairing topology, but yet might differ significantly due to differences in amino acid sequence at the domain interface (Zarina et al. 1994). Despite extensive efforts, crystallization of full-length murine
S has remained elusive, which raises the question of intra- or interdomain flexibility. Interestingly, the C-terminal domains of both human and bovine
S have been crystallized, and their structures reveal remarkable homo-dimeric patterns, with two individual molecules arranged in a manner similar to the N- and C-terminal domains in other members of the
-crystallin family (Basak et al. 1998; Purkiss et al. 2002).
In order to gain insight in potential differences between
S and its
-crystallin family members, we have determined the solution structure of full-length murine
S by NMR spectroscopy. In order to evaluate recently developed technology, we did not follow the conventional NMR approach, which requires extensive analysis of NOE-based experiments for obtaining a sufficiently large number of distance restraints (Wüthrich 1986; Clore and Gronenborn 1989; Wagner 1993). Instead, we utilized the recently introduced molecular fragment replacement (MFR) method (Delaglio et al. 2000; Kontaxis et al. 2005), which is largely based on measurement of residual dipolar couplings (RDCs), measured under weakly aligning conditions (Tjandra and Bax 1997; Clore et al. 1998; Hansen et al. 1998; Prestegard et al. 2000). The present study represents the first application of this technology to a protein not previously solved by other methods. Also, the extensive presence of
-strands, turns, and loops presents a challenging test to the MFR technology, which previously has been shown to be best suited for proteins rich in
-helix. In our study, RDCs are measured in two different alignment media and supplemented by a modest number of backbone HNHN and methylmethyl NOEs. RDCs are directly related to the orientation of the corresponding internuclear vectors relative to the molecular alignment tensor. Together with HNHN NOEs, they contain sufficient information to determine the backbone structure of the N- and C-terminal domains of
S at relatively high precision and accuracy, as well as their relative orientation. However, interdomain methylmethyl NOEs are found to be essential for determining the relative position of the two domains.
Weak alignment of the protein is a prerequisite for measurement of RDCs and can be obtained by dissolving the protein in an anisotropic aqueous medium, with the anisotropy caused by a small volume fraction of particles that orient in a liquid crystalline manner relative to the magnetic field. Phospholipid bicelles (Sanders and Schwonek 1992; Tjandra and Bax 1997), filamentous phage (Clore et al. 1998; Hansen et al. 1998), or polyethyleneglycol analogs (Ruckert and Otting 2000) are commonly used for this purpose. More recently, strained hydrogels (Sass et al. 2000; Tycko et al. 2000; Chou et al. 2001; Ishii et al. 2001; Meier et al. 2002; Ulmer et al. 2003) have been added to this arsenal, and have proven to be particularly robust for aligning macromolecules that are incompatible with any of the common liquid crystalline alignment media (Chou et al. 2002; Cierpicki and Bushweller 2004). For our study of
S, both a stretched hydrogel and an unstrained hydrogel containing 3 mg/mL Pf1 were used to yield two separate alignment tensors. Measurement of RDCs under two different alignment conditions often greatly reduces the orientational degeneracy associated with dipolar couplings measured in a single medium (Ramirez and Bax 1998; Al-Hashimi et al. 2000).
To date, dipolar couplings have been mainly used for refining structures obtained with the conventional, NOE-based approach (Drohat et al. 1999; Kuszewski et al. 2001; Tugarinov and Kay 2003) or modeled on the basis of X-ray crystallographic data of homologous proteins (Chou et al. 2000b). Direct calculation of a structure from dipolar couplings, although feasible in favorable cases (Brenneman and Cross 1990; Andrec et al. 2001; Hus et al. 2001), has proven difficult, particularly when data are incomplete. Structural information contained in the chemical shifts often provides approximate information about the backbone torsion angles of a given residue (Cornilescu et al. 1999; Wishart and Case 2001) and can complement the RDC information. Unfortunately, for proteins rich in the
-sheet, where the structural degeneracy when interpreting dipolar couplings is often most severe, the chemical shift information is often less discriminating than for helical proteins.
MFR is a conceptually rather different approach to structure determination compared to the conventional, NOE-based method. It mimics the substructure approach of Thirup and Jones (Jones and Thirup 1986), widely used in X-ray crystallography, but utilizes a much larger database of model structures. MFR relies on a search of a large fraction of the Protein Data Bank (RCSB) (Berman et al. 2000) for protein backbone fragments that are structurally compatible with RDCs measured for any given fragment in the protein of interest, and at the same time have predicted chemical shifts that roughly agree with those observed experimentally (Kontaxis et al. 2005). Although for small systems and with relatively complete sets of dipolar couplings, such data suffice to assemble complete proteins, occasional errors can result in translation of different elements of the structure relative to one another (Kontaxis et al. 2005). For this reason, we here use a hybrid approach, which relies primarily on MFR for building of the protein backbone, but also uses a modest set of NOEs to prevent such translational errors. No HNHN NOE interactions are observed between the N- and C-terminal domains, and the increased dynamics in the interdomain linker precludes accurate definition of this region of the structure. Therefore, one-bond RDCs cannot accurately position the residues following this flexible section relative to the N-terminal domain. To solve this problem, we resort to the measurement of CH3CH3 NOEs, and only very few such interactions suffice to determine the relative position of the two domains, when the orientation of each domain is already uniquely defined by RDCs.
| Results |
|---|
|
|
|---|
Selection of alignment media for
S
Imposing the required degree of weak alignment (~103) on the protein for facile NMR measurement of dipolar couplings proved challenging for
S. As a result of electrostatic interaction between
S and Pf1, initial attempts to align
S in regular Pf1 medium (Hansen et al. 1998) resulted in protein alignment that was too strong for permitting the measurement of RDCs in the usual fashion under conditions where Pf1 orients in a liquid crystalline manner. At the ionic strength (120 mM) required for monomeric behavior of
S, Pf1 loses liquid crystalline behavior below ~10 mg/mL, resulting in much weaker, field-dependent alignment of Pf1 and thereby of the protein (Zweckstetter and Bax 2001). In contrast, if a liquid crystalline, low ionic strength, dilute Pf1 sample is first "frozen" in its oriented state while inside a strong magnetic field, by initiation of acrylamide gel polymerization,
S at the desired ionic strength can subsequently be diffused into such an oriented sample. Alignment of
S in such a gelled Pf1 sample was found to be independent of magnetic field strength and remained constant for the entire duration over which the various spectra were recorded.
Four sets of one-bond dipolar couplings1DHN, 1DNC', 1DC
C
, and 1DC
C' were measured for perdeuterated
S in each of two alignment media: 3 mg/ mL gelled Pf1, prepared in the manner above (see also Materials and Methods), and 6% (w/v), stretched poly-acrylamide gel.
The above used alignment conditions were close to optimal for perdeuterated
S, but alignment was found to be too strong for measurement of 1DC
H
couplings in a protonated form of the protein. Strong alignment results in large homonuclear 1H-1H couplings, especially for sequential H
-HN interactions in
-strands, which as a result show severe broadening, increased overlap, and weaker 1H resonances. Furthermore, under the strong alignment conditions many 1DC
H
couplings exceed 50 Hz, and thereby adversely affect the uniformity in efficiency of the magnetization transfer steps. Therefore, a second set of samples with weaker alignment was generated for measurement of 1DC
H
couplings (see Experimental section). Although these couplings were of lower accuracy and found to be superfluous in the structure determination process, they provide a useful set of data to validate the correctness of the derived structures.
Experimental restraints used for
S structure determination
The solution structure of
S was determined using a total of 1709 experimental restraints, including 1376 RDCs (Table 1
). Out of 170 nonproline residues, 158 yielded detectable 1H-15N HSQC correlations; conformational exchange on an intermediate timescale resulted in the disappearance of Y59 in the N-terminal domain, and K130, V131, T135, W136, and K152Y156 in the C-terminal domain. The two N-terminal residues were unobservable due to rapid amide hydrogen exchange with solvent. 15N relaxation rates indicated an effective rotational correlation time of ~14 nsec at a concentration of 1 mM, resulting in relatively short transverse relaxation times and broad lines. Spectral quality was enhanced by perdeuteration of the protein, thereby also favoring the use of TROSY-based triple resonance experiments for backbone assignments (Salzmann et al. 1999). Backbone HNHN NOEs in such a perdeuterated protein yield much improved sensitivity compared to such interactions in fully protonated proteins (Torchia et al. 1988). A total of 181 such interactions was obtained from a spectrum recorded with a relatively long mixing time of 120 msec. No HNHN NOEs between the N- and C-terminal domains were detected however, and with uncertainty remaining regarding the precise translation of the N- versus the C-terminal domain, we found it necessary to also record a 13CH313CH3 3D NOE spectrum (Zwahlen et al. 1998a). The side-chain
1 angles for most of the hydrophobic core residues and a smaller fraction of the surface-exposed residues were defined uniquely by 3JNC
and 3JC'C
couplings.
|
S and
B-crystallin, discussed above, is just one example of a pair of proteins whose backbone folds are proven to be similar on the basis of dipolar couplings. The structure of interest (
S) can then be refined by using a structure based on the homologous protein (
B-crystallin) as a starting structure, thereby alleviating the above-mentioned multiple minima problem (Chou et al. 2000b). As a note of caution, we mention that care must be taken when using the homology model rather than data derived from the original structure itself during the refinement process. For example, minor backbone rearrangements in the homology model, relative to the original experimental structure in the database, frequently result in a much poorer fit between experimental RDCs and the homology model than does fitting of the RDCs to corresponding residues in the experimental database structure. This problem can be mitigated by imposing torsion angle restraints on the homology model that, at least in early iterations, harmonically restrain its backbone torsion angles to the values found in the database structure while, for example, restraining the backbone C
atoms to remain harmonically tied to their positions in the original homology model. Although such an approach is relatively straightforward for cases where an adequate homology model can be found, the method would not permit determination of a new protein fold. The latter is possible when using the same strategy described above for the homologous proteins, but applying it to small (typically 710 residues) fragments. Below, we illustrate this so-called MFR approach for the case of
S.
MFR selection of backbone fragments
In the MFR approach, the 177-residue sequence of
S is considered in terms of 171 overlapping seven-residue segments. For each seven-residue segment, a database containing about 800 nonhomologous, high-resolution crystal structures is searched for seven-residue fragments that are compatible with the backbone RDCs measured in the target segment. Full details of adjustable parameters and options for optimization of the MFR search have been provided previously (Kontaxis et al. 2005), and only a brief discussion of this procedure is presented here.
Although the entire database contains over 200,000 of these seven-residue fragments, the evaluation of whether a fragment is compatible with the experimental dipolar couplings is a linear problem that can be solved exceedingly rapidly by singular value decomposition or SVD (Losonczi et al. 1999; Sass et al. 1999), permitting the full database to be searched in a matter of minutes. Typically, the 10 fragments that best fit to the experimental RDCs are retained at the end of this search. In this first round of fragment searching, no prior knowledge about the alignment tensors, applicable for the two media, is used. However, prior to a second search, this alignment information is available from the results of the first search: The magnitudes and relative orientation of the corresponding alignment tensors are extracted from these search results, using a weighting factor that scales exponentially with the inverse of the backbone RMSD of the 10 best-fitting fragments. The magnitude, rhombicity, and relative orientation of the thus-obtained alignment tensors, together with their automatically derived uncertainties, are presented in Table 2
. When separating the results obtained for fragments taken from the N- and C-terminal domains, the difference in alignment parameters is not statistically significant, already strongly suggesting that the two domains are rigidly anchored to one another.
|
|
and C' coordinate restraints are used during this process in order to ensure that the refined fragment remains close to the starting fragment. Nevertheless, this refinement procedure often is very revealing since minor adjustment in backbone torsion angles can dramatically improve the fit in cases where the starting fragment is close to the true structure. In contrast, the improvement tends to be much smaller for incorrect fragments that, by chance, yield a better-than-random fit to the experimental couplings in the MFR search. Figure 1C
/
angles obtained after the refinement.
Calculation of the protein structure
Previously, for relatively small model systems without extensive stretches in the protein for which structural information is missing, assembly of the protein backbone proceeded sequentially: Successive fragments were assembled in a stepwise manner by superimposing overlapping segments of the above selected fragments (Kontaxis et al. 2005). Accumulation of errors and the problem in bridging regions of ill-defined structure pose considerable challenges, however. Instead, here we use a minor variation to the regular NOE-based NMR protocol, starting from a single extended chain, and apply a three-stage simulated annealing scheme. In the first stage, with van der Waals radii scaled to zero, the NOE data are used together with tight backbone torsion restraints (300 kcal/rad2), extracted from the collection of refined MFR fragments. Note that the MFR results provide very tight and comprehensive torsion restraints; in this case 90% of the torsions (318 torsion angles) exhibit RMS uncertainties of < ~7 °. No unique angles could be defined for the N-terminal five residues, flexible residues in the interdomain linker region, and several of the above-mentioned sequential residues for which no amide signals were observable as a result of conformational exchange.
In the second stage, the dipolar couplings are reintroduced, and side-chain
1 and
2 restraints, obtained from 3JNC
, 3JC'C
, and 3JC
C
measurements, are added. In a subsequent, relatively long (100,000 time steps) third stage of simulated annealing, a hydrogen-bond potential of mean force (HB-PMF) is added as an empirical energy term (Grishaev and Bax 2004). The entire protocol is summarized in the flow chart of Figure 2
. The HB-PMF automatically recognizes hydrogen bonds, provided that the starting structure is sufficiently close to the true structure. Indeed, for the N-terminal domain, all but one (R30OL33HN) of the backbonebackbone hydrogen bonds seen in homologous crystal structures have also formed at the end of the final simulated annealing run. One additional H-bond (N37OY69HN) is present in 18 out of the 20 NMR structures but involves a water-mediated H-bond in the X-ray structure of
B-crystallin. In the absence of explicit waters in the NMR structure calculation, such a water-mediated H-bond cannot be detected with our approach, and the weak H-bond seen in most of the NMR structures is likely to also be indirect. For the C-terminal domain, the absence of amide resonances for two stretches of residues resulted in considerably lower resolution for the starting structure, and ~20% of the H-bonds seen in homologous crystal structures are not automatically recognized by the HB-PMF module. Interestingly, if H-bond restraints modeled on the basis of the
B X-ray structures are included as distance restraints during the final stage of simulated annealing, all other restraints are satisfied equally well as without these modeled restraints (Supplemental Material). This suggests that the H-bond network seen in the C-terminal domain of crystallin family members also prevails in solution, and that the missing H-bonds in the solution NMR structure result from the lack of experimental data, caused by conformational exchange broadening.
|
S are topologically similar, and are connected by a six-residue linker. 15N relaxation data indicate that this linker, consisting of residues S88A93, is subject to relatively large amplitude rapid internal dynamics, with the generalized order parameter (Lipari and Szabo 1982) S2 < 0.3 near its midpoint (Fig. 3
-sheets (Fig. 4
- and
-crystallins. Starting from G5, strand
1 (G5Y10) is flanked by two anti-parallel strands
2 (R18C22) and
4 (S37G43). Strand
3 (D25D28) arches away from the first sheet and crosses over to become the edge strand of the second sheet, where it pairs with
8 (G80V84). Strand
3 is quite short with a bulge in the middle and only involves three hydrogen bonds (C26-HNC82-O, C82-HNC26-O, G80-OF29-HN), and is therefore not recognized as a
-strand by the program Molmol (Koradi et al. 1996). Strand
4 pairs up with edge strand
7 (G64P67), whereas the second sheet is completed by
5 (T45E50) and
6 (H57P62) (Fig. 4
2 and
3 contains a conserved "DCDCDC" sequence, which is capable of forming intermolecular disulfide bonds (Skouri-Panet et al. 2001), and which appears to be specifically protected by S-methylation (Lapko et al. 2002). In the present study, a reducing environment is maintained by the presence of dithiotreitol, preventing
S oligomerization.
|
|
5/
6 pair exhibits a pronounced kink of ~90 ° at R51 and G56, resulting in a structural motif that, although also found in most other
-crystallins, is relatively unusual in the Protein Data Bank. For example, no similar motif is found in any of the 80 structures that form the basis for the TALOS program (Cornilescu et al. 1999), which erroneously interprets the Y49R51 chemical shifts as indicative of extended,
-sheet-like backbone angles for E50. However, all low-energy, refined fragments have E50 backbone torsion angles that cluster in the helical region of
/
space (Fig. 1C
B-crystallin (Kumaraswamy et al. 1996). Interestingly, E50 is the last residue that interacts with both
6 and
8 simultaneously. The amide signal of M58 in strand
6 is very weak, and the 1H-15N correlation of Y59 is completely absent, indicative of a conformational exchange process on the µsecmsec timescale. Nevertheless, owing to the overlapping nature of segments in the MFR search, the backbone conformation of these residues remains well defined by dipolar couplings from the flanking residues.
The linker between
6 and
7 (L61Y66) forms a motif known as the
5 "tyrosine corner" (Hemmingsen et al. 1994). In this motif, the Tyr residue adopts a 60 °
1 angle, pointing its ring toward the preceding residues, confirmed in the present study by a small (<0.5 Hz) 3JNC
value. The presence of such a tyrosine corner is often seen in the connection between two Greek-key
-sheets, and has been proposed to play an important role in stabilizing a Greek key over a hairpin connection (Hemmingsen et al. 1994). A similar inter-Greek-key motif (V41W46) is also located between strands
4 and
5. In this case, W46 replaces the Tyr, and again adopts a 60 °
1 angle (3JC'C
= 4.3 Hz), allowing formation of a H-bond from its N
1 to the carbonyl of G43, and hydrophobic packing against V41.
The C-terminal domain closely resembles its N-terminal counterpart, reflecting the evolutionary history of successive gene duplications that characterizes the 
-crystallin superfamily (Wistow et al. 2005). It also consists of two Greek-key
-sheets, connected by a Trp-corner. Compared to the
3
4 linker in the N-terminal domain, the linker between homologous strands
11 and
12 contains two extra residues, allowing formation of a short 310-helix (I117F121). Inter-Greek-key linkers between
7 and
8 and between
15 and
16 also adopt 310-helical structure, and these short helices partly cover the open ends of the wedge arrangement of each pair of Greek-key
-sheets. The amide signals of K130V131, and T135W136 are not observable, whereas others in this region are very weak, indicative again of the presence of conformational exchange on a µsecmsec timescale. Although the amide of W136 is not observable, a value of 3JC'C
= 4.9 Hz is measured for W136 via the amide of I137, indicating
1 = 60 °. This is compatible with the same Trp-corner motif for V131W136 as that found for the homologous region in the N-terminal domain (V41W46). However, in the absence of more NMR data, this region remains ill-defined. N143 is located in the linker between
13 and
14, immediately adjacent to the interdomain interface. Deamidation of this Asn residue has been postulated to disturb the charge balance at the interface, and thereby contribute to cataract formation (Aswad et al. 2000).
The presence of conformational exchange in the region where a "tyrosine corner" is expected in the C-terminal domain (L151Y156) on the basis of its homology to the N-terminal domain, makes it difficult to evaluate whether this motif is present in
S. Although the Tyr again adopts a 60 °
1 angle (3JC'C
= 3.8 Hz), the residue at the Y-2 position is Lys instead of the preferred Gly. The extended
-conformation (
+160 °,
180 °), normally adopted by Gly in this position, is unfavorable for Lys. We speculate that the presence of Lys in this position destabilizes the Tyr corner, and may be responsible for the intermediate timescale conformational exchange in this region, which results in the absence of amide signals for residues D152Y156. The calculated NMR structure suggests that the characteristic Tyr-corner Y156-O
to D152-HN hydrogen bond is populated. Interestingly, the X-ray crystal structures of homodimers of the C-terminal domains of both human and bovine
S indicate the presence of a motif that is distinctly different, with the Y156-O
hydrogen-bonding to K153HN instead of D152HN (Basak et al. 1998; Purkiss et al. 2002), a pattern incompatible with the average NMR structure.
Similar to its remotely related
AF-crystallins,
S is made up of two structurally equivalent domains that pack against each other, mainly through hydrophobic side chains on the backside of the Greek key motifs 1 and 4. The two domains are arranged in a way similar to that seen in the homodimers of the C-terminal
S domain (Basak et al. 1998; Purkiss et al. 2002). In
S, numerous intermethyl NOE interactions (Fig. 4
; Supplemental Material) indicate that hydrophobic interactions, between T45, A47, I60, and V85 in the N-terminal domain and T135, I137, L150, and V175 in the C-terminal domain, play an important role in stabilizing the two-domain structure. In the homodimeric crystal structure of the C-terminal domain, it is the latter set of residues that stabilizes interdomain contacts.
Validation of the structure
For structures refined with RDCs, a fraction of such couplings omitted from the refinement input is commonly used for validation purposes (Clore and Garrett 1999; Drohat et al. 1999), and allows for calculation of a quality factor, Q, in analogy to the free R-factor commonly used in X-ray crystallography. In the MFR approach, such cross-validation is less straightforward, as all couplings initially are used for fragment selection, and repeating the whole procedure from scratch, with a variable subset of dipolar couplings omitted, would be very time consuming. Instead, for validation purposes we here use a set of C
-H
RDCs that was not used in the MFR procedure. Although this set of couplings was measured under much weaker alignment conditions than the other RDCs, and consequently has larger experimental errors, these couplings are found to correlate well with those predicted by the structure (Pearsons correlation coefficient, RP = 0.95; Q = 31%). For reference, the 1.8 Å X-ray structure of ubiquitin yields Q = 23% (Cornilescu et al. 1998).
In terms of Ramachandran statistics, the structures score remarkably high by NMR standards (Table 3
), despite the fact that no database-derived potential of mean force ("rama" term in XPLOR-NIH) was used during structure refinement. Considering the small number of experimental observables used for defining side-chain conformation, mainly 3JC
N and 3JC
C' for
1, packing of the protein also is good by NMR standards, with only a moderate number of steric clashes reported by PROCHECK (Laskowski et al. 1993).
|
| Discussion |
|---|
|
|
|---|
S structure reported here is the first new protein structure to be determined largely by the MFR approach. However, for this protein rich in
-sheet, inclusion of a moderate number of relatively easily accessible HNHN and CH3CH3 NOEs is found to be most helpful. In particular, without the interdomain methylmethyl NOEs the relative position of the N- and C-terminal domains remains undetermined, even though their relative orientation is tightly defined by the dipolar couplings. The individual domains of
S agree closely with the crystal structures (Table 4
S and
B-crystallin is among the lowest obtained when comparing previously solved NMR and X-ray structures for any protein, even in the absence of sequence differences. For the C-terminal domain, differences relative to
B-crystallin (1.09 Å) or relative to the crystal structure of the C-terminal domain of either bovine or human
S (1.07 Å) (Basak et al. 1998; Purkiss et al. 2002) is considerably higher, mainly resulting from the absence of several amide resonances from the NMR spectrum, which makes it impossible for the HB-PMF to identify the applicable hydrogen bonds. Absence of these H-bonds and other restraints in the corresponding regions of the structure results in substantial local uncertainty and increased differences relative to the X-ray structures. Interestingly, however, if the H-bonds observed in the homologous crystal structures are added as input restraints during the final stage of the structure calculation process, all experimental restraints are satisfied equally well as without these restraints, and agreement between the calculated structure and homologous X-ray structures drops to 0.75 Å (Supplemental Material).
|
AF-crystallin have been solved previously. Figure 5
S structure with the closest matches in the database, when superimposing the N-terminal domains. The interdomain angle differs somewhat between the
B and
D crystal structures, and considerably more when comparing
B with the X-ray structures of the homodimeric C-terminal domain of
S (Supplementary Fig. 7). Our
S structure adopts interdomain angles that are intermediate between those of the
B or
D structures, and those seen in the homodimeric C-domain of
S. However, unless the interdomain NOE restraints are artificially tightened (data not shown), the distance between the two domains remains considerably larger than in any of the crystal structures (Fig. 5
S is found to be remarkably similar to the structure of the intact protein. This is compatible with the high degree of flexibility observed for residues 8893 in the interdomain linker of
S (Fig. 3
|
-crystallin homologs,
S has an extra four-residue segment at its N terminus. This is equivalent to the longer N-arms of
-crystallins and of the distantly related, one-domain protein spherulin 3a of the slime mold Physarum polycephalum (Rosinke et al. 1997). These segments are, at least in part, well ordered in X-ray structures. Although in
S the methyl of T3 exhibits a very weak NOE interaction to A84 C
H3, this interaction is most likely to be transient and the overall structure of this segment is poorly defined by the NMR data. Dynamic disorder for residues S1G5 is supported by 15N relaxation measurements, which indicate rapid amide exchange for S1K2 and strongly decreased 15N-{1H} heteronuclear NOE and transverse relaxation rates for T3K5, resulting in low order parameters (Fig. 3
S is highly mobile, possibly contributing to the high solubility of the protein. If this is so, loss of the N-arm, perhaps with aging in the lens, could contribute to reduced solubility and possible aggregation.
Our results indicate that
S adopts a stable, two-domain structure, where the two domains are rigidly held together by hydrophobic interactions and, like the N-arm, the interdomain linker is highly dynamic. In X-ray structures of both
- and
-crystallins, in which the interdomain linker can adopt very different conformations (bent in
-crystallins, extended in
B2-crystallin (Bax et al. 1990), it is generally well ordered and clearly defined. However, the solution structure of
S suggests that these linker peptides in other members of the crystallin family may be quite mobile too.
Except for the N-arm and the linker region, the protein is generally well structured. The disappearance of amide resonances in the two linkers between the Greek-key
-sheets, which adopt a Trp-corner and variant of the Tyr-corner motif in the X-ray structure of the C-terminal
S domain, indicates conformational exchange on an intermediate timescale (µsecmsec) in this area. Unfortunately, the precise nature of this exchange process cannot be established from the available NMR data. Even after exchange of the protein into D2O solvent, many of the amide signals remain present for many weeks, both for residues in the N- and C-terminal domains (Supplemental Material), confirming that the protein is thermodynamically very stable.
The presence of conformational exchange on a µsecmsec timescale indicates that even the well-defined polypeptide backbone of both domains contains regions of considerable flexibility and movement. Since one of the most striking features about cataract formation is that normally well-folded proteins unfold and aggregate, producing light scattering centers, it is conceivable that these regions may be involved in relatively high barrier, first steps in such an unfolding cascade.
| Materials and methods |
|---|
|
|
|---|
S was restricted with NdeI and HindIII, subcloned into the pET17b vector (Novagen) and transformed into Escherichia coli strain BL21(DE3)pLysS (Sinha et al. 2001). Starting with a single colony taken from a freshly streaked agar plate, a 5-mL LB medium was inoculated and subsequently grown to an OD600 of 0.8. Then a 25-mL minimal medium starter culture, containing 15N-NH4Cl and/or 13C-glucose as the sole sources of nitrogen and carbon, respectively, was inoculated with 500 µL of the above LB culture and grown until the OD600 reached 1.0. Next, 1 L of minimal medium containing 15N-NH4Cl and/or 13C-glucose was inoculated with 10 mL of the starter culture. The culture was incubated at 37 °C in a rotating shaker to an OD600 of 0.7, at which point protein expression was induced by addition of IPTG to a final concentration of 1 mM. The cells were harvested after 16 h by pelleting at 3500g for 25 min. Details regarding the purification procedure are described by Sinha et al. (2001). In the case of deuterated protein, D2O was used in the minimal medium instead of H2O, but protonated 13C-glucose was used. Isotropic NMR samples consisted of 0.31.5 mM
S, 25 mM imidazole (pH 6.0), 10 mM KCl, and 0.04% NaN3 (92% H2O/8% D2O), in 270-µL thin-wall Shigemi microcells.
NMR spectroscopy
All NMR experiments were carried out at 25 °C on Bruker DMX500 and DRX600 spectrometers, equipped with single-axis gradient, triple-resonance cryogenic probes, and DMX600 and DRX800 spectrometers equipped with self-shielded, three-axis gradient, triple resonance probes. Sequential backbone assignments were obtained from HNCA and HNCOCA, HNCO, CBCACONH, and CBCANH experiments, and aliphatic side-chain 13C and 1H were assigned from 3D 13C-edited HCCH-TOCSY (Bax and Grzesiek 1993). HNHN interproton distance restraints were determined from 3D 15N-separated NOESY (120 msec mixing) recorded on perdeuterated 15N-labeled
S, and methylmethyl interactions from 3D 13C-edited NOESY (120 msec) (Zwahlen et al. 1998b) on 15N/13C protein. Side-chain
1 angles for aromatic and C
-methyl carrying residues were determined from 3JC'C
and 3JNC
, measured by quantitative J correlation experiments (Grzesiek et al. 1993; Vuister et al. 1993; Hu et al. 1997), as well as 2D {13C
} spin-echo difference experiments (Hu and Bax 1997). All NMR data were processed with NMRPipe software (Delaglio et al. 1995) and analyzed with NMRView (Johnson and Blevins 1994) and NMRDraw (Delaglio et al. 1995).
HNHN distances were calibrated according to the average HNHN distance of 2.9 ± 0.2 Å for the following interactions in anti-parallel
-sheet: 7/22, 9/20, 8/40, 10/38, 41/64, 47/83, 49/ 81, and 26/82, using an empirical 1/r4 dependence of the intensity, which to a first approximation accounts for the effects of spin diffusion (Güntert et al. 1991). A 0.5 Å tolerance was added to both upper and lower bounds for the distances derived in this manner. Methylmethyl distance restraints were obtained from a 3D 13C-edited NOESY spectrum in a similar manner, but using the intermethyl distance of 2.5 Å between two geminal methyl carbons in Leu and Val residues, with an error margin of 0.8 and +0.5 Å, respectively, for the lower limit and upper limit of the derived distance restraint.
RDCs were measured in several alignment media: axially stretched uncharged polyacrylamide gel, as well as filamentous Pf1, aligned either magnetically, or by stabilizing its orientation in a gel matrix. The stretched gel sample was prepared by soaking a premade gel (6% [w/v]), originally cast with a diameter of 5.4 mm, in the
S protein solution for 24 h, followed by radial compression using a funnel-type device to insert it into an open-ended NMR tube with an inner diameter of 4.2 mm (Chou et al. 2001). Experiments were collected within a 2-wk period, before any significant changes were noticeable in the proteins alignment strength. Two filamentous Pf1 samples were prepared, each containing 3 mg/mL Pf1 phage. One such Pf1 sample was originally at very low ionic strength, with no added
S, but in the presence of 5% w/v acrylamide/bisacrylamide (40:1 molar ratio), whose polymerization was initiated in the standard manner immediately prior to insertion of the sample in an 800 MHz NMR magnet. Under the low ionic strength conditions, the Pf1 was ~50% aligned with the magnetic field (corresponding to a 2H quadrupole splitting of 1.4 Hz), and this alignment was maintained after polymerization of the acrylamide. The solidified gel was cleaned by two wash cycles in 20 mL H2O, and subsequently equilibrated in the
S protein solution at the regular buffer conditions. After reinsertion in the NMR tube, the 1.4 Hz 2H quadrupole splitting indicated the Pf1 alignment was maintained by the gel matrix during the entire handling process. A second 3 mg/mL Pf1 sample, whose orientation was not stabilized by gel and which was below the nematic Pf1 threshold, yielded a 0.46 Hz 2H quadrupole splitting, and this sample was used for measurement of the 1DH
C
couplings. 1DNH RDCs were obtained from 2D HSQC-TROSY spectra, recorded in an interleaved mode (Kontaxis et al. 2000). 1DNC', 1DH
C
, and 1DC
C
RDCs were measured from 3D-HNCO (Chou et al. 2000a), 3D-CBCACONH (Chou and Bax 2001), and 2D-HNCOCA (Evenas et al. 2001) experiments, respectively, utilizing quantitative J correlation. 1DC
C' RDCs were obtained from C
-coupled HNCO experiments recorded at 500 MHz (1H frequency), to minimize C' relaxation, dominated by its large chemical shift anisotropy.
MFR selection
All molecular fragment selection was carried out using the program DYNAMO (Kontaxis et al. 2005). In a first iteration, seven-residue fragments out of the 800-protein library were searched on the basis of the backbone dipolar couplings, all normalized to the 1DNH interaction using previously reported scaling factors (Bax et al. 2001). 1DC
H
couplings were not used at any stage, except during structure validation. Results of this initial, SVD-based MFR search were only used to define the magnitudes and relative orientations of the two alignment tensors (Table 2
), in the manner outlined previously (Kontaxis et al. 2005). In the second MFR search, the magnitude and relative orientation were kept fixed on the basis of the results of the first search, and the 10 database fragments exhibiting the best fit to the RDCs were retained for each of the 171 seven-residue fragments of
S. Subsequently, each of the database fragments was subjected to a brief simulated annealing protocol using the DYNAMO program, during which its N, C
, and C' atoms were harmonically restrained to remain close to those of the original database fragment, as were the backbone torsion angles,
and
, using the RDCs as experimental input restraints. Only fragments for which the RMSD between experimental and calculated RDCs after refinement was lower than 1.5 Hz were retained. If no fragments refined to lower than 1.5 Hz, fragments refined to below 2.1 Hz were selected, provided they had all very similar backbone angles (Supplementary Fig. 6e,f). The terminal residues of each seven-residue fragment were discarded, and the average and standard deviation of the backbone torsion angles of the center five residues were retained for use as restraints during structure calculation.
Structure calculations
Structures were calculated by simulated annealing using XPLOR-NIH, starting from a fully extended strand (Schwieters et al. 2003). The protocol consisted of three stages. In the first stage, initially for 1000 1-fs steps at 1000 K, the protein was folded using zero van der Waals radii, 300 kcal/rad2 torsion angular restraints, and an NOE term that was ramped exponentially in 50 steps from 0.002 to 50 kcal/ Å2. Subsequently the temperature was lowered from 1000 to 300 K in 5-K decrements, using 285 time steps of 3 fs each, while keeping the torsion and NOE force constants unchanged, and ramping the van der Waals radii from 0 to 0.5 times their regular value. In the second stage, the temperature was lowered from 300 to 20 K, in 10-K decrements, while ramping the van der Waals radii from 0.002 to 0.8 times their regular value, and exponentially ramping the dipolar force constant from 5 x 104 to 0.5 kcal/Hz2 for DNH couplings. Dipolar force constants for DCC and DCN couplings were five- and eightfold higher, respectively. During the third stage, the temperature was initially started at 2000 K and then linearly decreased from 2000 to 1 K in 10-K decrements, using 500 time steps of 2 fs each. The force constant for the dipolar coupling restraints was ramped exponentially in the regular manner, up to a final value of 0.5 kcal/Hz2 (for couplings normalized to 1DNH), while the backbone and side-chain torsion angle force constants were kept fixed at 10 and 50 kcal mol1 rad2, with the NOE force constant unchanged at its full value. During this third stage, the H-bond PMF potential was turned on with a force constant of 0.6 kcal for the directional term and 0.1 kcal for the linearity term. Of the total of 20 structures used as input for the third stage, all converged and exhibited low energies, with no NOE violations larger than 0.3 Å and only one dipolar coupling violation (D1611DHN in gel) larger than 5 Hz.
Coordinates have been deposited in the RCSB Protein Data Bank under accession numbers 1ZWM [PDB] (experimental restraints only) and 1ZWO (experimental restraints plus homology-modeled H-bond restraints).
| Footnotes |
|---|
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Andrec, M., Du, P.C., and Levy, R.M. 2001. Protein backbone structure determination using only residual dipolar couplings from one ordering medium. J. Biomol. NMR 21: 335347.[CrossRef][Medline]
Annila, A., Aitio, H., Thulin, E., and Drakenberg, T. 1999. Recognition of protein folds via dipolar couplings. J. Biomol. NMR 14: 223230.[CrossRef]
Aswad, D.W., Paranandi, M.V., and Schurter, B.T. 2000. Isoaspartate in peptides and proteins: Formation, significance, and analysis. J. Pharm. Biomed. Anal. 21: 11291136.[CrossRef][Medline]
Basak, A.K., Kroone, R.C., Lubsen, N.H., Naylor, C.E., Jaenicke, R., and Slingsby, C. 1998. The C-terminal domains of
S-crystallin pair about a distorted twofold axis. Protein Eng. 11: 337344.
Basak, A., Bateman, O., Slingsby, C., Pande, A., Asherie, N., Ogun, O., Benedek, G.B., and Pande, J. 2003. High-resolution X-ray crystal structures of human
D crystallin (1.25 Å) and the R58H mutant (1.15 Å) associated with aculeiform cataract. J. Mol. Biol. 328: 11371147.[CrossRef][Medline]
Bax, A. 2003. Weak alignment offers new NMR opportunities to study protein structure and dynamics. Protein Sci. 12: 116.
Bax, A. and Grzesiek, S. 1993. Methodological advances in protein NMR. Acc. Chem. Res. 26: 131138.[CrossRef]
Bax, B., Lapatto, R., Nalini, V., Driessen, H., Lindley, P.F., Mahadevan, D., Blundell, T.L., and Slingsby, C. 1990. X-ray-analysis of
-B2-crystallin and evolution of oligomeric lens proteins. Nature 347: 776780.[CrossRef][Medline]
Bax, A., Kontaxis, G., and Tjandra, N. 2001. Dipolar couplings in macro-molecular structure determination. Methods Enzymol. 339: 127174.[Medline]
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., and Bourne, P.E. 2000. The Protein Data Bank. Nucleic Acids Res. 28: 235242.
Bloemendal, H., de Jong, W., Jaenicke, R., Lubsen, N.H., Slingsby, C., and Tardieu, A. 2004. Ageing and vision: Structure, stability and function of lens crystallins. Prog. Biophys. Mol. Biol. 86: 407485.[CrossRef][Medline]
Brenneman, M.T. and Cross, T.A. 1990. A method for the analytic determination of polypeptide structure using solid-state nuclear magnetic-resonanceThe metric method. J. Chem. Phys. 92: 14831494.[CrossRef]