|
|
||||||||
1 Department of Chemistry, University of Minnesota, Minneapolis, Minnesota 55455, USA
2 Department of Biochemistry, Molecular Biology, and Biophysics, University of Minnesota, St. Paul, Minnesota 55108, USA
Reprint requests to: George Barany, Dept. of Chemistry, University of Minnesota, 207 Pleasant St. S.E., Minneapolis, MN 55455, USA; e-mail: barany{at}umn.edu; fax: (612) 626-7541.
(RECEIVED November 1, 2001; FINAL REVISION March 14, 2002; ACCEPTED March 15, 2002)
3 Present address: Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, U.K. ![]()
4 Taken in part from Ph.D. thesis of N.C., University of Minnesota, September 2001. ![]()
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.4440102.
| Abstract |
|---|
|
|
|---|
50-residue protein in which two BPTI-derived core modules, CM I and CM II, are connected by a 22-atom cross-link. At low temperature and pH 3, homo- and heteronuclear NMR data report a dominant folded (`f') conformation with well-dispersed chemical shifts, i, i+1 periodicity, numerous long-range NOEs, and slowed amide hydrogen isotope exchange patterns that is a four-stranded antiparallel ß-sheet with nonsymmetrical and specific association of CM I and CM II. BetaCore `f' conformations undergo reversible, global, moderately cooperative, non-two-state thermal transitions to an equilibrium ensemble of unfolded `u' conformations. There is a significant energy barrier between `f' and `u' conformations. This is the first designed four-stranded antiparallel ß-sheet that folds in water. Keywords: Protein design; protein folding; ß-sheet protein; core modules; cross-link
Abbreviations: 1D 1H, one-dimensional proton Abu or X,
-amino-n-butyric acid BPTI, bovine pancreatic trypsin inhibitor CM, oxidized core module
H
, chemical shift for
-protons ESMS, electrospray mass spectrometry Fmoc, 9-fluorenylmethoxycarbonyl HPLC, high-performance liquid chromatography HSQC, 15N-1H heteronuclear single quantum coherence 3JHNC
H, vicinal coupling constant kobs, exchange rate constant krc exchange rate constant expected in a random coil Mpa or B, ß-mercaptopropionic acid NMR, nuclear magnetic resonance NOESY, nuclear Overhauser effect spectroscopy SEC, size exclusion chromatography Tm, transition midpoints TOCSY, total correlation spectroscopy
| Introduction |
|---|
|
|
|---|
-helices, ß-sheets, ß-turns, etc. are further stabilized in the final designed proteins by spontaneous self-assemblies (monomeric intramolecular or multimeric such as coiled-coil or bundles), covalent cross-links (e.g., disulfides), and/or metal ligations.
Considerable successes in the design of
-helical proteins (Hill et al. 2000) are in contrast to the myriad challenges of creating ß-sheet proteins (Smith and Regan 1997). Specific intramolecular long-range backbone and side chain interstrand interactions required to achieve these structures are countered by related intermolecular interactions; the latter are manifested by a pronounced tendency to aggregate in aqueous media. Nevertheless, building on information about conformational propensities of residues in ß-turns and ß-sheets revealed in protein structural databases as well as through experimental studies of model ß-hairpins (Blanco et al. 1998; Gellman 1998), de novo design has been carried out for three-stranded antiparallel ß-sheets that fold in water (Kortemme et al. 1998; Schenck and Gellman 1998; De Alba et al. 1999) and in organic solvents (Das et al. 1998; Sharman and Searle 1998), and a four-stranded ß-sheet has been reported to be stable in MeOH and 50% MeOH/H2O (Das et al. 2000). Moreover, a three-stranded antiparallel ß-sheet protein was constructed by redesign of naturally occurring toxin hand motifs (Ottesen and Imperiali 2001), and three-stranded antiparallel ß-sheets derived from the WW domain motif have been studied extensively (Koepf et al. 1999; Jiang et al. 2001).
Here we report the characterization and structural analysis of the first water-soluble designed four-stranded antiparallel ß-sheet, called BetaCore, that has properties approaching those of native proteins. Because of the importance of ß-sheets in protein dimerization, protein-protein recognition, and protein aggregation (Maitra and Nowick 2000), including a likely role in the progression of neurodegenerative diseases (e.g., Alzheimer's, Creutzfeldt-Jakob, and Huntington's), this work may be of interest beyond the initial protein design emphasis.
| Results and Discussion |
|---|
|
|
|---|
We hypothesize further that covalent linkage of two CM units will enhance self-association and mutual stabilization (Carulla et al. 2000; Woodward et al. 2001), and result in an ensemble that favors more compact conformations resembling a protein native state. A dimeric form of BPTI exists in solution and undergoes rapid associationdissociation (Gallagher and Woodward 1989; Ilyina et al. 1997), and molecular modeling studies on dimeric BPTI (assumed to be antiparallel) suggest that residues at the hydrophobic monomer-monomer interface are primarily in CM (Zielenkiewicz et al. 1991). Using oxime-forming ligation chemistry to create covalent CM dimers, six candidates each of
50 residues were produced with variations in position and length of the cross-link (Carulla et al. 2001). One-dimensional proton (1D 1H) NMR at pH 2 was used as a qualitative criterion to gauge ordered structure in the conformational ensembles of each CM dimer, and the candidate with the narrowest line widths and highest degree of chemical shift dispersion, that is, BetaCore, was chosen for the more detailed investigations reported herein.
Note that although the CM building blocks of BetaCore have identical residues except for the ones in the cross-link, combining two CMs results in a formal loss of symmetry. In CM I, Arg17 is replaced to form the cross-link; in CM II, Leu29 is replaced. Previous synthetic protocols (Carulla et al. 2001) were readily adapted to prepare CM I and CM II, and the resultant BetaCore constructs, with site-specific 15N labels (in individual CMs or in both), hence facilitating heteronuclear NMR experiments.
BetaCore is monomeric at pH 3 in the absence of salt
Since most ß-sheet forming peptides have a strong tendency to aggregate, it is imperative to confirm that BetaCore is monomeric under conditions of the NMR structural studies. Size-exclusion chromatography (SEC) at 3°C with 0.3 mM protein in CH3CN10 mM aqueous NaCl (1:19), pH 2, shows an elution volume consistent with monomeric molecular weight (calcd 5919.9 Da) and the absence of noncovalent associations (Carulla et al. 2001). Elution profiles were calibrated with standards of known size; salt and organic solvent were included to suppress, respectively, ionic and hydrophobic interactions between protein and cross-linked agarose/dextran column material.
A clearer gauge of aggregation states was provided by analytical ultracentrifuge sedimentation equilibrium experiments; these were conducted at 5°C, with protein loading concentrations ranging from 2.5 µM to 0.05 mM in water at pH 3, and two rotor speeds. Data analysis gave average molecular weights (i.e.,
4468 Da, with considerable scatter) somewhat lower than the molecular weight calculated from the monomeric amino acid sequence; this is a commonly observed result attributable to the opposition of protein electrostatic effects (note that at pH 3, BetaCore has seven positive charges) and ultracentrifugal forces (Schenck and Gellman 1998; De Alba et al. 1999). Therefore, a second virial coefficient, B, was included to account for nonideality effects, resulting in a good global fit of the data to a single species of average mass (Mav) 5878 Da [95% confidence 5800 to 5945 Da]. These results are consistent with the monomer mass and the absence of significant self-association over the concentration range examined. The fit value of B is 0.691 mL/mg [95% confidence 0.651 to 0.725], consistent with strong repulsion.
Given the plan to characterize BetaCore by NMR at concentrations higher than those used in the sedimentation analysis, 1D 1H NMR spectra were recorded in water at pH 3 and 5°C for samples at 0.05 mM and 0.4 mM. The observation that line widths as well as chemical shifts are indistinguishable is good evidence that the protein is monomeric into the 0.4 mM concentration range.
One means to suppress nonideal sedimentation behavior is to add salt, for example, 20 mM NaCl. Under these conditions at pH 3, the average molecular weights [i.e.,
7400 Da, with a range from
5500 to
9000 depending on concentration] were higher than the calculated monomeric weight. The data are best fit to a monomer-dimer equilibrium, dissociation constant 87 µM, with the monomer Mav of 5911 Da [95% confidence from 5731 to 6033 Da]; any higher stoichiometries (trimer, tetramer, etc.) do not give reasonable fits or estimated monomer masses. Inclusion of a second virial coefficient in the monomer-dimer fit does not provide significant improvement and returns a negative (attractive) value for B.
Based on these controls, the conditions used for NMR experiments are 0.4 mM in water at pH 3 with no added salt. A pH of 3 was chosen because line widths and chemical shift dispersions are optimal (with respect to higher pH), solubility is acceptable, and amide proton-solvent exchange is near the minimum rate.
Assignments of folded conformations of BetaCore
BetaCore samples were prepared containing 15N labels at selected positions in either CM I or CM II (see Fig. 1
and accompanying text for nomenclature). In 15N HSQC spectra recorded at 1°C, the number of peaks is equal to the number of labeled residues in each molecule, with chemical shifts well resolved from the random coil envelope (Fig. 2a
). Moreover, the ten peaks from the BetaCore sample in which CM I is labeled are all different from the nine peaks from the sample in which CM II is labeled at the same sequence positions (note that CM II has a cross-link in place of Leu at position 29). This result clearly indicates asymmetry between the two core modules comprising BetaCore, which can be correlated directly to the presence of the cross-link, and suggests that the observed peaks correspond to folded (`f') conformations.
|
|
H region are broad, making them difficult to define unambiguously. Also, very few NH-side chain cross-peaks are observed, precluding identification of spin systems. However, when the temperature is raised to 15°C, the peaks sharpen substantially, and it is possible to make sequential assignments (Wüthrich 1986) by combined analyses of homonuclear TOCSY, NOESY, and heteronuclear 15N-1H HSQC-TOCSY and 15N-1H HSQC-NOESY spectra. Assignments of all backbone and
95% of side chain protons of folded (`f') conformations were facilitated by use of the selectively 15N-labeled BetaCore proteins in heteronuclear experiments. Additional cross-peaks are observed at 15°C beyond those assigned; all of these are found within the random coil envelope and are likely associated with unfolded (`u') conformations, as discussed in detail below.
Folded and unfolded conformations of BetaCore are in slow exchange
15N HSQC spectra of labeled BetaCore molecules were recorded at different temperatures (1, 5, 15, 25, 35, 45, and 55°C, and back to 1°C). As already indicated, the number of peaks at 1°C matches exactly the number of labeled residues, and all can be assigned in the clearly asymmetric cross-linked CM dimer (Fig. 2a
). As temperature is increased, a new set of peaks with random coil chemical shifts start to appear, and at 35°C, the number of peaks is exactly double the number of labeled residues (Fig. 2b
). At 55°C, the number of peaks again matches the number of labeled residues; all of the peaks observed at low temperature are absent and the only peaks now present are those that grew in with increasing temperature (Fig. 2c
). When a sample evaluated at 55°C is returned to 1°C, the original low temperature spectrum (Fig. 2a
) is observed.
These data are consistent with fully reversible transitions from folded (`f', at 1°C) to unfolded (`u', at 55°C) conformations. Assignments of backbone and side chains of labeled residues for the unfolded protein were made based on heteronuclear 15N-1H HSQC-TOCSY and 15N-1H HSQC-NOESY. The important conclusion for the 19 residues carrying an 15N-label is that not only are the chemical shifts observed at 55°C all in the random coil region, but also the sequence symmetry of the CM covalent dimer is observed in the thermally denatured form. That is, identical residues at the same sequence positions in CM I and CM II do not have the same chemical shifts in folded BetaCore at 1°C, but the shifts are changed and overlapping when BetaCore is unfolded at 55°C.
When two protein conformations interconvert slowly on the NMR time scale (in the millisecond range), each gives rise to a separate peak corresponding to that conformation (Roberts 1993). The fact that throughout the unfolding transition each resonance is represented by two different peaks, one with a chemical shift well resolved and away from the random coil envelope, and the other with a random coil chemical shift, implies that BetaCore is in slow conformational exchange on the NMR time scale between folded and unfolded forms. Indeed, interconversion between these two conformations is much longer than milliseconds, as supported by the absence of exchange cross-peaks (Falzone et al. 1991) under a variety of experimental conditions (see Materials and Methods). As a consequence, full assignments of unfolded conformations have not been possible.
Sequential NOEs and coupling constants imply ß-sheet structure
In the dominant folded `f' conformation of BetaCore, for all residues designed to be in a strand, interresidue NHiNHi+1 NOE cross-peaks are very weak or entirely absent, whereas C
HiNHi+1 NOE cross-peaks are intense (Fig. 3
). This pattern of sequential NOE connectivities implies ß-sheet structure, but does not impart information about strand alignment.
|
H representative of different regions. All coupling constants for potential strand residues are greater than 8 Hz (the value for FII33 is > 9 Hz), consistent with
-120° as is characteristic of ß-sheet structure (Fig. 3
H for AI25 and AII25 are 4.4 and 5.1 Hz, respectively, in agreement with the 3JHNC
H of 4.1 Hz for type I ß-turn residue A25 in native BPTI.
Long-range NOEs indicate that BetaCore is a four-stranded ß-sheet
Long-range NOEs generally provide the most conclusive evidence for tertiary structure in solution. The NOE cross-peaks characteristic of antiparallel ß-sheet are those involving C
HiC
Hj, C
HiNHj+1, and NHi+1NHj-1 protons, where i and j are residues that face each other in adjacent ß-strands and the C
H protons point towards the interior. The shortest distances, and correspondingly, the most intense NOEs, are between C
HiC
Hj protons; these are the main diagnostic for a ß-sheet (Wüthrich 1986). Unambiguously identified C
HC
H NOEs between residues in the same CM, i.e., YI23XI30, YI21TI32, II19VI34, RI17GI36, YII23XII30, YII21TII32, III19VII34, and RII17GII36 (Fig. 4
, thick arrows), are the same as in native BPTI (Wagner et al. 1987), providing compelling evidence that each CM unit in BetaCore samples native-like 4:4 ß-hairpin structure. No NOEs in spectra of BetaCore are consistent with nonnative ß-sheets. This is in contrast to the `oxidized core module' by itself, which is an equilibrium ensemble of conformations among which a major population is similar to the native-like 4:4 ß-hairpin and a minor population approximates 3:5 ß-hairpins (Carulla et al. 2000).
|
HC
H NOEs are observed between residues in different CMs comprising BetaCore. These NOEs: LI29NII24, QI31FII22, F133RII20, and YI35III18 (Fig. 4
A significant number of long-range contacts are observed between side chains of residues on adjacent strands within the same CM (e.g., YI23XI30, YI21TI32, and YII21TII32; a total of 21 NOEs) or between different CMs (e.g., QI31FII22, YI35III18; a total of 15 NOEs) comprising BetaCore (Fig. 4
); these are all consistent with the proposed four-stranded antiparallel ß-sheet. In addition, a number of i, i+2 NOEs are observed between side chains (e.g., YI23AI25, II19YI21, YII23AII25, and III19YII21); these are characteristic of extended conformations (Fig. 4
).
Chemical shift data corroborate BetaCore four-stranded ß-sheet folded conformations
Chemical shift data for
-protons (
H
) provide further evidence that BetaCore adopts four-stranded antiparallel ß-sheet conformations in aqueous solution. Secondary structure profoundly affects
H
, with ß-sheet protons shifted downfield and turn protons shifted upfield relative to expected random coil values (Ösapay and Case 1994). The values of 
H
=
H
observed -
H
random coil (Merutka et al. 1995; Wishart et al. 1995; Andersen et al. 1997) for BetaCore at pH 3 and 15°C, plotted as a function of residue position (Fig. 3
), can be compared to those reported previously (Carulla et al. 2001) for the corresponding single (unlinked) CM units. Overall shapes of these graphs follow expectations from the native-like BPTI ß-sheet, that is, downfield shifts for residues 1824 and 2935 and upfield shifts for residues 2528. The absolute values of the deviations from random coil values are much higher for residues from BetaCore (e.g., YI21 has 
H
= 1.38) than for corresponding residues in CM units (e.g., YI21 has 
H
= 0.38), pointing to considerable stabilization of structure associated with combining two CMs through a covalent cross-link. In comparison, the maximum 
H
in the ß-sheet core of native BPTI, recorded at 68°C, is
1.15 (Wagner and Wüthrich 1982).
Analysis of the fine structure of the chemical shift deviation profile (Fig. 3
) provides additional insights. Note the essential mirror symmetry with respect to the two core modules that form BetaCore: zig-zag shapes are observed for the first strand of CM I and the second strand of CM II, and smoother shapes are observed for the second strand of CM I and the first strand of CM II. Theoretical calculations (Ösapay and Case 1994) provide a precedent for the zig-zag shape matching the hydrogen-bonding pattern in simple ß-hairpins; thus, the alternating residues involved in cross-strand contacts (e.g., residues II19, YI21, YI23, XII30, TII32, and VII34 in Fig. 4
) have a more positive
H
proton shift than residues having no interactions with a neighboring strand (e.g., residues II18, RI20, FI22, QII31, FII33, and YII35 in Fig. 4
). Such two-fold i, i+1 periodicity is also found in several native proteins, including tendamistant, plastocyanin, and interleukin 1ß. However, when a strand is located in the middle of a ß-sheet, each consecutive residue is involved in a contact with one or the other of the two neighboring strands, so the aforementioned i, i+1 periodicity is replaced by a smoother profile (i.e., as actually observed for the second and first strands of CM I and CM II, respectively, in the BetaCore protein; see Figs. 3 and 4![]()
). All of these data are consistent with the proposed four-stranded antiparallel ß-sheet conformation.
The chemical shift deviation profile (Fig. 3
) can even be used to pinpoint the ß-turn and distinguish among possible types. Turn residues are calculated and found (Ösapay and Case 1994) to always show a negative 
H
, with the sole exception that the chemical shift of position 3 of a type I ß-turn is expected to be very close to the random-coil value. Indeed, this is found for both turns of BetaCore (i.e., focus on DI27 and DII27) and agrees with the type I ß-turn found in native BPTI.
H/D exchange in BetaCore supports compact folded structure
Hydrogen isotope exchange experiments were conducted at pH 3 and 5°C, recording 2D 15N HSQC spectra as a function of time on a sample of BetaCore in which ten residues in CM I and nine residues in CM II are labeled with 15N. Exchange rate constants (kobs) for all 19 15N-bound amide hydrogens were measured; comparisons to the rates expected in a random coil (krc) calculated based on the pH, temperature, residue, and N-side neighboring residue (Bai et al. 1993) gives protection factors (Fig. 3
). The observed protection factors in BetaCore are at least an order of magnitude higher than those of corresponding residues in the `oxidized core module' (Carulla et al. 2000), and of the same order of magnitude as reported for a partially folded protein such as [1438]Abu (Barbar et al. 1995).
The exchange pattern in BetaCore is in good agreement with predictions from the four-stranded antiparallel ß-sheet model (Fig. 4
). Highly protected residues (i.e., FI22, FII22, FI33, and FII33; krc/kobs = 3075) engage in hydrogen bonds in the middles of the intramodule strands (two each for CM I and CM II), as well as the signature hydrogen bond of the native-like BPTI type I ß-turn (i.e., GI28 and GII28; krc/kobs 100 and 190, respectively). Moderately protected residues (i.e., LI29, VI34, and GI36; krc/kobs = 1720) are those that engage in intrastrand hydrogen bonds, one within CM I and two between CM I and CM II. Residues that exchange relatively rapidly (i.e., AI16, AII16, GcI17, AI25, AII25, GcII29, VII34, GII36, GI37, and GII37; krc/kobs = 310) include representatives from loop and turn regions, the cross-link, and a solvent-exposed side of ß-strand, none of which are expected to be involved in hydrogen bonding.
Calculation of the family of BetaCore folded structures
Compatibility of the proposed four-stranded antiparallel ß-sheet with all experimentally observed NOE distance constraints, those dihedral angles supported by 3JHNC
H measurements, and the hydrogen bonds deduced from H/D exchange studies, was checked by calculating a three-dimensional model structure (Fig. 5
, Table 1
). The most ordered backbone regions are the four strands; there is some flexibility in the turn regions, and the disulfide-bridged loop portions are considerably disordered (Fig. 5a
). The most disordered section of BetaCore is the essential long cross-link, as best appreciated by viewing the construct from the side (Fig. 5b
). Thus, the cross-link can traverse either face of the four-stranded sheet, with a statistical preference (15 of the best 20 calculated structures) for the face defined by the side chains of YI21, YI23, and YII21. Further structural definition of the cross-link is precluded by the absence of NOEs involving other parts of the molecule. Position
II29, which anchors the cross-link onto CM II, has a 
H
suggestive of random coil conformation (Fig. 3
), and perturbs neighboring residues comprising the CM II turn so that it is somewhat less defined than the corresponding turn in CM I (Figs. 4 and 5a![]()
). The structural calculations are not sufficiently resolved to determine whether the BPTI native-like `twist' is reproduced in ß-hairpins of either CM I or CM II.
|
|
Unfolding transitions of BetaCore
15N HSQC spectra of BetaCore provide evidence for folded (`f') and unfolded (`u') conformations (Fig. 2
). Reversible thermal unfolding is monitored by plotting, as a function of temperature, the volume of each unfolded cross-peak, referenced to the volume of the peak for the same residue at 55°C where the protein is completely unfolded (Fig. 6
). Although for each probe examined the curve is roughly sigmoidal, indicating some cooperativity, the transition midpoints (Tm) vary considerably and hence unfolding is non-two-state. Residues in the strands (i.e., FI22, LI29, FI33, VI34, and FII33) have a Tm
39°C, except for FII22 which has Tm
42°C and VII34 which has Tm
33°C (VII34 is not involved in interstrand hydrogen bonding). Residues in the loops (i.e., AI16 and AII16) and residues in the turn (i.e., AI25 and AII25) have a Tm
33°C. Glycines in the cross-link have a Tm
25°C; these are the only residues with significant levels of `u' at 5°C. For all of these residues, corresponding curves showing decreasing `f', rather than increasing `u', give qualitatively similar results, but Tm estimations are not possible due to ambiguity in the folded baselines.
|
0.8 for the Gly residues that are part of the cross-link, to
0.5 for loop residues, to
0.250.5 for residues in the strands and turns. Second, the absolute values of cross-peak volumes of `u' peaks at 55°C vary over a twofold range, not correlated in any obvious way to the nature or environment of the residue. Third, a subset of the residues examined (i.e., FI33, FII22, FI22, LI29, GI36, and AI25) show a 10%30% increase in the 15N HSQC volume of `f' as the temperature is raised from 5° to 15°C. Fourth, spectra of BetaCore show line width broadening for some but not all peaks. The line widths of the `f' conformation at 5°C range from
15 Hz for cross-link Gly residues, to 1622 Hz for loop residues, to
2334 Hz for strand or turn residues; all become sharper as temperature is increased. The line widths of the `u' conformation at 35°C range from
16 Hz for cross-link Gly residues, to 1220 Hz for loop and turn residues, to
2434 Hz for residues in the strands; again, all sharpen at 55°C. Line broadening and volume reduction reflect the presence of multiple conformations that have different chemical shifts for the same nucleus and interconvert on an intermediate NMR time scale (millisecond to microsecond range) (Roberts 1993). Applying these ideas to BetaCore, we conclude that the ensemble structure at low temperature consists of a dominant, folded conformation in equilibrium with minor conformations that may be exchange broadened and/or sparsely populated. Thermal unfolding of the dominant family of conformations is global, and non-two-state. Different parts of four-stranded BetaCore become random coil-like at different temperatures, and/or the `u' ensemble varies with temperature.
Role of cross-link in structure and stability of BetaCore
The components of BetaCore are two units of a CM that, by itself, is monomeric and favors native-like ß-sheet structure (Carulla et al. 2000). When combined covalently by a reasonably optimized long oxime cross-link, each constituent CM is individually stabilized toward native-like conformation (while minor nonnative conformations are eliminated), and new highly specific intermodule interactions emerge leading to a monomeric collapsed structure with substantially enhanced global stability. The primary role of the cross-link is to make the system unimolecular, thereby increasing the probability of packing interactions between two CMs. The cross-link itself is very flexible, as evidenced by the absence of NOEs, as well as the rapid H/D exchange and low Tm values of reporter residues; it is not in the vicinity of stable, organized structure. Furthermore, the cross-link used here has polar moieties which are likely to facilitate the overall water solubility of BetaCore, in contrast to conceivable more lipophilic cross-linkers that might nucleate aggregation.
The length and flexibility of the cross-link has additional advantages in this system: it allows the two CMs to sample various relative orientations and does not interfere with essential packing. The nonsymmetrical CMCM association actually accessed in BetaCore features a four-stranded antiparallel ß-sheet, where the cross-link connects points on outside strands (i.e., the N-terminal strand of CM I and the C-terminal strand of CM II). Several CM covalent dimers with shorter and/or differently positioned cross-links that were synthesized and evaluated in pilot work gave no or less indication for compact structure (Carulla et al. 2001), findings that in view of the present knowledge of BetaCore structure become plausible. Relatedly, the observed preferred structure, featuring its antiparallel and asymmetric arrangement of chains corresponding to parallel alignment of two CMs, differs from symmetric, antiparallel CM dimer interfaces anticipated in earlier stages of the design process.
Further insights can be gleaned from simulated annealing calculations showing that the cross-link can be accommodated on either face of the ß-sheet structure of BetaCore. This is consistent with the idea that the cross-link used here has no direct role in organizing structure beyond its effect to combine two CMs into a single molecule. It raises new questions about what other cross-link motifs (in terms of structure, length, and positioning) could lead to similar or even enhanced formation of and stabilization of collapsed structure. For example, the C-terminal strand of CM I and the N-terminal strand of CM II could be connected by a rather short cross-link, compatible with the observed four-stranded antiparallel ß-sheet geometry (proximal effect). Alternatively, it may be possible to develop a long transverse cross-linker (distal effect) that packs in a more specific way to a given face of ß-sheet structure, hence mimicking themes found in naturally occurring proteins such as ubiquitin and the immunoglobulin binding domains of proteins G and L.
Significance of BetaCore
Cross-linking core modules can indeed lead to their mutual stabilization to native-like conformations that are significantly more stable than other accessible conformations. BetaCore achieves a four-stranded antiparallel ß-sheet conformation, as supported by well-dispersed chemical shifts, i, i+1 periodicity, numerous long-range NOEs, and slowed amide hydrogen isotope exchange patterns. This conformation reinforces features expected from the structure of the BPTI source on which the CMs are based, but also shows significant interactions not found in BPTI. To the best of our knowledge, this is the first report that a designed protein adopts a four-stranded antiparallel ß-sheet conformation in water.
Folded BetaCore undergoes reversible, global, moderately cooperative, non-two-state thermal transitions to an equilibrium ensemble of unfolded `u' conformations. Folded and unfolded interconvert slowly on the NMR time scale, indicating a significant energy barrier between them. Thus, BetaCore has properties that compare favorably to those of some of the more successful designed proteins in the literature, and approach those of native proteins.
The BetaCore system, as introduced in this paper, provides ample possibilities for further optimization, including the length/positioning of the cross-link and packing of side chains. The roles of individual residues and residue pairs from the strands and turns in affecting the formation and stabilities of ß-sheets, and in controlling the balance between intramolecular interactions leading to collapsed monomeric structure and corresponding intermolecular interactions leading to aggregation, can be evaluated systematically. With respect to the latter, it is possible that the current long transverse cross-link helps provide steric barriers to intermolecular aggregation.
Another research direction suggested by the present work involves identification and synthesis of further core modules with known conformational preferences (
-helix as well as ß-hairpins and sheets)derived from proteins for which structural and H/D exchange data are availableand then combining these covalently through use of appropriate cross-links to test the hypothesis that more stable, compact, folded protein structures will result.
| Materials and methods |
|---|
|
|
|---|
Sedimentation equilibrium analysis
Sedimentation equilibrium experiments were performed on a Beckman Optima XL-A analytical ultracentrifuge. Experiments were conducted at 5°C, with protein concentrations ranging from 2.5 µM to 0.05 mM in water at pH 3 or 20 mM NaCl at pH 3, and two rotor speeds (30,000 and 44,000 rpm). Data were analyzed by nonlinear least square techniques using the program Kdalton (Philo 2000). This program fits data to the following general equation, giving the total concentration at radial position r for a nonideal, reversible association of monomer to N-mer:
![]() |
is the reduced molecular mass given by [M(1 - 
)
2]/(RT), where M is the monomer mass, v is the peptide partial specific volume,
is the solvent density,
is the rotor angular velocity, and T is temperature. As special cases, B is 0 for ideal solutions, and KN is 0 when the fit is to a single species. The partial specific volume of 0.7284 at 5°C was calculated with the program Sednterp (Laue et al. 1992) using the following sequence of natural amino acids as a model for BetaCore: C K A K G G I I R Y F Y N A K D G L V Q T F V Y G G C C K A R I I R Y F Y N A K D G K G G V Q T F V Y G G C. The same program calculates
= 0.99999 and 1.00082 g/mL respectively for samples in water and 20 mM NaCl.
NMR spectroscopy
NMR samples for Betacore were 0.4 mM at pH 3. Samples in D2O were dissolved while working in a glove bag under argon. Spectra of BetaCore were obtained in 90:10 H2O/D2O and 99.9% D2O at 5, 10, 15, 25, 35, 45, and 55°C on a Varian 600 MHz or 800 MHz Inova instrument. Spectra were acquired with the following number of complex points and spectral widths: TOCSY (
m = 65 msec) (Griesinger et al. 1988) and NOESY (
m = 200 msec) (Kumar et al. 1980), F1 (1H) 256, 9000 Hz; F2 (1H) 2048, 9000 Hz; 64 transients; 15N HSQC (Kay et al. 1992), F1 (15N) 128, 2200 Hz; F2 (1H) 1024, 9000 Hz; 16 transients; 15N-1H HSQC-TOCSY (
m = 65 msec) (Zhang et al. 1994) and 15N-1H HSQC-NOESY (Zhang et al. 1994) (
m = 200 msec), F1 (1H) 1024, 9000 Hz; F2 (1H) 96, 9000 Hz; F3 (15N) 32, 2200 Hz; 8 transients; HNHA experiments (Kuboniwa et al. 1994) F1 (1H) 796, 7000 Hz; F2 (1H) 64, 7000 Hz; F3 (15N) 32, 2200 Hz; 32 transients. Suppression of intense solvent resonances was achieved by presaturation or use of the WATERGATE sequence (Piotto et al. 1992). Data were processed and analyzed using the programs NMRPipe (Delaglio et al. 1995) and NMRView (Johnson and Blevins 1994). Data points were weighted using either a 54° or 72° shifted square sine bell in each dimension. Two-dimensional datasets were zero-filled to form 4K x 2K real matrices. Baseline corrections were applied in both dimensions. Three-dimensional datasets were zero filled to form 2K x 0.5K x 0.1K real matrices. Hydrogen isotope exchange rates were obtained by measuring peak volumes versus time in a series of two-dimensional 15N HSQC spectra at pH 3 and 5°C. Pseudo-first-order rate constants were determined from nonlinear least-squares fit of an exponential rate equation to experimental data. Thermal unfolding curves were obtained by measuring peak volumes versus temperature in a series of two-dimensional 15N HSQC spectra. A relaxation delay time of 3 sec was used to permit quantitative volume integration. Reversibility of thermal denaturation was verified by comparison of low-temperature spectra acquired before and after unfolding. Exchange cross-peaks between folded and unfolded conformations are absent under a variety of experimental conditions examined, including: TOCSY (
m = 30 msec, 50 msec, 70 msec), NOESY (
m = 150 msec, 200 msec, 400 msec), 15N-1H HMQC-TOCSY (
m = 30 msec, 50 msec, 70 msec), and 15N-1H HMQC-NOESY (
m = 150 msec, 200 msec, 400 msec) at 15, 28 and 35°C.
Structure calculation
NOE cross-peaks obtained from NOESY spectra taken at pH 3 and 15°C with 200 msec mixing time in H2O:D2O (9:1 by vol) and D2O were integrated and converted to distance constraints (strong, medium, and weak, corresponding to upper limits of 2.8, 3.4, and 5 Å. Pseudoatoms were defined for distances involving nonstereospecifically assigned protons, and upper limits were corrected appropriately. Dihedral angle constraints were deduced from 3JHNC
H coupling constants obtained from HNHA experiments. The
angles were constrained to -120 ± 30° for residues with 3JHNC
H > 8 Hz, to -60 ± 30° for turn residues with 3JHNC
H < 5 Hz, to -60 ± 50° for loop residues with 3JHNC
H < 6 Hz, and to 65 ± 30° for GI28 in the turn (since this position in a type I ß-turn favors 65 ±15°
angle and the observed 3JHNC
H = 5.2 and 4.7 were consistent with this value). Hydrogen bonds implied by H/D exchange experiments were also input as constraints.
Structures (total = 200) of BetaCore were calculated by the program X-Plor 3.851 (Brunger 1993) on Silicon Graphics Octane workstations. The files topallhdg.pro and parallhdg.pro were modified to account for the geometries of the cross-link and the unusual amino acids B and X (Fig. 1
), as described in the electronic supplemental material. Parameters for the standard simulated annealing protocol, sa.inp, were 16,000 steps of 1.5 fsec at 2000°K, followed by cooling to 300°K in 10,000 cooling steps of 1.5 fsec. The resulting structures were further refined using the protocol refine. inp a total of three times; this involved each time starting at 2000°K, and cooling to 300°K in 10,000 cooling steps of 1.5 fsec. Converged structures (total = 20) were selected on the basis of no constraint violations greater than 0.5 Å for NOEs and 5° for dihedrals, lowest total energy, and compatibility of Ramachandran plots. Structures were visualized using the program InsightII (Biosym) and analyzed with the program Procheck (Laskowski et al. 1993).
Accession numbers
The coordinates and constraint files for the ensemble of 20 structures (Fig. 5
) have been deposited in the Protein Data Bank (accession code: 1K09). Chemical shifts and coupling constants have been deposited in the BioMagResBank (accession code: 5183)
| Electronic supplemental material |
|---|
|
|
|---|
H region of TOCSY spectra taken at 5°C and 15°C and pH 3 and BetaCore C
HC
H region of NOESY spectrum taken at 15°C and pH 3.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Bai, Y., Milne, J.S., Mayne, L., and Englander, S.W. 1993. Primary structure effects on peptide group hydrogen exchange. Proteins: Structure, Function, and Genetics 17: 7586.
Barbar, E., Barany, G., and Woodward, C. 1995. Dynamic structure of a highly ordered ß-sheet molten globule: Multiple conformations with a stable core. Biochemistry 34: 1142311434.
Beasley, J.R. and Hecht, M.H. 1997. Protein design: the choice of de novo sequences. J. Biol. Chem. 272: 20312034.
Blanco, F., Ramirez-Alvarado, M., and Serrano, L. 1998. Formation and stability of ß-hairpin structures in polypeptides. Curr. Opin. Struct. Biol. 8: 107111.[CrossRef][Medline]
Brunger, A. 1993. X-Plor version 3.1: A system for X-ray crystallography and NMR, Yale University Press, New Haven, CT.
Carulla, N., Woodward, C., and Barany, G. 2000. Synthesis and characterization of a ß-hairpin peptide that represents a `core module' of Bovine Pancreatic Trypsin Inhibitor (BPTI). Biochemistry 39: 79277937.[CrossRef][Medline]
Carulla, N., Woodward, C., and Barany, G. 2001. Towards new designed proteins derived from Bovine Pancreatic Trypsin Inhibitor (BPTI): Covalent cross-linking of two `core modules' by oxime-forming ligation. Bioconjug. Chem. 12: 726741.[Medline]
Dahidat, B.I. and Mayo, S.L. 1997. De novo protein design: Fully automated sequence selection. Science 278: 8287.
Das, C., Nayak, V., Raghothama, S., and Balaram, P. 2000. Synthetic protein design: Construction of a four-stranded ß-sheet structure and evaluation of its integrity in methanol-water systems. J. Peptide Res. 56: 307317.[CrossRef][Medline]
Das, C., Raghothama, S., and Balaram, P. 1998. A designed three-stranded ß-sheet peptide as a multiple ß-hairpin model. J. Am. Chem. Soc. 120: 58125813.[CrossRef]
De Alba, E.D., Santoro, J., Rico, M., and Jimenez, M.A. 1999. De novo design of a monomeric three-stranded antiparallel ß-sheet. Protein Sci. 8: 854865.[Abstract]
DeGrado, W.F., Summa, C.M., Pavone, V., Nastri, F., and Lombardi, A. 1999. De novo design and structural characterization of proteins and metalloproteins. Annu. Rev. Biochem. 68: 779819.[CrossRef][Medline]
Delaglio, F., Grzesiek, S., Vuister, G.W., Zhu, G., Pfeifer, J., and Bax, A. 1995. NMRPipe: A multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6: 277293.[Medline]
Falzone, C.J., Wright, P.E., and Benkovic, S.J. 1991. Evidence for two interconverting protein isomers in the methotrexate complex of dihydrofolate reductase from Escherichia coli. Biochemistry 30: 21842191.[CrossRef][Medline]
Gallagher, W.H. and Woodward, C.K. 1989. The concentration dependence of the diffusion coefficient for Bovine Pancreatic Trypsin Inhibitor: A dynamic light scattering study of a small protein. Biopolymers 28: 20012024.[CrossRef][Medline]
Gellman, S.H. 1998. Minimal model systems for ß -sheet structure in proteins. Curr. Opin. Chem. Biol. 2: 717725.[CrossRef][Medline]
Griesinger, C., Otting, G., Wüthrich, K., and Ernst, R.R. 1988. Clean TOCSY for 1H spin system identification in macromolecules. J. Am. Chem. Soc. 110: 78707872.[CrossRef]
Hill, R.B., Raleigh, D.P., Lombardi, A., and DeGrado, W.F. 2000. De novo design of helical bundles as models for understanding protein folding and function. Acc. Chem. Res. 33: 745754.[CrossRef][Medline]
Hodges, R.S. 1996. De novo design of
-helical proteins: Basic research to medical applications. Biochem. Cell Biol. 74: 133154.[Medline]
Ilyina, E., Roongta, V., Pan, H., Woodward, C., and Mayo, K.H. 1997. A pulsed-field gradient NMR study of Bovine Pancreatic Trypsin Inhibitor self-association. Biochemistry 36: 33833388.[CrossRef][Medline]
Imperiali, B. and Ottesen, J.J. 1999. Uniquely folded mini-protein motifs. J. Peptide Res. 54: 177184.[Medline]
Jiang, X., Kowalski, J., and Kelly, J.W. 2001. Increasing protein stability using a rational approach combining sequence homology and structural alignment: Stabilizing the WW domain. Protein Sci. 10: 14541465.
Johnson, B.A., and Blevins, R.A. 1994. NMRView: A computer program for the visualization and analysis of NMR data. J. Biomol. NMR 4: 603614.[CrossRef]
Johnson, M.L., Correia, J.J., Yphantis, D.A., and Halvorson, R.R. 1981. Analysis of data from the analytical ultracentrifuge by nonlinear least squares techniques. Biophys. J. 36: 575588.
Kay, L.E., Keifer, P., and Saarinen, T. 1992. Pure absorption gradient enhanced heteronuclear single quantum correlation spectroscopy with improved sensitivity. J. Am. Chem. Soc 114: 1066310665.[CrossRef]
Koepf, E.K., Petrassi, H.M., Sudol, M., and Kelly, J.W. 1999. WW: An isolated three-stranded antiparallel ß-sheet domain that unfolds and refolds reversibly; evidence for a structured hydrophobic cluster in urea and GdnHCl and a disordered thermal unfolded state. Protein Sci. 8: 841853.[Abstract]
Kortemme, T., Ramirez-Alvarado, M., and Serrano, L. 1998. Design of a 20-amino acid, three-stranded ß-sheet protein. Science 281: 253256.
Kuboniwa, H., Grzesiek, S., Delaglio, F., and Bax, A. 1994. Measurement of HN-H
J couplings in calcium-free Calmodulin using new 2D and 3D water-flip-back methods. J. Biomol. NMR 4: 871878.[CrossRef][Medline]
Kumar, A., Ernst, R.R., and Wüthrich, K. 1980. A two-dimensional nuclear Overhauser enhancement (2D NOE) experiment for the elucidation of complete proton-proton cross-relaxation networks in biological macromolecules. Biochem. Biophys. Res. Comm. 95: 16.[CrossRef][Medline]
Laskowski, R.A., MacArthur, M.W., Moss, D.S., and Thornton, J.M. 1993. PROCHECK: A program to check the stereochemical quality of protein structures. J. Appl. Cryst. 26: 283291.
Laue, T.M., Shah, B.D., Ridgeway, B.D., and Pelletier, S.L. 1992. Analytical ultracentrifugation in biochemistry and polymer science, Royal Society of Chemistry, Cambridge.
Li, R. and Woodward, C. 1999. The hydrogen exchange core and protein folding. Protein Sci. 8: 15711590.[Abstract]
Maitra, S. and Nowick, J. S. 2000. ß-Sheet interactions between proteins. In The amide linkage: Structural significance in chemistry, biochemistry, and materials science, (eds. A. Greenberg, C. M. Breneman, and J. F. Liebman), pp. 495518. John Wiley & Sons, New York.