|
|
||||||||
1 Laboratory of Experimental and Computational Biology, National Cancer Institute, Frederick, Maryland 21702, USA
2 CRIBI Biotechnology Centre, University of Padua, 35121 Padua, Italy
3 Sackler Institute of Molecular Medicine, Department of Human Genetics and Molecular Medicine, School of Medicine, Tel Aviv University, Tel Aviv 69978, Israel
Reprint requests to: Ruth Nussinov, NCI-FCRDC, Bldg. 469, Room 151, Frederick, MD 21702, USA; e-mail: ruthn{at}ncifcrf.gov; fax: (301) 846-5598.
(RECEIVED October 3, 2001; FINAL REVISION April 17, 2002; ACCEPTED April 17, 2002)
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.4100102.
| Abstract |
|---|
|
|
|---|
-helices. Therefore, the targets for limited proteolysis are locally unfolded regions. In contrast, the computational cutting algorithm considers the compactness of the fragments, their nonpolar buried surface area, and their isolatedness, that is, the surface area which was buried prior to the cutting and becomes exposed subsequently. Despite the different criteria, there is an overall correspondence between sites or regions of limited proteolysis with those identified by computational cutting. The computational cutting method has been applied to several model proteins for which detailed limited proteolysis data are available, namely apomyoglobin, cytochrome c, ribonuclease A,
-lactalbumin, and thermolysin. As expected, more cuts are obtained computationally than experimentally and the agreement is better when a number of proteolytic enzymes are used. For example, cytochrome c is cleaved by thermolysin at 5657, 4546, and at 8081, and by proteinase K at 4849 and 5051. Incubation of the noncovalent and native-like complex of cytochrome c fragments 156 and 57104 with proteinase K yielded the gapped protein species 148/57104 and finally 140/57104. Computational cutting of cytochrome c reproduced the major experimental observations, with cuts at 47, 6465 or 6566 and 8081 and an unstable 3247 region not assigned to any building block. The next step, not addressed in this work, is to probe the ability of the generated fragments to fold independently. Since both the computational algorithm and limited proteolysis attempt to dissect the protein folding problem, the general agreement between the two procedures is gratifying. This consistency allows us to propose the use of limited proteolysis to produce protein fragments that can adopt an independent folding and, therefore, to study folding intermediates. The results of the present study appear to validate the building block folding model and are in line with the proposal that protein folding is a hierarchical process, where parts constituting local minima of energy fold first, with their subsequent association and mutual stabilization to finally yield the global fold. Keywords: Folding intermediates; limited proteolysis; protein anatomy; protein flexibility; protein folding; protein fragments
Abbreviations: ApoMb, apomyoglobin cyt c, cytochrome c RNase, bovine ribonuclease A
-LA,
-lactalbumin MG, molten globule A-state, partly folded state in acid solution PDB, Protein Data Bank TFE, trifluoroethanol CD, circular dichroism NMR, nuclear magnetic resonance
| Introduction |
|---|
|
|
|---|
One approach to simplify the protein folding problem is to produce protein fragments and determine whether they can form independent folding entities (Wetlaufer 1973, 1981; Peng and Kim 1994; Wu et al. 1994). Protein fragments can be produced experimentally or computationally. On the experimental side, the dissection strategy involves limited proteolytic or chemical cleavage of a protein, or alternatively synthesis of long enough peptides (Fontana et al. 1997a, 1999; Peng and Wu 2000). These are coupled with studies of the stabilities and conformational properties of the fragments and their comparisons with the corresponding native protein states. If the fragments constitute protein domains or subdomains with high enough population times, a correspondence between these and the native protein can be observed (Peng and Wu 2000). On the computational side, a reasonable strategy is to iteratively dissect the native protein fold, identifying the conformationally fluctuating building blocks. Through hierarchical assembly of these blocks, the independently folding hydrophobic units are obtained (Crippen 1978; Rose 1979; Wodak and Janin 1981; Zehfus and Rose 1986; Zehfus 1993; Panchenko et al. 1996, 1997; Tsai and Nussinov 1997; Tsai et al. 1998, 1999a, b, 2000).
Here our goal was to carry out an extensive comparison of limited proteolysis data (Fontana et al. 1997a,b; Polverino de Laureto et al. 1995, 1997, 1999, 2001) with the results of the computational cutting based on the building block folding model (Tsai et al. 2000). Overall, our results show that similar regions are cleaved by both approaches. We give an outline of the experimental strategy as well of the computational approach. We provide a case-by-case comparison of the experimental and computational results, and we conclude by speculating on the implications for protein folding schemes (Karplus and Weaver 1994).
| The limited proteolysis approach |
|---|
|
|
|---|
A modeling study by Hubbard (Hubbard et al. 1994; Hubbard 1998) showed that the site of limited proteolysis requires a conformational change of a stretch of up to 12 residues (Schechter and Berger 1967). That it is a region, rather than a specific site, is also illustrated by the finding that, if several proteases are used, one observes that cleavage takes place over a stretch of peptide bonds. Inspection of the cleavage sites on the globular protein substrate reveals that they never occur at the level of
-helices, but largely at loops (Fontana et al. 1993, 1999). Had proteolysis taken place at a helical segment, the helix is likely to have been destroyed by end-effects and loss of the cooperative hydrogen bonds that stabilize it. Furthermore, the newly created charged ends, if buried, might conceivably destabilize the protein core. Hence, limited proteolysis occurs preferentially at those loops which display inherent conformational flexibility, whereas the protein core remains quite rigid and thus resistant to proteolysis (Fontana et al. 1986, 1993, 1997a).
The limited proteolysis approach has also been used to probe the nonnative or molten globule (MG) (Ptitsyn 1995; Arai and Kuwajima 2000) state of proteins when exposed to a variety of solvent conditions (Fontana et al. 1997a). The acid-induced partly folded state (A-state) or the apo-state of
-lactalbumin at neutral pH (Polverino de Laureto et al. 1995, 1999, 2001), as well as the apo form of myoglobin (Fontana et al. 1997b) have been subjected to limited proteolysis, obtaining results consistent with those reached by NMR measurements (Arai and Kuwajima 2000). Moreover, the conformational state of proteins dissolved in aqueous trifluoroethanol (TFE) has been analyzed by using thermolysin as a proteolytic probe. Proteolysis in aqueous TFE yields specific and large protein fragments, suggesting that even under such conditions proteins retain partly rigid structures (Fontana et al. 1997a). Overall, the limited proteolysis data of several proteins in their MG state indicated a correlation between their native and MG states (Polverino de Laureto et al. 1995, 1998, 1999, 2001), in agreement with results obtained by other investigators using other physicochemical techniques, mostly NMR and hydrogen exchange measurements (Arai and Kuwajima 2000). Therefore, it appears that limited proteolysis can be used as a reliable probe of structure and dynamics of both native and partly folded proteins (Mihalyi 1978; Price and Johnson 1990; Fontana et al. 1993, 1999). Studies of the conformations of the protein fragments (Wu et al. 1994; Peng and Wu 2000) and of their complexes (Taniuchi et al. 1986; Spolaore et al. 2001) may yield information on intermediate states of proteins and their folding pathways.
| Computational dissection: The building block folding model |
|---|
|
|
|---|
According to the building block folding model, protein folding is a hierarchical event (Tsai et al. 1998, 1999a,b, 2000; Kumar et al. 1999, 2000; Ma et al. 1999, 2000). In the first step, building blocks form. Even if the native building block conformation is marginally stable and would exist in a relatively low population, the native conformer is more highly populated than all other conformers. Otherwise, it would not constitute a local minima in the building blocks fragment map (Tsai et al. 2000). In the next hierarchical step, the building blocks associate via combinatorial assembly (Tsai et al. 1998, 1999a). The only difference between the binding of building blocks and the binding of larger stable units, such as domains or subunits, or the binding of different molecules in a complex, is the shorter population time of the building block conformer. In this binding event, among the range of conformations present, the ones that bind are the most complementary. Hence, folding is largely a process of selection of building block conformations. With binding of the native conformers, the population would shift in their favor, further driving the folding reaction. Through their binding, they mutually stabilize each other, leading to the formation of stable hydrophobic folding units. In the next step, again through selection, the most complementary hydrophobic folding units within the population bind to form the domains. In each of the binding events, the changing conditions lead to shifts in the populations. Experimentally, complementing fragments provide a system for studying protein folding (Taniuchi et al. 1986; Fisher and Taniuchi 1992; Yang et al. 1998; Spolaore et al. 2001), consistent with the idea that intermolecular binding resembles intramolecular folding events (Tsai et al. 1998, 1999a). Figure 1
gives a flow chart of the cutting procedure, and its legend outlines the major steps.
|
For all of the protein cases described below, the tables list all building blocks minima. Here we provide only the table and the figures relating to the first protein case herewith examined (apomyoglobin, Table 1
, Figure 2ac
). The other tables are given in the Supplemental Material in the journal electronic site. These building blocks minima correspond to the horizontal (blue and red) lines given in the fragment maps. The building blocks relating to the major folding pathway of the protein are drawn in red, and those depicted in blue take part in alternate folding pathways. The x-axis represents the position of the building block and the y-axis its size. In general, which pathway is actually the major one is the outcome of external conditions. Hence, it is conceivable that the pathway we depict as the major one is not the most populated under a different set of physical conditions. Therefore, it is important to inspect all building block fragment minima, rather than only the major cuts.
|
|
Below we compare the experimental and the computational cutting. We stress that limited proteolysis reflects the local unfolding and hence relates to the unfolding of the protein. In contrast, the computational cutting reflects protein folding pathways.
| Results |
|---|
|
|
|---|
The results of computational cutting of native myoglobin (PDB 1wla; Bernstein et al. 1977) are presented in Figure 2
and in Table 1
. Figure 2A
depicts the fragment map, Figure 2B
the anatomy tree and Figure 2C
shows the cuttings pictorially on the protein 3D fold, illustrating the hierarchical cutting stages (top row) and the combinatorially assembled hydrophobic folding units (bottom row). There are two cuts at the second step, around positions 2224 and 6972. The resulting 72151 fragment is further split into fragments 7197, 91110, and 108136. This is the major folding pathway predicted by the algorithm. Comparison of these results of computational cutting with the limited proteolysis data and the NMR structure (Eliezer and Wright 1996; Eliezer et al. 1998; Cavagnero et al. 2001) shows a nice agreement. The 91110 building block is very unstable (a score of -4.20). The 7197 (or 7192) fragment also shows a marked instability score (-3.54 or -3.65, respectively). The cuts at around 91 and 108 are observed a number of times in the listing of building blocks minima in the table (for example, building block No. 6 in the table spans residues 282; No. 8 spans residues 86153; No. 10 stretches between 299; No. 11 starts at residue 101; No. 14 initiates at residue 92). Remembering the allowed seven-residue overlap in the cutting, these results indicate that the region starting at residue 84 is repeatedly cut by the algorithm. Furthermore, while we do not see a cut near residue 31, we observe cuts around residue 24 (fragment 224), thus within the seven-residue overlap allowed by the computational procedure (see above). Considering that proteases do not cleave in the middle of helices and that a conformational destabilization of 1012 residues appears to be required for proteolysis (Hubbard et al. 1994), we conclude that there is a consistency between experiments and computations. Indeed, comparison of the computationally derived fragments with those generated by proteolysis (Fontana et al. 1997b) yields the following picture. The major 188 and minor 132 thermolysin fragments are quite similar to fragment 282 (with a score of 0.88, see Table 1
) and fragment 224 (with a significantly lower score, -4.38). The 90153, 92153, and 94153 subtilisin fragments are quite similar to fragment 86153 (with a stability of 0.02). The 196, 97153, and 131 (minor) trypsin fragments are rather similar to 299 (with a stability of -0.93), 86153 (0.02), and 224 (-4.38), respectively.
Cytochrome c
Cytochrome c (cyt c) has been subjected to limited proteolysis by thermolysin in 50% aqueous (v/v) TFE at neutral pH (Fontana et al. 1995). A major cut has been observed at peptide bond 5657, generating the two fragments 156 and 57104. Additional but minor cleavages have been observed at peptide bonds 4546 and 8081. Wang et al. (1998) probed the heat-induced unfolding of cyt c using proteinase K as proteolytic probe, and they observed initial cuts at peptide bonds 4849 and 5051. Spolaore et al. (2001) used limited proteolysis by proteinase K at neutral pH on the noncovalent and native-like complex of fragments 156 and 57104. In the complex, the heme was covalently bound to fragment 156. Proteolysis of nicked cyt c yields a gapped protein complex given by fragments 148 and 57104. Further digestion leads to fragments 140 and 57104 in a still folded complex.
The results of the computational cutting of cyt c (PDB 1giw) are given in Figure 3
and Supplemental Material Table 1
. The table lists all building block fragments. There is a cut at position 47, with the region between residues 32 and 47 being an unassigned segment, implying that its stability is very low. This region includes the experimental minor 4546 cut (Fontana et al. 1995) and the chain segment digested by proteinase K in the complex 156/57104 (Spolaore et al. 2001). A cut is observed at residue 64 or 66. An additional cut at residue 85 is close (and within the seven-residue overlap) of the minor experimental cut at 8081. The 64 or 66 cut of the algorithm is at a distance of eight residues from the 5657 experimental cut. Hence, while our major cutting at residue 64 is eight residues removed from the major experimental 56 site, considering that the cutting is carried out in TFE and that a single protease is used rather than a battery of proteases, as well as the 12-residue region suggested to be distorted for proteolysis to occur (Hubbard et al. 1994), the results obtained by the experimental and computational cutting appear to be consistent. Proteolytic fragments include 156, 57104, 145, 5780, and 81104 (Fontana et al. 1995). In Table 1
of the Supplemental Material we see 1064 (with a stability of -1.16), 5595 (1.04), 1048 (very unstable, -3.58), 4780 (0.12), and 81104 (-3.61). The major computational cut observed at residue 47, with a low stability region preceding it, is in agreement with the Wang et al. (1998) cleavages at peptide bonds 4849 and 5051. Consistent with the results of Spolaore et al. (2001), as Figure 3
and Table 1
in the Supplemental Material show, cyt c consists of a single hydrophobic folding unit containing the entire sequence, with the 3247 region being unstable and hence not assigned to any building block.
|
-helix and reduced tertiary structure (Gast et al. 1999). Thermolysin cleaves the 124-residue chain of RNase, both in water upon moderate heating and in aqueous TFE, at peptide bond 3435, with a slower cleavage at 4546. In the absence of TFE, native RNase is resistant to cleavage by thermolysin. It was proposed that both TFE and heat induce a relaxed state of RNase, with a highly flexible 3046 segment, a favored substrate for proteolysis (Polverino de Laureto et al. 1997).
Figure 4
and Supplemental Material Table 2 present the results of the computational cutting of RNase (PDB 1a5p). No cut is observed near peptide bond 3435. This position falls in the middle of a stable building block and, if cleaved there, it would render its component parts unstable. No major cut is observed at 4546, whereas the major computational cut yields the 855 and 5777 building blocks. The data of Table 2 also show a number of cuts around positions 43, 46, and 49. A possible reason for the disagreement with the major thermolytic 3435 cut is that heat or TFE promote some conformational transition in this region of RNase. A cut is also observed at position 20 (building block No. 6 in Table 2, also seen in the Fig. 4
fragment map), in agreement with the limited proteolysis of RNase by subtilisin at neutral pH and ambient temperature (Richards 1958).
|
-Lactalbumin
-lactalbumin (
-LA), a 123-residue protein, have been probed using pepsin, chymotrypsin, and proteinase K (Polverino de Laureto et al. 1995, 1999, 2001). The conformational features of the A-state (acid-state) of
-LA were analyzed using pepsin as proteolytic probe at pH 2.0, while those of the TFE-state of the protein (Alexandrescu et al. 1994) were probed by thermolysin digestion (Polverino de Laureto et al. 1995). Chymotrypsin and proteinase K have been used at neutral pH to probe the apo-form of the protein obtained by EDTA-mediated removal of the calcium ion bound to the protein. Both states are considered to constitute the MG state of
-LA (Kuwajima 1996; Arai and Kuwajima 2000), a dynamic conformational state retaining most of the helices of the native structure, while the ß-sheet region is largely unstructured (Alexandrescu et al. 1993; Schulman et al. 1995, 1997; Wu et al. 1995). A time-course analysis of the limited proteolytic cleavages revealed that the fast, initial cuts by all three proteases occur at the same 3457 region, with the actual sites varying slightly with the different proteases (Polverino de Laureto et al. 1995, 1999, 2001). In the native structure, the 3457 region encompasses the ß-sheets of the protein. Subsequent cleavages took place at chain regions 3135 and 95105. Several of
-LA fragments have been isolated and studied. The single chain fragment 53103, containing the calcium binding sites and crosslinked by two disulfide bridges, in the presence of calcium ions appears to possess a native-like content of
-helix. For the two-chain species 140/104123 and 131/105123, where the two constituting fragments are connected by two disulfide bridges, retain some secondary structure. The gapped protein species 134/54,57123, given by fragment 134, connected to fragment 54123 or to 57123 by four disulfide bridges, has an
-helix content similar to that of the native protein (Polverino de Laureto et al. 2001). Moreover, it has been shown that MG excision of the ß-domain (chain region 3457) from the
-LA does not impair the formation of the MG state of the rest of the protein in acid solution.
-LA is a particularly good example for a comparison between experiment and computation, owing to the fact that several proteases have been employed in limited proteolysis experiments. Figure 5
and Supplemental Material Table 3 provide the results of the computational cutting of
-LA (PDB 1hfzA; see also Tsai et al. 2000). All obtained building block fragments are given in Table 3. In the first step of the cutting, the 1123 chain remains practically intact. Only two residues are removed from its amino tail. In the second step, cuts are performed at 3839 and 105106 peptide bonds. This procedure yields fragments 338, 39105, and 106123. Of note, the last fragment is highly unstable, with a stability score of -9.10. The first and the last fragments associate into a single hydrophobic folding unit (marked as B in Fig. 5B
). The central 39105 fragment is further split into fragments 3955, 5681, and 87108, and these constitute the hydrophobic folding unit A.
|
-LA, with or without Cys73 and Cys91 replaced by Ala, are monomeric and unstructured in solution. Consistently, we observe a score of -2.95 for the 87108 fragment and a score of -2.38 for the 70105 fragment, as given in Figure 5B
Thermolysin
Limited proteolysis of thermolysin has been carried out both using subtilisin and autolysis under different experimental conditions (upon heating or in the presence of 1 mM or 10 mM EDTA) (Fontana et al. 1986). Subtilisin was observed to cleave thermolysin to yield fragments 5224(225) and 225(226)316, which remain associated in a stable and functional complex at neutral pH (Vita et al. 1985). Thermal autolysis yielded fragments 1221, 1154(155), 155(156)221, and 224316, whereas autolysis in the presence of low EDTA concentration yielded fragments 1129, 130187, and 205316 and in the presence of a higher EDTA concentration 1196, 197204, and 205316 (Fassina et al. 1986). Dalzoppo et al. (1985) cut the C-terminal fragment 206316 of thermolysin with several proteolytic enzymes. Analysis of the kinetics of the proteolytic digestions and of the isolated subfragments provided evidence that the proteases degrade fragment 206316 in a stepwise manner proceeding from the amino terminus. The highly helical fragment 255316 was found to be rather resistant to further proteolysis.
The computational cutting yields the two equal-sized domains 3148 and 153316 (PDB 2tlx) (Fig. 6
, Supplemental Material Table 4), in agreement with fragments 1154(155) and 155(156)316 obtained by thermal autolysis of the protein. Although we do not observe a computational cut near residue 225 but do so at 233, this position is within the 1012 residue region considered to be distorted in order to achieve proteolysis at that region (Hubbard et al. 1994). Figure 6
shows a major cut to yield 3148 (with a high stability score, 6.68), in agreement with the identification of thermal autolysis 1154 fragment and 233316 (vs. 224316), also with a high stability score (3.79). We observe the computational fragments 203316, consistent with the EDTA autolysis (highly stable, with a score of 4.42, Table 4 in Supplemental Material), 213316 (also very stable, with a score of 4.05), and the largely helical 265310 building block (No. 39 in Table 4). A number of fragments initiate from around residue 205, from which the stepwise proteolysis initiates (Dalzoppo et al. 1985) (e.g., building blocks No. 11, 25, 28, or 69).
|
| Discussion |
|---|
|
|
|---|
In limited proteolysis, the site of cleavage should be on the protein surface, needs to be flexible, and cannot be in the middle of
-helices (Fontana et al. 1986). In contrast, in the computational cutting cleavages can be performed in the interior of protein cores. Indeed, some building blocks are buried, mediate interactions between building blocks, and play a critical role in reaching the correct three-dimensional fold of the protein (Ma et al. 2000; Kumar et al. 2001). Furthermore, the cleavages can take place in the middle of
-helices, if a cut at this site leads to a more compact and favorable building block (Tsai et al. 2000). The agreement between the two methods is clearly better when proteolysis data are obtained by the use of several proteolytic enzymes, rather than by a single one, since in this case a chain region rather than a single peptide bond is identified. Indeed, there is a correspondence between experiment and computation in the cases of apoMb and
-LA, where several proteases have been used. Further, the correspondence becomes closer for cyt c if the proteinase K cuts (Wang et al. 1998; Spolaore et al. 2001) are added to the thermolysin cuts (Fontana et al. 1995).
The computational approach suffers from limitations. First, a drawback of the stability function which is used is that it lacks an electrostatic component (Tsai et al. 2000). It is possible that had electrostatics been taken into account, a greater correspondence would have been observed. The second limitation is the fact that the computational algorithm is based solely on the native conformation (Tsai et al. 2000). Hence, if some nonnative interactions play a role at the site of the experimental cutting, such as for example in aqueous TFE or under other solvent conditions favoring partly folded states, it may lead to an inconsistency with the computational algorithm. Indeed, the fact that several proteins in their native state are not attacked by protease indicates that the population of species fitting the active site of the proteolytic enzyme is low. Additionally, the computational cutting yields more protein fragments than the experimental limited proteolysis.
Limited proteolysis is at sites of local unfolding, whereas the computational algorithm relates to folding. The general fair agreement between the two approaches is consistent with the hierarchical model of protein folding, which postulates that the polypeptide chain folds by parts. Several models have been proposed to describe protein folding, including (1) the framework model, (2) the nucleation and growth mechanism, (3) the diffusion-collision model, (4) the hydrophobic collapse, and (5) the hierarchical model. In the framework model (Kim and Baldwin 1982, 1990; Udgaonkar and Baldwin 1988), secondary structure formation is independent of formation of tertiary interactions and frequently occurs earlier. In the nucleation and growth (Wetlaufer 1973) or nucleation condensation mechanism (Shakhnovich et al. 1996; Fersht 1997), folding initiates by formation of a nucleus, followed by its extension. In the diffusion-collision model, secondary structure elements assemble into folds by a random diffusion and collision process and, if the assembly is favorable, they may lock into the native conformation (Karplus and Weaver 1994). In contrast to these models, the hydrophobic collapse model highlights the hydrophobic effect, the driving force of protein folding (Rackovsky and Scheraga 1977; Dill 1985, 1990). In this case, folding initiates with collapse of the molecule, consequent burial of extensive nonpolar surface area and, therefore, secondary structure formation and specific interactions follow. In the hierarchical model, protein folding initiates locally and the local folded elements assemble in a stepwise fashion to yield the final native protein fold (Baldwin and Rose 1999a, b).
The models of protein folding listed above are not necessarily exclusive of each other. The hierarchical model may include elements of hydrophobic collapse in the assembly of local folded elements. Such a hydrophobic assembly would constitute the first stage of the parts coming together, followed by the optimization of the specific (van der Waals, electrostatic, disulfide bonds, etc.) interactions. The hierarchical model may further include elements of the nucleation and growth or nucleation-condensation model. The nucleus can be a part of the polypeptide chain whose folded structure forms local minima. Such an element may subsequently act as a template for further folding of the protein. Similarly, with respect to the framework model, the formation of single secondary structure elements can be substituted by local building blocks. Thus, depending on the interpretation of these models, each may be viewed as a specific case of the more general hierarchical model (Baldwin and Rose 1999a,b).
| Conclusions |
|---|
|
|
|---|
-helices. Therefore, limited proteolysis occurs at regions which are locally unfolded (Fontana et al. 1986). In contrast, the building block folding model is based on protein folding, and the criteria on which the scoring function is based are compactness of the fragment, extent of the nonpolar surface area it buries, and its isolatedness, that is, the surface area which was buried prior to cutting and became exposed subsequently (Tsai et al. 2000). The overall agreement between the two methods allows us to propose that proteolytic enzymes can be used as reliable probes of protein structure, dynamics, and folding pathways. Furthermore, the fact that distinct fragments can be produced and their conformations analyzed leads us to suggest that the limited proteolysis approach can be used in studies of folding intermediates, complementing other methods in use for analyzing the transient intermediates along the folding reaction path (Chamberlain and Marqusee 2000). On the computational side, the consistency with the experimental cleavages appears to provide a validation of the building block folding model. Considering that it is increasingly becoming accepted that protein folding may initiate by folding of local fragments and proceed by their associations, fragments can be identified by computating and produced by proteolysis for further studies. This suggests a procedure of computational predictive folding; that is, initial folding of protein fragments rather than of the entire protein and subsequent fragment assembly. This approach may well simplify the prediction of the three-dimensional structure of proteins.
| Acknowledgments |
|---|
The content of this publication does not necessarily reflect the view or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organization imply endorsement by the U.S. government.
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
-lactalbumin: A two state dimensional NMR study. Biochemistry 32: 17071718.[CrossRef][Medline]
Alexandrescu, A.T., Ng, Y.-L., and Dobson, C.M. 1994. Characterization of a TFE-induced partially folded state of
-lactalbumin. J. Mol. Biol. 235: 587599.[CrossRef][Medline]
Arai, M. and Kuwajima, K. 2000. Role of the molten globule state in protein folding. Adv. Protein Chem. 53: 209282.[Medline]
Arnold, U., Rücknagel, K.P., Schierhorn A., and Ulbrich-Hofman, R. 1996. Thermal unfolding and proteolytic susceptibility of ribonuclease. Eur. J. Biochem. 237: 862869.[Medline]
Baldwin, R.L. and Rose, G.D. 1999a. Is protein folding hierarchic? I. Local structure and peptide folding. Trends Biochem. Sci. 24: 2633.[CrossRef][Medline]
Baldwin, R.L. and Rose, G.D. 1999b. Is protein folding hierarchic? II. Folding intermediates and transition states. Trends Biochem. Sci. 24: 7784.[CrossRef][Medline]
Bernstein, F.C., Koetzle, T.F., Williams, G.J.B., Meyer, E.F. Jr., Brice, M.D., Rodgers, J.R., Kennard, O., Shimanouchi, T., and Tasumi, M. 1977. The protein databank: A computer-based archival file for macromolecular structures. J. Mol. Biol. 112: 535542.[Medline]
Cavagnero, S., Nishimura, C., Schwarzinger, S., Dyson, H.J., and Wright, P.E. 2001. Conformational and dynamic characterization of the molten globule state of an apomyoglobin mutant with an altered folding pathway. Biochemistry 40: 1445914467.[CrossRef][Medline]
Chamberlain, A.K. and Marqusee, S. 2000. Comparison of equilibrium and kinetic approaches for determining protein folding mechanisms. Adv. Protein Chem. 53: 283328.[Medline]
Crippen, G.M. 1978. The tree structural organization of proteins. J. Mol. Biol. 126: 315332.[CrossRef][Medline]
Dalzoppo, D., Vita, C., and Fontana, A. 1985. Folding of thermolysin fragments: Identification of the minimum size of a carboxyl-terminal fragment that can fold into a stable native-like structure. J. Mol. Biol. 182: 331340.[CrossRef][Medline]
Dill, K.A. 1985. Theory for the folding and stability of globular proteins. Biochemistry 24: 15011509.[CrossRef][Medline]
Dill, K.A. 1990. Dominant forces in protein folding. Biochemistry 31: 71347155.[CrossRef]
Dill, K.A. and Chan, H.S. 1997. From Levinthal to pathways to funnels. Nat. Struct. Biol. 4: 1019.[CrossRef][Medline]
Eaton, W.A., Munioz, V., Thompson, P.A., Chan, C.K., and Hofrichter, J. 1997. Submillisecond kinetics of protein folding. Curr. Opin. Struct. Biol. 7: 1014.[CrossRef][Medline]
Eliezer, D. and Wright, P.E. 1996. Is apomyoglobin a molten globule? Structural characterization by NMR. J. Mol. Biol. 263: 531538.[CrossRef][Medline]
Eliezer, D., Yao, J., Dyson, H.J., and Wright, P.E. 1998. Structural and dynamic characterization of partially folded states of apomyoglobin and implications for protein folding. Nat. Struct. Biol. 5: 148155.[CrossRef][Medline]
Evans, P.A. and Radford, S.E. 1994. Probing the structure of folding intermediates. Curr. Opin. Struct. Biol. 4: 100106.
Fassina, G., Vita, C., Dalzoppo, D., Zamai, M., Zambonin, M., and Fontana, A. 1986. Autolysis of thermolysin: Isolation and characterization of a folded three-fragment complex. Eur. J. Biochem. 156: 221228.[Medline]
Fersht, A.R. 1997. Nucleation mechanism in protein folding. Curr. Opin. Struct. Biol. 7: 39.[CrossRef][Medline]
Fisher, A. and Taniuchi, H. 1992. A study of core domains and the core domain-domain interactions of cytochrome c fragment complex. Arch. Biochem. Biophys. 96: 116.[CrossRef]
Fontana, A., Fassima, G., Vita, C., Dalzoppo, D., Zamai, M., and Zambonin, M. 1986. Correlation between sites of limited proteolysis and segmental mobility in thermolysin. Biochemistry 25: 18471851.[CrossRef][Medline]
Fontana, A., Polverino de Laureto, P., and De Filippis, V. 1993. Molecular aspects of proteolysis of globular proteins. In: Protein stability and stabilization (eds. W. Van der Tweel, A. Harder, and M. Buitelaar), pp. 101110. Elsevier Sci. Publ., Amsterdam.
Fontana, A., Polverino de Laureto, P., De Filippis, V., Scaramella, E., and Zambonin, M. 1997a. Probing the partly folded states of proteins by limited proteolysis. Folding Des. 2: R17R26.[CrossRef][Medline]
Fontana, A., Polverino de Laureto, P., De Filippis, V., Scaramella, E., and Zambonin, M. 1999. Limited proteolysis in the study of protein conformation. In: Proteolytic enzymes: Tools and targets (eds. E.E. Sterchi, W. Stocker), pp. 257284. Springer-Verlag, Heidelberg.
Fontana, A., Zambonin, M., De Filippis, V., Bosco, M., and Polverino de Laureto, P. 1995. Limited proteolysis of cytochrome c in trifluoroethanol. FEBS Lett. 362: 266270.[CrossRef][Medline]
Fontana, A., Zambonin, M., Polverino de Laureto, P., De Filippis, V., Clementi, A., and Scaramella, E. 1997b. Probing the conformational state of apomyoglobin by limited proteolysis. J. Mol. Biol. 266: 223230.[CrossRef][Medline]
Gast, K., Zirwer, D., Müller-Frohne, M., and Damaschun, G. 1999. Trifluoroethanol-induced conformational transitions of proteins: Insights gained from the differences between
-lactalbumin and ribonuclease A. Protein Sci. 8: 625634.[Abstract]
Hirst, J.D. and Brooks, C.L. 1994. Helicity, circular dichroism and molecular dynamics of proteins. J. Mol. Biol. 243: 173178.[CrossRef][Medline]
Hubbard, S.J. 1998. The structural aspects of limited proteolysis of native proteins. Biochim. Biophys. Acta 1382: 191206.[CrossRef][Medline]
Hubbard, S.J., Eisenmenger, F., and Thornton, J.M. 1994. Modelling studies of the change in conformation required for cleavage of limited proteolytic sites. Protein Sci. 3: 757768.[Abstract]
Karplus, M. and Weaver, D.L. 1994. Protein folding dynamics: The diffusion-collision model and experimental data. Protein Sci. 3: 650668.[Abstract]
Kim, P.S. and Baldwin, R.L. 1982. Specific intermediates in the folding reactions of small proteins and the mechanism of protein folding. Annu. Rev. Biochem. 51: 459489.[CrossRef][Medline]
Kim, P.S. and Baldwin, R.L. 1990. Intermediates in the folding reactions of small proteins. Annu. Rev. Biochem. 59: 631660.[CrossRef][Medline]
Kuhlman, B., Boice, J.A., Wu, W.J., Fairman, R., and Raleigh, D.P. 1997. Calcium binding peptides from
-lactalbumin: Implications for protein folding and stability. Biochemistry 36: 46074615[CrossRef][Medline]
Kumar, S., Ma, B., Tsai, C.J., Sinha, N., and Nussinov, R. 2000. Folding and binding cascades: Dynamic landscapes and population shifts. Protein Sci. 9: 1019.[Abstract]
Kumar, S., Ma, B., Tsai, C.J., Wolfson, H., and Nussinov, R. 1999. Folding funnels and conformational transitions via hinge-bending motions. Cell. Biochem. Biophys. 31: 2346.
Kumar, S., Sham, Y.Y., Tsai, C.J., and Nussinov, R. 2001. Folding and function: The N-terminal building block in adenylate kinase. Biophys. J. 80: 24392454.
Kuwajima, K. 1996. The molten globule state of
-lactalbumin. FASEB J. 10: 7478.
Lecomte, J.T.J., Kao, Y.H., and Cocco, M.J. 1996. The native state of apomyoglobin described by proton NMR spectroscopy: The A-B-G-H interface of wild-type sperm whale apomyoglobin. Proteins: Struct. Funct. Genet. 25: 267285.[CrossRef][Medline]
Lin, L., Pinker, R.J., Forde, K., Rose, G.D., and Kallenbach, N.R. 1994. Molten globular characteristics of the native state of apomyoglobin. Nat. Struct. Biol. 1: 447451.[CrossRef][Medline]
Ma, B., Kumar, S., Tsai, C.-J., and Nussinov, R. 1999. Folding funnels and binding mechanisms. Protein Eng. 12: 713720.
Ma, B., Tsai, C.-J., and Nussinov, R. 2000. Binding and folding: In search of intramolecular chaperone-like building block fragments. Protein Eng. 13: 617627.
Matthews, C.R. 1995. Pathways of protein folding. Annu. Rev. Biochem. 62: 3642.
Mihalyi, E. 1978. Application of proteolytic enzymes to protein structure studies. CRC Press, Boca Raton, Florida.
Neurath, H. 1980. Limited proteolysis, protein folding and physiological regulation. In: Protein folding (ed. R. Jaenicke), pp. 501504. Elsevier/North Holland Biomedical Press, Amsterdam/New York.
Panchenko, A.R., Luthey-Schulten, Z., and Wolynes, P.G. 1996. Foldons, protein structural modules and exons. Proc. Natl. Acad. Sci. 93: 20082013.
Panchenko, A.R., Luthey-Schulten, Z., Cole, R., and Wolynes, P.G. 1997. The foldon universe: A survey of structural similarity and self-recognition of independently folding units. J. Mol. Biol. 272: 95105.[CrossRef][Medline]
Pande, V.S., Grosberg, A.Y., Tanaka, T., and Rokhsar, D.S. 1998. Pathways for protein folding: Is a new view needed? Curr. Opin. Struct. Biol. 8: 6679.
Peng, Z.-Y. and Kim, P.S. 1994. A protein dissection study of a molten globule. Biochemistry 33: 21362141.[CrossRef][Medline]