|
|
||||||||
Department of Chemistry, Princeton University, Princeton, New Jersey 08544, USA
Reprint requests to: Michael Hecht, Department of Chemistry, Princeton University, Princeton, NJ 08544, USA; e-mail: hecht{at}princeton.edu; fax: (609) 258-6746.
| Abstract |
|---|
|
|
|---|
-helical and
-sheet structures. The recently determined solution structure of a binary patterned four-helix bundle is well ordered, thereby demonstrating that sequences that have neither been selected by evolution (in vivo or in vitro) nor designed by computer can form nativelike proteins. Examples are presented demonstrating how binary patterned libraries have successfully produced well-ordered structures, cofactor binding, catalytic activity, self-assembled monolayers, amyloid-like nanofibrils, and protein-based biomaterials. Keywords: artificial proteins; binary patterning; combinatorial libraries; de novo protein design
1 Present address: Department of Biology, Brookhaven National Laboratory, Upton, NY 11973, USA. ![]()
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.04690804.
| Introduction |
|---|
|
|
|---|
Combinatorial methods are far more appealing. However, combinatorial libraries composed of random sequences will rarely yield proteins with desired properties. Therefore, to enhance the likelihood of success, combinatorial collections must be focused into those regions of sequence space most likely to produce well-folded proteins. The numerical power of the combinatorial approach must be tempered with elements of rational design (Beasley and Hecht 1997; Moffet and Hecht 2001).
| Designed combinatorial libraries of de novo proteins |
|---|
|
|
|---|
-helices and
-sheets) and (2) expose polar side chains to solvent while burying nonpolar side chains in the protein interior. Our strategy for protein design draws on these two features to rationally constrain the diversity of libraries of de novo sequences in ways that favor the formation of folded structures.
Sequences capable of forming abundant secondary structure while simultaneously exposing polar side chains and burying nonpolar side chains can be designed by constraining the periodicity of polar and nonpolar residues to match the structural periodicity of the desired secondary structure (Xiong et al. 1995). Thus, for an
-helical design, the sequence periodicity of polar and nonpolar residues must approximate the structural repeat of 3.6 residue/turn. For example, a sequence of polar (
) and nonpolar () residues with the pattern



has a nonpolar amino acid every three or four positions, consistent with the structural repeat of
-helices. Conversely, for a designed
-strand, polar and nonpolar residues would alternate every other residue. Thus, a designed sequence with the pattern
has a sequence periodicity of 2, which matches the structural repeat found in
-strands, with successive side chains pointing up-down-up-down and so forth (Fig. 1
).
|
Our initial test of the binary code strategy focused on the design of a library of four-helix bundles (Kamtekar et al. 1993). Each member of the library had a unique amino acid sequence, yet all sequences shared the identical pattern of polar and nonpolar residues (Fig. 2
). Combinatorial diversity was made possible by the organization of the genetic code (Fig. 3
). Six polar residues (Lys, His, Glu, Gln, Asp, Asn) were encoded by the degenerate DNA codon VAN, and five nonpolar residues (Met, Leu, Ile, Val, Phe) were encoded by the degenerate codon NTN (V = A, G, or C; N = A, G, C, or T).
|
|
-helical design indeed formed proteins that were soluble, stable, and had the expected
-helical secondary structure (Kamtekar et al. 1993). | Novel proteins and nativelike structures |
|---|
|
|
|---|
Well-folded structures versus molten globules
We developed several methods to rapidly screen libraries for sequences that successfully recapitulate the features of well-folded native proteins. These screens were based either on peak dispersion in NMR spectra (Roy et al. 1997a), or protection of hydrogen exchange measured by mass spectrometry (Rosenbaum et al. 1999). Our searches for well-folded proteins in the original 74-residue
-helical library uncovered several proteins that displayed some nativelike characteristics (Roy et al. 1997a,b; Rosenbaum et al. 1999; Roy and Hecht 2000). However, most proteins in the initial collection formed fluctuating structures, reminiscent of molten globule intermediates.
Why did most sequences from the original collection fail to form nativelike structures? One might postulate that fluctuating "molten" structures are exactly what should be expected from a combinatorial strategy that precludes explicit design of specific sequences with predetermined side-chain interactions. However, the alternative resultnativelike structureswould be predicted by numerous studies demonstrating that well-folded proteins can be specified by many different amino acid sequences (Dill 1985; Chothia and Lesk 1987; Bowie et al. 1990; Matthews 1993; Bromberg and Dill 1994; Axe et al. 1996; Gassner et al. 1996; Munson et al. 1996; Riddle et al. 1997). Comparisons of evolutionarily related sequences, theoretical studies using simplified models, and extensive mutagenesis experiments have all led to the realization that protein structures are robust, and explicit design of "jigsaw-puzzle" packing may not be necessary. These considerations led us to question whether the tendency of the original binary code proteins to form fluctuating structures might not be a failure of the binary code strategy per se, but rather a shortcoming of the designed structural scaffold used in its initial implementation.
In particular, we questioned whether the
-helices specified by our original scaffold might simply be too short. Previous workers had shown that for rationally designed
-helical bundles longer helices can enhance stability, and in some cases, favor more nativelike structures (Fairman et al. 1995; Betz and DeGrado 1996). We reasoned that in the context of the binary code strategy, which cannot specify side-chain packing a priori, it might be especially important to use a scaffold that encodes longer
-helices, and hence larger interhelical interfaces.
To test this reasoning, we designed a new structural scaffold to encode longer four-helix bundles (Fig. 4
), and constructed a second-generation library of 102-residue sequences (Wei et al. 2003b). This new library was not constructed from scratch. Instead, to stringently test whether the redesigned features are sufficient to convert a fluctional protein into a well-ordered structure, we chose a molten globule-like protein (sequence #86) from the original 74-residue library (Kamtekar et al. 1993) as the starting point for designing the second-generation elongated library.
|
|
-helical proteins were unsuccessful. This was not surprising given that structural flexibility is known both to disfavor crystal growth and to yield poorly resolved NMR spectra. In contrast with these first-generation sequences, proteins from the second-generation library yield high quality NMR spectra indicative of well-folded (nonfluctuating) structures (Wei et al. 2003b). Therefore the three-dimensional structures of these new proteins are accessible by NMR spectroscopy.
Recently, we solved the solution structure of a protein (S-824) from the second-generation library (Wei et al. 2003a). As shown in Figure 6
, the structure is indeed a four-helix bundle. In accordance with the binary code design, the polar and nonpolar residues segregate on the surface and the interior, respectively (Fig. 6B,C
). Moreover, the protein is not a molten globule: As shown in Figure 7
, the interior side chains are well orderedeven by the standards of natural proteins.
|
|
|
-helical and four of the five appear well ordered (Wei et al. 2003b). As shown in Figures 6| Libraries of functional de novo proteins |
|---|
|
|
|---|
In our efforts to devise functionally active proteins we have also used both approaches: We have developed collections of functional proteins both by incorporating cofactors and by pursuing the purist approach.
Functional heme proteins
Cofactors can be thought of as portable activity modules, which are used for different purposes by different proteins. Many cofactors, such as the porphyrin in heme, are preorganized structures. Moreover, the cofactors themselves often possess some level of activity even in the absence of protein. To make functional proteins, nature incorporates these portable activity modules into otherwise inactive proteins.
We have followed natures lead by using cofactor binding as a step towards the isolation of functional de novo proteins. We screened our binary patterned collections of
-helical proteins for heme binding. In an initial collection of 30
-helical sequences from the first-generation library (those with 74-residue sequences), we found that 15 bound heme (Rojas et al. 1997). More recent experiments showed that proteins from the second-generation library (102-residue sequences) also bind heme. The novel heme proteins are bright red (Fig. 9
) similar to hemoglobin or cytochrome c. The absorption and resonance Raman spectra of the de novo heme proteins (Rojas et al. 1997) resemble those of natural cytochromes. Because the design of the novel sequences was based solely on global features of polar/nonpolar patterning, our finding that approximately half of them bind heme demonstrates that isolating de novo heme proteins does not require explicit design of a cofactor-binding site. Apparently, heme binding is more permissive and more easily achieved than previously suspected.
|
In nature, heme proteins perform a number of functions. Among these are (1) catalysis of redox reactions, (2) binding and transport of small molecules (e.g., oxygen or carbon monoxide), and (3) electron transfer. We have probed our library of de novo heme proteins for all three of these functions (Moffet et al. 2000, 2001, 2003).
The capacity of the binary code proteins for catalytic activity was established by demonstrating that several of the de novo heme proteins are active as peroxidases, capable of catalyzing the two-electron reduction of hydrogen peroxide to water (Moffet et al. 2000). The most active protein in the collection had a turnover number of 17,000 min1significantly better that that of microperoxidase (1260 min1; Adams 1990), and only fourfold lower than the natural enzyme, horseradish peroxidase (60,000 min1; Hiner at al. 1996). Like natural enzymes, our de novo proteins are inactivated by high concentrations of peroxide. Hence the measured turnover numbers represent a lower limit of the catalytic potential of these proteins.
To assess the abilities of the de novo heme proteins to bind diatomic ligands, we measured affinity for CO, kinetics of CO binding and release, and resonance Raman spectra of the CO complexes for several heme proteins derived from our combinatorial libraries (Moffet et al. 2001). The CO binding affinities for all of the proteins were similar to that of myoglobin, with dissociation constants in the low nanomolar range. Overall, the CO binding properties of the de novo heme proteins span a narrow range of values that falls near the center of the range observed for natural heme proteins.
All the binary patterned heme proteins characterized thus far bound heme with 1:1 stoichiometry (Moffet et al. 2003). The availability of this collection enabled us to determine "default reduction potentials" for 1:1 heme proteins that have neither been selected by evolution nor explicitly designed for redox activity. We measured the midpoint reduction potentials for five first-generation and three second-generation binary patterned proteins. The potentials ranged from 112 mV to 176 mV (Moffet et al. 2003). These default reduction potentials can be compared with the reduction potentials of (1) heme alone, (2) naturally evolved heme proteins, and (3) rationally designed heme proteins. The midpoint reduction potential for unbound heme is 220 mV. Nature has altered this potential for particular redox functions by evolving heme proteins spanning a wide range of potentials from approximately 400 mV to +400 mV. Attempts to engineer the potentials of either natural proteins (Springs et al. 2000, 2002) or novel protein maquettes (Shifman et al. 2000) have produced narrower ranges.
Functional proteins without cofactors
Many proteins in nature are fully active without the aid of cofactors. Therefore we were also interested in probing the ability of the binary code strategy to generate functional proteins without cofactors.
Previous attempts by other groups to generate proteins with enzymelike activities have relied on a number of different approaches (Pollack et al. 1986; Tramontano et al. 1986; Stewart et al. 1994; Pinto et al. 1997; Broo et al. 1998; Benson et al. 2000; Hilvert 2000; Bolon and Mayo 2001; Keefe and Szostak 2001; Yamauchi et al. 2002; Looger et al. 2003). Some groups used rational design and/or computational methods to engineer active sites and catalytic function into either natural or novel protein sequences. Other groups selected or screened for desired activities by using the mammalian immune system (e.g., for catalytic antibodies), evolution in vitro, or other methods.
Recently, we probed whether any of our de novo
-helical proteins might catalyze ester hydrolysis (Wei and Hecht 2004). For our initial probes of catalytic activity, we focused on hydrolysis of p-nitrophenyl esters. We chose this activity because (1) hydrolysis of p-nitrophenyl esters is relatively easy to achieve (Menger and Ladika 1987), (2) it is straightforward to assay, and (3) there are precedents of novel esterases being isolated both from catalytic antibodies and from rational designs (Pollack et al. 1986; Tramontano et al. 1986; Menger and Ladika 1987; Stewart et al. 1994; Broo et al. 1998; Bolon and Mayo 2001; Yamauchi et al. 2002).
We measured the esterase activity of S-824, the binary patterned protein whose structure was solved at high resolution (see the section "Solution structure of a de novo protein from a designed combinatorial library" and Figs. 6
and 7
). Protein S-824 displayed a rate enhancement (kcat/kuncat) of 8700. The observed activity is similar to or better than that observed for several esterases designed previously using rational design and/or automated computational methods (Broo et al. 1998; Bolon and Mayo 2001). Moreover, the observed activity rivals those of the first catalytic antibodies (Pollack et al. 1986; Tramontano et al. 1986).
Because hydrolysis of p-nitrophenyl esters by protein S-824 is presumed to involve a histidine nucleophile (Wei and Hecht 2004), it was important to establish that the rate catalyzed by protein S-824 is above that catalyzed by free imidazole. At pH 7, protein S-824 hydrolyzed p-nitrophenyl acetate ~100-fold faster than does 4-methylimidazole (Wei and Hecht 2004). Even after correcting for the 11 histidines in the sequence of S-824, it is clear that the de novo protein catalyzes ester hydrolysis more effectively than a simple imidazole-based catalyst. Like natural enzymes, S-824 catalyzed multiple turnovers and remained active after prolonged exposure to excess substrate.
To assess whether the activity of S-824 is representative of other proteins in our binary patterned libraries, we measured the esterase activity of six additional binary patterned proteins. These proteins were from two libraries: the original library of 74-residue sequences and the second-generation library of 102-residue sequences described above. Both libraries were naïve in that they were neither designed to bind substrate nor subjected to high throughput screens for activity. All six of the additional proteins displayed esterase activity significantly above background (Wei and Hecht 2004).
These findings suggest that although the exquisite levels of activity and specificity typical of natural enzymes may have required eons of evolutionary selection, proteins with moderate levels of activity are surprisingly common in libraries of de novo sequences designed by binary patterning. The activities of these unselected proteins provide a reference state for the levels of activity that have been reported for proteins obtained by selection and/or computational design.
| Amyloid-like fibrils from binary patterned libraries |
|---|
|
|
|---|
peptide, whereas in spongiform encephalopathies, it is the prion protein. Despite substantial differences in both sequence and length, these diverse proteins assemble into amyloid structures that are remarkably similar to one another. They all form fibrils composed of
-strands running perpendicular to the fibril axis (cross-
structure). The similarity among the structures of the different amyloids suggests the various amyloids may share unifying structural determinants. However, the extreme dissimilarity between the various amyloidogenic sequences, coupled with the unavailability of a high-resolution structure, has limited understanding of the sequence determinants of amyloidogenesis (Wood et al. 1995; Selkoe and Podlisny 2002; Wurth et al. 2002; Williams et al. 2004). What are the molecular determinants of amyloidogenesis? Which features of an amino acid sequence cause it to assemble into amyloid? These questions can be addressed by two very different approaches: (1) One can probe the sequence determinants of natural amyloid proteins by screening randomly generated amino acid substitutions (mutations) to identify those that prevent amyloidogenesis or (2) one can test hypotheses about possible sequence determinants by using them as the basis for the design of de novo amyloid-like proteins.
We have used both approaches. Our work probing the sequence determinants of amyloidogenesis in a natural system (the Alzheimers A
peptide) has been described previously (Wurth et al. 2002) and will not be reviewed here. Our design of libraries of de novo amyloid-like fibrils is described in this section.
To probe the sequence determinants of amyloidogenesis by de novo protein design, we once again used the binary code strategy. We designed a binary patterned combinatorial library of de novo proteins (West et al. 1999) using the
-strand pattern shown in Figure 1
. All sequences in the library were constrained by the
alternating pattern consistent with amphiphilic
-structure. The precise identities of the side chains were not constrained and were varied combinatorially. Polar sites were allowed to be His, Lys, Asn, Asp, Gln, or Glu and nonpolar residues were allowed to be Leu, Ile, Val, or Phe. A schematic diagram of the binary pattern for proteins containing six
-strands punctuated by turns is shown in Figure 10
. The amino acid sequences of proteins from this library are shown in Figure 11
.
|
|
-structure. Electron microscopy demonstrated that they self-assembled into large oligomers resembling amyloid-like fibrils (Fig. 12
|
-helical proteins (see above). In both cases, combinatorial libraries were designed by specifying the binary pattern of polar and non-polar residues. Yet the resulting proteins display dramatically different properties: In the previous work, the sequences formed
-helices that folded intramolecularly into small globular domains. In contrast, the sequences described here form
-strands and self-assemble intermolecularly into high-order oligomers that assume fibrillar structures. What causes these dramatically different structures? The lengths of the sequences are not dramatically different, nor are their overall compositions. We propose that the determining difference is the binary patterning itself. In the earlier work, the library was constrained by the pattern



, consistent with the periodicity of
-helical structure. In contrast, this library is constrained by the pattern
, consistent with the periodicity of amphiphilic
-strands. If it is indeed true that alternating patterns of polar and nonpolar residues have an inherent propensity to form amyloid fibrils, then one might expect that these binary patterns would be disfavored by natural selection. We tested this possibility with a bioinformatics study: We analyzed a database of 250,514 natural protein sequences comprising 79,708,024 residues and calculated the frequencies of alternating patterns relative to other patterns with similar compositions. The results of this search revealed that alternating patterns occur in nature significantly less often than other patterns with similar compositions (Broome and Hecht 2000). The underrepresentation of alternating binary patterns in natural proteins, coupled with the observation that such patterns promote amyloid-like structures in de novo proteins, suggests that sequences of alternating polar and nonpolar amino acids are inherently prone to form amyloid-like structures and consequently have been disfavored by evolutionary selection.
Monomeric -sheet proteins
|
|---|
|
|
|---|
-sheet proteins described in the previous section were designed to encode proteins containing six amphiphilic
-strands separated by turns. Each
-strand was designed to be seven residues long, with polar and nonpolar amino acids arranged with an alternating periodicity (Figs. 10
) for all of the
-strands: No strand was explicitly designated to form the edges of the
-sheets. With all
-strands preferring to occupy interior (as opposed to edge) locations, intermolecular oligomerization was favored, and the proteins assembled into amyloid-like fibrils (shown in Fig. 12
|
-sheet proteins, we used a strategy suggested by Richardson and Richardson (2002) to redesign the first and/or last
-strands of several sequences from the original library. In the redesigned
-strands, the binary pattern was changed from
to
K
(where K denotes lysine). The presence of a lysine on the nonpolar face of a
-strand should disfavor fibrils because such structures would bury an uncompensated charge. The nonpolar-to-lysine mutations, therefore, would be expected to favor monomeric structures in which the
K
sequences form edge strands with the charged lysine side chain accessible to solvent (Fig. 13C,D
-strand, the C-terminal
-strand, or both was changed to lysine. Characterization of the redesigned proteins showed that they indeed formed monomeric
-sheet proteins (Wang and Hecht 2002). | Protein-based biomaterials |
|---|
|
|
|---|
Several proteins were probed for their abilities to form amphiphilic monolayers. The proteins were chosen from the binary patterned library described above, which was designed to form six
-strands punctuated by reverse turns (Figs. 10
, 11
). We used LangmuirBlodgett techniques to study the properties of the monolayers at the air/water interface, and spectroscopic methods to assess secondary structure. The results of these studies demonstrated that (1) the proteins self-assembled into monolayers at an air/water interface; (2) the monolayers were dominated by
-sheet secondary structure, as shown by both circular dichroism and infrared spectroscopies; (3) the measured area per protein molecule was approximately 500600 Å2. This matches the area expected for a model of an amphiphilic
-sheet shown in Figure 14
. If the polar turns project down into the aqueous solvent, as shown in the figure, then the measured area would comprise only the 42 residues in the six
-strands, which would indicate an area of approximately 1214 Å2 per residue (Xu et al. 2001).
|
-sheet monolayers can be encoded by the designed pattern of polar and nonpolar amino acids. Moreover, because the designed pattern is compatible with a wide variety of different sequences, it may be possible to fabricate
-sheet monolayers using combinations of side chains that are explicitly designed for particular applications of novel biomaterials.
Template-directed assembly of de novo designed proteins
A number of biological materials owe their unusual structural characteristics and mechanical properties to long-range order induced by the lamination of proteins between layers of inorganic mineral (Lowenstam and Weiner 1989; Heuer et al. 1992; Sarikaya and Aksay 1995; Aksay et al. 1996; Belcher et al. 1996; Falini et al. 1996; Weiner and Addadi 1997). A well-studied example of protein/mineral layering occurs in the nacre of mollusk shells (mother of pearl). These laminated structures are composed of alternating layers of a protein-rich matrix and aragonite, a crystal form of calcium carbonate. The proteins in the protein-rich layer are dominated by
-sheet secondary structure. In such composites, both the protein layer and the mineral layer adopt structures different from those they assume in isolation. Interactions between such layers and the ordered structures that result from these interactions enable nature to produce biomaterials that are simultaneously hard, strong, and tough.
With the long-term goal of constructing artificial biomaterials with laminated structures, we developed a biomimetic system using an ordered surface to template the assembly of a de novo designed
-sheet protein. The protein used in this study was chosen from the same combinatorial library used above (Fig. 11
) to form amyloid-like fibrils in a homogeneous aqueous environment or
-sheet monolayers at an air/water interface. Formation of facial amphiphiles is favored at an interface between polar and nonpolar phases. In the previous example, monolayers formed by self-assembly at the interface between water and air. In contrast, to study template-directed assembly, we chose the nonpolar phase to be the highly ordered surface of pyrolytic graphite. The graphite lattice is hexagonal; therefore structures templated by the graphite surface would be expected to show threefold symmetry.
To probe the ability of our de novo proteins to undergo template-directed assembly, we deposited protein onto a graphite surface and used atomic force microscopy (AFM) to image the resulting assemblies (Brown et al. 2002). As shown in Figure 15C
, the AFM images demonstrate that the protein assembles on the graphite surface into ordered fibers aligned in three orientations at 120° to each other. This symmetry indicates that the hexagonal lattice of graphite (Fig. 15B
) directs nucleation of fibers on the surface. The straightness of the fibers and their persistent length of several microns suggest that the template also influences the addition of protein monomers onto the growing fiber. The size of these structures indicates that the hexagonal lattice of the graphite surface templates assembly of millions of protein molecules into a highly ordered structure, reminiscent of those found in natural biomaterials.
|
| Conclusions and prospects for future work |
|---|
|
|
|---|
-helical proteins,
-sheet proteins, well-ordered structures, cofactor binding proteins, catalytically active enzymes, self-assembled monolayers, amyloid-like nanofibrils, and template-directed assemblies of two-dimensional biomaterials.
These initial successes notwithstanding, thus far the potential of the binary code for protein design has been explored only cursorily. To date, we have completed detailed studies of only several representative proteins from each collection. Although characterization of these few proteins has provided a proof-of-principle demonstration that the binary code strategy is an effective method for discovering novel proteins with interesting properties (e.g., well-ordered structures, cofactor binding, enzyme-like activity, etc.), the range of properties that might ultimately be uncovered will require detailed studies of many more proteins. For example, the first binary patterned protein whose three-dimensional structure was solved formed a four-helix bundle that is well ordered and displays nativelike interior packing (Figs. 6
, 7
). Will this be true for all members of this library? Or will there be a range of behaviors with some structures being less well packed? If it turns out that the majority of sequences from this library form highly ordered structures with nativelike packing (as suggested by Fig. 5
and described by Wei et al. 2003b), then this would imply that even when packing is not explicitly designed, protein sequences nonetheless "find a way" to achieve good packing. Such a result would have significant implications both for de novo protein design and for biological evolution. Our ability to address this question, however, will require detailed structural and dynamic studies of many more proteins form our libraries. Such studies are currently underway.
Future work will aim to merge the binary code strategy with high throughput screens and selections. In recent years, a number of powerful methods have been developed to facilitate the isolation of rare "winners" from vast collections of inactive candidates (Smith 1985; Hanes and Plückthun 1997; Chen et al. 2001; Keefe and Szostak 2001; Lin and Cornish 2002). These methods are often applied to libraries of randomly generated sequences. However, because the vast majority of sequence space is not likely to encode well-folded functionally active proteins (Mandecki 1990; Davidson and Sauer 1994; Davidson et al. 1995; Prijambada et al. 1996), application of these methods to randomly generated sequences typically requires very large libraries and yields "hits" only very rarely (Keefe and Szostak 2001). In contrast, application of screens or selections to focused libraries is likely to produce successful hits far more frequently. As described in this review, the binary code strategy can focus libraries of de novo sequences into regions of sequence space that favor folded protein structures. Therefore, we anticipate that application of high throughput screens and selections to binary patterned libraries will provide a rich source of novel proteins with interesting and useful activities.
| Acknowledgments |
|---|
| References |
|---|
|
|
|---|
Aksay, I.A., Trau, M., Manne, S., Honma, I., Yao, N., Zhou, L., Fenter, P., Eisenberger, P.M., and Gruner, S.M. 1996. Biomimetic pathways for assembling inorganic thin films. Science 273: 892898.[Abstract]
Axe, D.D., Foster, N.W., and Fersht, A.R. 1996. Active barnase variants with completely random hydrophobic cores. Proc. Natl. Acad. Sci. 93: 55905594.
Beasley, J.R. and Hecht, M.H. 1997. Protein design: The choice of de novo sequences. J. Biol. Chem. 272: 20312034.
Belcher, A.M., Wu, X.H., Christensen, R.J., Hansma, P.K., Stucky, G.D., and Morse, D.E. 1996. Control of crystal phase and orientation by soluble mollusk-shell proteins. Nature 381: 5658.[CrossRef]
Benson, D.E., Wisz, M.S., and Hellinga, H.W. 2000. Rational design of nascent metalloenzymes. Proc. Natl. Acad. Sci. 97: 62926297.
Betz, S.F. and DeGrado, W.F. 1996. Controlling topology and native-like behavior of de novo designed peptides: Design and characterization of anti-parallel four-stranded coiled coils. Biochemistry 35: 69556962.[CrossRef][Medline]
Bolon, D.N. and Mayo, S.L. 2001. Enzyme-like proteins by computational design. Proc. Natl. Acad. Sci. 98: 1427414279.
Bowie, J.U., Reidhaar-Olson, J.F., Lim, W.A., and Sauer, R.T. 1990. Deciphering the message in protein sequences: Tolerance to amino acid substitutions. Science 247: 13061310.
Bromberg, S. and Dill, K.A. 1994. Side-chain entropy and packing in proteins. Protein Sci. 3: 9971009.[Abstract]
Broo, K.S., Nilsson, H., Nilsson, J., and Baltzer, L. 1998. Substrate recognition and saturation kinetics in de novo designed histidine-based four-helix bundle catalysts. J. Am. Chem. Soc. 120: 1028710295.[CrossRef]
Broome, B.M. and Hecht, M.H. 2000. Nature disfavors sequences of alternating polar and nonpolar amino acids: Implications for amyloidogenesis. J. Mol. Biol. 296: 961968.[CrossRef][Medline]
Brown, C.L., Aksay, I.A., Saville, D.A., and Hecht, M.H. 2002. Template-directed assembly of a de novo designed protein. J. Am. Chem. Soc. 124: 68466848.[CrossRef][Medline]
Chen, G., Hayhurst, A., Thomas, J.G., Harvey, B.R., Iverson, B.L., and Georgiou, G. 2001. Isolation of high-affinity ligand-binding proteins by periplasmic expression with cytometric screening (PECS). Nature Biotechnol. 19: 537542.[CrossRef][Medline]
Chothia, C. and Lesk, A.M. 1987. The evolution of protein structures. Cold Spring Harbor Symp. Quant. Biol. 52: 399.[Medline]
Davidson, A.R. and Sauer, R.T. 1994. Folded proteins occur frequently in libraries of random amino acid sequences. Proc. Natl. Acad. Sci. 91: 21462150.
Davidson, A.R., Lumb, K.J., and Sauer, R.T. 1995. Cooperatively folded proteins in random sequence libraries. Nat. Struct. Biol. 2: 856864.[CrossRef][Medline]
Dill, K.A. 1985. Theory for the folding and stability of globular proteins. Biochemistry 24: 15011509.[CrossRef][Medline]