Protein Science
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Cazalis, R.
Right arrow Articles by Gibrat, J.-F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cazalis, R.
Right arrow Articles by Gibrat, J.-F.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?
Protein Science (2003), 12:34-43.
Copyright © 2003 The Protein Society

Homology modeling and molecular dynamics simulations of the N-terminal domain of wheat high molecular weight glutenin subunit 10

Roland Cazalis1, Thierry Aussenac2, Larbi Rhazi1, Antoine Marin3 and Jean-François Gibrat3

1 Laboratoire d’Agrophysiologie, UMR 1054 INRA, ESA Purpan, 31076 Toulouse cedex 3, France
2 ISA Beauvais, 60026 Beauvais cedex, France
3 Mathématique, Informatique et Génome, Centre de Recherche de Versailles, INRA, 78026 Versailles cedex, France

Reprint requests to: Jean-François Gibrat, Mathématique, Informatique et Génome, Centre de Recherche de Versailles, INRA, Route de St Cyr, 78026 Versailles cedex, France; e-mail: gibrat{at}versailles.inra.fr; fax: +33 (1) 30 83 33 59.

(RECEIVED August 28, 2002; ACCEPTED October 17, 2002)

Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.0229803.


    Abstract
 TOP
 Abstract
 Introduction
 Results
 Discussion
 Conclusion
 Materials and methods
 References
 
High molecular weight glutenin subunits (HMW-GS) are of a particular interest because of their biomechanical properties, which are important in many food systems such as breadmaking. Using fold-recognition techniques, we identified a fold compatible with the N-terminal domain of HMW-GS Dy10. This fold corresponds to the one adopted by proteins belonging to the cereal inhibitor family. Starting from three known protein structures of this family as templates, we built three models for the N-terminal domain of HMW-GS Dy10. We analyzed these models, and we propose a number of hypotheses regarding the N-terminal domain properties that can be tested experimentally. In particular, we discuss two possible ways of interaction between the N-terminal domains of the y-type HMW glutenin subunits. The first way consists in the creation of interchain disulfide bridges. According to our models, we propose two plausible scenarios: (1) the existence of an intrachain disulfide bridge between cysteines 22 and 44, leaving the three other cysteines free of engaging in intermolecular bonds; and (2) the creation of two intrachain disulfide bridges (involving cysteines 22–44 and cysteines 10–55), leaving a single cysteine (45) for creating an intermolecular disulfide bridge. We discuss these scenarios in relation to contradictory experimental results. The second way, although less likely, is nevertheless worth considering. There might exist a possibility for the N-terminal domain of Dy10, Nt-Dy10, to create oligomers, because homologous cereal inhibitor proteins are known to exist as monomers, homodimers, and heterooligomers. We also discuss, in relation to the function of the cereal inhibitor proteins, the possibility that this N-terminal domain has retained similar inhibitory functions.

Keywords: Fold recognition; cereal inhibitor family; glutenin polymer; disulfide bridge pattern


    Introduction
 TOP
 Abstract
 Introduction
 Results
 Discussion
 Conclusion
 Materials and methods
 References
 
Wheat gluten proteins are critical for bread-making and other food uses as they confer unique rheological properties on doughs, a combination of elasticity and viscous flow. Wheat gluten is a complex mixture of proteins broadly divided into two groups, the monomeric gliadins and the glutenins, which form polymers linked by interchain disulfide bonds. Both groups contribute to the rheological properties of gluten and dough. However, a glutenin subgroup, the high molecular weight glutenin subunits (HMW-GS), is considered the most important determinants of these polymers (Payne 1987).

Early studies showed that the differences in the HMW-GS composition of bread wheat cultivars were associated with variation in breadmaking quality (Payne et al. 1979), whereas more recent studies have shown direct effects of HMW-GS composition on the proportion of high molecular weight glutenin polymers (Popineau et al. 1994; Gupta et al. 1995) and the level of gluten viscosity (Popineau et al. 1994). The structures and interactions of the HMW-GS are, therefore, of considerable interest in relation to understanding and manipulating gluten functionality.

HMW-GS proteins consist of two closely related subunit types, called x and y on the basis of their mobility in SDS-PAGE (Payne 1987). The x and y types share a similar primary structure, which consists of three domains: (1) a small N-terminal domain ranging from 80 to 120 amino acids with three to five cysteine residues (Shewry and Tatham 1997); (2) a large central domain made up of repeating sequences rich in glutamines, prolines, and glycines; (3) the C-terminus, a nonrepetitive domain of ~50 residues with one cysteine. The nonrepetitive domains, then, contain most or all of the cysteine residues, some of which form interchain disulfide bonds (Gupta et al. 1995) to stabilize the high molecular glutenin polymers.

However, x- and y-type subunits differ in some features. Sequence alignments show that the y-type subunit exhibits an insertion of 18 residues in the middle of the N-terminal domain, including two cysteine residues, which is not found in the x-type subunit. Therefore, y types have five cysteines in this domain, whereas x types usually have three. y types have an additional cysteine in the repetitive domain that is also characterized by differences in composition and order of the repeat motifs between the two types (Anderson and Greene 1989; Miles et al. 1991; Matsushima et al. 1992).

Physical studies (Miles et al. 1991; Matsushima et al. 1992; Thomson et al. 1999; Humphris et al. 2000) and molecular modeling (Matsushima et al. 1990; Kasarda et al. 1994; Parchment et al. 2001) of the central repeat domain indicate a ß-turn conformation organized in a ß-spiral structure that might be responsible for the elastic properties of the molecule.

It is known that cross-linking of the various glutenin subunits by intermolecular disulfide bridges plays a critical role in the formation of high molecular glutenin polymers. However, the experimental results, as to which cysteines of the different glutenin subunits are involved in these interchain bridges, are, so far, contradictory (Köhler et al. 1991, 1993; Keck et al. 1995; Shimoni et al. 1997; Kasarda 1999).

Recently the first 50 residues of both HMW-GS Dx5 and Bx7 were modeled by computer methods (Köhler et al. 1997). The resulting structure shows that the first two cysteines are located in a region of continuous {alpha}-helix in HMW-GS Dx5, whereas HMW-GS Bx7 shows an {alpha}-helix structure interrupted between the first two cysteines by an inverse {gamma}-turn. This break in the helix permits a possible intramolecular disulfide bond, in agreement with some of the experimental results (Köhler et al. 1993).

In the present work, we have investigated the structure of the N-terminal domain of HMW-GS Dy10 (Cheyenne cultivar, GenBank accession number X12929; Anderson et al. 1989), using fold-recognition and molecular modeling techniques. Note that, hereafter, this N-terminal domain will be referred to as Nt-Dy10. Nt-Dy10 is defined as encompassing the first 119 amino acids, excluding the signal peptide (the first 21 residues in the GenBank file). An analysis of the resulting models allows us to propose a number of hypotheses regarding the properties of Nt-Dy10 that can be tested experimentally and to discuss the cysteines of the HMW-GS Dy10 involved in intra- and interchain disulfide bridges.


    Results
 TOP
 Abstract
 Introduction
 Results
 Discussion
 Conclusion
 Materials and methods
 References
 
Sequence analysis
To obtain a more accurate partition of the sequence in terms of domains, we used the program SEG (Wootton and Federhen 1993), which is designed to segment amino acid sequences into subsequences of contrasting complexity. Figure 1Go shows the result of this analysis for the sequence of HMW-GS Dy10. It confirms well what is known experimentally about the structure of the protein (Shewry et al. 1992). HMW-GS Dy10 is made up of three domains, an N-terminal globular domain having 119 residues, a small C-terminal globular domain having ~50 residues, and a large, low-complexity domain connecting the two globular domains.



View larger version (35K):
[in this window]
[in a new window]
 
Figure 1. SEG analysis of the HMW-GS Dy10 sequence. The left column shows low complexity regions, the right column regular regions, that is, regions that are likely to correspond to globular structures. The middle column is the sequence numbering.

 
The presence of leucine and glutamate residues at the beginning of the sequence is reminiscent of a coiled-coil domain. There are several types of coiled-coil domains, for instance, the leucine zipper domain in transcriptional regulators or the coiled-coil domain observed in the hinge region of myosin. The leucine zipper motif allows the dimerization of DNA-binding proteins that are, except for this domain, regular globular proteins. Coiled-coil domains are often involved in protein oligomerization; for example, it is believed that they are involved in flagelin polymerization. To test for the existence of a coiled-coil domain, we used the program COIL (Lupas et al. 1991), which delineates coiled-coil domains in otherwise globular proteins. No coiled-coil domain was found in the sequence of HMW-GS Dy10.

We performed a secondary structure prediction of Nt-Dy10 using the program PSIPRED (McGuffin et al. 2000). On average, this program provides a percentage of correctly predicted residues of 75%, when three states are taken into consideration: {alpha}-helix, ß-strand, and coils. Figure 2Go shows the result of the secondary structure prediction. The structure of Nt-Dy10 appears to be mostly helical: There are three N-terminal helices and two C-terminal helices. A short ß-strand is predicted in between the N-terminal and C-terminal helices.



View larger version (27K):
[in this window]
[in a new window]
 
Figure 2. Result of the PSIPRED secondary structure prediction of Nt-Dy10.

 
Homology search in the databases
The next stage of our analysis was to search in databases for proteins that were homologous to Nt-Dy10. We started our analysis with sequence comparison techniques, using increasingly sensitive methods.

Thus, we first used BLAST (Altschul et al. 1990), then PSI-BLAST (Altschul et al. 1997), and finally HHMER, a program based on Hidden Markov Model techniques (Hughey and Krogh 1996) that performs a search against the PFAM database of domains (Bateman et al. 2002).

It has been shown (Brenner et al. 1998) that programs such as BLAST are not very effective for finding remote homologs whose amino acid sequences can have greatly diverged. More effective methods, instead of using a single sequence, make use of multiple aligned sequences (or profiles). Profiles contain more information than single sequences and therefore allow a more sensitive search. PSI-BLAST is a method based on BLAST that builds a profile iteratively. This program, as well as Hidden Markov Model techniques, has been shown to be three times more sensitive than BLAST (Park et al. 1998).

Using these three methods, only glutenins and D-hordeins were found (data not shown). We therefore turned toward fold-recognition methods. It is well known that the 3D structure of homologous proteins is better conserved than the amino acid sequence: remote homologs can have as few as 10%–15% conserved residues and yet still have very similar 3D structures and often similar functions. The 3D structure is thus a better means of characterizing remotely homologous proteins than the sequence.

We have developed a fold-recognition method, called FROST (Marin et al. 2002a, b), and have shown that this method is twice as sensitive as PSI-BLAST (Marin et al. 2002a).

Table 1Go shows the results of the search for the sequence of Nt-Dy10 against a database that contains representatives of all the known 3D structures. FROST consists of two filters called the 1D and 3D filters. The Core column corresponds to the code of known protein 3D structures of the Protein Databank (PDB). The Definition column summarizes the function of the corresponding protein, and the Id column indicates the percentage of conserved residues in the query and core sequences. The Sini column gives the alignment score. The most interesting column for our purpose is the column Ndist (normalized distance). Without going into details, let us say that this distance quantifies the degree of confidence we have in the fold recognition. We have shown that, for a normalized distance >3.2, there is a <1% chance to wrongly assign a fold to a sequence. In Table 1Go, three proteins with the same fold appear above this threshold: the bifunctional corn Hageman factor inhibitor: PDB code 1BEA (Behnke et al. 1998); the bifunctional {alpha}-amylase/trypsin inhibitor from ragi seed: PDB codes 1BIP (Strobl et al. 1995) and 1TMQ (Strobl et al. 1998); and the 0.19 {alpha}-amylase inhibitor from wheat kernel: PDB code 1HSS (Oda et al. 1997). 1BIP and 1TMQ correspond to the same protein, whose 3D structure has been solved by X-ray crystallography, file 1TMQ, and by NMR spectroscopy, file 1BIP. In fact, the structure in file 1TMQ describes the complex yellow meal worm {alpha}-amylase and bifunctional {alpha}-amylase/trypsin inhibitor from ragi seed. Note that the results of both filters concord.


View this table:
[in this window]
[in a new window]
 
Table 1. FROST results for Nt-Dy10
 
All these proteins belong to the cereal inhibitor family. In Release 40.7 of SWISS-PROT, we found 19 known members of this family (omitting short fragments). These proteins were detected with FASTA (Pearson 2000) using the sequence of 1BIP (IAAT_ELECO) as a query. Table 2Go shows the function of these proteins when known (Strobl et al. 1998).


View this table:
[in this window]
[in a new window]
 
Table 2. Function of the members of the cereal protease inhibitor family
 
Modeling of the 3D structure of Nt-Dy10
Figure 3AGo presents a structural alignment of proteins 1HSS (IAA1_WHEAT), 1BIP (IAAT_ELECO), and 1BEA (ITRF_ MAIZE) performed using the VAST program (Gibrat et al. 1996). The core of the structures consists of four conserved {alpha}-helices with a simple up-and-down topology. Other parts of the structure differ markedly.



View larger version (33K):
[in this window]
[in a new window]
 
Figure 3. (A) Structural alignment of 1BEA, 1BIP, and 1HSS chain A. 1BEA (bifunctional corn Hageman factor inhibitor) and 1BIP (bifunctional {alpha}-amylase/trypsin inhibitor from Ragi seed) share 67% identical residues. Their tertiary structures, and therefore their secondary structures, are very similar. (s.s.) Lines describe the secondary structures of 1BEA and 1HSS A. Upper-case residues indicate zones of the 3D structure that are superimposable. (x) Disordered residues that are not seen in the electron density. The core of the 3D structure is composed of four conserved {alpha}-helices. Other parts of the structure differ. (B) Sequence alignments between Nt-Dy10 and 1BEA, 1BIP, and 1HSS chain A. (s.s.) Observed secondary structure of the template protein; (H) helix; (E) strand; and (.) coils. These alignments constitute the basis of the modeling procedure.

 
The loop between helix 1 and helix 2 corresponds, for 1BIP and 1BEA, to the canonical inhibition site for trypsin serine proteases (Bode and Huber 1992). This loop protrudes into the active site of the corresponding serine proteases in a substrate-like fashion. The corresponding loop in 1HSS has a very different conformation, but apparently 1HSS inhibits only {alpha}-amylase. The loop between helices 3 and 4 is also very different in both types of structures.

Once homologous proteins whose 3D structure is known have been identified, it is possible to build a 3D model for Nt-Dy10 using these known structures as templates.

The first stage of homology modeling, that is, the query protein sequence alignment with the sequence of the template, is critical. If this alignment is wrong, the resulting model will be wrong. To obtain the best possible model, we make use of all the information at hand: cereal inhibitor family multiple sequence alignment (data not shown), threading alignment, structural alignment of known 3D structures, secondary structure prediction, and so on.

Because the 3D structures of 1BEA, 1BIP, and 1HSS show many differences, we built a model of Nt-Dy10 using these three proteins as a template. Alignments of the query sequence with the three template sequences are shown in Figure 3BGo. Notice that 1HSS is a homodimer. The model we built using 1HSS as a template is thus also a homodimer.

The 3 models were built with MODELLER (Sali and Blundell 1993). Hereafter, Mod_1BEA (Mod_1BIP and Mod_1HSS, respectively) refers to the 3D model of Nt-Dy10 using the structure of 1BEA as a template (1BIP and 1HSS, respectively).

Molecular dynamics simulations
To test the robustness of the three models, we submitted Mod_1BEA and Mod_1BIP to 300 psec and Mod_1HSS to 200 psec of molecular dynamics. The purpose of these simulations was twofold. First, we wished to verify that there was no serious flaw in the models, that is, that the modeled 3D structures would not collapse during the simulation, indicating a poor modeling of the 3D structure. Second, we wanted to explore locally the conformational space and let the conformation of the models relax in a natural way. Molecular dynamics simulations constitute a necessary but, unfortunately, not sufficient condition to obtain models of the 3D structure of the protein of interest that one can trust.

Results for molecular dynamics simulations (data not shown) indicate that there is no serious problem with the models—the corresponding conformations are stable during the whole trajectory. Values of the root-mean-square deviation from the initial minimized conformation vary from 2.5 Å for Mod_1BEA to ~4 Å for Mod_1BIP. The Mod_1HSS conformation is still drifting away from its starting conformation. It reaches the value of 3 Å after 200 psec.

The last conformation of each trajectory was then minimized. The resulting conformations were used in the following analyses as representatives of the models.

Analysis of the models
Strobl et al. (1995) list a number of residues that form the hydrophobic core of the 3D structure of 1BIP and that are strongly conserved in the cereal inhibitor family. These residues are shown in Table 3Go for 1BIP, 1HSS, and Nt-Dy10. Residues Arg 56 and Gln 90 are completely buried in the structure of 1BIP and form a salt bridge. Most of these residues are either conserved or replaced by similar residues. There is a discrepancy for residues Leu 97 and Val 98, two hydrophobic residues in 1BIP that are replaced by polar residues (Thr 98 and Ser 99) in Nt-Dy10. Notice, however, that the residue aligned with Val 98 in 1HSS is also a polar residue, Thr 96.


View this table:
[in this window]
[in a new window]
 
Table 3. Conserved residues that form the core of the 3D structures
 
Generally speaking, the alignment of the first half of the sequence, up to the end of the third helix, is relatively straightforward. The number of conserved residues, such as the abovementioned core residues, but also cysteines and arginines, is sufficient, we believe, to correctly anchor the alignment. The second half of the structure is more problematic, in particular the location of the fourth helix. There is no residue conserved in this helix, and thus its precise location is more difficult to assess. This helix, potentially, plays an important role. In the homologous structure of 1HSS, hydrophobic residues lie on the external face of this helix and are involved in the dimer interface. In addition, none of the cysteines characteristic of the family appears to be conserved in this part of the sequence.

In their paper about the modeling of the N-terminal regions of HMW-GS Dx5 and Bx7 (first 50 residues) based exclusively on secondary structure predictions, Köhler et al. (1997) pointed out that the models they built had to be considered highly speculative.

Here, in contrast, we emphasize that fold-recognition techniques are now well established and constitute increasingly reliable techniques (CASP4 2001).

Benchmark tests with our method have also revealed that above a normalized distance of 4, on average, 75% of the query residues are perfectly aligned (with a shift of 0 with respect to their position in the true structural alignment), and a remaining 20% are aligned with a shift of at most 4 (Marin et al. 2002a). Therefore, we believe that, although all atomic details of the models are unlikely to be right, general features of the 3D structure should be essentially correct.

Dimer interface
To check whether Nt-Dy10 is able to form a dimer in a way similar to its template 1HSS, we analyzed the accessible surface of the interface between the two monomers in the template protein and in the model Mod_1HSS.

For this purpose we used NACCESS (Hubbard et al. 1991) to calculate the accessible surface area of the dimer, on the one hand, and of the monomer, on the other hand. All residues showing a relative surface area decreasing by more than 10% between the monomer and the dimer were recorded. They are listed in Table 4Go.


View this table:
[in this window]
[in a new window]
 
Table 4. Dimerization interface
 
The distribution of residues for 1HSS and Mod_1HSS is strikingly different. 1HSS exhibits a peculiar distribution of residues: seven are charged (Table 4Go, boldface), seven are hydrophobic (Table 4Go, italic), and seven are regular surface residues.

Two facts must be noticed. First, hydrophobic residues usually do not tend to cluster on the protein surface unless they are involved in a dimerization interface. Second, it is not favorable to bury charged residues unless they are able to create salt bridges. Figure 4Go shows that this is precisely the case for 1HSS. The dimerization interface is made of three layers: a strip of regular surface residues sandwiched by hydrophobic residues (Val 83, Leu 86, Val 94, Val 101, Val 110), themselves surrounded by a crown of charged residues that all form salt bridges (Arg 96 of one chain with Glu 31 of the other one, Arg 80 with Asp 102 and Asp 106, Lys 112 with Glu 82).



View larger version (94K):
[in this window]
[in a new window]
 
Figure 4. Space-filling representation of the dimer interface between chains A and B of the 1HSS structure. (Blue) Charged residues (Table 4Go); (red) hydrophobic residues; and (yellow) regular residues. Although hardly apparent in this view, the interface is rather flat.

 
As is apparent from Table 4Go, residues at the dimer interface for Mod_1HSS down to Val 103 are residues that are expected to be located at the protein surface for a regular globular protein. These data, therefore, seem to rule out the formation of a dimer by Nt-Dy10, at least in a way similar to the 1HSS dimer.

Disulfide bridges
Proteins of the cereal inhibitor family contain, in general, 10 cysteines forming 5 disulfide bridges (sometimes a cysteine is missing, resulting in only 4 disulfide bridges).

Nt-Dy10 has 5 cysteines. According to the models, only one disulfide bridge can be created between Cys 22 and Cys 44. In the models, Cys 10 and Cys 55 are 15 Å apart for Mod_1BIP and Mod_1BEA and 10 Å for Mod_1HSS. Because the N terminus of the molecule is flexible, it might be possible for these two cysteines to get close and form a disulfide bridge.

The monitoring of the corresponding cysteine sulfur-atom distance along the molecular dynamics trajectories (data not shown) for Mod_1BIP and Mod_1BEA shows that this distance undergoes a sudden transition, down to 10 Å, after 250 psec for Mod_1BEA and after 280 psec for Mod_1BIP. The trajectory for Mod_1HSS is not long enough to observe a similar phenomenon.

Therefore, we cannot completely rule out the possibility of a disulfide bridge between Cys 10 and Cys 55. Two scenarios can be considered. In the first one there is only one intrachain disulfide bridge between Cys 22 and Cys 44. Three cysteines—10, 45, and 55—remain available to make interchain disulfide bridges. In the second scenario there are 2 intrachain disulfide bridges connecting, respectively, Cys 22 to Cys 44, and Cys 10 to Cys 55. Only Cys 45 is free to create intermolecular disulfide bridges.


    Discussion
 TOP
 Abstract
 Introduction
 Results
 Discussion
 Conclusion
 Materials and methods
 References
 
Common viscoelastic properties with CM proteins
Some wheat proteins (SWISS-PROT IDs IA01_WHEAT, IA02_WHEAT, IA03_WHEAT, IA16_WHEAT) of the cereal inhibitor family, known as CM proteins (they were initially characterized by their solubility in a mixture of chloroform and methanol, hence their name), have been shown to be involved in the technological quality of wheat products like pasta (Gautier et al. 1989; Kobrehel and Alary 1989). Kobrehel and Alary (1989) found a significant correlation between the sulphydryl-disulfide content of a CM-rich fraction of gluten and the cooking quality of pasta. They underlined the "important functional role of [CM] proteins in the expression of the visco-elastic properties of gluten." They proposed that during hydrothermic treatment, CM proteins can form disulfide bridges, thus creating a strong proteinaceous network that improves the cooking quality of pasta.

We think that it is not merely luck if CM proteins and the N-terminal domain of HMW glutenins share common viscoelastic properties, but can be readily explained by their underlying common fold, as we propose in this paper.

Possible modes of domain interactions
The principal function of models is to help us propose new experiments to test biological hypotheses. Here, regarding the interaction between HMW glutenin proteins that are responsible for mechanical properties of dough, we can consider two different possibilities of association (that are not necessarily exclusive).

Intra- and interchain disulfide bridges
The most obvious possibility, backed up by many experimental studies, is the creation of interchain disulfide bridges. However, different opinions prevail as to which cysteines are involved in intra- and interchain disulfide bridges.

According to Wieser and colleagues (Köhler et al. 1991, 1993; Keck et al. 1995), there exist two parallel intermolecular disulfide bridges between adjacent Cys 3 and Cys 4 with the same cysteines in another y-type subunit. Note that, in this section, for the sake of comparison with data of the literature, Nt-Dy10 cysteines are numbered as they appear in the sequence, that is, from 1 to 5. They also proposed, tentatively, the existence of an interchain disulfide bond between Cys 2 of a y-type HMW-GS and Cys 5 of an x-type HMW-GS, although for this latter disulfide bridge two other possible types of connection exist.

Kasarda and colleagues (Tao et al. 1992) and Galili and colleagues (Shimoni et al. 1997) found only one interchain disulfide bridge connecting a y-type to an x-type HMW-GS. Note that these two teams used different experimental techniques to reach this conclusion. These experiments did not allow them to identify which cysteine of the y type was involved in the interchain disulfide bridge. Kasarda (1999), summarizing the experimental results available so far, proposed that the y-type HMW-GS contains two intrachain and one interchain disulfide bridges. He also acknowledged the inconsistency of his proposal with the results of the Wieser group.

The second scenario we proposed above, namely, that Cys 1–Cys 5 and Cys 2–Cys 3 form intrachain disulfide bonds, leaving Cys 4 free to create an interchain disulfide bridge, is fully compatible with the proposal of Kasarda.

However, this scenario is built on the assumption that Cys 1 and Cys 5 are able to create an intrachain disulfide bond. From the data obtained during the simulations this is possible but not yet proven.

The first scenario suggests that only Cys 2 and Cys 3 can create an intrachain disulfide bridge, leaving three cysteines free from making interchain bridges. This scenario is not compatible with data obtained by Köhler et al. (1991), because the intrachain bond involves two cysteines that are found by them to be engaged in interchain disulfide bridges. Let us note that the involvement of Cys 2 in an interchain bond with Cys 5 of an x-type HMW-GS is only one of three, equally likely, possibilities. The question, then, arises, regarding the work that shows the existence of the two parallel disulfide bridges (Köhler et al. 1991), of the possibility of an experimental artifact. In this work, a peptide made of two identical fragments, CCQQL (residues 44–48 of HMW-GS Dy10), linked by the two abovementioned parallel disulfide bonds was isolated. One might well imagine that, if the intrachain bond is broken in both subunits during the trypsic digestion, a new, artifactual, disulfide bridge could be made between the two newly freed Cys 3s (Cys 44s in the sequence) that are maintained in a close vicinity in the peptidic fragment created by the experiment.

The models provide ambiguous data regarding intra- and interchain disulfide bridges. It appears, however, that these data are easier to reconcile with the proposal of the Kasarda and Galili groups than with Wieser and coworkers’ suggestions.

Domain dimerization
The second possibility concerns the direct association of chains to form oligomers. Proteins of the cereal inhibitor family are known to exist as monomers, homodimers, or heterooligomers. Our analysis of the dimer interface in Mod_1HSS seems to rule out this possibility. We have tried to find other possible oligomerization interfaces, but we could not come up with anything significant (data not shown). We must emphasize, however, that it is, in general, rather difficult to locate such an interface without a priori clues, and we might well have overlooked it. It is therefore interesting, we think, to study experimentally whether HMW glutenin proteins are capable of association (mediated by the N-terminal domain) in the absence of disulfide bridges, under reducing conditions.

Potential function of the N-terminal domain
As indicated by Strobl et al. (1998) and summarized in Table 2Go, proteins of the cereal inhibitor family inhibit two important digestive enzymes of animals (trypsin and {alpha}-amylase) and are thought to be involved, in vivo, in a plant defense mechanism against predatory insects.

Proteins of the cereal inhibitor family and the N-terminal domain of HMW-GS Dy10 are remote homologs. This implies that the present function of the N-terminal domain of HMW-GS Dy10 and the function of proteins of the cereal family are more likely to have diverged over time. In spite of these questions, we, nevertheless, think that it would be interesting to test experimentally whether this domain is capable of inhibiting trypsin and/or {alpha}-amylase, at least in vitro.

An analysis of the models and related alignments leads us to believe that it is unlikely that Nt-Dy10 exhibits an {alpha}-amylase or trypsin inhibitory function. However, some engineering of the sequence could restore these functions as described below.

The {alpha}-amylase inhibitory site of 1BIP has been shown to involve principally two segments, Ser 1–Ala 11 and Pro 52–Cys 55 (Strobl et al. 1998). The first residue, Ser 1, plays a critical role because it is capable of interacting with all three acidic residues of the {alpha}-amylase active site. It is known that N-terminal elongation of wheat {alpha}-amylase inhibitor (IAA2_WHEAT) diminishes or abolishes inhibition of {alpha}-amylase (Garcia et al. 1991). This is confirmed by the fact that most proteins, IAA_HORVU, RA17_ORYSA, RA14_ORYSA, RAG2_ORYSA, RA05_ORYSA, which have no known inhibitory function according to Table 2Go, exhibit an N-terminal elongation. Nt-Dy10 also has such an N-terminal elongation, and one might be able to recover a detectable {alpha}-amylase inhibitory effect only by trimming the first 4 residues of Nt-Dy10.

The trypsin-binding loop of 1BIP is located between helices 1 and 2 and corresponds to the sequence: Gly 32-Pro 33-Arg 34-Leu 35-Ala 36-Thr 37. It adopts the canonical conformation observed in other serine protease inhibitors (Bode and Huber 1992). In Nt-Dy10, on the other hand, the loop (Gly 32-Arg 33-Leu 34-Pro 35-Trp 36-Ser 37) is shorter and, in Mod_1BIP at least, adopts a different conformation. To test whether Nt-Dy10 is able to inhibit trypsin, it might be necessary to insert a proline in the loop between Gly 32 and Arg 33 and possibly replace Pro 35 and Trp 36 by the corresponding residues (Ala and Thr) in the 1BIP loop.

Of course, these engineering experiments would not provide a clue to the present function of Nt-Dy10 but would confirm experimentally that the overall 3D organization of this domain is correct.


    Conclusion
 TOP
 Abstract
 Introduction
 Results
 Discussion
 Conclusion
 Materials and methods
 References
 
Using a fold-recognition technique, FROST, we identified a fold compatible with the N-terminal domain of HMW-GS Dy10. This fold corresponds to the one adopted by proteins of the cereal inhibitor family. Starting from the known structure of three members of the family: the bifunctional {alpha}-amylase/trypsin inhibitor of ragi seed, the bifunctional corn Hageman factor inhibitor, and the 0.19 {alpha}-amylase inhibitor, we built three models of this N-terminal domain. We analyzed these models and discussed their properties in relation to the possible modes of interaction of y-type HMW-GSs in glutenin polymers.

We believe that our models can be a help in the design of new experiments aimed at the elucidation of the relationship between the protein structure and its (biomechanical) properties.

The three models of the N-terminal domain of HMW-GS Dy10 can be downloaded from our Website, http://www-mig.jouy.inra.fr/mig/downloads.html.


    Materials and methods
 TOP
 Abstract
 Introduction
 Results
 Discussion
 Conclusion
 Materials and methods
 References
 
Sequence analysis was performed using SEG (Wootton and Federhen 1993) for detecting segments of low complexity in the HMW-GSy10 sequence, COILS (Lupas et al. 1991) for searching for coiled-coils, and PSI-PRED (McGuffin et al. 2000) for predicting the secondary structure of the N-terminal domain. These three methods were used with default parameters.

Homology searches were carried out with BLAST (Altschul et al. 1990) and PSI-BLAST (Altschul et al. 1997) against the NCBI nonredundant database. Default parameters were used for BLAST and PSI-BLAST except that, for the latter, the threshold for including homologous proteins in the profile was set to 0.0005 and the number of iterations (runs) was set to 20, as recommended by Park et al. (1998). Domain search was performed with HHMER (Hughey and Krogh 1996) against the PFAM database of domains (Bateman et al. 2002) with default parameters.

Fold recognition was performed with FROST (Marin et al. 2002a,b) with a database containing ~1200 cores.

Structural alignments between 1BIP, 1BEA, and 1HSS were performed with VAST (Gibrat et al. 1996).

Models were built with MODELLER (Sali and Blundell 1993) starting from the alignments shown in Figure 3BGo. For each model, 10 structures were generated, and the one with the best objective function was kept.

Molecular dynamics simulations were carried out with AMBER6 (Cornell et al. 1995). Each model was solvated in a box of equilibrated water molecules, the size of the box being calculated such that there are at least 10 Å between any protein atom and the box edges. The whole system was then minimized. Starting from the minimized structure, a molecular dynamics simulation was run for 300 psec (200 psec for Mod_1HSS) using periodic boundary conditions at constant temperature (T = 298 K) and constant volume. We used a time step of 1 fsec. Conformations along the trajectory were saved every 0.1 psec.

Calculation of the accessible surface area for the 1HSS dimer was performed with NACCESS (Hubbard et al. 1991) using standard parameters.


    Acknowledgments
 
We thank M.-F. Gautier for helping us to trace back the cryptic comment in SWISS-PROT about the role of CM proteins in the technological quality of wheat products, to the relevant bibliography.

The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.


    References
 TOP
 Abstract
 Introduction
 Results
 Discussion
 Conclusion
 Materials and methods
 References
 
Altschul, S., Gish, W., Miller, W., Myers, E., and Lipman, D. 1990. Basic local alignment search tool. J. Mol. Biol. 215: 403–410.[CrossRef][Medline]

Altschul, S., Madden, T., Schaffer, A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D. 1997. Gapped BLAST and PSI-BLAST: A new generation of protein database search programs. Nucleic Acids Res. 25: 3389–3402.[Abstract/Free Full Text]

Anderson, O. and Greene, F. 1989. The characterization and comparative analysis of high molecular weight glutenin genes from genomes A and B of a hexploid bread wheat. Theor. Appl. Genet. 77: 689–700.

Anderson, O., Greene, F., Yip, R., Halford, N., Shewry, P., and Malpica, R. 1989. Nucleotide sequences of the two high-molecular-weight glutenin genes from the D-genome of a hexaploid bread wheat, Triticum aestivum L. cv Cheyenne. Nucleic Acids Res. 17: 461–462.[Free Full Text]

Bateman, A., Birney, E., Cerruti, L., Durbin, R., Etwiller, L., Eddy, S., Griffiths, J., Howe, K., Marshall, M., and Sonnhammer, E. 2002. The Pfam protein families database. Nucleic Acids Res. 30: 276–280.[Abstract/Free Full Text]

Behnke, C., Yee, V., Trong, I., Pedersen, L., Stenkamp, R., Kim, S., Reeck, G., and Teller, D. 1998. Structural determinants of the bifunctional corn Hageman factor inhibitor: X-Ray crystal structure at 1.95 Å resolution. Biochemistry 37: 15277–15288.[CrossRef][Medline]

Bode, W. and Huber, R. 1992. Natural protein proteinase inhibitors and their interaction with proteinases. Eur. J. Biochem. 204: 433–451.[Medline]

Brenner, S., Chothia, C., and Hubbard, T. 1998. Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships. Proc. Natl. Acad. Sci. 95: 6073–6078.[Abstract/Free Full Text]

CASP4: Critical Assessment of Techniques for Protein Structure Prediction. 2001. Progress report on 4th meeting. Proteins Suppl. 5; http://predictioncenter.llnl.gov.

Cornell, W., Cieplak, P., Bayly, C., Gould, I., Merz, J., Ferguson, D., Spellmeyer, D., Fox, T., Caldwell, J., and Kollman, P. 1995. A second generation force field for the simulation of proteins and nucleic acids. J. Am. Chem. Soc. 117: 5179–5197.[CrossRef]

Garcia, M., Carbonero, P., and Garcia, O. 1991. Site-directed mutagenesis and expression in Escherichia coli of WMAI-1, a wheat monomeric inhibitor of insect {alpha}-amylase. Plant Mol. Biol. 17: 1005–1011.[CrossRef][Medline]

Gautier, M., Alary, R., Kobrehel, K., and Joudrier, P. 1989. Chloroform/methanol soluble proteins are the main components of Triticum durum sulfur-rich glutenin fractions. Cereal Chem. 66: 535.

Gibrat, J.-F., Madej, T., and Bryant, S.H. 1996. Surprising similarities in structure comparison. Curr. Opin. Struct. Biol. 6: 377–385.[CrossRef][Medline]

Gupta, R., Popineau, Y., Lefebvre, J., Cornec, M., Lawrence, G., and MacRitchie, F. 1995. Biochemical basis of flour properties in bread wheats. II. Changes in polymeric protein formation and dough gluten properties associated with the loss of low molecular or high molecular glutenin subunits. J. Cereal Sci. 21: 103–116.

Hubbard, S., Campbell, S., and Thornton, J. 1991. Molecular recognition. Conformational analysis of limited proteolytic sites and serine proteinase protein inhibitors. J. Mol. Biol. 220: 507–530.[CrossRef][Medline]

Hughey, R. and Krogh, A. 1996. Hidden Markov models for sequence analysis: Extension and analysis of the basic method. Comput. Appl. Biosci. 12: 95–107.[Abstract/Free Full Text]

Humphris, A., McMaster, T., Miles, M., Gilbert, S., Shewry, P., and Tatham, A. 2000. Atomic force microscopy (AFM) study of HMW subunits of wheat glutenin. Cereal Chem. 77: 107–110.

Kasarda, D.D. 1999. Glutenin polymers: The in vitro to in vivo transition. Cereal Foods World 44: 566–571.

Kasarda, D.D., King, G., and Kumosinski, T.F. 1994. Comparison of ß-spiral structures in wheat high molecular weight glutenin subunits and elastin, by molecular modeling. In Proceedings of the Symposium on Molecular Modeling (eds. T.F. Kumosiuski and M.N. Liebman). ACS, Washington, DC.

Keck, B., Köhler, P., and Wieser, H. 1995. Disulphide bonds in wheat gluten: Cystine peptides derived from gluten proteins following peptic and thermolytic digestion. Z. Lebensm. Unters. Forsch. 200: 432–439.[CrossRef][Medline]

Kobrehel, K. and Alary, R. 1989. The role of a low molecular weight glutenin fraction in the cooking quality of durum wheat pasta. J. Sci. Food Agric. 47: 487–500.[CrossRef]

Köhler, P., Belitz, H., and Wieser, H. 1991. Disulphide bonds in wheat gluten: Isolation of a cysteine peptide from glutenin. Z. Lebensm. Unters. Forsch. 192: 234–239.[CrossRef][Medline]

———. 1993. Disulphide bonds in wheat gluten: Further cysteine peptides from high molecular weight (HMW) and low molecular weight (LMW) subunits of glutenin and from {gamma}-gliadins. Z. Lebensm. Unters. Forsch. 196: 239–247.[CrossRef][Medline]

Köhler, P., Keck-Gassenmeier, B., Wieser, H., and Kasarda, D.D. 1997. Molecular modeling of the N-terminal regions of high molecular weight glutenin subunits 7 and 5 in relation to intramolecular disulfide bond formation. Cereal Chem. 74: 154–158.

Lupas, A., Van, D., and Stock, J. 1991. Predicting coiled coils from protein sequences. Science 252: 1162–1164.[CrossRef][Medline]

Marin, A., Pothier, J., Zimmermann, K., and Gibrat, J.-F. 2002a. FROST: A filter based fold recognition method. Proteins 49: 493–509.[CrossRef][Medline]

———. 2002b. Protein threading statistics: An attempt to assess the significance of a fold assignment to a sequence. In Protein structure prediction: Bioinformatics approach (ed. I.F. Tsigelny), Chap. 9, pp. 227–262. International University Line, La Jolla, CA.

Matsushima, N., Creutz, C., and Kretsinger, R. 1990. Polyproline, ß-turn helices. Novel secondary structures proposed for the tandem repeats within rhodopsin, synaptophysin, synexin, gliadin, RNA polymerase II, hordein, and gluten. Proteins 7: 125–155.[CrossRef][Medline]

Matsushima, N., Danno, G., Sasaki, N., and Izumi, Y. 1992. Small-angle X-ray scattering study by synchrotron orbital radiation reveals that high molecular weight subunit of glutenin is a very anisotropic molecule. Biochem. Biophys. Res. Commun. 186: 1057–1064.[CrossRef][Medline]

McGuffin, L., Bryson, K., and Jones, D. 2000. The PSIPRED protein structure prediction server. Bioinformatics 16: 404–405.[Abstract/Free Full Text]

Miles, M.J., Carr, H.J., McMaster, T.C., I’Anson, K., Belton, P.S., Morris, V.J., Field, J.M., Shewry, P.R., and Tatham, A.S. 1991. Scanning tunneling microscopy of a wheat seed storage protein reveals details of an unusual supersecondary structure. Proc. Natl. Acad. Sci. 88: 68–71.[Abstract/Free Full Text]

Oda, Y., Matsunaga, T., Fukuyama, K., Miyazaki, T., and Morimoto, T. 1997. Tertiary and quaternary structures of 0.19 {alpha}-amylase inhibitor from wheat kernel determined by X-ray analysis at 2.06 Å resolution. Biochemistry 36: 13503–13511.[CrossRef][Medline]

Parchment, O., Shewry, P., Tatham, A., and Osguthorpe, D. 2001. Molecular modeling of unusual structure in elastomeric wheat seed protein. Cereal Chem. 78: 658–662.

Park, J., Karplus, K., Barrett, C., Hughey, R., Haussler, D., Hubbard, T., and Chothia, C. 1998. Sequence comparisons using multiple sequences detect three times as many remote homologues as pairwise methods. J. Mol. Biol. 284: 1201–1210.[CrossRef][Medline]

Payne, P. 1987. Genetics and wheat storage protein and the effect of allelic variation on breadmaking quality. Ann. Rev. Plant Physiol. 38: 141–153.[CrossRef]

Payne, P., Corfield, K., and Blackman, J. 1979. Identification of a high-molecular-weight subunit of glutenin whose presence correlates with bread-making quality in wheats of related pedigree. Theor. Appl. Genet. 55: 153–159.[CrossRef]

Pearson, W. 2000. Flexible sequence similarity searching with the FASTA3 program package. Methods Mol. Biol. 132: 185–219.[Medline]

Popineau, Y., Cornec, M., Lefebvre, J., and Marchylo, B. 1994. Influence of high molecular glutenin subunits on glutenin polymers and rheological properties of glutens and gluten sub-fractions of near-isogenic lines of wheat Sicco. J. Cereal Sci. 19: 231–241.[CrossRef]

Sali, A. and Blundell, T. 1993. Comparative protein modelling by satisfaction of spatial restraints. J. Mol. Biol. 234: 779–815.[CrossRef][Medline]

Shewry, P. and Tatham, A. 1997. Disulphide bonds in wheat gluten proteins. J. Cereal Sci. 25: 207–227.[CrossRef]

Shewry, P., Halford, N., and Tatham, A. 1992. High molecular weight subunits of wheat glutenin. J. Cereal Sci. 15: 105–120.

Shimoni, Y., Blechl, A., Anderson, O., and Galili, G. 1997. A recombinant protein of two high molecular weight glutenins alters gluten polymer formation in transgenic wheat. J. Biol. Chem. 272: 15488–15495.[Abstract/Free Full Text]

Strobl, S., Muhlhahn, P., Bernstein, R., Wiltscheck, R., Maskos, K., Wunderlich, M., Huber, R., Glockshuber, R., and Holak, T. 1995. Determination of the three-dimensional structure of the bifunctional {alpha}-amylase/trypsin inhibitor from ragi seeds by NMR spectroscopy. Biochemistry 34: 8281–8293.[CrossRef][Medline]

Strobl, S., Maskos, K., Wiegand, G., Huber, R., Gomis, R., and Glockshuber, R. 1998. A novel strategy for inhibition of {alpha}-amylases: Yellow meal worm {alpha}-amylase in complex with the Ragi bifunctional inhibitor at 2.5 Å resolution. Structure 6: 911–921.[Medline]

Tao, H., Adalsteins, A., and Kasarda, D. 1992. Intermolecular disulfide bonds link specific high-molecular-weight glutenin subunits in wheat endosperm. Biochim. Biophys. Acta 1159: 13–21.[CrossRef][Medline]

Thomson, N., Miles, M., Popineau, Y., Harries, J., Shewry, P., and Tatham, A. 1999. Small angle X-ray scattering of wheat seed-storage proteins: {alpha}-, {gamma}- and {omega}-gliadins and the high molecular weight (HMW) subunits of glutenin. Biochim. Biophys. Acta 1430: 359–366.[CrossRef][Medline]

Wootton, J. and Federhen, S. 1993. Statistics of local complexity in amino acid sequences and sequence databases. Computers Chem. 17: 149–163.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?



This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Cazalis, R.
Right arrow Articles by Gibrat, J.-F.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Cazalis, R.
Right arrow Articles by Gibrat, J.-F.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS