|
|
||||||||
Laboratory of Molecular Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bethesda, Maryland 20892-4255, USA
Reprint requests to: Dr. Byungkook Lee, Laboratory of Molecular Biology, Center for Cancer Research, National Cancer Institute, National Institutes of Health, Bldg. 37, Room 4B15, 37 Convent Drive MSC 4255, Bethesda, MD 20892-4255, USA; e-mail: bk{at}nih.gov ; fax:(301) 402-1344.
(RECEIVED February 7, 2001; FINAL REVISION June 14, 2001; ACCEPTED June 14, 2001)
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1101/
| Abstract |
|---|
|
|
|---|
Keywords: Circular permutation; protein structure; structure alignment; gene duplication
| Introduction |
|---|
|
|
|---|
/ß proteins, but saposins (Ponting and Russell 1995; Liepinsh et al. 1997) are
-helical proteins. In most known cases, the N and C termini are close to each other (Thornton and Sibanda 1983), but we have found in this work many examples wherein the two termini are not close together. Detecting repeated sequence segments and circularly permuted proteins from a sequence database has been reported recently (Marcotte et al. 1999; Uliel et al. 1999). Here we report the results of a systematic search for protein pairs that have similar structures, but the structural alignment of which requires circular permutation of one of the sequences. | Results and Discussion |
|---|
|
|
|---|
The automatic procedure found most known CP relations, including those between plant lectins (Cunningham et al. 1979), bacterial glucanases (Heinemann and Hahn 1995b), (ß/
)8 barrel proteins (Sergeev and Lee 1994; Jia et al. 1996; Macgregor et al. 1996; Garcia-Vallve et al. 1998), the C2 domain proteins (Nalefski and Falke 1996), ferredoxins (Jung and Lee 2000), flavin-binding ß-barrel domains (Murzin 1998), the six-stranded double-
ß-barrels (Castillo et al. 1999), and the DNA and other methyltransferases (Jeltsch 1999). Some new examples of CP-related protein pairs are shown in Figure 1
. When a protein has a symmetric structure, it aligns to itself and to other structurally similar proteins both with and without circular permutation of its sequence. One can use this property to identify symmetric structures. Therefore, we operationally define a protein to be symmetric if it is related to another protein both with and without circular permutation and if the two alignments are judged to be distinct (see Materials and Methods). One feature that can be noted from the structures shown in Figure 1
is that the N and C termini are far apart in many of the structures. The proximity of the N and C termini are not a prerequisite condition for circular permutation.
|
-helical pieces or small
+ ß motifs, which resemble a part of many larger proteins. Most of the symmetric CP relations (green dots in Fig. 2
|
|
|
Some 71% of the circularly permuted proteins (1025 of 1433) have symmetric structures. The number of symmetric proteins detected here is therefore 34% of the total number of proteins. These structures might have arisen from ancient gene duplication events (Lang et al. 2000). Marcotte et al. (1999) reported that duplicated gene segments occur in 14% of all protein sequences and more than 20% of all eukaryotic proteins. These must reflect relatively recent gene duplication events because they were detected by sequence homology. In the case of the symmetric structural domains detected here, the sequence homology is generally low; only 91 of the 34,581 symmetric circularly permuted pairs have >30% sequence identity between them. If the symmetry has indeed arisen from gene duplication events, therefore, most of them must be ancient events. Alternatively, one cannot rule out the possibility that at least some of these structures arose without a gene duplication event (convergent evolution).
| Materials and methods |
|---|
|
|
|---|
nb. Circular permutation using this new cut position increases the number of matched residues in the structural superposition.
|
a of mab about ma was also computed. The match scores were then converted to z-score zab, which was defined as (mab - ma)/
a. For the straight structural alignment, a pair of proteins were considered to be structurally related when zab was >5.0. This z-score cutoff value is the same as that used previously for clustering protein structures into groups of similar structures (Jung and Lee 2000). This particular value was chosen primarily because the number of multimember clusters reached a plateau of maximum value at this cutoff value. Two proteins were considered to be related by circular permutation (CP related) if za`b is >5.0, where a` is the permuted protein, and if the number of matched residues of the C- and N-terminal parts of the permuted protein were both >10% of the total number of matched residues for the protein pair.
Criteria for distinct alignment
Two alignments were judged to be distinct if the mean alignment shift per residue,
r (Jung and Lee 2000), was greater than 5 positions between the two alignments.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., and Bourne, P.E. 2000. The Protein Data Bank. Nucleic Acids Res. 28: 235242.
Bowles, D.J., Marcus, S.E., Pappin, D.J., Findlay, J.B., Eliopoulos, E., Maycox, P.R., and Burgess, J. 1986. Posttranslational processing of concanavalin A precursors in jackbean cotyledons. J. Cell Biol. 102: 12841297.
Carrington, D.M., Auffret, A., and Hanke, D.E. 1985. Polypeptide ligation occurs during post-translational modification of concanavalin A. Nature 313: 6467.[CrossRef][Medline]
Castillo, R.M., Mizuguchi, K., Dhanaraj, V., Albert, A., Blundell, T.L., and Murzin, A.G. 1999. A six-stranded double-
ß barrel is shared by several protein superfamilies. Structure Fold Des. 7: 227236.[Medline]
Cunningham, B.A., Hemperly, J.J., Hopp, T.H., and Edelman, G.E. 1979. Favin versus concanavalin A: Circularly permuted amino acid sequences. Proc. Natl. Acad. Sci. USA 76: 32183222.
Doolittle, W.F. 1987. What introns have to tell us: Hierarchy in genome evolution. Cold Spring Harbor Symp. Quant. Biol. 52: 907913.
Garcia-Vallve, S., Rojas, A., Palau, J., and Romeu, A. 1998. Circular permutants in ß-glucosidases (family 3) within a predicted double-domain topology that includes a (ß/
)8-barrel. Proteins 31: 214223.[CrossRef][Medline]
Gilbert, W. 1987. The exon theory of genes. Cold Spring Harbor Symp. Quant. Biol. 52: 901905.
Heinemann, U. and Hahn, M. 1995a. Circular permutation of polypeptide chains: Implications for protein folding and stability. Prog. Biophys. Mol. Biol. 64: 121143.[CrossRef][Medline]
. 1995b. Circular permutations of protein sequence: Not so rare? Trends Biochem. Sci. 20: 349350.[CrossRef][Medline]
Iwakura, M., Nakamura, T., Yamane, C., and Maki, K. 2000. Systematic circular permutation of an entire protein reveals essential folding elements. Nat. Struct. Biol. 7: 580585.[CrossRef][Medline]
Jeltsch, A. 1999. Circular permutations in the molecular evolution of DNA methyltransferases. J. Mol. Evol. 49: 161164.[CrossRef][Medline]
Jia, J., Huang, W., Schorken, U., Sahm, H., Sprenger, G.A., Lindqvist, Y., and Schneider, G. 1996. Crystal structure of transaldolase B from Escherichia coli suggests a circular permutation of the
/ß barrel within the class I aldolase family. Structure 4: 715724.[Medline]
Jung, J. and Lee, B. 2000. Protein structure alignment using environmental profiles. Protein Eng. 13: 535543.
Lang, D., Thoma, R., Henn-Sax, M., Sterner, R., and Wilmanns, M. 2000. Structural evidence for evolution of the ß/
barrel scaffold by gene duplication and fusion. Science 289: 15461550.
Liepinsh, E., Andersson, M., Ruysschaert, J.M., and Otting, G. 1997. Saposin fold revealed by the NMR structure of NK-lysin. Nat. Struct. Biol. 4: 793795.[CrossRef][Medline]
Lindqvist, Y. and Schneider, G. 1997. Circular permutations of natural protein sequences: Structural evidence. Curr. Opin. Struct. Biol. 7: 422427.[CrossRef][Medline]
Luger, K., Hommel, U., Herold, M., Hofsteenge, J., and Kirschner, K. 1989. Correct folding of circularly permuted variants of a ß
-barrel enzyme in vivo. Science 243: 206210.
Macgregor, E.A., Jespersen, H.M., and Svensson, B. 1996. A circularly permuted
-amylase-type
/ß-barrel structure in glucan-synthesizing glucosyltransferases. FEBS Lett. 378: 263266.[CrossRef][Medline]
Marcotte, E.M., Pellegrini, M., Yeates, T.O., and Eisenberg, D. 1999. A census of protein repeats. J. Mol. Biol. 293: 151160.[CrossRef][Medline]
McWherter, C.A., Feng, Y., Zurfluh, L.L., Klein, B.K., Baganoff, M.P., Polazzi, J.O., Hood, W.F., Paik, K., Abegg, A.L., Grabbe, E.S., et al. 1999. Circular permutation of the granulocyte colony-stimulating factor receptor agonist domain of myelopoietin. Biochemistry 38: 45644571.[CrossRef][Medline]
Murzin, A.G. 1998. Probable circular permutation in the flavin-binding domain. Nat. Struct. Biol. 5: 101.[CrossRef][Medline]
Murzin, A.G., Brenner, S.E., Hubbard, T., and Chothia, C. 1995. SCOP: A structural classification of proteins database for the investigation of sequence and structures. J. Mol. Biol. 247: 536540.[CrossRef][Medline]
Nakamura, T. and Iwakura, M. 1999. Circular permutation analysis as a method for distinction of functional elements in the M20 loop of Escherichia coli dihydrofolate. J. Biol. Chem. 274: 1904119047.
Nalefski, E.A. and Falke, J.J. 1996. The C2 domain calcium-binding motif: Structural and functional diversity. Protein Sci. 5: 23752390.[Abstract]
Patthy, L. 1993. Modular design of proteases of coagulation, fibrinolysis, and complement activation: Implications for protein engineering and structurefunction studies. Methods Enzymol. 222: 1021.[Medline]
Polekhina, G., Board, P.G., Gali, R.R., Rossjohn, J., and Parker, M.W. 1999. Molecular basis of glutathione synthetase deficiency and a rare gene permutation event. EMBO J. 18: 32043213.[CrossRef][Medline]
Ponting, C.P. and Russell, R.B. 1995. Swaposins: Circular permutations within genes encoding saposin homologues. Trends Biochem. Sci. 20: 179180.[CrossRef][Medline]
Sergeev, Y. and Lee, B. 1994. Alignment of ß-barrels in (ß/
)8 proteins using hydrogen bonding pattern. J. Mol. Biol. 244: 168182.[CrossRef][Medline]
Thornton, J.M. and Sibanda, B.L. 1983. Amino and carboxy-terminal regions in globular proteins. J. Mol. Biol. 167: 443460.[CrossRef][Medline]
Uliel, S., Fliess, A., Amir, A., and Unger, R. 1999. A simple algorithm for detecting circular permutations in proteins. Bioinformatics 15: 930936.
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
W.-C. Lo, C.-C. Lee, C.-Y. Lee, and P.-C. Lyu CPDB: a database of circular permutation in proteins Nucleic Acids Res., October 8, 2008; (2008) gkn679v1. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Viksna and D. Gilbert Assessment of the probabilities for evolutionary structural changes in protein folds Bioinformatics, April 1, 2007; 23(7): 832 - 841. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. N. Sarma, V. A. Manning, L. M. Ciuffetti, and P. A. Karplus Structure of Ptr ToxA: An RGD-Containing Host-Selective Toxin from Pyrenophora tritici-repentis PLANT CELL, November 1, 2005; 17(11): 3190 - 3202. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Weiner 3rd, G. Thomas, and E. Bornberg-Bauer Rapid motif-based prediction of circular permutations in multi-domain proteins Bioinformatics, April 1, 2005; 21(7): 932 - 937. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Yuan and C. Bystroff Non-sequential structure-based alignments reveal topology-independent core packing arrangements in proteins Bioinformatics, April 1, 2005; 21(7): 1010 - 1019. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. H. Shin, Y. Lou, J. Jancarik, H. Yokota, R. Kim, and S.-H. Kim Crystal structure of YjeQ from Thermotoga maritima contains a circularly permuted GTPase domain PNAS, September 7, 2004; 101(36): 13198 - 13203. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. HOANG and A. R. FERRE-D'AMARE Crystal structure of the highly divergent pseudouridine synthase TruD reveals a circular permutation of a conserved fold RNA, July 1, 2004; 10(7): 1026 - 1033. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Bulaj, R. E. Koehn, and D. P. Goldenberg Alteration of the disulfide-coupled folding pathway of BPTI by circular permutation Protein Sci., May 1, 2004; 13(5): 1182 - 1196. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. Phlippen, K. Hoffmann, R. Fischer, K. Wolf, and M. Zimmermann The Glutathione Synthetase of Schizosaccharomyces pombe Is Synthesized as a Homodimer but Retains Full Activity When Present as a Heterotetramer J. Biol. Chem., October 10, 2003; 278(41): 40152 - 40161. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. P. Williams Descent of a split RNA Nucleic Acids Res., May 1, 2002; 30(9): 2025 - 2030. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |