|
|
||||||||
The Burnham Institute, La Jolla, California 92037, USA
Reprint requests to: Yuzhen Ye or Adam Godzik, Program in Bioinformatics and Systems Biology, The Burnham Institute, 10901 N. Torrey Pines Road, La Jolla, CA 92037, USA; e-mail: yye{at}burnham.org or adam{at}burnham.org; fax: (858) 646-3171.
(RECEIVED December 23, 2003; FINAL REVISION March 27, 2004; ACCEPTED March 29, 2004)
| Abstract |
|---|
|
|
|---|
Keywords: flexible structure comparison; significance analysis; structural database search
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.03602304.
Supplemental material: see http://fatcat.burnham.org
| Introduction |
|---|
|
|
|---|
Over 50 different structure comparison programs have been developed to answer the latter question (for review, for instance, see Guerra and Istrail 2000). Significantly fewer programs can also answer the first question, at least in a reasonable time and with verified accuracy. The most popular ones are DALI (Holm and Sander 1993), CE (Shindyalov and Bourne 1998), and VAST (Madej et al. 1995; Gibrat et al. 1996). Sensitivity and selectivity of the popular structure comparison programs have been compared recently (Sierk and Pearson 2004). Resources such as SCOP (Murzin et al. 1995) and CATH (Orengo et al. 1997) address the same question, but were created on the basis of both human analysis and automated comparison programs. For comparing two structures, we need to search for the optimal alignment between them, whereas for identifying a similar structure in a database, we need to answer a different and more difficult question: Is the similarity between proteins A and B more significant than the similarity between A and C? To answer this question, one has to develop an estimate of the statistical significance of the structural alignment, whereas most structure comparison programs simply report the length of the alignment and a similarity measure, such as root mean square deviation (RMSD). For instance, the most popular programs, DALI and CE, both report a Z-score as the similarity measurement, but they do not provide proof that the distribution of their raw scores is Gaussian. Only one algorithm (Gerstein and Levitt 1998) has been extensively analyzed in this respect (Levitt and Gerstein 1998).
A large number of existing structure comparison programs is the best proof that the structural comparison problem is far from being solved (Holm and Sander 1993; Boutonnet et al. 1995; Madej et al. 1995; Shindyalov and Bourne 1998; Eidhammer et al. 2001). There is no unique and objective measure for structural similarity between proteins (Yang and Honig 2000), and there is not even a unique way of aligning two protein structures (Godzik 1996). Existing structure comparison algorithms differ by minimization strategy and the similarity function being optimized (Holm and Sander 1993). But despite all the differences, most of them share a fundamental limitation: They compare proteins as rigid bodies and thus concentrate on the question of identifying the largest common substructure between two proteins. At the same time, it is well known that proteins are flexible and undergo significant structural changes as part of their normal function and as a result of mutations (Wuthrich and Wagner 1978; Schulz and Schirmer 1979; Bennett and Huber 1984; Jacobs et al. 2001). Only very few methods, such as least-squares fitting (Wriggers and Schulten 1997), multiple linkage clustering (Boutonnet et al. 1995; Ochagavia et al. 2002), FlexProt (Shatsky et al. 2002), and FATCAT (Ye and Godzik 2003) treat protein structures, true to their real nature, as flexible. Such programs ask: How can one of the structures be rearranged to make it more similar to the other one? instead of the usual, What is the largest similar part of two proteins? This seemingly simple change has profound consequences, for instance, in applying results of flexible structure alignments to modeling.
None of the current flexible protein structure comparison algorithms has been used for database searches because of a lack of an accurate statistical estimate of alignment significance. For instance, in FATCAT, the similarity score describes how well two structures are superimposed, but the score is strongly correlated with the length of the structures being compared. As a result, protein A may seem to be more similar to protein B than to protein C just because B is longer than C, whereas in fact the alignment between A and B may be worse. In this paper, we describe a method to evaluate the significance of structural similarity that is independent of the length of proteins, which makes database searching using FATCAT possible.
It is well known that the alignment score between random sequences follows the extreme value distribution (EVD; Pearson 1998). This was rigorously proven only for alignment without gaps (Altschul and Gish 1996), but empirically it was shown to hold for a variety of alignment algorithms and gap penalty functions (Pearson 1998). The EVD parameters can be fitted to an empirical distribution of random scores and used for a statistical estimate of the significance of the particular score. In two rigid-body structural alignment programs used for database searches, DALI (Holm and Sander 1993) and CE (Shindyalov and Bourne 1998), Z-score is used for similarity evaluationapparently assuming that the score distribution is Gaussian. Levitt and Gerstein (1998) have shown that the score of their simple structure comparison algorithm (Gerstein and Levitt 1998) behaves like a sequence similarity score, following EVD, whereas RMSD follows a different distribution and does not perform as well as the structural alignment score. However, their method is a generalization of the Needleman-Wunsch sequence alignment, using repeated cycles of dynamic programming and least-squares superimposing of two structures in which dynamic programming is performed on the distance matrix derived from least-squares superimposing; it was not really tested against other genuine structure comparison methods (Gerstein and Levitt 1998). Recently, Dewey and colleagues (Y. Jia, G.T. Dewey, I.N. Shindyalov, and P.E. Bourne, in prep.) showed that in gapless alignments of structural fragments of the same length, the RMSD of their optimal superposition follows the EVD type of distribution. In this paper, we show that the FATCAT similarity score between two unrelated structures also follows the EVD. On the basis of this, a statistical significance evaluation method was developed for a FATCAT-based database-searching tool (FATCAT-search) and tested on a large benchmark. Also, we show the results of applying FATCAT-search to several protein structures that were recently determined by Protein Structure Initiative Centers (Lesley et al. 2002). Despite successful structure determination, the functions of these proteins remain unknown. The advantages of flexible structural alignments in structure-similarity-based functional annotation are illustrated in these cases. FATCAT-search was able to identify similar structures and align active site residues for protein 1ufh [PDB] (PDB code), which led to a reliable prediction of its function as an acetyltransferase. In addition, flexibility was detected, which we hypothesize correlates with the substrate specificity of acetyltransferase. In the second case, FATCAT-search results suggest that the original annotation of the structures with an unusual trefoil knot as an early prototype of a TIM barrel (Zarembinski et al. 2003) may be wrong; instead, the similarity between knot structures and TIM barrel proteins is probably caused by the similarity in the building blocks they share, because significant similarities (some even stronger) were also detected between knot structures and structures from other folds.
| Materials and methods |
|---|
|
|
|---|
Flexible structure alignment is formulated as the AFP chaining process (Gusfield 1999), allowing at most t twists, and the flexible structure alignment is reduced to a rigid structure alignment when t equals 0. Dynamic programming is used in the chaining process, which can combine gaps and twists between consecutive AFPs, each with its own score penalty. We denote S(k) as the best score ending at AFP k, which can be calculated from the best ending at previous AFPs that can be connected with AFP k subject to the constraints of the consecutive,
![]() | (1) |
s.t. T(k)
t
where a(k) is the score of AFP k itself; c(m
k) is the score of introducing a connection between AFP m and AFP k; T(k) is the number of twists required to connect the chain of AFPs leading up to S(k), which is calculated by,
![]() | (2) |
where t(m
k) is 1 if a twist is required to connect AFP m and k and 0 if no twist is required. Denote the maximum S(k) as the chaining score of FATCAT, cs, which will be used following.
Extreme value distribution
The EVD of a random variable s has the probability density of
![]() | (3) |
where µ is the location parameter and
is the scale parameter. These parameters can be determined by either the analytic formula or empirical simulation (Altschul and Gish 1996; Pearson 1998). Here we follow the latter route. Once the parameters are determined, they can be used to compute the probability that the variable X takes a value greater than s (survival function; Evans et al. 2000):
![]() | (4) |
It has been reported that different fitting methods give almost identical results in determining the parameters (Altschul and Gish 1996). In this study, we use the fitting function based on the Nelder-Mead simplex method "fmins," implemented in Matlab (Nelder and Mead 1965).
Significance estimate of FATCAT score
We have developed the significance estimate for both FATCAT rigid-body mode (i.e., t = 0) and FATCAT flexible mode (i.e., t = 5). We did this in order to check the influence of introducing flexibility into structural comparison in terms of sensitivity and specificity in recognizing similar structures.
A good structural similarity measure should describe both the local structure similarity and the global structure similarity between two proteins. On the basis of this requirement, we have designed a similarity score of FATCAT in such a way that it takes into account factors such as the FATCAT chaining score (describing mainly the local and medium-range structure similarity; see equation 1), the global structure similarity between two structures described by the overall RMSD (which is defined as the root mean square deviation of all of the aligned C
atoms, based on the rigid-body superposition after one structure is modified according to the FATCAT alignment), and the number of equivalent positions in the alignment and the number of twists. The similarity score is computed as
![]() | (5) |
where cs is the FATCAT chaining score; L is the number of equivalent positions in the alignment; RMSD is the overall RMSD between two structures when one structure is rearranged at the positions where twists are detected by FATCAT; N is the number of blocks in the alignment (number of twists + 1).
We collected a set of unrelated structure pairs for the simulation to determine the location and scale parameters of the EVD distributions based on the SCOP classification (Murzin et al. 1995). We assume that any two structures from different SCOP folds constitute a pair of unrelated structures. Sometimes two structures from different SCOP folds could be similar (Shindyalov and Bourne 2000); a few exceptions, however, will not significantly influence the parameter fitting. We need to fit parameters of the EVD for random structures cataloged into different lengths separately because the parameters are dependent on the length of structures. Although there are >20,000 structures in PDB (Berman et al. 2000), there are not enough nonredundant protein structures with the same length to fit distribution parameters for all lengths. To overcome this problem, we randomly cut regions of length m from structures that are longer than m, and used this region as an individual structure for the parameters simulation of length m. In this way, we prepared a set of structure libraries for lengths of 40, 70, 100. . .400 amino acids, with each library having up to 10,000 random structure pairs (the full lists are available at the FATCAT Web site).
Database searching
We implemented the FATCAT-search, a FATCAT-based database searching tool to identify structural analogs for a given query structure. We used the 95% nonredundant SCOP database (Murzin et al. 1995) as the target database for searching instead of PDB (Berman et al. 2000) because SCOP provides a well-maintained annotation and classification of the structures and divides larger structures into domains. Although 0.05 is a typical significance level, we may also keep candidates with lower significance (for example, with P-value < 0.1) for further manual inspection to explore distantly similar structures.
| Results and Discussion |
|---|
|
|
|---|
|
|
The evaluation of the significance estimate here is focused on the ability of FATCAT to recognize similar structures in the benchmark described earlier as compared with CE and DALI. We adopted the ROC (Receiver Operating Characteristic) curve (Bradley 1997) to show the specificity and sensitivity of identifying similar structures for all of these programs. P-values of FATCAT and Z-scores of DALI and CE are used for ROC analysis. For each method, the protein pairs of both similar structures and dissimilar structures were sorted by the P-values or Z-scores, and then the true-positive fractions and false-positive fractions were calculated at different cutoffs. The performances of the three methods are shown in Figure 3
, in which a random performance corresponds to the main diagonal, and the better the performance of a method, the farther the corresponding curve is from the main diagonal. It is obvious that FATCAT using P-value performed better than FATCAT using the raw chaining score (cs). Both rigid FATCAT and flexible FATCAT have better discriminating ability than CE, but they are worse than DALI, at least in this benchmark. We conclude that FATCAT works relatively well in database search, comparable to the two most popular structural comparison programs (DALI and CE). The introduction of flexibility in the FATCAT only results in a small decrease in performance, laying to rest the worries that flexible alignment will significantly increase the number of false positives in recognition of similar structures.
|
2.0) and CE (Z-score
3.5), and another 122 pairs are confirmed by either DALI or CE. Selected examples are described following (see Table 1
|
|
Significant similarity between the structures in the examples shown earlier was detected by all three programs, FATCAT, CE, and DALI. In the 283 pairs that are found to be similar only by FATCAT, twists are present in 224 cases (detailed results are available at the FATCAT Web site). Selected examples are shown in Table 1
(EH) and Figure 4 (EH)
. Because these similarities are not very strong (P-value > 1e-4) and they are not confirmed by either DALI (i.e., Z-score < 2.0) or CE program (i.e., Z-score < 3.5), we believe that they mostly reflect marginal similarity of analogous groups of secondary structure elements. For instance, similarities can often be found between two structures with the same number of helices even though their topologies are different because helices themselves are similar and introducing twists easily transforms one into another. Indeed, such extreme cases are represented in our calculation. In 238 of 283 structure pairs, one or both structures are mainly composed of helices. In contrast, in only five cases, one or two structures are mainly composed of
-sheets. This suggests that, from a practical view, additional attention should be paid to the low significance similarity between helical structures, and that a P-value cutoff smaller than 0.05 may be needed for detecting the structural similarity so that it can also be detected by rigid-body comparison programs.
These and other related studies suggest that the current hierarchical classification of protein structures with strict division on separate fold-islands such as that used in SCOP (using a hierarchy of four levels; Murzin et al. 1995) and CATH (using a hierarchy of five levels; Orengo et al. 1997) does not necessarily reflect the actual hierarchy of protein structures (Ouzounis et al. 2003). Additional levels (for instance building blocks) between the fold and class may be necessary; neighborhoods of folds may share smaller structural units such as helical hairpins.
FATCAT-search examples
As mentioned in the introduction, a primary application of any protein structure similarity search procedure is to search for proteins that are similar to a newly determined protein structure. This type of analysis may provide helpful information for the functional study of the proteins, as distant homology can sometimes be recognized on the basis of structure similarity only. In this section, two cases are presented to demonstrate the FATCAT-search application for this task.
1ufh, a new acetyltransferase
The FATCAT-search shows that 1ufh
[PDB]
(PDB code), a hypothetical protein in Bacillus subtilis, is similar to N-acetyl transferases (SCOP code d.108.1.1) from the acyl-coA N-acyltransferases fold. The highest similarity is found between 1ufh
[PDB]
and histone acetyltransferase HPA2 from Saccharomyces cerevisiae (SCOP code d1qsma_), with a FATCAT alignment of 133 positions with an RMSD of 3.08 Å (P-value 5.05e-10). Twists are found in comparing 1ufh
[PDB]
with some of the acyltransferases. For instance, when comparing 1ufh
[PDB]
with the tabtoxin resistance protein from Pseudomonas syringae (SCOP code d1ghea_), a twist was introduced by FATCAT to get an alignment of 139 aligned positions with an RMSD of 2.66 Å, which covers the whole structure (P-value 5.05e-08); otherwise, only the acyl coenzyme A (ACO) binding regions can be well superimposed because the remaining two helices (cap helices) have slightly different orientations in the two structures (Fig. 5
). With the FATCAT alignments, the GANT motif (involved with ACO binding activity; He et al. 2003) and two of three putative active sites were confirmed to be conserved in 1ufh
[PDB]
. The two conserved active sites are His 128 (could be Asp or His according to the known structures with acetyl-transferase activity) and Tyr 97, and the site not conserved is Phe 90 (corresponding to Glu in the known acetyltransferases). We conclude that 1ufh
[PDB]
may have the acetyltransferase activity on the basis of these results. Furthermore, the conformational changes detected by FATCAT in comparing 1ufh
[PDB]
with other acetyltransferases suggest that different acetyltransferases may have different substrate specificity, which is related to the different orientations of the cap helices; it provides an efficient way to change the size of the substrate binding pocket by simply changing the packing between the two helices and the cofactor binding region.
|
/
knot fold. 1k3r
[PDB]
, however, was described as a TIM-barrel-like structure in the original paper (Zarembinski et al. 2003). This annotation was based on the DALI comparisons, which show that 1k3r
[PDB]
is significantly similar to TIM-barrel structures (1b5t
[PDB]
with Z-score of 5.7, 1ezw
[PDB]
with Z-score of 5.4, and 1a49
[PDB]
with Z-score of 4.6). The investigators further suggested that the knot structure of 1k3r
[PDB]
may be an early prototype of a TIM barrel.
FATCAT-search results provide a different scenario. The knot domains of these three structures are very similar to each other and they are most similar to the two knot structures that were collected by SCOP version 1.63, d1ipaa1 and d1mxia_. More important, FATCAT-search shows that these structures are also very similar to structures from some other folds (Table 2
). For instance, significant similarities are found by FATCAT between 1nxz
[PDB]
and d1chd__ from fold c.40 and d1g8la3 from fold c.57 (Table 2
, Fig. 6
), which are even stronger than the similarities between 1nxz
[PDB]
and some other knot structures, 1ns5
[PDB]
, 1mxi
[PDB]
, and 1ipa
[PDB]
. Although low similarities were also found between these three structures and structures from TIM-barrel fold (c.1), the similarities are less significant than the knot structures and structures from fold c.40, c.57, and so forth. Twists were detected in some of the alignments (Table 2
, Fig. 6
). Inspection of these structural similarities shows that many of these folds share building blocks, in this case,
/
/
modules (Fig. 6
). We conclude that the knot structure is a type of gregarious fold (Harrison et al. 2002) that has significant overlap with structures from many other SCOP folds, and the similarities between knot structures with structures from other folds could be the result of their common ancestors or just a result of construction of protein structures from limited building blocks. However, we cannot derive a more detailed functional annotation for these gregarious knot structures when no dominant structural homologs/analogs with known functions can be reliably found.
|
|
The study of similarities between different structures is the most important application of protein comparison programs. To date, we still do not know much about the overall properties of the fold space even though there have been more than 20,000 structures in PDB and more than 700 folds defined in SCOP. For instance, it is not clear if the fold space is continuous or hierarchical; we do not know how many folds there are in the universe. Whatever the results, the rules of how proteins change and evolve have to start from describing fold flexibilities, such as provided by the FATCAT program.
| Electronic supplemental material |
|---|
|
|
|---|
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Bennett, W. and Huber, R. 1984. Structural and functional aspects of domain motions in proteins. CRC Crit. Rev. Biochem. 15: 291384.[Medline]
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., and Bourne, P.E. 2000. The Protein Data Bank. Nucleic Acids Res. 28: 235242.
Boutonnet, N.S., Rooman, M.J., Ochagavia, M.E., Richelle, J., and Wodak, S.J. 1995. Optimal protein structure alignments by multiple linkage clustering: Application to distantly related proteins. Protein Eng. 8: 647662.[Medline]
Bradley, A.P. 1997. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognition 30: 11451159.[CrossRef]
Chambers, J.M., Cleveland, W.S., Kleiner, B., and Tukey, P.A. 1983. Graphical methods for data analysis. Chapman and Hill, New York.
Eidhammer, I., Jonassen, I., and Taylor, W.R. 2001. Structure comparison and structure patterns. J. Comput. Biol. 7: 685716.
Evans, M., Hastings, N., and Peacock, B. 2000. Statistical distributions, 3rd ed. Wiley, New York.
Gerstein, M. and Levitt, M. 1998. Comprehensive assessment of automatic structural alignment against a manual standard, the scop classification of proteins. Protein Sci. 7: 445456.[Abstract]
Gibrat, J.F., Madej, T., and Bryant, S.H. 1996. Surprising similarities in structure comparison. Curr. Opin. Struct. Biol. 6: 377385.[CrossRef][Medline]
Godzik, A. 1996. The structural alignment between two proteins: Is there a unique answer? Protein Sci. 5: 13251338.[Abstract]
Goldsmith-Fischman, S. and Honig, B. 2003. Structural genomics: Computational methods for structure analysis. Protein Sci. 12: 18131821.
Guerra, C. and Istrail, S. 2000. Mathematical methods for protein structure analysis and design. Springer Verlag, Berlin.
Gusfield, D. 1999. Algorithms on strings, trees and sequences: Computer science and computational biology, 2nd ed. Cambridge, New York.
Harrison, A., Pearl, F., Mott, R., Thornton, J., and Orengo, C. 2002. Quantifying the similarities within fold space. J. Mol. Biol. 323: 909926.[CrossRef][Medline]
He, H., Ding, Y., Bartlam, M., Sun, F., Le, Y., Qin, X., Tang, H., Zhang, R., Joachimiak, A., Liu, J., et al. 2003. Crystal structure of tabtoxin resistance protein complexed with acetyl coenzyme A reveals the mechanism for
-lactam acetylation. J. Mol. Biol. 325: 10191030.[CrossRef][Medline]
Holm, L. and Sander, C. 1993. Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233: 123138.[CrossRef][Medline]
Jacobs, D.J., Rader, A.J., Kuhn, L.A., and Thorpe, M.F. 2001. Protein flexibility predictions using graph theory. Proteins 44: 150165.[CrossRef][Medline]
Lesley, S.A., Kuhn, P., Godzik, A., Deacon, A.M., Mathews, I., Kreusch, A., Spraggon, G., Klock, H.E., McMullan, D., Shin, T., et al. 2002. Structural genomics of the Thermotoga maritima proteome implemented in a high-throughput structure determination pipeline. Proc. Natl. Acad. Sci. 99: 1166411669.
Levitt, M. and Gerstein, M. 1998. A unified statistical framework for sequence comparison and structure comparison. Proc. Natl. Acad. Sci. 95: 59135920.
Madej, T., Gibrat, J.F., and Bryant, S.H. 1995. Threading a database of protein cores. Proteins 23: 356369.[CrossRef][Medline]
Murzin, A.G., Brenner, S.E., Hubbard, T., and Chothia, C. 1995. SCOP: A structural classification of proteins database for the investigation of sequences and structures. J. Mol. Biol. 247: 536540.[CrossRef][Medline]
Nelder, J.A. and Mead, R. 1965. A simplex method for function minimization. Computer Journal 7: 308313.
Nureki, O., Shirouzu, M., Hashimoto, K., Ishitani, R., Terada, T., Tamakoshi, M., Oshima, T., Chijimatsu, M., Takio, K., Vassylyev, D.G., et al. 2002. An enzyme with a deep trefoil knot for the active-site architecture. Acta Crystallogr. D Biol. Crystallogr. 58: 11291137.[CrossRef][Medline]
Ochagavia, M.E., Richelle, J., and Wodak, S.J. 2002. Advanced pairwise structure alignments of proteins and analysis of conformational changes. Bioinformatics 18: 637640.
Orengo, C.A., Michie, A.D., Jones, S., Jones, D.T., Swindells, M.B., and Thornton, J.M. 1997. CATHA hierarchic classification of protein domain structures. Structure 5: 10931108.[Medline]
Ouzounis, C.A., Coulson, R.M., Enright, A.J., Kunin, V., and Pereira-Leal, J.B. 2003. Classification schemes for protein structure and function. Nat. Rev. Genet. 4: 508519.[Medline]
Pearson, W.R. 1998. Empirical statistical estimates for sequence similarity searches. J. Mol. Biol. 276: 7184.[CrossRef][Medline]
Schulz, G.E. and Schirmer, R.H. 1979. Principles of protein structure. Springer, New York.
Shatsky, M., Nussinov, R., and Wolfson, H.J. 2002. Flexible protein alignment and hinge detection. Proteins 48: 242256.[CrossRef][Medline]
Shindyalov, I.N. and Bourne, P.E. 1998. Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 11: 739747.
. 2000. An alternative view of protein fold space. Proteins 38: 247260.[CrossRef][Medline]
Sierk, M.L. and Pearson, W.R. 2004. Sensitivity and selectivity in protein structure comparison. Protein Sci. 13: 773785.
Wriggers, W. and Schulten, K. 1997. Protein domain movements: Detection of rigid domains and visualization of hinges in comparisons of atomic coordinates. Proteins 29: 114.[Medline]
Wuthrich, K. and Wagner, G. 1978. Internal motion in globular proteins. Trends Biochem. Sci. 3: 227230.
Yang, A.S. and Honig, B. 2000. An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance. J. Mol. Biol. 301: 665678.[CrossRef][Medline]
Ye, Y. and Godzik, A. 2003. Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics 19: ii246ii255.[Abstract]
Zarembinski, T.I., Kim, Y., Peterson, K., Christendat, D., Dharamsi, A., Arrowsmith, C.H., Edwards, A.M., and Joachimiak, A. 2003. Deep trefoil knot implicated in RNA binding found in an archaebacterial protein. Proteins 50: 177183.[CrossRef][Medline]
Zhang, C. and Kim, S.H. 2003. Overview of structural genomics: From structure to function. Curr. Opin. Chem. Biol. 7: 2832.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
K. L. Damm and H. A. Carlson Gaussian-Weighted RMSD Superposition of Proteins: A Structural Comparison for Flexible Proteins and Predicted Protein Structures Biophys. J., June 15, 2006; 90(12): 4558 - 4573. [Abstract] [Full Text] [PDF] |
||||
![]() |
Z. Li, Y. Ye, and A. Godzik Flexible Structural Neighborhood--a database of protein structural similarities and alignments Nucleic Acids Res., January 1, 2006; 34(suppl_1): D277 - D280. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Chen and G. M. Crippen A novel approach to structural alignment using realistic structural and environmental information Protein Sci., December 1, 2005; 14(12): 2935 - 2946. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Ye and A. Godzik Multiple flexible structure alignment using partial order graphs Bioinformatics, May 15, 2005; 21(10): 2362 - 2369. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS |