|
|
||||||||
1 Department of Biochemistry and
2 Howard Hughes Medical Institute, University of Texas Southwestern Medical Center, Dallas, Texas 75390, USA
Reprint requests to: Jimin Pei, Department of Biochemistry, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75390, USA; e-mail: jpei{at}chop.swmed.edu; fax: (214) 648-9099.
(RECEIVED March 8, 2004; FINAL REVISION May 10, 2004; ACCEPTED May 14, 2004)
| Abstract |
|---|
|
|
|---|
Keywords: dsDNA bacteriophage; prohead protease; MEROPS; homology detection; structure prediction; gene organization; evolution
Abbreviations: PDB, Protein Data Bank PSI-BLAST, position-specific iterated basic local alignment search tool COG, clusters of orthologous groups 3D, three-dimensional dsDNA, double-stranded DNA PCMA, profile consistency multiple sequence alignment.
Supplemental material: see ftp://iole.swmed.edu/pub/cheng/prohead
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.04726004.
| Introduction |
|---|
|
|
|---|
Double-stranded DNA bacteriophages assemble their heads in two steps; first, capsid proteins gather around a scaffold to form a prohead (or procapsid); then, the prohead undergoes a maturation process to expand into the final, mature head (Dokland 1999). Interestingly, this two-stage assembly strategy is also exploited by herpesviruses (Lata et al. 2000). Prohead maturation in some viruses requires the activity of a virus-encoded prohead protease, whose major responsibilities are to cleave the scaffold proteins and/or process the capsid proteins (Dokland 1999; Lata et al. 2000). The peptidase classification database MEROPS (Barrett et al. 2001; Rawlings et al. 2002) defines two prohead protease families, U9 and U35. MEROPS names each peptidase family by its catalytic type, aspartic (A), cysteine (C), metallo (M), serine (S), threonine (T), and unknown (U). Thus, the catalytic mechanisms of U9 and U35 are still unknown. Family U9 is represented by bacteriophage T4 gp21 protease. Family U35 consists of two subgroups, U35.001 represented by bacteriophage HK97 gp4 protease and U35.002 represented by phage Mu gpI protein. In addition, the herpesvirus procapsid maturation protease, which is also called assemblin or UL26 protein (numbered according to herpes simplex virus type 1 or HSV-1; Homa and Brown 1997), is classified as family S21 in MEROPS.
Using computational methods, we gather evidence that dsDNA bacteriophage prohead protease families U35.001, U35.002, and U9 are homologs. Furthermore, we link these phage prohead protease families with herpesvirus protease and unify them into a procapsid protease superfamily. Because herpesvirus protease has a known 3D structure, the homology inferred here provides important clues to further our understanding of the structure and mechanism of the bacteriophage prohead proteases.
| Results and Discussion |
|---|
|
|
|---|
Most of these 78 sequences in the U35.001 family belong to either bacteriophages or bacteria. When used as queries in searching CDD (Conserved Domain Database; Marchler-Bauer et al. 2003), many of them (including bacterial sequences) readily found COG3740 (Tatusov et al. 2001), phage head maturation protease, with good e-values (<0.01). This close relationship suggests that the bacterial sequences in this family probably resulted from integrated phage genomes. The words prophage or phage frequently appear in the annotations of these bacterial sequences.
S21 family
Most interestingly, the above-mentioned transitive searches to expand the U35.001 family found a group of herpesvirus proteases (44 sequences in total) in addition to the 78 U35.001 family members from phages or bacteria, suggesting a homologous relationship between phage prohead protease and herpesvirus protease (MEROPS family S21). Regular PSI-BLAST searches provided further evidence for this putative homologous relationship. For example, we used the full-length HK97 gp4 protein (gi|9634157) as a query to run PSI-BLAST (Altschul et al. 1997) on the NCBI nonredundant (nr) database (December 29, 2003: 1,585,607 sequences; 519,349,222 total letters; e-value cutoff 0.01). The UL26 capsid maturation protease of Meleagrid herpesvirus 1 (gi|12084854) was found in the second iteration with a significant e-value 0.009.
U9 family
The U9 family in MEROPS is represented by bacteriophage T4 gp21 protease. Starting with this protein (gi|75965, residues 1212), we performed transitive PSI-BLAST (e-value cutoff 0.001) to detect possible U9 family members. These searches converged after the first round, yielding only eight sequences. All of these sequences are phage proteins, often annotated as gp21 prohead core scaffold protein and protease. The small number of sequences in the U9 family suggests that it is a singleton family in the current nr database, and that its sequence profile is not good enough for detecting its similarity to distant homologs. However, in the course of expanding the U35.001 family, we found statistical evidence that the U9 family and the U35.001 family are remote homologs. For example, starting with the U35.001 representative HK97 gp4 protease (gi|9634157, residues 1225), PSI-BLAST search found a close homolog gi|26988298 with e-value 2e-18 in the first iteration. Using this sequence as query to run PSI-BLAST (default parameters in NCBI Web site), we found a U9 family member (gi|30044105) in the third iteration with a significant e-value 0.003.
U35.002 family and COG3566
In MEROPS, the U35.002 family contains only the bacteriophage Mu gpI protein. Using this protein as a query (gi|9633523, residues 1361), we performed extensive transitive PSI-BLAST searches (e-value cutoff 0.001) to expand the U35.002 family. These searches converged after the third round, yielding a total of 69 sequences. (We manually inspected the hits in each round to remove false positives.) Much like the U35.001 family, most of these 69 sequences also come from either bacteriophages or bacteria. Many of the bacterial sequences probably resulted from integrated phages, as suggested by their annotations.
We clustered these 69 sequences into four groups on the basis of sequence conservation and the results of Euclidian distance mapping (see "Sequence clustering"): three U35.002 subfamilies (U35.002.a, U35.002.b, and U35.002.c) and a fourth group corresponding to an uncharacterized cluster of orthologs (COG3566;Tatusov et al. 2001). U35.002.a contains enterobacteria phage Mu gpI protein and its close homologs (10 sequences in total); U35.002.b contains enterobacteria phage P2 gpO protein and its close homologs (32 sequences in total); U35.002.c includes 10 hypothetical proteins (nine from bacteria, one from archaea); and COG3566 consists of Burkholderia cepacia phage Bcep1 gp15 protein and its close homologs (17 sequences in total).
Because U35.001 and U35.002 are in the same U35 family in MEROPS, and MEROPS families consist of sequences with significant statistical similarity (http://merops.sanger.ac.uk/), we expected U35.001 and U35.002 family members to be close homologs. Surprisingly, during the above-mentioned extensive transitive PSI-BLAST searches, the U35.001/S21/U9 family members could not find any of the U35.002/COG3566 family members and vice versa. To explore the possible link between U35.001/S21/U9 and U35.002/COG3566, we made a global multiple sequence alignment of U35.001 combined with S21 (122 sequences, only protease domains) to seed BLAST searches. A U35.002 family member (gi|34496934) was found with a significant e-value 3e-04 by the query gi|9634157. BLAST searches seeded with a multiple sequence alignment of U35.002/ COG3566 families (69 sequences, full length) found a S21 member, the human herpesvirus 8 protease (gi|2246545), with a higher e-value 0.038.
Sequence clustering: Euclidian distance mapping and average pairwise sequence identities
The transitive and alignment-seeded PSI-BLAST searches have expanded and linked several MEROPS families, including phage prohead protease families U35.001, U35.002, U9, and herpesvirus protease family S21. To better understand the relationships between these families, we used a manually adjusted PCMA (Pei et al. 2003) multiple sequence alignment of all the detected sequences (199 in total) for Euclidian distance mapping (Grishin and Grishin 2002) and sequence identity calculations.
In distance mapping, each sequence is represented by a point in a multidimensional Euclidian space and the distances between these points reflect the evolutionary distances between the sequences. A two-dimensional projection of this space is shown in Figure 2A
(below). This plot offers a visualization of the sequence clustering; each family or subfamily appears as a rather distinct group, except for U35.002.a and U35.002.c. These two subfamilies are divided on the basis of sequence conservation and insertion/ deletion patterns, for example, U35.002.c sequences have a two-residue insertion before the putative oxyanion-binding site highlighted in red in Figure 1
.
|
|
Fold and active site prediction for the phage prohead proteases
The inferred homology between phage prohead proteases and herpesvirus protease offers a tentative prediction that these two kinds of proteases may adopt a similar fold. To explore this possibility further, we submitted representative sequences from each of the phage prohead protease families to the fold-recognition Meta server (Ginalski et al. 2003). Some of the queries found herpesvirus protease fold among their top hits. For example, when HK97 gp4 (gi|9634157, residues 1225) was used as a query, all of the 3D-Jury hits are herpesvirus proteases with significant consensus scores ranging from 70 to 93 (scores above 50 are considered to be significant; Ginalski et al. 2003). The representative sequences for U35.002.c (gi|34496934, residues 1202) and COG3566 (gi|38638622, residues 1200) also found the herpesvirus protease fold with the best consensus scores of 50 and 24, respectively.
To validate the homologous relationship between phage prohead proteases and herpesvirus protease, we constructed a PCMA (Pei et al. 2003) multiple-sequence alignment of all of the detected sequences (199 in total) in the procapsid protease superfamily. This alignment was manually adjusted according to sequence conservation, secondary structure predictions by PSI-PRED (Jones 1999), and comparisons with PSI-BLAST local alignments. Figure 1
shows the representative sequences in each family. The secondary structure predictions of phage prohead protease families exhibit an overall agreement with the experimental structure of herpesvirus protease, which is also reflected by the conservation of hydrophobicity pattern and small residue positions (yellow and gray shadings in Fig. 1
). The structural core of herpesvirus protease fold is a seven-stranded
-barrel composed of two orthogonally packed, four-stranded
-sheets (
3
4
1
7 and
3
2
6
5) (Fig. 3
). A sharp bend near a conserved Gly enables
3 to participate in both sheets (Chen et al. 1996). This central, mainly antiparallel
-barrel is capped by
A at one end and
B
C at the other, whereas the remaining four C-terminal helices (
D
E
F
G) and the loops between them form a circle surrounding the barrel (Tong et al. 1996; Tong 2002). The active form of herpesvirus protease is a homodimer, in which the monomermonomer interaction is mediated mainly by
F and partially by
B and
C. The connection between the secondary structure elements is as follows:
1
A-
2
3
4
B-
C-
5
6
7
D-
E-
F-
G (Fig. 3
; Tong et al. 1996). The multiple sequence alignment in Figure 1
covers only
1
6, as the rest of the sequences are too divergent to ensure reliable alignment. However, the covered region constitutes the major part of the herpesvirus protease fold and includes all of the active site residues (catalytic residues and oxyanion-binding loop). Each of the aligned secondary structure elements (
1
6) finds its counterpart in phage prohead protease sequences except
B, which corresponds to a long gap in the phage families. The secondary structure prediction results indicate that there is only one helix between
4 and
5 in phage families. We aligned this predicted helix with
C in herpesvirus protease instead of
B, as this alignment gives a better sequence conservation pattern. Because
B is involved in homodimer formation in herpesvirus protease, the absence of this helix might affect the manner or even the capability of phage prohead protease dimerization.
|
2 and
3 and Ser 132 in the middle of
5 in human cytomegalovirus or HCMV protease; Tong et al. 1996) constitute the essential catalytic elements for serine proteases; Ser 132 is the nucleophile and His 63 is the general base (Chen et al. 1996). The invariance of these two residues in families U35.001, U35.002, U9, and COG3566 strongly suggests that these phage prohead proteases are serine proteases and share the same overall catalytic mechanism with herpesvirus protease. However, the third member of the herpesvirus protease catalytic triad (His 157 in HCMV protease; Chen et al. 1996) is not conserved in phage prohead proteases (shaded in green in Fig. 1
The oxyanion-binding loop in herpesvirus protease consists of residues Gly 164 to Thr 169 (residue number according to HCMV protease), with the backbone amide of Arg 165 (boxed in red in Fig. 1
) directly contributing to the oxyanion hole (Reiling et al. 2000). In herpesvirus protease family S21, this oxyanion-binding loop is highly conserved. However, in phage prohead protease families U35.001, U35.002, and U9, a different conservation pattern is observed in this region. For example, the position corresponding to Gly 164 is occupied instead by a highly conserved Pro. More importantly, the oxyanion hole residue Arg 165 in S21 family is substituted by a conserved small residue (mainly A, sometimes S or G). Because the backbone amide, but not the side chain, forms an H-bond with the oxyanion, we expect that these small residues in this position could also play the oxyanion-binding role.
Although the oxyanion loop region in family S21 differs from its corresponding regions in families U35.001, U35.002, and U9, it is surprisingly similar to that found in COG3566. The shared features include the oxyanion hole residue R165 and the conserved G164 and G168. Furthermore, COG3566 also has a conserved His in the position occupied by the His third member of herpesvirus protease catalytic triad, whereas the residues in other families vary in this position. These similarities make COG3566 the most similar group to herpesvirus proteases in this procapsid protease superfamily. This similarity is also reflected in the sequence identity analysis; S21 family shows the highest between-group identity with COG3566, and vice versa.
Gene organization of the phage head-assembly module
In phage genomes, the genes functioning in head assembly usually cluster together to form a module (Hendrix 2003). Important members in this module include a portal protein, a prohead protease, a scaffolding protein, and capsid protein(s) (Duda et al. 1995). During this study, we observed three different situations concerning the position of the prohead protease gene in this module relative to those of the scaffolding and capsid protein genes (Fig. 4
).
|
In the second scenario, the prohead protease and the scaffolding protein share the same gene (Fig. 4
; Mu). In phage Mu, the putative protease gpI has 361 residues. However, only the first half can be aligned with other prohead proteases (Fig. 1
; U35.002.a). The C-terminal 183 residues of gpI correspond to another protein, gpZ, which is experimentally confirmed to be the Mu scaffolding protein (Grimaud 1996; Morgan et al. 2002). Thus, the scaffolding protein gpZ uses the same reading frame and the same stop codon as the protease gpI; only the start codon is different (Morgan et al. 2002). This gene arrangement is strikingly similar to that observed in gpC and gpNu3 proteins of bacteriophage
(Fig. 4
;
; Morgan et al. 2002). However,
gpC is not a member in this procapsid protease superfamily. Instead, it is homologous to Escherichia coli protease IV (discussed below). Interestingly, herpesvirus protease and scaffolding protein are also arranged in a nested way similar to phage Mu (Fig. 4
; HSV-1; Sheaffer et al. 2000). The protease is encoded by the full-length version of UL26 gene, whereas the scaffolding protein is encoded by a truncated version (UL26.5). Thus, the scaffolding protein has exactly the same sequence as the C-terminal part of the protease.
In the last scenario, the prohead protease is fused to the capsid protein (Fig. 4
; c2). For example, the predicted lactococcal bacteriophage c2 prohead protease covers the N-terminal part of gene l5 (Fig. 1
; U35.001, gi|9628687), whereas the C-terminal part (residues 206480) is indicated by biochemical studies to be the c2 major capsid protein (Lubbers et al. 1995). The fusion of protease and capsid protein is also observed in Rhodobacter prophage
RcM1 (Smith et al. 1999).
The fusion of prohead protease to scaffolding protein or capsid protein suggests the possibility of autoproteolysis. In fact, autocleavage has been observed for herpesvirus protease (Homa and Brown 1997; Sheaffer et al. 2000).
It is also worth mentioning that many phages do not encode a protease in their head assembly gene module. For instance, phage T7 (Fig. 4
; T7) head assembly does not involve proteolysis (Duda et al. 1995).
Distribution of procapsid protease superfamily members
Altogether, we detected 199 sequences in this procapsid protease superfamily: 89 from bacteria, one from archaea, 65 from bacterial or archaeal phages, and 44 from herpes-viruses. No homologs were detected from other kinds of viruses or eukaryotes. As discussed above, the bacterial and archaeal sequences probably originate from integrated prophages.
The phage sequences come from 65 different phages, of which 48 are in the order Caudovirales (Maniloff and Ackermann 1998) and 17 are unclassified bacteriophages. Among the Caudovirales phages, 20 are from the family Myoviridae (e.g., Mu, T4, P2), 26 from Siphoviridae (e.g., HK97, c2), and two from Podoviridae (e.g., ST64B). All of these are bacteriophages, except for psiM2 and psiM100, which are closely related archaeophages. (In fact, psiM100 is a defective prophage found in archaeon Methanothermobacter wolfeii [Luo et al. 2001].)
Interestingly, some phages known to carry out head-maturation proteolysis do not possess a procapsid protease. Instead, another kind of protease is found in their head-assembly gene module at roughly the same position as the prohead protease, suggesting that this second type of protease mediates the maturation proteolysis. Bacteriophage
putative protease gpC is an example of this second type (Baird et al. 1991). In MEROPS, gpC is classified in the protease IV family S49. However, it also shows significant sequence similarity to the ClpP proteases (MEROPS S14). For example, CDD search using
gpC (gi|9626248, residues 1439) as query revealed COG0740 or ClpP with evalue 3e-06. Thus, we will refer to this second type of phage protease as protease IV/ClpP-type. In addition to
, we found a protease IV/ClpP-type protease in the head-assembly module in coliphage 21 (Smith and Feiss 1993), Streptococcus thermophilus phage Sfi21 (Desiere et al. 1999), Staphylococcus aureus phage
12 (Iandolo et al. 2002), and a few other phages. ClpP has a known 3D structure (Wang et al. 1997), which has a fold different from the herpesvirus protease (Tong et al. 1996).
Surprisingly, certain phages possess both types of proteases (procapsid proteases and protease IV/ClpP-type). For example, mycobacteriophage CJW1 gp11 is in the prohead protease family U35.001 (Fig. 1
, gi|29565890). However, its gp103 is homologous to ClpP protease (Pedulla et al. 2003). Because gp11 is located in the head-assembly module, but gp103 is far away downstream, we expect that the maturation proteolysis be carried out by gp11. Further experiments are needed to elucidate the function of gp103.
Evolutionary implications
In this study, we argue that herpesvirus protease (UL26 protein) is homologous to dsDNA phage prohead proteases. This relationship is consistent with the well-established morphological similarities shared between herpesviruses and dsDNA phages in their head-assembly pathways. These similarities mainly include the requirement and subsequent removal of scaffolding proteins; and the preassembly and subsequent maturation of prohead or procapsid (Homa and Brown 1997; Dokland 1999). Our results provide evidence at the molecular level that herpesvirus and dsDNA phage head-assembly pathways have a common origin. Actually, the head-assembly pathway also includes DNA packaging, a process to encapsulate the viral genome. A similar DNA packaging mechanism is also shared by herpesviruses and dsDNA phages, in which a concatemer DNA is cleaved to unit genomes and translocated into the head or capsid by a virus-encoded terminase (Catalano 2000; Newcomb et al. 2001). Recently, the large subunit of this terminase was found to be homologous in herpesviruses and dsDNA phages (Mitchell et al. 2002; Przech et al. 2003). In addition to procapsid protease and terminase large subunit, the herpesvirus alkaline exonuclease, which functions in DNA recombination and replication, also shares homology with its counterpart in dsDNA phages (Bujnicki and Rychlewski 2001; Reuven et al. 2003). Importantly, these three herpes-virus enzymes (protease, terminase, and exonuclease) do not have counterparts detected from eukaryotic cells to date (Bujnicki and Rychlewski 2001; Mitchell et al. 2002). Thus, it is unlikely that herpesviruses acquired these enzymes from their eukaryotic hosts. A parsimonious explanation is that herpesviruses and dsDNA phages are evolutionarily related (Dokland 1999; Newcomb et al. 2001; Mitchell et al. 2002), and these shared enzymes are inherited from an ancestral virus.
Conclusions
Through computational sequence and structure analysis, we infer homology between phage prohead proteases (MEROPS families U35.001, U35.002, and U9) and herpesvirus protease (MEROPS family S21), and unify them to form a procapsid protease superfamily. This homology offers a fold prediction for phage prohead proteases and implies that they are serine proteases. Members in this procapsid protease superfamily are found in bacteriophages, archaeophages, herpesviruses, bacteria, and archaea. Our study presents evidence that herpesvirus and dsDNA phages are evolutionarily related.
| Materials and methods |
|---|
|
|
|---|
Multiple sequence alignments were constructed using the PCMA program (Pei et al. 2003) followed by manual adjustment. PCMA first aligns similar sequences in a fast way by ClustalW (Thompson et al. 1994) to form prealigned groups. These pre-aligned sequence groups are then aligned by a consistency objective function (Notredame et al. 2000) to improve alignment accuracy for divergent sequences. The similarity threshold of PCMA is set to 50% in this study (Pei et al. 2003).
In seeded BLAST searches, a position-specific scoring matrix was generated according to the input PCMA alignment. Then, each sequence in the input alignment was used as a query to run one round of PSI-BLAST using the alignment-generated matrix.
Secondary structure prediction was performed using PSI-PRED (Jones 1999). Representative sequences from each family were submitted to the 3D-Jury system on the fold recognition Meta server (Ginalski et al. 2003; http://bioinfo.pl/Meta).
Euclidian distance mapping and average pairwise sequence identity
The global multiple alignment of all the detected sequences (199 in total) was used for distance mapping and pairwise identity calculations. Euclidian distance mapping was performed by the EESG program (Grishin and Grishin 2002). Columns with more than 40% gaps were removed from the input multiple sequence alignment. In calculating the pairwise percent identity between two sequences, the numerator was the number of identical residue pairs and the denominator was the number of aligned positions in which both sequences had an amino acid. The pairwise percent identities were calculated for all sequence pairs within a family and averaged to give the within-group sequence identity for that family. The between-group identity for two families was calculated in a similar way, only that the two sequences in a sequence pair came from different families.
| Electronic supplemental material |
|---|
|
|
|---|
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Baird, L., Lipinska, B., Raina, S., and Georgopoulos, C. 1991. Identification of the Escherichia coli sohB gene, a multicopy suppressor of the HtrA (DegP) null phenotype. J. Bacteriol. 173: 57635770.
Barrett, A.J., Rawlings, N.D., and OBrien, E.A. 2001. The MEROPS database as a protease information system. J. Struct. Biol. 134: 95102.[CrossRef][Medline]
Bateman, A., Birney, E., Durbin, R., Eddy, S.R., Finn, R.D., and Sonnhammer, E.L. 1999. Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins. Nucleic Acids Res. 27: 260262.
Bujnicki, J.M. and Rychlewski, L. 2001. The herpesvirus alkaline exonuclease belongs to the restriction endonuclease PD-(D/E)XK superfamily: Insight from molecular modeling and phylogenetic analysis. Virus Genes 22: 219230.[CrossRef][Medline]
Catalano, C.E. 2000. The terminase enzyme from bacteriophage
: A DNA-packaging machine. Cell. Mol. Life Sci. 57: 128148.[CrossRef][Medline]
Chen, P., Tsuge, H., Almassy, R.J., Gribskov, C.L., Katoh, S., Vanderpool, D.L., Margosiak, S.A., Pinko, C., Matthews, D.A., and Kan, C.C. 1996. Structure of the human cytomegalovirus protease catalytic domain reveals a novel serine protease fold and catalytic triad. Cell 86: 835843.[CrossRef][Medline]
Desiere, F., Lucchini, S., and Brussow, H. 1999. Comparative sequence analysis of the DNA packaging, head, and tail morphogenesis modules in the temperate cossite Streptococcus thermophilus bacteriophage Sfi21. Virology 260: 244253.[CrossRef][Medline]
Dokland, T. 1999. Scaffolding proteins and their role in viral assembly. Cell. Mol. Life Sci. 56: 580603.[CrossRef][Medline]
Duda, R.L., Martincic, K., Xie, Z., and Hendrix, R.W. 1995. Bacteriophage HK97 head assembly. FEMS Microbiol. Rev. 17: 4146.[CrossRef][Medline]
Fischer, D. 2003. 3D-SHOTGUN: A novel, cooperative, fold-recognition meta-predictor. Proteins 51: 434441.[CrossRef][Medline]
Ginalski, K., Elofsson, A., Fischer, D., and Rychlewski, L. 2003. 3D-Jury: A simple approach to improve protein structure predictions. Bioinformatics 19: 10151018.
Grimaud, R. 1996. Bacteriophage Mu head assembly. Virology 217: 200210.[CrossRef][Medline]
Grishin, V.N. and Grishin, N.V. 2002. Euclidian space and grouping of biological objects. Bioinformatics 18: 15231534.
Hendrix, R.W. 2003. Bacteriophage genomics. Curr. Opin. Microbiol. 6: 506511.[CrossRef][Medline]
Hendrix, R.W. and Duda, R.L. 1998. Bacteriophage HK97 head assembly: A protein ballet. Adv. Virus Res. 50: 235288.[Medline]
Homa, F.L. and Brown, J.C. 1997. Capsid assembly and DNA packaging in herpes simplex virus. Rev. Med. Virol. 7: 107122.[CrossRef][Medline]
Iandolo, J.J., Worrell, V., Groicher, K.H., Qian, Y., Tian, R., Kenton, S., Dorman, A., Ji, H., Lin, S., Loh, P., et al. 2002. Comparative analysis of the genomes of the temperate bacteriophages
11,
12 and
13 of Staphylococcus aureus 8325. Gene 289: 109118.[CrossRef][Medline]
Jones, D.T. 1999. Protein secondary structure prediction based on position-specific scoring matrices. J. Mol. Biol. 292: 195202.[CrossRef][Medline]
Khayat, R., Batra, R., Massariol, M.J., Lagace, L., and Tong, L. 2001. Investigating the role of histidine 157 in the catalytic activity of human cytomegalovirus protease. Biochemistry 40: 63446351.[CrossRef][Medline]
Kinch, L.N. and Grishin, N.V. 2002a. Evolution of protein structures and functions. Curr. Opin. Struct. Biol. 12: 400408.[CrossRef][Medline]
. 2002b. Expanding the nitrogen regulatory protein superfamily: Homology detection at below random sequence identity. Proteins 48: 7584.[CrossRef][Medline]
Kraulis, P.J. 1991. MOLSCRIPT: A program to produce both detailed and schematic plots of protein structures. J. Appl. Cryst. 24: 946950.
Krogh, A., Brown, M., Mian, I.S., Sjolander, K., and Haussler, D. 1994. Hidden Markov models in computational biology. Applications to protein modeling. J. Mol. Biol. 235: 15011531.[CrossRef][Medline]
Lata, R., Conway, J.F., Cheng, N., Duda, R.L., Hendrix, R.W., Wikoff, W.R., Johnson, J.E., Tsuruta, H., and Steven, A.C. 2000. Maturation dynamics of a viral capsid: Visualization of transitional intermediate states. Cell 100: 253263.[CrossRef][Medline]
Lubbers, M.W., Waterfield, N.R., Beresford, T.P., Le Page, R.W., and Jarvis, A.W. 1995. Sequencing and analysis of the prolate-headed lactococcal bacteriophage c2 genome and identification of the structural genes. Appl. Environ. Microbiol. 61: 43484356.[Abstract]
Lundstrom, J., Rychlewski, L., Bujnicki, J., and Elofsson, A. 2001. Pcons: A neural-network-based consensus predictor that improves fold recognition. Protein Sci. 10: 23542362.
Luo, Y., Pfister, P., Leisinger, T., and Wasserfallen, A. 2001. The genome of archaeal prophage PsiM100 encodes the lytic enzyme responsible for autolysis of Methanothermobacter wolfeii. J. Bacteriol. 183: 57885792.
Maniloff, J. and Ackermann, H.W. 1998. Taxonomy of bacterial viruses: Establishment of tailed virus genera and the order Caudovirales. Arch. Virol. 143: 20512063.[CrossRef][Medline]
Marchler-Bauer, A., Anderson, J.B., DeWeese-Scott, C., Fedorova, N.D., Geer, L.Y., He, S., Hurwitz, D.I., Jackson, J.D., Jacobs, A.R., Lanczycki, C.J., et al. 2003. CDD: A curated Entrez database of conserved domain alignments. Nucleic Acids Res. 31: 383387.
Mitchell, M.S., Matsuzaki, S., Imai, S., and Rao, V.B. 2002. Sequence analysis of bacteriophage T4 DNA packaging/terminase genes 16 and 17 reveals a common ATPase center in the large subunit of viral terminases. Nucleic Acids Res. 30: 40094021.
Morgan, G.J., Hatfull, G.F., Casjens, S., and Hendrix, R.W. 2002. Bacteriophage Mu genome sequence: Analysis and comparison with Mulike prophages in Haemophilus, Neisseria and Deinococcus. J. Mol. Biol. 317: 337359.[CrossRef][Medline]
Newcomb, W.W., Juhas, R.M., Thomsen, D.R., Homa, F.L., Burch, A.D., Weller, S.K., and Brown, J.C. 2001. The UL6 gene product forms the portal for entry of DNA into the herpes simplex virus capsid. J. Virol. 75: 1092310932.
Notredame, C., Higgins, D.G., and Heringa, J. 2000. T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302: 205217.[CrossRef][Medline]
Pedulla, M.L., Ford, M.E., Houtz, J.M., Karthikeyan, T., Wadsworth, C., Lewis, J.A., Jacobs-Sera, D., Falbo, J., Gross, J., Pannunzio, N.R., et al. 2003. Origins of highly mosaic mycobacteriophage genomes. Cell 113: 171182.[CrossRef][Medline]
Pei, J. and Grishin, N.V. 2001. GGDEF domain is homologous to adenylyl cyclase. Proteins 42: 210216.[CrossRef][Medline]
. 2003. Peptidase family U34 belongs to the superfamily of N-terminal nucleophile hydrolases. Protein Sci. 12: 11311135.
Pei, J., Sadreyev, R., and Grishin, N.V. 2003. PCMA: Fast and accurate multiple sequence alignment based on profile consistency. Bioinformatics 19: 427428.
Perelygina, L., Zhu, L., Zurkuhlen, H., Mills, R., Borodovsky, M., and Hilliard, J.K. 2003. Complete sequence and comparative analysis of the genome of herpes B virus (Cercopithecine herpesvirus 1) from a rhesus monkey. J. Virol. 77: 61676177.
Przech, A.J., Yu, D., and Weller, S.K. 2003. Point mutations in exon I of the herpes simplex virus putative terminase subunit, UL15, indicate that the most conserved residues are essential for cleavage and packaging. J. Virol. 77: 96139621.
Rawlings, N.D., OBrien, E., and Barrett, A.J. 2002. MEROPS: The protease database. Nucleic Acids Res. 30: 343346.
Reiling, K.K., Pray, T.R., Craik, C.S., and Stroud, R.M. 2000. Functional consequences of the Kaposis sarcoma-associated herpesvirus protease structure: Regulation of activity and dimerization by conserved structural elements. Biochemistry 39: 1279612803.[CrossRef][Medline]
Reuven, N.B., Staire, A.E., Myers, R.S., and Weller, S.K. 2003. The herpes simplex virus type 1 alkaline nuclease and single-stranded DNA binding protein mediate strand exchange in vitro. J. Virol. 77: 74257433.
Sheaffer, A.K., Newcomb, W.W., Brown, J.C., Gao, M., Weller, S.K., and Tenney, D.J. 2000. Evidence for controlled incorporation of herpes simplex virus type 1 UL26 protease into capsids. J. Virol. 74: 68386848.
Smith, M.P. and Feiss, M. 1993. Sequence analysis of the phage 21 genes for prohead assembly and head completion. Gene 126: 17.[CrossRef][Medline]
Smith, M.C., Burns, R.N., Wilson, S.E., and Gregory, M.A. 1999. The complete genome sequence of the Streptomyces temperate phage straight phiC31: Evolutionary relationships to other viruses. Nucleic Acids Res. 27: 21452155.
Tatusov, R.L., Natale, D.A., Garkavtsev, I.V., Tatusova, T.A., Shankavaram, U.T., Rao, B.S., Kiryutin, B., Galperin, M.Y., Fedorova, N.D., and Koonin, E.V. 2001. The COG database: New developments in phylogenetic classification of proteins from complete genomes. Nucleic Acids Res. 29: 2228.
Thompson, J.D., Higgins, D.G., and Gibson, T.J. 1994. CLUSTAL W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22: 46734680.
Todd, A.E., Orengo, C.A., and Thornton, J.M. 2001. Evolution of function in protein superfamilies, from a structural perspective. J. Mol. Biol. 307: 11131143.[CrossRef][Medline]
Tong, L. 2002. Viral proteases. Chem. Rev. 102: 46094626.[CrossRef][Medline]
Tong, L., Qian, C., Massariol, M.J., Bonneau, P.R., Cordingley, M.G., and Lagace, L. 1996. A new serine-protease fold revealed by the crystal structure of human cytomegalovirus protease. Nature 383: 272275.[CrossRef][Medline]
Walker, D.R. and Koonin, E.V. 1997. SEALS: A system for easy analysis of lots of sequences. Proc. Int. Conf. Intell. Syst. Mol. Biol. 5: 333339.[Medline]
Wang, J., Hartling, J.A., and Flanagan, J.M. 1997. The structure of ClpP at 2.3 Å resolution suggests a model for ATP-dependent proteolysis. Cell 91: 447456.[CrossRef][Medline]
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
A. Wietzorrek, H. Schwarz, C. Herrmann, and V. Braun The Genome of the Novel Phage Rtp, with a Rosette-Like Tail Tip, Is Homologous to the Genome of Phage T1 J. Bacteriol., February 15, 2006; 188(4): 1419 - 1436. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. J. Summer, C. F. Gonzalez, M. Bomer, T. Carlile, A. Embry, A. M. Kucherka, J. Lee, L. Mebane, W. C. Morrison, L. Mark, et al. Divergence and Mosaicism among Virulent Soil Phages of the Burkholderia cepacia Complex J. Bacteriol., January 1, 2006; 188(1): 255 - 268. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Pei and N. V. Grishin The P5 protein from bacteriophage phi-6 is a distant homolog of lytic transglycosylases Protein Sci., May 1, 2005; 14(5): 1370 - 1374. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. R. Casjens, E. B. Gilcrease, D. A. Winn-Stapley, P. Schicklmaier, H. Schmieger, M. L. Pedulla, M. E. Ford, J. M. Houtz, G. F. Hatfull, and R. W. Hendrix The Generalized Transducing Salmonella Bacteriophage ES18: Complete Genome Sequence and DNA Packaging Strategy J. Bacteriol., February 1, 2005; 187(3): 1091 - 1104. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||