|
|
||||||||
Institute of Biological Chemistry, Academia Sinica, Taipei 115, Taiwan, Republic of China
Reprint requests to: Ting-Fang Wang, Institute of Biological Chemistry, Academia Sinica, Taipei 115, Taiwan, Republic of China; e-mail: tfwang{at}gate.sinica.edu.tw; fax: 886-2-27889759.
(RECEIVED February 18, 2002; FINAL REVISION April 9, 2002; ACCEPTED April 12, 2002)
1 These two authors contributed equally to this work. ![]()
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.0205202.
| Abstract |
|---|
|
|
|---|
1.5 mL) were subjected to high-speed centrifugation in a 96-tube format and analyzed by multiwell denaturing SDS-PAGE. Our results thus far show that 80% of the genes screened show high levels of expression of soluble products in at least one of the eight fusion protein constructs. The method is well suited for automation and is applicable for the production of large numbers of proteins for genome-wide analysis. Keywords: Structural genomics; functional genomics; proteomics; protein expression
Abbreviations: HTP, high throughput IPTG, isopropyl ß-D-thiogalactoside LB, Luria-Bertani RC, recombinational cloning PCR, polymerase chain reaction Trx, thioredoxin MBP, maltose-binding protein GST, glutathione S-transferase CBP, calmodulin-binding protein CAP, cellulose-associated protein SDS-PAGE, sodium dodecyl sulfate-polyacrylamide gel
| Introduction |
|---|
|
|
|---|
Several prototypes of HTP protein expression and purification systems have been initiated (Christendat et al. 2000; Edwards et al. 2000; Lesley 2001; Zhu et al. 2001). Cloning and expression in Escherichia coli are favored in many instances because E. coli has relatively simple genetics, is well characterized, has a relatively rapid growth rate, and has few post-translational protein modifications. One disadvantage, however, of expressing heterologous proteins in E. coli is that proteins are frequently expressed as insoluble aggregated folding intermediates, known as inclusion bodies (Paul et al. 1983). Although it may be possible to increase protein solubility by optimizing expression condition or by refolding the recombinant proteins, in the interests of throughput, only a single set of growth or folding conditions can be used.
Gene fusion is another approach that has been successfully used for producing soluble heterologous proteins in E. coli (Uhl'en and Moks 1990). Several carrier proteins are widely used in gene fusion, including thioredoxin (Trx), maltose-binding protein (MBP), glutathione S-transferase (GST), intein, calmodulin-binding protein (CBP), NusA, and cellulose-associated protein (CAP). Although the use of these carrier proteins has resulted in the successful overexpression of many heterologous proteins, each was tested empirically and certainly may not possess maximal solubilizing characteristics. Moreover, each expression scenario requires a specific vector. Recloning cDNA into each of these specific vectors is extremely labor intensive.
Recombinational cloning (RC) methodology was recently developed to minimize the effort required for alternative expression. It uses either cre-lox (Liu et al. 1999) or Int/Xis/IHF (Hartley et al. 2000) recombination to introduce the gene of interest into a recipient vector. In these systems, aberrant recombination or cointegrant products may result from faulty gene transfer to the expression vector. Another limitation is that translation fusions of the recombination att or lox sites and a few extra nucleotide sequences are required to ensure successful gene transfer. In some cases, such as protein crystallography, in which longer translation fusions are potentially more detrimental to the proteins, a conventional cloning approach with shorter translation fusions is more appropriate. In the present study, we established a new procedure for the parallel cloning of genes into multiple fusion expression vectors without restriction digestion. The main objective here was to rapidly screen for well-expressed soluble proteins that can be used in structural and functional genomics.
| Results |
|---|
|
|
|---|
25% of the final product carries two cohesive ends and is ready for ligation even without restriction digestion. Therefore, this method is suitable for cloning any gene, even genes with internal EcoRI or XhoI restriction sites. To optimize cloning efficiency, sticky-end PCR products were 5` phosphorylated with T4 polynucleotide kinase and the vectors were dephosphorylated by calf intestine alkaline phosphatase. Together, these procedures increase the efficiency of PCR products into multiple expression vectors. As shown in Figure 2
40 genes into these eight expression vectors (>300 cloning reactions) with a >95% success rate.
|
|
100% in a single cloning reaction), bacterial colonies were directly induced with isopropyl ß-D-thiogalactoside (IPTG) to produce proteins even without examining each individual clone by restriction mapping or colony PCR.
To identify well-expressed and highly soluble fusion proteins, 2 mL of culture was used for small-scale induction. Briefly, bacterial cultures in log phase (OD600
0.6) were induced with IPTG at 20°C for 24 hr. We found that low temperature and long induction time facilitate correct protein folding; for instance, the fusion protein of yeast Hop2 (encoded by open reading frame YGL033W; Table 1
) and Trx is soluble at 20°C but not at 37°C.
|
|
We have cloned and expressed more than 40 proteins from various organisms. The overall successful rate of obtaining soluble proteins, at least in one of the eight expression constructs tested, is >80% (Fig. 4A
). The soluble ratio of individual fusion protein is shown in Figure 4B
. Often, the larger fusion tags are superior for enhancing protein solubility; for example, the success ratio of soluble NusA (54 kD), MBP (42 kD), and GST (24 kD) fusion proteins are 60%, 60%, and 38%, respectively. Target fusion proteins that have been successfully expressed by this method are listed in Table 1
. These target proteins alone range from 9 kD to 100 kD, and the largest soluble fusion protein expressed in this study was
150 kD.
|
|
| Discussion |
|---|
|
|
|---|
Sticky-end PCR and directional cloning methods allow one to obtain multiple expression plasmids without restriction digestion. This is a somewhat conventional cloning approach, but it has several advantages compared with other methods, such as the RC approach. First of all, it is simpler. It allows direct cloning of PCR products into multiple expression vectors. RC methods require at least two cloning steps. Second, it is more accurate in theory and also in practice. With an RC approach, faulty gene transfer might occur because of aberrant recombination or cointegrant vector products. Third, it is less detrimental to proteins. This method introduces only two new amino acids encoded by the restriction sites, whereas RC methods include att or lox sites, as well as other extra sequences to achieve a precise gene transfer. Longer translation fusions introduced by a cloning procedure are usually more harmful to proteins.
All procedures in this study, including DNA cloning, plasmid preparation, protein induction, and cell lysis are based on using a standard 96-well format in an efficient and reproducible manner, making these procedures suitable for automation. All 96 samples could be subjected to ultracentrifugation fractionation in the Beckman Ti25 rotor. Therefore, it is possible to integrate the process of sample transfer from 96-well plate to 96 centrifugal tubes using a robotic protocol, although manual operation is still required afterward.
High speed centrifugation (90,000g) ensures the separation of highly soluble and properly folded proteins from insoluble or partially aggregated materials. Denaturing SDS-PAGE was applied to identify these soluble fusion proteins. Consequently, there is little chance of finding false positives in this screening procedure. After identifying clones expressing soluble fusion proteins, the rest of the cell lysates can be used for other purposes such as small-scale affinity purification.
With our protocols for rapid subcloning, solubility screening, and parallel protein purification, we will be able to provide a large number of high purity fusion proteins for structurefunction studies. For protein crystallography, carrier fusion protein domains can be proteolytically cleaved during or after the first step of affinity purification. The resulting His-tagged target proteins can be isolated by Ni2+-resin or other conventional chromatography methods. These proteins can also be used to make protein microarrays, allowing for the parallel characterization of diverse biochemical activities, such as enzymatic assays, proteinprotein, proteinnucleic acid, and receptorligand interactions. The protein chips may also be applied to screen for new drugs. We have succeeded in the biochemical characterization of at least three fusion proteins expressed in this study (data not shown), indicating that these fusion proteins retain a part of or even the full biochemical activity of the target proteins.
In summary, we have developed an HTP molecular cloning and protein expression system using E. coli. It allows us to screen effectively for well-expressed and highly soluble proteins. The same approach can be applied for alternate cloning of all potential target genes into vectors of different expression systems, including yeast, insect, and mammalian cells, as well as cell-free in vitro systems. Last, but not least, this method is well suited for automation and will be a useful tool for the production of proteins for use in structural and functional genomic studies.
| Materials and methods |
|---|
|
|
|---|
25% of the final products carried cohesive ends and were ready for ligation. Fusion protein expression vectors used in these studies were purchased from Novagen, New England Biolabs, or Amershan Pharmacia. We engineered two new universal cloning sites (EcoRI and XhoI) into those vectors. Briefly, the original vectors were cut with restriction enzymes as close to the 3` end of the N-terminal fusion genes. The appropriate DNA cassette was chosen to retain the reading frame of the fusion over EcoRI and XhoI restriction sites and to introduce 6 histidine amino acid residues between XhoI and stop codon. A specific cleavage sequence of protease (e.g., thrombin or factor Xa) was introduced immediately after the EcoRI site and before the coding sequence of target protein; this was achieved by stringent design of the sticky-end PCR primers. To prepare vectors for ligation reactions, the vectors were restriction digested with EcoRI and XhoI and then dephosphorylated with calf intestinal alkaline phosphotase (New England Biolabs).
Plasmid DNA purification was performed in a 96-well format using Millpore's Motage plasmid miniprep kit. Eight different expression vectors were used here for parallel cloning. Two independent clones were isolated and characterized from every cloning reaction. Therefore, soluble protein products of six different genes (48 cloning/96 protein induction) were screened simultaneously.
Small-scale protein induction
Host E. coli strain JM109(DE3) (Novagen) was chosen for plasmid preparation as well as protein induction. Host strains, BL21-Gold(DE3) or BL21-CondonPlus(DE3) (Stratagene), were also used for expression in the case of low-level protein induction in JM109(DE3). Single colonies were grown overnight in LB medium with ampicillin (50 µg/mL) or kanamycin (30 µg/mL) at 37°C. Two 18-µL overnight cultures were inoculated in 2 mL LB (with 1% glucose) and grown at 37°C for 3 hr (OD600
0.6). The cells were cooled in 20°C incubators, induced with or without 0.4 mM IPTG, and subsequently grown for an additional 20 hr. To harvest the cells, 500-µL cultures from each well of the 96-wells were transferred to a new 96-well plate. Culture medium was placed onto a Sorvall RTH750 microplate carrier and centrifuged for 10 min at 4000 rpm. Cell pellets were suspended in 1.5X SDS-PAGE sample buffer and boiled for 5 min.
Protein solubility test
For protein solubility assays, cell pellets from 1.5 mL of culture with IPTG induction were resuspended in 40 µL of ice-cold buffer B (250 mM sucrose, 25 mM Tris-HCl at pH 7.0, 1 mM EGTA, lysozyme 0.3 mg/mL) and incubated on ice for 20 min. The suspensions were mixed with 160 µL of ice-cold lysis buffer (0.1% Triton X-100, 150 mM NaCl, 0.1 unit Benzonase, 1 mM EGTA, 25 mM Tris-HCl at pH 7.0) followed by incubation at 4°C for another 20 min. Benzonase (Novagen, USA) was used here to digest bacteria genomic DNA and RNA. Insoluble materials were removed by centrifugation at 90,000g for 45 min in the Ti25 rotor (Beckman, USA). Soluble fractions (
100 µL) were then mixed with an equal volume of 3X SDS-sample buffer and boiled immediately for 5 min. Both total cell extracts and soluble fractions were analyzed on 8% to 12% denaturing SDS-PAGE. The proteins (gels) were visualized by Coomassie Blue staining.
Successful expression of soluble fusion protein was scored as follows: Eight different fusion constructs for each target protein were examined. At least one of these constructs must yield a high level of expression and also remain soluble after an ultracentrifugation fractionation procedure, as described previously. Successfully expressed soluble proteins were analyzed by SDS-PAGE and visually identified by Coomassie Blue staining.
| Acknowledgments |
|---|
The publication costs of this article were defrayed in part by payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 USC section 1734 solely to indicate this fact.
| References |
|---|
|
|
|---|
Edwards, A.M. Arrowsmith, C.H. Christendat, D. Dharamsi, A., Friesen, J.D. Greenblatt, J.F., and Vedadi, M. 2000. Protein production: Feeding the crystallographers, and NMR spectroscopies. Nat. Struct. Biol. [Suppl.] 7: 970972.
Fields, S. 2001. Proteomics: Proteomics in genomeland. Science 291: 12211224.
Hartley, J.L., Temple, G.F., and Brasch, M.A. 2000. DNA cloning using in vitro site-specific recombination. Genome Res. 10: 17881795.
Lesley, S.A. 2001. High-throughput proteomics: Protein expression and purification in the postgenomic world. Protein Expr. Purif. 22: 159164.[CrossRef][Medline]
Liu, Q., Li, M.Z., Leibham, D., Cortez, D., and Elledge, S.J. 1999. The univector plasmid-fusion system, a method for rapid construction of recombinant DNA without restriction enzymes. Curr. Biol. 8:13001309.
Paul, D.C., Van Frank, R.M., Muth, W.L., Ross J.W., and Williams, D.C. 1983. Immunocytochemical demonstration of human proinsulin chimeric polypeptide within cytoplasmic inclusion bodies of Escherichia coli. Eur. J. Cell Biol. 31: 171174.
Skolnick, J., Fetrow, J.S., and Kolinski, A. 2000. Structural genomics and its importance for gene function analysis. Nat. Biotechnol. 18:283287.[CrossRef][Medline]
Uhl'en, M. and Moks, T. 1990. Gene fusion for purposes of expression: An introduction. Methods Enzymol. 185:129143.[CrossRef][Medline]
Zeng, G. 1998. Sticky-end PCR: New method for subcloning. Biotechniques 25: 206208.[Medline]
Zhu, H., Bilgin, M., Bangham, R., Hall, D., Casamzyor, A., Bertone, P., Lan, N., Jansen, R., Bidlingmaier, S., Houfek, T., et al. 2001. Global analysis of protein activities using proteome chips. Science 293: 21012105.
![]()
CiteULike
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
S. Nallamsetty, B. P. Austin, K. J. Penrose, and D. S. Waugh Gateway vectors for the production of combinatorially-tagged His6-MBP fusion proteins in the cytoplasm and periplasm of Escherichia coli Protein Sci., December 1, 2005; 14(12): 2964 - 2971. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y.-P. Shih, H.-C. Wu, S.-M. Hu, T.-F. Wang, and A. H.-J. Wang Self-cleavage of fusion protein in vivo using TEV protease to yield native protein Protein Sci., April 1, 2005; 14(4): 936 - 941. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Idicula-Thomas and P. V. Balaji Understanding the relationship between the primary structure of proteins and its propensity to be soluble on overexpression in Escherichia coli Protein Sci., March 1, 2005; 14(3): 582 - 592. [Abstract] [Full Text] [PDF] |
||||
![]() |
J.-l. Chou, C.-L. Huang, H.-L. Lai, A. C. Hung, C.-L. Chien, Y.-Y. Kao, and Y. Chern Regulation of Type VI Adenylyl Cyclase by Snapin, a SNAP25-binding Protein J. Biol. Chem., October 29, 2004; 279(44): 46271 - 46279. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. H. Choi, J. M. Groarke, D. C. Young, M. G. Rossmann, D. C. Pevear, R. J. Kuhn, and J. L. Smith Design, expression, and purification of a Flaviviridae polymerase using a high-throughput approach to facilitate crystal structure determination Protein Sci., October 22, 2004; 13(10): 2685 - 2692. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-H. Luan, S. Qiu, J. B. Finley, M. Carson, R. J. Gray, W. Huang, D. Johnson, J. Tsao, J. Reboul, P. Vaglio, et al. High-Throughput Expression of C. elegans Proteins Genome Res., October 1, 2004; 14(10b): 2102 - 2110. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. L. Maxwell, D. Bona, C. Liu, C. H. Arrowsmith, and A. M. Edwards Refolding out of guanidine hydrochloride is an effective approach for high-throughput structural studies of small proteins Protein Sci., September 1, 2003; 12(9): 2073 - 2080. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. R. Smyth, M. K. Mrozkiewicz, W. J. McGrath, P. Listwan, and B. Kobe Crystal structures of fusion proteins with large-affinity tags Protein Sci., July 1, 2003; 12(7): 1313 - 1322. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Berthold, P. Stenmark, and P. Nordlund Screening for functional expression and overexpression of a family of diiron-containing interfacial membrane proteins using the univector recombination system Protein Sci., January 1, 2003; 12(1): 124 - 134. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |