Lean-Proteome Strains – Next Step in Metabolic Engineering

Valgepea, Kaspar; Peebo, Karl; Adamberg, Kaarel; Vilu, Raivo

doi:10.3389/fbioe.2015.00011

OPINION article

Front. Bioeng. Biotechnol., 06 February 2015

Sec. Systems Biology Archive

Volume 3 - 2015 | https://doi.org/10.3389/fbioe.2015.00011

This article is part of the Research TopicQuantitative Systems Biology for Engineering Organisms and PathwaysView all 11 articles

Lean-proteome strains – next step in metabolic engineering

Kaspar Valgepea¹

Karl Peebo^1,2

Kaarel Adamberg^1,3

Raivo Vilu^1,2*

¹Competence Center of Food and Fermentation Technologies, Tallinn, Estonia
²Department of Chemistry, Tallinn University of Technology, Tallinn, Estonia
³Department of Food Processing, Tallinn University of Technology, Tallinn, Estonia

Rapid development of high-throughput -omics (e.g., proteomics) and genetic engineering technologies together with an array of new metabolic modeling tools during this century has led to the emergence of new fields of biological research termed systems biology and synthetic biology. The successful exploitation of these developments is evidenced by the creation of increasing number of genetically engineered recombinant cells with superior characteristics (Jantama et al., 2008; Becker et al., 2011) or totally novel functions (Nakamura and Whited, 2003; Yim et al., 2011; Paddon et al., 2013) for diverse sectors such as chemicals and healthcare (Huang et al., 2012; Lee et al., 2012; Sun and Alper, 2014). However, there exists a significant gap in bioprocess performance between studies of the literature and the requirements for an industrially feasible bioprocess for chemical production (Van Dien, 2013). Overall bioprocess performance [productivity (gram/liter/hour), titer (gram/liter) etc.] has to be increased further for successful industrial-scale commercialization to drive the shift from fossil fuel to bioprocess-based chemical production and cost-effective production of novel drugs (Van Dien, 2013). Hence, there is great need for novel approaches addressing these key challenges in chemical and healthcare sectors.

Potential of Proteome Optimization

With this opinion, we propose that a novel approach of proteome optimization carries a substantial potential for addressing the aforementioned challenges in bioprocess development. That potential arises from the fact that cells express proteins not essential (e.g., flagellar, heat or acid stress proteins) for growth under well-controlled optimal conditions, typically realized in biotechnological processes. This leads to non-efficient use of protein synthesis capacity (translation machinery) and energy for bioprocesses. As translation capacity is believed to be one of the growth-limiting factors, at least in the bacterium Escherichia coli (Klumpp et al., 2013), synthesis of non-essential proteins sequesters ribosomes potentially lowering the synthesis capacity of target molecule production. Thus removing the expression burden of non-essential proteins, i.e., creation of lean-proteome strains, could enable to specifically manipulate the allocation of ribosomes for higher synthesis of proteins leading to increased target molecule production. Optimization of the cellular proteome through experimental testing of strains with optimized expression of non-essential proteins and inclusion of protein synthesis capacity constraints in metabolic modeling could open a new avenue for the creation of superior cell factories.

Initial experimental confirmation of the potential of optimization of the layer of protein synthesis capacity for increasing the maximum specific growth rate (μ_max) of cells comes from two studies of E. coli investigating the effects of heterologous protein expression on μ (Scott et al., 2010; Bienick et al., 2014). Both studies show for several heterologous proteins (e.g., LacZ, eGFP) that increasing their expression has a linear negative effect on μ. Their data suggest that for expression of every 1% of heterologous protein per dry cell weight, μ decreases by ~3%. It would be sensible to assume that a similar correlation would exist for the opposite case – decreasing the fraction of non-essential proteins by 1% would lead to an increase in μ by ~3%. Our proposal is also supported by two studies of Bacillus subtilis showing that reducing the expression load of proteins non-essential under bioprocess conditions by ~9% fraction from the total proteome through the deletion of the flagellar/motility regulator gene sigD leads to a ~30% increase of both μ_max and biomass yield (Fischer and Sauer, 2005; Muntel et al., 2014). Further support comes from recent experiments of D’Souza et al. (2014), which show that deletion of single amino acid, vitamin, or nucleobase biosynthesis genes from E. coli results in higher μ_max compared to the wild-type strain when both strains are grown on medium containing the amino acid, vitamin, or nucleobase that the deletion strain was auxotrophic for. These observations are consistent with earlier chemostat studies with B. subtilis (Zamenhof and Eichhorn, 1967) and E. coli (Dykhuizen, 1978) where mutants impaired in tryptophan biosynthesis demonstrate significant fitness advantages in the presence of tryptophan relative to prototrophic cells. More importantly, D’Souza et al. (2014) show that deleting genes with higher protein expression cost leads to a greater growth advantage.

The results presented above suggest that proteome resource optimization through decreasing the fraction of non-essential proteins could lead to faster growth and thus also to better bioprocess performance. For instance, target molecule productivity could be increased in growth-coupled production processes by enabling faster growth at the same expression level(s) of target molecule production-related proteins. On the other hand, recombinant protein titers could be significantly elevated by allocating more proteome resources for target protein expression at the expense of lower synthesis of non-essential proteins even at the same μ and/or protein synthesis rate.

Reduced-Genome Approaches

A conceptually similar approach of creating reduced-genome strains for industrial purposes has been applied in few cases before (Pósfai et al., 2006; Mizoguchi et al., 2008; Unthan et al., 2014; Xue et al., 2014). However, these efforts concentrated on reducing the genome and neglected the effects of gene deletions on the cellular proteome. The approach of deleting large chunks of the genome, instead of specific genes, based on gene function and not on protein abundance was probably responsible for the observed minor positive effects on cellular growth and target molecule production. While the latter studies focused on large-scale genome reduction, experimental technologies enabling more targeted and accurate engineering of strains with reduced load of gene expression have recently emerged. Hence, now the successful execution of the concept of targeted optimization of the layer of protein synthesis capacity is feasible due to the recent rapid progress in proteome-wide absolute quantitative proteomics (Arike et al., 2012; Ahrné et al., 2013; Wiśniewski et al., 2014) and high-throughput genome engineering technologies [e.g., Multiplexed Automated Genome Engineering (MAGE; Wang et al., 2009), trackable multiplex recombineering (TRMR; Warner et al., 2010)]. Thus, the time is ripe to design and create lean-proteome strains possibly leading to superior bioprocess performance.

Challenges with Proteome Optimization

The main challenge with creating lean-proteome strains is hitting the correct genes/proteins, i.e., genes, which deletion does not lead to detrimental effects. This is a serious concern even in the most studied bacterium E. coli since functions for a third of its proteins are still unknown (Keseler et al., 2013) while only ~300 proteins are considered essential for E. coli (http://ecoliwiki.net/colipedia/index.php/Essential_genes). It is important to point out that knowing functions/essentiality for more proteins is not the objective per se – it is actually more important to know the functions/essentiality of the proteins with the biggest translational burden (abundance × length), as their deletion presumably leads to stronger effects. The good news here is that for many organisms, the proteome mass (a good proxy for length) distribution follows the Pareto principle – ~20% of proteins make up ~80% of the proteome mass (Ghaemmaghami et al., 2003; Maier et al., 2011; Schmidt et al., 2011; Valgepea et al., 2013). Thus, instead of targeting hundreds of genes/genome areas like in the reduced-genome approach described above, one could theoretically greatly increase the key metrics of bioprocess performance (titer, yield, productivity; Van Dien, 2013) by deleting as few as ~10 non-essential genes with the highest translational burden in E. coli (in total 7% of proteome; Valgepea et al., 2013) and substituting the “freed” 7% of the total proteome with target molecule-related proteins. Importantly, current mass-spectrometric techniques of absolute proteome quantification (Arike et al., 2012; Ahrné et al., 2013; Wiśniewski et al., 2014) are accurate enough to determine the proteins with the biggest translational burden on the whole-proteome level.

Strategies of Proteome Optimization for Creating Lean-Proteome Strains

The first and most important step toward creating lean-proteome strains is absolute quantitative proteome analysis of the initial recombinant strain. Accurate characterization of the full proteome is needed for the compilation of lists of non-essential target proteins with the biggest translational burden. We propose two strategies for creating superior lean-proteome strains by targeting proteins with the biggest translational burden, currently specifically for E. coli:

1. The first strategy targets proteins with known functions and presumably unnecessary under optimal bioprocess conditions, e.g., pH, temperature, oxygen tension control; defined substrate feed; stirring. These could be proteins involved in stress responses (acid, heat, and osmotic shock), alternative substrate transport and catabolism and cellular movement (flagellar).

2. The second strategy targets proteins with unknown functions with the biggest translational burden. Beneficial for both approaches is the growth screen of all the Keio collection single (Baba et al., 2006) and double deletion strains (personal communication with Prof. Hirotada Mori) that can be used to determine the genes/proteins, which should and should not be targeted.

Another important step is the experimental construction of lean-proteome strains and selection for better production strains. Instead of reducing the proteome one protein at a time, one should target tens of genes with an approach similar to MAGE (Wang et al., 2009), which constantly generates genetic heterogeneity in the pool of mutants allowing the generation of thousands of lean-proteome strains within a few days. The challenge of selecting for better production strains could be tackled by combining several screening methods. First, one could screen for fast growth as reduction of non-essential protein expression should lead to faster growth. Second, high-producing strains could be isolated using fluorescence activated cell sorting (FACS) using a sensor system based on a fluorescent readout corresponding to target molecule levels.

Potential of Metabolic Modeling

Lastly, one would greatly benefit from an in silico metabolic model, which would enable quantitative prediction of the effects of removing non-essential proteins on target molecule production. This should be a model, which incorporates the cellular proteome with the two central features of regulation of μ – cell geometry and cell cycle – and ties the latter to the fluxes of flux balance analysis (FBA)-type models for in silico analysis and design of lean-proteome strains. Recently, we have seen serious progress into this direction by the development of a novel single-cell model (Abner et al., 2013), next-generation FBA-type of genome-scale models of metabolism and gene expression (O’Brien et al., 2013; Liu et al., 2014), and a whole-cell model (Karr et al., 2012). Surely, these models will be advanced further and hopefully they will also be able to determine which genes/proteins to delete for creating superior lean-proteome strains.

Conclusion

Based on the recent rapid advances in high-throughput mutant generation and proteomics technologies together with the emerging novel whole-cell modeling approaches, we conclude that the time is ripe for the metabolic engineering community to directly focus on proteome optimization leading to the creation of lean-proteome strains with superior target molecule production characteristics.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The financial support for this work was provided by the European Regional Development Fund project EU29994 and institutional research (IUT 1927) and personal (G9192) funding of the Estonian Ministry of Education and Research.

References

Abner, K., Aaviksaar, T., Adamberg, K., and Vilu, R. (2013). Single-cell model of prokaryotic cell cycle. J. Theor. Biol. 341C, 78–87. doi: 10.1016/j.jtbi.2013.09.035

CrossRef Full Text | Google Scholar

Ahrné, E., Molzahn, L., Glatter, T., and Schmidt, A. (2013). Critical assessment of proteome-wide label-free absolute abundance estimation strategies. Proteomics 17, 2567–2578. doi:10.1002/pmic.201300135

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Arike, L., Valgepea, K., Peil, L., Nahku, R., Adamberg, K., and Vilu, R. (2012). Comparison and applications of label-free absolute proteome quantification methods on Escherichia coli. J. Proteomics 75, 5437–5448. doi:10.1016/j.jprot.2012.06.020

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Baba, T., Ara, T., Hasegawa, M., Takai, Y., Okumura, Y., Baba, M., et al. (2006). Construction of Escherichia coli K-12 in-frame, single-gene knockout mutants: the Keio collection. Mol. Syst. Biol. 2, 2006.0008. doi:10.1038/msb4100050

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Becker, J., Zelder, O., Häfner, S., Schröder, H., and Wittmann, C. (2011). From zero to hero – design-based systems metabolic engineering of Corynebacterium glutamicum for l-lysine production. Metab. Eng. 13, 159–168. doi:10.1016/j.ymben.2011.01.003

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bienick, M. S., Young, K. W., Klesmith, J. R., Detwiler, E. E., Tomek, K. J., and Whitehead, T. A. (2014). The interrelationship between promoter strength, gene expression, and growth rate. PLoS ONE 9:e109105. doi:10.1371/journal.pone.0109105

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

D’Souza, G., Waschina, S., Pande, S., Bohl, K., Kaleta, C., and Kost, C. (2014). Less is more: selective advantages can explain the prevalent loss of biosynthetic genes in bacteria. Evolution 68, 2559–2570. doi:10.1111/evo.12468

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Dykhuizen, D. E. (1978). Selection for tryptophan auxotrophs of Escherichia coli in glucose-limited chemostats as a test of energy-conservation hypothesis. Evolution 32, 125–150. doi:10.2307/2407415

CrossRef Full Text | Google Scholar

Fischer, E., and Sauer, U. (2005). Large-scale in vivo flux analysis shows rigidity and suboptimal performance of Bacillus subtilis metabolism. Nat. Genet. 37, 636–640. doi:10.1038/ng1555

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Ghaemmaghami, S., Huh, W.-K., Bower, K., Howson, R. W., Belle, A., Dephoure, N., et al. (2003). Global analysis of protein expression in yeast. Nature 425, 737–741. doi:10.1038/nature02046

CrossRef Full Text | Google Scholar

Huang, C.-J., Lin, H., and Yang, X. (2012). Industrial production of recombinant therapeutics in Escherichia coli and its recent advancements. J. Ind. Microbiol. Biotechnol. 39, 383–399. doi:10.1007/s10295-011-1082-9

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Jantama, K., Zhang, X., Moore, J. C., Shanmugam, K. T., Svoronos, S. A., and Ingram, L. O. (2008). Eliminating side products and increasing succinate yields in engineered strains of Escherichia coli C. Biotechnol. Bioeng. 101, 881–893. doi:10.1002/bit.22005

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Karr, J. R., Sanghvi, J. C., Macklin, D. N., Gutschow, M. V., Jacobs, J. M., Bolival, B., et al. (2012). A whole-cell computational model predicts phenotype from genotype. Cell 150, 389–401. doi:10.1016/j.cell.2012.05.044

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Keseler, I. M., Mackie, A., Peralta-Gil, M., Santos-Zavaleta, A., Gama-Castro, S., Bonavides-Martínez, C., et al. (2013). EcoCyc: fusing model organism databases with systems biology. Nucleic Acids Res. 41, D605–D612. doi:10.1093/nar/gks1027

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Klumpp, S., Scott, M., Pedersen, S., and Hwa, T. (2013). Molecular crowding limits translation and cell growth. Proc. Natl. Acad. Sci. U.S.A. 110, 16754–16759. doi:10.1073/pnas.1310377110

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Lee, J. W., Na, D., Park, J. M., Lee, J., Choi, S., and Lee, S. Y. (2012). Systems metabolic engineering of microorganisms for natural and non-natural chemicals. Nat. Chem. Biol. 8, 536–546. doi:10.1038/nchembio.970

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Liu, J. K., O’Brien, E. J., Lerman, J. A., Zengler, K., Palsson, B. Ø, and Feist, A. M. (2014). Reconstruction and modeling protein translocation and compartmentalization in Escherichia coli at the genome-scale. BMC Syst. Biol. 8:110. doi:10.1186/s12918-014-0110-6

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Maier, T., Schmidt, A., Güell, M., Kühner, S., Gavin, A.-C., Aebersold, R., et al. (2011). Quantification of mRNA and protein and integration with protein turnover in a bacterium. Mol. Syst. Biol. 7, 511. doi:10.1038/msb.2011.38

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mizoguchi, H., Sawano, Y., Kato, J., and Mori, H. (2008). Superpositioning of deletions promotes growth of Escherichia coli with a reduced genome. DNA Res. 15, 277–284. doi:10.1093/dnares/dsn019

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Muntel, J., Fromion, V., Goelzer, A., Maass, S., Mader, U., Buttner, K., et al. (2014). Comprehensive absolute quantification of the cytosolic proteome of Bacillus subtilis by data independent, parallel fragmentation in liquid chromatography/mass spectrometry (LC/MSE). Mol. Cell. Proteomics 13, 1008–1019. doi:10.1074/mcp.M113.032631

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Nakamura, C. E., and Whited, G. M. (2003). Metabolic engineering for the microbial production of 1,3-propanediol. Curr. Opin. Biotechnol. 14, 454–459. doi:10.1016/j.copbio.2003.08.005

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

O’Brien, E. J., Lerman, J. A., Chang, R. L., Hyduke, D. R., and Palsson, B. Ø (2013). Genome-scale models of metabolism and gene expression extend and refine growth phenotype prediction. Mol. Syst. Biol. 9, 693. doi:10.1038/msb.2013.52

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Paddon, C. J., Westfall, P. J., Pitera, D. J., Benjamin, K., Fisher, K., McPhee, D., et al. (2013). High-level semi-synthetic production of the potent antimalarial artemisinin. Nature 496, 528–532. doi:10.1038/nature12051

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Pósfai, G., Plunkett, G., Fehér, T., Frisch, D., Keil, G. M., Umenhoffer, K., et al. (2006). Emergent properties of reduced-genome Escherichia coli. Science 312, 1044–1046. doi:10.1126/science.1126439

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schmidt, A., Beck, M., Malmström, J., Lam, H., Claassen, M., Campbell, D., et al. (2011). Absolute quantification of microbial proteomes at different states by directed mass spectrometry. Mol. Syst. Biol. 7, 510. doi:10.1038/msb.2011.37

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Scott, M., Gunderson, C. W., Mateescu, E. M., Zhang, Z., and Hwa, T. (2010). Interdependence of cell growth and gene expression: origins and consequences. Science 330, 1099–1102. doi:10.1126/science.1192588

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Sun, J., and Alper, H. S. (2014). Metabolic engineering of strains: from industrial-scale to lab-scale chemical production. J. Ind. Microbiol. Biotechnol. doi:10.1007/s10295-014-1539-8

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Unthan, S., Baumgart, M., Radek, A., Herbst, M., Siebert, D., Brühl, N., et al. (2014). Chassis organism from Corynebacterium glutamicum – a top-down approach to identify and delete irrelevant gene clusters. Biotechnol. J. doi:10.1002/biot.201400041

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Valgepea, K., Adamberg, K., Seiman, A., and Vilu, R. (2013). Escherichia coli achieves faster growth by increasing catalytic and translation rates of proteins. Mol. Biosyst. 2344–2358. doi:10.1039/c3mb70119k

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Van Dien, S. J. (2013). From the first drop to the first truckload: commercialization of microbial processes for renewable chemicals. Curr. Opin. Biotechnol. 24, 1061–1068. doi:10.1016/j.copbio.2013.03.002

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wang, H. H., Isaacs, F. J., Carr, P. A., Sun, Z. Z., Xu, G., Forest, C. R., et al. (2009). Programming cells by multiplex genome engineering and accelerated evolution. Nature 460, 894–898. doi:10.1038/nature08187

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Warner, J. R., Reeder, P. J., Karimpour-Fard, A., Woodruff, L. B. A., and Gill, R. T. (2010). Rapid profiling of a microbial genome using mixtures of barcoded oligonucleotides. Nat. Biotechnol. 28, 856–862. doi:10.1038/nbt.1653

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wiśniewski, J. R., Hein, M. Y., Cox, J., and Mann, M. (2014). A “proteomic ruler” for protein copy number and concentration estimation without spike-in standards. Mol. Cell. Proteomics 13, 3497–3506. doi:10.1074/mcp.M113.037309

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Xue, X., Wang, T., Jiang, P., Shao, Y., Zhou, M., Zhong, L., et al. (2014). The MEGA (Multiple Essential Genes Assembling) deletion and replacement method for genome reduction in Escherichia coli. ACS Synth. Biol. doi:10.1021/sb500324p

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Yim, H., Haselbeck, R., Niu, W., Pujol-Baxley, C., Burgard, A., Boldt, J., et al. (2011). Metabolic engineering of Escherichia coli for direct production of 1,4-butanediol. Nat. Chem. Biol. 7, 445–452. doi:10.1038/nchembio.580

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Zamenhof, S., and Eichhorn, H. H. (1967). Study of microbial evolution through loss of biosynthetic functions – establishment of defective mutants. Nature 26, 456–458. doi:10.1038/216456a0

CrossRef Full Text | Google Scholar

Keywords: strain engineering, recombinant cells, absolute quantitative proteomics, genome engineering, Escherichia coli, genome reduction, chassis cells, whole-cell model

Citation: Valgepea K, Peebo K, Adamberg K and Vilu R (2015) Lean-proteome strains – next step in metabolic engineering. Front. Bioeng. Biotechnol. 3:11. doi: 10.3389/fbioe.2015.00011

Received: 02 January 2015; Accepted: 22 January 2015;
Published online: 06 February 2015.

Edited by:

Hilal Taymaz Nikerel, Bogazici University, Turkey

Reviewed by:

Steve Van Dien, Genomatica, USA

Copyright: © 2015 Valgepea, Peebo, Adamberg and Vilu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence:cmFpdm9Aa2JmaS5lZQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.