Abstract
Solving environmental and social challenges such as climate change requires a shift from our current non-renewable manufacturing model to a sustainable bioeconomy. To lower carbon emissions in the production of fuels and chemicals, plant biomass feedstocks can replace petroleum using microorganisms as biocatalysts. The anaerobic thermophile Clostridium thermocellum is a promising bacterium for bioconversion due to its capability to efficiently degrade lignocellulosic biomass. However, the complex metabolism of C. thermocellum is not fully understood, hindering metabolic engineering to achieve high titers, rates, and yields of targeted molecules. In this study, we developed an updated genome-scale metabolic model of C. thermocellum that accounts for recent metabolic findings, has improved prediction accuracy, and is standard-conformant to ensure easy reproducibility. We illustrated two applications of the developed model. We first formulated a multi-omics integration protocol and used it to understand redox metabolism and potential bottlenecks in biofuel (e.g., ethanol) production in C. thermocellum. Second, we used the metabolic model to design modular cells for efficient production of alcohols and esters with broad applications as flavors, fragrances, solvents, and fuels. The proposed designs not only feature intuitive push-and-pull metabolic engineering strategies, but also present novel manipulations around important central metabolic branch-points. We anticipate the developed genome-scale metabolic model will provide a useful tool for system analysis of C. thermocellum metabolism to fundamentally understand its physiology and guide metabolic engineering strategies to rapidly generate modular production strains for effective biosynthesis of biofuels and biochemicals from lignocellulosic biomass.
1. Introduction
Global oil reserves will be soon depleted (Shafiee and Topal, ), and climate change could become a major driver of civil conflict (Hsiang et al., ). These challenges to security and the environment need to be addressed by replacing our current non-renewable production of energy and materials for a renewable and carbon neutral approach (Ragauskas et al., ). The gram-positive, thermophilic, cellulolytic, strict anaerobe C. thermocellum is capable of efficient degradation of lignocellulosic biomass to produce biofuels and biomaterial precursors, making this organism an ideal candidate for consolidated bioprocessing (CBP), where production of lignocellulosic enzymes, saccharification, and fermentation take place in a single step (Olson et al., ). However, its complex and poorly understood metabolism remains the main roadblock to achieve industrially competitive titers, rates, and yields of biofuels such as ethanol (Tian et al., ) and isobutanol (Lin et al., ).
For the past decade, significant efforts have been dedicated to characterize and manipulate the central metabolism of C. thermocellum, due to increasing interest in developing this organism as a CBP manufacturing platform for biofuels production (Akinosho et al., ). C. thermocellum possesses atypical central metabolism, characterized by the important roles of pyrophosphate and ferredoxin (Zhou et al., ), which makes redirection of both carbon and electron flows for biofuel production challenging to achieve. Specifically, the metabolic network of C. thermocellum contains various reactions to regulate intracellular concentration levels of NADH, NADPH, and reduced ferredoxin. These cofactors are used as electron donors with high specificity throughout metabolism. To maintain redox balance, C. thermocellum also possesses several hydrogenases to oxidize these reduced cofactors to molecular hydrogen that is secreted by the cell. Removal of these hydrogenases through deletion of ech (encoding the ferredoxin-dependent hydrogenase, ECH) and hydG (associated with the bifurcating hydrogenase, BIF, and bidirectional hydrogenase, H2ASE_syn) was successfully applied to increase ethanol yield by electron rerouting (Biswas et al., ). Thompson et al. () characterized the ΔhydGΔech strain in depth by flux analysis of its core metabolism, concluding that the major driver for ethanol production was redox rather than carbon balancing. In particular, the conversion of reduced ferredoxin to NAD(P)H is likely the most rate limiting step. In a subsequent study, Lo et al. () over-expressed rnf (encoding the ferredoxin-NAD oxidoreductase, RNF) in the ΔhydGΔech strain that is expected to enhance NADH supply, but did not achieve improved ethanol yield.
In an attempt to redirect carbon and electron flows for enhanced ethanol production, Deng et al. () manipulated the pyruvate node and malate shunt of C. thermocellum. By converting phosphoenolpyruvate (pep) to oxaloacetate (oaa) and then to pyruvate (pyr), this shunt can interchange one mole of NADPH with one mol of NADH generated from glycolysis. Interestingly, the authors noted that replacement of the malate shunt by alternative pathways not linked to NADPH increased ethanol production and carbon recovery but reduced amino acid formation, confirming the role of the malate shunt as an NADPH source in C. thermocellum.
Sulfur metabolism also plays a key role in redox metabolism of C. thermocellum and has been investigated for its role in ethanol production. Sulfate, a component of C. thermocellum media, serves as an electron acceptor, which is capable of oxidizing sulfate to sulfite and then sulfide. Thompson et al. () demonstrated that the strain ΔhydGΔechΔpfl could not grow in a conventional defined medium due to its inability to secrete hydrogen or formate, but was able to rescue growth by sulfate supplementation to the culture medium. More recently, Biswas et al. () reported an increase in final sulfide concentration and over-expression of the associated sulfate uptake and reduction pathway in the ΔhydG strain, but did not observe a significant difference in final sulfide concentration in ΔhydGΔech. Remarkably, neither of the strains consumed cysteine from the medium, unlike the wild-type. Sulfide can be converted to cysteine by CYSS (cysteine synthase) or homocysteine and then methionine by SHSL2 (succinyl-homoserine succinate lyase) and METS (methionine synthase), but the connection between the cessation of cysteine uptake and sulfate metabolism remains unclear.
Overall, the complex interactions of C. thermocellum metabolic pathways remain challenging to understand and engineer with conventional methods, and hence require a quantitative systems biology approach to decipher. To this end, several genome-scale metabolic models (GSMs) of C. thermocellum have been developed. The first GSM, named iSR432, was constructed for the strain ATCC27405 and applied to identify gene deletion strategies for high ethanol yield (Roberts et al., ). This model was then further curated into iCth446 (Dash et al., ). More recently, Thompson et al. developed the iAT601 genome-scale model (Thompson et al., ) for the strain DSM1313, which is genetically tractable (Argyros et al., ). The iAT601 model was used to identify genetic manipulations for high ethanol, isobutanol, and hydrogen production (Thompson et al., ), and to understand growth cessation prior to substrate depletion observed under high-substrate loading fermentations that simulate industrial conditions (Thompson and Trinh, ). In addition to these core and genome-scale steady-state metabolic models, a kinetic model of central metabolism, k-ctherm118, was recently developed and used to elucidate the mechanisms of nitrogen limitation and ethanol stress (Dash et al., ). Due to the biotechnological relevance of the Clostridium genus, GSMs have also been developed for other species (Dash et al., ), including C. acetobutylicum (Senger and Papoutsakis, ; Salimi et al., ; McAnulty et al., ; Wallenius et al., ; Dash et al., ; Yoo et al., ; Lee and Trinh, ), C. beijerinckii (Milne et al., ), C. butyricum (Serrano-Bermúdez et al., ), C. cellulolyticum (Salimi et al., ), and C. ljungdahlii (Nagarajan et al., ).
In this study, we developed an updated genome-scale metabolic model of C. thermocellum, named iCBI655, with more comprehensive and precise metabolic coverage, enhanced prediction accuracy, and extensive documentation. This model is a human-curated database that coherently represents all the available genetic, genomic, and metabolic knowledge of C. thermocellum from both experimental literature and bioinformatic predictions. Furthermore, the model can be applied not only to enable metabolic flux simulation but also to provide a framework to contextualize disparate datasets at the system level. As a demonstration for the model application, we first developed a quantitative multi-omics integration protocol and used it to fundamentally study redox metabolism and potential redox bottlenecks critical for production of biofuels (e.g., ethanol) in C. thermocellum. Furthermore, we used the model, in combination with the previously developed ModCell tool (Garcia and Trinh, ), to design modular (chassis) cells (Garcia and Trinh, ) for alcohol and ester production.
2. Results
2.1. Development of an Upgraded C. thermocellum Genome-Scale Model Named iCBI655
The iCBI655 model was developed using the published iAT601 model (Thompson et al., ) as a starting point. The model improvements include updated metabolic pathways, new annotation, and new extensive documentation. A detailed account of these changes can be found in the Supplementary Datasheet 1. Here, we highlight the most relevant modifications.
2.1.1. Modeling Updates
To facilitate model usage and reduce human error, the identifiers of reactions and metabolites were converted from KEGG into BiGG human-readable form (King et al., ). Additionally, reaction and metabolite identifiers were linked to the modelSEED database (Henry et al., ) that enables analysis through the KBase web interface (Arkin et al., ). The gene identifiers and functional descriptions were updated to the most current annotation (NCBI Reference Sequence: NC_017304.1). Metabolite formulas and charges from the modelSEED database (Henry et al., ) were included in the model and reactions were systematically corrected for charge and mass balance by the addition of protons and water.
2.1.2. Metabolic Updates
The automated construction process used in the previous model introduced several inconsistencies that were corrected in the current model. We removed reactions that were blocked and non-gene-associated, apparently introduced during automated gap-filling. Two notable examples are (i) the blocked selenate pathway which lacks experimental evidence (e.g., selenoproteins have not been found in C. thermocellum), and (ii) blocked reactions involving molecular oxygen (e.g., oxidation of Fe2+ to Fe3+) that are not possible in strict anaerobes like C. thermocellum. Furthermore, tRNA cycling reactions were unblocked by including tRNA in the biomass reaction (Reimers et al., ). Metabolite isomers were examined and consolidated under the same metabolite identifier when possible, leading to the removal of duplicated reactions and the elimination of gaps. Transport and exchange reactions were updated to reflect the export of amino acids and uptake of pyruvate as observed during fermentation experiments (Holwerda et al., ).
In terms of specific reactions, oxaloaceate decarboxylase was eliminated from the model in accordance with recent findings (Olson et al., ). The stoichiometries of pentose-phospate reactions, including sedoheptulose 1,7-bisphosphate D-glyceraldehyde-3-phosphate-lyase (FBA3) and sedoheptulose 1,7-bisphosphate ppi-dependent phosphofructokinase (PFK3_ppi), were corrected (according to experimental evidence, Rydzak et al., ) from the previous model by ensuring mass balance and avoiding lumping multiple steps into one reaction. Transaldolase (TALA) was removed from the model due to lack of annotation for this gene in C. thermocellum.
Several modifications were also performed in key bioenergetic reactions. The reactions catalyzed by membrane-bound enzymes, including inorganic diphosphatase (PPA) (Zhou et al., ) and membrane-bound ferredoxin-dependent hydrogenase (ECH) (Calusinska et al., ), were corrected to capture proton translocation. Furthermore, hydrogenase reactions were updated to ensure ferredoxin association for all cases and remove those reactions that do not involve ferredoxin and only use NAD(P)H as a cofactor, based on our recent understanding of C. thermocellum metabolism (Biswas et al., ). Gene-protein-reaction associations were updated to represent experimental knowledge. For instance, the hydrogenases BIF (CLO11313_RS09060-09070) and H2ASE (CLO1313_RS12830, CLO1313_RS02840) require the maturase Hyd (CLO1313_RS07925, CLO1313_RS11095, CLO1313_RS12830) to be functional, and the maturase itself requires all of its subunits to operate, which enables accurate representations of hydG deletion genotypes (Biswas et al., ).
Two hypothetical reaction modifications were introduced to ensure consistency with reported phenotypes. First, to enable growth without the need for succinate secretion, as observed in experimental data (Supplementary Datasheet 2), the reaction homoserine-O-trans-acetylase (HSERTA) was added to enable methionine biosynthesis (essential for growth). Although this reaction is not currently known to be associated with any gene in C. thermocellum, it is present as a gene-associated reaction in other Clostridium GSMs (Nagarajan et al., ). Next, the reaction deoxyribose-phosphate aldolase (DRPA) was removed based on a systematic analysis (section 4.4) to ensure correct lethality prediction of the ΔhydGΔechΔpfl mutant strain as well as the correct prediction of growth recovery in this mutant by addition of external electron sinks such as sulfate or ketoisovalerate (Table 1). The correct prediction of ΔhydGΔechΔpfl-associated phenotypes is critical to successfully use the model for computational strain design (Long et al., ; Ng et al., ; Maranas and Zomorrodi, ; Wang and Maranas, ; Garcia and Trinh, ,,).
Table 1
| Gene deletions | Medium | Percent of W.T. growth rate (%) | ||
|---|---|---|---|---|
| iAT601 | iCBI655 | Experiment | ||
| hydg | MTC | 100 | 100 | 73 |
| hydg-ech | MTC | 85 | 85 | 67 |
| hydg-pta-ack | MTC | 100 | 100 | 48 |
| hydG-ech-pfl | MTC | 58 | 0 | 0 |
| hydG-ech-pfl | MTC + fumarate | 377 | 726 | 0 |
| hydG-ech-pfl | MTC + sulfate | 58 | 65 | + |
| hydG-ech-pfl | MTC + ketoisovalerate | 97 | 101 | + |
Comparison of mutant growth rates predicted by iAT601 and iCBI655.
Experimental values are taken from Thompson et al. (); for some mutants whose growth recovery, not growth rate, was reported, they are presented with “+”. W.T., wildtype; MTC, Medium for Thermophilic Clostridia.
2.2. Comparison of iCBI655 Against Other Genome-Scale Models
We compared iCBI655 with the previous GSMs of C. thermocellum and the highly-curated GSM iML1515 of the extensively studied bacterium Escherichia coli (Table 2). The increased number of genes in iCBI655 with respect to iAT601 cover a variety of functions, including hydrogenase chaperones, cellulosome and cellulase, ATP synthase, and transporters. Remarkably, iCBI655 has a smaller percentage of blocked reactions than iAT601, indicating higher biochemical consistency. The number of metabolites in iCBI655 is smaller than those in iAT601 mainly due to the removal of metabolites that did not appear in any reaction, duplicated metabolites (e.g., certain isomers), and blocked pathways added automatically during gap-filling without any gene association. C. thermocellum DSM1313 has 2911 protein coding genes, 22% of which is captured by iCB655, while E. coli MG1655 has 4240 genes, 35% of which is included in iML1515. Overall, iCBI655 has the increased coverage of the metabolic functionality of C. thermocellum but remains far from the well-studied E. coli.
Table 2
| iSR432 | iCth446 | iAT601 | iCBI665 | iML1515 | |
|---|---|---|---|---|---|
| Strain | ATCC27405 | ATCC27405 | DSM1313 | DSM1313 | MG1655 |
| Genes | 432 | 446 | 601 | 665 | 1515 |
| Metabolites | 583 | 599 | 903 | 795 | 1877 |
| Reactions | 632 | 660 | 872 | 854 | 2712 |
| Blocked reactions | 39.2% | 32.1% | 40.8% | 35.1% | 9.8% |
| Reference | Roberts et al., | Dash et al., | Thompson et al., | This study | Monk et al., |
Comparison of all genome-scale metabolic models of C. thermocellum and the latest E. coli model.
2.3. Training of Model Parameters Under Diverse Conditions
Growth and non-growth associated maintenance (GAM and NGAM) are parameters that capture the consumption of ATP toward cell division and homeostasis, respectively. These are known to be condition-specific; however, genome-scale models do not include a mechanistic description that allows to determine these ATP consumption rates as part of the simulation. Instead, GAM is incorporated into the biomass pseudo-reaction and NGAM has its own pseudo-reaction that hydrolyzes ATP at a rate tuned by the constraint parameters.
To increase model prediction accuracy for various conditions, we trained GAM and NGAM parameters of iCBI655 using an extensive dataset of 28 extracellular fluxes (Supplementary Datasheet 2) measured during the growth phase under different reactor configurations, carbon sources, and gene deletion mutants. This approach is based on the method used to train the iML1655 E. coli model (Monk et al., ). Remarkably, we observed highly linear trends under three different conditions, including chemostat reactor with cellobiose as a carbon source, chemostat reactor with cellulose as a carbon source, and batch reactor with either cellobiose or cellulose as a carbon source (Figure 1A). This model training has led to improved growth rate prediction of iCBI655 as compared to iAT601 that has previously been trained with only a smaller dataset (Figure 1B). Specifically, the iAT601 training dataset was limited to batch conditions; hence, the inaccurate predictions of iAT601 were observed for chemostat conditions (Figure 1C).
Figure 1
2.4. Assessment of Model Quality and Standard Compliance With Memote
The field of metabolic network modeling suffers from a lack of standard enforcement and quality control metrics that limit model reproducibility and applicability. To address this issue, Lieven et al. (
2.5. Model-Guided Analysis of Proteomics and Flux Datasets Sheds Light on Redox Metabolism Critical for Biofuel Production in C. thermocellum
For the first application of the genome-scale metabolic model, we aimed to understand the complex redox metabolism and potential redox bottlenecks critical for enhanced biofuel production in C. thermocellum. We used the model as a scaffold to analyze proteomics and metabolic flux data collected for the C. thermocellum wild-type and ΔhydGΔech strains. The ΔhydGΔech mutant was engineered to redirect electron flow from hydrogen to ethanol by removal of primary hydrogenases (Biswas et al.,
2.5.1. Development of Fold Change-Based Omics Integration Protocol
To perform the analysis, we formulated an omics integration protocol anchored at the quantification of fold change (FC) between case and control samples (Figure 2A). In this approach, we first compared FCs between simulated intracellular fluxes and measured omics data. Next, we identified consistent reactions with FCs of the same sign and different from zero in both measured proteomics and simulated fluxes for further analysis (section 4.6).
Figure 2

Multi-omics data integration procedure. (A) Fold change-based multi-omics data integration and analysis protocol. (B) Mapping of proteomic data for the ΔhydGΔech case study to model reactions. (C) Correlation between measured and simulated fold changes (pFBA in blue and FVA in orange) for all 70 consistent reactions of the ΔhydGΔ ech case study.
To start the FC-based omics integration, we obtained measured FCs by mapping the measured proteomics data to 510 out of the 856 reactions in the model through the gene-protein-reaction (GPR) associations (Figure 2B). Then, we identified 70 consistent reactions by comparing measured FCs with two types of simulated FCs: (i) parsimonious flux balance analysis (pFBA) that determines the most efficient flux distribution (assuming all enzymes are roughly as efficient) and (ii) flux variability analysis (FVA) that identifies the feasible flux range of each reaction.
The Pearson correlation coefficients between simulated and measured FCs for the consistent reactions were 0.26 and 0.09 for pFBA and FVA, respectively (Figure 2C). In general, the FVA reaction flux ranges remained mostly unchanged, suggesting that pFBA is a better representation of actual metabolic fluxes as previously observed (Machado and Herrgård,
Figure 3

Metabolic map visualization for (A) redox and hydrogenases pathway, (B) pyruvate node that links glycolysis, incomplete Krebs cycle, anapleurotic pathway, and fermentative pathway, and (C) sulfur metabolism using the Escher tool. Values next to reaction labels correspond to proteomics fold change between the ΔhydGΔech and wild-type strains only for the 70 consistent reactions identified by using the FC-based omics integration protocol (section 2.5). The labels of extracellular metabolites are appended with “_e.” Reactions marked with a red cross are deleted in ΔhydGΔech.
2.5.2. FC-Based Omics Integration Reveals Redirection of Electron Flow for NADPH Supply in ΔhydGΔech Strain
Our identification of the consistent reactions by using the FC-based omics integration protocol revealed coherent indications of increased NADPH biosynthesis in the ΔhydGΔech mutant with respect to the wild-type across three major metabolic areas: (i) an increased protein level of FRNDPR2r (also known as NFN) that converts one mol of reduced ferredoxin (fdxr_42) and one mole of NADH into two moles of NADPH (Figure 3A), (ii) an increased protein level of all three malate shunt enzymes and a decreased protein level of the alternative route PPDK (Figure 3B), and (iii) a decreased protein level of sulfur transporter and of HSOR that oxidizes sulfite into sulfide consuming NADPH (Figure 3C). These observations are consistent with the failure of rnf over-expression to enhance ethanol production (Lo et al.,
2.5.3. Analysis of Simulated Fluxes Reveals the Role of NADPH in Redox Balancing
The analysis based on consistent reactions strongly indicates that NADPH production is important in the ΔhydGΔech mutant to achieve redox balance. However, the pathways oxidizing NADPH remain unknown since not all reactions in the model could be mapped to proteomics measurements and carbon recovery was lower in the mutant strain (Thompson et al.,
Taken altogether, model-guided data analysis illustrates the power of the model as contextualization tool and provides new insights into the redox bottlenecks present in C. thermocellum that are critical in the production of reduced molecules. The integration of omics and fluxes led to the resolution of NADPH as the key cofactor in redox bottleneck of ΔhydGΔech. It helped identify specific pathways that undergo major changes in protein levels, providing interesting target reactions for further engineering. Generally, the developed FC-based omics integration protocol can be applied to different omics data types due to its simplicity. The method does not require one to formulate or assume a quantitative relationship between omics measurements and simulated fluxes. Furthermore, fold change in biomolecule concentrations implemented in the method is currently much easier to measure in a quantitatively reliable manner for many molecules than case-specific absolute concentrations.
2.6. Model-Guided Design of Modular Production Strains for Biofuel Synthesis
Another common application of genome-scale models is strain design (Long et al.,
To design C. thermocellum modular cells, we first evaluated a range of design parameters α and β with an increasing number of genetic manipulations (Figure 4A). As expected, increasing the number of deletions leads to more compatible designs, at the expense of more complexity in the implementation. We selected an intermediate point of α = 6, β = 0 for further analysis. This Pareto front is composed of 12 designs that can be clustered into two groups (Figure 4B). The first group (e.g., designs 3, 8, and 9) are compatible with all products except butanol and its derived esters, whereas the second group (e.g., designs 1, 2, 10, and 12) have high objective values for butanol and its derived esters.
Figure 4

Modular cell designs for biosynthesis of 12 alcohols and esters. (A) Module compatibility for various design parameters. (B) Pareto front for parameters α = 6, β = 0. (C) Pareto set for parameters α = 6, β = 0. Reaction names and formulas are included in Table 3. (D) Feasible phenotypic spaces for select designs.
To understand the characteristics of each group, we can inspect the deletions of each design (Figure 4C, Table 3). Designs 3, 8, and 9 all have in common H2ASE_syn, GLUDy, PPDK, and FRNDPR2r deletion, while the last two deletions never appear in design 1, 2, 10, or 12. The majority of deletion targets are central metabolic reactions (Table 3). The common targets include deletion of hydrogenases that appear in the cluster of designs 2, 4, 7, 10, 11, and 12 with the ΔhydGΔech genotype discussed earlier or removal of reactions that form fermentative byproducts such as ALCD2x and ACALD (ethanol), PFL (formate), LDH_L (lactate). Interestingly, ACKr or PTA (acetate) does not appear in this list, likely because acetate production can serve as a regulatory valve for redox metabolism, especially in a modular cell that must be compatible with products of diverse degrees of reduction.
Table 3
| ID | Name | Formula | Counts (%) |
|---|---|---|---|
| PGM | Phosphoglycerate mutase | 2pg_c ↔ 3pg_c | 75 |
| H2ASE_syn | Bidirectional [NiFe] Hydrogenase (Fe-H2) | h2_c + nadp_c ↔ h_c + nadph_c | 75 |
| ECH | (FeFe)-hydrogenase, ferredoxin dependent, membrane-bound | 2.0 fdxr_42_c + 3.0 h_c ↔ 2.0 fdxo_42_c + h2_c + h_e | 66.7 |
| BIF | Bifurcating Hydrogenase | 2.0 fdxr_42_c + 3.0 h_c + nadh_c ↔ 2.0 fdxo_42_c + 2.0 h2_c + nad_c | 66.7 |
| GLUDy | Glutamate dehydrogenase (NADP) | glu__L_c + h2o_c + nadp_c ↔ akg_c + h_c + nadph_c + nh4_c | 50 |
| FRNDPR2r | Ferredoxin: nadp reductase (NFN) | 2.0 fdxr_42_c + h_c + nadh_c + 2.0 nadp_c ↔ 2.0 fdxo_42_c + nad_c + 2.0 nadph_c | 41.7 |
| RNF | Ferredoxin:NAD oxidoreductase (membrane bound) | 2.0 fdxr_42_c + 2.0 h_c + nad_c ↔ 2.0 fdxo_42_c + h_e + nadh_c | 33.3 |
| PEPCK_re | Phosphoenolpyruvate carboxykinase (GTP) | co2_c + gdp_c + pep_c → gtp_c + oaa_c | 33.3 |
| ALCD2x | Alcohol dehydrogenase (ethanol) | acald_c + h_c + nadh_c → etoh_c + nad_c | 25 |
| ACALD | Acetaldehyde dehydrogenase (acetylating) | accoa_c + h_c + nadh_c → acald_c + coa_c + nad_c | 25 |
| PPDK | Pyruvate, phosphate dikinase | amp_c + 2.0 h_c + pep_c + ppi_c → atp_c + pi_c + pyr_c | 25 |
| GLUSy | Glutamate synthase (NADPH) | akg_c + gln__L_c + h_c + nadph_c → 2.0 glu__L_c + nadp_c | 16.7 |
| PFL | Pyruvate formate lyase | coa_c + pyr_c → accoa_c + for_c | 16.7 |
| LDH_L | L-lactate dehydrogenase | h_c + nadh_c + pyr_c → lac__L_c + nad_c | 16.7 |
| POR | Pyruvate-ferredoxin oxidoreductase | coa_c + 2.0 fdxo_42_c + pyr_c → accoa_c + co2_c + 2.0 fdxr_42_c + h_c | 8.3 |
| CEPA | Cellobiose phosphorylase | cellb_c + pi_c → g1p_c + glc__D_c | 8.3 |
| GMPS | GMP synthase | atp_c + nh4_c + xmp_c → amp_c + gmp_c + 3.0 h_c + ppi_c | 8.3 |
| AHSL | O-Acetyl-L-homoserine succinate-lyase | achms_c + cys__L_c ↔ ac_c + cyst_L_c + h_c | 8.3 |
Reaction deletions sorted by appearance frequency (counts) in the designs of the Pareto front for α = 6, β = 0.
More interestingly, we also found important branch-point deletion reactions (Stephanopoulos and Vallino,
Two representative designs from the groups mentioned earlier are 3 and 12. Their feasible growth and production phenotypes reveal a tight coupling between product formation and growth rate (Figure 4D). This phenotype enables pathway optimization through adaptive laboratory evolution, as previously done for ethanol (Tian et al.,
3. Conclusions
In this study, we developed a genome-scale metabolic model of the biotechnologically relevant organism C. thermocellum. Model development followed standards and best practices to ensure reproducibility and accessibility. We demonstrated the enhanced predictions of the model for diverse fermentation conditions and gene lethality. Genome-scale models have a broad range of applications in systems biology, including metabolic engineering, physiological discovery, phenotype interpretation, and studies of evolutionary processes (Feist and Palsson,
4. Methods
4.1. Model Curation
The genome scale model iCBI655 was constructed from iAT601 (Thompson et al.,
4.2. Metabolic Flux Simulations
Constraint-based metabolic network modeling (Palsson,
Here and are the sets of metabolites and reactions in the model, respectively, and vjk is the metabolic flux (mmol/gCDW/h) through reaction j in the simulation condition k. Constraint (1) enforces mass balance for all metabolites in the network, where Sij represents the stoichiometric coefficient of metabolite i in reaction j. Constraint (2) enforces lower and upper bounds ljk and ujk, respectively, for each reaction j in the network.
In different simulation conditions, k, Sij remains fixed given the structure of the network for all . However, certain bounds ujk and ljk are modified to represent specific metabolic constraints. For example, to apply measured reaction fluxes such as in the case of GAM and NGAM calculation or the omics integration protocol (section 4.6), ljk and ujk are specified using the experimentally measured average (μjk) and standard deviation (σjk), which for normally distributed samples with 3 replicates produces an interval with a confidence level above 90% (3-4). Similarly, to represent a certain gene deletion mutant k, the bounds are set to be ujk = ljk = 0 for the associated reaction j.
The feasible flux space Ωk can be explored in different ways; (Trinh et al.,
Here cj is the coefficient of reaction j in the linear objective function, which is changed according to the simulation context. For example, to train GAM and NGAM (Figure 1A) the objective was set to maximize flux through the ATP hydrolysis reaction, i.e., cj = 1 for j corresponding to ATP hydrolysis reaction, and 0 otherwise. To evaluate growth prediction accuracy (Figures 1B,C), the objective was set to maximize growth, i.e., cj = 1 for j corresponding to growth pseudo-reaction and 0 otherwise.
4.3. Simulation of Different Growth Environments
The model is configured to generally represent different medium and reactor conditions by modifying three features. The first feature involves model boundaries specifying which metabolites may enter the intracellular environment (i.e., present in the growth medium) or may exit the intracellular environment (i.e., secreted by C. thermocellum). This feature can be adjusted through ujk and ljk for exchange reactions. In our simulations, only essential metabolites required for in silico growth may be consumed and only commonly observed metabolites may be produced, unless otherwise noted. The second feature involves biomass objective function. iCBI655 contains 3 possible biomass reactions: (i) BIOMASS_CELLOBIOSE used for growth in cellobiose with cellulosan constituting 2% of cell dry weight (CDW) (Zhang and Lynd,
For growth on cellulose, the experimentally measured glucose-equivalent uptake was represented in the model through the following pseudo-reactions: 3 glceq_e → cell3_e; 4 glceq_e → cell4_e; 5 glceq_e → cell5_e; and 6 glceq_e → cell6_e. Here, cell3_e, cell4_e, cell5_e, and cell6_e are cellodextrin polymers with 3, 4, 5, and 6 glucose monomers, respectively. These polymers can be imported inside the cell through the oligo-cellulose transport ABC system. The model is free to use any cellodextrin length, although utilization of longer cellodextrins results in higher ATP yield (Zhang and Lynd,
4.4. Single-Reaction Deletion Analysis to Match Experimentally Observed Phenotype
A core model of C. thermocellum (Thompson et al.,
4.5. Model Comparison
The C. thermocellum and E. coli models were obtained from their respective publications in SBML format. Blocked reactions were calculated by allowing all exchange reactions to have an unconstrained flux (i.e., lbjk = −1, 000, ubjk = 1, 000∀j ∈ Exchange). This procedure enables the most general scenario which produces the smallest number of blocked reactions in each model. Additional details can be found in Supplementary Datasheet 1.
4.6. Omics Integration Protocol
The omics integration protocol developed in this study consists of three steps: (i) simulation of fold changes, (ii) mapping of measured gene fold changes to reactions, and (iii) comparison of measured and simulated fold changes.
4.6.1. Calculation of Simulated Fold Changes
To simulate metabolic fluxes, lower and upper bounds (2) are constrained according to experimental data as described in section 4.2. Then, for the pFBA method, a quadratic optimization problem (6) is solved, leading to a unique flux distribution .
For the FVA method, a sequence of linear programming problems is solved where each flux is minimized (7) and maximized (8):
Note that for computation we applied the loop-less FVA method (Schellenberger et al.,
FVA produces a flux range for each reaction . To compare between states k (e.g., wild-type and mutant), we define the FVA center, a scalar variable that generally indicates a change in this range (9).
The FVA center is a heuristic analysis with the main purpose of determining whether a reaction exhibits an upward shift (center increase) or a downward shift (center decrease) between two conditions k. It should be emphasized that the FVA center, , does not attempt to quantify the fraction of overlap between ranges nor to identify what type of shift might occur from all possible permutations. Unlike , does not necessarily represent a feasible flux distribution of Ωk. Furthermore, the FVA center could potentially fail to capture hypothetical permutations of fluxes. Despite these considerations, the FVA center remains a useful heuristic to analyze simulated fold changes.
Finally, to determine the fold change for either pFBA or FVA simulated fluxes, the conventional procedure for fold change calculation in omics data is emulated. First, values are floored to avoid very large (or infinite) fold changes in cases with very small magnitude change. This is accomplished through a flooring piece-wise function (10), where ϵ = 0.0001 is the minimum value and x is an arbitrary scalar variable.
Then, the fluxes are normalized to the substrate uptake rate vuptake, k and fold change is calculated in log2 space (11).
4.6.2. Calculation of Measured Fold Changes
Fold change between case and control samples, FCl, is calculated in log2 space for each gene , where is the set of genes in the model. These gene fold changes can be mapped to metabolic reaction fold changes using the gene-protein reaction associations (GPR), given as the set of genes with FCl ≠ 0 in the GPR of reaction j:
4.6.3. Identification of Consistent Fold Changes
A reaction j is said to have a consistent fold change if the measured fold change has the same sign of at least one of the simulated fold changes, more formally:
where is the set of consistent reactions which is considered for further analysis and the simulated fold changes are re-defined for brevity (14-15).
4.7. Software Implementation
Model development was performed using Python and Jupyter notebooks with open-source Python libraries including cobrapy (Ebrahim et al.,
4.8. Proteomics Data Collection
C. thermocellum wild-type and ΔhydGΔech strains were cultured in batch reactors and metabolic fluxes were calculated as previously described (Thompson et al.,
For each LC-MS/MS run, 25μg of peptides were loaded via pressure cell onto a biphasic MudPIT column for online 2D HPLC separation and concurrent analysis via nanospray MS/MS using a LTQ-Orbitrap XL mass spectrometer (Thermo Scientific) operating in data-dependent acquisition (one full scan at 15 k resolution followed by 10 MS/MS scans in the LTQ, all one μscan; monoisotopic precursor selection; rejection of analytes with an undecipherable charge; dynamic exclusion = 30 s) (Giannone et al.,
Eleven salt cuts (25, 30, 35, 40, 45, 50, 65, 80, 100, 175, and 500 mM ammonium acetate) were performed per sample run with each followed by 120 min organic gradient to separate peptides.
Resultant peptide fragmentation spectra (MS/MS) were searched against the C. thermocellum DSM1313 proteome database concatenated with common contaminants and reversed sequences to control false-discovery rates using MyriMatch v.2.1. (Tabb et al.,
All raw and database-searched LC-MS/MS data pertaining to this study have been deposited into the MassIVE proteomic data repository and have been assigned the following accession numbers: MSV000084488 (MassIVE) and PXD015973 (ProteomeXchange). Data files are available upon publication (ftp://massive.ucsd.edu/MSV000084488/).
4.9. Modular Cell Design
The ModCell formulation, computational algorithm, and implementation followed the previous reports (Garcia and Trinh,
Statements
Data availability statement
All raw and database-searched LC-MS/MS data pertaining to this study have been deposited into the MassIVE proteomic data repository and have been assigned the following accession numbers: MSV000084488 (MassIVE) and PXD015973 (ProteomeXchange). Data files are available upon publication (ftp://massive.ucsd.edu/MSV000084488/).
Author contributions
CT managed the project. SG and CT conceived the study, designed experiments, and analyzed the data. SG, RT, RG, and SD performed the experiments. SG prepared the draft with co-authors' inputs. All read, wrote, and approved the final draft.
Funding
This research was financially supported in part by the NSF CAREER award (NSF#1553250 to CT) and by The Center of Bioenergy Innovation (CBI), U.S. Department of Energy Bioenergy Research Center supported by the Office of Biological and Environmental Research in the DOE Office of Science (to CT and CM). The funders had no role in this study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbioe.2020.00772/full#supplementary-material
Supplementary Datasheet 1Software used to develop, configure, and analyze iCBI655.
Supplementary Datasheet 2Flux dataset used to train the iCBI655 model and proteomics dataset for the wild-type and ΔhydGΔech strains.
Supplementary Datasheet 3Supplementary tables.
Supplementary Datasheet 4iCBI655 model in various formats for cellobiose growth conditions and map of central metabolic pathways in Escher format.
References
1
AkinoshoH.YeeK.CloseD.RagauskasA. (2014). The emergence of Clostridium thermocellum as a high utility candidate for consolidated bioprocessing applications. Front. Chem.2:66. 10.3389/fchem.2014.00066
2
ArgyrosD. A.TripathiS. A.BarrettT. F.RogersS. R.FeinbergL. F.OlsonD. G.et al. (2011). High ethanol titers from cellulose by using metabolically engineered thermophilic, anaerobic microbes. Appl. Environ. Microbiol. 77, 8288–8294. 10.1128/AEM.00646-11
3
ArkinA. P.CottinghamR. W.HenryC. S.HarrisN. L.StevensR. L.MaslovS.et al. (2018). KBase: the United States department of energy systems biology knowledgebase. Nat. Biotechnol. 36, 566–569. 10.1038/nbt.4163
4
BiswasR.WilsonC. M.GiannoneR. J.KlingemanD. M.RydzakT.ShahM. B.et al. (2017). Improved growth rate in Clostridium thermocellum hydrogenase mutant via perturbed sulfur metabolism. Biotechnol. Biofuels10:6. 10.1186/s13068-016-0684-x
5
BiswasR.ZhengT.OlsonD. G.LyndL. R.GussA. M. (2015). Elimination of hydrogenase active site assembly blocks h 2 production and increases ethanol yield in Clostridium thermocellum. Biotechnol. Biofuels8:20. 10.1186/s13068-015-0204-4
6
BlazeckJ.AlperH. (2010). Systems metabolic engineering: genome-scale models and beyond. Biotechnol. J. 5, 647–659. 10.1002/biot.200900247
7
BordelS.AgrenR.NielsenJ. (2010). Sampling the solution space in genome-scale metabolic networks reveals transcriptional regulation in key enzymes. PLoS Comput. Biol. 6:e1000859. 10.1371/journal.pcbi.1000859
8
CalusinskaM.HappeT.JorisB.WilmotteA. (2010). The surprising diversity of clostridial hydrogenases: a comparative genomic perspective. Microbiology156, 1575–1588. 10.1099/mic.0.032771-0
9
ChanS. H.WangL.DashS.MaranasC. D. (2018). Accelerating flux balance calculations in genome-scale metabolic models by localizing the application of loopless constraints. Bioinformatics34, 4248–4255. 10.1093/bioinformatics/bty446
10
DashS.KhodayariA.ZhouJ.HolwerdaE. K.OlsonD. G.LyndL. R.et al. (2017). Development of a core Clostridium thermocellum kinetic metabolic model consistent with multiple genetic perturbations. Biotechnol. Biofuels10:108. 10.1186/s13068-017-0792-2
11
DashS.MuellerT. J.VenkataramananK. P.PapoutsakisE. T.MaranasC. D. (2014). Capturing the response of Clostridium acetobutylicum to chemical stressors using a regulated genome-scale metabolic model. Biotechnol. Biofuels7:144. 10.1186/s13068-014-0144-4
12
DashS.NgC. Y.MaranasC. D. (2016). Metabolic modeling of clostridia: current developments and applications. FEMS Microbiol. Lett. 363, 1–10. 10.1093/femsle/fnw004
13
DashS.OlsonD. G.ChanS. H. J.Amador-NoguezD.LyndL. R.MaranasC. D. (2019). Thermodynamic analysis of the pathway for ethanol production from cellobiose in Clostridium thermocellum. Metab. Eng. 55, 161–169. 10.1016/j.ymben.2019.06.006
14
DengY.OlsonD. G.ZhouJ.HerringC. D.ShawA. J.LyndL. R. (2013). Redirecting carbon flux through exogenous pyruvate kinase to achieve high ethanol yields in Clostridium thermocellum. Metab. Eng. 15, 151–158. 10.1016/j.ymben.2012.11.006
15
EbrahimA.BrunkE.TanJ.O'brienE. J.KimD.SzubinR.et al. (2016). Multi-omic data integration enables discovery of hidden biological regularities. Nat. Commun. 7:13091. 10.1038/ncomms13091
16
EbrahimA.LermanJ. A.PalssonB. O.HydukeD. R. (2013). Cobrapy: constraints-based reconstruction and analysis for python. BMC Syst. Biol. 7:74. 10.1186/1752-0509-7-74
17
FeistA. M.PalssonB. Ø. (2008). The growing scope of applications of genome-scale metabolic reconstructions using Escherichia coli. Nat. Biotechnol. 26:659. 10.1038/nbt1401
18
GarciaS.TrinhC. T. (2019a). Comparison of multi-objective evolutionary algorithms to solve the modular cell design problem for novel biocatalysis. Processes7:361. 10.3390/pr7060361
19
GarciaS.TrinhC. T. (2019b). Modular design: implementing proven engineering principles in biotechnology. Biotechnol. Adv. 37:107403. 10.1016/j.biotechadv.2019.06.002
20
GarciaS.TrinhC. T. (2019c). Multiobjective strain design: a framework for modular cell engineering. Metab. Eng. 51, 110–120. 10.1016/j.ymben.2018.09.003
21
GarciaS.TrinhC. T. (2020). Harnessing natural modularity of cellular metabolism to design a modular chassis cell for a diverse class of products by using goal attainment optimization. ACS Synth. Biol.9, 1665–1681. 10.1021/acssynbio.9b00518
22
GiannoneR. J.WurchL. L.HeimerlT.MartinS.YangZ.HuberH.et al. (2015a). Life on the edge: functional genomic response of Ignicoccus hospitalis to the presence of Nanoarchaeum equitans. ISME J. 9:101. 10.1038/ismej.2014.112
23
GiannoneR. J.WurchL. L.PodarM.HettichR. L. (2015b). Rescuing those left behind: recovering and characterizing underdigested membrane and hydrophobic proteins to enhance proteome measurement depth. Anal. Chem. 87, 7720–7728. 10.1021/acs.analchem.5b01187
24
HenryC. S.DeJonghM.BestA. A.FrybargerP. M.LinsayB.StevensR. L. (2010). High-throughput generation, optimization and analysis of genome-scale metabolic models. Nat. Biotechnol. 28:977. 10.1038/nbt.1672
25
HolwerdaE. K.ThorneP. G.OlsonD. G.Amador-NoguezD.EngleN. L.TschaplinskiT. J.et al. (2014). The exometabolome of Clostridium thermocellum reveals overflow metabolism at high cellulose loading. Biotechnol. Biofuels7:155. 10.1186/s13068-014-0155-1
26
HsiangS. M.MengK. C.CaneM. A. (2011). Civil conflicts are associated with the global climate. Nature476:438. 10.1038/nature10311
27
KanehisaM.GotoS. (2000). KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30. 10.1093/nar/28.1.27
28
KingZ. A.LuJ.DrägerA.MillerP.FederowiczS.LermanJ. A.et al. (2015). Bigg models: a platform for integrating, standardizing and sharing genome-scale models. Nucleic Acids Res. 44, D515-D522. 10.1093/nar/gkv1049
29
KridelbaughD. M.NelsonJ.EngleN. L.TschaplinskiT. J.GrahamD. E. (2013). Nitrogen and sulfur requirements for Clostridium thermocellum and Caldicellulosiruptor bescii on cellulosic substrates in minimal nutrient media. Bioresour. Technol. 130, 125–135. 10.1016/j.biortech.2012.12.006
30
LeeJ.-W.TrinhC. T. (2019). Microbial biosynthesis of lactate esters. Biotechnol. Biofuels12:226. 10.1186/s13068-019-1563-z
31
LievenC.BeberM. E.OlivierB. G.BergmannF. T.AtamanM.BabaeiP.et al. (2020). Memote for standardized genome-scale metabolic model testing. Nat. Biotechnol. 38, 272–276. 10.1038/s41587-020-0446-y
32
LinP. P.MiL.MoriokaA. H.YoshinoK. M.KonishiS.XuS. C.et al. (2015). Consolidated bioprocessing of cellulose to isobutanol using Clostridium thermocellum. Metab. Eng. 31, 44–52. 10.1016/j.ymben.2015.07.001
33
LoJ.OlsonD. G.MurphyS. J.-L.TianL.HonS.LanahanA.et al. (2017). Engineering electron metabolism to increase ethanol production in Clostridium thermocellum. Metab. Eng. 39, 71–79. 10.1016/j.ymben.2016.10.018
34
LoderA. J.ZeldesB. M.GarrisonG. D.LipscombG. L.AdamsM. W.KellyR. M. (2015). Alcohol selectivity in a synthetic thermophilic n-butanol pathway is driven by biocatalytic and thermostability characteristics of constituent enzymes. Appl. Environ. Microbiol. 81, 7187–7200. 10.1128/AEM.02028-15
35
LongM. R.OngW. K.ReedJ. L. (2015). Computational methods in metabolic engineering for strain design. Curr. Opin. Biotechnol. 34, 135–141. 10.1016/j.copbio.2014.12.019
36
LuH.LiF.SánchezB. J.ZhuZ.LiG.DomenzainI.et al. (2019). A consensus S. cerevisiae metabolic model yeast8 and its ecosystem for comprehensively probing cellular metabolism. Nat. Commun. 10, 1–13. 10.1038/s41467-019-11581-3
37
MaZ.-Q.DasariS.ChambersM. C.LittonM. D.SobeckiS. M.ZimmermanL. J.et al. (2009). Idpicker 2.0: Improved protein assembly with high discrimination peptide identification filtering. J. Proteome Res. 8, 3872–3881. 10.1021/pr900360j
38
MachadoD.HerrgårdM. (2014). Systematic evaluation of methods for integration of transcriptomic data into constraint-based models of metabolism. PLoS Comput. Biol. 10:e1003580. 10.1371/journal.pcbi.1003580
39
MaranasC. D.ZomorrodiA. R. (2016). Optimization Methods in Metabolic Networks. Hoboken, NJ: John Wiley & Sons. 10.1002/9781119188902
40
McAnultyM. J.YenJ. Y.FreedmanB. G.SengerR. S. (2012). Genome-scale modeling using flux ratio constraints to enable metabolic engineering of clostridial metabolism in silico. BMC Syst. Biol. 6:42. 10.1186/1752-0509-6-42
41
MilneC. B.EddyJ. A.RajuR.ArdekaniS.KimP.-J.SengerR. S.et al. (2011). Metabolic network reconstruction and genome-scale model of butanol-producing strain Clostridium beijerinckii ncimb 8052. BMC Syst. Biol. 5:130. 10.1186/1752-0509-5-130
42
MonkJ. M.LloydC. J.BrunkE.MihN.SastryA.KingZ.et al. (2017). IML1515, a knowledgebase that computes Escherichia coli traits. Nat. Biotechnol. 35:904. 10.1038/nbt.3956
43
NagarajanH.SahinM.NogalesJ.LatifH.LovleyD. R.EbrahimA.et al. (2013). Characterizing acetogenic metabolism using a genome-scale metabolic reconstruction of Clostridium ljungdahlii. Microbial cell factories12:118. 10.1186/1475-2859-12-118
44
NgC. Y.KhodayariA.ChowdhuryA.MaranasC. D. (2015). Advances in de novo strain design using integrated systems and synthetic biology tools. Curr. Opin. Chem. Biol. 28, 105–114. 10.1016/j.cbpa.2015.06.026
45
OlsonD. G.HörlM.FuhrerT.CuiJ.ZhouJ.MaloneyM. I.et al. (2017). Glycolysis without pyruvate kinase in Clostridium thermocellum. Metab. Eng. 39, 169–180. 10.1016/j.ymben.2016.11.011
46
OlsonD. G.McBrideJ. E.ShawA. J.LyndL. R. (2012). Recent progress in consolidated bioprocessing. Curr. Opin. Biotechnol. 23, 396–405. 10.1016/j.copbio.2011.11.026
47
PalssonB. Ø. (2015). Systems Biology: Constraint-Based Reconstruction and Analysis. Cambridge: Cambridge University Press. 10.1017/CBO9781139854610
48
PapanekB.BiswasR.RydzakT.GussA. M. (2015). Elimination of metabolic pathways to all traditional fermentation products increases ethanol yields in Clostridium thermocellum. Metab. Eng. 32, 49–54. 10.1016/j.ymben.2015.09.002
49
PetersN. K. (2018). Bioenergy Research Centers. Technical report, USDOE Office of Science (SC), Washington, DC. 10.2172/1471709
50
RagauskasA. J.WilliamsC. K.DavisonB. H.BritovsekG.CairneyJ.EckertC. A.et al. (2006). The path forward for biofuels and biomaterials. Science311, 484–489. 10.1126/science.1114736
51
ReimersA.-M.LindhorstH.WaldherrS. (2017). A protocol for generating and exchanging (genome-scale) metabolic resource allocation models. Metabolites7:47. 10.3390/metabo7030047
52
RobertsS. B.GowenC. M.BrooksJ. P.FongS. S. (2010). Genome-scale metabolic analysis of clostridium thermocellum for bioethanol production. BMC Syst. Biol. 4:31. 10.1186/1752-0509-4-31
53
RydzakT.McQueenP. D.KrokhinO. V.SpicerV.EzzatiP.DwivediR. C.et al. (2012). Proteomic analysis of Clostridium thermocellum core metabolism: relative protein expression profiles and growth phase-dependent changes in protein expression. BMC Microbiol. 12:214. 10.1186/1471-2180-12-214
54
SalimiF.ZhuangK.MahadevanR. (2010). Genome-scale metabolic modeling of a clostridial co-culture for consolidated bioprocessing. Biotechnol. J. 5, 726–738. 10.1002/biot.201000159
55
SchellenbergerJ.LewisN. E.PalssonB. Ø. (2011). Elimination of thermodynamically infeasible loops in steady-state metabolic models. Biophys. J. 100, 544–553. 10.1016/j.bpj.2010.12.3707
56
SengerR. S.PapoutsakisE. T. (2008). Genome-scale model for Clostridium acetobutylicum: Part I. Metabolic network resolution and analysis. Biotechnol. Bioeng. 101, 1036–1052. 10.1002/bit.22010
57
SeoH.LeeJ.-W.GarciaS.TrinhC. T. (2019). Single mutation at a highly conserved region of chloramphenicol acetyltransferase enables isobutyl acetate production directly from cellulose by Clostridium thermocellum at elevated temperatures. Biotechnol. Biofuels12:245. 10.1186/s13068-019-1583-8
58
SeoH.NicelyP. N.TrinhC. T. (2020). Endogenous carbohydrate esterases of Clostridium thermocellum are identified and disrupted for enhanced isobutyl acetate production from cellulose. Biotechnol. Bioeng. 117, 2223–2236. 10.1002/bit.27360
59
Serrano-BermúdezL. M.BarriosA. F. G.MaranasC. D.MontoyaD. (2017). Clostridium butyricum maximizes growth while minimizing enzyme usage and ATP production: metabolic flux distribution of a strain cultured in glycerol. BMC Syst. Biol. 11:58. 10.1186/s12918-017-0434-0
60
ShafieeS.TopalE. (2009). When will fossil fuel reserves be diminished?Energy Policy37, 181–189. 10.1016/j.enpol.2008.08.016
61
StephanopoulosG.VallinoJ. J. (1991). Network rigidity and metabolic engineering in metabolite overproduction. Science252, 1675–1681. 10.1126/science.1904627
62
SzegezdiJ.CsizmadiaF. (2007). “Method for calculating the PKA values of small and large molecules,” in Abstracts of Papers of The American Chemical Society, Vol. 233 (Washington, DC: Amer Chemical Soc).
63
TabbD. L.FernandoC. G.ChambersM. C. (2007). Myrimatch: highly accurate tandem mass spectral peptide identification by multivariate hypergeometric analysis. J. Proteome Res. 6, 654–661. 10.1021/pr0604054
64
TavernerT.KarpievitchY. V.PolpitiyaA. D.BrownJ. N.DabneyA. R.AndersonG. A.et al. (2012). Danter: an extensible r-based tool for quantitative analysis of-omics data. Bioinformatics28, 2404–2406. 10.1093/bioinformatics/bts449
65
ThieleI.PalssonB. Ø. (2010). A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat. Protoc. 5:93. 10.1038/nprot.2009.203
66
ThompsonR. A.DahalS.GarciaS.NookaewI.TrinhC. T. (2016). Exploring complex cellular phenotypes and model-guided strain design with a novel genome-scale metabolic model of Clostridium thermocellum DSM 1313 implementing an adjustable cellulosome. Biotechnol. Biofuels9:194. 10.1186/s13068-016-0607-x
67
ThompsonR. A.LaytonD. S.GussA. M.OlsonD. G.LyndL. R.TrinhC. T. (2015). Elucidating central metabolic redox obstacles hindering ethanol production in Clostridium thermocellum. Metab. Eng. 32, 207–219. 10.1016/j.ymben.2015.10.004
68
ThompsonR. A.TrinhC. T. (2017). Overflow metabolism and growth cessation in Clostridium thermocellum DSM1313 during high cellulose loading fermentations. Biotechnol. Bioeng. 114, 2592–2604. 10.1002/bit.26374
69
TianL.PapanekB.OlsonD. G.RydzakT.HolwerdaE. K.ZhengT.et al. (2016). Simultaneous achievement of high ethanol yield and titer in Clostridium thermocellum. Biotechnol. Biofuels9:116. 10.1186/s13068-016-0528-8
70
TrinhC. T. (2012). Elucidating and reprogramming Escherichia coli metabolisms for obligate anaerobic n-butanol and isobutanol production. Appl. Microbiol. Biotechnol. 95, 1083–1094. 10.1007/s00253-012-4197-7
71
TrinhC. T.LiuY.ConnerD. J. (2015). Rational design of efficient modular cells. Metab. Eng. 32, 220–231. 10.1016/j.ymben.2015.10.005
72
TrinhC. T.WlaschinA.SriencF. (2009). Elementary mode analysis: a useful metabolic pathway analysis tool for characterizing cellular metabolism. Appl. Microbiol. Biotechnol. 81, 813–826. 10.1007/s00253-008-1770-1
73
WalleniusJ.ViikiläM.SurvaseS.OjamoH.EerikäinenT. (2013). Constraint-based genome-scale metabolic modeling of Clostridium acetobutylicum behavior in an immobilized column. Bioresour. Technol. 142, 603–610. 10.1016/j.biortech.2013.05.085
74
WangL.MaranasC. D. (2018). Mingenome: an in silico top-down approach for the synthesis of minimized genomes. ACS Synth. Biol. 7, 462–473. 10.1021/acssynbio.7b00296
75
YimH.HaselbeckR.NiuW.Pujol-BaxleyC.BurgardA.BoldtJ.et al. (2011). Metabolic engineering of Escherichia coli for direct production of 1, 4-butanediol. Nat. Chem. Biol. 7:445. 10.1038/nchembio.580
76
YooM.Bestel-CorreG.CrouxC.RiviereA.Meynial-SallesI.SoucailleP. (2015). A quantitative system-scale characterization of the metabolism of Clostridium acetobutylicum. MBio6:e01808-15. 10.1128/mBio.01808-15
77
ZhangY.-H. P.LyndL. R. (2005). Cellulose utilization by Clostridium thermocellum: bioenergetics and hydrolysis product assimilation. Proc. Natl. Acad. Sci. U.S.A. 102, 7321–7325. 10.1073/pnas.0408734102
78
ZhouJ.OlsonD. G.ArgyrosD. A.DengY.van GulikW. M.van DijkenJ. P.et al. (2013). Atypical glycolysis in Clostridium thermocellum. Appl. Environ. Microbiol. 79, 3000–3008. 10.1128/AEM.04037-12
Summary
Keywords
Clostridium thermocellum, biofuels, genome-scale model, metabolic model, omics integration, modular cell design, ModCell
Citation
Garcia S, Thompson RA, Giannone RJ, Dash S, Maranas CD and Trinh CT (2020) Development of a Genome-Scale Metabolic Model of Clostridium thermocellum and Its Applications for Integration of Multi-Omics Datasets and Computational Strain Design. Front. Bioeng. Biotechnol. 8:772. doi: 10.3389/fbioe.2020.00772
Received
02 April 2020
Accepted
18 June 2020
Published
21 August 2020
Volume
8 - 2020
Edited by
Young-Mo Kim, Pacific Northwest National Laboratory (DOE), United States
Reviewed by
Karsten Zengler, University of California, San Diego, United States; Esteban Marcellin, The University of Queensland, Australia
Updates

Check for updates
Copyright
© 2020 Garcia, Thompson, Giannone, Dash, Maranas and Trinh.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Cong T. Trinh ctrinh@utk.edu
†Present address: R. Adam Thompson, Quantitative Translational Pharmacology, DMPK-BA, Abbvie Inc., North Chicago, IL, United States
This article was submitted to Synthetic Biology, a section of the journal Frontiers in Bioengineering and Biotechnology
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.