Integration of a constraint-based metabolic model of Brassica napus developing seeds with 13C-metabolic flux analysis

Hay, Jordan O.; Shi, Hai; Heinzel, Nicolas; Hebbelmann, Inga; Rolletschek, Hardy; Schwender, Jorg

doi:10.3389/fpls.2014.00724

ORIGINAL RESEARCH article

Front. Plant Sci., 19 December 2014

Sec. Plant Systems and Synthetic Biology

Volume 5 - 2014 | https://doi.org/10.3389/fpls.2014.00724

This article is part of the Research TopicLarge-scale metabolic network models.View all 11 articles

Integration of a constraint-based metabolic model of Brassica napus developing seeds with ¹³C-metabolic flux analysis

Jordan O. Hay¹

Hai Shi¹

Nicolas Heinzel²

Inga Hebbelmann¹

Hardy Rolletschek²

Jorg Schwender¹^*

¹Biological, Environment and Climate Sciences Department, Brookhaven National Laboratory, Upton, NY, USA
²Department of Molecular Genetics, Leibniz-Institut für Pflanzengenetik und Kulturpflanzenforschung, Gatersleben, Germany

The use of large-scale or genome-scale metabolic reconstructions for modeling and simulation of plant metabolism and integration of those models with large-scale omics and experimental flux data is becoming increasingly important in plant metabolic research. Here we report an updated version of bna572, a bottom-up reconstruction of oilseed rape (Brassica napus L.; Brassicaceae) developing seeds with emphasis on representation of biomass-component biosynthesis. New features include additional seed-relevant pathways for isoprenoid, sterol, phenylpropanoid, flavonoid, and choline biosynthesis. Being now based on standardized data formats and procedures for model reconstruction, bna572+ is available as a COBRA-compliant Systems Biology Markup Language (SBML) model and conforms to the Minimum Information Requested in the Annotation of Biochemical Models (MIRIAM) standards for annotation of external data resources. Bna572+ contains 966 genes, 671 reactions, and 666 metabolites distributed among 11 subcellular compartments. It is referenced to the Arabidopsis thaliana genome, with gene-protein-reaction (GPR) associations resolving subcellular localization. Detailed mass and charge balancing and confidence scoring were applied to all reactions. Using B. napus seed specific transcriptome data, expression was verified for 78% of bna572+ genes and 97% of reactions. Alongside bna572+ we also present a revised carbon centric model for ¹³C-Metabolic Flux Analysis (¹³C-MFA) with all its reactions being referenced to bna572+ based on linear projections. By integration of flux ratio constraints obtained from ¹³C-MFA and by elimination of infinite flux bounds around thermodynamically infeasible loops based on COBRA loopless methods, we demonstrate improvements in predictive power of Flux Variability Analysis (FVA). Using this combined approach we characterize the difference in metabolic flux of developing seeds of two B. napus genotypes contrasting in starch and oil content.

Introduction

Large-scale metabolic network reconstruction, modeling, and simulation is becoming increasingly important in plant systems biology and metabolic engineering (Schwender, 2011; Sweetlove and Ratcliffe, 2011; Collakova et al., 2012; Seaver et al., 2012; De Oliveira Dal'molin and Nielsen, 2013; Sweetlove et al., 2014). Metabolic networks for constraints-based metabolic modeling have been reconstructed for plants including Arabidopsis (Poolman et al., 2009; Williams et al., 2010; Dal'molin et al., 2011; Mintz-Oron et al., 2012; Cheung et al., 2013; Arnold and Nikoloski, 2014), barley (Grafahrend-Belau et al., 2009, 2013; Rolletschek et al., 2011), Chlamydomonas (Boyle and Morgan, 2009; Manichaikul et al., 2009; Chang et al., 2011; Dal'molin et al., 2011; Kliphuis et al., 2012), corn (Saha et al., 2011), oilseed rape (Schwender, 2008; Hay and Schwender, 2011a,b; Pilalis et al., 2011; Schwender and Hay, 2012; Borisjuk et al., 2013), rice (Lakshmanan et al., 2013; Poolman et al., 2013), C4 plants (De Oliveira Dal'molin et al., 2010b), and CAM plants (Cheung et al., 2014). Draft models have been generated for seven additional plants with sequenced genomes (Seaver et al., 2014). The methods framework being generally used has been summarized as COnstraint-Based Reconstruction and Analysis (COBRA) (Thiele and Palsson, 2010). In constraint-based (CB) analysis, networks are simulated using mathematical constraints related to mass conservation (reaction stoichiometry), thermodynamics (reaction directionality) and various physiological/experimental data.

A metabolic reconstruction of an organism can be understood as a highly structured representation of biochemical, genomic, and physiological information which allows derivation of a mathematical model useful for constraint-based analysis (Thiele and Palsson, 2010). Standard operating procedures that describe the reconstruction process in detail and standards for quality control (e.g., confidence scores to quantify the level of biological evidence for reactions) and model testing have been developed (Thiele and Palsson, 2010). Together with model features like mass and charge balanced reactions and definition of gene-protein-reaction (GPR) associations a high-quality standard in model reconstruction is defined.

A continuously growing extensive suite of COBRA methods have gained rapid acceptance, providing diverse tools for model refinement, modeling of genetic perturbations, or characterization of the solution space of large-scale models (Lewis et al., 2012). Since CB modeling often results in a solution space with multiple possible flux distributions, methods to narrow down the flux solution space are in demand (Reed, 2012). For example, experimentally determined metabolic flux ratios can be imposed as additional constraints for flux balance analysis methods (Choi et al., 2007; Mcanulty et al., 2012). Imposition of loop-law constraints (Schellenberger et al., 2011a; Noor et al., 2012) shrinks the solution space by removing solutions that contain thermodynamically infeasible loops.

In recent years different methods have been developed aiming at using gene expression data to derive tissue specific models from generic organism specific networks, or to improve the predictive power of FBA (Blazier and Papin, 2012; Lewis et al., 2012; Machado and Herrgard, 2014). Chang et al. (2011) validated the reaction network in a metabolic reconstruction of Chlamydomonas based on transcritomic data. Mintz-Oron et al. (2012) used proteomic data to generate tissue specific models from their Arabidopsis thaliana metabolic reconstruction. In order to explore to what extent changes in flux in central metabolism can be related to changes in gene expression, a number of studies integrated differential gene expression data with metabolic flux predictions (Akesson et al., 2004; Daran-Lapujade et al., 2004; Bordel et al., 2010).

In this paper we report the revision of a CB model of seed development in the oilseed crop Brassica napus, bna572 (Hay and Schwender, 2011b), hereafter referred to as bna572+. We integrate bna572+ with transcriptome data to improve confidence in the model's ability to represent a developing seed. Carrying on with a biomass-guided, bottom-up approach to reconstruction, bna572+ has additional metabolic subsystems and seed-relevant biomass components. Bna572+ features standardized annotations of external data resources (Le Novere et al., 2005; Juty et al., 2012), quality assurance and control (Thiele and Palsson, 2010), COBRA toolbox compatibility (Schellenberger et al., 2011b; Ebrahim et al., 2013), mass and charge balanced reactions as well as GPR associations that account for subcellular compartmentalization. Furthermore, we compare two B. napus genotypes differing in seed oil accumulation based on ¹³C-Metabolic Flux Analysis (¹³C-MFA) of cultured developing embryos and derive two genotype specific CB-models. Using ¹³C-MFA derived flux constraints and the COBRA loopless algorithm (Schellenberger et al., 2011a), the solution space could be substantially reduced.

Materials and Methods

In-vitro Embryo Culture and Biomass Composition Measurements

Plants of oilseed rape accessions 3170 and 3231 (both maintained by the IPK Genebank) were grown in 30 cm plastic pots in a general purpose soil mix (Pro Mix BX, Premier Horticulture Inc., Quakertown, PA) under constant conditions (15°C nights, 20°C days; 16 h day, 400 μE m⁻² s⁻¹). Developing embryos were dissected aseptically about 20 days after flowering and grown in a liquid medium at 20°C under continuous light (50 μmol m⁻² s⁻¹). Cultures were kept in tissue culture flasks with vented seal cap (CytoOne T75 #CC7682-4875, USAScientific, Ocala, FL, USA), containing 13 ml of liquid medium and four embryos per flask. The liquid growth medium contained 20% (w/v) polyethylene glycol 4000, sucrose (80 mM), glucose (40 mM), Gln (35 mM), and Ala (10 mM). In labeling experiments, sucrose and glucose were partly substituted with ¹³C-labeled analogs so that [1-¹³C]- and [U-¹³C₆]- mono and disaccharide hexose moieties are overall at 8.125 and 10 mol% of total hexose moieties, respectively (sucrose, 59 mM; [1-¹³C_fructosyl]-sucrose, 6.5 mM; [1-¹³C_glucosyl]-sucrose, 6.5 mM; [U-¹³C₁₂]-sucrose, 8 mM; glucose, 32.75 mM; [1-¹³C]-glucose, 3.25 mM; [U-¹³C₆]-glucose, 4 mM). ¹³C-labeled substrates were purchased from Omicron Biochemicals (http://www.omicronbio.com/). Inorganic nutrients were as described in Schwender and Ohlrogge (2002) with KNO₃ and NH₄NO₃ being omitted. pH was adjusted to 5.8 using KOH. Growth medium was sterilized using 0.22-μm sterile vacuum filter units (Stericup; Millipore). After 10 days of culture, embryos were harvested, rapidly rinsed with NaCl solution (0.33 M), and after determination of fresh weight embryos were frozen in liquid nitrogen and stored at −80°C.

The biomass composition of embryo material was determined after organic solvent extraction and liquid:liquid fractionation into a chloroform soluble (lipid), methanol/water soluble (polar), and insoluble cell polymer fraction as reported earlier (Lonien and Schwender, 2009). The resulting biomass composition data and biomass derived fluxes used for ¹³C-MFA are listed in Table S2A.

In addition, extraction and analysis of soluble metabolites were done according to previously described protocols (Rolletschek et al., 2011). In brief, frozen embryos were ground to a powder with steel beads and extracted with precooled (−20°C) methanol/chloroform/water. Acephate was added during extraction as an internal standard. After centrifugation the supernatant was filtered with centrifugal filters and distributed in three aliquots.

To determine metabolite concentrations, we applied hydrophilic interaction chromatography (HILIC) using the instrument Ultimate 3000 (Dionex, Idstein, Germany) coupled to triple quadrupole mass spectrometer (MS; ABSciex, Darmstadt, Germany). The amino propyl column (Luna NH2, 250 × 2 mm, particle size 5 μm, Phenomenex, Torrance, CA) was used with varying eluent gradients, adjusted to either negative or positive scan mode of the MS. Some metabolites were detected via ion chromatography (ICS-3000-System, Dionex, Idstein, Germany) coupled to the same MS in negative ion mode. All chromatographic conditions, MS settings and ion traces used for multiple reaction monitoring in the various detection modes were exactly as listed in Rolletschek et al. (2011). Compound identities were verified by mass and retention time match to authenticated standards (Sigma, Germany). External calibration was applied using authenticated standards to determine metabolite concentrations (nmol/g dry weight of tissue).

Among the 57 analyzed free metabolites quantified by LC/MS, sucrose, Gln, malate and citrate are the most abundant ones, accounting for 80% (w/dw) of the free metabolites. In order to account for fluxes into these four major free metabolites, the weight of the methanol/water soluble fraction was divided among those metabolites.

¹³C-Metabolic Flux Analysis

For both genotypes ¹³C-labeling signatures were measured by mass spectrometric analysis as outlined before (Lonien and Schwender, 2009). Altogether fractional ¹³C-enrichment was determined by GC/MS in 18 biomass derived metabolites (33 molecular fragments, 163 peaks). Particular amino acids (derived from hydrolysis of cell pellet fraction), glycerol and fatty acids (derived from the lipid fraction), and glucose (derived from starch in cell pellet fraction) were analyzed (Table S2F).

Flux analysis was performed using the 13CFLUX2 computational toolbox (Weitzel et al., 2013) using the Brookhaven Linux Cluster (Scientific Linux 4). The network of central metabolism is defined by 25 free exchange fluxes, 13 free net fluxes as well as 21 biomass effluxes (Tables S2B–E) which are derived from biomass composition of the different genotypes (Table S2A). In addition, proxy reactions (Lonien and Schwender, 2009; Williams et al., 2010) were used to allow the model to adjust for the effect of isotopic disequilibrium in labeled polymers caused by preexisting unlabeled biomass. This adds one additional free net flux (Table S2E, net equalities section in model file). In order to find the global optimum 500 simulations were run with random start values for the free fluxes. The theoretical χ²-value was calculated to test the goodness of fit. Based on the general model information, the number of degrees of freedom is 91 (163 modeled MS signals + 21 biomass flux measurements; less 33 molecular fragments, 35 free net fluxes and 25 free exchange fluxes). With a 90% confidence level, χ²-value should be less than 108.65 (χ²_90%,91 = 108.65) to pass the statistical test. From the 500 random start optimizations, 28 and 43 solutions had a sum of squared residuals smaller than the χ²-critical value for genotype 3231 and 3170, respectively. For each genotype, the final flux estimates were derived by averaging the 10 best fit flux sets. Then statistical uncertainty in the best fit flux values (Table S2F) was determined with a Monte Carlo approach, by 100 times random re-sampling of the mass spectrometric data and flux measurements (Gaussian distribution; 13CFlux2 function “perturb”). Each time a best fit was selected from 20 random start optimizations.

The resulting flux distributions for genotypes 3231 and 3170 were used to constrain the flux space in the large scale stoichiometric metabolic model (bna572+) by transferring 9 out of 13 possible degrees of freedom. In the following we shortly summarize the necessary considerations about the definition of flux states, precision of flux estimates and the treatment of possible incompatibilities between the models based on network stoichiometries. Table S2B and C define mappings between reactions of both models, which we used to transfer flux values. A flux solution in the ¹³C-model is defined by a total of 35 free net fluxes (Table S2H). Twenty one of these don't need to be transferred since they are biomass dependent fluxes, which are derived independently in bna572+ based on the same biomass composition data (see below under paragraph 2.7). One free net flux (“vAla_bm”) in the ¹³C-model, used for adjustment to isotopic disequilibrium by preexisting unlabeled biomass, is excluded as well since it is not relevant for bna572+. This leaves 13 free net fluxes that define a flux state transferrable to bna572+. However, the flux solutions obtained in ¹³C-MFA have limited accuracy and precision, dependent on network structure, substrate labeling, measurement configuration and measurement errors. We therefore inspected the covariance matrix of the free net flux data generated by Monte Carlo statistics. This indicated that most of the statistical uncertainty is given by vGPT, vPPT, and vG6PDHp. We therefore excluded these three weakly determined free fluxes, leaving 10 free fluxes to be transferred to bna572+ (vuptAla, vupt0, vuptGln, vRub_p, vPEPC_c, vPyr_cp, vICDH, vPyr_cm, vME_c, vME_p). Two of them (vMEc, vMEp) were relatively close to zero and were therefore defined as knock-out constraints in bna572+ (Table 2).

Finally, we recognized that transfer of flux estimates from the ¹³C-model to bna572+ can cause unbalanced flux states in bna572+, in particular if absolute flux values are to be transferred. For example, in the ¹³C-model the uptakes of Ala and Gln are not derived based on tracing nitrogen, but based on unlabeled amino acid substrates isotopically diluting intermediates derived from ¹³C-labeled sugar substrates. While nitrogen is not balanced in the ¹³C-model, it is comprehensively balanced in bna572+. By transfer of Ala and Gln uptake rates from the ¹³C-model even very small numerical deviations from the nitrogen balance would render the bna572+ stochiometry non-solvable. As shown by this example, transfer of absolute net fluxes was avoided by formulation of flux states in the ¹³C-model by using flux ratios (Table 2; Table S2H).

Bna572+: Metabolic Network Reconstruction Revision and Availability

For metabolic network reconstruction of bna572+, we developed an Excel-based metabolic knowledge database. The core of the reconstruction is available in Excel format (Tables S1A–C), along with associated worksheets (Table S1D–F), COBRA-compatible Systems Biology Markup Language (SBML) (File S1), a demonstration of how the core Excel content is translated into SBML using LibSBML (Bornstein et al., 2008) and Python (File S2), and a Matlab COBRA test script (File S3).

Update of Biochemical Reaction Equations

Biochemical reactions and metabolites data are in Table S1B,C. Notation for subcellular compartmentalization is defined as “no compartment” (“u”), “cell wall apoplast” (“a”), “cytosol” (“c”), “mitochondrial intermembrane space” (“i”), “plastid thylakoid lumen” (“l”), “mitochondrial matrix” (“m”), “inner mitochondrial membrane” (“n”), “plastidic stroma” (“p”), “plastid thylakoid membrane” (“t”), “peroxisome” (“x”), and “external environment” (“e”). Charge balancing was carried out according to compartment pH and the chemicalize.org database. The cytosol was assumed to be at pH 7.5 and plastid stroma and mitochondrial matrix compartments more alkaline (pH 8.0 and 7.8, respectively). Reaction and metabolite data was written to SBML using a Python script (File S2).

Revision of Gene-Protein-Reaction Associations

Gene-Protein-Reaction associations (GPRs) are detailed in Table S1A. Bna572 GPRs (Hay and Schwender, 2011b) were revised to account for subcellular compartmentalization. To do this, we used the SUBA3 database (Tanz et al., 2013). Reconstruction of multiprotein complexes is based on Aracyc (Mueller et al., 2003) and additional literature (Table S1F).

In Table S1A a numerical association scheme is used to relate genes to reactions. Each association is encoded by up to three numbers separated by periods (e.g., 1.1.1). The first number specifies the reaction, the second a complex, and the third a subunit. Genes with the same numerical association code are related to each other by “or” Boolean logic. If genes associate with the same complex (i.e., second number is the same), but different subunits (i.e., third number different), then an “and” association is made. The numerical associations were translated into COBRA-compatible GPR strings and written to SBML using a Python script (File S2).

To compare Arabidopsis genes encoding for central metabolism enzymes to B. napus orthologs, we obtained predicted protein sequences of the diploid progenitor species of B. napus, Brassica rapa, and Brassica oleracea from genomic databases (http://www.ocri-genomics.org/bolbase/, Liu et al., 2014; ftp://ftp.jgi-psf.org/pub/compgen/phytozome/v9.0/Brapa/assembly/, Wang et al., 2011). Altogether, 86777 predicted protein sequences from the B. rapa and B. oleracea genomes were combined into a BLAST searchable database.

Confidence Scoring and Integration of Gene Expression Data

Confidence scores were used to justify the inclusion of reactions in the reconstruction. The scoring system (Table S1E) is based on literature, external databases, and published B. napus omics data sets referenced to Arabidopsis gene identifiers. The default score for a reaction was zero. A score of “1” was used if the reaction is included for modeling and simulation purposes only. “2” was given at the level of physiological evidence, for example, a spontaneous non-enzymatic reaction and for B. napus chloroplast proteome (Demartini et al., 2011). “3” was used for biochemical evidence in Brassicaceae species other than B. napus using BRENDA (Schomburg et al., 2013), or for developing B. napus embryo transcriptome (Troncoso-Ponce et al., 2011) and B. napus embryo plastid proteome data (Demartini et al., 2011). “4” was the highest score awarded, and used for biochemical evidence (e.g., enzyme activity assayed) in B. napus according to the literature or BRENDA.

GPRs were evaluated using the gene-reaction association rules in the COBRA model structure (field name “rules”) and a true/false presence/absence call for gene expressed based on expression profiling in developing B. napus embryos (Demartini et al., 2011; Troncoso-Ponce et al., 2011).

Genotype-Specific Constraints-Based Metabolic Modeling

Metabolic models of B. napus genotypes 3170 and 3231 were generated from the Systems Biology Markup Language (SBML) encoding of bna572+ (version: bna20140218T082916.xml; File S1). The COBRA toolbox 2.0.5 (Schellenberger et al., 2011b) for MATLAB (version R2013a, http://www.mathworks.com), libSBML (version libSBML-5.8.0-libxml2-x86) was used for metabolic modeling. We include a test script (File S3) for simulation in MATLAB using COBRA. The reader is referred to a protocol (Hay and Schwender, 2014) for more details on using the model with COBRA.

Flux ratios were constrained according to flux estimates from the ¹³C-MFA models. Flux ratios defined in the ¹³C-MFA model are formulated using bna572+ flux names given by reaction projections from the ¹³C-MFA model to bna572+ (Table S2C). The flux ratio equations were added as rows to the stoichiometric matrix (S) and right-hand side (b), as has been carried out in other studies (Hay and Schwender, 2011a,b; Mcanulty et al., 2012). To accomplish this, we modified the COBRA function addRatioReaction (Thiele and Palsson, 2010) to handle complex flux summations (see function addFluxRatioReaction in File S3).

The biomass equations are expressed in terms of millimole demands for the synthesis of one gram of biomass. They were written according to the biomass composition determined for each genotype. RNA (rna[u]) and DNA (dna[u]) contents (0.1% w/w each), nucleic acid composition, and amino acid composition were modeled after bna572 (Hay and Schwender, 2011b). New compounds found in bna572+ include alpha-tocopherol (atoco[p]; isoprenoid biosynthesis), beta-sitosterol (ss[c]; sterol biosynthesis), sinapine (sincho[c]; phenylpropanoid and choline biosynthesis), and two kaempferol triglucosides (ktg[c] and sinktg[c]; flavonoid biosynthesis). The abundances of these minor components in seed biomass were estimated from the literature. Phenylpropanoids/flavonoids were accounted as small fractions of the insoluble cell wall fraction: ktg[c] 0.01% (w/w), sinktg[c] 0.04% (w/w) and sincho[c] 0.62% (w/w) (Fang et al., 2012). Small amounts of alpha-tocopherol and beta-sitosterol were attributed to the lipid fraction: atoco[p] 0.01% (w/w oil) (Goffman and Mollers, 2000); ss[c] 0.1% (w/w oil) (Gunstone, 2001). Note that the genotype-specific biomass equations (Table S1Q) are not found in the SBML, but instead are implemented in the COBRA MATLAB script (File S3).

Flux Variability Analysis

The COBRA toolbox 2.0.5 (Schellenberger et al., 2011b) and TOMLAB CPLEX solver package (version 12.3, http://tomopt.com) were used for metabolic simulation. Flux variability analysis (FVA), a two level optimization, was used to quantify minimum and maximum values for all reactions. Minimization of the molar sum of all substrate uptakes and light flux has been introduced before as the primary linear objective applicable to various simulated substrate combinations (Hay and Schwender, 2011b). This objective was developed as a rationalization of photoheterotrophic metabolism of cultured embryos growing with multiple possible organic substrates (Hay and Schwender, 2011a,b). In short, our objective function is to be applied in the primary optimization and in effect determines total carbon uptakes and net CO₂ release in agreement with an experimentally given global carbon balance. In this study the carbon balance is given with the ¹³C-MFA data (total carbon substrate uptakes relative to CO₂ efflux, ratio 7, Table 2). In secondary optimizations solutions that gave 100% of the optimal value were obtained. The COBRA setting for tolerance of optimal value accuracy (the COBRA variable CBT_LP_PARAMS.objTol) was applied globally at 10⁻¹⁰ μmol/h. FVA was implemented by setting the “allowLoops” option of the “fluxVariability” function to “false.” To improve the performance of loopless FVA, it was performed only on loop reactions. Loopless FVA sometimes took longer than usual (usually no more than one loop reaction per FVA), and the solver logged the message “Consider using CPLEX node files to reduce memory usage.” Minimum and maximum fluxes returned by COBRA's loopless FVA algorithm had to be adjusted because “fluxVariability” underestimates them by a factor of 1000 for some unknown reason (see e.g., line 0218 of the source code). For some reason the maximum flux of reactions #552–553 for flux ratio-constrained 3170 was sensitive to CBT_LP_PARAMS.objTol; for these optimizations the tolerance was increased to 10⁻⁹ μmol/h.

The variability type (Hay and Schwender, 2011b) was used to qualitatively characterize the flux intervals predicted with FVA. For classification of reactions according to variability types, absolute flux values ≤ 10⁻⁷ μmol/h and those > 100 μmol/h were judged zero and unbounded/infinite (loop reaction), respectively.

Projected flux variability analysis was implemented in COBRA according to previously reported methods using toolboxes for metabolic modeling (Hay and Schwender, 2014). The only difference is that “allowLoops” was set to “false” when using “optimizeCbModel.” This method was also used as an independent approach to verify the output of loopless FVA using “fluxVariability.”

By comparing flux variability analyses between genotypes, reactions were classified according to responsiveness type. Responsiveness type characterizes the qualitative difference in a flux interval at high oil relative to low oil (Schwender and Hay, 2012). Each symbolic responsiveness type is four-partite (composed of the low-oil genotype variability type, high-oil genotype variability type, minimum flux response, and maximum flux response). A flux is termed oil responsive if its magnitude increases with oil content. If a flux magnitude decreases with oil content it is termed starch responsive. In the responsiveness type, an “r” was used to denote flux directionality reversal. For classification of reactions according to responsiveness types, absolute flux differences > 10⁻⁷ μmol/h were judged significant.

Using GPR associations of bna572+ as a scaffold, FVA was integrated with transcriptome data as detailed in the companion paper (Schwender et al., 2014).

Topological Analysis, Mapping and Random Sampling of Metabolic Network

Correlated reaction subsets were determined by using “sampleCbModel” and “identifyCorrelSets” in the COBRA toolbox. Blocked reactions were tested for using min/max optimization of each flux in the unconstrained network. For random sampling of the genotype-specific models, the COBRA function “sampleCbModel” was used with default settings. Sampling was performed after constraining lower and upper flux bounds of all reactions in the model according to the optimal flux space defined by loopless FVA. For each sampled vector the projections of bna572+ reactions onto the ¹³C-MFA network (Table S2C) were used to derive a sampled set of ¹³C-MFA fluxes. The Euclidian distance between the ¹³C-MFA results and the projected sampled flux set was determined in each case.

For integration of FVA data with a reaction map we used COBRA “readCbMap” and “drawFluxVariability.” The map file is provided as a text file and svg (Files S4, S5). Loops were identified by computing type III extreme pathways using the Expa toolbox (Bell and Palsson, 2005). Input and output files for Expa are provided (File S6) as well as the python/COBRApy script used to generate them (File S7). A loop reaction was deemed reversible if it could carry loop flux in both directions.

Results

History of Bna572

Bna572, a large-scale stoichiometric model with 572 reactions, has been developed to model storage compound synthesis in developing seeds of B. napus, using linear optimization and integrating physiological inputs generated with in-vitro cultivation of developing embryos (Hay and Schwender, 2011a,b). Using bna572 we modeled seed composition by in silico exploration of a tradeoff between two biomass components (Schwender and Hay, 2012), and explored the tissue specific metabolic heterogeneity that can be observed in the embryo while developing in planta (Borisjuk et al., 2013). Bna572 works with commonly used toolboxes for metabolic modeling and simulation (Hay and Schwender, 2014). Here we report revision and extension of bna572 (Hay and Schwender, 2011b) according to current standards for genome-scale metabolic reconstructions (bna572+). Using two genotype accessions of B. napus that differ in seed composition (Table 1), we demonstrate integration of flux information obtained from ¹³C-Metabolic Flux Analysis (Table 2) with loopless FVA. Our workflow is delineated as a flowchart shown in Figure 1.

TABLE 1

Table 1. Biomass composition of B. napus genotypes, measured for in-vitro cultured embryos of the two accessions 3170 and 3231.

TABLE 2

Table 2. Flux ratio- and flux constraints used to define constraint-based models for genotypes 3170 and 3231.

FIGURE 1

Figure 1. Illustration of procedures for reconstruction of bna572+ and integration of ¹³C-MFA data.

Bna572+ Database and Revision of Bna572

Following standards from current protocols for generating Biochemically, Genetically, and Genomically (BiGG) structured knowledgebases (Schellenberger et al., 2010; Thiele and Palsson, 2010), we use an Excel workbook with three core worksheets containing information about genes (Table S1A), reactions (Table S1B), and metabolites (Table S1C). Each section contains external data resource annotations. Since a fully annotated genome of B. napus is not available, Arabidopsis is used as the reference genome. The genes worksheet (Table S1A) lists 962 Arabidopsis genes grouped based on associations with reactions. Definition of sub-cellular compartmentation of enzymes and reactions is documented as a decision process based on literature and mining the SUBA3 database (Tanz et al., 2013). The reactions worksheet includes notes on the heuristic decision rules (Hay and Schwender, 2011b) used to define reaction directionality.

Table 3 summarizes updates that have been made by comparing bna572 to bna572+, the version we report here. To distinguish between reconstruction improvements and expansion, the Table also includes information for just the first 572 reactions of the revision (bna572 revised). Improvements include new literature annotations, revised gene identifiers and EC numbers (Tables S1G,H), and revised/expanded subsystem assignments. Following MIRIAM guidelines for external resource annotation (Le Novere et al., 2005), we now exclusively use Pubmed identifiers, Digital Object Identifier, or International Standard Book Number (ISBN) for literature references. Metabolites have KEGG compound ID annotations and identifiers including CHEBI, Pubchem, Inchi, and Smiles. Metabolites have charges, and charged and neutral formulae are distinguished. To this end, we now organize the metabolite worksheet of the database (Table S1C) by compartment-specific metabolite pools. Using the COBRA toolbox, we evaluated reactions for mass and charge balancing. Because of the lack of careful stoichiometric balancing of water and protons, bna572 has 130 mass imbalanced reactions. The revised bna572 has only 8 mass imbalances, which remain for photons and storage compounds for which limited data are available.

TABLE 3

Table 3. Content statistics for the revision and expansion of bna572.

Reference genome

The use of Arabidopsis as a reference genome for gene annotation is based on the high similarity in protein sequences between Arabidopsis and Brassica (Cheng et al., 2012). To further substantiate this assumption we performed an exploration of orthologous relationships. We aligned all bna572+ associated predicted protein sequences against all nuclear encoded predicted protein sequences for B. rapa (Liu et al., 2014) and B. oleracea (Wang et al., 2011), the two genetic progenitor species of B. napus (Protein BLAST 2.2.27+, e-value cut-off of 10⁻⁵⁰; Altschul et al., 1997). Among the blast hits were 4102 unique Brassica protein sequences, which were in turn aligned against the complete A. thaliana proteome (Protein BLAST 2.2.27+), resulting in 762 bidirectional best hits. Ninety five percentage of these alignments had >80% identity in amino acid residues. Using TargetP (version 1.1), localization prediction was identical between tentative orthologs in 88% of all cases and for about 5% the difference in prediction was between mitochondrial and plastidic localization. We conclude that proteins encoding for central metabolism are highly conserved between B. napus and A. thaliana not only on a protein sequence level but also with regards to subcellular localization.

Revision of gene associations resulted in 100 of the original reactions in bna572 receiving gene associations, while for four reactions existing gene associations were removed based on compartment specific information. An example of revisions made is the compartmentation of the Pentose Phosphate Pathway (PPP). Bna572 features a complete cytosolic PPP (reactions 512–514; 548–551; Table S1B) in parallel to the plastidic one. Yet, it has been reported that in the A. thaliana genome only plastid isoforms of transketolase and transaldolase are known (Eicks et al., 2002). So far no cytosolic isoforms have been found in A. thaliana and recently transketolase (AT2G45290, AT3G60750) has been localized to the chloroplast based on in vivo protein tagging (YFP) (Rocha et al., 2014). In our revision the reactions TKT1_c, TKT2_c, and TALDO_c have no genes associated. Reviewing gene associations of other relevant reactions suggests that in the cytosol ribulose 5-phosphate, produced by oxidative decarboxylation of glucose 6-phosphate, can be converted to xylulose 5-phosphate (ribulose-phosphate 3-epimerase, RPE_c) and then transported into the plastid by the xylulose 5-phosphate translocator (Pi_p_X5P_c_anti) where it can be further metabolized by the plastidic PPP (Table S1B). The lack of cytosolic transketolase and transaldolase isoforms seems to be confirmed in the genomes of B. oleracea and Brassica rapa. We found six complete sequences highly similar to the Arabidopsis transketolase sequences (Bol030169, Bol037547, Bol045625, Bra004898, Bra007555, Bra014473). All of them include conserved transit peptide sequences that are predicted to localize to the chloroplast (TargetP).

New pathways

Taking a pathway-by-pathway approach to the reconstruction process, the model was expanded to 669 reactions, 458 metabolites (672 compartmented metabolite pools), and 962 genes (Table 3). Bna572+ has five new Brassicaceae-relevant components in the biomass and nine new subsystems, including isoprenoid (methylerythritol phosphate and mevalonate pathways), alpha tocopherol, sterol, phenylpropanoid, flavonoid, and choline biosynthesis. The new subsystems are presented as a COBRA reaction map (File S5).

To assess whether the revision could impact any of our formerly reported simulations, stoichiometry and reaction directionality of bna572 were compared against bna572+. Disregarding the balances of waters and protons (except for proton motive force) and the new biomass equations, the first 572 reactions of bna572+ are identical to bna572. In other words, no changes were made to the stoichiometric and thermodynamic directionality constraints of the first 572 reactions of bna572+.

During expansion of the model, a flavonoid biosynthesis subnetwork was added including the biosynthesis of 3′5′ hydroxylated flavones (myricetin, dihydromyricetin, pentahydroxyflavanone) by reactions 635, 637, 638, 639, 641 (Table S1B). Literature indicates that Arabidopsis and Brassica do not synthesize 3′5′-hydroxylated anthocyanins (Sheahan et al., 1998; Falginella et al., 2010; Fang et al., 2012), and genes encoding for flavanoid 3′,5′-hydroxylase (EC 1.14.13.88) appear to be absent from the Arabidopsis genome (Falginella et al., 2010) and from B. rapa (Guo et al., 2014). Note that therefore the synthesis of 3′5′ hydroxylated flavones was inactivated in the genotype specific model simulations (Table 4), which is a modification to be implemented after import of the SBML model file into COBRA. Likewise, reactions reported to exist and to be critical for seed development, but not yet included in the SBML, are added directly to the COBRA model in the script. These are cytosolic NADP-dependent malic enzyme (Wheeler et al., 2005) and plastidic NAD-malate dehydrogenase (Scheibe, 2004), which has recently been found to be essential for embryo viability in Arabidopsis during seed development (Beeler et al., 2014; Selinski et al., 2014). Both of these reactions are expressed in B. napus developing seeds according to transcriptome data (Troncoso-Ponce et al., 2011).

TABLE 4

Table 4. Definition of genotype-specific metabolic models derived from bna572+.

Confidence scores

Confidence scores were derived to quantify the level of biological evidence for reactions in the network (Tables S1B,E). The scores include information e.g., on evidence by biochemical characterization of the enzyme complement in isolated plastids of developing rapeseed embryos as well as evaluate reactions and GPR associations against multi-omics datasets, databases, and other organism-specific literature (see Materials and Methods). The scores given ranged between zero and four. Eighty percentage of all reactions had a score of at least 3.

Availability of Bna572+ reconstruction and models

Bna572+ is under continuous development and the version presented here and being used in simulations is dated February 18, 2014 (see model ID in File S1). Bna572+ is available in COBRA-compatible SBML (Hucka et al., 2003). MIRIAM Uniform Resource Identifiers (identifiers.org) (Juty et al., 2012) were used to define annotations. For practical usability, we include a test script (File S3) for the COBRA toolbox (Schellenberger et al., 2011b). Prior to use, the user needs only to install MATLAB and LibSBML (Bornstein et al., 2008). We also provide a Python script which can be used to translate the database's Excel worksheet into SMBL (File S2) for use with alternative programs such as COBRApy (Ebrahim et al., 2013).

Analysis of Metabolic Network Structure

To assess the structure of the revised metabolic network reconstruction, we quantitatively analyzed some topological features. First, we assessed the degree of continuity of internal reactions with respect to inputs and outputs. In the unconstrained metabolic network, every reaction can carry flux at steady-state, i.e., there are not any blocked reactions (Table 3). This result reflects the fact that our pathway-by-pathway approach to reconstruction intentionally prevents dead ends in the metabolic network. Next, to analyze enzyme subsets, we randomly sampled the unconstrained network and identified correlated reactions sets, i.e., groups of reactions that always operate in fixed flux proportions. Using this approach, we identified 73 reaction subsets (Table S1B) ranging between 2 and 183 reactions large. Subset 1 is the largest and contains all the reactions going to biomass synthesis. Finally, we determined the loop topology of bna572+ (Figure 2, Table S1I). For these computations, instead of the unconstrained network, we used the photoheterotrophic-specific constraints with fixed light exchange flux (Table 4). Thirty nine reactions participate in loops (Figure 2B), i.e., thermodynamically infeasible internal cycles that result in unrealistic flux predictions. Extreme pathway analysis with Expa (Bell and Palsson, 2005) enumerated 51 loops (type III pathways) between 4 and 31 loop reactions in length (Figure 2A). Nine of these loops are reversible, i.e., they can carry flux in both directions. Loop reactions have membership with 15–43 loops. Due to these cycles the solution space was in part unbounded in FVA simulations (Table S1J). The loop reactions mostly belong to highly compartmentalized central intermediary metabolism. Mainly reactions of the subsystem categories of glycolysis, of citric acid cycle and carboxylic acid metabolism and intracellular transport reactions connecting the compartments cytosol, plastid, and mitochondrium are present. Hence, loops are a major problem predicting the feasible flux space in central carbon metabolism, motivating the use of the loopless algorithm.

FIGURE 2

Figure 2. Loop topology of bna572+. Distribution of loop size (A) and loop membership (B).

The loopless algorithm involves mixed integer linear programming which can be computationally demanding. Therefore, we evaluated the computational performance of the loopless algorithm (Table S1M). The time to complete loopless FVA across all network reactions was 19–27 min. In contrast, FVA without the loopless algorithm took less than 1 min. Hence, loopless FVA takes significantly longer to compute. This is especially true for reactions with loop bounds, which took 6–10 times longer to calculate (Table S1M). Importantly, since loopless FVA offers no advantage for non-loop reactions, it makes sense in terms of computation time, to limit the algorithm to loop reactions. This technique is particularly useful given that the solver can get stuck, even on non-loop reactions (Table S1N, genotype 3170 no ratios). Bypassing non-loop reactions, we could bring the computation time for loopless FVA down to 7–9 min.

Analysis of New Subsystems (Reaction 572+)

To characterize the predicted metabolic flux for the new subsystems of bna572+, we compared the newly reconstructed pathways and mapped flux variability analysis (File S5). Analysis of alternative biosynthetic pathways indicated which routes are predicted to carry flux with respect to the optimal flux space. For the vitamin E subsystem, tyrosine or phenylalanine contribute to the formation of the polar chromanol ring of alpha-tocopherol. For the isoprenoid pathways, the 5-C isoprenoid precursor isopentenyl diphosphate is produced by the non-mevalonate pathway, while the mevalonate pathway is predicted to not be used. Lastly, sinapate is predicted to be synthesized via conventional free phenylpropanoid acid biosynthesis, not the alternative route via shikimate O-hydroxycinnamoyltransferase (HCT) (#661). Since we showed that none of these reactions are blocked (Table 3), our analysis suggests that the unused pathways do not carry flux because they are suboptimal relative to alternative routes.

Verification of Gene Expression

To verify the expression of genes in bna572+, we integrated developing embryo-specific gene expression data. Of particular value are 11464 Arabidopsis gene orthologs reported to be expressed in developing B. napus embryos (Troncoso-Ponce et al., 2011). Using this data set, we can validate the expression of 744 of the 966 genes in bna572+ (77%). By using proteome data of embryo plastids (Demartini et al., 2011), 10 more genes in bna572+ are validated. Leveraging the Boolean gene rule functionality of in total 545 GPR's (GPR, Table S1B) with the COBRA toolbox, we evaluated the reported presence/absence of transcripts and proteins (Table S1B). Based on transcriptome data, 523 (96%) of gene-associated reactions are present. If we also include the proteome data, this number increases to 526 (97%), i.e., 19 reactions with GPR lack support by gene expression. While the lack of expression diminishes to the overall confidence score, the 19 reactions for which gene expression was not found remained active in all model simulations reported—anticipating future improvements of data quality and methods. Closer inspection revealed that eight active reactions that were judged “non-present” were protein complexes consisting of seven or more different subunits (Sdh_m, FQR_p, NADH_m, ATP_p, PSI_p, PSII_p, PET_p). By Boolean logic defined in the GPR associations (AND), the lack of expression of only one subunit will cause an entire reaction to be called non-present. Also, expression of asparaginase (Asnase_c) and of phosphoglycolate phosphatase (PGLP_p), a step in photorespiration, was lacking. Missing expression of these reactions does not limit the confidence in the flux results since neither reaction is used in any of our simulations (Table S1J). For 8 additional reactions among Trp, Cys, purine, and phenylpropanoid synthesis (FOL_p, hfasynth_c, kdgsynth_c, narsynth_c, PAI_p, PUR_p, SAT_p, sinaldsynth_c) as well as two steps of the methylerythritol phosphate pathway (MEPCT_p, CMPkin_p) lack of expression might be more crucial.

¹³C-Metabolic Flux Analysis and Mapping Bna572+ to a ¹³C-Model

Embryos of two B. napus accessions 3231 and 3170 were grown photoheterotrophically in culture under identical conditions with ¹³C-labeled glucose and sucrose as tracers for ¹³C-MFA (see Materials and Methods). Table S2B summarizes the network definition in relation to bna572+. In the ¹³C-MFA network, reactions are derived mainly by lumping successive or parallel reactions that can be represented by a common transition of carbon atoms and by lumping of metabolite pools (Schwender, 2011). The mapping relations are formalized in Table S2C. A ¹³C-MFA model with 128 reactions, comprising 647 carbon transitions in central metabolism was modeled (Materials and Methods). From the results of flux parameter fitting for the two genotypes, flux differed significantly between the genotypes for 13 reactions (Table S2G). Figure 3 gives an overview. As to be expected for the genotype higher in oil (3170), flux into lipid synthesis is increased. The increased demand in biosynthetic precursors is met by increased flux of plastidic pyruvate kinase (vKPp), pyruvate dehydrogenase (vPDHp), and efflux of triose-phosphate (synthesis of glycerol). The uptake of sugar substrate is increased as well, indicating an increased sugar to lipid conversion. An increase in citrate synthase (vCS) and ATP:citrate lyase (vACL) flux in accession 3170 can be explained by higher demands for cytosolic fatty acid elongation.

FIGURE 3

Figure 3. Flux map for accession 3231 (¹³C-MFA). Flux distribution map showing glycolysis, pentosephosphate pathway, the TCA cycle and biosynthetic effluxes into protein, lipid and free soluble metabolites for accession 3231. Arrow thickness indicates carbon flux. Significant higher (lower) values in accession 3170 are marked in blue (red). Gray arrows: high statistical uncertainty (SD > 50% of flux value). Abbreviations: Metabolites (subscripts “p,” chloroplast; “c,” cytosol): AcCoA, acetyl-Coenzyme A; aKIV, 2-keto-isovalerate; C1, 5,10-methylene- or 5-formyl -tetrahydrofolate; Cit, citrate; GOX, glyoxylate; Hex, hexose, representing free hexoses and sucrose; HP, hexose phosphate; Icit, isocitrate; KG, ketoglutarate; OAA, lumped pool of subcellular pools of oxaloacetate and malate; PEP, phosphoenol pyruvate; PGA, 3-phospho glycerate; Pyr, pyruvate; RP, ribose 5-phosphate; RuP, Ribulose 5-phosphate; Succ, succinate; TP, triose phosphate (dihydroxyacetone phosphate, glyceraldehyde 3-phosphate); XP, xylulose 5-phosphate. Reactions: vACL, ATP:citrate lyase; vCS, citrate synthase; vFM, fumarase, malate dehydrogenase; vICDH, isocitrate dehydrogenase; vPDH_p, plastidic pyruvate dehydrogenase; vPK_p, plastidic pyruvate kinase; vRub_p, ribulose bisphosphate carboxylase.

Next we determined flux ratio- and flux constraints to be applied to the large scale model. With the initial model configuration the flux state of free fluxes is defined by three free uptake fluxes and ten inner free fluxes. This configuration was replaced by one with seven flux ratios and five free net fluxes (vME_c, vME_p, vPPT, vG6PDH_p, vAla_bm) (Table S2H). Two free fluxes (vME_c, vME_p) were close to zero in both genotypes and thus the equivalent reactions in bna572+ (ME_c, ME_p) were constrained to zero. All resulting constraints are listed in Table 2.

Genotype-Specific Simulations of Bna572+

The biomass composition of the two B. napus accessions 3231 and 3170 differed principally in two components. Accession 3170 is 10% (w/dw) higher in lipid content and 3231 is by 6% higher in starch content (Table 1). These differences in biomass composition were used to derive genotype specific models from bna572+. Transformation of the bna572+ reconstruction into a metabolic model requires addition of physiological constraints, which include biomass composition-derived fluxes, biomass accumulation rate, fixed maintenance respiration rate, and inactivation of exchanges for external metabolites not available in the medium. Details on constraints imposed on bna572+ when simulated under COBRA can be found in Table 4. As before, flux balance analysis methods were carried out with photoheterotrophic constraints under the primary objective of minimization of total molar substrate uptakes (Table 4, Hay and Schwender, 2011b). Essentially, the primary optimization adjusts the model to a carbon balance that is derived from the ¹³C-tracer experiments performed under photoheterotrophic conditions (Table 2, r7). The light flux affects the total carbon balance since photosynthetic electron transport is a source for carbon reduction. The primary optimization serves to adjust the light flux accordingly. The light fluxes hereby obtained for both accessions (Table 4) are similar to the one formerly obtained based on constraining the model with independently measured carbon balances (Hay and Schwender, 2011a). In addition to the carbon balance constraint, we applied six genotype specific flux ratio- and two reaction knock-out constraints (Table 2), based on the flux distribution in central carbon metabolism as inferred by ¹³C-MFA (Table S2G). Flux states resulting from these model configurations were then assessed to characterize the effect of ¹³C-flux and flux ratio constraints on the size of the optimal flux space. First we assessed the variability type distribution. FVA produces bounds for all reactions which can be categorized according to being “variable” (difference between lower and upper bound), “non-variable” (both bounds are equal and non-zero) and “never used” (both bounds are zero) (Hay and Schwender, 2011b). One might expect that addition of ¹³C-constraints reduces the flux space by reducing the number of variable reactions. Remarkably, for both accessions the total number of 89 variable reactions is unchanging, with or without ¹³C-flux ratio constraints (Table S1K). Instead, reduction of flux space can be seen when assessing the total range of flux variability (sum of size of all flux intervals across bna572+). Accordingly, presence of ¹³C-derived flux constraints reduce the loopless solution space by 53 and 55% for accessions 3231 and 3170, respectively (Table S1M). We also applied random sampling of loopless flux states within the boundaries given by the flux variability intervals (Table S1O). In particular, the random sampling was done in projection of bna572+ onto the ¹³C-MFA network. For each of the sampled states, the flux ratios originally derived from 13C-MFA were reproduced (Table S1O). This verifies the computational process in-between the ¹³C-MFA and bna572+. In addition, the Euclidian distance between the random sampled flux states and the original ¹³C-MFA derived flux states reduces substantially by imposing the ¹³C-derived constraints (Figure 4). This again demonstrates the reduction in flux space. Closer inspection of FVA projections onto the ¹³C-MFA network shows that with ¹³C-MFA constraints 15 reactions have flux variability (Table S1L). The 15 reactions are part of the glycolysis and PPP reactions present in parallel in the cytosol and in the plastid compartment. These represent free fluxes in the ¹³C-MFA model that had relatively high statistical uncertainty and therefore were not transferred (see Materials and Methods).

FIGURE 4

Figure 4. Random sampling of loopless solutions showing the effect of flux- and flux ratio constraints. Genotype 3170. Euclidean distance between random sampled flux vectors, projected onto the ¹³C-MFA model, and ¹³C-MFA flux estimates.

Differential Flux Variability and Reaction Responsiveness in Brassica Napus Genotypes

To better understand the genotype-specific differences in carbon allocation to embryo biomass during development, we compared the metabolic phenotypes of B. napus accessions 3231 and 3170 predicted with FVA. The change in predicted flux between the high oil genotype 3170 and 3231 was qualitatively characterized by assigning each reaction a responsiveness type (Table S1J), the frequencies of which are tabulated in Table S1K and summarized in Table 5. Applying the loopless constraints, there are 27 more responsiveness predictions than without. This is because multiple reactions participate in loops in both genotype models and therefore have undefined (infinite) flux bounds, which become identifiable with added loopless constraints. The reactions that become identifiable are located mainly in central metabolism. Overall, with the loopless algorithm and ¹³C-MFA derived constraints, 351 oil- and 41 starch-responsive reactions were identified (Table 5). Via GPR associations, this information can be integrated with gene expression data, as explored in our companion paper (Schwender et al., 2014).

TABLE 5

Table 5. Distribution of predicted reaction responses to explain the difference between high oil/low starch genotype 3170 and low oil/high starch genotype 3231.

We further characterized the metabolic phenotypes by subsystems (Table S1P). Lipid biosynthetic pathways increase in flux in the high oil genotype, as indicated by their high enrichment for oil-responsiveness. The same was true for the light reactions of photosynthesis, Calvin cycle, Rubisco shunt, oxidative pentose phosphate pathway (OPPP), and various amino acid biosyntheses (the high-oil phenotype is also slightly higher in protein; Table 1). Unexpectedly, in a portion of the beta-oxidation subsystem flux was up upregulated in the high oil genotype (#rxns 413–437, 466–468, 519–20). This observed pattern did not involve lipase (#133), but constitutes a futile cycle of fatty acid synthesis and degradation of relatively small magnitude, which must be suboptimal. A different set of subsystems was highly enriched for starch-responsiveness. Up-regulated reactions predicted in the high-starch genotype include starch biosynthesis, pyrophosphate metabolism, and oxidative phosphorylation. Glycolysis and the citric acid cycle were divided between starch and oil responsiveness, with mitochondrial citrate synthase (#37) and pyruvate dehydrogenase (#541–543) correlated with starch, and the plastidic isoform of pyruvate dehydrogenase (#480–482) being oil-responsive. These relationships are further explored in our companion paper (Schwender et al., 2014).

Limitations

We have demonstrated here how the solution space in the large scale model bna572+ can be further constrained based on experimental data, namely ¹³C-MFA derived fluxes, together with elimination of unboundedness based on the loopless-COBRA algorithm. However, interpretation in particular of qualitative flux variability types and of flux responsiveness has to be made carefully, bearing in mind that small differences might not be statistically significant. While the ¹³C-MFA model results in flux estimates with statistical uncertainty measures, the constraint-based approach produces flux estimates without statistics. We address this problem in part by not including in the CB model any flux estimates from MFA that are statistically poorly defined. Yet, the biomass composition data for the two compared B. napus accessions were defined in the CB-model using the commonly used biomass equation - without statistical measures. Between the two accessions 3170 and 3231 there is a very small difference in protein content (Table 1). On the level of ¹³C-Flux analysis, this difference does translate into statistically insignificant differences in amino acid and protein biosynthesis fluxes, while in the CB-model, the same small differences are judged “TAG responsive” by the criteria used to determine quantitative responsiveness. Furthermore, some pattern of pathway usage being observed in response to imposition of ¹³C-constraints onto genotype specific bna572+ cannot easily be explained. Imposing genotype 3170 derived ¹³C-constraints resulted in a small scale energetically wasteful futile cycle of fatty acid synthesis and degradation. This is unexpected since the ¹³C-MFA model does not contain beta-oxidation. Altogether this means that interpretation of the small flux responses should be done with caution and a comprehensive statistical treatment of CB-predictions is warranted.

Discussion

Relevance of Bna572+

Bna572+ extends the previous predictive model of B. napus seed development, bna572 (Hay and Schwender, 2011a,b; Schwender and Hay, 2012; Borisjuk et al., 2013). Lacking a complete and fully annotated B. napus genome, our model is referenced to the Arabidopsis genome. Such hybrid models that are indirectly derived by orthology to Arabidopsis are not uncommon, including models of C4 and CAM plants (De Oliveira Dal'molin et al., 2010b; Cheung et al., 2014). In our case, using the Arabidopsis genome as a proxy is highly justified given the close relationship between B. napus and Arabidopsis within the Brassicaceae (Al-Shehbaz et al., 2006). In an accompanying paper (Schwender et al., 2014) we explore how, using the Arabidopsis genome as a proxy, B. napus hiseq transcriptome sequencing data can be integrated with bna572+.

Bna572+ is relatively small compared to reconstructions of Arabidopsis which were derived in a top-down manner starting with a set of genome predicted metabolic genes (Table 6). Given the complex subcellular compartmentation of metabolism in plants, model reconstruction aimed at comprehensive representation of genome encoded reactome is not necessarily the only meaningful strategy for network reconstruction, in particular since even the Arabidopsis genome is largely underexplored in terms of transport proteins in primary and secondary metabolism. It appears that in genome-scale Arabidopsis reconstructions a high percentage of reactions is associated with the cytosol (Table 6). This indicates that compartmentation of pathways tends to be a major challenge in model reconstruction, since the cytosol is typically the default localization in first draft reconstructions derived from uncompartmentalized pathway databases. An exeption to this retention of cytosolic localization is seen in the Mintz-Oron model (Table 6). However, in this case, all intracellular transport reactions are entirely computationally inferred based on a principle similar to gap filling (Mintz-Oron et al., 2012), which might be unrealistic. Here we favor an expandable bottom-up reconstruction process that is guided by adding metabolic functions according to major biomass components relevant to the studied tissue and by emphasizing compartmentation over comprehensiveness. Compartmentation of specific pathways can often be defined from original expert domain literature, which often implies which metabolites might be transported across membranes without necessarily identifying specific transporter proteins. Altogether, bna572+ is a valuable resource for computational and experimental scientists in the Brassicacea community and foreseeable community-driven efforts for Arabidopsis consensus models.

TABLE 6

Table 6. Basic characteristics of Arabidospis thaliana pathway databases and metabolic reconstructions.

Importance of Standardization

According to published guidelines (Thiele and Palsson, 2010), bottom-up genome-scale reconstruction is a laborious task that involves labor and time intensive manual curation of multi-omics data used in systems biology. Various software platforms exist to automate initial steps in the reconstruction process (Hamilton and Reed, 2014). For example, the SuBliMinaL (Swainston et al., 2011) toolbox of path2models (Buchel et al., 2013) in the BioModels database outputs a draft reconstruction of 1672 reactions and 2032 genes for Arabidopsis, leaving the arduous task of model refinement to the reconstructionist. Other platforms have been designed for model building and curation. For example, the rBioNet extension for COBRA offers a convenient graphical user interface for quality control and assurance (Thorleifsson and Thiele, 2011). While clearly there are many options for the reconstructionist, the workflow we report here offers the familiarity of Excel spreadsheets and hence knowledge database flexibility. It is along the lines of the xls2model function that was developed for COBRA (Schellenberger et al., 2011b). The difference is that our platform requires the user to write SBML as a model intermediate in a Python scripting environment. Importantly, COBRA-compliant SBML, of the kind we provide here (File S1), is the mainstay of metabolic systems biology by making predictive metabolic models publishable and distributable for actual use.

With the COBRA standard, a wealth of methods is now applicable to bna572+ for systems-level analysis and metabolic engineering. For example, gene deletions in the reaction network could be characterized as is commonly done with microbes (Monk and Palsson, 2014), or OptKnock (Burgard et al., 2003) could be used to predict non-intuitive knockout strategies for the redesign of biomass composition.

Tissue Specific vs. Genome Scale Model

Genome-scale models of higher organisms typically represent global metabolic reconstructions. To simulate cell-type or tissue specific metabolism, models are being integrated with information on tissue specific gene expression (Machado and Herrgard, 2014). For example, proteome data have been used to generate tissue specific models from an Arabidopsis genome scale model (Mintz-Oron et al., 2012). Bna572+ was derived as a bottom-up reconstruction already with a developing seed in mind, but now also allows integration of transcriptomic data. This means transcriptomic data can be used to validate the model or to further constrain the flux space. A next step would be to identify which of the several mathematical methods being available (Machado and Herrgard, 2014) might be best suited for our specific system.

Shrinking the Solution Space Using ¹³C-Constraints and Loopless Algorithm

Here we integrated ¹³C-MFA and CB-modeling. ¹³C-MFA results in empirical flux data based on the use of ¹³C-tracers, while CB-modeling is a rather predictive modeling approach. A principal problem commonly encountered for CB-modeling is that of the missing of constraints (Reed, 2012) and the ambiguity caused by prediction of multiple alternative flux distributions. This applies in particular to highly compartmentalized plant cells. The boundaries of this solution space can be explored by FVA (Mahadevan and Schilling, 2003) or by random sampling (Wiback et al., 2004). If objective functions are used that give unique solutions, internal predicted flux states can differ substantially from ¹³C-MFA-derived empirical flux data. In order to improve the predictive power of CB-modeling a variety of approaches have been developed in trying to further narrow down the most likely biologically relevant flux solutions (Park et al., 2010; Lerman et al., 2012; Reed, 2012). In plants, alternative solutions and ambiguity have been addressed for example by the choice of objective function(s) (Cheung et al., 2013), integration of gene expression data (Topfer et al., 2013), FVA (Hay and Schwender, 2011b) and simplification by network projection (Hay and Schwender, 2011a). Another resource for refining flux predictions based on experimental data is the integration of ¹³C-MFA data. Flux ratio constraints have been used to better predict experimentally determined fluxes (Hay and Schwender, 2011b; Cheung et al., 2013). However, only a few ratios were used in these studies. In this paper, most of the information about the flux determination in ¹³C-MFA has been transferred to the large scale model via seven flux ratios and two flux constraints (Table 2). As a result, the predictive potential of our model increased substantially (Figure 4).

Another major problem with FVA is the prediction of flux states involving net flux around a closed cycle of internal reactions. In this study, the predictive power of bna572+ was severely challenged by such thermodynamically infeasible loop flux (Figure 2, Table 5). The loopless COBRA algorithm (Schellenberger et al., 2011a) proved to be a good solution to eliminate these problematic cycles because, in contrast to other methods (Muller and Bockmayr, 2013), it did not require any information about metabolite concentrations or thermodynamic parameters. Surprisingly, so far there has been limited use of loopless COBRA in plant systems biology (Mintz-Oron et al., 2012), even though it is implemented in its namesake toolbox. As shown by the genotype response of FVA reported here, we expect the loopless constraints to improve the predictive power of in silico biomass component tradeoffs (Schwender and Hay, 2012).

Comparison of Two Genotypes

In this study we compared two B. napus accessions which differed mainly in lipid and starch content of in-vitro cultivated embryos. By using constraints derived from the flux estimation by ¹³C-MFA for two genotype specific FVA models, we expanded on the concept of in silico biomass component tradeoffs (Schwender and Hay, 2012) and could show that the flux solution space is being reduced. Starch biosynthesis responsiveness was predicted for the low-oil genotype as expected given the biomass composition. Likewise, for the high-oil genotype 3170, increase in flux through fatty acid synthesis, elongation, desaturation, triacylglyerol synthesis, and amino acid/protein biosynthesis were predicted. A few of the reactions of these subsystems can be validated with correlations of gene expression (Table 4 in Schwender et al., 2014).

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Acknowledgments

The authors like to thank the two reviewers for their helpful comments. We acknowledge funding by Bayer CropScience NV to Inga Hebbelmann, Nicolas Heinzel and Hardy Rolletschek for experimental work. This material (plant culture, in vitro embryo culture, biochemical analysis, ¹³C-Metabolic Flux Analysis, computational modeling of metabolism, data analysis and interpretation that was performed by Jorg Schwender, Jordan O. Hay, Hai Shi and Inga Hebbelmann) is based on work supported by the U.S. Department of Energy, Office of Science, Office of Basic Energy Sciences, Chemical Sciences, Geosciences, and Biosciences Division under contract number DEAC0298CH10886. Hardy Rolletschek gratefully acknowledges financial support by the Deutsche Forschungsgemeinschaft (BO-1917/4-1). Jordan O. Hay received a HHMI Postdoctoral Fellowship for work with Brookhaven National Laboratory's Office of Educational Programs, which further contributed to development of constraint-based modeling. We thank Griffin St. Clair, Biomedical Engineering Department, State University of New York at Stony Brook, Stony Brook, NY 11794, Graham Preston (Ward Melville High School, NY), and Dane Limjoco (Sachem High School East, NY) for their contributions to model extension during summer internships.

Supplementary Material

The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/Journal/10.3389/fpls.2014.00724/abstract

References

Akesson, M., Forster, J., and Nielsen, J. (2004). Integration of gene expression data into genome-scale metabolic models. Metab. Eng. 6, 285–293. doi: 10.1016/j.ymben.2003.12.002

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Al-Shehbaz, I., Beilstein, M., and Kellogg, E. (2006). Systematics and phylogeny of the Brassicaceae (Cruciferae): an overview. Plant Syst. Evol. 259, 89–120. doi: 10.1007/s00606-006-0415-z

CrossRef Full Text | Google Scholar

Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W., et al. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402. doi: 10.1093/nar/25.17.3389

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Arnold, A., and Nikoloski, Z. (2014). Bottom-up metabolic reconstruction of Arabidopsis and its application to determining the metabolic costs of enzyme production. Plant Physiol. 165, 1380–1391. doi: 10.1104/pp.114.235358

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Beeler, S., Liu, H. C., Stadler, M., Schreier, T., Eicke, S., Lue, W. L., et al. (2014). Plastidial NAD-dependent malate dehydrogenase is critical for embryo development and heterotrophic metabolism in Arabidopsis. Plant Physiol. 164, 1175–1190. doi: 10.1104/pp.113.233866

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bell, S. L., and Palsson, B. O. (2005). Expa: a program for calculating extreme pathways in biochemical reaction networks. Bioinformatics 21, 1739–1740. doi: 10.1093/bioinformatics/bti228

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Blazier, A. S., and Papin, J. A. (2012). Integration of expression data in genome-scale metabolic network reconstructions. Front. Physiol. 3, 299. doi: 10.3389/fphys.2012.00299

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bordel, S., Agren, R., and Nielsen, J. (2010). Sampling the solution space in genome-scale metabolic networks reveals transcriptional regulation in key enzymes. PLoS Comput. Biol. 6:e1000859. doi: 10.1371/journal.pcbi.1000859

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Borisjuk, L., Neuberger, T., Schwender, J., Heinzel, N., Sunderhaus, S., Fuchs, J., et al. (2013). Seed architecture shapes embryo metabolism in oilseed rape. Plant Cell 25, 1625–1640. doi: 10.1105/tpc.113.111740

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Bornstein, B. J., Keating, S. M., Jouraku, A., and Hucka, M. (2008). LibSBML: an API library for SBML. Bioinformatics 24, 880–881. doi: 10.1093/bioinformatics/btn051

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Boyle, N. R., and Morgan, J. A. (2009). Flux balance analysis of primary metabolism in Chlamydomonas reinhardtii. BMC Syst. Biol. 3:4. doi: 10.1186/1752-0509-3-4

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Buchel, F., Rodriguez, N., Swainston, N., Wrzodek, C., Czauderna, T., Keller, R., et al. (2013). Path2Models: large-scale generation of computational models from biochemical pathway maps. BMC Syst. Biol. 7:116. doi: 10.1186/1752-0509-7-116

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Burgard, A. P., Pharkya, P., and Maranas, C. D. (2003). Optknock: a bilevel programming framework for identifying gene knockout strategies for microbial strain optimization. Biotechnol. Bioeng. 84, 647–657. doi: 10.1002/bit.10803

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Chang, R. L., Ghamsari, L., Manichaikul, A., Hom, E. F., Balaji, S., Fu, W., et al. (2011). Metabolic network reconstruction of Chlamydomonas offers insight into light-driven algal metabolism. Mol. Syst. Biol. 7, 518. doi: 10.1038/msb.2011.52

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Cheng, F., Wu, J., Fang, L., and Wang, X. (2012). Syntenic gene analysis between Brassica rapa and other Brassicaceae species. Front. Plant. Sci. 3:198. doi: 10.3389/fpls.2012.00198

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Cheung, C. Y., Poolman, M. G., Fell, D. A., Ratcliffe, R. G., and Sweetlove, L. J. (2014). A Diel flux balance model captures interactions between light and dark metabolism during day–night cycles in C3 and crassulacean acid metabolism leaves. Plant Physiol. 165, 917–929. doi: 10.1104/pp.113.234468

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Cheung, C. Y., Williams, T. C., Poolman, M. G., Fell, D. A., Ratcliffe, R. G., and Sweetlove, L. J. (2013). A method for accounting for maintenance costs in flux balance analysis improves the prediction of plant cell metabolic phenotypes under stress conditions. Plant J. 75, 1050–1061. doi: 10.1111/tpj.12252

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Choi, H. S., Kim, T. Y., Lee, D. Y., and Lee, S. Y. (2007). Incorporating metabolic flux ratios into constraint-based flux analysis by using artificial metabolites and converging ratio determinants. J. Biotechnol. 129, 696–705. doi: 10.1016/j.jbiotec.2007.02.026

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Collakova, E., Yen, J. Y., and Senger, R. S. (2012). Are we ready for genome-scale modeling in plants? Plant Sci. 191–192, 53–70. doi: 10.1016/j.plantsci.2012.04.010

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Dal'molin, C. G., Quek, L. E., Palfreyman, R. W., and Nielsen, L. K. (2011). AlgaGEM–a genome-scale metabolic reconstruction of algae based on the Chlamydomonas reinhardtii genome. BMC Genomics 12(Suppl. 4):S5. doi: 10.1186/1471-2164-12-S4-S5

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Daran-Lapujade, P., Jansen, M. L., Daran, J. M., Van Gulik, W., De Winde, J. H., and Pronk, J. T. (2004). Role of transcriptional regulation in controlling fluxes in central carbon metabolism of Saccharomyces cerevisiae. A chemostat culture study. J. Biol. Chem. 279, 9125–9138. doi: 10.1074/jbc.M309578200

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Demartini, D. R., Jain, R., Agrawal, G., and Thelen, J. J. (2011). Proteomic comparison of plastids from developing embryos and leaves of Brassica napus. J. Proteome Res. 10, 2226–2237. doi: 10.1021/pr101047y

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

De Oliveira Dal'molin, C. G., and Nielsen, L. K. (2013). Plant genome-scale metabolic reconstruction and modelling. Curr. Opin. Biotechnol. 24, 271–277. doi: 10.1016/j.copbio.2012.08.007

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

De Oliveira Dal'molin, C. G., Quek, L. E., Palfreyman, R. W., Brumbley, S. M., and Nielsen, L. K. (2010a). AraGEM, a genome-scale reconstruction of the primary metabolic network in Arabidopsis. Plant Physiol. 152, 579–589. doi: 10.1104/pp.109.148817

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

De Oliveira Dal'molin, C. G., Quek, L. E., Palfreyman, R. W., Brumbley, S. M., and Nielsen, L. K. (2010b). C4GEM, a genome-scale metabolic model to study C4 plant metabolism. Plant Physiol. 154, 1871–1885. doi: 10.1104/pp.110.166488

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Ebrahim, A., Lerman, J. A., Palsson, B. O., and Hyduke, D. R. (2013). COBRApy: COnstraints-Based Reconstruction and Analysis for Python. BMC Syst. Biol. 7:74. doi: 10.1186/1752-0509-7-74

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Eicks, M., Maurino, V., Knappe, S., Flugge, U. I., and Fischer, K. (2002). The plastidic pentose phosphate translocator represents a link between the cytosolic and the plastidic pentose phosphate pathways in plants. Plant Physiol. 128, 512–522. doi: 10.1104/pp.010576

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Falginella, L., Castellarin, S. D., Testolin, R., Gambetta, G. A., Morgante, M., and Di Gaspero, G. (2010). Expansion and subfunctionalisation of flavonoid 3′,5′-hydroxylases in the grapevine lineage. BMC Genomics 11:562. doi: 10.1186/1471-2164-11-562

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Fang, J., Reichelt, M., Hidalgo, W., Agnolet, S., and Schneider, B. (2012). Tissue-specific distribution of secondary metabolites in rapeseed (Brassica napus L.). PLoS ONE 7:e48006. doi: 10.1371/journal.pone.0048006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Goffman, F. D., and Mollers, C. (2000). Changes in tocopherol and plastochromanol-8 contents in seeds and oil of oilseed rape (Brassica napus L.) during storage as influenced by temperature and air oxygen. J. Agric. Food Chem. 48, 1605–1609. doi: 10.1021/jf9912755

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Grafahrend-Belau, E., Junker, A., Eschenroder, A., Muller, J., Schreiber, F., and Junker, B. H. (2013). Multiscale metabolic modeling: dynamic flux balance analysis on a whole-plant scale. Plant Physiol. 163, 637–647. doi: 10.1104/pp.113.224006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Grafahrend-Belau, E., Schreiber, F., Koschützki, D., and Junker, B. H. (2009). Flux balance analysis of barley seeds: a computational approach to study systemic properties of central metabolism. Plant Physiol. 149, 585–598. doi: 10.1104/pp.108.129635

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Gunstone, F. D. (2001). Production and consumption of rapeseed oil on a global scale. Eur. J. Lipid Sci. Technol. 103, 447–449. doi: 10.1002/1438-9312(200107)103:7<447::AID-EJLT447>3.0.CO;2-Q

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Guo, N., Cheng, F., Wu, J., Liu, B., Zheng, S., Liang, J., et al. (2014). Anthocyanin biosynthetic genes in Brassica rapa. BMC Genomics 15:426. doi: 10.1186/1471-2164-15-426

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hamilton, J. J., and Reed, J. L. (2014). Software platforms to facilitate reconstructing genome-scale metabolic networks. Environ. Microbiol. 16, 49–59. doi: 10.1111/1462-2920.12312

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hay, J. O., and Schwender, J. (2011a). Computational analysis of storage synthesis in developing Brassica napus L. (oilseed rape) embryos: flux variability analysis in relation to ¹³C metabolic flux analysis. Plant J. 67, 513–525. doi: 10.1111/j.1365-313X.2011.04611.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hay, J. O., and Schwender, J. (2011b). Metabolic network reconstruction and flux variability analysis of storage synthesis in developing oilseed rape (Brassica napus L.) embryos. Plant J. 67, 526–541. doi: 10.1111/j.1365-313X.2011.04613.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hay, J. O., and Schwender, J. (2014). Flux variability analysis: application to developing oilseed rape embryos using toolboxes for constraint-based modeling. Methods Mol. Biol. 1090, 301–316. doi: 10.1007/978-1-62703-688-7_18

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Hucka, M., Finney, A., Sauro, H. M., Bolouri, H., Doyle, J. C., Kitano, H., et al. (2003). The systems biology markup language (SBML): a medium for representation and exchange of biochemical network models. Bioinformatics 19, 524–531. doi: 10.1093/bioinformatics/btg015

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Juty, N., Le Novere, N., and Laibe, C. (2012). Identifiers.org and MIRIAM Registry: community resources to provide persistent identification. Nucleic Acids Res. 40, D580–D586. doi: 10.1093/nar/gkr1097

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Kliphuis, A. M., Klok, A. J., Martens, D. E., Lamers, P. P., Janssen, M., and Wijffels, R. H. (2012). Metabolic modeling of Chlamydomonas reinhardtii: energy requirements for photoautotrophic growth and maintenance. J. Appl. Phycol. 24, 253–266. doi: 10.1007/s10811-011-9674-3

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Lakshmanan, M., Zhang, Z., Mohanty, B., Kwon, J. Y., Choi, H. Y., Nam, H. J., et al. (2013). Elucidating rice cell metabolism under flooding and drought stresses using flux-based modeling and analysis. Plant Physiol. 162, 2140–2150. doi: 10.1104/pp.113.220178

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Le Novere, N., Finney, A., Hucka, M., Bhalla, U. S., Campagne, F., Collado-Vides, J., et al. (2005). Minimum information requested in the annotation of biochemical models (MIRIAM). Nat. Biotechnol. 23, 1509–1515. doi: 10.1038/nbt1156

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Lerman, J. A., Hyduke, D. R., Latif, H., Portnoy, V. A., Lewis, N. E., Orth, J. D., et al. (2012). In silico method for modelling metabolism and gene product expression at genome scale. Nat. Commun. 3, 929. doi: 10.1038/ncomms1928

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Lewis, N. E., Nagarajan, H., and Palsson, B. O. (2012). Constraining the metabolic genotype-phenotype relationship using a phylogeny of in silico methods. Nat. Rev. Microbiol. 10, 291–305. doi: 10.1038/nrmicro2737

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Liu, S., Liu, Y., Yang, X., Tong, C., Edwards, D., Parkin, I. A., et al. (2014). The Brassica oleracea genome reveals the asymmetrical evolution of polyploid genomes. Nat. Commun. 5, 3930. doi: 10.1038/ncomms4930

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Lonien, J., and Schwender, J. (2009). Analysis of metabolic flux phenotypes for two Arabidopsis mutants with severe impairment in seed storage lipid synthesis. Plant Physiol. 151, 1617–1634. doi: 10.1104/pp.109.144121

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Machado, D., and Herrgard, M. (2014). Systematic evaluation of methods for integration of transcriptomic data into constraint-based models of metabolism. PLoS Comput. Biol. 10:e1003580. doi: 10.1371/journal.pcbi.1003580

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mahadevan, R., and Schilling, C. H. (2003). The effects of alternate optimal solutions in constraint-based genome-scale metabolic models. Metab. Eng. 5, 264–276. doi: 10.1016/j.ymben.2003.09.002

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Manichaikul, A., Ghamsari, L., Hom, E. F., Lin, C., Murray, R. R., Chang, R. L., et al. (2009). Metabolic network analysis integrated with transcript verification for sequenced genomes. Nat. Methods 6, 589–592. doi: 10.1038/nmeth.1348

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mcanulty, M. J., Yen, J. Y., Freedman, B. G., and Senger, R. S. (2012). Genome-scale modeling using flux ratio constraints to enable metabolic engineering of clostridial metabolism in silico. BMC Syst. Biol. 6:42. doi: 10.1186/1752-0509-6-42

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mintz-Oron, S., Meir, S., Malitsky, S., Ruppin, E., Aharoni, A., and Shlomi, T. (2012). Reconstruction of Arabidopsis metabolic network models accounting for subcellular compartmentalization and tissue-specificity. Proc. Natl. Acad. Sci. U.S.A. 109, 339–344. doi: 10.1073/pnas.1100358109

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Monk, J., and Palsson, B. O. (2014). Genetics. Predicting microbial growth. Science 344, 1448–1449. doi: 10.1126/science.1253388

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Mueller, L. A., Zhang, P., and Rhee, S. Y. (2003). AraCyc: a biochemical pathway database for Arabidopsis. Plant Physiol. 132, 453–460. doi: 10.1104/pp.102.017236

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Muller, A. C., and Bockmayr, A. (2013). Fast thermodynamically constrained flux variability analysis. Bioinformatics 29, 903–909. doi: 10.1093/bioinformatics/btt059

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Noor, E., Lewis, N. E., and Milo, R. (2012). A proof for loop-law constraints in stoichiometric metabolic networks. BMC Syst. Biol. 6:140. doi: 10.1186/1752-0509-6-140

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Park, J. M., Kim, T. Y., and Lee, S. Y. (2010). Prediction of metabolic fluxes by incorporating genomic context and flux-converging pattern analyses. Proc. Natl. Acad. Sci. U.S.A. 107, 14931–14936. doi: 10.1073/pnas.1003740107

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Pilalis, E., Chatziioannou, A., Thomasset, B., and Kolisis, F. (2011). An in silico compartmentalized metabolic model of Brassica napus enables the systemic study of regulatory aspects of plant central metabolism. Biotechnol. Bioeng. 108, 1673–1682. doi: 10.1002/bit.23107

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Poolman, M. G., Kundu, S., Shaw, R., and Fell, D. A. (2013). Responses to light intensity in a genome-scale model of rice metabolism. Plant Physiol. 162, 1060–1072. doi: 10.1104/pp.113.216762

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Poolman, M. G., Miguet, L., Sweetlove, L. J., and Fell, D. A. (2009). A genome-scale metabolic model of Arabidopsis and some of its properties. Plant Physiol. 151, 1570–1581. doi: 10.1104/pp.109.141267

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Reed, J. L. (2012). Shrinking the metabolic solution space using experimental datasets. PLoS Comput. Biol. 8:e1002662. doi: 10.1371/journal.pcbi.1002662

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Rocha, A. G., Mehlmer, N., Stael, S., Mair, A., Parvin, N., Chigri, F., et al. (2014). Phosphorylation of Arabidopsis transketolase at Ser428 provides a potential paradigm for the metabolic control of chloroplast carbon metabolism. Biochem. J. 458, 313–322. doi: 10.1042/BJ20130631

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Rolletschek, H., Melkus, G., Grafahrend-Belau, E., Fuchs, J., Heinzel, N., Schreiber, F., et al. (2011). Combined noninvasive imaging and modeling approaches reveal metabolic compartmentation in the barley endosperm. Plant Cell 23, 3041–3054. doi: 10.1105/tpc.111.087015

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Saha, R., Suthers, P. F., and Maranas, C. D. (2011). Zea mays iRS1563: a comprehensive genome-scale metabolic reconstruction of maize metabolism. PLoS ONE 6:e21784. doi: 10.1371/journal.pone.0021784

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Scheibe, R. (2004). Malate valves to balance cellular energy supply. Physiol. Plant. 120, 21–26. doi: 10.1111/j.0031-9317.2004.0222.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schellenberger, J., Lewis, N. E., and Palsson, B. O. (2011a). Elimination of thermodynamically infeasible loops in steady-state metabolic models. Biophys. J. 100, 544–553. doi: 10.1016/j.bpj.2010.12.3707

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schellenberger, J., Park, J. O., Conrad, T. M., and Palsson, B. O. (2010). BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions. BMC Bioinformatics 11:213. doi: 10.1186/1471-2105-11-213

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schellenberger, J., Que, R., Fleming, R. M., Thiele, I., Orth, J. D., Feist, A. M., et al. (2011b). Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0. Nat. Protoc. 6, 1290–1307. doi: 10.1038/nprot.2011.308

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schomburg, I., Chang, A., Placzek, S., Sohngen, C., Rother, M., Lang, M., et al. (2013). BRENDA in 2013: integrated reactions, kinetic data, enzyme function data, improved disease classification: new options and contents in BRENDA. Nucleic Acids Res. 41, D764–D772. doi: 10.1093/nar/gks1049

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schwender, J. (2008). Metabolic flux analysis as a tool in metabolic engineering of plants. Curr. Opin. Biotechnol. 19, 131–137. doi: 10.1016/j.copbio.2008.02.006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schwender, J. (2011). Experimental flux measurements on a network scale. Front. Plant Sci. 2:63. doi: 10.3389/fpls.2011.00063

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schwender, J., and Hay, J. O. (2012). Predictive modeling of biomass component tradeoffs in Brassica napus developing oilseeds based on in silico manipulation of storage metabolism. Plant Physiol. 160, 1218–1236. doi: 10.1104/pp.112.203927

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Schwender, J., König, C., Klapperstück, M., Heinzel, N., Münz, E., Hebbelmann, I., et al. (2014). Transcript abundance on its own cannot be used to infer fluxes in central metabolism. Front. Plant Sci. 5:668. doi: 10.3389/fpls.2014.00668

CrossRef Full Text | Google Scholar

Schwender, J., and Ohlrogge, J. B. (2002). Probing in vivo metabolism by stable isotope labeling of storage lipids and proteins in developing Brassica napus embryos. Plant Physiol. 130, 347–361. doi: 10.1104/pp.004275

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Seaver, S. M., Gerdes, S., Frelin, O., Lerma-Ortiz, C., Bradbury, L. M., Zallot, R., et al. (2014). High-throughput comparison, functional annotation, and metabolic modeling of plant genomes using the PlantSEED resource. Proc. Natl. Acad. Sci. U.S.A. 111, 9645–9650. doi: 10.1073/pnas.1401329111

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Seaver, S. M., Henry, C. S., and Hanson, A. D. (2012). Frontiers in metabolic reconstruction and modeling of plant genomes. J. Exp. Bot. 63, 2247–2258. doi: 10.1093/jxb/err371

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Selinski, J., Konig, N., Wellmeyer, B., Hanke, G. T., Linke, V., Neuhaus, H. E., et al. (2014). The plastid-localized NAD-dependent malate dehydrogenase is crucial for energy homeostasis in developing Arabidopsis thaliana seeds. Mol. Plant 7, 170–186. doi: 10.1093/mp/sst151

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Sheahan, J., Cheong, H., and Rechnitz, G. (1998). The colorless flavonoids of Arabidopsis thaliana (Brassicaceae). I. A model system to study the orthodihydroxy structure. Am. J. Bot. 85, 467. doi: 10.2307/2446429

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Swainston, N., Smallbone, K., Mendes, P., Kell, D., and Paton, N. (2011). The SuBliMinaL Toolbox: automating steps in the reconstruction of metabolic networks. J. Integr. Bioinform. 8, 186. doi: 10.2390/biecoll-jib-2011-186

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Sweetlove, L. J., Obata, T., and Fernie, A. R. (2014). Systems analysis of metabolic phenotypes: what have we learnt? Trends Plant Sci. 19, 222–230. doi: 10.1016/j.tplants.2013.09.005

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Sweetlove, L. J., and Ratcliffe, R. G. (2011). Flux-balance modeling of plant metabolism. Front. Plant Sci. 2:38. doi: 10.3389/fpls.2011.00038

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Tanz, S. K., Castleden, I., Hooper, C. M., Vacher, M., Small, I., and Millar, H. A. (2013). SUBA3: a database for integrating experimentation and prediction to define the SUBcellular location of proteins in Arabidopsis. Nucleic Acids Res. 41, D1185–D1191. doi: 10.1093/nar/gks1151

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Thiele, I., and Palsson, B. O. (2010). A protocol for generating a high-quality genome-scale metabolic reconstruction. Nat. Protoc. 5, 93–121. doi: 10.1038/nprot.2009.203

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Thorleifsson, S. G., and Thiele, I. (2011). rBioNet: A COBRA toolbox extension for reconstructing high-quality biochemical networks. Bioinformatics 27, 2009–2010. doi: 10.1093/bioinformatics/btr308

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Topfer, N., Caldana, C., Grimbs, S., Willmitzer, L., Fernie, A. R., and Nikoloski, Z. (2013). Integration of genome-scale modeling and transcript profiling reveals metabolic pathways underlying light and temperature acclimation in Arabidopsis. Plant Cell 25, 1197–1211. doi: 10.1105/tpc.112.108852

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Troncoso-Ponce, M. A., Kilaru, A., Cao, X., Durrett, T. P., Fan, J., Jensen, J. K., et al. (2011). Comparative deep transcriptional profiling of four developing oilseeds. Plant J. 68, 1014–1027. doi: 10.1111/j.1365-313X.2011.04751.x

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text

Wang, X., Wang, H., Wang, J., Sun, R., Wu, J., Liu, S., et al. (2011). The genome of the mesopolyploid crop species Brassica rapa. Nat. Genet. 43, 1035–1039. doi: 10.1038/ng.919

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Weitzel, M., Noh, K., Dalman, T., Niedenfuhr, S., Stute, B., and Wiechert, W. (2013). 13CFLUX2–high-performance software suite for 13C-metabolic flux analysis. Bioinformatics 29, 143–145. doi: 10.1093/bioinformatics/bts646

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wheeler, M. C., Tronconi, M. A., Drincovich, M. F., Andreo, C. S., Flügge, U. I., and Maurino, V. G. (2005). A comprehensive analysis of the NADP-malic enzyme gene family of Arabidopsis. Plant Physiol. 139, 39–51. doi: 10.1104/pp.105.065953

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Wiback, S. J., Famili, I., Greenberg, H. J., and Palsson, B. O. (2004). Monte Carlo sampling can be used to determine the size and shape of the steady-state flux space. J. Theor. Biol. 228, 437–447. doi: 10.1016/j.jtbi.2004.02.006

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Williams, T. C., Poolman, M. G., Howden, A. J., Schwarzlander, M., Fell, D. A., Ratcliffe, R. G., et al. (2010). A genome-scale metabolic model accurately predicts fluxes in central carbon metabolism under stress conditions. Plant Physiol. 154, 311–323. doi: 10.1104/pp.110.158535

Pubmed Abstract | Pubmed Full Text | CrossRef Full Text | Google Scholar

Keywords: loopless flux balance analysis, ¹³C-metabolic flux analysis, central metabolism, carbon partitioning, constraint-based reconstruction and analysis

Citation: Hay JO, Shi H, Heinzel N, Hebbelmann I, Rolletschek H and Schwender J (2014) Integration of a constraint-based metabolic model of Brassica napus developing seeds with ¹³C-metabolic flux analysis. Front. Plant Sci. 5:724. doi: 10.3389/fpls.2014.00724

Received: 18 September 2014; Accepted: 01 December 2014;
Published online: 19 December 2014.

Edited by:

Lee Sweetlove, University of Oxford, UK

Reviewed by:

Shyam K. Masakapalli, University of Bath, UK
Nicholas John Kruger, University of Oxford, UK

Copyright © 2014 Hay, Shi, Heinzel, Hebbelmann, Rolletschek and Schwender. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jorg Schwender, Brookhaven National Laboratory, Biological, Environment and Climate Sciences Department, Building 463, Upton, NY 11973, USA e-mail:c2Nod2VuZEBibmwuZ292

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Integration of a constraint-based metabolic model of Brassica napus developing seeds with 13C-metabolic flux analysis

Introduction

Materials and Methods

In-vitro Embryo Culture and Biomass Composition Measurements

13C-Metabolic Flux Analysis

Bna572+: Metabolic Network Reconstruction Revision and Availability

Update of Biochemical Reaction Equations

Revision of Gene-Protein-Reaction Associations

Confidence Scoring and Integration of Gene Expression Data

Genotype-Specific Constraints-Based Metabolic Modeling

Flux Variability Analysis

Topological Analysis, Mapping and Random Sampling of Metabolic Network

Results

History of Bna572

Bna572+ Database and Revision of Bna572

Reference genome

New pathways

Confidence scores

Availability of Bna572+ reconstruction and models

Analysis of Metabolic Network Structure

Analysis of New Subsystems (Reaction 572+)

Verification of Gene Expression

13C-Metabolic Flux Analysis and Mapping Bna572+ to a 13C-Model

Genotype-Specific Simulations of Bna572+

Differential Flux Variability and Reaction Responsiveness in Brassica Napus Genotypes

Limitations

Discussion

Relevance of Bna572+

Importance of Standardization

Tissue Specific vs. Genome Scale Model

Shrinking the Solution Space Using 13C-Constraints and Loopless Algorithm

Comparison of Two Genotypes

Conflict of Interest Statement

Acknowledgments

Supplementary Material

References

Integration of a constraint-based metabolic model of Brassica napus developing seeds with ¹³C-metabolic flux analysis

¹³C-Metabolic Flux Analysis

¹³C-Metabolic Flux Analysis and Mapping Bna572+ to a ¹³C-Model

Shrinking the Solution Space Using ¹³C-Constraints and Loopless Algorithm