Identification of a Solanum pennellii Chromosome 4 Fruit Flavor and Nutritional Quality-Associated Metabolite QTL

A major resource for tomato quality improvement and gene discovery is the collection of introgression lines (ILs) of cultivated Solanum lycopersicum that contain different, defined chromosomal segments derived from the wild tomato relative, S. pennellii. Among these lines, IL4-4, in which the bottom of S. lycopersicum (cv. M82) chromosome 4 is replaced by the corresponding S. pennellii segment, is altered in many primary and secondary metabolites, including many related to fruit flavor and nutritional quality. Here, we provide a comprehensive profile of IL4-4 ripe fruit metabolites, the transcriptome and fine mapping of sub-ILs. Remarkably, out of 327 quantified metabolites, 185 were significantly changed in IL4-4 fruit, compared to the control. These altered metabolites include volatile organic compounds, primary and secondary metabolites. Partial least squares enhanced discriminant analysis of the metabolite levels among sub-ILs indicated that a genome region encompassing 20 putative open reading frames is responsible for most of the metabolic changes in IL4-4 fruit. This work provides comprehensive insights into IL4-4 fruit biochemistry, identifying a small region of the genome that has major effects on a large and diverse set of metabolites.


INTRODUCTION
Recent research has highlighted the importance of crop compositional quality for human health (Demmig-Adams and Adams, 2002;Spencer et al., 2005;Martin et al., 2011). Tomato, as one of the most important fruit crops worldwide, represents a valuable source of micronutrients including amino acids, vitamins and antioxidants. Despite its importance as a crop and a major component of the human diet, fruit quality has deteriorated in recent years. In particular, the flavor of modern commercial varieties is generally perceived as poor as breeders have focused most attention on yield. Natural variation in the wild relatives of tomato is a potential source of genetic and biochemical diversity for improvement of flavor and nutrition of the cultivated tomato (Fernie et al., 2006;Schauer et al., 2006;Klee and Tieman, 2013). Solanum pennellii, a wild South American relative, has a complete genome sequence ) and a well characterized collection of introgression lines (ILs) (Eshed and Zamir, 1995;Liu and Zamir, 1999;Chitwood et al., 2013). These ILs have been extensively exploited for discovery of fruit quality associated QTLs, including soluble solids content, volatile emissions, pigment biosynthesis as well as primary/secondary metabolite production (Causse et al., 2004;Baxter et al., 2005;Schauer et al., 2006;Tieman et al., 2006;Perez-Fons et al., 2014;Alseekh et al., 2015). However, only a few of the loci affecting fruit chemical composition have been mapped to high resolution using sub-isogenic lines (sub-ILs) (Fridman et al., 2004;Alseekh et al., 2015).
Previous studies of the S. pennellii IL population indicated that the bottom of chromosome 4 contains QTLs for many horticulturally important traits including soluble solids content, fruit shape, lycopene content and chemical composition (Yates et al., 2004;Schauer et al., 2006;Tieman et al., 2006;Mathieu et al., 2009;Alseekh et al., 2015). However, the epidermal reticulation phenotype of IL4-4 fruit controlled by CUTICULAR WATER PERMEABILITY 1 (CWP1) is a barrier to utilizing the genetic variation of wild relatives at the bottom of chromosome 4 for improving tomato fruit quality (Hovav et al., 2007). At a fundamental level, the molecular basis for the major effects on such a broad range of fruit metabolites in IL4-4 is an interesting and important challenge.
Identification of metabolite QTLs using metabolite profiling is a powerful tool to understand the complex mechanisms underlying regulation of metabolic pathways. In Arabidopsis, Keurentjes et al. (2006) identified more than 4000 metabolite QTLs using untargeted metabolomics methods in a recombinant inbred population derived from two most divergent Arabidopsis accessions. The combination of rice metabolic profiling with an ultrahigh-density genetic map greatly accelerated the gene identification and pathway elucidation for metabolites in rice (Gong et al., 2013). A refinement on metabolomic profiling to identify genes underlying important QTLs is construction of transcriptome-enabled regulatory networks, as illustrated in S. pennellii-derived ILs (Fei et al., 2011;Chitwood et al., 2013). Comparative analysis of RNA-Seq data from M82 and ILs may reveal the transcriptomic changes regulated by the introgressed genome segment. The combination of metabolite profiling and RNA-Seq can facilitate understanding points of metabolic regulation (Adato et al., 2009) and identification of the causative genes (Chambers et al., 2014).
Here, we performed a large-scale metabolic analysis with a fine-mapped sub-IL population derived from IL4-4 combined with RNA-Seq performed on ripe fruit tissue of IL4-4 and M82. Out of 327 quantified fruit metabolites, 185 were altered in the IL4-4 fruit. Further QTL mapping identified a locus encompassing twenty annotated genes near the bottom of chromosome 4 that is responsible for the overall metabolic changes associated with IL4-4.

Plant Material and Growth Conditions
Plants used for volatile analysis were grown in the field at Live Oak, FL, USA; Plants used for other metabolites analysis were grown in greenhouses at Golm, Germany. To generate a series of sub-ILs, a total of 688 F2 plants were derived from a cross between IL4-4 and M82. DNA was extracted from each individual and screened for the flanking ends of the introgression with markers C198103-1 and C10. This screen resulted in the isolation of 57 recombinant individuals within the interval. Points of recombination within these sub-ILs were further defined by the markers listed in Supplementary Table S1. Thirty sub-ILs were selected for propagating homozygous seeds in the F3 generation. Ripe fruits from F4 plants were harvested for metabolite profiling.
To validate consistency of chemical profiling across sites, we examined multiple metabolites quantified in both locations. Glucose, fructose, citric, and malic acids were expressed as a ratio of IL4-4 to M82. These numbers were consistent across both locations (Supplementary Table S2).

Volatile Collection and Analysis
Volatile organic compounds (VOCs) were collected from ripe fruits of sub-ILs R401, R2174, R2075, R100, R434 as well as the parent lines, IL4-4 and M82, grown in randomized, replicated plots. Each biological replicate was a mixture of five to six individual fruit at the red ripe stage. Collection of VOCs was performed as described previously (Tieman et al., 2006). In brief, VOCs were collected from chopped ripe fruits (peel and flesh) during a 1-h period. The VOCs were trapped on SuperQ resin and subsequently eluted with methylene chloride using nonyl acetate as an internal control. The samples were separated on a DB-5 column (Agilent) 1 and analyzed on an Agilent 6890N gas chromatograph equipped with a flame ionization detector. Retention times compared with known standards and identities of volatile peaks were confirmed by gas chromatography/mass spectrometry (GC/MS) (Agilent 5975 GC/MS) 1 . The list of quantified volatile compounds is presented in Supplementary  Table S3.

Primary Metabolites Profiling
The extraction method was performed as described by Giavalisco et al. (2011). In short, samples were extracted with 1 ml of methanol/methyl tert-butyl ether/water mixture. After incubation in 4 • C and sonication for 10 min in an ice-cooled sonic bath, 500 ml of water/methanol mixture was added. This led to the formation of two phases: a lipophilic phase and a polar phase. For the primary metabolites aliquot from the polar phase was collected and dried under vacuum, and the residue was derivatized for 120 min at 37 • C (in 50 µl of 20 mg ml-1 methoxyamine hydrochloride in pyridine) followed by a 30 min treatment at 37 • C with 50 µl of MSTFA. The GC-MS system used was a gas chromatograph coupled to a time-of-flight mass spectrometer (Pegasus III, Leco). An autosampler system (PAL) injected the samples. Helium was used as carrier gas at a constant flow rate of 2 ml/s and gas chromatography was performed on a 30 m DB-35 column. The injection temperature was 230 • C and the transfer line and ion source were set to 250 • C. The initial temperature of the oven (85 • C) increased at a rate of 15 • C/min up to a final temperature of 360 • C. After a solvent delay of 180 s mass spectra were recorded at 20 scans s-1 with FIGURE 1 | Heat map of IL4-4 fruit metabolites compared to M82. BCAA, branched-amino acids; FA, fatty acids; AAA, aromatic-amino acids; CAR, carotenoid; ACE, acetate; AA, amino acids; Ssa, Sugars and sugar alcohols; OA, organic acids; Vit, vitamins; Cyc, cyclitol; Alk, alkaloids; DAG, diacylglycerol; DGDG, digalactosyl-diacylglycerol; MGDG, monogalactosyldiacylglycerol; PC, phosphatidylcholine; PE, phosphatidylethanolamine; TAG, triacylglyceride; PG, phosphatidylglycerol; PI, phosphatidylinositol; PS, phosphatidylserine; SQDG, sulfoquinovosyldiacylglycerol.

Secondary Metabolites Profiling
For secondary metabolites the rest of polar phase (see above extraction protocol) was dried and residue was suspended in 200 µl of 80% methanol water (80:20). The extracts were then subjected to LC-MS analysis using a high-performance liquid chromatography (HPLC; Surveyor; Thermo Finnigan, USA), coupled to a Finnigan LTQ-XP system (Thermo Finnigan, USA), Metabolite identification and annotation were performed using a combination of standard compounds and tomato metabolomics databases (Moco et al., 2006;Iijima et al., 2008;Fernie, 2009, 2010;Rohrmann et al., 2011).

Lipid Extraction and Analysis
For lipid extraction, the lipophilic phase (see extraction protocol above) was collected and vacuum-dried. Samples were processed using ultra-performance liquid chromatography coupled with Fourier transform mass spectrometry (UPLC-FT-MS, Hummel et al., 2011), on a C8 reverse phase column coupled with an Exactive mass spectrometer (Thermo-Fisher) 2 in positive and negative ionization mode. Processing of chromatograms, peak detection and integration were performed using REFINER MS R 6.0 (GeneData) 3 . Processing of mass spectrometry data included the removal of the fragmentation information, isotopic peaks, as well as chemical noise. Obtained features (m/z at a certain retention time) were queried against an in-house lipid database for further annotation.

Brix Value Determination
Five ripe tomato fruits were homogenized in a blender for 30 s and frozen at −80 • C until analysis. Samples were thawed and 1.5 mL was centrifuged at 16 000 × g for 5 min. The supernatant was used to calculate • Brix with a handheld refractometer (ATAGO N-20, Japan).

QTL Mapping
To map the metabolite variation in the sub-ILs, a one-way analysis of variance (ANOVA; level of significance set as p < 0.01) was used to determine the QTL controlling metabolite content. All lines were compared with M82 and each other. If metabolite level of the line was significantly different from the M82 control (indicated by Dunnett's t-test p < 0.01), a line was considered as harboring a QTL. A list of QTLs within the sub-ILs can be found at Supplementary Table S4.

Heat Map
The heat map was generated with MutiExperiment Viewer 4.0. False color imaging was done on the log2-transformed metabolite data. Metabolite data were the average value of all replicates for each line.

Statistical Analysis
Unpaired Student's t-test was used for two-sample comparisons. For multiple comparisons, an ANOVA was performed followed

RNA Extraction and RNA-seq
Total RNA was extracted from frozen pericarp tissue of ripe fruits as described in Griffiths et al. (1999). Strand-specific RNA-Seq libraries were constructed using the protocol described in Zhong et al. (2011) and sequenced on the Illumina HiSeq 2000 platform using the single-end mode. Raw RNA-Seq reads were processed using Trimmomatic (Bolger A.M. et al., 2014) to remove adaptor and low quality sequences. RNA-Seq reads were then aligned to the ribosomal RNA database (Quast et al., 2013) using Bowtie allowing three mismatches (Langmead et al., 2009) and the mappable reads were discarded. The resulting high-quality cleaned reads were aligned to the tomato reference genome (The Tomato Genome Consortium, 2012) using TopHat allowing one segment mismatch ). Following alignments, raw counts for each tomato gene were derived and normalized to reads per kilobase of exon model per million mapped reads (RPKM). Raw counts were then fed to the DESeq package (Anders and Huber, 2010) for differential expression analysis. Genes with adjusted p values less than 0.05 and fold changes greater than or equal to 2 were identified as differentially expressed genes (DEGs) between IL4-4 and M82.

SIFT (Sorting Intolerant from Tolerant) Analyses
The deduced amino acid sequences from S. lycopersicum and S. pennellii were aligned using ClustalW 4 , and polymorphic sequences were submitted to SIFT 5 to predict the impact of amino acid substitutions on protein function (Kumar et al., 2009).

Gene Ontology (GO) Enrichment Analysis
Gene ontology enrichment analyses of DEGs were conducted with PANTHER 6 . GO terms for biological process and molecular function were retrieved with functions of this web-tool. The heatmap of metabolism overview was designed using Mapman software (Mapman version 3.0.0). A dot represents the log2 of the RPKM ratio of a transcript between IL4-4 and control M82.

Taste Panel Analysis
Twenty fruits harvested from R2174, R434, R2075, IL4-4, and M82 were subjected to sensory evaluation. Samples were cut into wedges and labeled with random numbers. Sample taste tests were performed on two occasions with two separate harvests. Each taste panel had 14 panelists. Each sample was rated on a 10-point hedonic scale (1-10, with 10 as like extremely). Taste preference scores are an averaged score of all panelists. Significant differences were calculated using Student's T-test.

Metabolite Profiling of IL4-4 Fruit
To obtain an overview of the IL4-4 fruit metabolome, we quantified a set of metabolites that included VOCs, hydrophilic primary, hydrophilic secondary and lipophilic metabolites by utilizing targeted gas chromatography (targeted GC), gas chromatography-mass spectrometry (GC-MS) and high/ultraperformance liquid chromatography mass spectrometry (HP/UPLC-MS). An overlay heat map shows the metabolite contents in IL4-4 with respect to M82. The fully annotated data set is provided in Supplementary Table S3. Out of 327 quantified fruit metabolites, 185 were altered in the IL4-4 fruit (p < 0.05) (Figure 1). Chemicals with altered contents included 21 VOCs, 44 hydrophilic primary metabolites, 31 hydrophilic secondary metabolites and 89 lipids ( Table 1). Consistent with previous results (Tieman et al., 2006), many VOCs were significantly elevated in IL4-4 fruit. These VOCs include chemicals synthesized from multiple independent  biosynthetic pathways derived from branched-chain amino acids (BCAAs), fatty acids (FA), and aromatic amino acids (AAA) as well as undefined pathways. These include multiple VOCs associated with consumer liking (Tieman et al., 2012), including 1-nitro-2-phenethane and 3-methyl-1-butanol. Altered hydrophilic primary metabolites included multiple amino acids, organic acids, vitamins and almost all of the measured sugars. Significantly decreased hydrophilic primary metabolites in IL4-4 included multiple amino acids, organic acids and one unknown sugar (Table 1). Interestingly, out of 18 quantified sugars and sugar alcohols, 12 were elevated, including the most important contributors to sweetness, glucose, and fructose, while only one unknown sugar was significantly decreased in IL4-4. As key precursors for phenylpropanoids biosynthesis, levels of all three AAA were increased in IL4-4, indicating potential regulators of the shikimate pathway may exist within the S. pennellii genomic segment included in IL4-4. In addition, 31 out of 49 hydrophilic secondary metabolites and 89 out of 170 lipids that were quantified were significantly altered (p < 0.05) ( Table 1).
These results together indicate a locus or loci at the bottom of S. pennellii chromosome 4 regulating multiple independent metabolic pathways during fruit ripening.

Transcriptomic Analysis of IL4-4
To gain insight into the molecular mechanisms underlying the observed metabolite changes, gene expression in ripe pericarp tissues of IL4-4 and M82 was determined using RNA-Seq. A total of 26,147 expressed genes were detected in IL4-4 and M82 (Supplementary Table S5). To identify DEGs, we determined the significance of difference in transcript contents. Using cutoff criteria of ≥2-fold difference in transcript and adjusted P-value < 0.05, we identified 2307 more abundant transcripts and 1635 less abundant transcripts in IL4-4 fruit tissue compared to M82 (Supplementary Table S6). Next, we performed GO enrichment analysis with the DEGs to identify the major gene groups whose transcript abundance was altered in IL4-4. Due to the major alterations in biochemical composition in IL4-4, we examined the GO term distribution within biological process and molecular function categories (Figures 2A,B). GO analysis showed that the largest groups in biological process and molecular function were metabolic process (47.4%) and catalytic activity (45%), respectively. In order to further investigate the potential roles of DEGs in metabolism, we generated a snapshot of these genes over the main metabolic pathways facilitated by MapMan. An overview of the map suggested a global transcriptional regulation of metabolismassociated genes by IL4-4 ( Figure 2C). The mapped DEGs are involved in main primary and secondary metabolic biosynthetic processes such as sugar, acid, amino acid, phenylpropanoid, and lipid metabolism. Notably, light reaction-associated genes were significantly more abundant. This pattern suggests a possible increase in photosynthesis in IL4-4 fruit, which in turn could provide precursors for multiple metabolic pathways.
To better understand the relationship between transcription and metabolite synthesis in IL4-4, we examined transcript content of genes encoding key enzymes related to FA metabolism and the shikimate-phenylpropanoid pathway. 13-lipoxygenases (13-LOXs) and hydroperoxide lyase (HPL) are the main enzymes catalyzing conversion of C18 polyunsaturated FAs to C5 and C6 volatiles in tomato fruit ( Figure 3A). We observed significantly higher levels of TomLOXB (Solyc01g099190), TomLOXF (Solyc01g006560) and HPL (Solyc07g049690) transcripts in IL4-4 fruit tissue while other members of the 13-LOX family remained unchanged ( Figure 3B). Increased transcript abundance of these genes is consistent with the higher levels of C5 and C6 volatiles in IL4-4 fruit ( Figure 3C).
We observed significant increases in AAAs as well as nutrition and flavor contributing phenylpropanoids and glycoalkaloids in IL4-4. Therefore, we examined the transcript levels of genes encoding key steps in the shikimate and phenylpropanoid pathways ( Figure 4A). The transcripts associated with most genes in the shikimate pathway were not significantly different while there was a significant decrease in shikimate dehydrogenase (Solyc10g038080) in IL4-4. Transcripts encoding three genes in the tryptophan synthesis pathway (PAI: anthranilate isomerase, Solyc06g051410; IGPS: indole-3-glycerol phosphate synthase, Solyc03g111850; TS: tryptophan synthase, Solyc07g064280) were elevated in IL4-4, while ADT (arogenate dehydratase: Solyc11g072520) in the phenylalanine pathway was reduced. In addition, we observed significantly higher transcript contents of PHENYLALANINE AMMONIA-LYASE (PAL: Solyc09g007910), which catalyzes the first step in phenylpropanoid synthesis from L-phenylalanine ( Figure 4B). A previous study suggested that expression of this PAL is the primary determinant of commitment to phenylpropanoid synthesis (Xie et al., 2016). Therefore, higher Solyc09g007910 transcript in IL4-4 could explain the higher levels of multiple phenylpropanoids as well as several Phe-derived volatiles in IL4-4 compared to M82 (guaiacol, methylsalicylate and eugenol) (Figures 4C,D). Taken together, these results suggest that genetic factors within IL4-4 regulate metabolite levels via their influence on transcription of genes involved in multiple biosynthetic pathways.

Fine Mapping of the QTLs Controlling Metabolic Changes of IL4-4 Fruit
IL4-4 includes a genomic region of ∼3.4Mb (Chitwood et al., 2013). To fine map the loci affecting metabolic traits in IL4-4 fruit, we generated a set of sub-ILs by screening an F2 population of IL4-4 crossed to M82. Mapping of the recombinant lines was accomplished using molecular markers developed from alignment of the S. lycopersicum and S. pennellii genomes ( Figure 5A). VOCs were collected from fruits of five selected sub-ILs (R2174, R401, R100, R2075, and R434) and the parents, M82 and IL4-4, while additional metabolite data were collected from all sub-ILs shown in Figure 5A. Partial least squares enhanced discriminant analysis (PLS-EDA) models were generated based on the metabolome data collected from the sub-ILs (Figures 5B-E). The distribution patterns of different lines with and without the M genomic region suggest that the overall metabolic traits of all lines can be divided to two main clades, IL4-4-like and M82-like. Sub-ILs that contain S. pennellii genomic region M (defined by marker C2147 and C200952) are closer to IL4-4, while lines without the M region resemble M82. Thus, the S. pennellii M genome fragment contains the major locus affecting a wide range of metabolic changes in tomato fruit.
As shown in Table 1, we identified 185 significantly altered metabolites (p < 0.05). To further understand how the M genomic region regulates fruit quality, we reexamined all of these metabolites in all of the lines for significant effects using ANOVA. Using a statistical threshold of p < 0.01, we identified 72 metabolite QTLs in IL4-4. Of these QTLs, 43 were located within the M region ( Supplementary Table S4; Figure 6). QTLs within this region affect metabolites synthesized from a wide range of both primary and secondary metabolism pathways. Interestingly, QTLs correlated with fructose, glucose, soluble solids (brix) and C5 volatile contents are located within this small segment, all traits likely to affect consumer taste preferences (Figures 7A-C).
There are also major effects on potential nutrition-associated metabolites, including flavonoids ( Figure 7D). To address the effects of these chemical alterations on flavor quality, we conducted a consumer panel where individuals rated fruits from M82, IL4-4, and sub-ILs. Consumers significantly preferred IL4-4 and a sub-IL containing the M region (R2174) over M82 or sub-ILs absent the M region (Supplementary Figure S1). There are 20 annotated genes within the M region of the S. lycopersicum and S. pennellii genomes (Table 2; Figure 6). In the red fruit pericarp tissue, transcript levels of only three of these genes were significantly altered between IL4-4 and M82: ORF2 (Serine/threonine protein kinase), ORF10 (Galactose oxidase/kelch repeat superfamily protein) and ORF17 (Palmitoyltransferase). There were no premature stop codons in any open reading frames of the genes in either ortholog. To investigate the possible effects of amino acid sequence substitutions on the metabolic phenotype in IL4-4, a comparison of protein sequences of the 20 candidate genes from S. pennellii and S. lycopersicum located in the mapped M genome region was performed (Supplementary Table S7). As predicted by the SIFT analysis, amino acid sequence polymorphisms of 13 candidates are not likely to be functionally significant, alteration of amino acids in five candidates likely confer a functional change, while the remainder did not show a clear effect. Notably, CWP1 was previously reported to cause a dehydration phenotype in ripe fruit (Hovav et al., 2007). That skin phenotype is manifested in sub-ILs containing the M region (Supplementary Figure S2). However, the water loss phenotype associated with CWP1 cannot explain the entire effect on metabolites in IL4-4 fruit. For example, while many metabolites are higher, as might be expected in a fruit with less water content, levels of multiple organic acids are reduced in IL4-4. Therefore, further screening of recombinant events in this M region combined with transgenic alterations will be necessary to map the gene(s) responsible for altered metabolite biosynthesis.

DISCUSSION
The Genetic Basis of Fruit Biochemical Diversity in IL4-4 This study provides a comprehensive analysis of the metabolite composition of fruit from M82 and the highly chemically variant line, IL4-4, as well as sub-ILs. Multiple studies have identified metabolite QTLs in introgression populations, including those derived from S. pennellii (Schauer et al., 2006;Tieman et al., 2006;Mathieu et al., 2009;Alseekh et al., 2015). Here, we describe an extensive metabolite analysis of IL4-4, identifying 44 hydrophilic primary metabolites, 89 lipids, 31 hydrophilic secondary metabolites and 21 VOCs that are significantly altered relative to M82. The effect on such a large set of metabolites indicates the potential of this small locus to have a major impact on tomato fruit quality.
High-resolution fine mapping studies with tomato populations to identify loci regulating multiple fruit quantitative traits have been performed (e.g., Frary et al., 2000;Fridman et al., 2004). Yates et al. (2004) mapped a series of phenotypes including brix, fruit weight, stem scar, lycopene content, reticulation, and fruit shape to a three centimorgan segment at the bottom of IL4-4 by analysis of a set of sub-ILs of IL4-4. Hovav et al. (2007) demonstrated that the "reticulation" phenotype is a result of mis-expression of a CWP1 gene with map-based cloning from S. habrochaites sub-ILs. Neither group provided information on an extensive set of metabolites. By fine mapping a set of sub-ILs derived from IL4-4, we identified a segment of 200 kb that significantly alters the contents of more than 40 chemicals (p < 0.01). Localization of the reticulated peel phenotype caused by CWP1 (Hovav et al., 2007) to the M region might explain some of the altered metabolites. Hovav et al. (2007) did not examine metabolites in fruit flesh and the transgenic lines reported are not available. Although CWP1 expression is limited to the cuticle of immature fruits (Hovav et al., 2007), higher metabolite content in mature fruit flesh could be the consequence of dehydration caused by the micro-fissured fruit cuticle. For example, contents of proline were significantly increased, possibly correlating with water loss. Water loss could also affect soluble solids content. However, water loss cannot be the sole explanation for the M region phenotype as levels of a large portion of organic and amino acids were decreased. Separately, we note the existence of a TREHALOSE-6-PHOSPHATE PHOSPHATASE (T6PP) gene in the M region. Trehalose, which is regulated by T6PP, has been proposed to be a central signaling sugar that impacts sucrose content (Nuccio et al., 2015). Altered sucrose metabolism, in turn, could have major effects on many fruit metabolites. There were three genes whose transcripts were significantly altered in IL4-4 ( Table 2). It is possible that altered expression of one or more of these genes could cause at least some of the observed metabolic changes.
In general, GO enrichment analysis of the DEGs suggested a major impact on genes involved in metabolic processes and catalytic activity in ripe IL4-4 fruit tissue. The DEGs were distributed among multiple distinct metabolic pathways. We were able to identify genes with important functions in FA and phenylpropanoid pathways whose transcripts were more abundant in IL4-4. In the case of FAs, multiple LOX transcripts as well as HPL were more abundant. For phenylpropanoids, the transcript encoding PAL (Solyc09g007910) was significantly higher in IL4-4. This transcript was previously shown to be significantly up-regulated in transgenic tomato fruits overexpressing the MYB transcription factor PhODO1 (Dal Cin et al., 2011). Those plants also contain substantially higher levels of many phenylpropanoid compounds (Xie et al., 2016). Notably, the S. pennellii M region defined here does not contain any transcription factors. Therefore, the effects on gene products involved in these metabolic pathways must be indirect.
Higher levels of transcripts in light reaction-associated genes could impact photosynthetic capacity, having an effect on many metabolites.
In summary, we have mapped a major metabolite QTL to a genome region encoding only 20 genes. This locus has a broad effect on multiple metabolic pathways, many contributing to overall fruit flavor and nutritional quality. Further work is necessary to identify the causative gene(s) and the mechanism of action. Although the skin reticulation phenotype is not itself desirable, it may be possible to eliminate or minimize this effect while maximizing the overall effects on fruit metabolite contents.