ORIGINAL RESEARCH article
Similarities and Differences of Photosynthesis Establishment Related mRNAs and Novel lncRNAs in Early Seedlings (Coleoptile/Cotyledon vs. True Leaf) of Rice and Arabidopsis
- State Key Laboratory of Hybrid Rice, College of Life Sciences, Wuhan University, Wuhan, China
Photosynthesis uses sunlight and carbon dioxide to produce biomass that is vital to all life on earth. In seed plants, leaf is the main organ for photosynthesis and production of organic nutrients. The seeds are mobilized to fuel post-germination seedling growth until seedling photosynthesis can be efficiently established. However, the photosynthesis and metabolism in the early growth and development have not been studied systematically and are still largely unknown. In this study, we used two model plants, rice (Oryza sativa L.; monocotyledonous) and Arabidopsis (Arabidopsis thaliana; dicotyledonous) to determine the similarities and differences in photosynthesis in cotyledons and true leaves during the early developmental stages. The photosynthesis-related genes and proteins, and chloroplast functions were determined through RNA-seq, real-time PCR, western blotting and chlorophyll fluorescence analysis. We found that in rice, the photosynthesis established gradually from coleoptile (cpt), incomplete leaf (icl) to first complete leaf (fcl); whereas, in Arabidopsis, photosynthesis well-developed in cotyledon, and the photosynthesis-related genes and proteins expressed comparably in cotyledon (cot), first true leaf (ftl) and second true leaf (stl). Additionally, we attempted to establish an mRNA-lncRNA signature to explore the similarities and differences in photosynthesis establishment between the two species, and found that DEGs, including encoding mRNAs and novel lncRNAs, related to photosynthesis in three stages have considerable differences between rice and Arabidopsis. Further GO and KEGG analysis systematically revealed the similarities and differences of expression styles of photosystem subunits and assembly factors, and starch and sucrose metabolisms between cotyledons and true leaves in the two species. Our results help to elucidate the gene functions of mRNA-lncRNA signatures.
Seeds provide energy and materials for the germination, early growth and development of plants. The size and storage of seeds vary considerably among species. In monocots, the endosperm constitutes the majority of the mature seed (Berger et al., 2006). Rice (Oryza sativa) endosperm consists of an outer aleurone layer and an inner starchy endosperm (Becraft and Asuncion-Crabb, 2000; Krishnan and Dayanandan, 2003). In maize (Zea mays), the endosperm typically consists of the vitreous endosperm and the starchy endosperm (Wang et al., 2012). In dicotyledons, such as the oilseed crop Brassica napus (rapeseed) and the model plant Arabidopsis, storage products consist of proteins and triacylglycerols (TAGs) synthesized mainly in the embryo (Baud et al., 2008). The endosperm and embryo are mobilized to fuel post-germination seedling growth until seedling photosynthesis can be efficiently established (Sela et al., 2020).
Rice, as a monocotyledonous model plant, is one of the most important crops in the world. Rice grain, as a cereal crop, is a food reserve to nourish growing seedlings during germination (Zheng and Wang, 2015; Wu et al., 2016). The starchy endosperm is the major energy source of rice and contains mostly starch and a small amount (8%) of proteins (Becraft and Yi, 2011). During rice seed germination, starch is decomposed into glucose, which is directly used as raw material for respiration, and cellulose is further synthesized to ensure the formation of cell walls in new cells (Leonova et al., 2010). Proteins stored in seeds are decomposed into soluble amino acids under the action of protease and transported to the growth site of embryos. Amino acids under the action of corresponding enzymes form structural proteins, which become cellular components in buds and roots. Rice seeds contain less fat, that is, approximately 1–2%. During seed germination, storage lipids are decomposed into fatty acids and glycerol by lipase and then converted into sugars, thereby providing raw materials for respiration. Arabidopsis seeds are considerably smaller than rice. Rice (Nipponbare) seed usually weighs 20 mg. By contrast, Arabidopsis (Col-0) seed weighs approximately 20 μg. Arabidopsis seeds contain 30–40% proteins (Baud et al., 2002), 38% model oil (O’Neill et al., 2003), and 2% oligosaccharides (Bentsink et al., 2000). The seed stored oil, TAGs, is metabolized to provide carbon skeleton and energy resources for germination and growth of young seedlings (Baud et al., 2002). At the stage of seeds maturation, the most abundant mRNAs in embryos are those encoding seed storage proteins, which are considered to be the source of seedling nitrogen for young seedlings (Pang et al., 1988).
The photosynthesis and carbon cycle of plants are the basis of all life activities. Photosynthesis plays a crucial role in biology and biogeochemistry, transforming sunlight energy and CO2 into metabolic products of life (Liu and Last, 2017). After germination, leaves are the main organs of plant photosynthesis. In dicotyledonous plants, such as Arabidopsis, cotyledons are components of seeds and the first pair of leaves; the first true leaves appeared several days later. Both cotyledons and true leaves contain chloroplasts and have photosynthetic activity; however, the differentiation of chloroplasts follows different pathways in these two organs (Deng and Gruissem, 1987; Mansfield and Briarty, 1996). Cotyledons are capable of storing nutrients and performing photosynthesis and provide a major proportion of matter needed for seedling growth and development until the first true leaf becomes a significant exporter of photosynthates (Zhang et al., 2008; Zheng et al., 2011). Cotyledon deficiency experiments result in growth arrest or even death of young seedlings (Garcia-Cebrian et al., 2003; Hanley and May, 2006). The cotyledon chloroplast biogenesis factor CYO1 (shi-yo-u means cotyledon in Japanese) associated with PSI/LHCI and PSII/LHCII complexes is crucial for the biogenesis of cotyledon chloroplasts in Arabidopsis (Shimada et al., 2007; Muranaka et al., 2012). In rice, the coleoptile is the first leaf-like organ formed around the embryo during embryogenesis (Hirano, 2008). It is the basic structure of the seedling and plays a protective role when the embryo passes through the soil layers (Xiong et al., 2017). The incomplete leaf is the first green organ with only a sheath but no blade. The complete leaves with blades and sheaths are from the second green leaf (Du et al., 2018). OsCYO1 is expressed in leaf but not in coleoptile, and functions in the accumulation and/or assembly of PSI (Tominaga et al., 2016). There are structural and functional differences between monocotyledonous and dicotyledonous plants in early growth and development. In rice, photosynthetic activity may differ from that in Arabidopsis. However, the similarities and differences in photosynthesis and metabolism in the early growth and development of the two model plants have not been studied systematically and are still largely unknown.
In recent years, deep sequencing of transcripts has been increasingly used in the identification of differential expression (Lu et al., 2010; Bhargava et al., 2013; Gao et al., 2013; Yuan et al., 2016). Long non-coding RNAs (lncRNAs) are RNAs with lengths greater than 200 bp that lack protein-encoding functions and play important roles in essential biological processes. lncRNA mostly functions in the nucleus, often in association with chromatin processes (Chekanova, 2015). lncRNAs can be divided into three categories: long intergenic ncRNAs (lincRNAs), intronic ncRNAs (incRNAs), and natural antisense transcripts (NATs) transcribed from the complementary DNA strand of the associated genes (Chekanova, 2015). Although lncRNAs do not have the ability to encode proteins, it is widely believed that lncRNAs can be transcribed and regulate gene expression at the transcriptional and post-transcriptional levels (Kim and Sung, 2012). Many plant lncRNAs are developmentally and environmentally regulated and likely represent functional components of the transcriptome (Liu et al., 2012). The regulation of coding genes mediated by lncRNA at the transcriptional level can be divided into cis-regulation and trans-regulation. Epigenetic regulation of lncRNAs plays a key role in flowering regulation by controlling the expression of FLC (flowering site C) in Arabidopsis. FLC plays a key role in regulating flowering process and regulates flowering independently of environmental signals (Amasino and Michaels, 2010). COLDAIR and COOLAIR are two different classes of lncRNAs transcribed from FLC that participate in epigenetic silencing of FLC (Liu et al., 2010; Heo and Sung, 2011; Tian et al., 2019). A lncRNA HID1 with a length of 236 nt length was identified by chain-specific transcriptome sequencing. This lncRNA participates in seed photomorphogenesis by mediating PIF3 expression in Arabidopsis (Wang et al., 2014b). In addition, some studies have found that lncRNAs can be transcribed from the antisense strands of genes. These lncRNAs are usually involved in post-translational splicing, editing, transport, translation, and degradation of mRNA (Liu et al., 2019).
In this study, high-throughput sequencing was used to analyze mRNA, novel lncRNAs and their target genes during the early growth and development of rice and Arabidopsis. We show that photosynthetic protein complexes gradually mature from cpt, icl to fcl in rice. However, the functions are fully developed during the cot stage of Arabidopsis. Photosynthesis and photosynthesis-related processes were clustered in the three stages of rice but not in Arabidopsis through Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses of differentially expressed genes (DEGs). Further analysis of DEGs, including mRNAs and lncRNAs, showed that the upregulated genes from icl to fcl were distributed in photosystem II (PSII), photosystem I (PSI), cytochrome b6/f complex, photosynthetic electron transport and F-type ATPase in rice but not in Arabidopsis. Additionally, assembling the individual subunits to form PSII and PSI are also different in the two plants and may not function at the transcriptional level.
Materials and Methods
Plant Materials and Growth Conditions
Rice (Nipponbare) were grown under 16 h of light at 30°C and 8 h of darkness at 28°C. Seedlings were grown on plates containing only water and 0.8% agar. Leaves were harvested at 68 h for coleoptile (cpt), 82 h for incomplete leaf (icl), and 127 h for first complete leaf (fcl). Arabidopsis (Columbia-0) was grown under 16 h of light at 22°C and 8 h of darkness at 20°C. Seedlings were grown on 1/2 MS medium containing 1.0% sucrose (pH 5.7) or only water and 0.8% agar. Leaves were harvested at 111 h for cotyledon (cot), 183 h for first true leaf (ftl), and 236 h for second true leaf (stl).
RNA Extraction and RNA-Seq Preparation
Approximately 60 rice seedlings and 600 Arabidopsis seedlings were collected from each stages and frozen in liquid nitrogen immediately. Three biological replicates were performed. Total RNA was extracted using a RNeasy Plant Mini Kit followed by DNase I treatment according to the manufacturer’s manual (Qiagen). The extracted RNA was analyzed by 1% (w/v) agarose gel for quality and completeness, and the concentration was measured on a NanoDrop 2000c spectrophotometer (NanoDrop Technologies, Wilmington, DE, United States). The RNA samples were sent to Novogene (Novogene, Co. Ltd.) for mRNA purification, cDNA library construction and sequencing using the platform Illumina. The original raw data from HiSeq X Ten was transformed to sequenced reads by base calling. Sequencing data analysis was completed in UniqueGene (Wuhan, China).
Reads qualities of the RNA sequencing (RNA-Seq) were evaluated using FastQC (Version 0.11.5). The clean data was evaluated by removing the adapter and low-quality reads of the raw data obtained from Illumina sequencing. The clean reads were searched against the rice and Arabidopsis genome Using HISAT2 (Version 2.0.4). The obtained transcripts were assembled using StringTie (Version 1.0.4). All downstream analyses were based on the high quality clean data.
Identification of Novel lncRNAs
Based on the structural characteristics and non-coding function of lncRNA, we established a five-step screening method to obtain high quality lncRNA: (1) Transcripts with one exon were removed while the transcripts with two or more exons were selected; (2) the transcripts with length < 200 bp were removed; (3) Filtered out transcripts with reads coverage < 5 in all samples; (4) Filtered out the mRNA and other non-coding RNA (rRNA, tRNA, snoRNA, snRNA, etc.) by comparing gffcompare (Version 0.10.4) with the annotation files; (5) The potential lincRNA, intronic lncRNA and anti-sense lncRNA were screened according to the class_code information (”u,” “i,” “x”) in the comparison results. Based on the length of the open reading frame, homology with known proteins and their coding potential, the CNCI (Coding-Non-Coding Index) (Sun et al., 2013), CPC (Coding Potential Calculator) (Kong et al., 2007), and CPAT (Coding Potential Assessment Tool), which rapidly recognizes coding and non-coding transcripts from a large pool of candidates (Wang et al., 2013) were combined to screen the lncRNAs. This analysis was combined with information from the Pfam protein database to ensure the predicted lncRNA transcripts did not contain protein-coding domains (Mistry et al., 2007). The transcripts that meet these four criteria were determined to be lncRNAs.
Differential Expression Analysis of mRNAs and lncRNAs
The clean reading was aligned with the reference genomes by HiSAT2 software, and then gene and transcript expression levels were calculated by performing RSEM. In order to make the data of gene expression comparable among samples, the level of gene expression was standardized. FPKM (fragments per kilobase of exon per million fragments mapped) values were used to analyze gene expression. Differential expression analysis of different stages was performed using the DESeq R package (1.10.1). DESeq provides a statistical program based on negative binomial distribution to determine differential expression in digital gene expression data. The resulting P-values were adjusted by Benjamini and Hochberg’s approach for controlling the false discovery rate (Benjamini and Hochberg, 1995). Genes with an adjusted P-value < 0.05 and Fold change ≥ 2.00 found by DESeq were assigned as differentially expressed.
Target Genes of Differentially Expressed lncRNAs
The differentially expressed lncRNAs were selected as targets by cis- or trans- regulation. In the prediction of cis pathway target genes, the genes transcribed in 10 kb upstream or downstream of coding genes. Trans-acting target genes were selected by RNAplex software (Tafer and Hofacker, 2008).
GO and KEGG Pathway Analyses of Differentially Expressed mRNAs and lncRNAs
The GO enrichment analysis of DEGs was implemented by Wallenius non-central hypergeometric distribution based on GOseq R package (Young et al., 2010), which can be adjusted according to the gene length bias in DEGs. GO terms P-value < 0.05 were considered significantly enriched by differentially expressed genes. KEGG (Kanehisa and Goto, 2000) is a database resource for understanding the high-level functions and utilities of biological systems, such as cells, organisms and ecosystems, from molecular-level information, especially large-scale molecular datasets generated by genome sequencing and other high-throughput experimental technologies1. The statistical enrichment of DEGs in KEGG pathway were tested using KOBAS (Mao et al., 2005) software. The significantly enriched KEGG pathways were determined using the corrected P-value < 0.05 as a threshold.
Expression Analysis by Quantitative Real-Time PCR (q-RT PCR) of DEGs
The DEG results were confirmed by q-RT PCR using the same RNA samples that were used for RNA library construction. The isolated RNA was reverse transcribed to cDNA using a PrimeScript RT reagent Kit (Takara, Dalian China) according to the manufacturer’s instructions. Real-time PCR analyses were performed on an Applied Biosystems 7300 Plus Real-Time PCR System with SYBR Green Mix (Takara, Dalian, China). The amplified PCR products were quantified and normalized using actin1 for rice and actin2 for Arabidopsis as a control (Supplementary Table S1). The correlation between RNA Seq and q-RT PCR results were analyzed through pearson’s correlation coefficient (r).
Protein was extracted from each tissue using extraction buffer [50 mM Tris-MES pH 7.5, 50 mM NaCl, 0.5% SDS (v/v)], and protease inhibitor cocktail P9599 (Sigma-Aldrich, St. Louise, MO, United States). The extract was centrifuged at 12000 rpm at 4°C for 30 min. For immunoblotting, protein samples (18 μg total protein) were separated on 12% SDS PAGE gels and transferred to nitrocellulose membranes (BioTraceTM NT nitrocellulose, Mexico) followed by western blot analysis. After blocking non-specific binding with 5% milk, the blot was subsequently incubated with antibodies generated against the indicated proteins and detected using the Super SignalTM West Pico PLUS Chemiluminescent Substrate kit (Thermo Scientific, United States).
Chlorophyll Fluorescence Measurements
Chlorophyll fluorescence was measured using FluorCam 800 MF (Photon Systems Instruments, spol. s r.o.). Plants were maintained in the dark for 30 min before the measurements. The intensities of the actinic light and saturating light were 300 and 2,100 μmol m–2 s–1 photosynthetic photon flux density for rice, and 80 and 1,600 μmol m–2 s–1 photosynthetic photon flux density for Arabidopsis, respectively. Three biological replicates for each treatment were performed with 10 plants.
All data were expressed as mean ± SD and analyzed using GraphPad Prism software (version 7.0, La Jolla, CA, United States). Student’s t-test was used to compare the different groups. And the significant differences between the three stages were calculated using Tukey’s test. The Pearson’s correlation coefficient (r) was used to analyze the correlation between qRT-PCR and RNA-seq results. Differences with |log2 ratio| ≥ 1 and P < 0.05 were selected for the DEGs. Cytoscape software was used to visualize the inferred gene networks (Shannon et al., 2003). Modules were visualized using Cytoscape version 3.6.1.
The sequence data generated in the current study has been uploaded to the NCBI Sequence Read Archive. The accession numbers are SRR10907568–SRR10907573 (coleoptile sequencing of rice), SRR10913120-SRR10913123 (incomplete leaf sequencing of rice), SRR10913307-SRR10913312 (first complete leaf sequencing of rice), SRR10914245-SRR10914250 (cotyledon sequencing of Arabidopsis), SRR10914757-SRR10914762 (first true leaf sequencing of Arabidopsis), SRR10915176-SRR10915181 (second true leaf sequencing of Arabidopsis)2.
Photosynthesis and Chloroplast Development of Early Growth and Development in Rice and Arabidopsis
To find the similarities and differences in photosynthesis and chloroplast development between monocotyledons and dicotyledons during early growth and development, photosynthesis was determined by measuring chlorophyll fluorescence in rice and Arabidopsis. We collected coleoptile (cpt) before the incomplete leaf (icl) appeared in rice and cotyledon (cot) before the first true leaf (ftl) appeared in Arabidopsis as stage 1 (S1), icl before the first complete leaf (fcl) appeared in rice and ftl before the second true leaf (stl) appeared in Arabidopsis as stage 2 (S2), fcl in rice and stl in Arabidopsis appeared fully as stage 3 (S3). The ratio of variable fluorescence to maximum fluorescence (Fv/Fm), which represents the maximum photochemical efficiency of PSII (Maxwell and Johnson, 2000; Hou et al., 2015), was significantly increased in the three stages of rice (Figure 1A). In contrast, unlike rice, Arabidopsis could germinate on 0.8% agar, but was arrested at the cotyledon stage without supplied nutrients (Figure 1B). In addition, the Fv/Fm was decreased during the growth (Figure 1B), suggesting that the nutrients stored in Arabidopsis seeds are insufficient to satisfy their normal growth and development. To ensure the normal growth of Arabidopsis, seedlings were grown on 1/2 MS medium containing 1% sucrose. No significant difference in Fv/Fm was found in the three stages of Arabidopsis (Figure 1C). Taken together, the results of the chlorophyll fluorescence assays suggested that differences exist in early growth and development between rice and Arabidopsis in PSII function.
Figure 1. Comparison of rice and Arabidopsis at the early developmental stages. (A) rice grown on 0.8% agar under 16 h of light at 30°C and 8 h of darkness at 28°C. (B) Arabidopsis plants grown on 0.8% agar under 16 h of light at 22°C and 8 h of darkness at 20°C conditions for different stages (C) Arabidopsis plants grown on 1/2MS under 16 h of light at 22°C and 8 h of darkness at 20°C at different stages. (A–C) Chlorophyll fluorescence analysis. F0, the minimum fluorescence; Fm, the maximum fluorescence; Fv/Fm, the maximum efficiency of PSII photochemistry. cpt, rice coleoptile; icl, rice incomplete leaf; fcl, rice first complete leaf; cot, Arabidopsis cotyledons; ftl, Arabidopsis first true leaf; stl, Arabidopsis second true leaf. (D) Analysis of thylakoid membrane proteins in rice and Arabidopsis at different stages. 16 μg total proteins were separated by 12% SDS PAGE. And immunoblots were performed using antibodies as indicated. CBS, Coomassie Blue Staining. (E) The relative expression levels of chloroplast proteins in (D) and the relative expression levels of relavent genes of rice and Arabidopsis. Data was presented as the means ± SD of three biological replicates. S1, cpt and cot; S2, icl and ftl; S3, fcl and stl. a, b, and c indicate significant differences between the three stages according to Tukey’s test.
To further visualize differences in the photosynthetic complexes in two model plants, we used qRT-PCR and immunoblotting to examine the abundance of subunits of the photosynthetic complexes including PSII: D2, CP47, PsbO, and PsbQ; and PSI: PsaD and PsaF. The Rubisco protein and Coomassie Blue staining were used as a loading control. The results showed that the protein expression of PSII subunits increased from cpt to fcl in rice. Conversely, there was no significant difference in the abundance of PSII and PSI subunits between the three stages in Arabidopsis (Figure 1D). The mRNA levels of these six genes were consisting with the protein level in both rice and Arabidopsis (Figure 1E). These findings suggest that chloroplast functional protein complexes gradually mature from cpt, icl to fcl during the early growth and development of rice. Photosynthesis of the chloroplast is also gradually improved in the above three stages. Conversely, chloroplast proteins and photosynthesis are fully developed at the cot stage of Arabidopsis. More importantly, we can use RNA expression levels to represent the protein expression levels of photosynthetic genes because their expression trend was consistent.
Sequencing Data Summary
To systematically study the differences between rice and Arabidopsis during early growth and development, gene expression analyses were conducted by high-throughput RNA sequencing. A total of 54.9 Gb raw data for rice and 67.9 Gb raw data for Arabidopsis were generated. RNA sequencing was performed on the HiSeq X Ten platform, and we detected the expression of mRNA in all tissue samples. In detail, 91,035,666, 90,874,512, and 95,792,352 raw reads were obtained for rice coleoptile (cpt-A, B, and C, respectively); 86,766,618 and 95,714,768 raw reads were obtained for rice incomplete leaf (icl-A and B, respectively); and 88,510,902, 115,795,104, and 85,884,202 raw reads were obtained for rice first complete leaf (fcl-A, B, and C, respectively). A total of 99,311,856, 100,974.596, and 100,566,922 raw reads were obtained for Arabidopsis cotyledon (cot-A, B, and C, respectively); 85,926,770, 92,493,690, and 98,031,334 raw reads were obtained for Arabidopsis first true leaf (ftl-A, B, and C, respectively); 89,000,366, 125,026,956, and 98,057,458 raw reads were obtained for Arabidopsis second true leaf (stl-A, B, and C, respectively). The results of the RNA-Seq reads mapped on the reference are shown in Supplementary Table S2. Of the 17 library alignments, the unique mapping rate between clean reads and the reference genome was 93.96–98.83% (Supplementary Table S3). The three functional elements of gene proportions were 55.00–75.31% exon, 5.83–8.19% intron, 18.86–37.50% intergenic in rice and 83.98–88.71% exon, 4.76–8.39% intron, and 6.52–7.43% intergenic in Arabidopsis (Supplementary Table S4), and the normalized FPKM values for all mRNAs and lncRNAs in all samples are shown in Supplementary Table S5. The above results show that our sequencing results were reliable.
Identification and Characterization of mRNA and Novel lncRNA in Rice and Arabidopsis
In the present study, we collected three typical growth periods of rice and Arabidopsis. Using high-throughput sequencing, DEGs were detected in the leaves at different developmental stages. To study the basic features of lncRNAs in rice and Arabidopsis, lncRNAs were identified and compared with mRNAs. As a result, 762 novel transcripts in rice and 219 novel transcripts in Arabidopsis were identified as novel lncRNAs (Figure 2A). In addition, 43,148 known transcripts in rice and 25,481 in Arabidopsis were identified as mRNAs. Most identified lncRNAs had only 2–3 exons in their transcripts in both rice (Supplementary Figures S1A,E) and Arabidopsis (Supplementary Figures S1B,F) compared with mRNAs. The number of lncRNAs was notably small compared to the number of coding RNAs with 200–2000 nt transcripts in both rice (Supplementary Figures S1C,G) and Arabidopsis (Supplementary Figures S1D,H) compared with mRNAs. According to RNA-seq, the significantly dysregulated mRNAs and lncRNAs (|log2 ratio| ≥ 1, q < 0.05), including upregulated and downregulated mRNAs and lncRNAs, were found to be differentially expressed between icl-vs.-cpt and fcl-vs.-icl in rice; ftl-vs.-cot, stl-vs.-ftl in Arabidopsis. In rice, we identified 10396 and 3834 DEGs in the icl-vs.-cpt and fcl-vs.-icl comparisons, respectively (Figure 2B). Similarly, 2519 and 3166 DEGs were identified in the ftl-vs.-cot and stl-vs.-ftl comparisons, respectively, in Arabidopsis (Figure 2B). Similarly, according to RNA-seq, the significantly upregulated and downregulated lncRNAs (| log2 ratio| ≥ 1, q < 0.05) were found to be differentially expressed between the different stages (Supplementary Figure S2). In rice, we identified 641 and 638 DEGs in the icl-vs.-cpt and fcl-vs.-icl comparisons, respectively (Figures 2C,D). Similarly, 177 and 183 DEGs were identified in the ftl-vs.-cot and stl-vs.-ftl comparisons (Figures 2E,F), respectively, in Arabidopsis.
Figure 2. Differentially expressed unigenes and corresponding genes in each early growth developmental stage. (A) Three software CPC, CNCI, CPAT and a protein database pfam were used to predict lncRNAs, and the transcript was determined when the four methods were consistent in rice and Arabidopsis. (B) Number of differentially expressed genes obtained by comparisons between different stages and different lncRNAs in different stages. The genes were selected with “| log2 ratio| ≥ 1” and “q < 0.05.” (C–F) Volcanic map of differentially expressed genes in rice icl-vs.-cpt (C), fcl-vs.-icl (D) and in Arabidopsis ftl-vs.-cot (E) and stl-vs.-ftl (F). Volcano plot of the log2 fold change of gene expression levels (X-axis) against the –log10 P-value (Y-axis) comparing the two stages in rice and Arabidopsis. Splashes represent different genes. Gray splashes means genes without significant different expression. Orange splashes mean significantly upregulated genes. Green splashes mean significantly downregulated genes. The genes were selected with “| log2 ratio| ≥ 1” and “q < 0.05.”
GO Analysis of Differential Expressed mRNAs and Novel lncRNAs in Rice and Arabidopsis
To better understand the developmental pattern of the photosynthetic characteristics of early growth and development in rice and Arabidopsis, GO cluster analysis was conducted to further explore the biological processes enriched by the candidate mRNAs and lncRNAs. In rice, GO analysis for mRNAs revealed up- and downregulated genes in biological pathways, and “photosynthesis,” “photosynthesis, light reaction,” “photosynthetic electron transport in PSI,” and “photosynthesis, light harvesting in PSI” were clearly enriched in icl-vs.-cpt. “Photosynthesis,” “photosynthesis, dark reaction,” “photosynthesis, light reaction,” and “photosynthesis, light harvesting” were also clearly enriched in fcl-vs.-icl. In contrast, in Arabidopsis, no photosynthesis-related biological pathways were enriched in the ftl-vs.-cot and stl-vs.-ftl comparisons. Similarly, GO analysis for lncRNAs revealed that up- and downregulated genes in biological pathways in rice were “photosynthesis,” “photosynthesis, light reaction,” “photosynthesis, light harvesting,” “phototransduction,” and “red, far-red light phototransduction,” which were clearly enriched in icl-vs.-cpt. “Photosynthesis,” “photosynthesis, light reaction,” “photorespiration,” “photosynthesis, light harvesting,” and “chlorophyll metabolic process” were also clearly enriched in fcl-vs.-icl. In Arabidopsis, similar to the mRNAs, no photosynthesis-related biological pathways were enriched in the ftl-vs.-cot and stl-vs.-ftl comparisons (Supplementary Table S6). This result suggests that chloroplast develop and mature gradually in the three stages of early growth and development in rice, unlike the fully developed chloroplast in cotyledon in Arabidopsis.
KEGG Analysis of Differential Expressed mRNAs and Novel lncRNAs in Rice and Arabidopsis
At the same time, KEGG pathway analysis showed that mRNAs in the network of photosynthesis were enriched in “Photosynthesis antenna proteins,” “Photosynthesis,” and “Starch and sucrose metabolism” in icl-vs.-cpt (Figure 3A). “Photosynthesis,” “starch and sucrose metabolism,” and “photosynthesis – antenna proteins” were also significantly enriched in fcl-vs.-icl (Figure 3B). Similar to the GO analysis, in Arabidopsis, no photosynthesis-related biological pathways were enriched in the ftl-vs.-cot and stl-vs.-ftl comparisons (Figures 3C,D). Interestingly, the “starch and sucrose metabolism” pathway was clearly enriched in the ftl-vs.-cot and stl-vs.-ftl comparisons (Figures 3C,D). Similarly, KEGG analysis for lncRNAs revealed that up- and downregulated photosynthesis-related genes in rice were “Photosynthesis – antenna proteins,” “Carbon fixation in photosynthetic organisms” in icl-vs.-cpt (Figure 4A). “Carbon fixation in photosynthetic organisms” and “porphyrin and chlorophyll metabolism” in fcl-vs.-icl (Figure 4B). Additionally, “Carbon metabolism” and “Citrate cycle (TCA cycle)” were clearly enriched in fcl-vs.-icl (Figure 4B). This finding suggests that the accumulation of photosynthesis products increased gradually from cpt to fcl. Similar to the GO analysis in Arabidopsis, no photosynthesis-related biological pathways were enriched in the ftl-vs.-cot and stl-vs.-ftl comparisons (Figures 4C,D).
Figure 3. The mRNAs KEGG classifications of predicted target genes in different comparisons. KEGG classification of the predicted target genes in different comparisons of icl-vs.-cpt (A) and fcl-vs.-icl (B) in rice. KEGG classification of the predicted target genes in different comparisons of ftl-vs.-cot (C) and stl-vs.-ftl (D) in Arabidopsis. In both (A,B), the y-axis shows the pathway category, and the x-axis shows the richness factor. A larger richness factor indicates greater enrichment. The size of the point represents the number of related DEGs., and the color of the bubble represents the range of q-values.
Figure 4. The lncRNAs KEGG classifications of predicted target genes in different comparisons. KEGG classification of the predicted target genes in different comparisons of icl-vs.-cpt (A) and fcl-vs.-icl (B) in rice. KEGG classification of the predicted target genes in different comparisons of ftl-vs.-cot (C) and stl-vs.-ftl (D) in Arabidopsis. In both (A,B), the y-axis shows the pathway category, and the x-axis shows the richness factor. A larger richness factor indicates greater enrichment. The size of the point represents the number of related DEGs, and the color of the bubble represents the range of q-values.
The KEGG analysis combined with GO analysis of DEGs indicated that there are great differences in the expression pattern of chloroplast-related genes and the function of chloroplast form integrity during early development between rice and Arabidopsis. Photosynthetic genes and functions develop and mature gradually in rice and fully develop when exposed to light in Arabidopsis in the three stages of early growth and development. More importantly, lncRNAs might play important roles in chloroplast development by regulating these coherent target genes. The difference in chloroplast development between rice and Arabidopsis is largely consistent with the mRNA level.
Different Patterns of Photosynthesis Metabolism in Early Development
To better describe the differences in gene expression at different stages between rice (from cpt to fcl) and Arabidopsis (from cot to stl). We studied DEGs related to photosynthesis in rice and Arabidopsis. The up-regulated genes in fcl compared to icl were distributed in PSII, PSI, cytochrome b6/f complex, photosynthetic electron transport and F-type ATPase (Figure 5). The functions of lncRNAs are executed on coding genes via cis- or trans-regulation. Different expression novel lncRNAs targeted photosynthetic genes in rice and Arabidopsis were showed in Figure 6 and Supplementary Table S7. As showed in Figure 6, one novel lncRNA was correlated with several mRNAs and one mRNA was co-expressed with more than one novel lncRNA, further supporting the notion of inter-regulatory relationships. The novel lncRNAs targeted photosynthetic genes also showed upregulated chloroplast-associated genes, including PsbA, PsbW, Psb27, PsaH, PsaO, PetC, PetF, PetH, ATPF1B, and ATPF1G in icl-vs.-cpt, and PsbA, Psb27, Psb28, PsaO, PetC, PetH, ATPF1B, ATPF1G, and ATPF1D in fcl-vs.-icl. In contrast, for Arabidopsis, KEGG pathway analysis of novel lncRNAs targeted photosynthetic genes showed no significant difference in chloroplast-associated genes in the three stages (Figure 7). To validate the RNA-Seq data, DEGs and differentially expressed mRNAs and lncRNAs related to photosynthesis were selected in rice and Arabidopsis at the three stages (Supplementary Table S8). The qRT-PCR expression patterns of mRNAs and lncRNAs genes were consistent with the sequencing results (Supplementary Figure S3). In addition, we performed qRT-PCR analysis of the genes mentioned in this paper, and further analysis of the correlation between RNA Seq and q-RT PCR results. As shown in Supplementary Figure S4 and Supplementary Table S9, the expression trends of all 17 genes from qRT-PCR and RNA-seq analyses including mRNAs and lncRNAs were largely consistent (Pearson’s correlation coefficient R2 = 0.83 in rice and 0.91 in Arabidopsis), indicating that our sequencing results were reliable.
Figure 5. The mRNAs KEGG classifications of photosynthetic genes in different comparisons. KEGG classification of the photosynthetic genes in different comparisons of icl-vs.-cpt and fcl-vs.-icl in rice, ftl-vs.-cot and stl-vs.-ftl in Arabidopsis. The change of expression amount was expressed by color change, the red color represented upregulated expression annotated to the KEGG Orthology (KO) system, the green color represented downregulated expression annotated to the KO system. The genes were selected with “| log2 ratio| ≥ 1” and “q < 0.05.”
Figure 6. The networks of novel lncRNAs and target mRNAs of DEGs related to photosynthesis. (A) Co-expression network analysis of novel lncRNAs and target mRNAs in rice. (B) Co-expression network analysis of novel lncRNAs and target mRNAs in Arabidopsis. The blue color indicated novel lncRNAs and the yellow color indicated target genes for novel lncRNAs.
Figure 7. The lncRNAs KEGG classifications of photosynthetic genes in different comparisons. KEGG classification of the photosynthetic genes in different comparisons of icl-vs.-cpt and fcl-vs.-icl in rice, ftl-vs.-cot, and stl-vs.-ftl in Arabidopsis. The change of expression amount was expressed by color change, the red color represented upregulated expression annotated to the KEGG Orthology (KO) system, the green color represented downregulated expression annotated to the KO system, blue color indicates that the target gene annotated to the KO system is not significantly up- and downregulated. The genes were selected with “| log2 ratio| ≥ 1” and “q < 0.05.”
The light-harvesting chlorophyll protein complex (LHC) is also enriched in mRNA and lncRNA DEGs in rice. For icl-vs.-cpt, LHCA2, LHCA5, LHCB1, LHCB2, LHCB4, LHCB5, and LHCB6 were upregulated by KEGG in both mRNA and lncRNA DEGs (Figures 5, 7). For fcl-vs.-icl, LHCA2, LHCA5, LHCB1, LHCB2, LHCB4, and LHCB5 were enriched in mRNA; LHCA5, LHCB2, LHCB4, and LHCB5 were upregulated in lncRNA (Figures 5, 7). In contrast, there was no significant difference in LHC genes in ftl-vs.-cot in either mRNA or lncRNA DEGs (Figures 5, 7); LHCB1 was upregulated and LHCB4 was downregulated in mRNA DEGs for stl-vs.-ftl, and there was no significant difference in LHC genes in lncRNAs for stl-vs.-ftl (Figures 5, 7).
Different Regulatory Styles of Assembly Factors for PSI and PSII in Early Development
The assembly of core subunits of PSII is regulated by the expression of nuclear genes and transported to plastids, and PSI assembly factors are encoded by both plastid and nuclear genes (Yang et al., 2015). We found that the expression values of the assembly factors of PSI and PSII were different between rice and Arabidopsis (Figures 8A,B and Supplementary Table S10). We used four genes as examples to explore the similarities and differences in PSI and PSII assembly during early development in rice and Arabidopsis (Figures 8C,D). The mRNA level of FKBP20-2 was upregulated in icl-vs.-cpt and downregulated in fcl-vs.-icl in rice based on the sequencing results. In contrast, the expression levels were upregulated in cot-vs.-ftl and stl-vs.-ftl in Arabidopsis. To further visualize changes in the protein expression level, we used immunoblotting to examine the abundance of the assembly factors of three stages in rice and Arabidopsis. The results showed upregulation in icl-vs.-cpt and downregulation in fcl-vs.-icl in rice with no significant difference in cot-vs.-ftl and stl-vs.-ftl in Arabidopsis. For FKBP20-2, the RNA expression levels were consistent with the protein expression levels in rice for three stages but did not appear in Arabidopsis. The mRNA levels of LQY1 were upregulated in icl-vs.-cpt and fcl-vs.-icl in rice, upregulated in cot-vs.-ftl and downregulated in stl-vs.-ftl in Arabidopsis. The immunoblots showed upregulation in icl-vs.-cpt and fcl-vs.-icl in rice, downregulation in cot-vs.-ftl and upregulation in stl-vs.-ftl in Arabidopsis. For LQY1, the RNA expression levels were consistent with the protein expression levels in rice and for three stages but did not appear in Arabidopsis. The mRNA levels of TLP18.3 were upregulated in icl-vs.-cpt and fcl-vs.-icl in rice, downregulated in cot-vs.-ftl and upregulated in stl-vs.-ftl in Arabidopsis. The immunoblots showed downregulation in icl-vs.-cpt and fcl-vs.-icl in rice and upregulation in cot-vs.-ftl and stl-vs.-ftl in Arabidopsis. For TLP18.3, the RNA expression levels were not consistent with protein expression levels in rice and for three stages, but they appeared in Arabidopsis. The mRNA level of Deg1 was not significantly different in icl-vs.-cpt and was upregulated in fcl-vs.-icl in rice with no significant difference in cot-vs.-ftl and upregulated in stl-vs.-ftl in Arabidopsis. The immunoblots showed downregulated icl-vs.-cpt and fcl-vs.-icl in rice, with no significant difference in cot-vs.-ftl and stl-vs.-ftl in Arabidopsis. For Deg1, the RNA expression levels were not consistent with the protein expression levels in rice and Arabidopsis for three stages. Taken together, these results indicate that assembling the individual subunits to form PSII and PSI may not occur at the transcriptional level.
Figure 8. Analysis of PSII and PSI assembly factors proteins and genes, starch and sucrose metabolism pathways marker genes of rice and Arabidopsis in three stages. Heatmap of the expression of photosystems assembly factors in rice (A) and Arabidopsis (B) at different stages. The gene expression levels are shown in different colors indicated by the scale bar. (C) Analysis of photosystems assembly factor proteins in rice and Arabidopsis at different stages. 16 μg total proteins were separated by 12% SDS PAGE. And immunoblots were performed using antibodies as indicated. CBS, Coomassie Blue Staining. (D) The density ratio levels of photosystems assembly factor proteins in (C) and RNA-seq-based gene expression values (FPKM) of relavent genes in rice and Arabidopsis. Data was presented as the means ± SD of three biological replicates. a, b, and c indicate significant differences between the three stages according to Tukey’s test. (E) The relative expression levels of starch and sucrose metabolism pathways marker genes measured by real-time RT-PCR and FPKM of relavent genes in rice and Arabidopsis. Data was presented as means ± SD of three biological replicates. a, b, and c indicate significant differences between the three stages according to Tukey’s test.
Different Patterns of Starch and Sucrose Metabolism in Early Development
Starch is a storage carbohydrate widely synthesized in plants. Starch serves as an important store of carbon that fuels plant metabolism and growth when they are unable to photosynthesize. The “starch and sucrose metabolism” terms were all clearly enriched among the DEGs in rice and Arabidopsis. To test the similarities and differences in expression patterns, four β-glucosidase (EC:18.104.22.168) genes, α-amylase (EC:22.214.171.124) gene and β-amylase (EC:126.96.36.199) gene were analyzed. Similarly, the qRT-PCR expression patterns of mRNA and lncRNA genes were consistent with the sequencing results (Figure 8E). These results indicate that although DEGs of starch and sucrose metabolism pathways are expressed both in rice and Arabidopsis, there are significant differences in the expression trend of key genes between the two species.
In this study, significant advances have been made in understanding the photosynthetic gene functions from coleoptile, incomplete leaf to first complete leaf in rice, from cotyledon, first true leaf to second true leaf in Arabidopsis. Many protein coding genes have been identified by traditional genetic and molecular analysis combined with recent genomic studies, which can act as positive or negative regulators of photomorphogenesis in seedlings under different light conditions (Jiao et al., 2007; Chen and Chory, 2011; Li et al., 2012). lncRNAs are an important class of non-coding RNAs (ncRNAs) which have been proved to have multiple functions in eukaryotes recently (Wang and Chang, 2011; Guttman and Rinn, 2012; Geisler and Coller, 2013). lncRNAs have been systematically identified or predicted in Arabidopsis, rice, wheat, and maize (De Lucia and Dean, 2011; Kim and Sung, 2012; Zhang and Chen, 2013). The non-coding RNA HIDDEN TREASURE 1 (HID1) is a positive regulator of photomorphogenesis (Wang et al., 2014a). lncRNAs can regulate photoperiod-sensitive male sterility (PSMS) in rice (Ding et al., 2012). Chloroplasts are green plastids found in land plants, algae, and some protists. Chloroplasts are the only site of photosynthesis in these cells. Therefore, chloroplasts are responsible for most of the world’s primary productivity (Jarvis, 2011). In our study, chlorophyll fluorescence (Fv/Fm) from icl, cpt to fcl increased progressively in rice, indicating that the development of rice chloroplasts matured, in turn, from coleoptile to true leaf. In addition, western blotting also indicated that the development and maturation process of rice chloroplasts exhibits gradual maturation. In contrast, in Arabidopsis, the Fv/Fm ratio from cot, first true leaf to second true leaf was almost unchanged, and proteins analysis showed the same result. Therefore, there were significant differences in the early development of chloroplasts between rice and Arabidopsis.
In this study, we investigated mRNA-dependent mRNA-lncRNA interactions in different stages for early growth and development using bioinformatics approaches. Chloroplast is an essential semiautonomous organelle, which plays an important role in photosynthesis, carbon fixation, oxygen production, energy conversion, starch production, amino acid synthesis, fatty acid synthesis, pigment synthesis, hormone synthesis and sulfur and nitrogen metabolism (Neuhaus and Emes, 2000; Jarvis and Lopez-Juez, 2013; Xu et al., 2016; Wang et al., 2018). For rice, it is essential to know the development of photosynthetic genes. The gene expression of rice coleoptile and incomplete chloroplasts was significantly lower than that of the first complete leaf. This finding indicates that the photosynthetic genes are fully developed during the first complete development period. In contrast, in Arabidopsis, the expression of photosynthetic genes in cotyledons was similar to that in true leaves. This finding indicates that the photosynthetic genes have been developed in the cotyledon stage. In our study, we found that the function of the first complete leaf of rice is similar to that of the cotyledon of Arabidopsis in chloroplast development.
GO cluster analysis of rice showed that in the three stages of early development growth and development, the DEGs in biological pathways were mainly related to photosynthesis both in mRNA and lncRNA. Additionally, KEGG pathway analysis showed that mRNAs in the photosynthesis network were enriched in “Photosynthesis antenna proteins,” “Photosynthesis,” and “Starch and sucrose metabolism” in icl-vs.-cpt. “Photosynthesis,” “Starch and sucrose metabolism,” and “Photosynthesis antenna proteins” in fcl-vs.-icl. In Arabidopsis, no photosynthesis-related biological pathways were enriched in the ftl-vs.-cot and stl-vs.-ftl comparisons. Interestingly, the “starch and sucrose metabolism” pathway was clearly enriched in the ftl-vs.-cot and stl-vs.-ftl systems. KEGG analysis of lncRNAs revealed that up- and downregulated photosynthesis-related genes in rice were “Photosynthesis – antenna proteins,” “Carbon fixation in photosynthetic organisms” in icl-vs.-cpt. “Carbon fixation in photosynthetic organisms” and “porphyrin and chlorophyll metabolism” in fcl-vs.- icl (Figure 4B). Additionally, “Carbon metabolism” and “Citrate cycle (TCA cycle)” were clearly enriched in fcl-vs.-icl. In Arabidopsis, no photosynthesis-related biological pathways were enriched in the ftl-vs.-cot and stl-vs.-ftl comparisons.
The main DEGs were chloroplast development and the carbon cycle pathway in early rice growth and development. Sucrose and starch contents due to alterations of the allocation pattern of photosynthetic fixed carbon (Higuchi et al., 2015; Pilkington et al., 2015). Temporary starch exists in cells with photosynthetic capacity, synthesized in the chloroplast during the day, and degraded at night to provide carbon for non-photosynthetic metabolism (Lunn, 2007). Leaf starch and sucrose allocation is a well-regulated process in carbohydrate metabolism. The amount of starch synthesis seems to be in harmony with the length of the dark period (Cordenunsi-Lysenko et al., 2019). Starch-sucrose distribution is affected by plant development and sink activity, as well as environmental factors such as light intensity, temperature and photoperiod length (Pilkington et al., 2015). This finding indicates the involvement of lncRNAs in the regulation of chloroplast-related biological processes in rice. In Arabidopsis, the KEGG analysis of differentially expressed mRNAs revealed that the main enrichments in ftl-vs.-cot were phenylpropanoid biosynthesis, starch and sucrose metabolism, phenylalanine metabolism and cutin, suberine and wax biosynthesis; phenylpropanoid biosynthesis, flavonoid biosynthesis, circadian rhythm-plant and phenylalanine metabolism in stl-vs.-ftl. The KEGG analysis of differentially expressed lncRNAs showed that the main enrichments were ribosome in ftl-vs.-cot and glycosylphosphatidylinositol (GPI)-anchor biosynthesis in stl-vs.-ftl. This finding suggests that lncRNAs may not participate in chloroplast development and related biological processes in Arabidopsis. These results indicate that there are important differences between monocotyledon rice and dicotyledon Arabidopsis in early growth and development. In addition, we found that the plant could not grow without supplied nutrients for Arabidopsis (Figure 1), but rice grew normally under these conditions. It has been proven that there are differences between rice and Arabidopsis in early growth and development.
The chloroplast of eukaryotic cells is developed from an ancient endosymbiont related to cyanobacteria, which gives plants the ability of oxygenic photosynthesis (Gould et al., 2008; Nickelsen and Rengstl, 2013). The chloroplast is a semiautonomous organelle. The chloroplast has retained some genes, and different subunits of organellar protein complexes are derived from different genetic systems. PSII assembly factors are all encoded by the nucleus and transported to chloroplasts, and PSI assembly factors are encoded by nuclear genes and photosynthetic genes. From the mRNA sequencing data, we concluded that the assembly processes of PSI and PSII are not exactly the same. The expression trends of the four assembly genes for PSI and PSII differed from those of rice and Arabidopsis. FKBP20-2 participates in the assembly of the PSII supercomplexes in the chloroplast thylakoid lumen (Lima et al., 2006). LQY1 protein is involved in PSII photo-repair and reassembly of PSII complexes (Lu et al., 2011). TLP18.3 protein was involved in PSII regulating repair, turnover of damaged D1 and reassembly of PSII (Sirpio et al., 2007). Deg1 assists the assembly of the PSII complex through interaction with the PSII reaction center D2 protein (Sun et al., 2010). Together with the western blot results, FKB20-2 and LQY1 had the same expression pattern, but TLP18.3 and DEG1 had the opposite expression pattern of the three stages in rice. Similarly, in Arabidopsis, except for DEG1, FKB20-2, LQY1, and TLP18.3 had the opposite expression pattern of the three stages. The qRT–PCR and western blot results indicated that the PSI and PSII assembly were not correlated with its transcription level in the two species. Moreover, different expression levels of assembly factors in rice and Arabidopsis lead to different functions of PSI and PSII.
We take starch and sucrose metabolism as an example. The four β-glucosidase (EC:188.8.131.52) genes, α-amylase (EC:184.108.40.206) gene and β-amylase (EC:220.127.116.11) gene were analyzed. BGLU25/BGLU23 represents an independent class of six thioglucoside glucohydrolases (Nakano et al., 2017), BGLU8 encodes a putative beta glucosidase (Wang et al., 2017), BGLU26 encodes an atypical myrosinase (Matern et al., 2019), BGLU11 encodes beta glucosidase 11. BAMY1 is involved in diurnal starch degradation in guard cells (Zanella et al., 2016), AMY2/BAMY2 was a mixed-function OSC capable of synthesizing both β-amyrin and lupeol (Iturbe-Ormaetxe et al., 2003). In rice, β-glucosidase (EC:18.104.22.168), as a carbohydrate with biological functions, participates in energy acquisition, cell wall extension and signal transduction through selective hydrolysis of glycoside bonds in plants and plays an important role in the defense of pathogens (Davies and Henrissat, 1995; Mizutani et al., 2002; Barleben et al., 2005; Lee et al., 2006; Morant et al., 2008); alpha-amylase (EC:22.214.171.124) acts on the alpha-1,4-glycoside bond of starch but cannot cut the alpha-1,6-glycoside bond at the branch of amylopectin, thereby hydrolyzing the substrate (Wang et al., 2006); glucan 1,3-β-glucosidase (EC:126.96.36.199) is a growth-specific cell wall polysaccharide that exists in the endosperm of members of the Gramineae family (grasses) and commercially important grains (such as barley, rice, and wheat) among the main components of the cell wall (Hwang du et al., 2007). Those biological processes were significantly downregulated; endoglucanase (EC:188.8.131.52) hydrolyzed cellulose randomly in amorphous regions, producing oligosaccharides of different lengths, cellobiose, and glucose (Klippel et al., 2019), and showed significant upregulation in icl-vs.-cpt; in addition, no significant difference in fcl-vs.-icl. In Arabidopsis, the main upregulated gene is β-glucosidase (EC:184.108.40.206), and the downregulated gene is α-amylase (EC:220.127.116.11) in ftl-vs.-cot. The upregulated gene is β-glucosidase (EC:18.104.22.168), and the downregulated gene is β-amylase (EC:22.214.171.124), which hydrolyzes every other α-1,4-glycoside bond from the non-reductive end of starch substrate once, and the hydrolysate is maltose unit (Wang et al., 2006) in stl-vs.-ftl. In addition, no lncRNA DEGs showed significant differences in the three stages of the starch and sucrose metabolism pathways.
mRNA and lncRNAs sequencing revealed that photosynthesis was gradually established from coleoptile, incomplete leaf to complete leaf in rice and was fully functional in cotyledon of Arabidopsis. In summary, we examined the gene expression patterns of mRNAs and lncRNAs at different developmental stages of early growth and development in rice and Arabidopsis. In our study, functional modes were revealed for mRNAs and lncRNAs, the main DEGs enriched in multiple biological processes, such as photosynthetic genes and photosynthetic carbon assimilation genes. We found that the chloroplast protein from the coleoptile, incomplete leaf and first complete leaf gradually matured in rice. In contrast, in Arabidopsis, the maturation of chloroplast protein was immediately complete at the cotyledon stage. The results showed that monocotyledons and dicotyledons (at least in rice and Arabidopsis) differed greatly in photosynthetic mechanisms. In conclusion, our studies revealed whole genome expression profiles of mRNAs and lncRNAs related to early growth and development in rice and Arabidopsis, thereby providing important evidence for further research on the molecular mechanisms of early growth and development.
Data Availability Statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www. ncbi.nlm.nih.gov/, SRR10907568–SRR10907573; https://www.ncbi.nlm.nih.gov/, SRR10913120-SRR10913123; https://www.ncbi. nlm.nih.gov/, SRR10913307–SRR10913312; https://www.ncbi.nlm.nih.gov/, SRR10914245–SRR10914250; https://www.ncbi.nlm.nih.gov/, SRR10914757–SRR10914762; https://www.ncbi.nlm.nih.gov/, SRR10915176–SRR10915181.
YS and XH designed the research and wrote the manuscript. YS and JC performed the research. All authors analyzed the data and read and approved the manuscript.
This work was supported by National Key Research and Development Program (2016YFD0100604).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2020.565006/full#supplementary-material
FIGURE S1 | The qualitative analysis of novel lncRNAs and the relationship about lncRNAs and mRNAs. The qualitative analysis of novel lncRNAs and the relationship about lncRNAs and mRNAs. Novel lncRNA exon distribution in rice (A) and Arabidopsis (B); Novel lncRNA length distribution in rice (C) and Arabidopsis (D); Exon number contrast of lncRNA vs. mRNA in rice (E) and Arabidopsis (F); The length contrast of lncRNA vs. mRNA in rice (G) and Arabidopsis (H).
FIGURE S2 | Cluster map of differential genes rice and Arabidopsis. Cluster map of differential genes rice mRNAs (A), rice lncRNAs (B), Arabidopsis mRNAs (C), and Arabidopsis lncRNAs (D) The change of expression amount was expressed by color change, the yellow color represented higher expression, and the blue color represented lower expression.
FIGURE S3 | The RNA-seq data and qRT-PCR validation of lncRNAs and their target protein-coding genes. The relative expression levels and RNA-seq-based gene expression values (FPKM) of six lncRNAs and their target protein-coding genes were shown. A lncRNA can have one or more target protein-coding genes. The bars denote the standard deviation.
FIGURE S4 | Verification of the transcriptome results through the experiments of qRT-PCR. The relationships between qRT-PCR and RNA-seq. Values were the log2 ratio (FPKM ratio by RNA-Seq) for genes in rice (A) and Arabidopsis (B). The determine coefficient (R2) was indicated in the figure. All qRT-PCR reactions were performed in three biological replicates.
TABLE S1 | The primers used for quantitative real-time PCR.
TABLE S2 | Summary of reads mapping to rice and Arabidopsis genome in three developmental stages.
TABLE S3 | Summary of reads quality of rice and Arabidopsis in three developmental stages.
TABLE S4 | Number and proportion of unique alignment sequences on three functional elements of gene (exon, intron and intergenic) in rice and Arabidopsis.
TABLE S5 | The normalized FPKM values for all lncRNAs and mRNAs in all samples in rice and Arabidopsis.
TABLE S6 | Gene ontology of biological pathways enriched in the transgenic lines based on the up- or down-regulated genes.
TABLE S7 | Trans and cis target photosynthetic genes of novel lncRNAs in rice and Arabidopsis.
TABLE S8 | The coherent target genes of differentially expressed lncRNAs in all comparisons.
TABLE S9 | The relationships between qRT-PCR analysis for validation of RNA seq data analysis in rice and Arabidopsis.
TABLE S10 | Summary of described PSII assembly factors in all samples in rice and Arabidopsis.
CNCI, Coding-Non-Coding Index; cot, cotyledon; CPAT, Coding Potential Assessment Tool; CPC, Coding Potential Calculator; cpt, coleoptile; DEGs, differentially expressed genes; fcl, first complete leaf; ftl, first true leaf; GO, Gene Ontology; icl, incomplete leaf; KEGG, Kyoto Encyclopedia of Genes and Genomes; KO, KEGG Orthology; lncRNA, long non-coding RNA; mRNA, messenger RNA; PS I, photosystem I; PS II, photosystem II; stl, second true leaf.
Barleben, L., Ma, X., Koepke, J., Peng, G., Michel, H., and Stockigt, J. (2005). Expression, purification, crystallization and preliminary X-ray analysis of strictosidine glucosidase, an enzyme initiating biosynthetic pathways to a unique diversity of indole alkaloid skeletons. Biochim. Biophys. Acta 1747, 89–92. doi: 10.1016/j.bbapap.2004.09.026
Baud, S., Dubreucq, B., Miquel, M., Rochat, C., and Lepiniec, L. (2008). Storage reserve accumulation in Arabidopsis: metabolic and developmental control of seed filling. Arabidopsis Book 6:e0113. doi: 10.1199/tab.0113
Bentsink, L., Alonso-Blanco, C., Vreugdenhil, D., Tesnier, K., Groot, S. P., and Koornneef, M. (2000). Genetic analysis of seed-soluble oligosaccharides in relation to seed storability of Arabidopsis. Plant Physiol. 124, 1595–1604.
Bhargava, A., Clabaugh, I., To, J. P., Maxwell, B. B., Chiang, Y. H., Schaller, G. E., et al. (2013). Identification of cytokinin-responsive genes using microarray meta-analysis and RNA-Seq in Arabidopsis. Plant Physiol. 162, 272–294. doi: 10.1104/pp.113.217026
Cordenunsi-Lysenko, B. R., Nascimento, J. R. O., Castro-Alves, V. C., Purgatto, E., Fabi, J. P., and Peroni-Okyta, F. H. G. (2019). The starch is (not) just another brick in the wall: the primary metabolism of sugars during banana ripening. Front. Plant Sci. 10:391. doi: 10.3389/fpls.2019.00391
Ding, J. H., Lu, Q., Ouyang, Y. D., Mao, H. L., Zhang, P. B., Yao, J. L., et al. (2012). A long noncoding RNA regulates photoperiod-sensitive male sterility, an essential component of hybrid rice. Proc. Natl. Acad. Sci. U.S.A. 109, 2654–2659. doi: 10.1073/pnas.1121374109
Higuchi, K., Kanai, M., Tsuchiya, M., Ishii, H., Shibuya, N., Fujita, N., et al. (2015). Common reed accumulates starch in its stem by metabolic adaptation under Cd stress conditions. Front. Plant Sci. 6:138. doi: 10.3389/fpls.2015.00138
Hou, X., Fu, A., Garcia, V. J., Buchanan, B. B., and Luan, S. (2015). PSB27: a thylakoid protein enabling Arabidopsis to adapt to changing light intensity. Proc. Natl. Acad. Sci. U.S.A. 112, 1613–1618. doi: 10.1073/pnas.1424040112
Iturbe-Ormaetxe, I., Haralampidis, K., Papadopoulou, K., and Osbourn, A. E. (2003). Molecular cloning and characterization of triterpene synthases from Medicago truncatula and Lotus japonicus. Plant Mol. Biol. 51, 731–743. doi: 10.1023/a:1022519709298
Klippel, B., Blank, S., Janzer, V. A., Piascheck, H., Moccand, C., Bel-Rhlid, R., et al. (2019). Characterization of a thermoactive endoglucanase isolated from a biogas plant metagenome. Extremophiles 23, 479–486. doi: 10.1007/s00792-019-01099-3
Kong, L., Zhang, Y., Ye, Z. Q., Liu, X. Q., Zhao, S. Q., Wei, L., et al. (2007). CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic Acids Res. 35, W345–W349. doi: 10.1093/nar/gkm391
Lee, K. H., Piao, H. L., Kim, H. Y., Choi, S. M., Jiang, F., Hartung, W., et al. (2006). Activation of glucosidase via stress-induced polymerization rapidly increases active pools of abscisic acid. Cell 126, 1109–1120. doi: 10.1016/j.cell.2006.07.034
Leonova, S., Grimberg, A., Marttila, S., Stymne, S., and Carlsson, A. S. (2010). Mobilization of lipid reserves during germination of oat (Avena sativa L.), a cereal rich in endosperm oil. J. Exp. Bot. 61, 3089–3099. doi: 10.1093/jxb/erq141
Lima, A., Lima, S., Wong, J. H., Phillips, R. S., Buchanan, B. B., and Luan, S. (2006). A redox-active FKBP-type immunophilin functions in accumulation of the photosystem II supercomplex in Arabidopsis thaliana. Proc. Natl. Acad. Sci. U.S.A. 103, 12631–12636. doi: 10.1073/pnas.0605452103
Liu, F., Marquardt, S., Lister, C., Swiezewski, S., and Dean, C. (2010). Targeted 3’ processing of antisense transcripts triggers Arabidopsis FLC chromatin silencing. Science 327, 94–97. doi: 10.1126/science.1180278
Liu, H., Wang, R., Mao, B., Zhao, B., and Wang, J. (2019). Identification of lncRNAs involved in rice ovule development and female gametophyte abortion by genome-wide screening and functional analysis. BMC Genomics 20:90. doi: 10.1186/s12864-019-5442-6
Liu, J., Jung, C., Xu, J., Wang, H., Deng, S., Bernad, L., et al. (2012). Genome-wide analysis uncovers regulation of long intergenic noncoding RNAs in Arabidopsis. Plant Cell 24, 4333–4345. doi: 10.1105/tpc.112.102855
Liu, J., and Last, R. L. (2017). A chloroplast thylakoid lumen protein is required for proper photosynthetic acclimation of plants under fluctuating light environments. Proc. Natl. Acad. Sci. U.S.A. 114, E8110–E8117. doi: 10.1073/pnas.1712206114
Lu, T., Lu, G., Fan, D., Zhu, C., Li, W., Zhao, Q., et al. (2010). Function annotation of the rice transcriptome at single-nucleotide resolution by RNA-seq. Genome Res. 20, 1238–1249. doi: 10.1101/gr.106120.110
Lu, Y., Hall, D. A., and Last, R. L. (2011). A small zinc finger thylakoid protein plays a role in maintenance of photosystem II in Arabidopsis thaliana. Plant Cell 23, 1861–1875. doi: 10.1105/tpc.111.085456
Mansfield, S. G., and Briarty, L. G. (1996). The dynamics of seedling and cotyledon cell development in Arabidopsis thaliana during reserve mobilization. Int. J. Plant Sci. 157, 280–295. doi: 10.1086/297347
Mao, X. Z., Cai, T., Olyarchuk, J. G., and Wei, L. P. (2005). Automated genome annotation and pathway identification using the KEGG Orthology (KO) as a controlled vocabulary. Bioinformatics 21, 3787–3793. doi: 10.1093/bioinformatics/bti430
Matern, A., Bottcher, C., Eschen-Lippold, L., Westermann, B., Smolka, U., Doll, S., et al. (2019). A substrate of the ABC transporter PEN3 stimulates bacterial flagellin (flg22)-induced callose deposition in Arabidopsis thaliana. J. Biol. Chem. 294, 6857–6870. doi: 10.1074/jbc.RA119.007676
Mizutani, M., Nakanishi, H., Ema, J., Ma, S. J., Noguchi, E., Inohara-Ochiai, M., et al. (2002). Cloning of beta-primeverosidase from tea leaves, a key enzyme in tea aroma formation. Plant Physiol. 130, 2164–2176. doi: 10.1104/pp.102.011023
Morant, A. V., Jorgensen, K., Jorgensen, C., Paquette, S. M., Sanchez-Perez, R., Moller, B. L., et al. (2008). beta-glucosidases as detonators of plant chemical defense. Phytochemistry 69, 1795–1813. doi: 10.1016/j.phytochem.2008.03.006
Muranaka, A., Watanabe, S., Sakamoto, A., and Shimada, H. (2012). Arabidopsis cotyledon chloroplast biogenesis factor CYO1 uses glutathione as an electron donor and interacts with PSI (A1 and A2) and PSII (CP43 and CP47) subunits. J. Plant Physiol. 169, 1212–1215. doi: 10.1016/j.jplph.2012.04.001
Nakano, R. T., Pislewska-Bednarek, M., Yamada, K., Edger, P. P., Miyahara, M., Kondo, M., et al. (2017). PYK10 myrosinase reveals a functional coordination between endoplasmic reticulum bodies and glucosinolates in Arabidopsis thaliana. Plant J. 89, 204–220. doi: 10.1111/tpj.13377
Pang, P. P., Pruitt, R. E., and Meyerowitz, E. M. (1988). Molecular-cloning, genomic organization, expression and evolution of 12s-seed storage protein genes of Arabidopsis thaliana. Plant Mol. Biol. 11, 805–820.
Pilkington, S. M., Encke, B., Krohn, N., Hohne, M., Stitt, M., and Pyl, E. T. (2015). Relationship between starch degradation and carbon demand for maintenance and growth in Arabidopsis thaliana in different irradiance and temperature regimes. Plant Cell Environ. 38, 157–171. doi: 10.1111/pce.12381
Sela, A., Piskurewicz, U., Megies, C., Mene-Saffrane, L., Finazzi, G., and Lopez-Molina, L. (2020). Embryonic photosynthesis affects post-germination plant growth. Plant Physiol. 182, 2166–2181. doi: 10.1104/pp.20.00043
Shannon, P., Markiel, A., Ozier, O., Baliga, N. S., Wang, J. T., Ramage, D., et al. (2003). Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504. doi: 10.1101/gr.1239303
Shimada, H., Mochizuki, M., Ogura, K., Froehlich, J. E., Osteryoung, K. W., Shirano, Y., et al. (2007). Arabidopsis cotyledon-specific chloroplast biogenesis factor CYO1 is a protein disulfide isomerase. Plant Cell 19, 3157–3169. doi: 10.1105/tpc.107.051714
Sirpio, S., Allahverdiyeva, Y., Suorsa, M., Paakkarinen, V., Vainonen, J., Battchikova, N., et al. (2007). TLP18.3, a novel thylakoid lumen protein regulating photosystem II repair cycle. Biochem. J. 406, 415–425. doi: 10.1042/BJ20070460
Sun, L., Luo, H., Bu, D., Zhao, G., Yu, K., Zhang, C., et al. (2013). Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts. Nucleic Acids Res. 41, e166. doi: 10.1093/nar/gkt646
Sun, X., Ouyang, M., Guo, J., Ma, J., Lu, C., Adam, Z., et al. (2010). The thylakoid protease Deg1 is involved in photosystem-II assembly in Arabidopsis thaliana. Plant J. 62, 240–249. doi: 10.1111/j.1365-313X.2010.04140.x
Tian, Y., Zheng, H., Zhang, F., Wang, S., Ji, X., Xu, C., et al. (2019). PRC2 recruitment and H3K27me3 deposition at FLC require FCA binding of COOLAIR. Sci. Adv. 5:eaau7246. doi: 10.1126/sciadv.aau7246
Tominaga, J., Mizutani, H., Horikawa, D., Nakahara, Y., Takami, T., Sakamoto, W., et al. (2016). Rice CYO1, an ortholog of Arabidopsis thaliana cotyledon chloroplast biogenesis factor AtCYO1, is expressed in leaves and involved in photosynthetic performance. J. Plant Physiol. 207, 78–83. doi: 10.1016/j.jplph.2016.10.005
Wang, G. F., Wang, F., Wang, G., Wang, F., Zhang, X. W., Zhong, M. Y., et al. (2012). Opaque1 encodes a myosin XI motor protein that is required for endoplasmic reticulum motility and protein body formation in maize endosperm. Plant Cell 24, 3447–3462. doi: 10.1105/tpc.112.101360
Wang, N., Zhang, Y., Wang, Q., Liu, J., Wang, H., Xue, Y., et al. (2006). Gene cloning and characterization of a novel alpha-amylase from alkaliphilic Alkalimonas amylolytica. Biotechnol. J. 1, 1258–1265. doi: 10.1002/biot.200600098
Wang, Y., Fan, X., Lin, F., He, G., Terzaghi, W., Zhu, D., et al. (2014a). Arabidopsis noncoding RNA mediates control of photomorphogenesis by red light. Proc. Natl. Acad. Sci. U.S.A. 111, 10359–10364. doi: 10.1073/pnas.1409457111
Wang, Y., Wang, X., Deng, W., Fan, X., Liu, T. T., He, G., et al. (2014b). Genomic features and regulatory roles of intermediate-sized non-coding RNAs in Arabidopsis. Mol Plant 7, 514–527. doi: 10.1093/mp/sst177
Wang, Z. W., Lv, J., Xie, S. Z., Zhang, Y., Qiu, Z. N., Chen, P., et al. (2018). OsSLA4 encodes a pentatricopeptide repeat protein essential for early chloroplast development and seedling growth in rice. Plant Growth Regul. 84, 249–260. doi: 10.1007/s10725-017-0336-6
Xiong, Q., Ma, B., Lu, X., Huang, Y. H., He, S. J., Yang, C., et al. (2017). Ethylene-inhibited jasmonic acid biosynthesis promotes mesocotyl/coleoptile elongation of etiolated rice seedlings. Plant Cell 29, 1053–1072. doi: 10.1105/tpc.16.00981
Xu, X., Zhang, X. B., Shi, Y. F., Wang, H. M., Feng, B. H., Li, X. H., et al. (2016). A point mutation in an F-Box domain-containing protein is responsible for brown hull phenotype in rice. Rice Sci. 23, 1–8. doi: 10.1016/j.rsci.2016.01.001
Yuan, J., Zhang, Y., Dong, J., Sun, Y., Lim, B. L., Liu, D., et al. (2016). Systematic characterization of novel lncRNAs responding to phosphate starvation in Arabidopsis thaliana. BMC Genomics 17:655. doi: 10.1186/s12864-016-2929-2
Zanella, M., Borghi, G. L., Pirone, C., Thalmann, M., Pazmino, D., Costa, A., et al. (2016). beta-amylase 1 (BAM1) degrades transitory starch to sustain proline biosynthesis during drought stress. J. Exp. Bot. 67, 1819–1826. doi: 10.1093/jxb/erv572
Zhang, H. X., Zhou, D. W., Matthew, C., Wang, P., and Zheng, W. (2008). Photosynthetic contribution of cotyledons to early seedling development in Cynoglossum divaricatum and Amaranthus retroflexus. New. Z. J. Bot. 46, 39–48.
Zheng, W., Yang, J. Y., Luan, Z. H., Wang, P., Zhang, H. X., and Zhou, D. W. (2011). Compensatory growth and photosynthetic responses of Pharbitis purpurea seedlings to clipped cotyledon and second leaf. Photosynthetica 49, 21–28. doi: 10.1007/s11099-011-0004-4
Keywords: rice, Arabidopsis, messenger RNA, long non-coding RNA, cotyledon, true leaf, photosynthesis
Citation: Shi Y, Chen J and Hou X (2020) Similarities and Differences of Photosynthesis Establishment Related mRNAs and Novel lncRNAs in Early Seedlings (Coleoptile/Cotyledon vs. True Leaf) of Rice and Arabidopsis. Front. Genet. 11:565006. doi: 10.3389/fgene.2020.565006
Received: 23 May 2020; Accepted: 17 August 2020;
Published: 08 September 2020.
Edited by:Weike Duan, Huaiyin Institute of Technology, China
Reviewed by:Himanshu Sharma, National Agri-Food Biotechnology Institute, India
Vijay Gahlaut, University of Delhi, India
Copyright © 2020 Shi, Chen and Hou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xin Hou, firstname.lastname@example.org