ORIGINAL RESEARCH article

Front. Genet., 04 June 2024

Sec. Genomics of Plants and Plant-Associated Organisms

Volume 15 - 2024 | https://doi.org/10.3389/fgene.2024.1395805

Mitochondrial genome complexity in Stemona sessilifolia: nanopore sequencing reveals chloroplast gene transfer and DNA rearrangements

  • 1. School of Public Health, North China University of Science and Technology, Tangshan, China

  • 2. Public Health Department, Affiliated Hospital of Shandong University of Traditional Chinese Medicine, Jinan, China

  • 3. College of Life Science, North China University of Science and Technology, Tangshan, China

Article metrics

View details

7

Citations

2k

Views

905

Downloads

Abstract

Mitochondria are semi-autonomous organelles in eukaryotic cells with their own genome. Plant mitogenomes differ from animal mitogenomes in size, structure, and repetitive DNA sequences. Despite larger sizes, plant mitogenomes do not have significantly more genes. They exhibit diverse structures due to variations in size, repetitive DNA, recombination frequencies, low gene densities, and reduced nucleotide substitution rates. In this study, we analyzed the mitochondrial genome of Stemona sessilifolia using Nanopore and Illumina sequencing. De-novo assembly and annotation were conducted using Unicycler, Geseq, tRNAscan-SE and BLASTN, followed by codon usage, repeat sequence, RNA-editing, synteny, and phylogenetic analyses. S. sessilifolia’s mitogenome consisted of one linear contig and six circular contigs totaling 724,751 bp. It had 39 protein-coding genes, 27 tRNA genes, and 3 rRNA genes. Transfer of chloroplast sequences accounted for 13.14% of the mitogenome. Various analyses provided insights into genetic characteristics, evolutionary dynamics, and phylogenetic placement. Further investigations can explore transferred genes’ functions and RNA-editing’s role in mitochondrial gene expression in S. sessilifolia.

1 Introduction

Mitochondria are widely distributed semi-autonomous organelles within eukaryotic cells that harbor their own genome, known as the mitochondrial genome which consists of circular double-helix DNA (Saccone et al., 2000). Mitochondrial genome may also comprise other small circular chromosomes or sub-genomes (Wanrooij et al., 2012; Sloan, 2013). Previous study illustrated mitogenomes using circular maps, suggesting their physical presence as circular monomeric DNA molecules (Manchekar et al., 2006); however, researchers also demonstrated that the physical structure of mitochondrial genomes is not exclusively confined to a circular monomeric form (Menger et al., 2021). For instance, to examine the mitogenomic structure of by fluorescence microscopy technique, Kozik and his colleagues discovered that the mitochondrial genome of Lactuca sativa exhibited linear, branched, and circular configurations (Kozik et al., 2019).

Plant mitogenomes are larger and more complex in nature in comparison to animal mitogenomes, which typically span around 15–17 kb in length. Plant mitogenomes vary in size, ranging from 200 kb to 11 Mb in length, but do not harbor much more genes (Zardoya, 2020). The structure of mitochondrial genomes (mitogenomes) in flowering plants displays remarkable diversity, including variations in size, repetitive DNA sequences, recombination frequencies of extensive repeats, low gene densities, and reduced rates of nucleotide substitution (Rose, 2021; Maliga, 2022). The expansion of plant mitogenomes primarily stems from the accumulation of repetitive sequences and the incorporation of DNA derived from chloroplast and nuclear genomes (Petit et al., 2005; Greiner and Bock, 2013; Li et al., 2016; Zhang et al., 2020b; Tan et al., 2022). Consequently, these intricate genomes pose challenges for assembly using short paired-end read sequencing.

Stemona sessilifolia (Miq.) Miq., commonly known as Zhili-Baibu, is a highly esteemed and extensively used in traditional Chinese medicine (Chanmahasathien et al., 2011). Stemonae Radix, derived from the roots of three species: S. sessilifolia (Yang et al., 2009), S. japonica (Tan et al., 2023), and S. tuberosa (Fan et al., 2015), has been widely used for a considerable time due to its antitussive and insecticidal properties. For S. sessilifolia, the nuclear and chloroplast genomes of have not been thoroughly annotated, and its mitogenome remains unexamined.

In this study, we present the first assembled and annotated mitogenome of S. sessilifolia, employing a combination of Nanopore and Illumina sequencing technologies with S. sessilifolia as the genetic material. These findings offer further substantiation for the existence of diverse conformations within plant mitogenomes, while also providing valuable insights for future investigations into the phylogenetic status of S. sessilifolia.

2 Materials and methods

2.1 Materials and sequencing

In June 2023, fresh leaves of S. sessilifolia were collected from Moyun Mountain, located in Jinan City, Shandong Province (coordinates: N 36°20′31.0308″, E 117°54′43.4772″). S. sessilifolia is not classified as an endangered or protected species, thus no specific permission was required for the collection. All samples were thoroughly rinsed, cleaned using DEPC water, and subsequently stored at −80°C. Total DNA was extracted using TIANamp Genomic DNA Kit (Tiangen, Beijing, China). To obtain comprehensive data, we sequenced the mitogenome of S. sessilifolia on both Nanopore GridION sequencing platform (Oxford Nanopore Technology, Oxford Science Park) and Illumina Novaseq 6000 platform (Illumina, San Diego, USA), which enabling the construction of libraries and the generation of raw data (Nanopore raw data: 23.69 Gb, Illumina raw data: 25.94 Gb). The data reported in this paper have been deposited in the GenBank of NCBI (Benson et al., 2013), under accession number PP692484, PP692485, PP692486, PP692487, PP692488, PP692489 and PP692490 that were publicly accessible at https://www.ncbi.nlm.nih.gov/genbank/.

2.2 Assembly and annotation of organelle genomes

The S. sessilifolia mitogenome was assembled using a comprehensive approach combining Illumina and Nanopore sequencing technologies. Initially, we employed Flye (Kolmogorov et al., 2019) software to conduct de novo assembly of long reads derived from S. sessilifolia obtained through Oxford Nanopore sequencing. Subsequently, the BLASTn (Chen et al., 2015) was utilized to identify the draft mitogenome of S. sessilifolia by comparing the assembled contigs. To facilitate this process, we created a database for the assembled sequences using makeblastdb and chose conserved mitochondrial genes from Arabidopsis thaliana (L.) Heynh. as our query sequence to pinpoint contigs that contain these conserved mitochondrial genes. The commonly parameters used for this assembly included '-evalue 1e-5 -outfmt 6 -max_hsps 10 -word_size 7 -task blastn-short’. Additionally, we conducted a hybrid assembly using Unicycler, intergrating both Illumina short reads and Nanopore long reads (Wick et al., 2017), while applying the default parameters. For the annotation of protein-coding genes (PCGs) in the mitogenome, we selected A. thaliana (NC_037304) and Liriodendron tulipifera (NC_021152.1) as reference genomes, using Geseq for the annotation process (Tillich et al., 2017). Annotation of tRNA and rRNA within the mitogenome was accomplished using tRNAscan-SE (Chan et al., 2021) and BLASTn (Chen et al., 2015), respectively. Manual correction of annotation errors in the mitogenome was performed using Apollo (Lewis et al., 2002).

2.3 Analysis of codon usage and repeated sequences

Protein-coding gene (PCG) sequences were extracted from the genome using Phylosuite (Zhang et al., 2020a), and the codon usage in mitochondrial PCGs was analyzed using Mega 7.0 (Kumar et al., 2016) and relative synonymous codon usage (RSCU) values were also calculated. To identify repeated sequences, including simple sequence repeats (SSRs), tandem repeats, and interspersed repeats, MISA (Beier et al., 2017) (https://webblast.ipk-gatersleben.de/misa/), TRF (Benson, 1999) (https://tandem.bu.edu/trf/trf.unix.help.html), and REPuter (Kurtz et al., 2001) (https://bibiserv.cebitec.uni-bielefeld.de/reputer/) were employed. The results were visualized using the RCircos (Zhang et al., 2013) package.

2.4 Prediction of RNA editing sites

Deepred-mt (Edera et al., 2021), a tool based on the convolutional neural network (CNN) model, was utilized for predicting C to U RNA editing sites. Mitochondrial protein-coding genes were extracted for prediction analysis, and only results with probability values exceeding 0.9 were selected for further consideration.

2.5 Chloroplast to mitochondrion DNA transformation

The chloroplast genome was assembled and annotated using GetOrganelle (Jin et al., 2020), and CPGAVAS2, respectively (Shi et al., 2019). The BLASTn (Chen et al., 2015) program was utilized to compare two organelle genomes of S. sessilifolia. In this process, the mitogenome was established as the database with makeblastdb, and the chloroplast genome was employed as the query sequence. All results were visualized using the Circos (Krzywinski et al., 2009) package.

2.6 Phylogenetic inference

Related species of S. sessilifolia were selected based on their genetic relationship, and their complete mitogenome sequences were downloaded from NCBI (https://www.ncbi.nlm.nih.gov) (Supplementary Table S3). PhyloSuite (Zhang et al., 2020a) was utilized to extract shared mitochondrial genes across these species. Multiple sequences alignment was carried out using MAFFT (Katoh et al., 2002; Katoh et al., 2019) with a bootstrap value of 1000. IQ-TREE (Minh et al., 2020) was used for phylogenetic analysis. The resulting phylogenetic analysis was visualized using iTOL (Letunic and Bork, 2021).

2.7 Synteny analysis

Using the BLASTn, we identified conserved homologous sequences, which are referred to as co-linear blocks, with commonly parameters '-value 1e-5, -word_size 9, -gapopen 5, -gapextend 2, -reward 2, -penalty -3' (Shan et al., 2023). As a result, only co-linear blocks longer than 500 bp were considered. Based on sequence similarity. The mitochondrial genome of S. sessilifolia was compared with multiple synteny regions from closely related species using MCscanX (Stotter et al., 1990).

3 Results

3.1 Characteristics of the mitochondrial genomes of S. sessilifolia

The mitochondrial genome of S. sessilifolia displays a branched structure (Supplementary Figure S1, Supplementary Data S1). After eliminating duplicated regions from the Nanopore data, we identified seven contigs, comprising one linear contig and six circular contigs. The total length of the mitogenome is 724,751 bp, with a GC content of 44.72% (Supplementary Table S1). We conducted annotation of the S. sessilifolia mitochondrial genome and identified a total of 39 protein-coding genes. Among these, 24 are unique mitochondrial core genes, while the remaining 15 are non-core genes. Additionally, there are 27 tRNA genes, with 11 of them being present in multiple copies, and 3 rRNA genes. The core genes consist of 5 ATP synthase genes (atp1, atp4, atp6, atp8, and atp9), 9 NADH dehydrogenase genes (nad1, nad2, nad3, nad4, nad4L, nad5, nad6, nad7, and nad9), 4 cytochrome C biogenesis genes (ccmB, ccmC, ccmFC, and ccmFN), 3 cytochrome C oxidase genes (cox1, cox2, and cox3), 1 protein transport subunit gene (mttB), 1 maturases gene (matR), and 1 ubiquinol-cytochrome C reductase gene (cob). Additionally, the non-core genes include 3 ribosomal large subunit genes (rpl2, rpl5, rpl16) and 11 ribosomal small subunit genes (rps1, rps2, rps3, rps4, rps7, rps10, rps11, rps12, rps13, rps14, and rps19). There is also 1 succinate dehydrogenase gene (sdh4) (Table 1); (Figure 1). Interestingly, several plastid genes were also annotated in the mtDNA, albeit mostly as fragments. These include rps7, rpoC2, rps15, accD, et al. This finding suggests a notable sequence migration was observed between the chloroplast DNA (cpDNA) and mtDNA of S. sessilifolia, which was accompanied by gene transfer, which will be discussed in detail below. Figure 2A presents the cpDNA genome map.

TABLE 1

Group of genesName of genes
ATP synthaseatp1, atp4 (×2), atp6, atp8, atp9
NADH dehydrogenasenad1, nad2, nad3, nad4, nad4L, nad5, nad6 (×2), nad7, nad9 (×2)
Cytochrome bcob (×2)
Cytochrome c biogenesiscox1, cox2, cox3
MaturasesmatR
Protein transport subunitmttB
Ribosomal protein large subunitrpl2, rpl5, rpl16
Ribosomal protein small subunitrps1, rps2, rps3, rps4, rps7, rps10, rps11, rps12, rps13, rps14, rps19
Succinate dehydrogenasesdh4
Ribosome RNArrn5, rrn18, rrn26
Transfer RNAtrnC-GCA(×2), trnD-GUC, trnE-UUC(×3), trnF-GAA(×2), trnfM-CAU, trnH-GUG, trnI-CAU (×2), trnK-UUU(×2), trnL-CAA, trnL-UAA, trnL-UAG, trnM-CAU(×2), trnN-GUU(×2), trnP-GGG, trnP-UGG(×2), trnQ-UUG (×2), trnR-ACG, trnR-CCG, trnR-UCU, trnS-GCU (×2), trnS-GGA, trnS-UGA, trnT-CGU, trnT- UGU, trnV-GAC, trnW-CCA (×2), trnY-GUA

Gene composition in the mitogenome of S. sessilifolia.

FIGURE 1

FIGURE 2

3.2 Chloroplast to mitochondrion DNA transformation

We observed significant sequence transfers from the chloroplast genome to the mitogenome in S. sessilifolia. Through sequence similarity analysis, we identified 42 homologous fragments between the mitogenome and chloroplast genome, totaling 95,262 base pairs. This fragment accounted for 13.14% of the entire mitogenome. Subsequently, we extracted and annotated these homologous sequences. Most of these fragments migrated from cpDNA to mtDNA, except for a few tRNA genes that exhibited high sequence similarity (Supplementary Table S6, Supplementary Data S2), making it challenging to determine the direction of migration (Figure 2B). Consequently, we denoted these sequences as mitochondrial plastid sequences (MTPTs), and detailed sequence information of MTPTs can be found in Supplementary Data S3. A total of 19 MTPTs with lengths exceeding 1,000 bp were identified: MTPT6, MTPT7, MTPT8, MTPT9, MTPT10, MTPT11, MTPT12, MTPT13, MTPT14, MTPT15, MTPT17, MTPT19, MTPT20, MTPT21, MTPT22, MTPT23, MTPT24, MTPT26, and MTPT27. The mapping results were visualized using the Tablet software (Milne et al., 2013), where a representative long read spanning the MTPT region was highlighted in deep blue (Supplementary Data S4), with the longest gene being MTPT27 spanning 20,203 bp. The annotation of these homologous sequences revealed the presence of 71 complete genes across the 42 homologous fragments. These included 52 protein-coding genes (PCGs) and 19 tRNA genes (trnC-GCA, trnF-GAA, trnG-UCC, trnH-GUG, trnI-CAU, trnK-UUU, trnL-CAA, trnL-UAA, trnL-UAG, trnM-CAU, trnM-CAU, trnN-GUU, trnP-UGG, trnQ-UUG, trnS-GCU, trnS-GGA, trnT-UGU, trnV-UAC, trnW-CCA). For detailed information on the transferred genes, please refer to Supplementary Table S2.

3.3 The prediction of RNA editing

Using Deepred-mt, we identified a total of 639 potential RNA editing sites on 32 unique protein-coding genes (PCGs) in the mitogenome, with cutoff value of 0.9 as the criterion. Interestingly, all identified edits were C-U conversions. Among these mitochondrial genes, the gene nad4 displays the highest number of editing sites with 59 occurrences, followed closely by the ccmC gene with 40 editing sites (Figure 3).

FIGURE 3

3.4 Synteny analysis and phylogenetic inference

To investigate the synteny relationship between S. sessilifolia and closely related species, we utilized MCscanXto generate multiple synteny plots based on the sequence similarity. Figure 4 illustrates that the co-linear blocks exhibit varying arrangements across individual mitochondrial genomes. Although a substantial number of blocks were detected between S. sessilifolia and Pandanus odorifer and Spirodela polyrhiza, these co-linear blocks appeared to be shorter in length. Additionally, unique sequences specific to S. sessilifolia were also identified, which was lacking homology with other species. These findings indicate extensive genomic rearrangements within the S. sessilifolia mitogenome, leading to an exceptionally unconserved mitochondrial structure.

FIGURE 4

To further explore the evolutionary relationships of S. sessilifolia mitochondria, we constructed a phylogenetic tree using the DNA sequences of 24 conserved mitochondrial PCGs (atp1, atp4, atp6, atp8, atp9, ccmB, ccmC, ccmFC, ccmFN, cob, cox1, cox2, cox3, matR, mttB, nad1, nad2, nad3, nad, nad4L, nad5, nad6, nad7, nad9) from 35 species across 6 orders of angiosperms. Two mitochondrial genomes from Ranunculales species were designated as outgroups (Supplementary Table S3). The topology of the phylogenetic tree based on mitochondrial DNA aligned with the current classification of the Angiosperm Phylogeny Group (APG). S. sessilifolia was classified within the family Stemonaceae, under the order Pandanales, illustrating its close evolutionary relationship with P. odorifer (Figure 5).

FIGURE 5

3.5 PCGs codon usage analysis

We conducted a codon usage analysis of the 39 unique protein-coding genes (PCGs) in S. sessilifolia. The codon usage for each amino acid is presented in Supplementary Table S4. Relative synonymous codon usage (RSCU) values greater than 1 signify a preference for specific amino acids. Among the mitochondrial PCGs, apart from the start codon AUG (Met) and UGG (Trp), there was a noticeable preference for certain codons (Figure 6). For example, alanine (Ala) showed a higher preference for the codon GCU, evidenced by an RSCU value of 1.61, which was the highest observed. Similarly, histidine (His) exhibited a preference for the codon CAU, with an RSCU value of 1.53.

FIGURE 6

3.6 S. sessilifolia mitogenome repeats analysis

Microsatellites, also known as simple repeat sequences (SSRs), were analyzed to determine the presence of repeat sequences in the mitogenome. The results indicated that chromosomes 1-7 of S. sessilifolia contained 85, 56, 51, 34, 36, 33, and 40 SSRs, respectively. Monomeric polymers constituted the largest proportion in almost all chromosomes (Supplementary Table S5). Additionally, there were 15 and 13 tandem repeats in chromosomes 1 and 2, respectively, while other chromosomes had fewer than 5 tandem repeats. Dispersed repeats with a length greater than or equal to 30 bp were detected in each chromosome except for chromosome 6. Among these dispersed repeats, chromosomes 1 through 5 and 7 contained palindromic repeats in the following numbers: 35, 5, 2, 2, 1, and 1, respectively. Additionally, they contained forward repeats in the quantities of 36, 23, 10, 1, 4, and 5, respectively. Chromosomes 1 and 2 carried 3 reverse repeats, 3 complementary repeats, along with 1 additional complementary repeat. Overall, the mitogenome of S. sessilifolia included 335 SSRs, 135 pairs of dispersed repeats, and 43 tandem repeats. A circular diagram in Figure 7 illustrates the different types of repeat sequences.

FIGURE 7

4 Discussion

In this study, we conducted an extensive characterization of the mitochondrial genome of S. sessilifolia and investigated a range of genetic features. Notably, we uncovered a phenomenon known as chloroplast-to-mitochondrion DNA transformation, wherein DNA sequences are transferred from the chloroplast genome to the mitogenome. This phenomenon has also been observed in other Chinese herbal medicines such as C. chinensis, C. deltoidei, C. omeiensis, and Saposhnikovia divaricate (Ni et al., 2022; Zhong et al., 2023). By examining mitogenome, we identified 42 homologous fragments between the mitogenome and chloroplast genome, accounting for 13.14% of the total mitogenome. These fragments, termed mitochondrial plastid sequences (MTPTs), encompass complete genes within them. The presence of similar transferred genes has been documented in numerous mitochondrial genomes. For instance, Quercus acutissima contains 15.7 kb MTPTs, representing 3.49% of its mitogenome (Liu et al., 2022), while S. miltiorrhiza exhibits sixteen fragments resembling the plastome in its mitogenome (Yang et al., 2022). Such transferred genes highlight the dynamic nature of plant mitochondrial genomes and suggest the inter-organelle genetic exchange.

Using Deepred-mt, we predicted RNA editing sites in the S. sessilifolia mitogenome. A total of 639 C-to-U RNA editing sites were identified across 32 unique mitochondrial PCGs. Among all genes, nad4 and ccmC had the highest number of editing sites. The Nad4 gene encodes mitochondrial respiratory chain complex I subunit IV and is highly conserved in maize mitochondria (Marienfeld and Newton, 1994). Prior studies have established a connection between DEK43 and cis-splicing of Nad4 gene in maize mitochondria (Ren et al., 2020), and the significance of Nad4 intron 3 in normal seed development has also been demonstrated (Zhu et al., 2019). CcmC plays a pivotal role in the maturation of cytochrome c and is translated as a long precursor with an N-extension. Our analysis identified 31 C-to-U RNA editing events in N-extension and cmC-homologous region (ccmC-core region), which contribute to the conservation of amino acid sequence (Kitazaki et al., 2009). RNA editing serves as a crucial post-transcriptional modification mechanism which greatly influence the expression and function of mitochondrial gene.

Through synteny analysis and phylogenetic inference, we explored the evolutionary of S. sessilifolia mitochondria. The co-linear blocks observed in the mitogenomes of closely related species did not exhibit the same arrangement, indicating extensive genomic rearrangements. Our phylogenetic analysis unveiled that S. sessilifolia belongs to the family Stemonaceae within the order Pandanales, showing a close relationship to P. odorifer.

An analysis of codon usage among the mitochondrial PCGs in S. sessilifolia demonstrated preferential codon usage for certain amino acids. For instance, alanine (Ala) displayed a higher preference for the codon GCU, whereas histidine (His) favored the codon CAU. Comprehending these codon usage patterns enhances our understanding of the molecular evolution and functional constraints associated with mitochondrial genes.

Repeat sequences played a significant role in the rearrangement of the mitogenome (Cole et al., 2018). Extensive gene repeat sequences have been observed in Stemonaceae species, including S. mairei (Lu et al., 2018a), C. japonica (Lu et al., 2018b), and S. parviflora (Wei and Li, 2022). The repeats analysis indicated the presence of microsatellites (SSRs), tandem repeats, and dispersed repeats within the S. sessilifolia mitogenome. SSRs were the most abundant repeat type, with monomeric polymers being the most prevalent. Dispersed repeats and tandem repeats exhibited variations in their distribution across different chromosomes.

In conclusion, our study provides a thorough analysis of the mitochondrial genome of S. sessilifolia. The identification of transferred genes from the chloroplast genome, RNA editing sites, evolutionary relationships, codon usage patterns, and repeat sequences enhances our understanding of the genetic features and evolutionary dynamics of S. sessilifolia. Future investigations could focus on functional analyses of the transferred genes and on unraveling the impact of RNA editing on mitochondrial gene expression in S. sessilifolia.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://www.ncbi.nlm.nih.gov/, PP234475.1, https://www.ncbi.nlm.nih.gov/, PRJNA1073764, https://www.ncbi.nlm.nih.gov/, SAMN39839158, https://www.ncbi.nlm.nih.gov/, SRR27999910, https://www.ncbi.nlm.nih.gov/genbank/, PP692484, https://www.ncbi.nlm.nih.gov/genbank/, PP692485, https://www.ncbi.nlm.nih.gov/genbank/, PP692486, https://www.ncbi.nlm.nih.gov/genbank/, PP692487, https://www.ncbi.nlm.nih.gov/genbank/, PP692488, https://www.ncbi.nlm.nih.gov/genbank/, PP692489, https://www.ncbi.nlm.nih.gov/genbank/, PP692490.

Author contributions

YX: Conceptualization, Data curation, Formal Analysis, Software, Writing–original draft, Writing–review and editing. WL: Conceptualization, Data curation, Writing–original draft, Writing–review and editing. LG: Conceptualization, Formal Analysis, Investigation, Writing–original draft, Writing–review and editing. XZ: Conceptualization, Funding acquisition, Investigation, Resources, Supervision, Validation, Writing–original draft, Writing–review and editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work was supported by Key Project of Natural Science Foundation of Hebei province of China (No. H2017209233), Leader talent cultivation plan of innovation team in Tangshan city (No. 14130225B), Leader talent cultivation plan of innovation team in Hebei province (No. LJRC001), and National Natural Science Foundation of China (No. 81101483).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2024.1395805/full#supplementary-material

References

  • 1

    BeierS.ThielT.MünchT.ScholzU.MascherM. (2017). MISA-web: a web server for microsatellite prediction. Bioinformatics33 (16), 2583–2585. 10.1093/bioinformatics/btx198

  • 2

    BensonD. A.CavanaughM.ClarkK.Karsch-MizrachiI.LipmanD. J.OstellJ.et al (2013). GenBank. Nucleic Acids Res.41 (Database issue), D36–D42. 10.1093/nar/gks1195

  • 3

    BensonG. (1999). Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res.27 (2), 573–580. 10.1093/nar/27.2.573

  • 4

    ChanP. P.LinB. Y.MakA. J.LoweT. M. (2021). tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res.49 (16), 9077–9096. 10.1093/nar/gkab688

  • 5

    ChanmahasathienW.AmpasavateC.GregerH.LimtrakulP. (2011). Stemona alkaloids, from traditional Thai medicine, increase chemosensitivity via P-glycoprotein-mediated multidrug resistance. Phytomedicine18 (2-3), 199–204. 10.1016/j.phymed.2010.07.014

  • 6

    ChenY.YeW.ZhangY.XuY. (2015). High speed BLASTN: an accelerated MegaBLAST search tool. Nucleic Acids Res.43 (16), 7762–7768. 10.1093/nar/gkv784

  • 7

    ColeL. W.GuoW.MowerJ. P.PalmerJ. D. (2018). High and variable rates of repeat-mediated mitochondrial genome rearrangement in a genus of plants. Mol. Biol. Evol.35 (11), 2773–2785. 10.1093/molbev/msy176

  • 8

    EderaA. A.SmallI.MiloneD. H.Sanchez-PuertaM. V. (2021). Deepred-Mt: deep representation learning for predicting C-to-U RNA editing in plant mitochondria. Comput. Biol. Med.136, 104682. 10.1016/j.compbiomed.2021.104682

  • 9

    FanL. L.XuF.HuJ. P.YangD. H.ChenH. B.KomatsuK.et al (2015). Binary chromatographic fingerprint analysis of stemonae radix from three Stemona plants and its applications. J. Nat. Med.69 (3), 402–410. 10.1007/s11418-015-0887-7

  • 10

    GreinerS.BockR. (2013). Tuning a ménage à trois: co-evolution and co-adaptation of nuclear and organellar genomes in plants. Bioessays35 (4), 354–365. 10.1002/bies.201200137

  • 11

    JinJ. J.YuW. B.YangJ. B.SongY.dePamphilisC. W.YiT. S.et al (2020). GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol.21 (1), 241. 10.1186/s13059-020-02154-5

  • 12

    KatohK.MisawaK.KumaK.MiyataT. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res.30 (14), 3059–3066. 10.1093/nar/gkf436

  • 13

    KatohK.RozewickiJ.YamadaK. D. (2019). MAFFT online service: multiple sequence alignment, interactive sequence choice and visualization. Brief. Bioinform20 (4), 1160–1166. 10.1093/bib/bbx108

  • 14

    KitazakiK.NomotoY.AoshimaA.MikamiT.KuboT. (2009). A mitochondrial gene involved in cytochrome c maturation (ccmC) is expressed as a precursor with a long NH2-terminal extension in sugar beet. J. Plant Physiol.166 (7), 775–780. 10.1016/j.jplph.2008.09.002

  • 15

    KolmogorovM.YuanJ.LinY.PevznerP. A. (2019). Assembly of long, error-prone reads using repeat graphs. Nat. Biotechnol.37 (5), 540–546. 10.1038/s41587-019-0072-8

  • 16

    KozikA.RowanB. A.LavelleD.BerkeL.SchranzM. E.MichelmoreR. W.et al (2019). The alternative reality of plant mitochondrial DNA: one ring does not rule them all. PLoS Genet.15 (8), e1008373. 10.1371/journal.pgen.1008373

  • 17

    KrzywinskiM.ScheinJ.BirolI.ConnorsJ.GascoyneR.HorsmanD.et al (2009). Circos: an information aesthetic for comparative genomics. Genome Res.19 (9), 1639–1645. 10.1101/gr.092759.109

  • 18

    KumarS.StecherG.TamuraK. (2016). MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol.33 (7), 1870–1874. 10.1093/molbev/msw054

  • 19

    KurtzS.ChoudhuriJ. V.OhlebuschE.SchleiermacherC.StoyeJ.GiegerichR. (2001). REPuter: the manifold applications of repeat analysis on a genomic scale. Nucleic Acids Res.29 (22), 4633–4642. 10.1093/nar/29.22.4633

  • 20

    LetunicI.BorkP. (2021). Interactive Tree of Life (iTOL) v5: an online tool for phylogenetic tree display and annotation. Nucleic Acids Res.49 (W1), W293–w296. 10.1093/nar/gkab301

  • 21

    LewisS. E.SearleS. M.HarrisN.GibsonM.LyerV.RichterJ.et al (2002). Apollo: a sequence annotation editor. Genome Biol.3 (12), Research0082. 10.1186/gb-2002-3-12-research0082

  • 22

    LiS. F.ZhangG. J.YuanJ. H.DengC. L.GaoW. J. (2016). Repetitive sequences and epigenetic modification: inseparable partners play important roles in the evolution of plant sex chromosomes. Planta243 (5), 1083–1095. 10.1007/s00425-016-2485-7

  • 23

    LiuD.GuoH.ZhuJ.QuK.ChenY.GuoY.et al (2022). Complex physical structure of complete mitochondrial genome of Quercus acutissima (fagaceae): a significant energy plant. Genes. (Basel)13 (8), 1321. 10.3390/genes13081321

  • 24

    LuQ.YeW.LuR.XuW.QiuY. (2018a). Phylogenomic and comparative analyses of complete plastomes of croomia and Stemona (Stemonaceae). Int. J. Mol. Sci.19 (8), 2383. 10.3390/ijms19082383

  • 25

    LuR.XuW.LuQ.LiP.LoshJ.HinaF.et al (2018b). Generation and classification of transcriptomes in two Croomia species and molecular evolution of CYC/TB1 genes in Stemonaceae. Plant divers.40 (6), 253–264. 10.1016/j.pld.2018.11.006

  • 26

    MaligaP. (2022). Engineering the plastid and mitochondrial genomes of flowering plants. Nat. Plants8 (9), 996–1006. 10.1038/s41477-022-01227-6

  • 27

    ManchekarM.Scissum-GunnK.SongD.KhaziF.McLeanS. L.NielsenB. L. (2006). DNA recombination activity in soybean mitochondria. J. Mol. Biol.356 (2), 288–299. 10.1016/j.jmb.2005.11.070

  • 28

    MarienfeldJ. R.NewtonK. J. (1994). The nad4 gene of maize mitochondria is highly conserved. Plant Physiol.104 (1), 301–302. 10.1104/pp.104.1.301

  • 29

    MengerK. E.Rodríguez-LuisA.ChapmanJ.NichollsT. J. (2021). Controlling the topology of mammalian mitochondrial DNA. Open Biol.11 (9), 210168. 10.1098/rsob.210168

  • 30

    MilneI.StephenG.BayerM.CockP. J.PritchardL.CardleL.et al (2013). Using Tablet for visual exploration of second-generation sequencing data. Brief. Bioinform14 (2), 193–202. 10.1093/bib/bbs012

  • 31

    MinhB. Q.SchmidtH. A.ChernomorO.SchrempfD.WoodhamsM. D.von HaeselerA.et al (2020). IQ-TREE 2: new models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol.37 (5), 1530–1534. 10.1093/molbev/msaa015

  • 32

    NiY.LiJ.ChenH.YueJ.ChenP.LiuC. (2022). Comparative analysis of the chloroplast and mitochondrial genomes of Saposhnikovia divaricata revealed the possible transfer of plastome repeat regions into the mitogenome. BMC Genomics23 (1), 570. 10.1186/s12864-022-08821-0

  • 33

    PetitR. J.DuminilJ.FineschiS.HampeA.SalviniD.VendraminG. G. (2005). Comparative organization of chloroplast, mitochondrial and nuclear diversity in plant populations. Mol. Ecol.14 (3), 689–701. 10.1111/j.1365-294X.2004.02410.x

  • 34

    RenR. C.WangL. L.ZhangL.ZhaoY. J.WuJ. W.WeiY. M.et al (2020). DEK43 is a P-type pentatricopeptide repeat (PPR) protein responsible for the Cis-splicing of nad4 in maize mitochondria. J. Integr. Plant Biol.62 (3), 299–313. 10.1111/jipb.12843

  • 35

    RoseR. J. (2021). Contribution of massive mitochondrial fusion and subsequent fission in the plant life cycle to the integrity of the mitochondrion and its genome. Int. J. Mol. Sci.22 (11), 5429. 10.3390/ijms22115429

  • 36

    SacconeC.GissiC.LanaveC.LarizzaA.PesoleG.ReyesA. (2000). Evolution of the mitochondrial genetic system: an overview. Gene261 (1), 153–159. 10.1016/s0378-1119(00)00484-4

  • 37

    ShanY.LiJ.ZhangX.YuJ. (2023). The complete mitochondrial genome of Amorphophallus albus and development of molecular markers for five Amorphophallus species based on mitochondrial DNA. Front. Plant Sci.14, 1180417. 10.3389/fpls.2023.1180417

  • 38

    ShiL.ChenH.JiangM.WangL.WuX.HuangL.et al (2019). CPGAVAS2, an integrated plastome sequence annotator and analyzer. Nucleic Acids Res.47 (W1), W65-W73–w73. 10.1093/nar/gkz345

  • 39

    SloanD. B. (2013). One ring to rule them all? Genome sequencing provides new insights into the 'master circle' model of plant mitochondrial DNA structure. New Phytol.200 (4), 978–985. 10.1111/nph.12395

  • 40

    StotterA. T.VipondM. N.GuillouP. J. (1990). The response of general surgeons to HIV in England and Wales. Ann. R. Coll. Surg. Engl.72 (5), 281–286.

  • 41

    TanC. Y.ShiB. B.BaoM. F.CaiX. H. (2023). Anti-inflammatory maistemonine-class alkaloids of Stemona japonica. Nat. Prod. Bioprospect13 (1), 8. 10.1007/s13659-023-00372-5

  • 42

    TanJ.FornerJ.KarcherD.BockR. (2022). DNA base editing in nuclear and organellar genomes. Trends Genet.38 (11), 1147–1169. 10.1016/j.tig.2022.06.015

  • 43

    TillichM.LehwarkP.PellizzerT.Ulbricht-JonesE. S.FischerA.BockR.et al (2017). GeSeq - versatile and accurate annotation of organelle genomes. Nucleic Acids Res.45 (W1), W6-W11–w11. 10.1093/nar/gkx391

  • 44

    WanrooijP. H.UhlerJ. P.ShiY.WesterlundF.FalkenbergM.GustafssonC. M. (2012). A hybrid G-quadruplex structure formed between RNA and DNA explains the extraordinary stability of the mitochondrial R-loop. Nucleic Acids Res.40 (20), 10334–10344. 10.1093/nar/gks802

  • 45

    WeiR.LiQ. (2022). The complete chloroplast genome of endangered species Stemona parviflora: insight into the phylogenetic relationship and conservation implications. Genes. (Basel)13 (8), 1361. 10.3390/genes13081361

  • 46

    WickR. R.JuddL. M.GorrieC. L.HoltK. E. (2017). Unicycler: resolving bacterial genome assemblies from short and long sequencing reads. PLoS Comput. Biol.13 (6), e1005595. 10.1371/journal.pcbi.1005595

  • 47

    YangH.ChenH.NiY.LiJ.CaiY.MaB.et al (2022). De novo hybrid assembly of the salvia miltiorrhiza mitochondrial genome provides the first evidence of the multi-chromosomal mitochondrial DNA structure of salvia species. Int. J. Mol. Sci.23 (22), 14267. 10.3390/ijms232214267

  • 48

    YangX. Z.ZhuJ. Y.TangC. P.KeC. Q.LinG.ChengT. Y.et al (2009). Alkaloids from roots of Stemona sessilifolia and their antitussive activities. Planta Med.75 (2), 174–177. 10.1055/s-0028-1088345

  • 49

    ZardoyaR. (2020). Recent advances in understanding mitochondrial genome diversity. F1000Res9, 1000. 10.12688/f1000research.21490.1

  • 50

    ZhangD.GaoF.JakovlićI.ZouH.ZhangJ.LiW. X.et al (2020a). PhyloSuite: an integrated and scalable desktop platform for streamlined molecular sequence data management and evolutionary phylogenetics studies. Mol. Ecol. Resour.20 (1), 348–355. 10.1111/1755-0998.13096

  • 51

    ZhangG. J.DongR.LanL. N.LiS. F.GaoW. J.NiuH. X. (2020b). Nuclear integrants of organellar DNA contribute to genome structure and evolution in plants. Int. J. Mol. Sci.21 (3), 707. 10.3390/ijms21030707

  • 52

    ZhangH.MeltzerP.DavisS. (2013). RCircos: an R package for Circos 2D track plots. BMC Bioinforma.14, 244. 10.1186/1471-2105-14-244

  • 53

    ZhongF.KeW.LiY.ChenX.ZhouT.XuB.et al (2023). Comprehensive analysis of the complete mitochondrial genomes of three Coptis species (C. chinensis, C. deltoidea and C. omeiensis): the important medicinal plants in China. Front. Plant Sci.14, 1166420. 10.3389/fpls.2023.1166420

  • 54

    ZhuC.JinG.FangP.ZhangY.FengX.TangY.et al (2019). Maize pentatricopeptide repeat protein DEK41 affects cis-splicing of mitochondrial nad4 intron 3 and is required for normal seed development. J. Exp. Bot.70 (15), 3795–3808. 10.1093/jxb/erz193

Summary

Keywords

Stemona sessilifolia, mitochondrial genome, repeated sequences, phylogenetic relationship, RNA editing

Citation

Xie Y, Liu W, Guo L and Zhang X (2024) Mitochondrial genome complexity in Stemona sessilifolia: nanopore sequencing reveals chloroplast gene transfer and DNA rearrangements. Front. Genet. 15:1395805. doi: 10.3389/fgene.2024.1395805

Received

04 March 2024

Accepted

17 May 2024

Published

04 June 2024

Volume

15 - 2024

Edited by

Madhav P. Nepal, South Dakota State University, United States

Reviewed by

Yi Zou, Chinese Academy of Agricultural Sciences, China

Pedro Paulo Gattai Gomes, University of São Paulo, Brazil

Updates

Copyright

*Correspondence: Xuemei Zhang,

†These authors have contributed equally to this work and share first authorship

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics