ORIGINAL RESEARCH article

Front. Plant Sci., 18 April 2023

Sec. Plant Systematics and Evolution

Volume 14 - 2023 | https://doi.org/10.3389/fpls.2023.1126175

Novel insights into chloroplast genome evolution in the green macroalgal genus Ulva (Ulvophyceae, Chlorophyta)

  • 1. CAS Key Laboratory of Marine Ecology and Environmental Sciences, Institute of Oceanology, Chinese Academy of Sciences (IOCAS), Qingdao, Shandong, China

  • 2. Marine Ecology and Environmental Science Laboratory, Pilot National Laboratory for Marine Science and Technology (Qingdao), Qingdao, Shandong, China

  • 3. Center for Ocean Mega-Science, Chinese Academy of Sciences, Qingdao, Shandong, China

  • 4. Harbin University of Science and Technology, Weihai, Shandong, China

Article metrics

View details

8

Citations

3,5k

Views

1,4k

Downloads

Abstract

To understand the evolutionary driving forces of chloroplast (or plastid) genomes (plastomes) in the green macroalgal genus Ulva (Ulvophyceae, Chlorophyta), in this study, we sequenced and constructed seven complete chloroplast genomes from five Ulva species, and conducted comparative genomic analysis of Ulva plastomes in Ulvophyceae. Ulva plastome evolution reflects the strong selection pressure driving the compactness of genome organization and the decrease of overall GC composition. The overall plastome sequences including canonical genes, introns, derived foreign sequences and non-coding regions show a synergetic decrease in GC content at varying degrees. Fast degeneration of plastome sequences including non-core genes (minD and trnR3), derived foreign sequences, and noncoding spacer regions was accompanied by the marked decrease of their GC composition. Plastome introns preferentially resided in conserved housekeeping genes with high GC content and long length, as might be related to high GC content of target site sequences recognized by intron-encoded proteins (IEPs), and to more target sites contained by long GC-rich genes. Many foreign DNA sequences integrated into different intergenic regions contain some homologous specific orfs with high similarity, indicating that they could have been derived from the same origin. The invasion of foreign sequences seems to be an important driving force for plastome rearrangement in these IR-lacking Ulva cpDNAs. Gene partitioning pattern has changed and distribution range of gene clusters has expanded after the loss of IR, indicating that genome rearrangement was more extensive and more frequent in Ulva plastomes, which was markedly different from that in IR-containing ulvophycean plastomes. These new insights greatly enhance our understanding of plastome evolution in ecologically important Ulva seaweeds.

Introduction

The green algal class Ulvophyceae harbors at least 13 orders and more than 2700 species thus far, and ranks second in the number of species among Chlorophyta only next to the class Chlorophyceae (Guiry and Guiry, 2023). Species in the Ulvophyceae show great diversification of cytological types and morphological complexity, which varied from small unicellular species (e.g. Scotinosphaerales), to large multicellular thalli composed of uninucleate cells (e.g. Ulvales) or multinucleate cells (e.g. Cladophorales), to the gigantic single-celled coenocytic thalli (e.g. Bryopsidales and Dasycladales) (Cocquyt et al., 2010; Leliaert et al., 2012; Gulbrandsen et al., 2021). Meanwhile, their chloroplast or plastid genomes (plastomes, or cpDNAs, or ptDNAs) display miraculous variations in genome architecture, genome size, GC content, gene density, intron content and gene order (Lang and Nedelcu, 2012; Smith, 2017; Turmel and Lemieux, 2018), ranging from the circular 195.9-kb plastome with two inverted repeats (IRs) in Pseudendoclonium akinetum (Ulotrichales) (Pombert et al., 2005), which is the first ulvophycean cpDNA sequenced, to the 34 multiple hairpin cpDNA chromosomes in Boodlea composita (Cladophorales) with high GC content (average 57%) (Del Cortona et al., 2017), from the compact 74.5-kb IR-lacking plastome in Callipsygma wilsonis (Bryopsidales) (Cremen et al., 2018) to the approximately 2000-kb plastome in Acetabularia acetabulum (Dasycladales) with a high noncoding content (more than 85%) (de Vries et al., 2013).

The green macroalgal genus Ulva Linnaeus 1753 (Ulvophyceae, Chlorophyta) is the species-richest genus in the Ulvales. As more Ulva species have been accurately identified recently, 102 species names have been flagged as accepted taxonomically up to now (Guiry and Guiry, 2023). Globally, many Ulva seaweeds (e.g. Ulva prolifera, Ulva compressa, and Ulva meridionalis) are notorious for their rapid vegetative growth in eutrophic waters, leading to green tides formed by the accumulation of excess biomass (Wang et al., 2019; Liu et al., 2022b). Ulva simple morphologies show high similarity at the interspecific level, meanwhile cytological and morphological features could vary greatly at the intraspecific level, thus accurate identification of Ulva species has been challenging (e.g. Blomster et al., 2002; Hayden and Waaland, 2002). The use of molecular markers (e.g., ITS, rbcL, and tufA) for species identification has become the mainstream method to ensure the accuracy and credibility of identification results (e.g. Hofmann et al., 2010; Hughey et al., 2019; Steinhagen et al., 2019). However, due to the limited differentiation signals of these marker sequences, their resolution is inadequate for identifying closely related Ulva species. Organelle genomes (cpDNAs and mtDNAs) as super molecular markers have been proved to be powerful to understand the evolution and molecular species concepts in the genus Ulva, and are potential resources for developing specific high-resolution molecular markers (Mitsuhashi et al., 2020). Recently, phylogenomic analysis based on organelle genome data clearly depicted the evolutionary nature of double crown radiation in the phylogeny and speciation of Ulva species (Liu and Melton, 2021; Liu et al., 2022a; Liu et al., 2022b).

The data of Ulva plastomes have accumulated rapidly recently based on efficient high-throughput sequencing technology (Melton et al., 2015; Fort et al., 2021; Hughey et al., 2021), which makes it possible for more accurately understanding of the evolution trend of Ulva plastomes on a more detailed and specific sampling scale. A total of 33 plastomes from 17 Ulva species have been documented in the GenBank database thus far. The sequenced Ulva plastomes show many unique features when compared with the counterparts in other ulvophycean lineages. These Ulva plastomes belong to the compact circular IR-lacking plastomes with the smaller size (86.73 - 119.87 kb) and the lowest GC content (23.89 - 26.25%) within the Ulvophyceae (Table 1). Variations in the Ulva plastome size at interspecific and intraspecific level were mainly caused by differences in content of group I/II introns, integration of foreign DNA fragments and content of intergenic regions (Liu and Melton, 2021). The Ulva plastomes show high conservation in repertoire of canonical genes, and share the same set of 100 core genes including 71 protein-coding genes (PCGs), three ribosomal RNA (rRNA) genes and 26 transfer RNA (tRNA) genes (Wang et al., 2021). The organelle division inhibitor factor gene, minD, was observed to be present only in the plastome of Ulva aragoënsis (Liu and Melton, 2021), which used to be regarded as Ulva flexuosa (Cai et al., 2017). Only one group II (derived) intron (intron infA-62) were shared by all sequenced Ulva plastomes, and all other introns displayed highly variable and sporadic distribution pattern. Ulva plastome architectures were dynamic and plastic not conserved at the intrageneric level due to frequent genome rearrangements (Mitsuhashi et al., 2020; Liu and Melton, 2021).

Table 1

LineageSubcladeSpeciesAbbr.Accession numberSize (bp)GC (%)References
Ulva IIAUlva proliferaUpr1OP98512993,06624.78This study
Ulva proliferaUpr2OP98513093,06624.78This study
Ulva proliferaUpr3OP98513193,07224.78This study
Ulva proliferaUpr4KX34286793,06624.78Jiang et al., 2019
Ulva proliferaUpr5MZ57150899,72425.28GenBank
Ulva linzaUliKX05832386,72624.79Wang et al., 2017
Ulva tortaUto1OL684342112,03424.89This study
Ulva tortaUto2MZ703011105,42325.24Wen et al., 2022
Ulva californicaUcaMZ56147592,12624.71Lin et al., 2022
Ulva aragoënsisUar1OP98513287,17224.68This study
Ulva aragoënsis (Ulva flexuosa*)Uar2KX57994389,41424.97Cai et al., 2017
IBUlva giganteaUgiMT179350117,60625.73Fort et al., 2021
Ulva lactuca (syn. Ulva fasciata)Ula1KT88261496,00524.87Melton and Lopez-Bautista, 2017
Ulva lactucaUla2MH73097295,99724.87Hughey et al., 2019
Ulva ohnoiUohAP018696103,31325.44Suzuki et al., 2018
Ulva lacinulata (Ulva laetevirens*)Ulc1MT179351103,44425.40Fort et al., 2021
Ulva lacinulataUlc2MW543061107,24225.82Hughey et al., 2021
Ulva lacinulata (Ulva laetevirens*)Ulc3MW531676110,88925.63Wang et al., 2021
Ulva lacinulata (Ulva rigida*)Ulc4MN389525103,52325.40Hughey et al., 2021
Ulva sp. A AF-2021 (Ulva rigida*)Usp2MT17935296,67324.57Fort et al., 2021
ICUlva meridionalisUmeOP985133122,17224.86This study
Ulva sp. UNA00071828Usp1KP72061699,98325.30Melton et al., 2015
Ulva tepidaUteOL68434194,44924.49This study
Ulva sp. Q253Usp3MW69978888,801**23.89GenBank
Ulva sp. (Ulva prolifera*)Usp3MN85387988,80123.89GenBank
Ulva sp. (Ulva meridionalis*)Usp3MN88954088,65323.91Liu J. et al., 2020
Ulva IIIIAUlva compressaUco1MW548841114,29126.23Liu and Melton, 2021
Ulva compressaUco2MW34428791,18925.86Liu and Melton, 2021
Ulva compressaUco3MW35378196,82426.17Liu and Melton, 2021
Ulva compressa (syn. Ulva mutabilis)Uco4MK069584119,86626.24GenBank
Ulva compressaUco5MK069585>89,16426.25GenBank
Ulva compressaUco6MT91692994,22625.80Xia et al., 2021
Ulva compressaUco7KX59527596,80826.18GenBank
Ulva intestinalisUinMZ15870399,04124.97Wang et al., 2021
IIBUlva rigida (Ulva rotundata*)Uri1MT179353118,20626.12Fort et al., 2021
Ulva rigidaUri2MW543060117,99526.13Hughey et al., 2021
Ulva fenestrataUfeMT17934994,65425.27Fort et al., 2021
Ulva australis (syn. Ulva pertusa)Uau1MN853875104,38025.66Han et al., 2020
Ulva australisUau2LC507117102,89925.33Mitsuhashi et al., 2020
Ulva australisUau3MT17934899,820**25.21Fort et al., 2021

The sequenced 40 plastomes of Ulva species for comparative analysis.

* The Ulva plastomes with wrong species name assignment, which were deposited in the GenBank database, have been corrected. Ulva laetevirens (MT179351), Ulva rigida (MT179352), and Ulva rotundata (MT179353) have been corrected to Ulva lacinulata (MT179351), Ulva sp. A AF-2021 (MT179352), and Ulva rigida (MT179353), respectively (Fort et al., 2021).

In this study, we sequenced and constructed seven complete plastomes from five Ulva species including Ulva prolifera O.F.Müller, Ulva aragoënsis (Bliding) Maggs, Ulva torta (Mertens) Trevisan, Ulva tepida Y. Masakiyo & S. Shimada, and Ulva meridionalis R. Horimoto & S. Shimada, and conducted comparative plastomic analysis to understand the evolutionary driving forces in ecologically important Ulva seaweeds.

Materials and methods

Sample collection and DNA extraction

Three free-floating algal samples of Ulva prolifera O.F.Müller (LF001, LF002 and LF003) were collected on 2 Jul. 2021 at the First (N36°05’53’’, E120°34’17’’), Second (N36°04′96″, E120°34′83″) and Third (N36°05′03″, E120°36′82″) bathing beaches along the coast of Qingdao, Shandong, China, respectively. The sessile samples of Ulva aragoënsis (Bliding) Maggs (LF005) and Ulva tepida Y.Masakiyo & S.Shimada (LF006) were collected on 11 Aug. 2021 at the Trestle Bridge (N36°06′09″, E120°31′08″) and the First bathing beach (N36°05′34″, E120°33′81″) along the coast of Qingdao, Shandong, China, respectively. The free-floating algal thalli of Ulva torta (Mertens) Trevisan (LF007) and Ulva meridionalis R.Horimoto & S.Shimada (LF010) were sampled on 4 Aug. 2021 in the Sakura Lake (37°07’30’’-56’’N, 122°27’03’’-50’’E), Rongcheng, Shandong, China. These Ulva samples were stored in coolers (5-8°C) after collection and transported back to the laboratory within 48 hours. Algal thallus for each individual Ulva thallus was cultured in a 9-cm diameter Petri dish containing 25-mL L1 medium with 0.5‰ GeO2, 50 µg/mL Dipterex (Fengcheng Animal Medicine Co., Ltd, China) and a suite of antibiotics (per mL: 50 µg streptomycin, 66.6 µg gentamycin, 20 µg ciprofloxacin, 2.2 µg chloramphenicol, and 100 µg ampicillin) (Shibl et al., 2020). The culture was maintained at 18°C, 100 - 120 μmol photons m-2 s-1 in the photoperiod of 12 h light: 12 h darkness in a GXZ-380C temperature-controlled incubator (Ningbo Jiangnan, China). After at least one week of culture, fresh algal tissue from each Ulva thallus was used for DNA extraction using a Plant Genome DNA Kit (DP305, Tiangen Biotech, Beijing, China) according to the manufacturer’s instructions. Species identification was conducted based on phylogenetic analyses of two common DNA marker datasets, including the nuclear ITS region including the 5.8S rDNA gene, and the chloroplast rbcL gene (Hayden and Waaland, 2002; Liu F. et al., 2020).

DNA sequencing and plastome assembly

The quality and concentration of total genomic DNA extracted were checked using a NanoPhotometer spectrophotometer (Implen, CA, USA), and a Qubit 2.0 Flurometer (Life Technologies, CA, USA), respectively. Qualified DNA samples were fragmented into 350 bp by Covaris S220 ultrasonic crater for library construction. The libraries were sequenced on an Illumina NovaSeq platform (Illumina, USA) using paired-end sequencing, yielding about 10 Gb sequencing raw data of paired-end reads with 150 bp in length for each Ulva sample. Clean data were harvested by trimming sequencing adapters and removing short or low-quality reads from the raw data. Complete Ulva plastomes were constructed by the GetOrganelle v1.7.1 (Jin et al., 2020). The plastome of U. compressa (MW353781) was used as the reference genome for assembly. Plastome assemblies were re-examined by aligning reads against the assembled plastome sequence using the MEM algorithm of BWA v0.7.17 (Li and Durbin, 2010). VarScan v2.3.9 (Koboldt et al., 2009) and IGV v2.8.12 (Robinson et al., 2011) were employed to examine mutation sites and to verify assembly results, respectively.

Annotation of Ulva plastomes

Protein-coding genes (PCGs) were annotated by using the Open Reading Frame Finder at the National Center for Biotechnology Information (NCBI) website (https://www.ncbi.nlm.nih.gov/orffinder/), and by aligning homologous PCGs from Ulva cpDNAs deposited in the GenBank database with the newly sequenced Ulva plastomes. Transfer RNA genes (tRNAs) were searched for by reconstructing their cloverleaf structures using the tRNAscan-SE 2.0 software with default parameters (Chan et al., 2021). Ribosomal RNA genes (rRNAs) were identified by using the RNAweasel (https://megasun.bch.umontreal.ca/apps/rnaweasel/), and by aligning homologous rRNAs. The free-standing and intronic open reading frames (orfs) were found by using the Open Reading Frame Finder at the NCBI website. Intron insertion-sites were determined manually by aligning the intron-containing homologous genes, and corresponding genes in the U. compressa (MW353781) plastome were used as a reference (Liu and Melton, 2021). Intron name was defined as host gene plus insertion site. The class and core structure of all these introns were determined by using the RNAweasel and Mfold (Zuker, 2003). The core domains of intron-encoded proteins (IEPs) and free-standing specific ORFs were determined by significant Pfam-A matches (Bateman et al., 2000). To ensure the accuracy of comparative analysis, we have re-annotated the plastomes of Ulva species and Blidingia minima (MK408749 and MT948112) deposited in the GenBank database with the same method as above. All annotation results (including genes and introns) were manually verified. In some Ulva plastomes (e.g. MZ561475, MK069585, MT916929, and KX342867), incorrect annotations and abnormal sequence errors have been corrected in our subsequent comparative analysis.

Plastome comparison and phylogenetic analysis

Base composition of Ulva plastomes and other DNA sequences was determined by using MEGA 7.0 (Kumar et al., 2016). Tandem repeats were analyzed by using Tandem Repeats Finder with parameter settings of two for matches and seven for mismatches and indels (Benson, 1999). Differences and identity values of DNA sequences were calculated by use of BioEdit v7.1.9 (Hall, 1999). Synteny analysis of Ulva plastomes was executed by using Mauve v2.3.1 software with default parameters (Darling et al., 2010). A new class of specific orfs named Ucp-orf was found in Ulva plastomes. The thorough search for Ucp-orf-like sequences in Ulva plastomes was conducted against the NCBI nucleotide database. A total of 29 full-length Ucp-orfs were detected in 17 of 40 Ulva cpDNAs. Multiple sequence alignments of Ucp-ORFs were conducted by using ClustalX 1.83 with the default settings (Thompson et al., 1997). The structural domain or motif in Ucp-ORFs was searched on the HMMER website (https://www.ebi.ac.uk/Tools/hmmer/search/phmmer) and using InterProScan tool (Paysan-Lafosse et al., 2023). The phylogenetic relationships were inferred with the Maximum Likelihood (ML) method based on the JTT matrix-based model (Jones et al., 1992) by using MEGA 7.0 (Kumar et al., 2016). There was a total of 113 positions in the final dataset of Ucp-ORFs. Phylogenomic trees were constructed based on two plastome datasets including nucleotide (nt) sequences of 100 common genes and amino acid (aa) sequences of 71 common PCGs from 40 Ulva. The nt sequences of 100 genes and the aa sequences of 71 PCGs were individually aligned, checked and concatenated using ClustalX 1.83 with default settings (Thompson et al., 1997). Maximum-likelihood trees were constructed by IQ-TREE (Trifinopoulos et al., 2016) using default parameters with 1000 ultrafast bootstrap analysis (Minh et al., 2013). The substitution model conducted by IQ-TREE was GTR+G for nt dataset and cpREV+F+I+G4 for aa dataset, respectively. Blidingia minima (MK408749 and MT948112) was used as the outgroup.

Results and discussion

Ulva plastomes show a clear evolutionary trend of becoming smaller and more compact

These seven newly obtained plastomes from five Ulva species were successfully assembled as circular-mapping molecules, with sizes ranging from 87.2 kb in U. aragoënsis (Uar1) to 122.2 kb in U. meridionalis (Ume) (Table 1). The 122.2-kb cpDNA of Ume is the largest Ulva plastome sequenced thus far, and is 1.4 times the smallest one which is the 86.7-kb cpDNA of U. linza (Uli) (Wang et al., 2017). To clarify the unique evolutionary trend of Ulva plastomes, we built the Ulva plastome dataset composed of newly sequenced plastomes and the data deposited in the GenBank database (Table 1), with a total of 40 Ulva plastomes which represented 19 Ulva species from two independent Ulva evolutionary lineages (I and II) (Liu and Melton, 2021; Liu et al., 2022a), and compared Ulva plastomes with those in other lineages of Ulvophyceae (Supplementary Table 1).

On the whole, Ulva cpDNAs show a clear evolutionary trend of becoming smaller and more compact when compared with all circular complete counterparts in Ulvophyceae (Figure 1). The Ulva plastomes only encoded a total of 100 conserved canonical genes, including 71 protein-coding genes (PCGs), three rRNA genes and 26 tRNA genes, which are the least among the sequenced circular ulvophycean plastomes. The overall coding regions composed of these 100 canonical genes occupy approximately 71.2 - 72.5 kb in size. Only small repeat sequences were observed to be concentrated either in intergenic spacer regions or intronic regions, and some specific repeat sequences reside in several PCGs (e.g. rpoB, rpoC1, and rpoC2). Unlike those in Sykidiales (Pseudoneochloris marina), Ulotrichales, Oltmannsiellopsidales, Ignatiales and Trentepohliales (Pombert et al., 2005; Turmel and Lemieux, 2018; Kim et al., 2019; Fang et al., 2021), most of intergenic regions are relatively short and the rRNA operon-encoding inverted repeat (IR) has been completely lost in Ulva plastomes. The minimum size of overall non-coding intergenic regions was only approximately 11.6 kb, which was observed in U. californica (Uca) (Supplementary Table 2).

Figure 1

The ongoing gene loss or transfer can be clearly observed in Ulva plastomes. An intact organelle division inhibitor factor gene, minD (orf306), was observed to be present only in the U. aragoënsis (Uar) cpDNA (Liu and Melton, 2021). Our further comparative analysis shows that this gene or its residue exists in the trnL2-trnS2 intergenic region of cpDNAs in the U. aragoënsis-U. californica-U.torta (Uar-Uca-Uto) subclade, while it was completely lost in the other Ulva plastomes as well as the Blidingia cpDNAs (Gao et al., 2022). The minD was split into two parts (orf68 and orf209) in the Uca cpDNA, due to a 5-bp insertion mutation, whereas this gene has degenerated more seriously in the Uto cpDNAs, leaving only the residue orf44. The fracture and degeneration of minD, as well as the decreased GC content in its homologous sequences (Supplementary Table 2), indicated that there was no selective pressure to retain the integrity of this gene in Uca and Uto. Considering that homologues of minD are present in other lineages of core Ulvophyceae (Turmel and Lemieux, 2018), this gene is likely to have been transferred to the nuclear genome through horizontal transfer in Ulva species containing minD-lacking plastomes. However, we have not found this gene in the nuclear genome of U. mutabilis yet (De Clerck et al., 2018), but it cannot be determined that it does not exist in the nuclear genome, considering the incompleteness of the sequenced Ulva genome.

One specific trnR3(ccu) gene is present in plastomes of some Ulva species including U. prolifera (Upr1-5), U. gigantea (Ugi), U. rigida (Uri1-2), U. torta (Uto1), U. meridionalis (Ume), U. lacinulata (Ulc1-4) and Ulva sp. (Usp2) (Figure 2). This gene is conservatively located in the downstream region adjacent to psbC (Liu and Melton, 2021), but it is situated between psbA and trnT in the Uto1 cpDNA. Comparative analysis shows that this gene is highly similar with trnR2(ucu), indicating that it originated from the duplication of trnR2(ucu), and then its anti-codon mutated from UCU to CCU. More mutation sites were detected in trnR3(ccu) when compared with trnR2(ucu), and the latter maintains a highly conserved sequence in Ulva cpDNAs. A 5-bp (ACAAG) duplication mutation was detected in the dihydrouridine (DHU) stem-loop structure of trnR3(ccu) only in Upr5. The rates of sequence evolution for trnR3(ccu) appear to be dramatically higher than for trnR2(ucu), and the GC contents of trnR3(ccu) tend to decrease in varying degrees (Figure 2), indicating that trnR3(ccu) is subject to different selection pressures when compared with trnR2(ucu). We found that the 26 core tRNA genes are sufficient to meet all the requirements of protein synthesis in Ulva chloroplast genomes (Figure 3). Redundant trnR3(ccu) can be completely replaced by trnR2(ucu) in function, which should be the reason why it underwent significantly accelerated sequence evolution or a complete loss that had occurred.

Figure 2

Figure 3

In addition, these IR-lacking Ulva plastomes could sporadically incorporate foreign sequences in some specific intergenic regions, and accept group I/II introns in some housekeeping genes, which obviously increased their size. The current size of Ulva plastomes was the result of dynamic changes caused by several of the above factors. Especially, marked intraspecific differences in Ulva plastome sizes are common, involving gain or loss of introns, integration of foreign fragments and abundance of repetitive sequences (Liu and Melton, 2021).

Strong selection against GC in Ulva plastomes

The GC composition of ulvophycean plastomes varies significantly among different lineages (Turmel et al., 2017; Zhu et al., 2019; Fang et al., 2021), but particularly remarkable is that the GC content of Ulva cpDNAs was the lowest among the Ulvophyceae so far, ranging from 23.89 to 26.25% (Figure 1), indicating the strong selection against GC to shape the nucleotide composition in Ulva plastomes. Due to that GC content showed greatly heterogeneity in distinct regions of Ulva plastomes, we analyzed the differences in GC content of overall core-gene coding regions, intronic regions, foreign sequence regions, and non-coding intergenic spacer regions among these 40 Ulva cpDNAs.

The GC content is 26.53 - 27.71% in overall coding regions composed of 100 canonical genes (Supplementary Table 2). The GC composition of core-gene coding region as well as its total size showed very similar values at intraspecific level or among closely related species, but there are significant differences in the GC content and size of core-gene coding regions among different Ulva lineages (Table 1; Supplementary Table 2). The 26 tRNA regions have the highest GC content (51.32 - 51.88%), followed by the 3 rRNA regions (44.61 - 45.20%) and then the 71 PCG regions (24.50 - 25.82%) (Supplementary Table 2).

The GC composition is relatively stable in chloroplast rRNAs and tRNAs, but their variations in PCGs fluctuate greatly. Some PCGs related to photosynthesis have much higher GC content, e.g. psbA (40.96 ± 0.17%), psbB (38.60 ± 0.41%), psbC (38.20 ± 0.22%), psbD (38.76 ± 0.35%), and psaC (38.02 ± 0.91%), while some PCGs with large molecular weight, which are mainly involved in transcription and proteolysis, show very low GC content, e.g. ftsH (14.81 ± 0.68%), rpoA (16.96 ± 0.45%), rpoB (19.25 ± 0.65%), rpoC1 (16.24 ± 0.69%), and rpoC2 (14.76 ± 0.95%) (Figure 4). On the whole, codon usage pattern in Ulva chloroplast PCGs showed a much stronger preference for codons with A or T at the third position (Figure 3). The difference in GC content between chloroplast PCGs is mainly determined by their different amino acid composition and the different usage frequency of synonymous codons. The long PCGs with low GC content employ a large number of codons composed of only A and T. For example, the seven most frequently used codons in rpoC2 are AAU(N), AAA(K), UUA(L), UUU(F), UAU(Y), AUU(I), and AUA(I). However, GC-biased PCGs tend to prefer some codons with C at the third position. For example, UUC(F), AAC(N), AUC(I), UAC(Y) and CAC(H) were used more frequently than their synonymous codons in psbA.

Figure 4

The overall intronic regions show great difference in GC composition among different Ulva cpDNAs, ranging from 22.59% in U. linza (Uli) to 34.72% in U. compressa (Uco2) (Supplementary Table 2), as mainly depends on type and content of introns which cpDNA harbors. The GC content of overall foreign sequence regions range from 22.79 to 35.83% in nearly all Ulva cpDNAs with the exception of Usp3 cpDNAs which show much lower values (18.69 - 18.87%) (Supplementary Table 2). The GC content in noncoding intergenic regions is obviously the lowest in the range from 8.38% in U. californica (Uca) to 19.25% in U. australis (Uau1) (Supplementary Table 2).

Low GC content in Ulva plastomes is mainly attributed to strong selection pressure driving A + T richness at a genomic level. This selection pressure seems to act on the overall plastome sequences in the microenvironment of Ulva chloroplasts, including coding regions, introns, foreign sequences and noncoding regions. The GC composition of all these regions has been markedly reduced when compared with the counterparts in other ulvophycean plastomes (Leliaert and Lopez-Bautista, 2015; Turmel et al., 2016). Plastomes in order Bryopsidales also show a trend of decreasing in size (74.5 - 177.8 kb) and GC content (27.07 - 37.71%) (Figure 1), and some species in Bryopsidales adopt the strategy of increasing the number of overlapping regions to make plastomes compact. The lowest GC content in Bryopsidales was 27.07%, which was observed in Boodleopsis plastome (MH591104). Its GC content in overall coding region (28.15%) is more than twice that in non-coding region (12.82%), and it was higher than that in coding regions of Ulva plastomes (26.53 - 27.71%).

Considering the high energy consumption and high nitrogen demand for GTP and CTP synthesis and the shortening of the sequence length in most PCGs (Mann and Chen, 2010), the advantage conferred by selection against GC observed in Ulva plastomes not in other ulvophycean plastomes seems to be more effective in saving the energy cost and serving photosynthesis and biomass synthesis in Ulva species, which is more conducive to supporting rapid and abundant growth of Ulva species. Meanwhile, the GC content (32.17 - 38.84%) in mitogenomes of Ulva species (Liu et al., 2022a; Liu et al., 2022b) does not show a significant difference from those in Ulotrichales and Oltmannsiellopsidales (Turmel et al., 2016). This strong selection of A + T preference seen in plastomes does not appear in nuclear genomes of Ulva species (e.g. 57.2% in U. mutabilis and 57.3% in U. compressa) (De Clerck et al., 2018; Osorio et al., 2022).

Distribution and diversity of Ulva plastome introns

These newly sequenced Ulva plastomes harbor different intron contents ranging from two in U. aragoënsis (Uar1) to 16 in U. meridionalis (Ume), occupying 3.5 - 17.1% of cpDNAs, which are in the range of the reported Ulva plastomes (Liu and Melton, 2021). To further understand the evolutionary trend of Ulva chloroplast introns in distribution and diversity, we systematically excavated and compared introns at intrageneric level. A total of 34 intron families were found among these 40 known Ulva chloroplast genomes. Among them, 33 intron families were found by RNAweasel (Lang et al., 2007), and only one (intron rns-476) was detected by alignment of homologous gene sequences (Table 2).

Table 2

Intron names *Intron groupsIntron number
(n)
Intron-encoded proteins
(domain)
LAGLIDADG
motif
Avg. intron size
(SD, bp)
Avg. GC
(SD, %)
atpA-492IB (complete)17LAGLIDADGdouble1168 (16)23.24 (0.60)
atpB-537IIB1RTM235533.38
atpB-627IIB15RTM2225 (13)36.79 (0.93)
atpB-696IIB17RTM2372 (12)36.52 (0.52)
atpI-256IIB1RTM225236.23
infA-62II (derived)40 (40) **616 (68)22.69 (1.20)
petB-23IIB3RTM2316 (1)34.66 (0.52)
petB-69IIB20 (1) **RTM2207 (92)35.38 (0.94)
petB-169IIB2RTM245934.85
petB-277IIB6RTM2447 (13)36.37 (0.41)
petB-528IB (complete)14 (1) **LAGLIDADGdouble1265 (14)23.94 (0.51)
petD-87IIA4RTM2428 (11)36.23 (0.03)
psaA-1104IB (complete)3LAGLIDADGdouble1238 (1)19.43 (0.11)
psaA-1605IB (complete)2LAGLIDADGdouble109622.45
psaB-1050IB (complete)15LAGLIDADGdouble1123 (20)22.59 (0.45)
psbA-179I (derived, B1)1LAGLIDADGsingle69528.92
psbA-750I (derived, B1)3 (1) **T5orf172752 (399)32.27 (1.22)
psbB-489I (derived, A)8 (2) **GIY-YIG880 (250)24.61 (1.43)
psbB-600IB (complete)8LAGLIDADGdouble1301 (41)24.58 (1.00)
psbB-772I (derived, A)3 (3) **367 (35)27.25 (2.55)
psbB-1022I (derived, B1)3HNH959 (9)25.36 (0.69)
psbB-1352I (derived, B1)2HNH947 (10)24.19 (1.44)
psbC-496IIB1RTM244136.71
psbC-708IA4LAGLIDADGsingle986 (5)26.11 (1.22)
psbC-882I (derived, A)4GIY-YIG926 (4)24.72 (0.52)
psbD-740I (derived, A)11GIY-YIG1039 (21)24.64 (0.77)
psbD-1034I (derived, A)2GIY-YIG1006 (108)24.28 (1.13)
rnl-1893aIB (complete)19LAGLIDADGsingle765 (3)30.38 (1.01)
rnl-1893bIB (complete)3LAGLIDADGdouble1007 (5)25.03 (0.40)
rnl-2225IB (complete)22LAGLIDADGsingle966 (24)26.79 (0.26)
rnl-2463IB (complete)4LAGLIDADGsingle1013 (1)26.40 (0.22)
rnl-2556I (derived, B2)5LAGLIDADGsingle757 (24)27.66 (0.43)
rns-476I (unknown)4LAGLIDADGsingle1007 (3)24.39 (0.19)
rns-499IA35 (3) **LAGLIDADGdouble938 (779)27.63 (4.10)

General features of Ulva chloroplast introns detected among the 40 Ulva plastomes.

* Intron names were defined as host gene plus insertion site which was determined by comparing homologous genes relative to the plastome of U. compressa (MW353781) (Liu and Melton, 2021).

** Numbers in parentheses indicate the number of introns with severe IEP degradation or loss.

The Ulva chloroplast introns were detected at 33 insertion sites of 14 host genes including atpA (1 site), atpB (3), atpI (1), infA (1), petB (5), petD (1), psaA (2), psaB (1), psbA (2), psbB (5), psbC (3), psbD (2), rnl (4) and rns (2) (Table 2). Obviously, intron densities varied widely among chloroplast genes at interspecific and intraspecific levels (Supplementary Figure 1). These introns were mainly distributed in two rRNA genes (rnl, and rns) and some more conserved PCGs involving photosystem I and II, electron transport and ATP synthesis, indicating that different functional groups of genes have different propensities for intron insertion. Further comparative analysis showed that introns preferentially resided in conserved housekeeping genes with high GC content (usually more than 34%) and long size (usually more than 0.5 kb) (Figure 4). We speculate that this may be related to higher GC content in target site sequences required for IEP recognition in host genes and more target sites contained by long GC-rich genes. However, some PCGs (e.g. rbcL, and tufA) with the above similar characteristics have high expression in chloroplasts and tend to resist intron invasion to economically and effectively ensure unnecessary consumption and time cost in transcription and processing (Jeffares et al., 2006).

A total of 23 intron families were observed to belong to group I introns, and the remaining 11 were group II introns. Eight intron families including intron atpB-537, psaA-1605, psbA-179, psbB-1022, psbB-1352, psbC-708, psbD-1034 and rnl-1893b were found for the first time in Chlorophyta. The size and GC content of different intron families fluctuate markedly in Ulva plastomes (Table 2), which are largely determined by type of introns and degeneration degree of intron-encoded proteins (IEPs) (Figure 5A). Except for the degenerated group II (derived) intron infA-62 (Liu and Melton, 2021), the size of all other group II introns is significantly longer than that of group I introns, due to the different size of their IEP genes (Table 2). The GC content is positively correlated with group II intron size with the excellent coefficient of determination (R2 = 0.9809) (Figure 5B). Group IIA/IIB introns have the high GC content ranging from 33.38% in intron atp-537 to 36.79 ± 0.93% in intron atpB-627, while the GC content of group II (derived) intron infA-62 is only 22.69 ± 1.20% mainly due to the loss of the entire IEP (Table 2). Contrary to the group II introns, there is a weak negative correlation between GC content and size in group I intron (Figure 5B). The GC content in group I introns shows complex changes, which is mainly due to the diversity of types (related to secondary structure) and IEPs.

Figure 5

Group IIA/IIB introns harbor a reverse transcriptase/maturase (RTM) gene in Ulva plastomes. The vast majority (93.8%) of group I introns encode an intact IEP which is the member of the LAGLIDADG or GIY-YIG or HNH homing endonuclease (LHE or GHE or NHE) families. However, the IEPs from intron psbA-750 do not exhibit significant sequence similarity to common homing endonuclease families (e.g. LHE, GHE, and NHE), but contain a conserved T5orf172 domain which occurs in a stand-alone protein form in phage, virus and bacteria and is also found in DNA-binding regulatory proteins of bacterial and eukaryotic DNA viruses (Iyer et al., 2002). All chloroplast GHEs were encoded by group I (derived, A) introns, while the IEPs encoded by group I (derived, B1) introns showed diverse protein types including NHE, LHE, and T5orf172 domain-containing homing endonuclease (THE). All of chloroplast group IB introns and other group I introns encoded an LHE with one or two LAGLIDADG motifs.

Almost all introns displayed sporadic distribution pattern in Ulva plastomes, due to their nature of homing and mobility. Only the chloroplast intron infA-62 is an exception. This intron is shared by all Ulva plastomes, but absent in Blidingia cpDNAs, indicating that it might be acquired after its divergence from Blidingia. This intron has completely lost the ability to move and has been trapped in infA, because of its severe degeneration and the loss of IEP. This intron co-evolved with infA and showed a faster evolution rate than the host gene. Introns from the same insertion site were previously observed to be homologous among organelle genomes in Ulva (Liu and Melton, 2021; Liu et al., 2022a). However, two different intron families, intron rnl-1893a and rnl-1893b, were found to be present in the same insertion site. Although both of them belong to group IB intron and share similar secondary structure of ribozyme components, but their primary sequences and IEPs are markedly different. The IEPs in intron rnl-1893a were LHEs with only one LAGLIDADG motif, while those in intron rnl-1893b contained two LAGLIDADG motifs (Table 2). These facts indicated that these two different LHEs should recognize the same target site in rnl, although they can be a homodimer and a monomer (Haugen et al., 2005), respectively.

Novel insights into integration of foreign sequences and rearrangement of Ulva plastomes

Comparison of plastome intergenic regions shows that Ulva cpDNAs experienced frequent insertion of foreign DNA sequences which usually harbor some specific open reading frames (orfs), as were important indicators for insight into their source. The largest U. meridionalis (Ume) cpDNA contained 14.6-kb foreign sequence which encoded 15 specific orfs (Supplementary Figure 2), accounting for 11.9% of plastome. To elucidate the origin of exogenous sequences and their relationships, we systematically compared the sequence characteristics of large intergenic regions and the distribution of free-standing orfs among these 40 Ulva plastomes. A total of 154 specific free-standing orfs as well as many homologous residue DNA sequences of some specific orfs, which have no similarity to chloroplast canonical genes, were detected in intergenic regions of these 40 Ulva cpDNAs. These specific orfs were not randomly distributed but mainly located in some specific intergenic regions (e.g. psbA-psbB, trnT-psbA, psbB-psbD, psbC-psbB, trnS2-psbC, trnM3-psbD, psbC-trnM3, trnL2-psbD, trnL2-trnM3, trnM1-trnE, and trnW-psaJ) (Liu and Melton, 2021), indicating that these intergenic regions should be hot spots for the invasion of foreign sequences.

Some foreign DNA sequences integrated into different intergenic regions of Ulva plastomes harbor homologous orfs with significant high similarity (Supplementary Table 3). These facts indicated that these foreign sequences could have been derived from the same origin. It is very similar to the finding in Ulva mitogenomes where the derived foreign sequences mainly originated from mitochondrial plasmid DNA (Liu et al., 2022a; Liu et al., 2022b). Among these specific chloroplast homologous orfs, three classes of orfs show high similarities to the full length or partial sequences of putative bacterial tyrosine-type recombinase/integrase (tri), NAD-dependent DNA ligase (lig), and phage/plasmid DNA primase (Supplementary Table 3), respectively, based on tblastn search, which were also detected in cpDNAs of some siphonous green algae (Bryopsidales) (Leliaert and Lopez-Bautista, 2015; Cremen et al., 2018). It is worth noting that a new class of specific free-standing orfs was found only in Ulva plastomes, which we named Ucp-orf. A total of 29 full-length Ucp-orfs were detected in 17 of 40 Ulva plastomes, which belonged to different Ulva lineages (Figure 6), and none of such orf was found in other ulvophyceaen cpDNAs sequenced thus far. Interestingly, all of Ucp-orfs reside in several intergenic regions where genome rearrangement occurred. In the Ume cpDNA, there are five Ucp-orfs in three intergenic regions (i.e. psbA-psbB, psbC-trnS2, and trnL2-trnM3). One conserved domain was shared by all of these Ucp-orfs (Figure 7), but no information on its function can be obtained based on blastp search. In addition, the remaining orfs contain some recognizable protein domains acting on DNA or RNA, but their functions are still unknown (Supplementary Table 3).

Figure 6

Figure 7

The plastome architecture is not as conserved as that of mitogenomes in Ulva species (Supplementary Figure 2), but has experienced several rearrangement events to varying degrees (Liu and Melton, 2021). It is worth noting that the intergenic regions where foreign sequences frequently invade match well with the regions where plastome rearrangement occurs. Six conserved gene blocks can be detected in Ulva cpDNAs by comparing the plastome structure, and chloroplast genome recombination frequently occurs in the regions on both sides of the psbD-psbC gene block, the upstream region of psbB and the downstream region of trnT (Figure 8). These regions are exactly the regions where foreign DNA sequences are inserted most frequently. It seems that the invasion of foreign fragments causes the instability of genome architecture and triggered inversion of some gene blocks in Ulva cpDNAs. The Ulva chloroplast genomes belong to IR-lacking cpDNAs, and the IR was supposed to play an important role in stabilizing the architecture of cpDNAs (Turmel and Lemieux, 2018). The invasion of foreign sequences, especially in the context of IR loss, seems to be an important driving force for Ulva genome rearrangement.

Figure 8

These acquired orfs display a different evolutionary trend from the chloroplast canonical genes. The levels of sequence divergence among homologous specific orfs or foreign DNA sequences in Ulva cpDNAs greatly exceed those observed in chloroplast genes. Frequent insertion and deletion mutations lead to serious fracture and degeneration of these sequences, accompanied by reduced GC content (Supplementary Table 2, 3). These evidences show that their existence is not a necessary requirement of Ulva chloroplast genomes, and there is no selection pressure to maintain their existence. Differential GC content can be used as an indicator to distinguish the background genome and the non-self (or introduced) DNA sequence (Mann and Chen, 2010). Because of rapid evolution and high divergence of derived foreign sequences, their GC content decreased at varying levels, as depends on their evolution time and rate after their insertion into different Ulva plastomes. Their changes from heterogeneity to homogeneity caused by rapid evolution make it difficult to distinguish some foreign sequences that completely lose coding ability from non-coding intergenic regions. The fate of these integrated foreign sequences will most likely be accelerated evolution and eventually lose, which reminds us of the similar phenomena observed in the Ulva mitogenomes where the frequently inserted plasmid-derived sequences underwent multiple mutations and rapid degeneration (Liu et al., 2022a; Liu et al., 2022b).

Novel insights into plastome architecture and phylogenomic analysis

Due to the limited data at present, it is difficult to reconstruct the plastome structure of the common ancestor of Ulva species. Based on comparative analysis of architectures in sequenced Ulvales-Sykidiales-Ulotrichales cpDNAs to date, plastomes have completely lost the IR in Ulvales (e.g. Ulva species and B. minima) (Figures 8, 9), which is a remarkable difference from those in Sykidiales (e.g. P. marina) and Ulotrichales carrying identical or non-identical IR copies (Turmel et al., 2017; Kim et al., 2019). Gene order and gene distribution show some new characteristics in these IR-lacking plastomes. In Ulva plastomes, gene clusters show a staggered distribution pattern on two strands (Figure 8), while in B. minima plastomes (MT948112 and MK408749), gene clusters tend to accumulate on one main strand and only one gene cluster (rpl20-rps8-rps4-rps9-rpl12) and 13 tRNAs are transcribed on another strand (Figure 9). We found that gene partitioning pattern changed after the loss of IR and distribution range of gene clusters became larger, indicating that genome rearrangement was more extensive and more frequent in IR-lacking Ulvales plastomes. Many new gene clusters, e.g. rpl23-rpl2-rps19-rps3-rpl16-rpl14-rpl5-rps8-infA-rpl36-rps11-rpoA-petB-petD, psaC-ycf20-ccsA-trnR1-ycf1-psbA, rns-trnI-trnA-rnl-rrn5-psaA, trnV-rpoB-rpoC1-rpoC2, ycf3-psbI-psaI-psbN, and psbB-psbT-psbH, were observed to be shared only by plastomes of Ulva and Blidingia (Figure 9), but they did not appear in plastomes of P. marina and other ulvophycean species (e.g. Pombert et al., 2005; Pombert et al., 2006; Leliaert and Lopez-Bautista, 2015; Turmel et al., 2017; Fang et al., 2021), indicating they have been formed and maintained in their common ancestor after divergence from P. marina. These findings provide important clues for us to understand genome structure and gene order of early IR-lacking plastomes in Ulvales.

Figure 9

Phylogenomic analyses of two Ulva plastome datasets (nt sequences of 100 canonical genes and aa sequences of 71 PCGs) showed that the common ancestor of Ulva species had a very early internal divergence and split into two major evolutionary lineages (Ulva I and II) (Liu and Melton, 2021). Ulva lineage I has evolved into at least three independent clades (IA, IB and IC), and Ulva lineage II into at least two clades (IIA and IIB) (Figures 10, 11). The inversion of psbD-psbC gene cluster is the most significant difference between lineage I and II in gene order, but the continuous inversion of this gene cluster makes the gene order in the Ulva intestinalis (Uin) plastome completely consistent with those in IA and IB clades (Figure 8). Ulva IA clade contains the U. linza-prolifera (LP) complex which harbor the rps19 gene with GTG start code and the Uar-Uca-Uto subclade which is the only minD-containing Ulva lineage. Plastomes in IA and IB clades shared the identical gene order, but were different from those in IC clade. In the IC plastomes, two gene clusters composed of 45 genes from psbB to trnT(ugu) and six genes (trnM3-trnD-psaB-psbM-trnH-trnS2) were inverted respectively, and then the latter had a secondary inversion in Usp1 and the psbD-psbC gene block was inverted in Ume. Like plastomes in the IC clade, the large gene cluster containing 45 genes was also inverted in the IIB clade but not in IIA (Figure 8; Supplementary Figure 3). Because of frequent inversion of gene clusters in plastomes, the gene order cannot well reflect their evolutionary relationship between Ulva species (Wang et al., 2021).

Figure 10

Figure 11

Our results of phylogenomic analysis well supported taxonomic revisions of some species names at the genomic level (Figures 10, 11), e.g. U. mutabilis Föyn, U. pertusa Kjellman and U. fasciata Delile are taxonomic synonyms of U. compressa Linnaeus (Steinhagen et al., 2019), U. australis Areschoug (Couceiro et al., 2011) and U. lactuca Linnaeus (Hughey et al., 2019), respectively. It is worth emphasizing that our results show that eight of 40 Ulva plastomes were assigned wrong species names reported at first (Liu J. et al., 2020; Wang et al., 2020; Fort et al., 2021; Hughey et al., 2021). The fact that inaccurate species identification occurred frequently leads to incorrect Ulva species names matched by DNA sequences deposited in the GenBank database. Therefore, the way to fundamentally eliminate the mismatch between species names and sequences is to use molecular marker technology, especially the comparative analysis of organelle genomes, to more clearly reveal the genotype differences between individuals of Ulva species, especially the closely related species.

Conclusion

The accumulation of Ulva cpDNA data provides us with an opportunity to decipher the unique evolution of plastomes in these globally distributed green macroalgae. In this study, more new insights into plastome evolution of Ulva species have been gained. First, Ulva plastome evolution reflects the strong selection pressure driving the compactness of genome organization and the decrease of overall genomic GC content. The overall plastome sequences including canonical genes, introns, derived foreign sequences and non-coding regions show a synergetic decrease in GC content at the varying degree. Fast degeneration of plastome sequences including non-core genes (minD and trnR3), derived foreign sequences, and noncoding spacer regions was accompanied by the marked decrease of their GC composition. Second, introns preferentially resided in conserved housekeeping genes with high GC content and long length in Ulva plastomes. It might be related to high GC content in target site sequences required for IEP recognition and more target sites contained by long GC-rich genes. Third, many foreign DNA sequences integrated into different intergenic regions harbor some homologous specific orfs with high similarity, indicating that they could have been derived from the same origin. Fourth, the invasion of foreign sequences seems to be an important driving force for plastome rearrangement in these IR-lacking Ulva cpDNAs. It seems that the invasion of foreign fragments causes the instability of genome architecture and triggered inversion of some gene blocks in Ulva cpDNAs. Finally, gene partitioning pattern changed after the loss of IR, and genome rearrangement was more extensive and more frequent in IR-lacking Ulvales plastomes. Gene clusters show a staggered distribution pattern on two strands in Ulva plastomes. Our new findings have deepened our understanding of the evolutionary trend of the plastomes in ecologically important Ulva seaweeds.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: GenBank OL684341, OL684342, and OP985129-OP985133.

Author contributions

FL designed the study. FL, NC, HW, JL, JW and FQ performed the experiments. FL and HW performed the analysis. FL wrote the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This work was financially supported by the National Natural Science Foundation of China (No. 42276133/41876165), the Strategic Priority Research Program of Chinese Academy of Sciences (No. XDA23050302/XDA23050403), the Key Research Program of Frontier Sciences, Chinese Academy of Sciences (No. QYZDB-SSW-DQC023), the Science and Technology Basic Resources Investigation Program of China (No. 2018FY100200), the Major Scientific and Technological Innovation Project of Shandong Province (No. 2019JZZY020706), and Natural Science Foundation of Shandong Province for Young Scholars (No. ZR2022QD066).

Acknowledgments

We are thankful to reviewers for their valuable comments and all staffs of marine ecological environment genomics research group in Institute of Oceanology, Chinese Academy of Sciences.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1126175/full#supplementary-material

References

  • 1

    BatemanA.BirneyE.DurbinR.EddyS. R.HoweK. L.SonnhammerE. L. (2000). The pfam protein families database. Nucleic Acids Res.28, 263266. doi: 10.1093/nar/28.1.263

  • 2

    BensonG. (1999). Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res.27, 573580. doi: 10.1093/nar/27.2.573

  • 3

    BlomsterJ.BäckS.FewerD. P.KiirikkiM.LehvoA.MaggsC. A.et al. (2002). Novel morphology in Enteromorpha (Ulvophyceae) forming green tides. Am. J. Bot.89, 17561763. doi: 10.3732/ajb.89.11.1756

  • 4

    CaiC.WangL.ZhouL.HeP.JiaoB. (2017). Complete chloroplast genome of green tide algae Ulva flexuosa (Ulvophyceae, chlorophyta) with comparative analysis. PloS One12 (9), e0184196. doi: 10.1371/journal.pone.0184196

  • 5

    ChanP. P.LinB. Y.MakA. J.LoweT. M. (2021). tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res.49 (16), 90779096. doi: 10.1101/614032

  • 6

    CocquytE.VerbruggenH.LeliaertF.De ClerckO. (2010). Evolution and cytological diversification of the green seaweeds (Ulvophyceae). Mol. Biol. Evol.27 (9), 20522061. doi: 10.1093/molbev/msq091

  • 7

    CouceiroL.CremadesJ.BarreiroR. (2011). Evidence for multiple introductions of the pacific green alga Ulva australis areschoug (Ulvales, chlorophyta) to the Iberian peninsula. Bot. Mar.54 (4), 391402. doi: 10.1515/bot.2011.044

  • 8

    CremenM. C. M.LeliaertF.MarcelinoV. R.VerbruggenH. (2018). Large Diversity of non-standard genes and dynamic evolution of chloroplast genomes in siphonous green algae (Bryopsidales, chlorophyta). Genome Biol. Evol.10 (4), 10481061. doi: 10.1093/gbe/evy063

  • 9

    DarlingA. E.MauB.PernaN. T. (2010). Progressivemauve: multiple genome alignment with gene gain, loss and rearrangement. PloS One5 (6), e11147. doi: 10.1371/journal.pone.0011147

  • 10

    De ClerckO.KaoS. M.BogaertK. A.BlommeJ.FoflonkerF.KwantesM.et al. (2018). Insights into the evolution of multicellularity from the Sea lettuce genome. Curr. Biol.28 (18), 29212933. doi: 10.1016/j.cub.2018.08.015

  • 11

    Del CortonaA.LeliaertF.BogaertK. A.TurmelM.BoedekerC.JanouškovecJ.et al. (2017). The plastid genome in cladophorales green algae is encoded by hairpin chromosomes. Curr. Biol.27, 37713782. doi: 10.1016/j.cub.2017.11.004

  • 12

    de VriesJ.HabichtJ.WoehleC.HuangC.ChristaG.WägeleH.et al. (2013). Is ftsH the key to plastid longevity in sacoglossan slugs? Genome Biol. Evol.5 (12), 25402548. doi: 10.1093/gbe/evt205

  • 13

    FangJ.LiuB.LiuG.VerbruggenH.ZhuH. (2021). Six newly sequenced chloroplast genomes from trentepohliales: the inflated genomes, alternative genetic code and dynamic evolution. Front. Plant Sci.12, 780054. doi: 10.3389/fpls.2021.780054

  • 14

    FortA.McHaleM.CascellaK.PotinP.UsadelB.GuiryM. D.et al. (2021). Foliose Ulva species show considerable inter-specific genetic diversity, low intra-specific genetic variation, and the rare occurrence of inter-specific hybrids in the wild. J. Phycol.57, 219233. doi: 10.1111/jpy.13079

  • 15

    GaoD.SunZ.BiG.ZhangX. (2022). The complete plastome of Blidingia marginata and comparative analysis with the relative species in ulvales. Aquat. Bot.183, 103568. doi: 10.1016/j.aquabot.2022.103568

  • 16

    GuiryM. D.GuiryG. M. (2023). AlgaeBase. world-wide electronic publication (Galway: National University of Ireland).

  • 17

    GulbrandsenØ.S.AndresenI. J.KrabberødA. K.BråteJ.Shalchian-TabriziK. (2021). Phylogenomic analysis restructures the ulvophyceae. J. Phycol.57, 12231233. doi: 10.1111/jpy.13168

  • 18

    HallT. A. (1999). BioEdit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucl. Acids Symp. Ser.41, 9598. doi: 10.1021/bk-1999-0734.ch008

  • 19

    HanH.YanL.SongW.WangZ.ZhangX. (2020). Complete chloroplast genome of Ulva pertusa, one of the causal species of green macroalgal blooms in the coastal waters of qinhuangdao, China. Mitochondrial. DNA Part B.5 (1), 10841086. doi: 10.1080/23802359.2020.1723448

  • 20

    HaugenP.SimonD. M.BhattacharyaD. (2005). The natural history of group I introns. Trends Genet.21, 111119. doi: 10.1016/j.tig.2004.12.007

  • 21

    HaydenH. S.WaalandJ. R. (2002). Phylogenetic systematics of the ulvaceae (Ulvales, ulvophyceae) using chloroplast and nuclear DNA sequences. J. Phycol.38 (6), 12001212. doi: 10.1046/j.1529-8817.2002.01167.x

  • 22

    HofmannL. C.NettletonJ. C.NeefusC. D.MathiesonA. C. (2010). Cryptic diversity of Ulva (Ulvales, chlorophyta) in the great bay estuarine system (Atlantic USA): introduced and indigenous distromatic species. Eur. J. Phycol.45 (3), 230239. doi: 10.1080/09670261003746201

  • 23

    HugheyJ. R.GabrielsonP. W.MaggsC. A.MineurF. (2021). Genomic analysis of the lectotype specimens of European Ulva rigida and Ulva lacinulata (Ulvaceae, chlorophyta) reveals the ongoing misapplication of names. Eur. J. Phycol.57 (2), 143153. doi: 10.1080/09670262.2021.1914862

  • 24

    HugheyJ. R.MaggsC. A.MineurF.JarvisC.MillerK. A.ShabakaS. H.et al. (2019). Genetic analysis of the linnaean Ulva lactuca (Ulvales, chlorophyta) holotype and related type specimens reveals name misapplications, unexpected origins, and new synonymies. J. Phycol.55, 503508. doi: 10.1111/jpy.12860

  • 25

    IyerL. M.KooninE. V.AravindL. (2002). Extensive domain shuffling in transcription regulators of DNA viruses and implications for the origin of fungal APSES transcription factors. Genome Biol.3 (3), research0012. doi: 10.1186/gb-2002-3-3-research0012

  • 26

    JeffaresD. C.MourierT.PennyD. (2006). The biology of intron gain and loss. Trends Genet.22 (1), 1622. doi: 10.1016/j.tig.2005.10.006

  • 27

    JiangT.GuK.WangL.LiuQ.ShiJ.LiuM.et al. (2019). Complete chloroplast genome of Ulva prolifera, the dominant species of green macroalgal blooms in yellow Sea, China. Mitochondrial. DNA Part B.4 (1), 19301931. doi: 10.1080/23802359.2019.1610090

  • 28

    JinJ.-J.YuW.-B.YangJ.-B.SongY.dePamphilisC. W.YiT.-S.et al. (2020). GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol.21, 241. doi: 10.1186/s13059-020-02154-5

  • 29

    JonesD. T.TaylorW. R.ThorntonJ. M. (1992). The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci.8, 275282. doi: 10.1093/bioinformatics/8.3.275

  • 30

    KimD.LeeJ. M.ChoiJ. W.YangJ. H. (2019). Flip-flop organization in the chloroplast genome of Capsosiphon fulvescens (Ulvophyceae, chlorophyta). J. Phycol.55, 214223. doi: 10.1111/jpy.12811

  • 31

    KoboldtD. C.ChenK.WylieT.LarsonD. E.McLellanM. D.MardisE. R.et al. (2009). VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics25, 22832285. doi: 10.1093/bioinformatics/btp373

  • 32

    KumarS.StecherG.TamuraK. (2016). MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol.33, 18701874. doi: 10.1093/molbev/msw054

  • 33

    LangB. F.LaforestM.-J.BurgerG. (2007). Mitochondrial introns: a critical view. Trends Genet.23, 119125. doi: 10.1016/j.tig.2007.01.006

  • 34

    LangB. F.NedelcuA. M. (2012). “Plastid genomes of algae,” in Advances in photosynthesis and respiration including bioenergy and related processes: genomics of chloroplasts and mitochondria. Eds. BockR.KnoopV. (Dordrecht: Springer), 5987.

  • 35

    LeliaertF.Lopez-BautistaJ. M. (2015). The chloroplast genomes of Bryopsis plumosa and Tydemania expeditiones (Bryopsidales, chlorophyta): compact genomes and genes of bacterial origin. BMC Genomics16, 204. doi: 10.1186/s12864-015-1418-3

  • 36

    LeliaertF.SmithD. R.MoreauH.HerronM. D.VerbruggenH.DelwicheC. F.et al. (2012). Phylogeny and molecular evolution of the green algae. Crit. Rev. Plant Sci.31, 146. doi: 10.1080/07352689.2011.615705

  • 37

    LiH.DurbinR. (2010). Fast and accurate long-read alignment with burrows-wheeler transform. Bioinformatics26, 589595. doi: 10.1093/bioinformatics/btp698

  • 38

    LinX.LiuW.WeiX.JiangP. (2022). Complete chloroplast genome of an invasive marine macroalga Ulva californica (Ulvophyceae, chlorophyta). Mitochondrial. DNA Part B.7 (7), 13371339. doi: 10.1080/23802359.2022.2098854

  • 39

    LiuF.MeltonJ. T. (2021). Chloroplast genomes of the green-tide forming alga Ulva compressa: comparative chloroplast genomics in the genus Ulva (Ulvophyceae, chlorophyta). Front. Mar. Sci.8, 668542. doi: 10.3389/fmars.2021.668542

  • 40

    LiuF.MeltonJ. T.Lopez-BautistaJ. M.ChenN. (2020). Multiple intraspecific variations of mitochondrial genomes in the green-tide forming alga, Ulva compressa Linnaeus (Ulvophyceae, chlorophyta). Front. Mar. Sci.7, 714. doi: 10.3389/fmars.2020.00714

  • 41

    LiuF.MeltonJ. T.WangH.WangJ.Lopez-BautistaJ. M. (2022a). Understanding the evolution of mitochondrial genomes in the green macroalgal genus Ulva (Ulvophyceae, chlorophyta). Front. Mar. Sci.9, 850710. doi: 10.3389/fmars.2022.850710

  • 42

    LiuF.WangH.SongW. (2022b). Tandem integration of circular plasmid contributes significantly to the expanded mitochondrial genomes of the green-tide forming alga Ulva meridionalis (Ulvophyceae, chlorophyta). Front. Plant Sci.13, 937398. doi: 10.3389/fpls.2022.937398

  • 43

    LiuJ.YangX.CuiJ.ZhuangM.ZhaoL.LiJ.et al. (2020). Complete chloroplast genome of Ulva meridionalis (Ulvales: ulvaceae): an extremely fast-growing green macroalgae. Mitochondrial. DNA Part B.5 (2), 13901392. doi: 10.1080/23802359.2020.1735967

  • 44

    MannS.ChenY. P. (2010). Bacterial genomic G + c composition-eliciting environmental adaptation. Genomics95 (1), 715. doi: 10.1016/j.ygeno.2009.09.002

  • 45

    MeltonJ. T.LeliaertF.TronholmA.Lopez-BautistaJ. M. (2015). The complete chloroplast and mitochondrial genomes of the green macroalga ulva sp. UNA00071828 (Ulvophyceae, chlorophyta). PloS One10 (4), e0121020. doi: 10.1371/journal.pone.0121020

  • 46

    MeltonJ. T.Lopez-BautistaJ. M. (2017). The chloroplast genome of the marine green macroalga Ulva fasciata delile (Ulvophyceae, chlorophyta). Mitochondrial. DNA Part A.28 (1), 9395. doi: 10.3109/19401736.2015.1110814

  • 47

    MinhB. Q.NguyenM. A. T.von HaeselerA. (2013). Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol.30 (5), 11881195. doi: 10.1093/molbev/mst024

  • 48

    MitsuhashiC.TeramuraH.ShimadaH. (2020). Construction of genomic marker sets based on the chloroplast genome of a green alga, Ulva pertusa (syn. Ulva australis), leads to simple detection of Ulva species. Genes Genet. Systems.95 (2), 5563. doi: 10.1266/ggs.19-00054

  • 49

    OsorioH.Tapia-ReyesP.EspinozaD.LaporteD.GonzálezA.Castro-NallarE.et al. (2022). The genome of the marine alga Ulva compressa (Chlorophyta) reveals protein-coding genes with similarity to plants and green microalgae, but also to animal, bacterial, and fungal genes. Int. J. Mol. Sci.23, 7279. doi: 10.3390/ijms23137279

  • 50

    Paysan-LafosseT.BlumM.ChuguranskyS.GregoT.PintoB. L.SalazarG. A.et al. (2023). InterPro in 2022. Nucleic Acids Res.51 (D1), D418D427. doi: 10.1093/nar/gkac993

  • 51

    PombertJ.-F.LemieuxC.TurmelM. (2006). The complete chloroplast DNA sequence of the green alga Oltmannsiellopsis viridis reveals a distinctive quadripartite architecture in the chloroplast genome of early diverging ulvophytes. BMC Biol.4, 3. doi: 10.1186/1741-7007-4-3

  • 52

    PombertJ.-F.OtisC.LemieuxC.TurmelM. (2005). The chloroplast genome sequence of the green alga Pseudendoclonium akinetum (Ulvophyceae) reveals unusual structural features and new insights into the branching order of chlorophyte lineages. Mol. Biol. Evol.22 (9), 19031918. doi: 10.1093/molbev/msi182

  • 53

    RobinsonJ. T.ThorvaldsdóttirH.WincklerW.GuttmanM.LanderE. S.GetzG.et al. (2011). Integrative genomics viewer. Nat. Biotechnol.29, 2426. doi: 10.1038/nbt.1754

  • 54

    ShiblA. A.IsaacA.OchsenkühnM. A.CárdenasA.FeiC.BehringerG.et al. (2020). Diatom modulation of select bacteria through use of two unique secondary metabolites. P. Natl. Acad. Sci. U.S.A.117, 2744527455. doi: 10.1073/pnas.2012088117

  • 55

    SmithD. R. (2017). Evolution: in chloroplast genomes, anything goes. Curr. Biol.27 (24), R1305R1307. doi: 10.1016/j.cub.2017.10.049

  • 56

    SteinhagenS.WeinbergerF.KarezR. (2019). Molecular analysis of Ulva compressa (Chlorophyta, ulvales) reveals its morphological plasticity, distribution and potential invasiveness on German north Sea and Baltic Sea coasts. Eur. J. Phycol.54 (1), 102114. doi: 10.1080/09670262.2018.1513167

  • 57

    SuzukiS.YamaguchiH.HiraokaM.KawachiM. (2018). Mitochondrial and chloroplast genome sequences of Ulva ohnoi, a green-tide forming macroalga in the southern coastal regions of Japan. Mitochondrial. DNA Part B.3 (2), 765767. doi: 10.1080/23802359.2018.1483778

  • 58

    ThompsonJ. D.GibsonT. J.PlewniakF.JeanmouginF.HigginsD. G. (1997). The ClustalX windows interface flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res.25, 48764882. doi: 10.1093/nar/25.24.4876

  • 59

    TrifinopoulosJ.NguyenL. T.von HaeselerA.MinhB. Q. (2016). W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res.44 (W1), W232W235. doi: 10.1093/nar/gkw256

  • 60

    TurmelM.LemieuxC. (2018). Evolution of the plastid genome in green algae. Adv. Bot. Res.85, 157193. doi: 10.1016/bs.abr.2017.11.010

  • 61

    TurmelM.OtisC.LemieuxC. (2016). Mitochondrion-to-chloroplast DNA transfers and intragenomic proliferation of chloroplast group II introns in Gloeotilopsis green algae (Ulotrichales, ulvophyceae). Genome Biol. Evol.8 (9), 27892805. doi: 10.1093/gbe/evw190

  • 62

    TurmelM.OtisC.LemieuxC. (2017). Divergent copies of the large inverted repeat in the chloroplast genomes of ulvophycean green algae. Sci. Rep.7, 994. doi: 10.1038/s41598-017-01144-1

  • 63

    WangL.CaiC.ZhouL.HeP.JiaoB. (2017). The complete chloroplast genome sequence of Ulva linza. conserv. Genet. Resour.9, 463466. doi: 10.1007/s12686-016-0682-0

  • 64

    WangY.LiuF.LiuX.ShiS.BiY.MoejesF. W. (2019). Comparative transcriptome analysis of four co-occurring Ulva species for understanding the dominance of Ulva prolifera in the yellow Sea green tides. J. Appl. Phycol.31, 33033316. doi: 10.1007/s10811-019-01810-z

  • 65

    WangH.LiuF.WangJ.ChenN. (2021). Phylogenomic analysis of the chloroplast genome of the green-tide forming macroalga Ulva intestinalis Linnaeus (Ulvophyceae, chlorophyta). Mitochondrial. DNA Part B.6 (10), 30523054. doi: 10.1080/23802359.2021.1978889

  • 66

    WangJ.WangY.WangH.LiuF.ChenN. (2020). Comparative chloroplast genome analysis of new-green tide causative species in China: Ulva laetevirens. Oceanol. Et Linnol. Sinica.52 (5), 113.

  • 67

    WenQ.YangW.LiJ.LiuJ.ZhaoS.GaoS.et al. (2022). Characterization of complete chloroplast genome of Ulva torta (Mertens) trevisa. Mitochondrial. DNA Part B.7 (6), 10411043. doi: 10.1080/23802359.2022.2081943

  • 68

    XiaL.QinY.LiuJ.ZhangH.WuL.GaoS.et al. (2021). Complete chloroplast genome of Ulva compressa (Ulvales: ulvaceae). Mitochondrial. DNA Part B.6 (3), 720722. doi: 10.1080/23802359.2020.1860696

  • 69

    ZhuH.HuY.LiuF.HuZ.LiuG. (2019). Characterization of the chloroplast genome of Trentepohlia odorata (Trentepohliales, chlorophyta), and discussion of its taxonomy. Int. J. Mol. Sci.20, 1774. doi: 10.3390/ijms20071774

  • 70

    ZukerM. (2003). Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res.31, 34063415. doi: 10.1093/nar/gkg595

Summary

Keywords

chloroplast genome, Ulvophyceae, comparative genomics, GC content, group I/II intron, genome rearrangement

Citation

Liu F, Chen N, Wang H, Li J, Wang J and Qu F (2023) Novel insights into chloroplast genome evolution in the green macroalgal genus Ulva (Ulvophyceae, Chlorophyta). Front. Plant Sci. 14:1126175. doi: 10.3389/fpls.2023.1126175

Received

17 December 2022

Accepted

03 April 2023

Published

18 April 2023

Volume

14 - 2023

Edited by

Enrico Vito Perrino, International Centre for Advanced Mediterranean Agronomic Studies, Italy

Reviewed by

Zoltan Fussy, Charles University, Czechia; Eric Wade Linton, Central Michigan University, United States; Huan Zhu, Institute of Hydrobiology (CAS), China

Updates

Copyright

*Correspondence: Feng Liu, ;

This article was submitted to Plant Systematics and Evolution, a section of the journal Frontiers in Plant Science

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics