Abstract
To understand the evolutionary driving forces of chloroplast (or plastid) genomes (plastomes) in the green macroalgal genus Ulva (Ulvophyceae, Chlorophyta), in this study, we sequenced and constructed seven complete chloroplast genomes from five Ulva species, and conducted comparative genomic analysis of Ulva plastomes in Ulvophyceae. Ulva plastome evolution reflects the strong selection pressure driving the compactness of genome organization and the decrease of overall GC composition. The overall plastome sequences including canonical genes, introns, derived foreign sequences and non-coding regions show a synergetic decrease in GC content at varying degrees. Fast degeneration of plastome sequences including non-core genes (minD and trnR3), derived foreign sequences, and noncoding spacer regions was accompanied by the marked decrease of their GC composition. Plastome introns preferentially resided in conserved housekeeping genes with high GC content and long length, as might be related to high GC content of target site sequences recognized by intron-encoded proteins (IEPs), and to more target sites contained by long GC-rich genes. Many foreign DNA sequences integrated into different intergenic regions contain some homologous specific orfs with high similarity, indicating that they could have been derived from the same origin. The invasion of foreign sequences seems to be an important driving force for plastome rearrangement in these IR-lacking Ulva cpDNAs. Gene partitioning pattern has changed and distribution range of gene clusters has expanded after the loss of IR, indicating that genome rearrangement was more extensive and more frequent in Ulva plastomes, which was markedly different from that in IR-containing ulvophycean plastomes. These new insights greatly enhance our understanding of plastome evolution in ecologically important Ulva seaweeds.
Introduction
The green algal class Ulvophyceae harbors at least 13 orders and more than 2700 species thus far, and ranks second in the number of species among Chlorophyta only next to the class Chlorophyceae (). Species in the Ulvophyceae show great diversification of cytological types and morphological complexity, which varied from small unicellular species (e.g. Scotinosphaerales), to large multicellular thalli composed of uninucleate cells (e.g. Ulvales) or multinucleate cells (e.g. Cladophorales), to the gigantic single-celled coenocytic thalli (e.g. Bryopsidales and Dasycladales) (; ; ). Meanwhile, their chloroplast or plastid genomes (plastomes, or cpDNAs, or ptDNAs) display miraculous variations in genome architecture, genome size, GC content, gene density, intron content and gene order (; ; ), ranging from the circular 195.9-kb plastome with two inverted repeats (IRs) in Pseudendoclonium akinetum (Ulotrichales) (), which is the first ulvophycean cpDNA sequenced, to the 34 multiple hairpin cpDNA chromosomes in Boodlea composita (Cladophorales) with high GC content (average 57%) (), from the compact 74.5-kb IR-lacking plastome in Callipsygma wilsonis (Bryopsidales) () to the approximately 2000-kb plastome in Acetabularia acetabulum (Dasycladales) with a high noncoding content (more than 85%) ().
The green macroalgal genus Ulva Linnaeus 1753 (Ulvophyceae, Chlorophyta) is the species-richest genus in the Ulvales. As more Ulva species have been accurately identified recently, 102 species names have been flagged as accepted taxonomically up to now (). Globally, many Ulva seaweeds (e.g. Ulva prolifera, Ulva compressa, and Ulva meridionalis) are notorious for their rapid vegetative growth in eutrophic waters, leading to green tides formed by the accumulation of excess biomass (; ). Ulva simple morphologies show high similarity at the interspecific level, meanwhile cytological and morphological features could vary greatly at the intraspecific level, thus accurate identification of Ulva species has been challenging (e.g. ; ). The use of molecular markers (e.g., ITS, rbcL, and tufA) for species identification has become the mainstream method to ensure the accuracy and credibility of identification results (e.g. ; ; ). However, due to the limited differentiation signals of these marker sequences, their resolution is inadequate for identifying closely related Ulva species. Organelle genomes (cpDNAs and mtDNAs) as super molecular markers have been proved to be powerful to understand the evolution and molecular species concepts in the genus Ulva, and are potential resources for developing specific high-resolution molecular markers (). Recently, phylogenomic analysis based on organelle genome data clearly depicted the evolutionary nature of double crown radiation in the phylogeny and speciation of Ulva species (; ; ).
The data of Ulva plastomes have accumulated rapidly recently based on efficient high-throughput sequencing technology (; ; ), which makes it possible for more accurately understanding of the evolution trend of Ulva plastomes on a more detailed and specific sampling scale. A total of 33 plastomes from 17 Ulva species have been documented in the GenBank database thus far. The sequenced Ulva plastomes show many unique features when compared with the counterparts in other ulvophycean lineages. These Ulva plastomes belong to the compact circular IR-lacking plastomes with the smaller size (86.73 - 119.87 kb) and the lowest GC content (23.89 - 26.25%) within the Ulvophyceae (Table 1). Variations in the Ulva plastome size at interspecific and intraspecific level were mainly caused by differences in content of group I/II introns, integration of foreign DNA fragments and content of intergenic regions (). The Ulva plastomes show high conservation in repertoire of canonical genes, and share the same set of 100 core genes including 71 protein-coding genes (PCGs), three ribosomal RNA (rRNA) genes and 26 transfer RNA (tRNA) genes (). The organelle division inhibitor factor gene, minD, was observed to be present only in the plastome of Ulva aragoënsis (), which used to be regarded as Ulva flexuosa (). Only one group II (derived) intron (intron infA-62) were shared by all sequenced Ulva plastomes, and all other introns displayed highly variable and sporadic distribution pattern. Ulva plastome architectures were dynamic and plastic not conserved at the intrageneric level due to frequent genome rearrangements (; ).
Table 1
| Lineage | Subclade | Species | Abbr. | Accession number | Size (bp) | GC (%) | References |
|---|---|---|---|---|---|---|---|
| Ulva I | IA | Ulva prolifera | Upr1 | OP985129 | 93,066 | 24.78 | This study |
| Ulva prolifera | Upr2 | OP985130 | 93,066 | 24.78 | This study | ||
| Ulva prolifera | Upr3 | OP985131 | 93,072 | 24.78 | This study | ||
| Ulva prolifera | Upr4 | KX342867 | 93,066 | 24.78 | |||
| Ulva prolifera | Upr5 | MZ571508 | 99,724 | 25.28 | GenBank | ||
| Ulva linza | Uli | KX058323 | 86,726 | 24.79 | |||
| Ulva torta | Uto1 | OL684342 | 112,034 | 24.89 | This study | ||
| Ulva torta | Uto2 | MZ703011 | 105,423 | 25.24 | |||
| Ulva californica | Uca | MZ561475 | 92,126 | 24.71 | |||
| Ulva aragoënsis | Uar1 | OP985132 | 87,172 | 24.68 | This study | ||
| Ulva aragoënsis (Ulva flexuosa*) | Uar2 | KX579943 | 89,414 | 24.97 | |||
| IB | Ulva gigantea | Ugi | MT179350 | 117,606 | 25.73 | ||
| Ulva lactuca (syn. Ulva fasciata) | Ula1 | KT882614 | 96,005 | 24.87 | |||
| Ulva lactuca | Ula2 | MH730972 | 95,997 | 24.87 | |||
| Ulva ohnoi | Uoh | AP018696 | 103,313 | 25.44 | |||
| Ulva lacinulata (Ulva laetevirens*) | Ulc1 | MT179351 | 103,444 | 25.40 | |||
| Ulva lacinulata | Ulc2 | MW543061 | 107,242 | 25.82 | |||
| Ulva lacinulata (Ulva laetevirens*) | Ulc3 | MW531676 | 110,889 | 25.63 | |||
| Ulva lacinulata (Ulva rigida*) | Ulc4 | MN389525 | 103,523 | 25.40 | |||
| Ulva sp. A AF-2021 (Ulva rigida*) | Usp2 | MT179352 | 96,673 | 24.57 | |||
| IC | Ulva meridionalis | Ume | OP985133 | 122,172 | 24.86 | This study | |
| Ulva sp. UNA00071828 | Usp1 | KP720616 | 99,983 | 25.30 | |||
| Ulva tepida | Ute | OL684341 | 94,449 | 24.49 | This study | ||
| Ulva sp. Q253 | Usp3 | MW699788 | 88,801** | 23.89 | GenBank | ||
| Ulva sp. (Ulva prolifera*) | Usp3 | MN853879 | 88,801 | 23.89 | GenBank | ||
| Ulva sp. (Ulva meridionalis*) | Usp3 | MN889540 | 88,653 | 23.91 | |||
| Ulva II | IIA | Ulva compressa | Uco1 | MW548841 | 114,291 | 26.23 | |
| Ulva compressa | Uco2 | MW344287 | 91,189 | 25.86 | |||
| Ulva compressa | Uco3 | MW353781 | 96,824 | 26.17 | |||
| Ulva compressa (syn. Ulva mutabilis) | Uco4 | MK069584 | 119,866 | 26.24 | GenBank | ||
| Ulva compressa | Uco5 | MK069585 | >89,164 | 26.25 | GenBank | ||
| Ulva compressa | Uco6 | MT916929 | 94,226 | 25.80 | |||
| Ulva compressa | Uco7 | KX595275 | 96,808 | 26.18 | GenBank | ||
| Ulva intestinalis | Uin | MZ158703 | 99,041 | 24.97 | |||
| IIB | Ulva rigida (Ulva rotundata*) | Uri1 | MT179353 | 118,206 | 26.12 | ||
| Ulva rigida | Uri2 | MW543060 | 117,995 | 26.13 | |||
| Ulva fenestrata | Ufe | MT179349 | 94,654 | 25.27 | |||
| Ulva australis (syn. Ulva pertusa) | Uau1 | MN853875 | 104,380 | 25.66 | |||
| Ulva australis | Uau2 | LC507117 | 102,899 | 25.33 | |||
| Ulva australis | Uau3 | MT179348 | 99,820** | 25.21 |
The sequenced 40 plastomes of Ulva species for comparative analysis.
* The Ulva plastomes with wrong species name assignment, which were deposited in the GenBank database, have been corrected. Ulva laetevirens (MT179351), Ulva rigida (MT179352), and Ulva rotundata (MT179353) have been corrected to Ulva lacinulata (MT179351), Ulva sp. A AF-2021 (MT179352), and Ulva rigida (MT179353), respectively ().
In this study, we sequenced and constructed seven complete plastomes from five Ulva species including Ulva prolifera O.F.Müller, Ulva aragoënsis (Bliding) Maggs, Ulva torta (Mertens) Trevisan, Ulva tepida Y. Masakiyo & S. Shimada, and Ulva meridionalis R. Horimoto & S. Shimada, and conducted comparative plastomic analysis to understand the evolutionary driving forces in ecologically important Ulva seaweeds.
Materials and methods
Sample collection and DNA extraction
Three free-floating algal samples of Ulva prolifera O.F.Müller (LF001, LF002 and LF003) were collected on 2 Jul. 2021 at the First (N36°05’53’’, E120°34’17’’), Second (N36°04′96″, E120°34′83″) and Third (N36°05′03″, E120°36′82″) bathing beaches along the coast of Qingdao, Shandong, China, respectively. The sessile samples of Ulva aragoënsis (Bliding) Maggs (LF005) and Ulva tepida Y.Masakiyo & S.Shimada (LF006) were collected on 11 Aug. 2021 at the Trestle Bridge (N36°06′09″, E120°31′08″) and the First bathing beach (N36°05′34″, E120°33′81″) along the coast of Qingdao, Shandong, China, respectively. The free-floating algal thalli of Ulva torta (Mertens) Trevisan (LF007) and Ulva meridionalis R.Horimoto & S.Shimada (LF010) were sampled on 4 Aug. 2021 in the Sakura Lake (37°07’30’’-56’’N, 122°27’03’’-50’’E), Rongcheng, Shandong, China. These Ulva samples were stored in coolers (5-8°C) after collection and transported back to the laboratory within 48 hours. Algal thallus for each individual Ulva thallus was cultured in a 9-cm diameter Petri dish containing 25-mL L1 medium with 0.5‰ GeO2, 50 µg/mL Dipterex (Fengcheng Animal Medicine Co., Ltd, China) and a suite of antibiotics (per mL: 50 µg streptomycin, 66.6 µg gentamycin, 20 µg ciprofloxacin, 2.2 µg chloramphenicol, and 100 µg ampicillin) (). The culture was maintained at 18°C, 100 - 120 μmol photons m-2 s-1 in the photoperiod of 12 h light: 12 h darkness in a GXZ-380C temperature-controlled incubator (Ningbo Jiangnan, China). After at least one week of culture, fresh algal tissue from each Ulva thallus was used for DNA extraction using a Plant Genome DNA Kit (DP305, Tiangen Biotech, Beijing, China) according to the manufacturer’s instructions. Species identification was conducted based on phylogenetic analyses of two common DNA marker datasets, including the nuclear ITS region including the 5.8S rDNA gene, and the chloroplast rbcL gene (; ).
DNA sequencing and plastome assembly
The quality and concentration of total genomic DNA extracted were checked using a NanoPhotometer spectrophotometer (Implen, CA, USA), and a Qubit 2.0 Flurometer (Life Technologies, CA, USA), respectively. Qualified DNA samples were fragmented into 350 bp by Covaris S220 ultrasonic crater for library construction. The libraries were sequenced on an Illumina NovaSeq platform (Illumina, USA) using paired-end sequencing, yielding about 10 Gb sequencing raw data of paired-end reads with 150 bp in length for each Ulva sample. Clean data were harvested by trimming sequencing adapters and removing short or low-quality reads from the raw data. Complete Ulva plastomes were constructed by the GetOrganelle v1.7.1 (). The plastome of U. compressa (MW353781) was used as the reference genome for assembly. Plastome assemblies were re-examined by aligning reads against the assembled plastome sequence using the MEM algorithm of BWA v0.7.17 (). VarScan v2.3.9 () and IGV v2.8.12 () were employed to examine mutation sites and to verify assembly results, respectively.
Annotation of Ulva plastomes
Protein-coding genes (PCGs) were annotated by using the Open Reading Frame Finder at the National Center for Biotechnology Information (NCBI) website (https://www.ncbi.nlm.nih.gov/orffinder/), and by aligning homologous PCGs from Ulva cpDNAs deposited in the GenBank database with the newly sequenced Ulva plastomes. Transfer RNA genes (tRNAs) were searched for by reconstructing their cloverleaf structures using the tRNAscan-SE 2.0 software with default parameters (). Ribosomal RNA genes (rRNAs) were identified by using the RNAweasel (https://megasun.bch.umontreal.ca/apps/rnaweasel/), and by aligning homologous rRNAs. The free-standing and intronic open reading frames (orfs) were found by using the Open Reading Frame Finder at the NCBI website. Intron insertion-sites were determined manually by aligning the intron-containing homologous genes, and corresponding genes in the U. compressa (MW353781) plastome were used as a reference (). Intron name was defined as host gene plus insertion site. The class and core structure of all these introns were determined by using the RNAweasel and Mfold (). The core domains of intron-encoded proteins (IEPs) and free-standing specific ORFs were determined by significant Pfam-A matches (). To ensure the accuracy of comparative analysis, we have re-annotated the plastomes of Ulva species and Blidingia minima (MK408749 and MT948112) deposited in the GenBank database with the same method as above. All annotation results (including genes and introns) were manually verified. In some Ulva plastomes (e.g. MZ561475, MK069585, MT916929, and KX342867), incorrect annotations and abnormal sequence errors have been corrected in our subsequent comparative analysis.
Plastome comparison and phylogenetic analysis
Base composition of Ulva plastomes and other DNA sequences was determined by using MEGA 7.0 (). Tandem repeats were analyzed by using Tandem Repeats Finder with parameter settings of two for matches and seven for mismatches and indels (). Differences and identity values of DNA sequences were calculated by use of BioEdit v7.1.9 (). Synteny analysis of Ulva plastomes was executed by using Mauve v2.3.1 software with default parameters (). A new class of specific orfs named Ucp-orf was found in Ulva plastomes. The thorough search for Ucp-orf-like sequences in Ulva plastomes was conducted against the NCBI nucleotide database. A total of 29 full-length Ucp-orfs were detected in 17 of 40 Ulva cpDNAs. Multiple sequence alignments of Ucp-ORFs were conducted by using ClustalX 1.83 with the default settings (). The structural domain or motif in Ucp-ORFs was searched on the HMMER website (https://www.ebi.ac.uk/Tools/hmmer/search/phmmer) and using InterProScan tool (). The phylogenetic relationships were inferred with the Maximum Likelihood (ML) method based on the JTT matrix-based model () by using MEGA 7.0 (). There was a total of 113 positions in the final dataset of Ucp-ORFs. Phylogenomic trees were constructed based on two plastome datasets including nucleotide (nt) sequences of 100 common genes and amino acid (aa) sequences of 71 common PCGs from 40 Ulva. The nt sequences of 100 genes and the aa sequences of 71 PCGs were individually aligned, checked and concatenated using ClustalX 1.83 with default settings (). Maximum-likelihood trees were constructed by IQ-TREE () using default parameters with 1000 ultrafast bootstrap analysis (). The substitution model conducted by IQ-TREE was GTR+G for nt dataset and cpREV+F+I+G4 for aa dataset, respectively. Blidingia minima (MK408749 and MT948112) was used as the outgroup.
Results and discussion
Ulva plastomes show a clear evolutionary trend of becoming smaller and more compact
These seven newly obtained plastomes from five Ulva species were successfully assembled as circular-mapping molecules, with sizes ranging from 87.2 kb in U. aragoënsis (Uar1) to 122.2 kb in U. meridionalis (Ume) (Table 1). The 122.2-kb cpDNA of Ume is the largest Ulva plastome sequenced thus far, and is 1.4 times the smallest one which is the 86.7-kb cpDNA of U. linza (Uli) (). To clarify the unique evolutionary trend of Ulva plastomes, we built the Ulva plastome dataset composed of newly sequenced plastomes and the data deposited in the GenBank database (Table 1), with a total of 40 Ulva plastomes which represented 19 Ulva species from two independent Ulva evolutionary lineages (I and II) (; ), and compared Ulva plastomes with those in other lineages of Ulvophyceae (Supplementary Table 1).
On the whole, Ulva cpDNAs show a clear evolutionary trend of becoming smaller and more compact when compared with all circular complete counterparts in Ulvophyceae (Figure 1). The Ulva plastomes only encoded a total of 100 conserved canonical genes, including 71 protein-coding genes (PCGs), three rRNA genes and 26 tRNA genes, which are the least among the sequenced circular ulvophycean plastomes. The overall coding regions composed of these 100 canonical genes occupy approximately 71.2 - 72.5 kb in size. Only small repeat sequences were observed to be concentrated either in intergenic spacer regions or intronic regions, and some specific repeat sequences reside in several PCGs (e.g. rpoB, rpoC1, and rpoC2). Unlike those in Sykidiales (Pseudoneochloris marina), Ulotrichales, Oltmannsiellopsidales, Ignatiales and Trentepohliales (; ; ; ), most of intergenic regions are relatively short and the rRNA operon-encoding inverted repeat (IR) has been completely lost in Ulva plastomes. The minimum size of overall non-coding intergenic regions was only approximately 11.6 kb, which was observed in U. californica (Uca) (Supplementary Table 2).
Figure 1
The ongoing gene loss or transfer can be clearly observed in Ulva plastomes. An intact organelle division inhibitor factor gene, minD (orf306), was observed to be present only in the U. aragoënsis (Uar) cpDNA (). Our further comparative analysis shows that this gene or its residue exists in the trnL2-trnS2 intergenic region of cpDNAs in the U. aragoënsis-U. californica-U.torta (Uar-Uca-Uto) subclade, while it was completely lost in the other Ulva plastomes as well as the Blidingia cpDNAs (). The minD was split into two parts (orf68 and orf209) in the Uca cpDNA, due to a 5-bp insertion mutation, whereas this gene has degenerated more seriously in the Uto cpDNAs, leaving only the residue orf44. The fracture and degeneration of minD, as well as the decreased GC content in its homologous sequences (Supplementary Table 2), indicated that there was no selective pressure to retain the integrity of this gene in Uca and Uto. Considering that homologues of minD are present in other lineages of core Ulvophyceae (), this gene is likely to have been transferred to the nuclear genome through horizontal transfer in Ulva species containing minD-lacking plastomes. However, we have not found this gene in the nuclear genome of U. mutabilis yet (), but it cannot be determined that it does not exist in the nuclear genome, considering the incompleteness of the sequenced Ulva genome.
One specific trnR3(ccu) gene is present in plastomes of some Ulva species including U. prolifera (Upr1-5), U. gigantea (Ugi), U. rigida (Uri1-2), U. torta (Uto1), U. meridionalis (Ume), U. lacinulata (Ulc1-4) and Ulva sp. (Usp2) (Figure 2). This gene is conservatively located in the downstream region adjacent to psbC (), but it is situated between psbA and trnT in the Uto1 cpDNA. Comparative analysis shows that this gene is highly similar with trnR2(ucu), indicating that it originated from the duplication of trnR2(ucu), and then its anti-codon mutated from UCU to CCU. More mutation sites were detected in trnR3(ccu) when compared with trnR2(ucu), and the latter maintains a highly conserved sequence in Ulva cpDNAs. A 5-bp (ACAAG) duplication mutation was detected in the dihydrouridine (DHU) stem-loop structure of trnR3(ccu) only in Upr5. The rates of sequence evolution for trnR3(ccu) appear to be dramatically higher than for trnR2(ucu), and the GC contents of trnR3(ccu) tend to decrease in varying degrees (Figure 2), indicating that trnR3(ccu) is subject to different selection pressures when compared with trnR2(ucu). We found that the 26 core tRNA genes are sufficient to meet all the requirements of protein synthesis in Ulva chloroplast genomes (Figure 3). Redundant trnR3(ccu) can be completely replaced by trnR2(ucu) in function, which should be the reason why it underwent significantly accelerated sequence evolution or a complete loss that had occurred.
Figure 2
Figure 3
In addition, these IR-lacking Ulva plastomes could sporadically incorporate foreign sequences in some specific intergenic regions, and accept group I/II introns in some housekeeping genes, which obviously increased their size. The current size of Ulva plastomes was the result of dynamic changes caused by several of the above factors. Especially, marked intraspecific differences in Ulva plastome sizes are common, involving gain or loss of introns, integration of foreign fragments and abundance of repetitive sequences (
Strong selection against GC in Ulva plastomes
The GC composition of ulvophycean plastomes varies significantly among different lineages (
The GC content is 26.53 - 27.71% in overall coding regions composed of 100 canonical genes (Supplementary Table 2). The GC composition of core-gene coding region as well as its total size showed very similar values at intraspecific level or among closely related species, but there are significant differences in the GC content and size of core-gene coding regions among different Ulva lineages (Table 1; Supplementary Table 2). The 26 tRNA regions have the highest GC content (51.32 - 51.88%), followed by the 3 rRNA regions (44.61 - 45.20%) and then the 71 PCG regions (24.50 - 25.82%) (Supplementary Table 2).
The GC composition is relatively stable in chloroplast rRNAs and tRNAs, but their variations in PCGs fluctuate greatly. Some PCGs related to photosynthesis have much higher GC content, e.g. psbA (40.96 ± 0.17%), psbB (38.60 ± 0.41%), psbC (38.20 ± 0.22%), psbD (38.76 ± 0.35%), and psaC (38.02 ± 0.91%), while some PCGs with large molecular weight, which are mainly involved in transcription and proteolysis, show very low GC content, e.g. ftsH (14.81 ± 0.68%), rpoA (16.96 ± 0.45%), rpoB (19.25 ± 0.65%), rpoC1 (16.24 ± 0.69%), and rpoC2 (14.76 ± 0.95%) (Figure 4). On the whole, codon usage pattern in Ulva chloroplast PCGs showed a much stronger preference for codons with A or T at the third position (Figure 3). The difference in GC content between chloroplast PCGs is mainly determined by their different amino acid composition and the different usage frequency of synonymous codons. The long PCGs with low GC content employ a large number of codons composed of only A and T. For example, the seven most frequently used codons in rpoC2 are AAU(N), AAA(K), UUA(L), UUU(F), UAU(Y), AUU(I), and AUA(I). However, GC-biased PCGs tend to prefer some codons with C at the third position. For example, UUC(F), AAC(N), AUC(I), UAC(Y) and CAC(H) were used more frequently than their synonymous codons in psbA.
Figure 4

Comparison of GC composition and gene size among 71 PCGs and three rRNAs of Ulva plastomes. Error bars represent the standard deviation (SD). Purple box represents the distribution area of most intron-containing genes.
The overall intronic regions show great difference in GC composition among different Ulva cpDNAs, ranging from 22.59% in U. linza (Uli) to 34.72% in U. compressa (Uco2) (Supplementary Table 2), as mainly depends on type and content of introns which cpDNA harbors. The GC content of overall foreign sequence regions range from 22.79 to 35.83% in nearly all Ulva cpDNAs with the exception of Usp3 cpDNAs which show much lower values (18.69 - 18.87%) (Supplementary Table 2). The GC content in noncoding intergenic regions is obviously the lowest in the range from 8.38% in U. californica (Uca) to 19.25% in U. australis (Uau1) (Supplementary Table 2).
Low GC content in Ulva plastomes is mainly attributed to strong selection pressure driving A + T richness at a genomic level. This selection pressure seems to act on the overall plastome sequences in the microenvironment of Ulva chloroplasts, including coding regions, introns, foreign sequences and noncoding regions. The GC composition of all these regions has been markedly reduced when compared with the counterparts in other ulvophycean plastomes (
Considering the high energy consumption and high nitrogen demand for GTP and CTP synthesis and the shortening of the sequence length in most PCGs (
Distribution and diversity of Ulva plastome introns
These newly sequenced Ulva plastomes harbor different intron contents ranging from two in U. aragoënsis (Uar1) to 16 in U. meridionalis (Ume), occupying 3.5 - 17.1% of cpDNAs, which are in the range of the reported Ulva plastomes (
Table 2
| Intron names * | Intron groups | Intron number (n) | Intron-encoded proteins (domain) | LAGLIDADG motif | Avg. intron size (SD, bp) | Avg. GC (SD, %) |
|---|---|---|---|---|---|---|
| atpA-492 | IB (complete) | 17 | LAGLIDADG | double | 1168 (16) | 23.24 (0.60) |
| atpB-537 | IIB | 1 | RTM | – | 2355 | 33.38 |
| atpB-627 | IIB | 15 | RTM | – | 2225 (13) | 36.79 (0.93) |
| atpB-696 | IIB | 17 | RTM | – | 2372 (12) | 36.52 (0.52) |
| atpI-256 | IIB | 1 | RTM | – | 2252 | 36.23 |
| infA-62 | II (derived) | 40 (40) ** | – | – | 616 (68) | 22.69 (1.20) |
| petB-23 | IIB | 3 | RTM | – | 2316 (1) | 34.66 (0.52) |
| petB-69 | IIB | 20 (1) ** | RTM | – | 2207 (92) | 35.38 (0.94) |
| petB-169 | IIB | 2 | RTM | – | 2459 | 34.85 |
| petB-277 | IIB | 6 | RTM | – | 2447 (13) | 36.37 (0.41) |
| petB-528 | IB (complete) | 14 (1) ** | LAGLIDADG | double | 1265 (14) | 23.94 (0.51) |
| petD-87 | IIA | 4 | RTM | – | 2428 (11) | 36.23 (0.03) |
| psaA-1104 | IB (complete) | 3 | LAGLIDADG | double | 1238 (1) | 19.43 (0.11) |
| psaA-1605 | IB (complete) | 2 | LAGLIDADG | double | 1096 | 22.45 |
| psaB-1050 | IB (complete) | 15 | LAGLIDADG | double | 1123 (20) | 22.59 (0.45) |
| psbA-179 | I (derived, B1) | 1 | LAGLIDADG | single | 695 | 28.92 |
| psbA-750 | I (derived, B1) | 3 (1) ** | T5orf172 | – | 752 (399) | 32.27 (1.22) |
| psbB-489 | I (derived, A) | 8 (2) ** | GIY-YIG | – | 880 (250) | 24.61 (1.43) |
| psbB-600 | IB (complete) | 8 | LAGLIDADG | double | 1301 (41) | 24.58 (1.00) |
| psbB-772 | I (derived, A) | 3 (3) ** | – | – | 367 (35) | 27.25 (2.55) |
| psbB-1022 | I (derived, B1) | 3 | HNH | – | 959 (9) | 25.36 (0.69) |
| psbB-1352 | I (derived, B1) | 2 | HNH | – | 947 (10) | 24.19 (1.44) |
| psbC-496 | IIB | 1 | RTM | – | 2441 | 36.71 |
| psbC-708 | IA | 4 | LAGLIDADG | single | 986 (5) | 26.11 (1.22) |
| psbC-882 | I (derived, A) | 4 | GIY-YIG | – | 926 (4) | 24.72 (0.52) |
| psbD-740 | I (derived, A) | 11 | GIY-YIG | – | 1039 (21) | 24.64 (0.77) |
| psbD-1034 | I (derived, A) | 2 | GIY-YIG | – | 1006 (108) | 24.28 (1.13) |
| rnl-1893a | IB (complete) | 19 | LAGLIDADG | single | 765 (3) | 30.38 (1.01) |
| rnl-1893b | IB (complete) | 3 | LAGLIDADG | double | 1007 (5) | 25.03 (0.40) |
| rnl-2225 | IB (complete) | 22 | LAGLIDADG | single | 966 (24) | 26.79 (0.26) |
| rnl-2463 | IB (complete) | 4 | LAGLIDADG | single | 1013 (1) | 26.40 (0.22) |
| rnl-2556 | I (derived, B2) | 5 | LAGLIDADG | single | 757 (24) | 27.66 (0.43) |
| rns-476 | I (unknown) | 4 | LAGLIDADG | single | 1007 (3) | 24.39 (0.19) |
| rns-499 | IA3 | 5 (3) ** | LAGLIDADG | double | 938 (779) | 27.63 (4.10) |
General features of Ulva chloroplast introns detected among the 40 Ulva plastomes.
* Intron names were defined as host gene plus insertion site which was determined by comparing homologous genes relative to the plastome of U. compressa (MW353781) (
** Numbers in parentheses indicate the number of introns with severe IEP degradation or loss.
The Ulva chloroplast introns were detected at 33 insertion sites of 14 host genes including atpA (1 site), atpB (3), atpI (1), infA (1), petB (5), petD (1), psaA (2), psaB (1), psbA (2), psbB (5), psbC (3), psbD (2), rnl (4) and rns (2) (Table 2). Obviously, intron densities varied widely among chloroplast genes at interspecific and intraspecific levels (Supplementary Figure 1). These introns were mainly distributed in two rRNA genes (rnl, and rns) and some more conserved PCGs involving photosystem I and II, electron transport and ATP synthesis, indicating that different functional groups of genes have different propensities for intron insertion. Further comparative analysis showed that introns preferentially resided in conserved housekeeping genes with high GC content (usually more than 34%) and long size (usually more than 0.5 kb) (Figure 4). We speculate that this may be related to higher GC content in target site sequences required for IEP recognition in host genes and more target sites contained by long GC-rich genes. However, some PCGs (e.g. rbcL, and tufA) with the above similar characteristics have high expression in chloroplasts and tend to resist intron invasion to economically and effectively ensure unnecessary consumption and time cost in transcription and processing (
A total of 23 intron families were observed to belong to group I introns, and the remaining 11 were group II introns. Eight intron families including intron atpB-537, psaA-1605, psbA-179, psbB-1022, psbB-1352, psbC-708, psbD-1034 and rnl-1893b were found for the first time in Chlorophyta. The size and GC content of different intron families fluctuate markedly in Ulva plastomes (Table 2), which are largely determined by type of introns and degeneration degree of intron-encoded proteins (IEPs) (Figure 5A). Except for the degenerated group II (derived) intron infA-62 (
Figure 5

The GC composition and size of introns detected among 40 Ulva plastomes. (A) Comparison of GC content and size among different types of introns. (B) Comparison of GC content and size between group I and group II introns.
Group IIA/IIB introns harbor a reverse transcriptase/maturase (RTM) gene in Ulva plastomes. The vast majority (93.8%) of group I introns encode an intact IEP which is the member of the LAGLIDADG or GIY-YIG or HNH homing endonuclease (LHE or GHE or NHE) families. However, the IEPs from intron psbA-750 do not exhibit significant sequence similarity to common homing endonuclease families (e.g. LHE, GHE, and NHE), but contain a conserved T5orf172 domain which occurs in a stand-alone protein form in phage, virus and bacteria and is also found in DNA-binding regulatory proteins of bacterial and eukaryotic DNA viruses (
Almost all introns displayed sporadic distribution pattern in Ulva plastomes, due to their nature of homing and mobility. Only the chloroplast intron infA-62 is an exception. This intron is shared by all Ulva plastomes, but absent in Blidingia cpDNAs, indicating that it might be acquired after its divergence from Blidingia. This intron has completely lost the ability to move and has been trapped in infA, because of its severe degeneration and the loss of IEP. This intron co-evolved with infA and showed a faster evolution rate than the host gene. Introns from the same insertion site were previously observed to be homologous among organelle genomes in Ulva (
Novel insights into integration of foreign sequences and rearrangement of Ulva plastomes
Comparison of plastome intergenic regions shows that Ulva cpDNAs experienced frequent insertion of foreign DNA sequences which usually harbor some specific open reading frames (orfs), as were important indicators for insight into their source. The largest U. meridionalis (Ume) cpDNA contained 14.6-kb foreign sequence which encoded 15 specific orfs (Supplementary Figure 2), accounting for 11.9% of plastome. To elucidate the origin of exogenous sequences and their relationships, we systematically compared the sequence characteristics of large intergenic regions and the distribution of free-standing orfs among these 40 Ulva plastomes. A total of 154 specific free-standing orfs as well as many homologous residue DNA sequences of some specific orfs, which have no similarity to chloroplast canonical genes, were detected in intergenic regions of these 40 Ulva cpDNAs. These specific orfs were not randomly distributed but mainly located in some specific intergenic regions (e.g. psbA-psbB, trnT-psbA, psbB-psbD, psbC-psbB, trnS2-psbC, trnM3-psbD, psbC-trnM3, trnL2-psbD, trnL2-trnM3, trnM1-trnE, and trnW-psaJ) (
Some foreign DNA sequences integrated into different intergenic regions of Ulva plastomes harbor homologous orfs with significant high similarity (Supplementary Table 3). These facts indicated that these foreign sequences could have been derived from the same origin. It is very similar to the finding in Ulva mitogenomes where the derived foreign sequences mainly originated from mitochondrial plasmid DNA (
Figure 6

Phylogenetic analysis of the 29 full-length free-standing Ucp-ORFs found in Ulva plastomes. The bootstrap support values greater than 70% were displayed at branches. Branch lengths were proportional to the amount of sequence change, which were indicated by the scale bar below the trees. Different colored circles represent different Ulva lineages.
Figure 7

The conserved domain shared by the 29 Ucp-ORFs detected in Ulva plastomes.
The plastome architecture is not as conserved as that of mitogenomes in Ulva species (Supplementary Figure 2), but has experienced several rearrangement events to varying degrees (
Figure 8

Comparison of plastome organization and gene order between Ulva lineage I and II. Thick lines with different colors on the left represented different gene blocks. Thick lines with different colors on the top represented different Ulva lineages. The arrows indicated the direction of gene transcription. Only canonical genes were shown as filled boxes in different colors representing different gene types.
These acquired orfs display a different evolutionary trend from the chloroplast canonical genes. The levels of sequence divergence among homologous specific orfs or foreign DNA sequences in Ulva cpDNAs greatly exceed those observed in chloroplast genes. Frequent insertion and deletion mutations lead to serious fracture and degeneration of these sequences, accompanied by reduced GC content (Supplementary Table 2, 3). These evidences show that their existence is not a necessary requirement of Ulva chloroplast genomes, and there is no selection pressure to maintain their existence. Differential GC content can be used as an indicator to distinguish the background genome and the non-self (or introduced) DNA sequence (
Novel insights into plastome architecture and phylogenomic analysis
Due to the limited data at present, it is difficult to reconstruct the plastome structure of the common ancestor of Ulva species. Based on comparative analysis of architectures in sequenced Ulvales-Sykidiales-Ulotrichales cpDNAs to date, plastomes have completely lost the IR in Ulvales (e.g. Ulva species and B. minima) (Figures 8, 9), which is a remarkable difference from those in Sykidiales (e.g. P. marina) and Ulotrichales carrying identical or non-identical IR copies (
Figure 9

Comparison of plastome organization and gene order between Ulva meridionalis (OP985133) and Blidingia minima (MK408749 and MT948112). Thick lines with different colors between two plastomes represented different gene blocks. The arrows indicated the direction of gene transcription. Only canonical genes were shown as filled boxes in different colors representing different gene types.
Phylogenomic analyses of two Ulva plastome datasets (nt sequences of 100 canonical genes and aa sequences of 71 PCGs) showed that the common ancestor of Ulva species had a very early internal divergence and split into two major evolutionary lineages (Ulva I and II) (
Figure 10

Unrooted phylogenomic tree based on Maximum Likelihood (ML) analysis of the nucleotide (nt) sequences of the 100 common genes in the 40 Ulva plastomes. The bootstrap support values greater than 70% were displayed at branches. Branch lengths are proportional to the amount of sequence change, which are indicated by the scale bar below the trees. The asterisk indicates that the species name has been corrected.
Figure 11

Phylogenomic tree based on Maximum Likelihood (ML) analysis of the amino acid (aa) sequences of the 71 common PCGs in the 40 Ulva plastomes. The bootstrap support values greater than 70% were displayed at branches. Branch lengths are proportional to the amount of sequence change, which are indicated by the scale bar below the trees. The tree was rooted with Blidingia minima as the outgroup. The asterisk indicates that the species name has been corrected.
Our results of phylogenomic analysis well supported taxonomic revisions of some species names at the genomic level (Figures 10, 11), e.g. U. mutabilis Föyn, U. pertusa Kjellman and U. fasciata Delile are taxonomic synonyms of U. compressa Linnaeus (
Conclusion
The accumulation of Ulva cpDNA data provides us with an opportunity to decipher the unique evolution of plastomes in these globally distributed green macroalgae. In this study, more new insights into plastome evolution of Ulva species have been gained. First, Ulva plastome evolution reflects the strong selection pressure driving the compactness of genome organization and the decrease of overall genomic GC content. The overall plastome sequences including canonical genes, introns, derived foreign sequences and non-coding regions show a synergetic decrease in GC content at the varying degree. Fast degeneration of plastome sequences including non-core genes (minD and trnR3), derived foreign sequences, and noncoding spacer regions was accompanied by the marked decrease of their GC composition. Second, introns preferentially resided in conserved housekeeping genes with high GC content and long length in Ulva plastomes. It might be related to high GC content in target site sequences required for IEP recognition and more target sites contained by long GC-rich genes. Third, many foreign DNA sequences integrated into different intergenic regions harbor some homologous specific orfs with high similarity, indicating that they could have been derived from the same origin. Fourth, the invasion of foreign sequences seems to be an important driving force for plastome rearrangement in these IR-lacking Ulva cpDNAs. It seems that the invasion of foreign fragments causes the instability of genome architecture and triggered inversion of some gene blocks in Ulva cpDNAs. Finally, gene partitioning pattern changed after the loss of IR, and genome rearrangement was more extensive and more frequent in IR-lacking Ulvales plastomes. Gene clusters show a staggered distribution pattern on two strands in Ulva plastomes. Our new findings have deepened our understanding of the evolutionary trend of the plastomes in ecologically important Ulva seaweeds.
Statements
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: GenBank OL684341, OL684342, and OP985129-OP985133.
Author contributions
FL designed the study. FL, NC, HW, JL, JW and FQ performed the experiments. FL and HW performed the analysis. FL wrote the manuscript. All authors contributed to the article and approved the submitted version.
Funding
This work was financially supported by the National Natural Science Foundation of China (No. 42276133/41876165), the Strategic Priority Research Program of Chinese Academy of Sciences (No. XDA23050302/XDA23050403), the Key Research Program of Frontier Sciences, Chinese Academy of Sciences (No. QYZDB-SSW-DQC023), the Science and Technology Basic Resources Investigation Program of China (No. 2018FY100200), the Major Scientific and Technological Innovation Project of Shandong Province (No. 2019JZZY020706), and Natural Science Foundation of Shandong Province for Young Scholars (No. ZR2022QD066).
Acknowledgments
We are thankful to reviewers for their valuable comments and all staffs of marine ecological environment genomics research group in Institute of Oceanology, Chinese Academy of Sciences.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1126175/full#supplementary-material
References
1
BatemanA.BirneyE.DurbinR.EddyS. R.HoweK. L.SonnhammerE. L. (2000). The pfam protein families database. Nucleic Acids Res.28, 263–266. doi: 10.1093/nar/28.1.263
2
BensonG. (1999). Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res.27, 573–580. doi: 10.1093/nar/27.2.573
3
BlomsterJ.BäckS.FewerD. P.KiirikkiM.LehvoA.MaggsC. A.et al. (2002). Novel morphology in Enteromorpha (Ulvophyceae) forming green tides. Am. J. Bot.89, 1756–1763. doi: 10.3732/ajb.89.11.1756
4
CaiC.WangL.ZhouL.HeP.JiaoB. (2017). Complete chloroplast genome of green tide algae Ulva flexuosa (Ulvophyceae, chlorophyta) with comparative analysis. PloS One12 (9), e0184196. doi: 10.1371/journal.pone.0184196
5
ChanP. P.LinB. Y.MakA. J.LoweT. M. (2021). tRNAscan-SE 2.0: improved detection and functional classification of transfer RNA genes. Nucleic Acids Res.49 (16), 9077–9096. doi: 10.1101/614032
6
CocquytE.VerbruggenH.LeliaertF.De ClerckO. (2010). Evolution and cytological diversification of the green seaweeds (Ulvophyceae). Mol. Biol. Evol.27 (9), 2052–2061. doi: 10.1093/molbev/msq091
7
CouceiroL.CremadesJ.BarreiroR. (2011). Evidence for multiple introductions of the pacific green alga Ulva australis areschoug (Ulvales, chlorophyta) to the Iberian peninsula. Bot. Mar.54 (4), 391–402. doi: 10.1515/bot.2011.044
8
CremenM. C. M.LeliaertF.MarcelinoV. R.VerbruggenH. (2018). Large Diversity of non-standard genes and dynamic evolution of chloroplast genomes in siphonous green algae (Bryopsidales, chlorophyta). Genome Biol. Evol.10 (4), 1048–1061. doi: 10.1093/gbe/evy063
9
DarlingA. E.MauB.PernaN. T. (2010). Progressivemauve: multiple genome alignment with gene gain, loss and rearrangement. PloS One5 (6), e11147. doi: 10.1371/journal.pone.0011147
10
De ClerckO.KaoS. M.BogaertK. A.BlommeJ.FoflonkerF.KwantesM.et al. (2018). Insights into the evolution of multicellularity from the Sea lettuce genome. Curr. Biol.28 (18), 2921–2933. doi: 10.1016/j.cub.2018.08.015
11
Del CortonaA.LeliaertF.BogaertK. A.TurmelM.BoedekerC.JanouškovecJ.et al. (2017). The plastid genome in cladophorales green algae is encoded by hairpin chromosomes. Curr. Biol.27, 3771–3782. doi: 10.1016/j.cub.2017.11.004
12
de VriesJ.HabichtJ.WoehleC.HuangC.ChristaG.WägeleH.et al. (2013). Is ftsH the key to plastid longevity in sacoglossan slugs? Genome Biol. Evol.5 (12), 2540–2548. doi: 10.1093/gbe/evt205
13
FangJ.LiuB.LiuG.VerbruggenH.ZhuH. (2021). Six newly sequenced chloroplast genomes from trentepohliales: the inflated genomes, alternative genetic code and dynamic evolution. Front. Plant Sci.12, 780054. doi: 10.3389/fpls.2021.780054
14
FortA.McHaleM.CascellaK.PotinP.UsadelB.GuiryM. D.et al. (2021). Foliose Ulva species show considerable inter-specific genetic diversity, low intra-specific genetic variation, and the rare occurrence of inter-specific hybrids in the wild. J. Phycol.57, 219–233. doi: 10.1111/jpy.13079
15
GaoD.SunZ.BiG.ZhangX. (2022). The complete plastome of Blidingia marginata and comparative analysis with the relative species in ulvales. Aquat. Bot.183, 103568. doi: 10.1016/j.aquabot.2022.103568
16
GuiryM. D.GuiryG. M. (2023). AlgaeBase. world-wide electronic publication (Galway: National University of Ireland).
17
GulbrandsenØ.S.AndresenI. J.KrabberødA. K.BråteJ.Shalchian-TabriziK. (2021). Phylogenomic analysis restructures the ulvophyceae. J. Phycol.57, 1223–1233. doi: 10.1111/jpy.13168
18
HallT. A. (1999). BioEdit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucl. Acids Symp. Ser.41, 95–98. doi: 10.1021/bk-1999-0734.ch008
19
HanH.YanL.SongW.WangZ.ZhangX. (2020). Complete chloroplast genome of Ulva pertusa, one of the causal species of green macroalgal blooms in the coastal waters of qinhuangdao, China. Mitochondrial. DNA Part B.5 (1), 1084–1086. doi: 10.1080/23802359.2020.1723448
20
HaugenP.SimonD. M.BhattacharyaD. (2005). The natural history of group I introns. Trends Genet.21, 111–119. doi: 10.1016/j.tig.2004.12.007
21
HaydenH. S.WaalandJ. R. (2002). Phylogenetic systematics of the ulvaceae (Ulvales, ulvophyceae) using chloroplast and nuclear DNA sequences. J. Phycol.38 (6), 1200–1212. doi: 10.1046/j.1529-8817.2002.01167.x
22
HofmannL. C.NettletonJ. C.NeefusC. D.MathiesonA. C. (2010). Cryptic diversity of Ulva (Ulvales, chlorophyta) in the great bay estuarine system (Atlantic USA): introduced and indigenous distromatic species. Eur. J. Phycol.45 (3), 230–239. doi: 10.1080/09670261003746201
23
HugheyJ. R.GabrielsonP. W.MaggsC. A.MineurF. (2021). Genomic analysis of the lectotype specimens of European Ulva rigida and Ulva lacinulata (Ulvaceae, chlorophyta) reveals the ongoing misapplication of names. Eur. J. Phycol.57 (2), 143–153. doi: 10.1080/09670262.2021.1914862
24
HugheyJ. R.MaggsC. A.MineurF.JarvisC.MillerK. A.ShabakaS. H.et al. (2019). Genetic analysis of the linnaean Ulva lactuca (Ulvales, chlorophyta) holotype and related type specimens reveals name misapplications, unexpected origins, and new synonymies. J. Phycol.55, 503–508. doi: 10.1111/jpy.12860
25
IyerL. M.KooninE. V.AravindL. (2002). Extensive domain shuffling in transcription regulators of DNA viruses and implications for the origin of fungal APSES transcription factors. Genome Biol.3 (3), research0012. doi: 10.1186/gb-2002-3-3-research0012
26
JeffaresD. C.MourierT.PennyD. (2006). The biology of intron gain and loss. Trends Genet.22 (1), 16–22. doi: 10.1016/j.tig.2005.10.006
27
JiangT.GuK.WangL.LiuQ.ShiJ.LiuM.et al. (2019). Complete chloroplast genome of Ulva prolifera, the dominant species of green macroalgal blooms in yellow Sea, China. Mitochondrial. DNA Part B.4 (1), 1930–1931. doi: 10.1080/23802359.2019.1610090
28
JinJ.-J.YuW.-B.YangJ.-B.SongY.dePamphilisC. W.YiT.-S.et al. (2020). GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol.21, 241. doi: 10.1186/s13059-020-02154-5
29
JonesD. T.TaylorW. R.ThorntonJ. M. (1992). The rapid generation of mutation data matrices from protein sequences. Comput. Appl. Biosci.8, 275–282. doi: 10.1093/bioinformatics/8.3.275
30
KimD.LeeJ. M.ChoiJ. W.YangJ. H. (2019). Flip-flop organization in the chloroplast genome of Capsosiphon fulvescens (Ulvophyceae, chlorophyta). J. Phycol.55, 214–223. doi: 10.1111/jpy.12811
31
KoboldtD. C.ChenK.WylieT.LarsonD. E.McLellanM. D.MardisE. R.et al. (2009). VarScan: variant detection in massively parallel sequencing of individual and pooled samples. Bioinformatics25, 2283–2285. doi: 10.1093/bioinformatics/btp373
32
KumarS.StecherG.TamuraK. (2016). MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol.33, 1870–1874. doi: 10.1093/molbev/msw054
33
LangB. F.LaforestM.-J.BurgerG. (2007). Mitochondrial introns: a critical view. Trends Genet.23, 119–125. doi: 10.1016/j.tig.2007.01.006
34
LangB. F.NedelcuA. M. (2012). “Plastid genomes of algae,” in Advances in photosynthesis and respiration including bioenergy and related processes: genomics of chloroplasts and mitochondria. Eds. BockR.KnoopV. (Dordrecht: Springer), 59–87.
35
LeliaertF.Lopez-BautistaJ. M. (2015). The chloroplast genomes of Bryopsis plumosa and Tydemania expeditiones (Bryopsidales, chlorophyta): compact genomes and genes of bacterial origin. BMC Genomics16, 204. doi: 10.1186/s12864-015-1418-3
36
LeliaertF.SmithD. R.MoreauH.HerronM. D.VerbruggenH.DelwicheC. F.et al. (2012). Phylogeny and molecular evolution of the green algae. Crit. Rev. Plant Sci.31, 1–46. doi: 10.1080/07352689.2011.615705
37
LiH.DurbinR. (2010). Fast and accurate long-read alignment with burrows-wheeler transform. Bioinformatics26, 589–595. doi: 10.1093/bioinformatics/btp698
38
LinX.LiuW.WeiX.JiangP. (2022). Complete chloroplast genome of an invasive marine macroalga Ulva californica (Ulvophyceae, chlorophyta). Mitochondrial. DNA Part B.7 (7), 1337–1339. doi: 10.1080/23802359.2022.2098854
39
LiuF.MeltonJ. T. (2021). Chloroplast genomes of the green-tide forming alga Ulva compressa: comparative chloroplast genomics in the genus Ulva (Ulvophyceae, chlorophyta). Front. Mar. Sci.8, 668542. doi: 10.3389/fmars.2021.668542
40
LiuF.MeltonJ. T.Lopez-BautistaJ. M.ChenN. (2020). Multiple intraspecific variations of mitochondrial genomes in the green-tide forming alga, Ulva compressa Linnaeus (Ulvophyceae, chlorophyta). Front. Mar. Sci.7, 714. doi: 10.3389/fmars.2020.00714
41
LiuF.MeltonJ. T.WangH.WangJ.Lopez-BautistaJ. M. (2022a). Understanding the evolution of mitochondrial genomes in the green macroalgal genus Ulva (Ulvophyceae, chlorophyta). Front. Mar. Sci.9, 850710. doi: 10.3389/fmars.2022.850710
42
LiuF.WangH.SongW. (2022b). Tandem integration of circular plasmid contributes significantly to the expanded mitochondrial genomes of the green-tide forming alga Ulva meridionalis (Ulvophyceae, chlorophyta). Front. Plant Sci.13, 937398. doi: 10.3389/fpls.2022.937398
43
LiuJ.YangX.CuiJ.ZhuangM.ZhaoL.LiJ.et al. (2020). Complete chloroplast genome of Ulva meridionalis (Ulvales: ulvaceae): an extremely fast-growing green macroalgae. Mitochondrial. DNA Part B.5 (2), 1390–1392. doi: 10.1080/23802359.2020.1735967
44
MannS.ChenY. P. (2010). Bacterial genomic G + c composition-eliciting environmental adaptation. Genomics95 (1), 7–15. doi: 10.1016/j.ygeno.2009.09.002
45
MeltonJ. T.LeliaertF.TronholmA.Lopez-BautistaJ. M. (2015). The complete chloroplast and mitochondrial genomes of the green macroalga ulva sp. UNA00071828 (Ulvophyceae, chlorophyta). PloS One10 (4), e0121020. doi: 10.1371/journal.pone.0121020
46
MeltonJ. T.Lopez-BautistaJ. M. (2017). The chloroplast genome of the marine green macroalga Ulva fasciata delile (Ulvophyceae, chlorophyta). Mitochondrial. DNA Part A.28 (1), 93–95. doi: 10.3109/19401736.2015.1110814
47
MinhB. Q.NguyenM. A. T.von HaeselerA. (2013). Ultrafast approximation for phylogenetic bootstrap. Mol. Biol. Evol.30 (5), 1188–1195. doi: 10.1093/molbev/mst024
48
MitsuhashiC.TeramuraH.ShimadaH. (2020). Construction of genomic marker sets based on the chloroplast genome of a green alga, Ulva pertusa (syn. Ulva australis), leads to simple detection of Ulva species. Genes Genet. Systems.95 (2), 55–63. doi: 10.1266/ggs.19-00054
49
OsorioH.Tapia-ReyesP.EspinozaD.LaporteD.GonzálezA.Castro-NallarE.et al. (2022). The genome of the marine alga Ulva compressa (Chlorophyta) reveals protein-coding genes with similarity to plants and green microalgae, but also to animal, bacterial, and fungal genes. Int. J. Mol. Sci.23, 7279. doi: 10.3390/ijms23137279
50
Paysan-LafosseT.BlumM.ChuguranskyS.GregoT.PintoB. L.SalazarG. A.et al. (2023). InterPro in 2022. Nucleic Acids Res.51 (D1), D418–D427. doi: 10.1093/nar/gkac993
51
PombertJ.-F.LemieuxC.TurmelM. (2006). The complete chloroplast DNA sequence of the green alga Oltmannsiellopsis viridis reveals a distinctive quadripartite architecture in the chloroplast genome of early diverging ulvophytes. BMC Biol.4, 3. doi: 10.1186/1741-7007-4-3
52
PombertJ.-F.OtisC.LemieuxC.TurmelM. (2005). The chloroplast genome sequence of the green alga Pseudendoclonium akinetum (Ulvophyceae) reveals unusual structural features and new insights into the branching order of chlorophyte lineages. Mol. Biol. Evol.22 (9), 1903–1918. doi: 10.1093/molbev/msi182
53
RobinsonJ. T.ThorvaldsdóttirH.WincklerW.GuttmanM.LanderE. S.GetzG.et al. (2011). Integrative genomics viewer. Nat. Biotechnol.29, 24–26. doi: 10.1038/nbt.1754
54
ShiblA. A.IsaacA.OchsenkühnM. A.CárdenasA.FeiC.BehringerG.et al. (2020). Diatom modulation of select bacteria through use of two unique secondary metabolites. P. Natl. Acad. Sci. U.S.A.117, 27445–27455. doi: 10.1073/pnas.2012088117
55
SmithD. R. (2017). Evolution: in chloroplast genomes, anything goes. Curr. Biol.27 (24), R1305–R1307. doi: 10.1016/j.cub.2017.10.049
56
SteinhagenS.WeinbergerF.KarezR. (2019). Molecular analysis of Ulva compressa (Chlorophyta, ulvales) reveals its morphological plasticity, distribution and potential invasiveness on German north Sea and Baltic Sea coasts. Eur. J. Phycol.54 (1), 102–114. doi: 10.1080/09670262.2018.1513167
57
SuzukiS.YamaguchiH.HiraokaM.KawachiM. (2018). Mitochondrial and chloroplast genome sequences of Ulva ohnoi, a green-tide forming macroalga in the southern coastal regions of Japan. Mitochondrial. DNA Part B.3 (2), 765–767. doi: 10.1080/23802359.2018.1483778
58
ThompsonJ. D.GibsonT. J.PlewniakF.JeanmouginF.HigginsD. G. (1997). The ClustalX windows interface flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res.25, 4876–4882. doi: 10.1093/nar/25.24.4876
59
TrifinopoulosJ.NguyenL. T.von HaeselerA.MinhB. Q. (2016). W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Res.44 (W1), W232–W235. doi: 10.1093/nar/gkw256
60
TurmelM.LemieuxC. (2018). Evolution of the plastid genome in green algae. Adv. Bot. Res.85, 157–193. doi: 10.1016/bs.abr.2017.11.010
61
TurmelM.OtisC.LemieuxC. (2016). Mitochondrion-to-chloroplast DNA transfers and intragenomic proliferation of chloroplast group II introns in Gloeotilopsis green algae (Ulotrichales, ulvophyceae). Genome Biol. Evol.8 (9), 2789–2805. doi: 10.1093/gbe/evw190
62
TurmelM.OtisC.LemieuxC. (2017). Divergent copies of the large inverted repeat in the chloroplast genomes of ulvophycean green algae. Sci. Rep.7, 994. doi: 10.1038/s41598-017-01144-1
63
WangL.CaiC.ZhouL.HeP.JiaoB. (2017). The complete chloroplast genome sequence of Ulva linza. conserv. Genet. Resour.9, 463–466. doi: 10.1007/s12686-016-0682-0
64
WangY.LiuF.LiuX.ShiS.BiY.MoejesF. W. (2019). Comparative transcriptome analysis of four co-occurring Ulva species for understanding the dominance of Ulva prolifera in the yellow Sea green tides. J. Appl. Phycol.31, 3303–3316. doi: 10.1007/s10811-019-01810-z
65
WangH.LiuF.WangJ.ChenN. (2021). Phylogenomic analysis of the chloroplast genome of the green-tide forming macroalga Ulva intestinalis Linnaeus (Ulvophyceae, chlorophyta). Mitochondrial. DNA Part B.6 (10), 3052–3054. doi: 10.1080/23802359.2021.1978889
66
WangJ.WangY.WangH.LiuF.ChenN. (2020). Comparative chloroplast genome analysis of new-green tide causative species in China: Ulva laetevirens. Oceanol. Et Linnol. Sinica.52 (5), 1–13.
67
WenQ.YangW.LiJ.LiuJ.ZhaoS.GaoS.et al. (2022). Characterization of complete chloroplast genome of Ulva torta (Mertens) trevisa. Mitochondrial. DNA Part B.7 (6), 1041–1043. doi: 10.1080/23802359.2022.2081943
68
XiaL.QinY.LiuJ.ZhangH.WuL.GaoS.et al. (2021). Complete chloroplast genome of Ulva compressa (Ulvales: ulvaceae). Mitochondrial. DNA Part B.6 (3), 720–722. doi: 10.1080/23802359.2020.1860696
69
ZhuH.HuY.LiuF.HuZ.LiuG. (2019). Characterization of the chloroplast genome of Trentepohlia odorata (Trentepohliales, chlorophyta), and discussion of its taxonomy. Int. J. Mol. Sci.20, 1774. doi: 10.3390/ijms20071774
70
ZukerM. (2003). Mfold web server for nucleic acid folding and hybridization prediction. Nucleic Acids Res.31, 3406–3415. doi: 10.1093/nar/gkg595
Summary
Keywords
chloroplast genome, Ulvophyceae, comparative genomics, GC content, group I/II intron, genome rearrangement
Citation
Liu F, Chen N, Wang H, Li J, Wang J and Qu F (2023) Novel insights into chloroplast genome evolution in the green macroalgal genus Ulva (Ulvophyceae, Chlorophyta). Front. Plant Sci. 14:1126175. doi: 10.3389/fpls.2023.1126175
Received
17 December 2022
Accepted
03 April 2023
Published
18 April 2023
Volume
14 - 2023
Edited by
Enrico Vito Perrino, International Centre for Advanced Mediterranean Agronomic Studies, Italy
Reviewed by
Zoltan Fussy, Charles University, Czechia; Eric Wade Linton, Central Michigan University, United States; Huan Zhu, Institute of Hydrobiology (CAS), China
Updates

Check for updates
Copyright
© 2023 Liu, Chen, Wang, Li, Wang and Qu.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Feng Liu, liufeng@qdio.ac.cn; prcliufeng@sina.cn
This article was submitted to Plant Systematics and Evolution, a section of the journal Frontiers in Plant Science
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.