ORIGINAL RESEARCH article

Front. Plant Sci., 20 December 2024

Sec. Plant Systematics and Evolution

Volume 15 - 2024 | https://doi.org/10.3389/fpls.2024.1498543

Complete chloroplast genome data reveal the existence of the Solidago canadensis L. complex and its potential introduction pathways into China

  • 1. Zhejiang Provincial Key Laboratory of Plant Evolutionary Ecology and Conservation, Taizhou University, Taizhou, China

  • 2. School of Electronics and Information Engineering, Taizhou University, Taizhou, China

  • 3. School of Advances Study, Taizhou University, Taizhou, China

  • 4. Environmental and Life Sciences Graduate Program, Trent University, Peterborough, ON, Canada

  • 5. Department of Biology, University of Konstanz, Konstanz, Germany

Abstract

Solidago canadensis, native to North America, is an invasive species in many areas of the world, where it causes serious damage to natural ecosystems and economic losses. However, a dearth of genetic resources and molecular markers has hampered our understanding of its invasion history. Here, we de novo assembled 40 complete chloroplast genomes of Solidago species, including 21 S. canadensis individuals, 15 S. altissima individuals, and four S. decurrens individuals, the sizes of which ranged from 152,412 bp to 153,170 bp. The phylogenetic trees based on the complete chloroplast genome sequences and nuclear genome-wide SNP data showed that S. canadensis and S. altissima cluster together and form a monophyletic pair, as sister to S. decurrens, indicating the existence of the S. canadensis L. complex in China. Three potential introduction pathways were identified. The chloroplast-genome structure and gene contents are conservative in the genomes of the S. canadensis L. complex and S. decurrens. The analysis of sequence divergence indicated five variable regions, and 10 chloroplast protein-coding genes that underwent positive selection were identified. Our findings shed new light on the invasion history of S. canadensis and the data sets generated in this study will facilitate future research on its chloroplast genome evolution.

1 Introduction

The genus Solidago L. (Asteraceae) has currently ca. 140 recognized species (; ). The majority of Solidago species are native to North America, while a few species are native to South America or Eurasia (; ). However, at least 50 of the North American Solidago species have been introduced as ornamental plants to Europe in the 17th and 18th century (), and most likely also elsewhere. Some of those species have become naturalized in Europe, as well as in Asia and other parts of the world (; ). One of them, Solidago canadensis L., is now one of the most aggressive invasive plant species in Europe and in China (; ). However, due to differences in morphology and ploidy, there is uncertainty about whether the plants in China and Europe belong to the same species, and thus whether they both belong to S. canadensis.

The taxonomy of S. canadensis is complicated, and due to its wide variability in leaf sizes, pubescence, rhizome lengths, flowerhead sizes, and ploidy levels, multiple varieties are recognized (; ; ). Some of the taxa that some taxonomists consider to be varieties of S. canadensis are considered to be separate species by others. Consequently, uncertainty exists around the true identity of what is now usually called Solidago canadensis in the invaded ranges. For example, a plant-morphological study by concluded that the taxon in Europe is highly variable (), and that, particularly due to its characteristic nodding shoot tips, it has to be closely related to S. canadensis var. scabra (Muhl.) Torr. & Gray, which is considered a synonym of S. altissima L (; ). Consequently, many older publications refer to this invasive species in Europe as S. altissima (). However, S. altissima is hexaploid, whereas the taxon in Europe is diploid. Since the early 2000s, most studies use the name S. canadensis (). However, reported the first evidence for an established hexaploid population of S. altissima in Europe (). In Asia, a highly invasive hexaploid Solidago has been identified in Japan as S. altissima (https://www.nies.go.jp/biodiversity/invasive/DB/detail/80600e.html), but S. canadensis has also been identified as naturalized in Japan. In China, however, an highly invasive hexaploid Solidago has been identified as S. canadensis, but S. altissima has also been recorded as naturalized in China (). So, both in Europe and Asia, S. altissima and S. canadensis have been identified, but doubts about the exact taxonomic status remain.

Based on botanical identification literature, S. altissima should distinguish itself from S. canadensis by having fewer leaf serrations, more disk florets, longer ray florets, a higher involucral height, larger pollen, and a nodding shoot tip, and by being hexaploid instead of diploid (; ; ). Based on an extensive comparison of morphological characteristics, concluded that S. altissima and S. canadensis should be treated as separate species (). However, some plants, like S. canadensis in Europe, have characteristics of both species, and therefore molecular marker data might give more precise information on its identity (; ). Recently, used nuclear ribosomal ITS and chroloplast trnH-psbA intergenic spacer sequences on specimens from North America, Europe and China, and found that specimens morphologically identified to be S. canadensis and S. altissima were >99% identical, and should be considered as one species complex (). However, other molecular markers might provide more conclusive evidence.

Here, we used next-generation sequencing to obtain complete chloroplast genome sequences of 40 Solidago individuals from the native range in the US, and the non-native ranges in Asia (China and Japan) and Europe. An advantage of the chloroplast genome, compared to the nuclear genome, is that it is smaller and has more copy numbers (). Furthermore, the chloroplast genome is usually maternally inherited and does not undergo gene recombination (). Therefore, complete chloroplast genomes have frequently been used to study population genetics and species evolution, and to determine phylogenetic relationships among plant species (; ). The aims of our study were to (i) obtain complete chloroplast genomes of specimens morphologically identified to be S. canadensis and S. altissima, to evaluate the usefulness of the chloroplast genome in species identification and phylogenetics, and (ii) to obtain insight into whether the invasive Solidago in Europe and Asia are most likely S. canadensis or S. altissima. Furthermore, we searched for highly variable genome regions that could be used to develop barcode markers, and we tested for genes with positive selection to identify genes that may play a role in evolutionary adaptation.

2 Materials and methods

2.1 Plant materials and genome data sources

For S. canadensis and S. altissima, seeds and leaf samples of a total of 36 individuals were collected from the native range (11 from the United States and 10 from Canada) and the introduced range (10 from China, 4 from Europe, and 1 from Japan). In detail, seeds of reference native S. canadensis L. and S. altissima L. (Asteraceae) were obtained from The Germplasm Resources Information Network of the United States Department of Agriculture (GRIN/USDA) with accession numbers W6 52837 and W6 57335, respectively. The seeds were sown and leaf material of one offspring plant from each of the S. canadenisis and S. altissima accessions were used for chloroplast-genomic analysis. Additionally, leaf samples of S. altissima herbarium specimens were obtained: three samples from the Wuhan Institute of Botany (HIB; accession numbers HIB0188587, HIB0188588, and HIB0188589) and one sample from the Chinese Field Herbarium (CFH; accession number CSH0115776). We also obtained leaf material of one S. canadensis specimen from the Specimen Museum of Zhejiang University (accession number HZU60102855; Supplementary Table S1). We also used 29 leaf samples of S. canadensis collected in the field, including nine samples from the United States, six samples from China, and four samples from Europe. In addition, we used 10 samples of S. altissima from Peterborough, Canada. The healthy leaf samples were washed and then dried in silica gel for preservation until DNA extraction. In total, we had 21 S. canadensis samples and 15 S. altissima samples. For the samples collected in the field, S. canadensis and S. altissima were mainly distinguished by the length and arrangement of epidermal hairs on stems and leaves, and the shape and margins of leaves. One individual each of S. canadensis, S. altissima and S. decurrens are shown in Supplementary Figure S1.

As a control to show that the chloroplast genomic analysis can distinguish Solidago species, we also used samples of S. decurrens, which is native in China. We had one S. decurrens herbarium specimen from the Specimen Museum of Zhejiang University (accession number HZU60120963), and three leaf samples of S. decurrens collected in the field in China (Supplementary Tables S1, S2). In addition, we also obtained complete chloroplast genome sequence data of S. decurrens (NC_053705.1) from the National Center for Biotechnology Information (NCBI, https://www.ncbi.nlm.nih.gov/). As additional outgroups, we also obtained from NCBI complete chloroplast genome sequence data of six other Asteraceae species, including Aster flaccidus (MN122101.1) (), Conyza bonariensis (NC_035884.1) (), Erigeron canadensis (MT806101.1), Heteroplexis incana (NC_048508.1), Lagenophora cuchumatanica (NC_034819.1) (), and Symphyotrichum subulatum (NC_050667.1) ().

2.2 Total DNA extraction

Approximately 1 g of each of the dried leaf samples was used for genomic DNA extraction with the modified CTAB method (). A library with insertion sizes of 300-500 bp was constructed for paired-end sequencing using the Illumina Novaseq 6000 platform (Illumina, San Diego, CA, United States).

2.3 Chloroplast genome assembly and annotation

Chloroplast genomes were de novo assembled using NOVOPlasty version 4.2 with default parameters (). The GeSeq tool was used to annotate the assembled chloroplast genome sequences (). Results of software based annotation were verified by manual inspection using Geneious Prime 2021.1.1 (Biomatters Ltd., Auckland, New Zealand). Chloroplast gene circular maps were drawn with the OrganellarGenomeDRAW tool (OGDRAW) ().

2.4 Nuclear genome SNP data calling

For the nuclear phylogenetic tree construction, genome-wide SNP data was obtained according to the following steps. First, the raw sequencing data were cleaned using Trimmomatic (version 0.39) () to remove adapters and low-quality sequences (the reads were scanned with a 4-base wide sliding window, cut when the average quality per base dropped below 30, and dropped when they were less than 120 bases long). Then the cleaned reads were mapped to the nuclear genome of S. canadensis using the software Burrows–Wheeler Alignment (BWA, version 0.7.17) with default parameters (). The variant calling was performed using the Genome Analysis Toolkit (GATK) Best Practices Pipeline (version 4.2.0.0) (), and variants were filtered with the VariantFiltration tool from GATK (-Window 4, -filter “QD < 2.0 || MQ<40.0 || DP <10 || FS > 60.0). Finally, the filtered SNPs of the Solidago species were merged using the software BCFtools (version 1.11) with default parameters ().

2.5 Phylogenetic tree construction

In the phylogenetic tree construction, the six non-Solidago species that belong to the same Asteraceae Tribe (A. flaccidus, C. bonariensis, E. canadensis, H. incana, L. cuchumatanica, and S. subulatum), served as outgroups for the chloroplast genome tree. In the nuclear genome tree, S. subulatum was served as outgroup. The whole-chloroplast-genome sequences were aligned using MAFFT v7.471 () and manually adjusted. To determine the phylogenetic relationships, a maximum-likelihood tree of the chloroplast genome was constructed based on the manually adjusted aligned chloroplast genomes. The nuclear genome tree was built based on the genome-wide SNP data. The best-fit substitution model was selected by the software MrModeltest v2.4, which selects general time-reversible models with invariant sites and the gamma-rate heterogeneity (GTR+G+I) as the best-fitting substitution model. The maximum likelihood tree was constructed using IQ-TREE version 1.6.12 () with 1000 bootstrap replicates.

For principal component analysis (PCA), eigenvalues for consensus genotyping from SNP data of chloroplast genomes in Solidago species were calculated in plink v1.9 (). Then PCA plots were created by plotting the first two eigenvalues by ggplot2 package () in R.

2.6 Comparative genome analysis

Chloroplast genome comparisons were carried out and visualized with the online software mVISTA () based on annotation information of the sample Solidago canadensis 21 (Supplementary Table S2) in Shuffle LAGAN mode. DNAsp v5.10 software () was used to perform nucleotide diversity (Pi) and sequence-polymorphism analysis, and the settings were a step size of 200 bp and a window length of 600 bp.

Positive selection analysis was performed by the software EasyCodeML () across the chloroplast genomes, using a site-specific model with five site models (M0, M1a & M2a, M7 & M8) to calculate the synonymous (dS) and non-synonymous (dN) substitution rates, and their ratio (ω = dN/dS). EasyCodeML uses CodeML of the PAML software (). The site-specific model allowed the ω ratio to vary among sites while maintaining a fixed ω ratio in all the branches (). Specifically, M1a (nearly neutral) vs M2a (positive selection), and M7 (β) vs M8 (β & ω) were applied to find sequences that had undergone positive selection (). Likelihood ratio tests (LRT) were used for selection-strength evaluation with the comparison of M1a vs M2a and M7 vs M8. A p value of the chi-square statistic smaller than 0.05 was considered to be significant. The site models of M2a and M8 were implemented using Bayes Empirical Bayes (BEB) inference () in order to estimate the posterior probabilities and positive selection pressures on the selected genes.

3 Results

3.1 Chloroplast-genome features of S. canadensis, S. altissima, and S. decurrens

There was a large overlap in the lengths of the complete circular chloroplast genomes of S. canadensis (152,412 to 153,170 bp; n = 21), S. altissima (152,957 to 153,126 bp; n = 15) and S. decurrens (152,728 to 152,863 bp; n = 5). Each chloroplast genome had a typical quadripartite structure, including a large single-copy region (LSC, 84,214 bp to 85,061 bp, 84,809 bp to 85,026 bp, and 84,599 bp to 84,736 bp, respectively for S. canadensis, S. altissima, and S. decurrens) and a small single-copy region (SSC, 18,066 bp to 18,143 bp, 18,066 bp to 18,165 bp, and 18,089 bp to 18,095 bp, respectively, for S. canadensis, S. altissima, and S. decurrens), separated by two inverted repeats (IRA and IRB, 25,015 bp to 25,134 bp, 25013 bp to 25024 bp, and 25,019 bp to 25,020 bp, respectively for S. canadensis, S. altissima, and S. decurrens) (Supplementary Tables S1, S2 and Figure 1).

Figure 1

The chloroplast-genome features of the sampled S. canadensis, S. altissima and S. decurrens plants were largely identical. All the chloroplast genomes of S. canadensis, S. altissima, and S. decurrens showed similar GC contents ranging from 37.16% to 37.29%. The gene numbers and gene orders of the chloroplast genomes were identical. They contained a total of 128 genes, including 85 protein-coding genes, 35 tRNA genes, and 8 rRNA genes (Supplementary Table S2). Of the 128 genes, six protein-coding genes (rpl2, rpl23, ycf2, ndhB, rps7, and ycf1), seven tRNA genes (trnI-CAU, trnL-CAA, trnV-GAC, trnI-GAU, trnA-UGC, trnR-ACG, and trnN-GUU), and 4 rRNA genes (rrn16, rrn23, rrn4.5, and rrn5) were duplicated in the IR regions. The majority of encoded chloroplast genes are involved in photosynthesis-related metabolic processes (Table 1). There were 15 genes harboring a single intron (trnK-UUU, rps16, rpoC1, atpF, trnG-UCC, trnL-UAA, trnV-UAC, petB, petD, rpl16, rpl2, ndhB, trnI-GAU, trnA-UGC, and ndhA), and tree genes containing two introns (ycf3, rps12, and clpP1) (Supplementary Table S3).

Table 1

Group of genesGene names
Photosystem IpsaA, psaB, psaC, psaI, psaJ
Photosystem IIpsbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbT, psbZ, pbf1
Cytochrome b/f complexpetA, petB, petD, petG, petL, petN
ATP synthaseatpA, atpB, atpE, atpF, atpH, atpI
NADP dehydrogenasendhA, ndhB, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, ndhK
RubisCO large subunitrbcL
RNA polymeraserpoA, rpoB, rpoC1, rpoC2
Ribosomal proteins (SSU)rps2, rps3, rps4, rps7, rps8, rps11, rps12, rps14, rps15, rps16, rps18, rps19
Ribosomal proteins (LSU)rpl2, rpl14, rpl16, rpl20, rpl22, rpl23, rpl32, rpl33, rpl36
Hypothetical chloroplast reading frames (ycf)ycf1, ycf2, ycf3, ycf4
Other genesaccD, ccsA, cemA, clpP1, infA, matK
Ribosomal RNAsrrn16, rrn23, rrn4.5, rrn5
Transfer RNAstrnH-GUG, trnK-UUU, trnQ-UUG, trnS-GCU, trnC-GCA, trnD-GUC, trnY-GUA, trnR-UCU, trnG-UCC, trnS-UGA, trnG-UCC, trnfM-CAU, trnS-GCU, trnT-UGU, trnL-UAA, trnF-GAA, trnV-UAC, trnM-CAU, trnW-CCA, trnP-UGG, trnI-CAU, trnL-CAA, trnV-GAC, trnI-GAU, trnA-UGC, trnR-ACG, trnN-GUU, trnL-UAG, trnN-GUU, trnR-ACG, trnA-UGC, trnI-GAU, trnV-GAC, trnL-CAA, trnI-CAU

List of annotated genes in the chloroplast genome of S. canadensis L. complex and S. decurrens.

3.2 Phylogenetic analyses

The topology of the phylogenetic trees based on the complete chloroplast genome sequences and nuclear genome-wide SNP data both had strong support (100% bootstrap value) and showed that all S. canadensis and S. altissima samples formed a monophyletic group, distinct from S. decurrens (Figure 2). Principal component analysis (PCA) based on the chloroplast genome sequences also significantly distinguished S. decurrens from S. canadensis and S. altissima, while S. canadensis and S. altissima were grouped together (Figure 3). These results suggest that S. canadensis and S. altissima are one and the same species or belong to one species complex.

Figure 2

Figure 3

Generally, the phylogenetic trees reconstructed from the chloroplast genome data and nuclear genome data showed a largely congruent invasion history of native and invasive S. canadensis L. complex (Figure 2). In the monophyletic group, individuals of the S. canadensis L. complex in China showed a close relationship with individuals from North America. In addition, one Japanese individual showed close affinities to the Chinese individuals. The European individuals were grouped together and also showed a close relationship with North American individuals. Taken together, the results of the phylogenetic analysis indicate multiple origins and complex invasive routes of the S. canadensis L. complex into China.

3.3 Comparative analyses of chloroplast genomes

To investigate the divergence of chloroplast genomes among the accessions of S. altissima, S. canadensis, and S. decurrens, multiple alignments of chloroplast genomes were compared by mVISTA, with the annotated accession S. canadensis 21 as reference (Supplementary Figure S2). The results revealed a high similarity of chloroplast genome sequences, but the LSC (from trnH-GUG to rps19) and SSC (from ycf1 to ndhF) regions showed relatively less conservation compared to the IRA (from ycf1 to rpl2) and IRB (from rps19 to ycf1) regions (Supplementary Figure S2). Moreover, higher divergence was also observed in non-coding regions when compared to coding regions.

Nucleotide diversity (Pi) values were calculated within 600 bp windows to discover hotspots of sequence divergence. The Pi values of the whole chloroplast genomes varied from 0 to 0.00594, and there was a higher diversity in the LSC and SSC regions than in the IR regions (Figure 4). A total of five highly variable regions with Pi values higher than 0.002 were identified, including trnG-UCC - psbD, trnT-UGUtrnL-UAA and petB - petD in the LSC region, and ycf1 - ndhA and trnL-UAG-ndhF in the SSC region (Figure 4). The nucleotide diversity of individuals of the S. canadensis L. complex between China and Europe was higher than between individuals from China and North America (Supplementary Figure S3), which may be due to a limited sample size for Europe.

Figure 4

The non-synonymous (dN) and synonymous (dS) substitution rates of all the protein coding genes were analyzed across chloroplast genomes in the S. canadensis L. complex and S. decurrens. Most of the genes showed evidence of purifying selection. A total of 10 protein coding genes with significant posterior probabilities suggest that some sites of these genes were under positive selection (Supplementary Table S4). The gene ycf1 showed the highest number (n = 8) of amino acid sites with evidence for positive selection (Supplementary Table S4). Interestingly, in the gene rps19, the 46th amino acid was alanine in the S. altissima and S. canadensis accessions, while it was serine in the S. decurrens accessions (Supplementary Figure S4).

4 Discussion

In this study, we found that the genome size (152,412-153,170 bp), GC content (37.16-37.29%), the quadripartite structure and gene composition (Supplementary Table S2) of the 40 Solidago accessions were consistent with those of other Tribe. Astereae Cass chloroplast genomes, and showed a highly conserved structure with minor differences among the Solidago species. The chloroplast genome features of the sampled S. canadensis L. complex and S. decurrens plants were largely identical (Table 1 and Supplementary Tables S2, S3). Our results confirm the results of , who used both nuclear molecular markers and morphological traits, that S. decurrens is different from the other accessions (). More importantly, our whole chloroplast genome sequences revealed that there were no differences between the putative S. canadensis and S. altissima accessions, and between their native and non-native origins (Figures 2 and 3).

Solidago canadensis has spread worldwide owing to its high competitive ability, which allows it to dominate plant communities (). Solidago altissima has also been reported as invasive in different parts of the world (). The taxonomic status of S. canadensis and S. altissima is still controversial, as it is not clear yet whether they are two separate species or belong to a single species or species complex (; ; ). The main morphological traits that have been used to distinguish S. canadensis and S. altissima are the height and width of inflorescences (), the numbers of disk and ray florets (), pollen size () and the length and arrangement of the epidermal hairs of the stems and leaves (). However, some studies showed that S. canadensis and S. altissima do not differ in functional traits, biomass production and allocation (, ; ). In addition, based on both molecular methods and morphological characteristics concluded that S. canadensis and S. altissima should be considered as a single species complex (). As pointed out by others, next generation sequencing may help to reconstruct the phylogenetic relationships of Solidago species (; ; ). Indeed sequence data of chloroplast genomes and nuclear genomes have served as the technology of choice for phylogenetic investigations (; ). Similar to what earlier studies implied (), our analyses of whole-chloroplast genome sequences and genome-wide SNP data (Figures 2 and 3) showed S. canadensis and S. altissima formed one monophyletic group as sister to S. decurrens. This supports the conclusion that S. canadensis and S. altissima belong to one species complex (i.e. the S. canadensis L. complex).

Accessions from China appeared in two different clades and one of these clades also included the accession from Japan and Europe (Figure 2). This is in line with the idea that there have been multiple introduction events of S. canadensis from its native North America range into China. The morphological analysis of based on 11 phenotypic traits also indicated that both the Chinese and European individuals in their study showed close affinity with North America individuals of S. canadensis L. complex. The Chinese accessions may have been introduced directly from North America or they may have used Japan or Europe as stepping stones (; ). In any case, the results indicate that the S. canadensis L. complex came to China via multiple routes, as has also been suggested by other studies (; ). For Europe, historical records indicate that S. canadensis was introduced as an ornamental plant in the 17th century (). This was several centuries before the first plants were introduced to China (in the 1930s) (; ). Therefore, it is likely that the invasive accessions of S. canadensis in Europe came directly from North America and were not introduced via China or Japan. So, despite some limitations regarding the number of studied individuals, our results still provide new insights into the complicated invasion history of the S. canadensis L. complex in Europe and Asia.

DNA barcode markers, which are usually located in highly variable genome regions, are widely used for species identification. However, as some of the classical DNA barcodes are insufficient to distinguish between different Solidago species, it is crucial to find more highly variable genome regions that could be developed as potential barcode markers for identification of Solidago species. We found that most variable regions were located in the LSC and SSC regions, and that non-coding regions were more variable than coding regions (Supplementary Figure S2). This is a common phenomenon in the chloroplast genomes of most angiosperms (). In this study, five highly variable chloroplast-genome regions that are located in the intergenic region of S. canadensis L. complex and S. decurrens were identified based on mVISTA and nucleotide diversity. These regions include trnG-UCC - psbD, trnT-UGG - trnL-UAA, petB - petD, ycf1 - ndhA, and trnL-UAG - ndhF (Figure 4), and could serve as potential DNA barcodes for identification and phylogenetic analysis of Solidago species. The most divergent regions of ycf1 - ndhA, trnL-UAG - ndhF, trnT-UGU - trnL-UAA, and trnG-UCC - psbD as shown in Figure 4 were consistent with results of previous studies (; ), indicating that these regions indeed evolve rapidly in Solidago.

Recently, S. canadensis has been added to the updated list of 33 key invasive alien species in China (http://www.moa.gov.cn/govpublic/KJJYS/202211/t20221109_6415160.htm; accessed on 1 March 2023). This indicates that the management and effective control of this species remains of pressing concern in China. Our study adds further evidence (see also ; ) that there may have been multiple introduction events of S. canadensis into China, and possibly via multiple pathways. To prevent invasions by more S. canadensis accessions, either from North America or other invaded regions, border quarantine measures should be strengthened. This does not only apply to S. canadensis but also to other alien species that may adversely affect local biodiversity. In addition, chemical, mechanical, and biological control approaches should be integrated to minimize the harm of S. canadensis to the local ecological environment (; ; ). Furthermore, with the application of next generation sequencing, the use of species-specific markers will be a very useful fast and cheap molecular tool for early detection of new alien species or accessions of alien species that are already invasive (). The molecular markers detected by our study may help to establish a rapid and effective early warning monitoring system to promptly detect S. canadensis so that it can be eradicated before it becomes invasive.

Positive selection is considered to play an important role in adaptive evolution, while negative (purifying) selection is a ubiquitous evolutionary force responsible for conservation of the genome across long evolutionary timescales (; ). To detect signs of adaptive evolution among species, the ratio of substitution rates at synonymous and non-synonymous sites is frequently used (; ). For our Solidago accessions, we identified 10 genes with sites that showed signals of positive selection. Among them, rpoB, rbcL, accD, psbB, rps3, rps19, ndhH, ndhD, and ccsA genes have also been found in other plant species (; ; ). Moreover, these genes could be used for identification and phylogenetic research of Solidago species. The gene ycf1 had numerous such sites, indicating that this gene may play a key role in adaptive evolution of Solidago (Supplementary Figure S4 and Supplementary Table S4). Furthermore, the genes rps3 and rps19, encoding ribosomal subunit proteins that are considered to be essential for chloroplast biogenesis and function, showed signs of positive selection. This suggests that the studied Solidago species may increase evolutionary adaptability by regulating encoding ribosomal subunit proteins in chloroplast genomes. In addition, the likelihood ratio test (LRT) results showed that the p-values of rpoB, rps19, and ndhD genes were below 0.05, especially for the rpoB (encoding the beta-subunit of RNA polymerase) and ndhD gene (encoding NADP dehydrogenase), corroborating that these sites in the S. canadensis L. complex species have been under positive selection. As an important modulator of photosynthetic electron transport, recent study has revealed that positive selection of the ndhD gene was fairly common in all the main lineages of land plants (). Moreover, the positive selection site of the ndhD gene in five S. canadensis L. complex individuals from China (Solidago canadensis 04 and 06) or three from North America (Solidago altissima 10, 13, and 14) contain same amino acid, indicating the Chinese individuals of the S. canadensis 04 and 06 may be directly introduced from North America. For the rpoB gene, Solidago canadensis 17 and 19 collected from Europe showed same amino acid in the 569th positive selection site, inferring the direct introduction route from North America to Europe. In general, positive selection would possibly contribute to various invasive environments for this introduced species. Therefore, positive selection of these chloroplast genes may promote environmental adaptation of Solidago, but unravelling details of the adaptation mechanisms will need further in-depth research.

Statements

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.

Author contributions

Y-TT: Writing – original draft, Formal analysis, Writing – review & editing. L-XC: Data curation, Writing – review & editing. MJ: Data curation, Writing – review & editing. JJ: Methodology, Writing – review & editing. Z-SS: Data curation, Writing – review & editing. C-NC: Writing – review & editing. H-YL: Data curation, Writing – review & editing. AK: Data curation, Writing – review & editing. J-ML: Conceptualization, Writing – review & editing. MK: Conceptualization, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Key R&D Program of China (2016YFC1201100), Ten Thousand Talent Program of Zhejiang Province (No. 2019R52043) and the Taizhou Science and Technology Project (No. 21ywb76).

Acknowledgments

We thank the Taizhou Bigdata AI Research Center for providing computing resources.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2024.1498543/full#supplementary-material

Supplementary Figure 1

Solidago canadensis in the field (A), an herbarium specimen of S. altissima(B), and a potted plant of S. decurrens(C).

Supplementary Figure 2

Sequence alignment of chloroplast genomes with accession Solidago canadensis 21 as a reference. The y-scale indicates sequence identity from 50% to 100%. Gray arrows indicate the positions and directions of each gene. Red indicates non-coding sequences (CNS); Purple indicates the exons of protein-coding genes (exon); lime green indicates tRNAs and rRNAs.

Supplementary Figure 3

Nucleotide diversity analysis between Chinese and European/North American individuals of the S. canadensis L. complex.

Supplementary Figure 4

Partial alignment of three of the 10 positively selected genes with posterior probabilities higher than 0.95. (A–C) Partial aligned amino acid sequences of the rpoB, rps19, and ndhD genes, respectively. The red blocks represent amino acids with a high posterior probability.

References

  • 1

    ArduraA. (2019). Species-specific markers for early detection of marine invertebrate invaders through eDNA methods: Gaps and priorities in GenBank as database example. J. Nat. Conserv.47, 5157. doi: 10.1016/j.jnc.2018.11.005

  • 2

    AshworthV. (2017). Revisiting phylogenetic relationships in Phoradendreae (Viscaceae): utility of the trnL-F region of chloroplast DNA and presence of a homoplasious inversion in the intergenic spacer1. Botany95, 247258. doi: 10.1139/cjb-2016-0241

  • 3

    BeaudryJ. R.ChabotD. L. (1957). Studies on Solidago L. I. S. altissima L. and S. canadensis L. Contr. Inst. Bot. Univ.Montréal70, 65–72.

  • 4

    BeckJ. B.SempleJ. C. (2015). Next-generation sampling: pairing genomics with herbarium specimens provides species-level signal in Solidago (Asteraceae). Appl. Plant Sci.3, 1500014. doi: 10.3732/apps.1500014

  • 5

    BolgerA. M.LohseM.UsadelB. (2014). Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics30, 21142120. doi: 10.1093/bioinformatics/btu170

  • 6

    CroatT. (1972). Solidago canadensis complex of the great plains. Brittonia24, 317326. doi: 10.2307/2805669

  • 7

    CvijovicI.GoodB. H.DesaiM. M. (2018). The effect of strong purifying selection on genetic diversity. Genetics209, 12351278. doi: 10.1534/genetics.118.301058

  • 8

    DanecekP.BonfieldJ. K.LiddleJ.MarshallJ.OhanV.PollardM. O.et al. (2021). Twelve years of SAMtools and BCFtools. Gigascience10, giab008. doi: 10.1093/gigascience/giab008

  • 9

    DierckxsensN.MardulynP.SmitsG. (2017). NOVOPlasty: de novo assembly of organelle genomes from whole genome data. Nucleic Acids Res.45, e18. doi: 10.1093/nar/gkw955

  • 10

    DongM.LuB.-R.ZhangH.-B.ChenJ.-K.LiB. (2006). Role of sexual reproduction in the spread of an invasive clonal plant Solidago canadensis revealed using intersimple sequence repeat markers. Plant Spec. Biol.21, 1318. doi: 10.1111/j.1442-1984.2006.00146.x

  • 11

    DongW.XuC.LiD.JinX.LiR.LiuQ.et al. (2016). Comparative analysis of the complete chloroplast genome sequences in psammophytic Haloxylon species (Amaranthaceae). PeerJ4, e2699. doi: 10.7717/peerj.2699

  • 12

    DoyleJ. J.DoyleJ. L. (1987). A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem. Bull.19, 1115.

  • 13

    FernaldM. L. (1950). Gray’s manual of botany. 8th edition (New York: American Book Co).

  • 14

    FrazerK. A.PachterL.PoliakovA.RubinE. M.DubchakI. (2004). VISTA: computational tools for comparative genomics. Nucleic Acids Res.32, W273W279. doi: 10.1093/nar/gkh458

  • 15

    GaoF.ChenC.ArabD. A.DuZ.HeY.HoS. Y. W. (2019). EasyCodeML: a visual tool for analysis of selection using CodeML. Ecol. Evol.9, 38913898. doi: 10.1002/ece3.2019.9.issue-7

  • 16

    GreinerS.LehwarkP.BockR. (2019). OrganellarGenomeDRAW (OGDRAW) version 1.3.1: expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Rese.47, W59W64. doi: 10.1093/nar/gkz238

  • 17

    GuoY.-Y.YangJ.-X.BaiM.-Z.ZhangG.-Q.LiuZ.-J. (2021). The chloroplast genome evolution of Venus slipper (Paphiopedilum): IR expansion, SSC contraction, and highly rearranged SSC regions. BMC Plant Biol.21, 248. doi: 10.1186/s12870-021-03053-y

  • 18

    HerewardJ. P.WerthJ. A.ThornbyD. F.KeenanM.ChauhanB. S.WalterG. H. (2017). Complete chloroplast genome of glyphosate resistant Conyza bonariensis (L.) Cronquist from Australia. Mitochondrial DNA Part B2, 444445. doi: 10.1080/23802359.2017.1357441

  • 19

    HuX. (2020). Characterization of the complete chloroplast genome of Aster subulatus Michx. Mitochondrial DNA Part B5, 597598. doi: 10.1080/23802359.2019.1710599

  • 20

    HuangH.GuoS. (2004). Review on ecological studies on three invasive species of European genus Solidago. Guangxi Sci.11, 6974.

  • 21

    JiangD.CaiX.GongM.XiaM.XingH.DongS.et al. (2023). Complete chloroplast genomes provide insights into evolution and phylogeny of Zingiber (Zingiberaceae). BMC Genomics24, 30. doi: 10.1186/s12864-023-09115-9

  • 22

    JiangH.FangF.SuoS. (2008). Influences of parasitism by Cuscuta japonica plants on eco-physiological characteristics of Solidago canadensis. Acta Ecol. Sin.1, 399406.

  • 23

    KatohK.StandleyD. M. (2013). MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol. Biol. Evol.30, 772780. doi: 10.1093/molbev/mst010

  • 24

    KryazhimskiyS.PlotkinJ. (2008). The population genetics of dN/dS. PloS Genet.4, e1000304. doi: 10.1371/journal.pgen.1000304

  • 25

    LiC.CaiC.TaoY.SunZ.JiangM.ChenL.et al. (2021). Variation and evolution of the whole chloroplast genomes of Fragaria spp. (Rosaceae). Front. Plant Sci.12, 754209. doi: 10.3389/fpls.2021.754209

  • 26

    LiD.-M.LiJ.WangD.-R.XuY.-C.ZhuG.-F. (2021). Molecular evolution of chloroplast genomes in subfamily Zingiberoideae (Zingiberaceae). BMC Plant Biol.21, 558. doi: 10.1186/s12870-021-03315-9

  • 27

    LiH.DurbinR. (2009). Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics25, 17541760. doi: 10.1093/bioinformatics/btp324

  • 28

    LibradoP.RozasJ. (2009). DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics25, 14511452. doi: 10.1093/bioinformatics/btp187

  • 29

    LikhanovA.OliinykM.PashkevychN.ChurilovA.KozyrM. (2021). The role of flavonoids in invasion strategy of Solidago canadensis L. Plants10, 1748. doi: 10.3390/plants10081748

  • 30

    LinH.ChenL.LiJ. (2023). Multiple introductions and distinct genetic groups of Canada goldenrod (Solidago canadensis) in China revealed by genomic single-nucleotide polymorphisms. Plants12, 1734. doi: 10.3390/plants12091734

  • 31

    LoudonJ. C. (1850). An Encyclopaedia of gardening. 2nd edition. Longman publishing.

  • 32

    MaL.QiangS. (2007). Identification and classification of cut-flower Huangyinghua using DNA marker techniques. Acat Phytotaxonomica Sin.45, 497504. doi: 10.1360/aps06135

  • 33

    McNeilJ. (1976). Flora Europaea. Eds. SolidagoL.TutinT. G.HeywoodV. H.BurgesN. A.ValentineD. H.WaltersS. M.WebbD. A. (Cambridge: Cambridge University Press).

  • 34

    MelvilleM. R.MortonJ. K. (1982). A biosystematic study of the Solidago canadensis (Compositae) complex. I. The ontario populations. Can. J. Bot.60, 976997. doi: 10.1139/b82-123

  • 35

    MoseleyR. C.MewalalR.MottaF.TuskanG. A.HaaseS.YangX. (2018). Conservation and diversification of circadian rhythmicity between a model crassulacean acid metabolism plant Kalanchoë fedtschenkoi and a model C3 photosynthesis plant Arabidopsis thaliana. Front. Plant Sci.9, 1757. doi: 10.3389/fpls.2018.01757

  • 36

    NguyenL.-T.SchmidtH. A.von HaeselerA.MinhB. Q. (2015). IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol. Biol. Evol.32, 268274. doi: 10.1093/molbev/msu300

  • 37

    PeirsonJ. A.DickC. W.ReznicekA. A. (2013). Phylogeography and polyploid evolution of North American goldenrods (Solidago subsect. Humiles, Asteraceae). J. Biogeogr.40, 18871898. doi: 10.1111/jbi.2013.40.issue-10

  • 38

    PurcellS.NealeB.Todd-BrownK.ThomasL.FerreiraM. A. R.BenderD.et al. (2007). PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet.81, 559575. doi: 10.1086/519795

  • 39

    SempleJ. C.CookR. E. (2006). “Solidago linnaeus,” (Oxford Univ. Press, New York), 107166.

  • 40

    SempleL. C.McMinn-SauderH.StoverM.LemmonA. R.LemmonE. M.BeckJ. B. (2023). Goldenrod herbariomics: Hybrid-sequence capture reveals the phylogeny of Solidago. Am. J. Bot.110, e16164. doi: 10.1002/ajb2.v110.7

  • 41

    SempleJ. C.RahmanH.BzovskyS.SorourM. K.KomobisK.Lopez LaphitzR.et al. (2015). A multivariate morphometric study of the Solidago altissima complex and S. canadensis (Asteraceae: Astereae). Phytoneuron2015, 131.

  • 42

    ShotaS.TakumaK.RyutaK.MasayukiM.TakakoN.NaokoI.et al. (2018). Phylogeographic analysis of the East Asian goldenrod (Solidago virgaurea complex, Asteraceae) reveals hidden ecological diversification with recurrent formation of ecotypes. Ann. Bot.121, 489500. doi: 10.1093/aob/mcx182

  • 43

    SzymuraM.SzymuraT. H. (2013). Soil preferences and morphological diversity of goldenrods (Solidago L.) from south-western Poland. Acta Soc Bot. Pol.82, 107115. doi: 10.5586/asbp.2013.005

  • 44

    SzymuraM.SzymuraT. H. (2015). Growth, phenology, and biomass allocation of alien Solidago species in central Europe. Plant Spec. Biol.30, 245256. doi: 10.1111/psbi.2015.30.issue-4

  • 45

    SzymuraM.SzymuraT. H. (2016). Historical contingency and spatial processes rather than ecological niche differentiation explain the distribution of invasive goldenrods (Solidago and Euthamia). Plant Ecol.217, 565582. doi: 10.1007/s11258-016-0601-1

  • 46

    SzymuraM.SzymuraT. H.KreitschitzA. (2015). Morphological and cytological diversity of goldenrods (Solidago L. and Euthamia Nutt.) from south-western Poland. Biodiv. Res. Conserv.38, 4149. doi: 10.1515/biorc-2015-0010

  • 47

    TaoL.DuanH.TaoK.LuoY.LiQ.LiL. (2023). Complete chloroplast genome structural characterization of two Phalaenopsis (Orchidaceae) species and comparative analysis with their alliance. BMC Genomics24, 359. doi: 10.1186/s12864-023-09448-5

  • 48

    TianZ.ChengJ.XuJ.FengD.ZhongJ.YuanX.et al. (2023). Cytogeography of naturalized Solidago canadensis populations in Europe. Plants12, 1113. doi: 10.3390/plants12051113

  • 49

    TillichM.LehwarkP.PellizzerT.Ulbricht-JonesE. S.FischerA.BockR.et al. (2017). GeSeq – versatile and accurate annotation of organelle genomes. Nucleic Acids Res.45, W6W11. doi: 10.1093/nar/gkx391

  • 50

    Van der AuweraG.CarneiroM. O.HartlC.PoplinR.AngelG. D.Levy-MoonshineA.et al. (2013). From fastq data to high-confidence variant calls: The genome analysis toolkit best practices pipeline. Curr. Protoc. Bioinf.43, 11.10.1-11.10.33. doi: 10.1002/0471250953.2013.43.issue-1

  • 51

    Van KleunenM.PyšekP.DawsonW.EsslF.KreftH.PerglJ.et al. (2019). The global naturalized alien Flora (Glo NAF) database. Ecology100, e02542. doi: 10.1002/ecy.2542

  • 52

    Van KleunenM.SchmidB. (2003). No evidence for an evolutionary increased competitive ability in an invasive plant. Ecology84, 28162823. doi: 10.1890/02-0494

  • 53

    VargasO. M.OrtizE. M.SimpsonB. B. (2017). Conflicting phylogenomic signals reveal a pattern of reticulate evolution in a recent high-Andean diversification (Asteraceae: Astereae: Diplostephium). New Phytol.214, 17361750. doi: 10.1111/nph.2017.214.issue-4

  • 54

    VerlooveF.ZonneveldB. J. M.SempleJ. C. (2017). First evidence for the presence of invasive Solidago altissima (Asteraceae) in Europe. Willdenowia47, 6975. doi: 10.3372/wi.47.47107

  • 55

    WangX.BaiS.ZhangZ.ZhengF.SongL.WenL.et al. (2023). Comparative analysis of chloroplast genomes of 29 tomato germplasms: genome structures, phylogenetic relationships, and adaptive evolution. Front. Plant Sci.14, 1179009. doi: 10.3389/fpls.2023.1179009

  • 56

    WeberE. (1997). Morphological variation of the introduced perennial Solidago canadensis L. sensu lato (Astera ceae) in Europe. Bot. J. Linn. Soc123, 197210. doi: 10.1111/j.1095-8339.1997.tb01413.x

  • 57

    WeberE. (2000). Biological flora of central Europe: Solidago altissima L. Flora195, 123134. doi: 10.1016/S0367-2530(17)30960-X

  • 58

    WeberE. (2017). Invasive plant species of the world: a reference guide to environmental weeds. 2nd ed. (Wallingford: CABI publishing).

  • 59

    WeberE.SchmidB. (1998). Latitudinal population differentiation in two species of Solidago (Asteraceae) introduced into Europe. Am. J. Bot.85, 1110. doi: 10.2307/2446344

  • 60

    WenF.WuX.LiT.JiaM.LiuX.LiaoL. (2021). The complete chloroplast genome of Stauntonia chinensis and compared analysis revealed adaptive evolution of subfamily Lardizabaloideae species in China. BMC Genomics22, 161. doi: 10.1186/s12864-021-07484-7

  • 61

    WickhamH. (2016). ggplot2: Elegant Graphics for Data Analysis (Switzerland Springer: Springer-Verlag New York).

  • 62

    WilliamsM.ZapataL.WernerB.BarnesC.SottorivaA.GrahamT. (2020). Measuring the distribution of fitness effects in somatic evolution by combining clonal dynamics with dN/dS ratios. Elife9, e48714. doi: 10.7554/eLife.48714

  • 63

    WuH.QiangS. (2005). Bio-ecological characteristics of Solidago canadensis L. and its control. Weed Sci.1, 5256.

  • 64

    XiaX.PengJ.YangL.ZhaoX.DuanA.WangD. (2022). Comparative analysis of the complete chloroplast genomes of eight Ficus species and insights into the phylogenetic relationships of Ficus. Life12, 848. doi: 10.3390/life12060848

  • 65

    XuH.QiangS.GenovesiP.DingH.WuJ.MengL.et al. (2012). An inventory of invasive alien species in China. NeoBiota15, 126. doi: 10.3897/neobiota.15.3575

  • 66

    YangZ.NielsenR. (2002). Codon-substitution models for detecting molecular adaptation at individual sites along specific lineages. Mol. Biol. Evol.19, 908917. doi: 10.1093/oxfordjournals.molbev.a004148

  • 67

    YangZ.WongW. S. W.NielsenR. (2005). Bayes empirical bayes inference of amino acid sites under positive selection. Mol. Biol. Evol.22, 11071118. doi: 10.1093/molbev/msi097

  • 68

    YangR.ZanS.TangJ.ChenX. (2011). Invasion mechanisms of Solidago canadensis L.: a review. Acta Ecol. Sin.31, 11841195.

  • 69

    ZhangM.ChenN. (2022). Comparative analysis of Thalassionema chloroplast genomes revealed hidden biodiversity. BMC Genomics23, 327. doi: 10.1186/s12864-022-08532-6

  • 70

    ZhaoS. Y.SunS. G.DaiC.GituruR. W.ChenJ. M.WangQ. F. (2015). Genetic variation and structure in native and invasive Solidago canadensis populations. Weed Res.55, 163172. doi: 10.1111/wre.2015.55.issue-2

  • 71

    ZhouH.SheR.XuY.ZhaoP.ZhangS. (2019). Characterization of the complete chloroplast genome of sunflower family species Aster Flaccidus (Compositae). Mitochondrial DNA Part B4, 40964097. doi: 10.1080/23802359.2019.1692710

Summary

Keywords

Solidago, chloroplast genome, phylogenetic analysis, introduction pathways, molecular marker

Citation

Tao Y-T, Chen L-X, Jiang M, Jin J, Sun Z-S, Cai C-N, Lin H-Y, Kwok A, Li J-M and van Kleunen M (2024) Complete chloroplast genome data reveal the existence of the Solidago canadensis L. complex and its potential introduction pathways into China. Front. Plant Sci. 15:1498543. doi: 10.3389/fpls.2024.1498543

Received

19 September 2024

Accepted

02 December 2024

Published

20 December 2024

Volume

15 - 2024

Edited by

Marcial Escudero, Sevilla University, Spain

Reviewed by

Kyong-Sook Chung, Jungwon University, Republic of Korea

Jose Planta, University of the Philippines Diliman, Philippines

Updates

Copyright

*Correspondence: Jun-Min Li,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics