Invasive Chloroplast Population Genetics of Mikania micrantha in China: No Local Adaptation and Negative Correlation between Diversity and Geographic Distance

Two fundamental questions on how invasive species are able to rapidly colonize novel habitat have emerged. One asks whether a negative correlation exists between the genetic diversity of invasive populations and their geographic distance from the origin of introduction. The other is whether selection on the chloroplast genome is important driver of adaptation to novel soil environments. Here, we addressed these questions in a study of the noxious invasive weed, Mikania micrantha, which has rapidly expanded in to southern China after being introduced to Hong Kong in 1884. Seven chloroplast simple sequence repeats (cpSSRs) were used to investigate population genetics in 28 populations of M. micrantha, which produced 39 loci. The soil compositions for these populations, including Mg abundance, were measured. The results showed that M. micrantha possessed relatively high cpSSR variation and differentiation among populations. Multiple diversity indices were quantified, and none was significantly correlated with distance from the origin of introduction. No evidence for “isolation by distance,” significant spatial structure, bottlenecks, nor linkage disequilibrium was detected. We also were unable to identify loci on the chloroplast genome that exhibited patterns of differentiation that would suggest adaptive evolution in response to soil attributes. Soil Mg had only a genome-wide effect instead of being a selective factor, which highlighted the association between Mg and the successful invasion. This study characterizes the role of the chloroplast genome of M. micrantha during its recent invasion of southern China.


INTRODUCTION
Mikania micrantha H. B. K. (Asteraceae) represents one of the world's 100 worst weeds (Holm et al., 1977). It is a perennial vine with both sexual and vegetative reproduction. The invasion history of M. micrantha in southern China can be clearly traced with voucher records. Its first appearance was documented at Hong Kong Zoological and Botanical Gardens in 1884 (Wang et al., 2003).
After becoming naturalized by 1919, the weed started to spread at an alarming rate (Wang et al., 2003). In 1984, M. micrantha reached near Shenzhen (Wang et al., 2003), from where it rapidly spread throughout Guangdong, and then on to other southern and central China areas (Zhang L.Y. et al., 2004).
During colonization, invasive species are expected to experience sharp decreases in effective population size during founder events, which may have an important genetic impact (Dlugosch and Parker, 2008). Indeed, many invasive species are found to have reduced genetic variation in their introduced ranges (Dlugosch and Parker, 2008). In principle, a negative correlation is expected between the genetic diversity of a given invasive population and its distance from the origin of introduction in the invaded areas (Prugnolle et al., 2005). However, empirical results demonstrate that population genetic patterns can be complex and deviate from this expectation. For instance, a high level of genetic variation was revealed in the introduced populations of M. micrantha by using AFLP markers . Theoretical analyses have shown that factors, including high levels of growth rate, migration (gene flow), and dispersal distance, all can reduce the predicted effects of successive founder events and limit the loss of genetic diversity (Austerlitz et al., 1997). Besides, multiple introductions from a single or multiple source populations may also buffer the loss of genetic diversity (Dlugosch and Parker, 2008;Sakata et al., 2015). As the introduction of M. micrantha in southern China has an explicit origin, it provides an ideal subject to exam how genetic diversity evolves and is structured during the colonization process. Moreover, we have also conducted spatial autocorrelation analysis to estimate the spread potential of this invasive weed (Fitzpatrick et al., 2012).
Of diverse ecological factors that impact the population genetic patterns of invasive plants, soil is a crucial one that should be given special consideration. Soil provides essential nutrients for plants and has a decisive influence on their local adaptation (Hancock et al., 2011). Changes in soil metal concentrations therefore have the potential to drive local genetic adaptation (Alberto et al., 2010). Indeed, genetic differentiation in response to soil factors has been well documented in a wide range of plant taxa (Anacker et al., 2011;Misiewicz and Fine, 2014). M. micrantha has exhibited different soil preferences in invasive areas (Zhang L.Y. et al., 2004). Unlike in its native range, the weed grows on both dry soils and shady, more mesic sites in southern China (Zhang L.Y. et al., 2004), whose favorable soil moisture has consequently increased to higher than 15% (Huang et al., 2000). More importantly, the expansion of M. micrantha can also induce changes in the soil nutrient content, which in turn promotes growth (Liu et al., 2012). Therefore, the initial differences and subsequent changes of soil composition in invasive regions may act as selective factors, driving adaptive genetic differentiation. We have identified adaptive AFLP loci in the invasive populations of M. micrantha .
Chloroplasts are essential cellular organelles, which have a decisive effect on the growth, development, and biotic defense of plants (Nie et al., 2012). Tolerances of environmental stress, including drought, high salinity, extremes of temperature, heavy metals, and high light, are all known associated with the function of chloroplast (cp) genomes (Fitzgerald et al., 2011). For example, it has been revealed that high levels of chloroplast genomic diversity are important in allowing populations of weeping ricegrass to adapt to both warm and dry climatic conditions (Fitzgerald et al., 2011). Nonetheless, whether the cp genome is involved in the adaptation of invasive plants to novel environments remains unclear. Chloroplast DNA variation can be effectively detected by chloroplast simple sequence repeats (cpSSRs). These cpSSRs are uniparentally inherited and nonrecombinant. They are particularly efficient for the detection of bottleneck effects, because the cp genome is haploid (Rodriguez et al., 2013). To complement the analysis of nuclear genome for understanding the response of M. micrantha to local edaphic conditions , we have identified variable cpSSRs and evaluated associations among haplotypes and soil characteristics.
In the present study, we have measured the soil composition and conducted a cpSSR assay for the invasive populations of M. micrantha in southern China. We focus to test the following hypotheses: (i) whether a negative correlation exists between the genetic diversity of a population and its geographic distance from the origin of introduction; (ii) are there any adaptive cpSSR loci that are significantly associated with soil attributes? This study will provide additional information to improve our ability to predict the invasion of M. micrantha in southern China and mitigate its impact.

Plant Materials and Soil Sampling
Twenty-eight M. micrantha populations covering its entire introduced range in southern China were sampled as described by Wang et al. (2012). The plant materials were collected and processed as described in the previous study ; Figure 1). To obtain environmental data, three soil samples per population were randomly selected and analyzed. Each soil sample, of about 0.5 g, was taken from a depth of 10-15 cm.

Soil Composition Analysis
Soil samples were air dried and ground fine enough to pass through a 1.0 mm sieve. The measurement of soil pH was performed in suspension at a ratio of 2:5 (W/V) of soil to deionized water using a pH meter (PHSJ3F, INESA, Shanghai, China). Electrical conductivity was measured using a conductivity meter (model DDS-307A, INESA, Shanghai, China), soil moisture was determined by oven drying for 6 h at 105 • C, and soil organic matter content was measured by the wet combustion method. Total nitrogen and total phosphate were digested with H 2 SO 4 + HClO 4 at a ratio of 5:1 (W/V) and then determined using the indophenol-blue and molybdenum-blue method, respectively. Total soil Al, Ca, Cu, Fe, K, Mg, Mn, Na, Si, and Zn concentrations were determined by using inductively coupled plasma atomic emission spectrometry (ICP-AES) after acid digestion.

DNA Extraction and cpSSR Protocols
DNA extraction methods were presented in detail in a previous study . To optimize the cpSSR conditions, a total of 34 cp microsatellite loci were preliminarily screened. We chose eight polymorphic cp microsatellites ( Table 1). Only cpSS 02 was an anonymous marker. It is unclear whether cpSS 02 is located in a coding or non-coding region of the cp genome. Polymerase chain reaction (PCR) was performed in a 20 µL final volume containing 1 × buffer, 25 Mm MgCl 2 , 10 Mm each of dNTP, 0.5 U Taq polymerase, 5 mm of each primer, and 100 ng template DNA. The PCR program was as follows: an initial denaturation of 5 min at 94 • C, followed by 35 cycles of denaturation at 94 • C for 60 s, annealing at 55 • C for 60 s, and elongation at 72 • C for 120 s, and a final elongation of 72 • C for 5 min. PCRs were conducted on a PTC-100 Peltier Thermal Cycler (MJ Research). The PCR products were sized on an ABI 377 automated sequencer (Applied Biosystems) using ROX-500 as the size marker.

Statistical Analysis
Raw fluorescent cpSSR data were collected and scored using software GeneScan 3.7 (Applied Biosystems) and genographer (version 1.6.0) 1 . GenALEx 6.3 was used to calculate population genetic parameters including percentage of polymorphic loci, observed number of alleles, effective number of alleles, Nei's gene diversity, and Shannon's Information index. The occurrence of null alleles in cpSSR data was determined using MICRO-CHECKER version 2.2.3.
Based on cpSSR data, we measured the population differentiation among and within populations using ARLEQUIN 3.0. Analysis of molecular variance (AMOVA) based on one thousand random permutations were performed at three levels: regional, among-population, and within-population. Also using the same software, isolation by distance, which is the correlation between geographical distance (km) and genetic differentiation among populations, was investigated using the Mantel test. The pairwise θ B estimates, which can be obtained by HICKORY v1.0, comprised the matrix of genetic differentiation. To investigate correlation between genetic variation and dissimilarity in soil composition, a partial Mantel test was also performed to evaluate dependence between these variables implemented in GenAlEx (9999 permutations), controlling for the effect of geographic distance.
We used SPAGeDi v. 1.3 (Hardy and Vekemans, 2002) to estimate Moran's I, which quantifies the correlation between allelic frequency and spatial distance classes. The distance classes were 0.1, 0.2, 0.3, 0.4, 0.5, 0.6 and 0.7 km in order to obtain a high spatial resolution and to assure a sufficient number of individual pairs per distance class. Tests of significance were carried out against 9999 random permutations and 95% confidence intervals (CIs) were estimated.
Pearson correlation and linear regression analyses were performed to estimate each diversity parameter against the distance from Hong Kong Zoological and Botanical Gardens, the inferred original invasive site of M. micrantha in Southern China (Wang et al., 2003). STRUCTURE 2.2.3 was used to further investigate the Bayesian clustering and assignment within the entire sample studied. Total 100 000 MCMC iterations and a 10 000 burnin period were performed with admixture model to search for best number of clusters based on 30 independent runs. The maximum-likelihood estimate method suggested by Evanno et al. (2005) was applied to determine the number of clusters.
Linkage disequilibrium (LD) across loci may bias the estimation of the population genetic variation and differentiation (Volis et al., 2005). In order to remove the effects of LD, the extent of LD between all pairs of cpSSR loci was examined by using TASSEL (Trait Analysis by aSSociation, Evolution, and Linkage) 3.0 (Bradbury et al., 2007). The evaluated statistics included the standardized disequilibrium coefficient (D') and the squared correlation coefficient (r 2 ). Their significance was determined by a two-tailed Fisher's exact test.
The program BOTTLENECK was used to test whether populations have suffered a bottleneck in the cp genome. The heterozygosity (Heq) expected at mutation-drift equilibrium was calculated based on both the stepwise mutation model (SMM) and the infinite allele model (IAM). The significance of heterozygosity excess was determined by the sign test.
We used two population genetic approaches to detect outlier loci, i.e., those that show a higher than expected differentiation between populations. First, the software Dfdist a frequentist method based on summary statistics of a symmetrical island model (i.e., drift-migration equilibrium). We used Dfdist to test for the neutrality of cpSSR markers. The 4 Nµ parameter value was set to 0.04 in all simulations. All 28 populations were used for a global analysis of M. micrantha. The outlier threshold was delimited by more restrictive significance levels of 0.005. In addition, to cross-check the reliability of the outlier loci detected by Dfdist, we also ran Bayescan analyses. Bayescan was run with 10 pilot runs of length 5000, followed by 50,000 iterations each, 5000 sample sizes, and 20 thinning intervals. Generally, outliers identified by both Dfdist and Bayescan are likely to be truly adaptive regions of the genome.
Program GESTE 2.0, a hierarchical Bayesian method was used to analyze the relationship between genetic structure and environmental factors at each population site. F ST values for each population and association of F ST to environmental factors were estimated using a generalized linear model. The posterior probabilities for each model were provided using a reversible jump MCMC approach based on a sample size of 30,000. The model with the highest posterior probability was the one that best explained the data. We used a burn in of 100,000 iterations with 10 pilot runs of 5,000 iterations and a thinning interval of 50.

Genetic Variation of cpSSR
Of the 34 chloroplast microsatellites screened, seven loci (RCt8, RCt9, WCt2, WCt11, WCt12, WCt14, and WCt22) were polymorphic and contributed to 39 alleles in M. micrantha. CpSS 02 was determined to be null alleles using the program Micro-Checker. We excluded this locus from further analyses. Genetic variation revealed with the cpSSR markers was high in M. micrantha. Across all of the populations, the percentage of polymorphic loci was 97.44%, and Nei's gene diversity and Shannon's Information Index were estimated to be 0.1686 and 0.2827, respectively.
At the population level, all but one measure (the number of polymorphic loci) of cpSSR genetic diversity were highest in population NLD26. Measures that were highest in NLD26 were: the observed number of alleles, the effective number of alleles, Nei's gene diversity, and Shannon's Information Index ( Table 2). In contrast, all estimates except effective number of alleles and Nei's gene diversity were lowest in population HK87 ( Table 2). At the region level, all estimates except effective number of alleles showed that region NLD had the highest level of genetic diversity. No region was found to have the least genetic variation across a majority of the estimates calculated.
At the region level, G ST values were 0.2625, 0.2172, 0.2150, 0.2061, 0.1510, and 0.0795 for regions NLD, MA, HK, SZ, ZH, and DG, respectively. That is, NLD had the most strongly differentiated populations, while DG populations were the least differentiated. Overall G ST among the 28 populations was 0.2880.
Analysis of molecular variance indicated that most (84.53%) of the variance was attributable to the differences within populations, 7.85% was accounted for by differences among populations within regions, while 7.62% was attributable to differences among regions ( Table 3). A random permutation test revealed that these variance partitions were all significant (P < 0.001). Wright's F ST was 0.1547.  In the STRUCTURE analysis, the clustering level, K = 2, yielded the largest delta-K-value ( K = 136.77835). Both of the clusters exhibited substantial admixture, which is visualized in Figure 2.
The matrix of genetic distances among populations was not significantly correlated with the corresponding matrix of geographical distances (Mantel test; r = 0.0711, P = 0.293). That is, no evidence of "isolation by distance" was detected. Because genetic diversity is expected to decrease during the invasion of new regions, regression of diversity on geographic distance from an inferred ancestral source population would reveal a negative correlation coefficient. However, we found that each diversity FIGURE 2 | Chloroplast genetic structure of M. micrantha populations based on structure. Each color represents a different cluster. Black lines separate populations. For population codes see Table 2. parameter had no significant correlation with increasing distance from Hong Kong Zoological and Botanical Gardens (Pearson's r: P > 0.05), which followed the linear regression models reported in Figure 3. From Hong Kong to Guangdong, the genetic diversity of M. micrantha decreased. However, genetic variation in NLD 26 reaches a maximum, then it gradually decreased in other areas. In addition, we also compared the dissimilarity of soil attributes among populations with their genetic distances with the Mantel test; there was no significant relation between these variables.
Moran's I was insignificant for all distance classes, indicating that the overall spatial pattern of genetic variation is not differentiable from random across the entire study region (Figure 4). Similarly, at a finer spatial scale, no significant value of I was detected across different distance classes, ranging from 0 to 0.8 km (Figure 4).

Linkage Disequilibrium and Bottleneck Test
All 39 cpSSR alleles genotyped on the 28 populations were tested for linkage disequilibrium (LD). No allelic pairs were found to be in significant LD based on D' and r 2 statistics ( Table 4).
When either the SMM or the IAM was assumed, no significant excess of heterozygosity was detected for any population up to and including at the level of the entire range ( Table 2).

Detection of Signatures of Positive Selection
We used global F ST simulations in an infinite-alleles model based on an island demographic model in Dfdist. We employed the 30% trimmed mean, which removes the highest and lowest 30% of F ST values. This trimmed mean served as the target mean F ST for coalescent simulations. In this study, no adaptive loci were identified at the 99.5% confidence level ( Figure 5A). Only loci under negative selection were detected. We adopted log 10 PO > 2.0 as a threshold for decisive evidence for accepting a model under selection, which corresponded to a posterior probability greater than 0.91. No locus was found to exhibit a divergence pattern that deviated from neutral expectation ( Figure 5B).

Measurement of Soil Components
Site pH values ranged from 4.07 to 8.13, and organic matter content ranged from 0.70 to 35.65%. Si concentration varied from 0.02 to 2.68 mg/g. In terms of metal concentrations, Mg, Zn, and Al concentrations varied greatly, ranging from 0.048 FIGURE 4 | Spatial autocorrelation analysis of genetic and geographical distance in M. micrantha using SPAGeDi. Dashed lines indicate 95% confidence intervals, assuming the null hypothesis that no autocorrelation exists, based on 9999 permutations.

Association with Environmental Variables
GESTE analyses were carried out on all cpSSR loci and environmental variables. Generalized linear models were run using all seven non-metal factors (soil pH, electrical conductivity, fresh water content in soil, air dried soil water content, soil organic matter, total nitrogen, and total phosphate), resulting in a total of 128 models. For 10 metal factors, a total of 1024 models were produced. Mg was found to have the highest probability model, 513 (Pr = 0.00390). Due to the fact that no adaptive loci were identified in the cp genome, Mg appeared to have only a genome-wide effect instead of being a selective factor.

DISCUSSION
Our results showed that M. micrantha in southern China possesses relatively high cpSSR variation and differentiation (Wright, 1978;Powell et al., 1995;Hamrick and Godt, 1996). The genetic diversity is highest in NLD in Shenzhen. Genetic diversity is expected to decrease from the origin of introduction to newly invaded areas (Fontaine et al., 2013). However, M. micrantha chloroplast variation did not conform to this hypothesis. Compared to other regions, the population genetic variation at Hong Kong Zoological and Botanical Gardens, the site documented as the origin of introduction, is the lowest. It seems that with naturalization in southern China, the genetic variation of M. micrantha has gradually increased in the novel invasive areas. The Mantel test detected no correlation between genetic and geographic distances. Moreover, the Moran's I correlogram also showed no spatial genetic structure. Both analyses suggest that the genetic variation is randomly distributed across the introduced populations of M. micrantha. Similar results have been obtained in an ISSR analysis of M. micrantha (Wang et al., 2008). The pattern of population genetic variation observed here may be closely linked with the ecological characteristics of M. micrantha. The weed produces enormous numbers of small and light seeds (1.7 × 10 5 /m 2 ; Zhang L.Y. et al., 2004). Although its seed dispersal takes place by wind, water and animals, dispersal by wind is thought to be the chief means of invasion into disturbed environments (Zhang L.Y. et al., 2004). In particular, violent winds have the potential to uplift and transport the seeds several hundred kilometers away (Zhang L.Y. et al., 2004). We infer that extensive gene flow mediated by wind-dispersed seeds is likely the primary factor causing the random distribution of genetic variation across the introduced region.
Besides seeds, M. micrantha can also reproduce vegetatively by producing shoots from stem fragments and rosettes (Zhang L.Y. et al., 2004). Vegetative reproduction tends to increase the occurrence of patches of genetically identical individuals (ramets), which will yield significant positive values of Moran's I for small distance classes in the correlogram. However, this is not the case for M. micrantha (Figure 3). Moreover, lack of spatial genetic structure also indicates that populations of M. micrantha are not subdivided into local demes or subpopulations that are consisted of interbreeding, closely related plants. These results, which suggest that dispersal and establishment via seed prevail over vegetative expansion, may be helpful for developing an efficient control program.
Invasive species are generally characterized with genetic bottlenecks (Garbelotto et al., 2013). In the previous, we have detected evidence for bottlenecks in the nuclear genome of M. micrantha . Compared with nuclear markers, cpSSRs are particularly efficient for the detection of bottleneck effects due to their haploid nature (Rodriguez et al., 2013). Surprisingly, however, no bottleneck was found in the M. micrantha cp genome, possibly attributing to the lack of the accumulation of mutations or recombinations (Garbelotto et al., 2013). In addition, gene flow through seed dispersal, which causes mixing of divergent chloroplasts, can also erase bottleneck effects. The lack of linkage disequilibrium in the M. micrantha populations may also be associated with the mixing of chloroplasts.
We also performed a scan of the M. micrantha cp genome by using cpSSRs. No adaptive loci were detected in this study. By contrast, 14 adaptive loci were identified in the previous AFLP genome scan . It seems that the local adaptation of M. micrantha is driven by nuclear rather than cp genome since its introduction to southern China. The likely underlying causes are as follows. First, angiosperm cp genomes are evolutionarily conserved and have lower rates of nucleotide substitution than nuclear genomes (Guisinger et al., 2008). Asteraceae cp genomes are known especially conservative (Nie et al., 2012). Their low levels of substitution input may limit the adaptability, although this supposition remains controversial (Harris et al., 2012;Bock et al., 2014). Second, most chloroplast genes of photosynthetic plants undergo strong negative rather than positive selection due to functional constraints (Guisinger et al., 2008). Adaptive evolution has only been detected in a few chloroplast genes (Erixon and Oxelman, 2008;Iida et al., 2009;Sen et al., 2012). Third, chloroplast is semi-autonomous. A chloroplast gene-encoding subunit often needs to assemble with a nuclear gene-encoding counterpart to form a functional complex (Kellogg and Juliano, 1997). Thus, if a substitution happens to occur in a chloroplast gene, a compensatory substitution(s) in its interacting nuclear gene would be needed to maintain the function of the whole complex (Jansen et al., 2007). This will impose extra selective pressure to restrain the occurrence of adaptive evolution. To better understand the issue, we have sequenced the entire cp genome from multiple accessions of M. micrantha (Huang et al., 2016).
In this study, we observed that soil Magnesium (Mg), instead of just being a selective factor, has a genome-wide effect on cpDNA. Mg is an essential element for plant growth and development, whose biological functions include key roles in photosynthesis, protein synthesis, and nucleotide metabolism (Gransee and Führs, 2013). In particular, Mg 2+ acts as an important signal in the regulation of key enzymes involved in the fixation of carbon in chloroplasts (Shaul, 2002). Small variation in Mg 2+ levels at the cytosol and the chloroplast can strongly affect the activity of photosynthetic enzymes (Shaul, 2002). More importantly, Mg is the central atom in the chlorophyll molecule. Its deficiency will result in mild to severe etiolation in leaves (Gransee and Führs, 2013). Our finding of the effect of Mg on the M. micrantha cp genome highlights the association between soil Mg and the successful invasion of the weed in southern China.

CONCLUSION
The present study is the first to report on chloroplast genomic diversity of the invasive weed M. micrantha. High genetic variability and differentiation, no linkage disequilibrium, and no severe bottlenecks were observed in the introduced populations of M. micrantha by using cpSSRs. Each genetic diversity parameter showed no significant correlation with increasing distance from the origin of introduction. Soil appeared not associated with the cpDNA variation of M. micrantha. Our results provide a framework for further exploring the cp genome evolution of M. micrantha.

AUTHOR CONTRIBUTIONS
TW designed and performed the experiments, and wrote the manuscript; ZW conducted data analysis and checked English grammar; GC performed the cpSSR experiment and soil analysis; CW performed data analysis; YS contributed to the supervision of the work and wrote the manuscript. All authors read and approved the final version of the manuscript.