Comparisons of Natural and Cultivated Populations of Corydalis yanhusuo Indicate Divergent Patterns of Genetic and Epigenetic Variation

Epigenetic variation may contribute to traits that are important in domestication, but how patterns of genetic and epigenetic variation differ between cultivated and wild plants remains poorly understood. In particular, we know little about how selection may shape epigenetic variation in natural and cultivated populations. In this study, we investigated 11 natural populations and 6 major cultivated populations using amplified fragment length polymorphism (AFLP) and methylation-sensitive AFLP (MS-AFLP or MSAP) markers to identify patterns of genetic and epigenetic diversity among Corydalis yanhusuo populations. We further explored correlations among genetic, epigenetic, alkaloidal, and climatic factors in natural and cultivated C. yanhusuo. We found support for a single origin for all cultivated populations, from a natural population which was differentiated from the other natural populations. The magnitude of F ST based on AFLP was significantly correlated with that for MSAP in pairwise comparisons in both natural and cultivated populations, suggesting a relationship between genetic and epigenetic variation in C. yanhusuo. This relationship was further supported by dbRDA (distance-based redundancy analyses) where some of the epigenetic variation could be explained by genetic variation in natural and cultivated populations. Genetic variation was slightly higher in natural than cultivated populations, and exceeded epigenetic variation in both types of populations. However, epigenetic differentiation exceeded that of genetic differentiation among cultivated populations, while the reverse was observed among natural populations. The differences between wild and cultivated plants may be partly due to processes inherent to cultivation and in particular the differences in mode of reproduction. The importance of epigenetic compared to genetic modifications is thought to vary depending on reproductive strategies, and C. yanhusuo usually reproduces sexually in natural environments, while the cultivated C. yanhusuo are propagated clonally. In addition, alkaloid content of C. yanhusuo varied across cultivated populations, and alkaloid content was significantly correlated to climatic variation, but also to genetic (6.89%) and even more so to epigenetic (14.09%) variation in cultivated populations. Our study demonstrates that epigenetic variation could be important in cultivation of C. yanhusuo and serve as a source of variation for response to environmental conditions.

Epigenetic variation may contribute to traits that are important in domestication, but how patterns of genetic and epigenetic variation differ between cultivated and wild plants remains poorly understood. In particular, we know little about how selection may shape epigenetic variation in natural and cultivated populations. In this study, we investigated 11 natural populations and 6 major cultivated populations using amplified fragment length polymorphism (AFLP) and methylation-sensitive AFLP (MS-AFLP or MSAP) markers to identify patterns of genetic and epigenetic diversity among Corydalis yanhusuo populations. We further explored correlations among genetic, epigenetic, alkaloidal, and climatic factors in natural and cultivated C. yanhusuo. We found support for a single origin for all cultivated populations, from a natural population which was differentiated from the other natural populations. The magnitude of F ST based on AFLP was significantly correlated with that for MSAP in pairwise comparisons in both natural and cultivated populations, suggesting a relationship between genetic and epigenetic variation in C. yanhusuo. This relationship was further supported by dbRDA (distance-based redundancy analyses) where some of the epigenetic variation could be explained by genetic variation in natural and cultivated populations. Genetic variation was slightly higher in natural than cultivated populations, and exceeded epigenetic variation in both types of populations. However, epigenetic differentiation exceeded that of genetic differentiation among cultivated populations, while the reverse was observed among natural populations. The differences between wild and cultivated plants may be partly due to processes inherent to cultivation and in particular the differences in mode of reproduction. The importance of epigenetic compared to genetic modifications is thought to vary depending on reproductive strategies, and C. yanhusuo usually reproduces sexually in natural environments, while the cultivated C. yanhusuo are propagated clonally. In addition, alkaloid content of C. yanhusuo varied across cultivated populations, and INTRODUCTION Cultivated plants possess major morphological and physiological changes compared to their wild progenitors (Darwin, 1868). Moreover, many quantitative traits in cultivated plants can become differentiated across the cultivated range due to selection in new environments and in response to different human preferences (Purugganan and Fuller, 2009;Fuller et al., 2011;Meyer and Purugganan, 2013). The wide range of possible phenotypes results from continuous rounds of artificial selection despite the fact that cultivated plants are presumed to have genetic bottlenecks inherent to strong artificial selection and genetic drift (Hyten et al., 2006;Berger et al., 2012;Meyer and Purugganan, 2013;Verde et al., 2013). While most studies of the inheritance of phenotypic differences in cultivated species have focused on DNA sequence differences (Caicedo et al., 2007;Hufford et al., 2012;Meyer and Purugganan, 2013;Lin et al., 2014;Zhou et al., 2015), recent work suggests that some of this increased phenotypic diversity could be epigenetically based (Hauben et al., 2009;Ji et al., 2015;Gallusci et al., 2017;Shen et al., 2018).
DNA methylation is one epigenetic mechanism that could contribute to organismal response to environmental conditions since DNA methylation modifications can affect transcription, and lead to phenotypic variation (Nicotra et al., 2010;Wibowo et al., 2016;Richards et al., 2017). Studies in both plant and animal species with low levels of genetic diversity have found that variation in DNA methylation was correlated to habitat type, exposure to stress, and shifts in species range suggesting that environmentally induced epigenetic changes may contribute to the ability to cope with environmental variation (Richards et al., 2012;Preite, 2014: Liebl et al., 2015;Xie et al., 2015). Further, several studies have found that epigenetic variation is not entirely explained by genetic variation, and that the amount of epigenetic variation often exceeds that of sequence variation among natural populations (e.g., Viola cazorlensis, Herrera and Bazaga, 2010;Laguncularia racemose, Lira-Medeiros et al., 2010;Fallopia spp., Richards et al., 2012;Viola elatior, Schulz et al., 2014). However, deciphering the relationship between genetic and epigenetic variation remains a significant challenge (Schulz et al., 2014;Alonso et al., 2016;Foust et al., 2016;Gugger et al., 2016;Herrera et al., 2017;Richards et al., 2017), and some studies have found that DNA methylation polymorphisms are explained by underlying genetic differences (Becker et al., 2011;Li et al., 2014;Dubin et al., 2015), and therefore do not provide any additional sources of phenotypic variation. Although epigenetic variation among wild plant populations has been well studied, the changes in epigenetic variation involved in cultivation of plants remain poorly understood (Piperno, 2017;Lu et al., 2020). In addition to studies comparing domesticated species to their wild progenitors (Li et al., 2012;Sauvage et al., 2017), studies of crop plants with both natural and cultivated populations will shed light on the importance of epigenetic and genetic variation in domestication.
Vegetative propagation has been a customary technique in modern cultivation, including for economically important crops like manioc, potato, sugarcane, sweet potato, taro, and yams (Denham et al., 2020). Many of the economically important crops that are vegetatively propagated, are derived from wild type progenitors that largely reproduce sexually (Denham et al., 2020). While recent evidence is equivocal about the loss of genetic diversity through domestication for outcrossing grain crops (Meyer and Purugganan, 2013;Allaby et al., 2019), genetic diversity in vegetatively propagated plants can be much lower and many vegetatively propagated plants lose the capacity for sexual reproduction (Denham et al., 2020). As for many clonal species, phenotypic plasticity may also be important for clonally propagated crops (Denham et al., 2020), and such plasticity may be mediated by epigenetic effects (Herman et al., 2014;Herman and Sultan, 2016;Richards et al., 2017;Banta and Richards, 2018). In fact, previous studies on a variety of species have revealed that the dynamics of plant responses that are mediated by DNA methylation may differ between sexual and asexual plants (Verhoeven and Preite, 2014). This could be partly due to the fact that much of DNA methylation is reset during sexual reproduction of plants (Jullien and Berger, 2010;Gehring, 2019), which may restrict the transmission to the next generation of epigenetic variation caused by environmental induction (Feng and Jacobsen, 2011;Heard and Martienssen, 2014;Tricker, 2015;Wibowo et al., 2018). However, such reprogramming is bypassed in plants that reproduce asexually, suggesting that epigenetic changes induced by environmental conditions could be maintained in asexual plants over generations (Richards et al., 2012;Verhoeven and Preite, 2014;Dodd and Douhovnikoff, 2016;Gonzaĺez et al., 2018). This maintenance of epigenetic variation through asexual reproduction may be important in the process of adaptation, contributing to the short-term evolution of asexual plants (Verhoeven and Preite, 2014;Dodd and Douhovnikoff, 2016). Thus, the role of DNA methylation in adaptation may differ between wild and clonally propagated crops, but the comparison of genetic and epigenetic variation among sexual and asexual crops has not yet been fully investigated.
Corydalis yanhusuo W.T. Wang ex Z.Y. Su et C.Y. Wu is a diploid (2n = 32), perennial herb endemic to China (Xu and Wei, 1994;Qiu et al., 2009). The well-known traditional Chinese medicine, Rhizoma corydalis is made from the dried rhizomes of C. yanhusuo which contains several important alkaloids that promote blood activation and pain relief (Pharmacopoeia Committee of P. R. China, 2010). The concentrations of the alkaloids tetrahydropalmatine, protopine, palmatinem, and berberine are the most likely phenotypes under selection for these medicinal purposes. At present, C. yanhusuo is planted widely across China, with several major C. yanhusuo-producing regions in Zhejiang, Shanxi, and Jiangsu provinces. The cultivation history of C. yanhusuo can be traced back to the Ming Dynasty when it was originally planted in Zhejiang Province and then gradually introduced to other regions of China (Qiu et al., 2009). In the production of cultivated C. yanhusuo, the tubers are usually planted in October, propagated through asexual reproduction, and harvested in May before flowering or fruit set. On the other hand, in natural environments C. yanhusuo usually reproduces sexually (Qiu et al., 2009). Due to human overexploitation and environmental degradation, natural C. yanhusuo populations are decreasing, and its range is shrinking. At present, the natural distribution of C. yanhusuo is limited to hilly areas in the middle and lower reaches of the Yangtze River. In this study, we investigated natural and cultivated populations of C. yanhusuo to examine how genetic and epigenetic variation is correlated to climatic differences in the two types of populations. Using epigenetic, genetic, environmental, and phenotypic (alkaloids) data we: (1) examined the differences between natural and cultivated C. yanhusuo in genetic and epigenetic variation and climatic conditions; and (2) explored the association between genetic, epigenetic, and climatic variation and alkaloid content in cultivated C. yanhusuo. With these data, we tested the hypothesis that genetic differentiation is lower and epigenetic differentiation is higher in cultivated than natural populations. We also tested the hypothesis that epigenetic variation is predicted partly by climatic conditions and highly correlated to alkaloid production in cultivated populations supporting the idea that epigenetic variation is an important source of phenotypic variation in cultivated plants.

Plant Materials and DNA Isolation
According to the Flora of China (http://www.efloras.org/index. aspx), C. yanhusuo has two synonyms: C. turtschaninovii Besser f. yanhusuo Y. H. Chou & C. C. Hsu and C. tertici (Nakai) Nakai f. yanhusuo (Y. H. Chou & C. C. Hsu) Y. C. Zhu. In this study, we collected leaves of approximately 15 to 20 individuals of wild C. yanhusuo from the 11 known natural populations ( Figure 1; Table 1). The distances between neighboring plants exceeded 15 m. In addition, we obtained leaves and rhizomes from 14 to 20 individuals of C. yanhusuo from 6 cultivated populations representing the major C. yanhusuo-producing regions in China (CG, GZ, HQ, HZ, JS, and QX; Figure 1, Table 1). Leaf tissues were ground in liquid nitrogen, and genomic DNA was extracted using DNA Plantzol Reagent (Invitrogen, Carlsbad, CA, USA) according to the manufacturer's protocol. The rhizomes from cultivated populations were dried at 60°C to a constant weight and prepared for alkaloid analysis.

Molecular Analysis
We screened 201 individuals of C. yanhusuo from 11 natural populations and 112 individuals from 6 cultivated populations for analysis of genetic and epigenetic variation. We used the amplified fragment length polymorphism (AFLP) technique of Vos et al. (1995) to analyze genetic variation. Genomic DNA was digested with EcoRI/MseI, followed by pre-and selective amplification. Eight fluorescently labeled primer pairs were used for selective amplification (Table S2), and the products were resolved by Genewiz, Inc. (Suzhou, China) on an ABI3730 instrument. Fragments of 69 to 450 bp were identified using GENEMARKER ver. 2.2.0 (SoftGenetics, State College, PA, USA), and transformed into a binary character matrix, with "1" indicating presence and "0" indicating absence.
We used the methylation-sensitive AFLP (MS-AFLP or MSAP) method of Reyna-López et al. (1997) with slight modification (Shi et al., 2017) to analyze epigenetic variation. MSAP is similar to AFLP except for double digestion with EcoRI/ MspI or EcoRI/HpaII in place of EcoRI/MseI as the frequent cutter. MspI and HpaII are isoschizomers, but have different sensitivities to methylation of cytosine within the tetranucleotide recognition sequence 5′-CCGG-3′. Cleavage by MspI is blocked if the outer cytosines are hemi-or fully methylated, and cleavage by HpaII is blocked if the outer cytosine is hemi-methylated or the inner cytosine is methylated on both strands. If both cytosines are methylated, cleavage by both enzymes is blocked. Eight fluorescently labeled primer pairs were used for selective amplification (Table S3). Similar to AFLP, we scored the presence or absence of epigenetic fragments of 69 to 450 bp for each enzyme combination. Next, the epigenetic fragments were transformed into a binary character matrix. Four types of methylation can be defined: type 1 when a fragment is present in both MspI and HpaII reactions (1,1) or there is no methylation; type 2 when a fragment is present in MspI but absent in HpaII (1,0), indicating full methylation of internal cytosines; type 3 when a fragment is absent in MspI, but present in HpaII (0,1) indicating hemimethylation of the external cytosine and type 4 when there is no fragment in either MspI or HpaII reactions (0,0) due to either full methylation of cytosines or sequence polymorphism at the recognition site (Schulz et al., 2013). We pooled these methylation states into two categories: methylated (type 2 and type 3 coded as 1) and unmethylated (type 1 coded as 0) to create a new data matrix and used this matrix to compute epigenetic parameters. We treated type 4 as missing data, because the methylation state cannot be distinguished from sequence polymorphism (Foust et al., 2016).

Meteorological Data
We collected meteorological data for 10 of the 11 naturalpopulation regions (excluding TT) and the 6 cultivation regions from the National Climate Center in China (https:// www.ncc-cma.net/cn/) for the period of October 2013 to May 2014 based on the annual planting time. The parameters were as follows: average (TEM_Avg,°C), lowest (TEM_Min,°C) and highest daily temperatures (TEM_Max,°C); average daily relative humidity (RHU_Avg, %); average atmospheric pressure (PRS_Avg, hpa); total daily sunshine time (SSH, hour); and total daily precipitation (PRE_Time, mm). We used the monthly means as weather parameters in subsequent calculations ( Table S4).

Analysis of Alkaloids
The major active components of interest in the rhizome of C. yanhusuo are alkaloids. Tetrahydropalmatine is the most important alkaloid in C. yanhusuo, and three other alkaloids, i.e., protopine, palmatinem, and berberine, are frequently used to assess the medicinal quality of C. yanhusuo. Therefore, we determined the contents of these four alkaloids in this study. We mashed the dried rhizomes and passed them through a 40-mesh sieve. Next, 1.0 g of sample powder was ultrasonically extracted in 30 mL of 70% ethanol for 30 min and centrifuged for 1.5 min at 1000 r/min. The supernatant was filtered through a 0.45-mm membrane and used for high-performance liquid chromatography (HPLC) analysis. Chromatograms were generated on an Agilent 1100 series HPLC system (Agilent technologies, Palo Alto, CA USA). Chromatographic separation was performed on a Zorbax Extend C18 column using a gradient of A (acetonitrile) and B (0.06% triethylamine and 0.6% acetic acid), following the protocol of Zhao et al. (2013). After screening, a sufficiently large number of detectable peaks were visualized at a wavelength of 280 nm. The chromatographic peaks detected by HPLC were characterized by liquid chromatography-mass spectrometry. The mass spectra were acquired in the positive-ion mode using a Finnigan LCQDECA XP System (Thermo LC/MS Division, San Jose, CA, USA). The mass spectrometry detector (MSD) parameters were set according to Zhao et al. (2013). The contents of protopine, coptisine, tetrahydropalmatine, and palmatine were calculated using a calibration curve, which was plotted as peak area against the concentration determined from duplicate injections of six methanol-diluted working solutions of the reference compounds.

Data Analysis
We evaluated both genetic and epigenetic diversity by calculating Shannon's diversity index (H′) with GENALEX software (ver. 6.5; Peakall and Smouse, 2012). In addition, we used the "msap" package in R to calculate the frequency of each methylation type (Schulz et al., 2013).
Using the genetic data only, we performed the Bayesian clustering model implemented in STRUCTURE (ver. 2.3.4; Pritchard et al., 2000) to assess the genetic structure. This model infers population structure by clustering individual genotypes into a given number of populations (K) by minimizing deviation from Hardy-Weinberg equilibrium. The number of K was set to vary from 1 to 17, and 10 independent iterations were performed for each K. K was determined using the method of Evanno et al. (2005) based on the second-order rate of change of the likelihood (DK). Subsequently, we estimated genetic and epigenetic structure using principal coordinates analysis (PCoA) based on Nei's distance matrix by GENALEX software. In addition, neighbor-joining (NJ) analysis (bootstraps = 1,000) was performed based on Nei's distance using the PHYLIP Package (Felsenstein, 1993) for both AFLP and MSAP data. We calculated the pairwise epigenetic and genetic differentiation coefficient (F ST ) (Tables S5 and S6) and Nei's distance (Tables S7 and S8) with GENALEX software.
We further tested the assumption of population structure for both genetic and epigenetic loci using hierarchical analysis of molecular variance (AMOVA) to assess the overall differences among populations within natural compared to cultivated types (ARLEQUIN software ver. 3.5; Excoffier and Lischer, 2010). We also analyzed natural and cultivated populations separately to better understand how molecular variation was partitioned within and among populations of natural compared to cultivated plants. We tested significance with 10,000 permutations. We also calculated pairwise genetic differences among populations (F ST ) in ARLEQUIN.
To explore patterns of spatial autocorrelation in natural and cultivated populations, we used simple linear regression of pairwise comparisons to calculate the correlations between genetic and epigenetic differentiation (F ST /1-F ST ) and geographic distance (log-transformed) (Diniz-Filho et al., 2013) with the "lm" function in R. To estimate the relationship between genetic and epigenetic variation, we also analyzed the correlation between the pairwise genetic and epigenetic distances (F ST ) and between the respective Shannon's diversities (H') with the "lm" function in R using the following formula: lm (x~y) where x = F ST or H′ for AFLP, y = F ST or H′ for MSAP.
To test the relationship between genetic, epigenetic, environmental, and phenotypic variation, we used distancebased redundancy analysis (dbRDA) with the "capscale" function of the VEGAN package in R (Oksanen et al., 2019). First, we evaluated the correlation between genetic and epigenetic variation separately for cultivated and natural populations using the following formula: capscale (x~y) where x = the Euclidean distance matrix for MSAP as the dependent variable, and y = the first three principal component analysis (PCA) axes of AFLP (AFLP_PC1, AFLP_PC2, AFLP_PC3) as predictors of epigenetic variation. We derived the PCs using the "principal" function of the psych package in R to run PCA with the following formula: principal (r, nfactors = 3) where r = the binary character matrix for AFLP. Then, we tested and quantified the overall contribution of environmental variables to epigenetic, genetic, and phenotypic divergence among populations in three separate dbRDA for natural populations and 3 separate dbRDA for cultivated populations using the formula: capscale (x~y) where x = the Euclidean distance matrix for AFLP or MSAP or alkaloids, y = meteorological factors (TEM_Avg, TEM_Min, TEM_Max, RHU_Avg, PRS_Avg, SSH, PRE_Time). Our environmental conditions were the seven meteorological factors and phenotypes were the contents of four alkaloids. Finally, we tested the correlations between alkaloid content and genetic or epigenetic variation in separate dbRDA of cultivated populations only using the following formula: capscale (x~y) where x = the Euclidean distance matrix for AFLP or MSAP across the six cultivated populations, and y = the content of four alkaloids (tetrahydropalmatine, protopine, palmatinem, berberine). Because we did not collect meteorological data from the TT natural-population region, we excluded TT from dbRDA analysis. To further assess the significance of the associations, we conducted permutation tests using the "envfit" function of the VEGAN package in R with 999 permutations.

Genetic and Epigenetic Diversity
The AFLP analysis of 313 individuals yielded 515 genetic loci, and the MSAP analysis resulted in 1337 epigenetic loci. We found that the mean genetic diversity was slightly higher in natural than in cultivated populations (H′ AFLP : 0.3497 ± 0.0036 in natural compared to 0.3037 ± 0.0049 in cultivated; Table 1; Figure 2A), while epigenetic diversity was lower in natural populations compared to cultivated populations (H′ MSAP : 0.1088 ± 0.0020 compared to 0.1470 ± 0.0029; Table 1; Figure  2A). Compared to natural populations, the frequency of unmethylated loci (type 1) was significantly higher in cultivated populations ( Figure 2B; Table S1). The epigenetic diversities were generally lower than the genetic diversities in both the natural (H′ AFLP : 0.3497 ± 0.0036; H′ MSAP : 0.1088 ± 0.0020) and cultivated (H′ AFLP : 0.3037 ± 0.0049; H′ MSAP : 0.1470 ± 0.0029) populations (Table 1), but on average, epigenetic diversity was higher in cultivated populations. The mean pairwise genetic and epigenetic Nei's distances were higher in the natural than in the cultivated populations ( Figure 2C, Tables S7 and S8).

Genetic and Epigenetic Structure
Structure identified only two genetic groups: the DK criterion reached its nodal value for K = 2 ( Figure S1), suggesting that the uppermost level of the genetic structure has two distinct clusters. The results for K = 2 and K = 3 are presented to illustrate the formation of populations ( Figure 3A). At K = 2, all populations were divided into two clusters. Cluster I (green) was mainly present in the cultivated populations, whereas most individuals from the natural populations were assigned to cluster II (red), except for the ZJ population, which was assigned to both cluster I (0.54) and cluster II (0.46). At K = 3, cluster II was further separated into two clusters (red and blue), and the blue cluster was found in some individuals belonging to cultivated populations. The PCoA analysis showed that the first two principal coordinates account for 57.48% of the genetic variation of C. yanhusuo population, which mainly reflected the division between natural and cultivated populations, while the ZJ natural population nested within the cultivated cluster ( Figure 3B). In the PCoA analysis for epigenetic data, the first two principal coordinates account for 39.27% of the epigenetic variation of C. yanhusuo populations, and natural and cultivated populations formed two clusters apart from each other ( Figure  3C). In the NJ tree based on genetic data ( Figure 4A), the cultivated and natural populations were grouped into two distinct clades, except ZJ, which clustered within the cultivated populations. In the NJ tree based on epigenetic data ( Figure 4B), all of the natural populations formed a strongly supported clade (100% bootstrap support), whereas all cultivated populations clustered into the other clade (94% bootstrap support).
The results of the hierarchical AMOVA showed that 23.24% of the genetic variation and 20.89% of the epigenetic variation were explained by differences between cultivated and natural populations, while 25.12% of the genetic and 19.16% of the epigenetic variation was attributed to differences among populations within groups (Table 2A). Separate AMOVAs on natural and cultivated populations showed that more of the genetic variation (39.18%) than epigenetic variation (28.74%) was explained by differences among natural populations. On the contrary, less of the genetic variation (10.27%) in cultivated populations was explained by differences among populations then the amount of epigenetic variation (15.60%, Table 2B), indicating that most of the genetic and epigenetic variation in cultivated plants was found within each population. The genetic and epigenetic differences (F ST 0.103 and 0.156) among cultivated populations were lower than those among natural populations (F ST 0.392 and 0.287) ( Table 2). We did not detect a correlation between pairwise comparisons of genetic (H′ AFLP ) and epigenetic diversity (H′ MSAP ) in either cultivated or natural populations (cultivated: r = −0.148, P = 0.779; natural: r = −0.274, P = 0.416; Figure 5A). However, we found that the genetic and epigenetic pairwise F ST were significantly related in cultivated and natural populations (cultivated: r = 0.634, P = 0.011; natural: r = 0.641, P = 0.000; Figure 5B). We also found a spatial autocorrelation pattern for epigenetic variation in the cultivated populations (r = 0.541, P = 0.037; Figure 5C) but not in the natural populations (r = 0.054, P = 0.696; Figure 5D). Relationships among Epigenetic, Genetic,

Alkaloids, and Environmental Variation
We analyzed the relationship between genetic, epigenetic, and environmental variation by separate dbRDA. We found that genetic variation explained 12.60% and 23.02% of the epigenetic variation in the natural and cultivated populations, respectively ( Figures 6A, E). The environmental descriptors explained 33.41% of the genetic variation in natural populations, but only 12.87% of the genetic variation in cultivated populations ( Figures 6B, F). Similarly, environmental factors explained 22.68% and 17.43% of the epigenetic variation in the natural and cultivated populations ( Figures 6C, G). Both epigenetic and genetic variation were also correlated to almost every single climate variable in cultivated and natural populations ( Table S9).
We detected alkaloid content in cultivated populations and found that the contents of four alkaloids differed significantly among the six cultivated populations ( Table 3), even though genetic differentiation among the cultivated populations is limited. We further explore the relationships among epigenetic, genetic, and alkaloids variation by dbRDA, which suggested that alkaloid content was significantly correlated to both genetic (6.89%) and epigenetic (14.09%) variation in cultivated populations ( Figure S1). All four alkaloids were significantly correlated with epigenetic variation and three (except protopine) were correlated with genetic variation (Table 3). Additionally, the environmental descriptors explained 69.03% of the variation in alkaloid content, and five environmental factors were significantly correlated with the alkaloid variation (the exception being PRS_Avg and SSH) ( Figure 6D, Table S9).

DISCUSSION
In this study, we investigated genetic and epigenetic variation among all 11 known natural populations of C. yanhusuo and compared these patterns of variation to 6 major cultivated populations. Despite evidence for a single origin of the cultivated plants (ZJ population), genetic diversity was only slightly higher in natural populations (H′ AFLP : 0.3497 ± 0.0036) compared to cultivated populations (H′ AFLP : 0.3037 ± 0.0049). However, genetic diversity is more evenly represented among populations in cultivation: only about 10% of the genetic variation was explained by differences among populations in cultivation compared to 38% in wild populations. The lower genetic differentiation for the cultivated plants is consistent with our hypothesis based on the differences in reproductive patterns between natural and cultivated populations of C. yanhusuo. In the wild, natural C. yanhusuo is mainly outcrossed by bee (Maloof, 2000;Qiu et al., 2009), which limits long-distance gene flow among natural populations. On the other hand, cultivated populations originated from a single source and have been propagated asexually so we expected lower genetic variation within and among populations. The hierarchical analysis showed that 23% of the overall genetic variation was due to differences between cultivated and wild populations suggesting that a large portion of the genetic variation within the cultivated populations was not present in the natural populations. This unexplained genetic variation in cultivated plants could be partly due to somatic mutation since several studies have reported that high rates of somatic mutation may allow asexual species to maintain abundant genetic variation and adapt to changing environmental conditions (Lynch, 1984;Gill et al., 1995;Schoen and Schultz, 2019). The modular nature of clonal plants allows for detrimental mutations to be terminated in specific cell lines while benign or beneficial mutations can be maintained (reviewed in Schoen and Schultz, 2019). However, we cannot exclude the possibility that cultivars were also introduced from other areas. Understanding the sources of genetic variation in the cultivated populations will require more detailed genetic investigations. In contrast to the patterns of genetic variation, we detected that epigenetic variation among cultivated populations (H′ MSAP : 0.1470 ± 0.0029) exceeded that of natural populations (H′ MSAP : 0.1088 ± 0.0020). We predicted that epigenetic differentiation would also be higher in cultivated than natural populations but we found the opposite (F ST = 0.156 compared to F ST = 0.287). These results suggest higher levels of epigenetic diversity that are more evenly represented among cultivated populations. Therefore, it appears that cultivated populations have relatively less genetic than epigenetic differentiation, while the opposite is true for natural populations. Combined, these results suggest that processes involved in cultivation could have an influence on the diversity and distribution of genetic and epigenetic modifications.
Understanding the causes and consequences of natural epigenetic variation is not straightforward, considering that epigenetic variation is affected by many factors, including genetic and environmental variation (Richards et al., 2017). In our study, we showed that epigenetic variation was significantly correlated to genetic variation with pairwise F ST comparisons and a significant association in dbRDA. However, more of the epigenetic diversity was explained by genetic diversity in the cultivated (23.02%) than in the natural (12.60%) populations. We also found that more of the genetic (33.41% compared to 12.87%) and epigenetic (22.68% compared to 17.43%) variation was explained by climatic factors in natural populations than in cultivated populations of C. yanhusuo. This could be partly due to the greater number of natural populations exposed to a wider range of environments. On the other hand, epigenetic modification can mediate phenotypic plasticity (Herman et al., 2014;Herman and Sultan, 2016;Smith et al., 2016;Kilvitis et al., 2017;Richards et al., 2017;Banta and Richards, 2018), which can be important for response of domestication traits to novel or less than optimal conditions (Piperno, 2017), and impact which individuals are selected for cultivation. Selecting individuals that can tolerate a variety of new environments could be selecting for individuals with the ability to express more epigenetic variation (Herman et al., 2014;Herman and Sultan, 2016;Richards et al., 2017;Banta and Richards, 2018). In fact, Hauben et al. (2009) were able to artificially select on epigenetic variation to create developmental differences among individuals from an isogenic canola population. More recently, Shen et al. (2018) found 5,412 differentially methylated regions (DMRs) that corresponded with soybean domestication and improvement. Several DMRs were associated with genes involved in carbohydrate metabolism and were not associated with genetic variation, indicating that functional changes in methylation may be independent from genetic variation.
In our study, we showed that the alkaloid content varied among cultivated populations, and was more highly correlated yanhusuo using genetic data as predictor; (B) genetic, (C) epigenetic, and (D) alkaloids variation in cultivated populations of C. yanhusuo using environmental factors as predictor; (E) the epigenetic variation in natural populations of C. yanhusuo using genetic data as predictor; (F) genetic and (G) epigenetic variation in cultivated populations of C. yanhusuo using environmental factors as predictor. For genetic data used as predictors of epigenetic variation, the first three PCA axes of the AFLP datasets were adopted. Environmental data include average (TEM_Avg), lowest (TEM_Min), and highest daily temperatures (TEM_Max), average daily relative humidity (RHU_Avg), average air pressure (PRS_Avg), total daily sunshine time (SSH) and total daily precipitation (PRE_Time). The percentages on the axes indicated the proportion of eigenvalues for each constrained axe, and the percentage in the bottom right corner of the graph indicated the proportion of eigenvalues for all constrained axes.
with epigenetic (14.09%) than genetic (6.89%) variation. This could result from selection of genotypes with a greater potential for epigenetic diversity during the cultivation process, or from the clonal propagation which is thought to limit methylation reprogramming (Richards et al., 2012;Dodd and Douhovnikoff, 2016;Gonzaĺez et al., 2018). However, epigenetic variation was not as strongly explained by the climate factors we analyzed in the cultivated plants as in the natural plants. The increased epigenetic variation in cultivated plants could be a response to an unmeasured environmental factor or to the process of domestication. Climatic variation was the strongest predictor of alkaloid content (69%), suggesting processes other than DNA methylation are involved in translating environmental variation into phenotypic variation (e.g., processes that contribute to transcription variation; Sauvage et al., 2017;Liu et al., 2019), or that our methods have not captured the causal methylation patterns (Richards et al., 2017;Paun et al., 2019). Still, these results indicate that persistent epigenetic variation could be contributing to differences in phenotypes and genetic background is less important. However, our study cannot specifically address the possibility that the selected cultivars were those that were more prone to increase epigenetic variation or if the cultivars under selection have increased levels of these alkaloids compared to natural populations. Further studies are required to evaluate the contribution of genetic, epigenetic, and environmental variation to the production of these alkaloids, and response to environmental challenges in asexual crop plants (Verhoeven and Preite, 2014;Dodd and Douhovnikoff, 2016).

CONCLUSION
We used both natural and cultivated populations to identify slightly higher levels of genetic variation in natural than in cultivated populations and the opposite pattern for epigenetic variation. Epigenetic variation was significantly correlated to genetic variation but more strongly so in cultivated populations. We found higher epigenetic but lower genetic differentiation in cultivated populations while the reverse was true in natural populations. Such differences between cultivated and wild populations might be caused by the process of cultivation and/ or the different modes of reproductive between the two types of C. yanhusuo. Epigenetic variation explained more of the variation in alkaloid content than genetic variation, suggesting that epigenetic variation could be particularly important in the process of domestication and provide an important source of phenotypic variation in cultivated (asexual) C. yanhusuo.

DATA AVAILABILITY STATEMENT
The datasets generated for this study are available on request to the corresponding authors.

AUTHOR CONTRIBUTIONS
CC, ZS, and CF planned and designed the research. ZZ, YB, and HZ performed AFLP and MSAP experiments. YZ and JL performed alkaloids analysis. YC conducted fieldwork. CC and ZZ analyzed data. CC and CR wrote the manuscript.