Original Research ARTICLE
Efficient Identification of the Forest Tree Species in Aceraceae Using DNA Barcodes
- 1Key Laboratory of Resource Biology and Biotechnology in Western China, Ministry of Education, College of Life Sciences, Northwest University, Xi'an, China
- 2State Key Laboratory of Cotton Biology, Institute of Cotton Research, Chinese Academy of Agricultural Sciences, Anyang, China
Aceraceae is a large forest tree family that comprises many economically and ecologically important species. However, because interspecific and/or intraspecific morphological variations result from frequent interspecific hybridization and introgression, it is challenging for non-taxonomists to accurately recognize and identify the tree species in Aceraceae based on a traditional approach. DNA barcoding is a powerful tool that has been proposed to accurately distinguish between species. In this study, we assessed the effectiveness of three core standard markers (matK, rbcL and ITS) plus the chloroplast locus trnS-trnG as Aceraceae barcodes. A total of 231 sequences representing 85 species in this forest family were collected. Of these four barcode markers, the discrimination power was highest for the ITS (I) region (50%) and was progressively reduced in the other three chloroplast barcodes matK (M), trnS-trnG (T) and rbcL (R); the discrimination efficiency of the ITS marker was also greater than any two-locus combination of chloroplast barcodes. However, the combinations of ITS plus single or combined chloroplast barcodes could improve species resolution significantly; T+I (90.5% resolution) and R+M+T+I (90.5% resolution) differentiated the highest portion of species in Aceraceae. Our current results show that the nuclear ITS fragment represents a more promising DNA barcode marker than the maternally inherited chloroplast barcodes. The most efficient and economical method to identify tree species in Aceraceae among single or combined DNA barcodes is the combination of T+I (90.5% resolution).
The forest plant family Aceraceae is the largest family of broad-leaved deciduous trees in the Northern Hemisphere (Wolfe and Tanai, 1987). This forest family includes two genera, Acer L. and Dipteronia Oliv., with approximately 140 species. Maple species are widespread and play key roles in local forest ecosystems. In addition, because of their rapid growth rates, great value for landscaping and adaptability to various harsh environmental conditions, maples are widely cultivated and exploited (Saeki and Murakami, 2009). Maples have become a model organism due to their fascinating sex systems (e.g., duodichogamy, protandry, and protogyny) and ease of use in biological research (Shang et al., 2012). Simultaneously, wild maple species provide an important breeding resource for the future from an economic view point. However, the recognition and identification of wild maple species based on traditional morphological approaches is difficult due to their extensive interspecific and/or intraspecific hybridization and introgression (Li et al., 2006; Li, 2011), which causes transitional intraspecific morphological characteristics in Aceraceae. Nevertheless, the recently proposed DNA barcoding method is one possible way to resolve this problem (Hebert et al., 2003).
DNA barcoding aims to achieve rapid and accurate species recognition by sequencing a short DNA sequence or a few DNA regions (Hebert et al., 2003; Hebert and Gregory, 2005; Kress et al., 2005; Hollingsworth et al., 2009; Li et al., 2011). This technology was first developed to diagnose animal species, and “the mitochondrial gene cytochrome c oxidase I (COI) can be served as the core of a global bio-identification system for animals” (Hebert et al., 2003). Increasingly, studies have demonstrated that the COI gene fragment is highly efficient for discrimination among animal species, including amphibians (Vences et al., 2005), birds (Hebert et al., 2004), fish (Ward et al., 2009), and insects (Pons et al., 2006). In plants, however, frequent recombination and low mutation rates are the main restrictions in finding efficient mitochondrial barcode markers (Cho et al., 1998, 2004; Hollingsworth et al., 2011), and the search for suitable candidates has focused on chloroplast and nuclear DNA markers (Kress et al., 2005; CBOL Plant Working Group, 2009; Li et al., 2011; Yan et al., 2015), although it is not easy to amplify and sequence universal nuclear gene primers across different angiosperm taxa. Numerous studies have suggested that four standard barcodes, matK, rbcL, trnH-psbA, and ITS, should be used as core barcode markers for the molecular identification of land plants (Hollingsworth et al., 2011; Li et al., 2011). In addition to choosing DNA barcoding fragments, collecting many individuals from different populations within a species is also important for establishing a reference database for universal application (Bolson et al., 2015; Guo et al., 2016).
Maples are important ornamental and horticultural woody species in the Northern Hemisphere. They are widely distributed in the temperate and subtropical regions of northern Africa, eastern Asia, Europe, and eastern North America (van Gelderen et al., 1994). China is a modern center of geographic distribution and diversification because approximately 70% of the species in Aceraceae occur in this country. According to leaf shapes, inflorescence types and elongated winged fruits (Figure 1), different classification systems for Aceraceae have been proposed. Pax divided the genus Acer into 13 sections in his classification system of Acer (Pax, 1902). In Murray (1970), the genus Acer was classified into 24 sections. More recently, Xu (1996) recognized 23 sections and discussed the phylogenetic relationships among species of Aceraceae.
Figure 1. The variations and similarities in the morphology of maple species. a, Acer ginnala; b, A. negundo var. variegatum; c, A. davidii; d, A. negundo; e, A. truncatum; f, A. circinatum: from http://oregonstate.edu/trees/broadleaf_genera/species/maple_spp.html.
In recent years, the identification of closely related maple species has been a process of continuous development. Previous studies have demonstrated that restriction fragment length polymorphism (RFLP) and amplified fragment-length polymorphism (AFLP) markers are useful for differentiation of the sections Arguta, Cissifolia, Lithocarpa, and Spicata in Aceraceae (Hasebe et al., 1998). A phylogenetic study has shown that two chloroplast DNA loci, psbM-trnD and trnD-trnT, could be used to discriminate between the genera Acer and Dipteronia (Li et al., 2006), and studies based on the combined datasets of the nuclear ITS and the chloroplast noncoding trnL-trnF region (Tian et al., 2002) suggest that the genus Dipteronia might be paraphyletic with D. deyerana embedded in Acer. Furthermore, a recent phylogenetic reconstruction suggested that the ITS sequence in Aceraceae evolved rapidly, and this might lead to high species resolution in this forest family (Grimm et al., 2006).
However, many maple species have complex sex systems, and since seeds and pollen are dispersed by wind and birds, interspecific hybridization and introgression occur extensively between closely related species, causing transitional intraspecific morphological variations, such as similar shapes of leaves, inflorescences and samaras (Guo et al., 2014; Liu C. et al., 2014). Therefore, it is difficult to recognize and delimitate maple species accurately by available morphological characteristics (Tian et al., 2002). In the current study, we used four DNA candidate barcodes that had been proposed previously, three chloroplast markers (rbcL, matK, and trnS-trnG), and one nuclear ITS fragment, to differentiate 231 samples representing 85 Aceraceae species. Our primary objectives were to: (1) test the universality of the four DNA markers in Aceraceae; (2) examine the resolution of these four markers alone or in combination and evaluate the most appropriate marker among these barcodes to distinguish between species within the family Aceraceae; and (3) establish a reference database to facilitate future identification of forest trees.
Materials and Methods
A total of 231 sequences representing 85 species in Aceraceae were collected and analyzed. Hybridization and/or introgression are very common in Aceraceae, which may have affected the performance of the tested DNA barcodes. In the current study, we did not sample the complicated introgression species in our analysis of DNA barcoding. Xu (1996) included 22 sections in the genus Acer: Acer, Distyla, Ginnala, Glabra, Goniocarpa, Hyptiocarpa, Integrifolia, Macrantha, Macrophylla, Microcarpa, Palmata, Parviflora, Pentanphylla, Platanoidea, Trifoliata, Rubra, Saccarodendron, Lithocarpa, Carpinifolia, Arguta, Cissifolia, and Negundo. We sampled the leaves of 85 trees representing 21 maple species from Shaanxi, Gansu, Taiwan, and Yunnan, in China (Table 1). In total, 2–5 individuals per species were sampled from 1 to 4 populations in the field. Fresh leaves were dried and stored in silica gel, and the longitude, latitude and altitude of each collection site (population) were recorded using a GIS unit (Garmin, Taiwan). The voucher specimens were stored in the Key Laboratory of Resource Biology and Biotechnology in Western China (KLRBBWC), Northwest University. To thoroughly assess the efficiency of species identification by single or combined DNA barcodes, we also downloaded 108, 81 and 146 sequences of rbcL, matK and ITS fragments that represented 45, 33, and 47 Aceraceae species, respectively, from GenBank (Table S1).
DNA Extraction, PCR Amplification, and Sequencing
Total genomic DNA of the collected samples was extracted from approximately 20 mg of silica-dried leaves using a modified cetyltrimethylammonium bromide (CTAB) method (Doyle, 1987). Amplification of DNA regions was performed by standard polymerase chain reaction (PCR). The primer sequences and thermocycling conditions of PCR amplification are listed in Table 2. The PCR reactions were conducted in a 25 μL mixture system containing 12.5 μL 2 × Taq PCR mix (Runde, Xi'an, China), 1.0 μL of each primer (5 μmol/L), 10.5 μL ddH2O, and 1.0 μL template DNA (30–50 ng).
All sequence assemblies and adjustment were performed using MEGA software version 6.0 (Tamura et al., 2013). For the alignments of each gene and the combinations of multiple loci, the parameters were: (1) multiple alignment, gap opening penalty 15, gap extension penalty 6.66; 2) DNA weight matrix IUB; (3) transition weight 0.5; and (4) delay divergent cutoff 30%. In total, for the rbcL (R) region, 193 sequences represented 66 species; for matK (M), 166 sequences represented 54 species; for trnS-trnG (T), 85 sequences represented 21 species; for ITS (I), 231 sequences represented 68 species; for the R+M regions, 134 sequences represented 47 species; for the M+T, R+T, T+I, R+M+T, and R+M+T+I regions, 85 sequences represented 21 species; for the R+I regions, 121 sequences represented 46 species; and for the M+I regions, 119 sequences represented 41 species (Table S1; Supplementary Material Data Sheet). In particular, insertions/deletions (indels) and single nucleotide polymorphisms (SNPs) were calculated using DnaSP version 5.10.01 (Librado and Rozas, 2009). To evaluate species discrimination success, we applied two different methods of analysis for single markers and all possible combinations of markers.
The barcoding gap was a significant factor used to test for appropriate barcode markers, which show high interspecific, but low intraspecific genetic divergence. The CBOL Plant Working Group (2009) recommended the PWG-Distance method to calculate distances from pairwise alignments by counting unambiguous base substitutions, and pairwise nucleotide genetic distances were based on the Kimura 2-parameter (K2P) nucleotide evolutionary model obtained from the MEGA 6.0 program (Tamura et al., 2013). We suggest that successful species discrimination was confirmed if the minimum uncorrected interspecific p-distance involving a species was larger than its maximum intraspecific distance (CBOL Plant Working Group, 2009).
Tree-Building is a method that uses the program MEGA 6.0 (Tamura et al., 2013) to construct a Neighbor-Joining (NJ) tree for each single marker and the different combinations of barcode markers under the Kimura 2-parameter substitution model (Hall, 2013). The bootstrap support of the NJ tree was assessed using 1000 replicates. The sampled species was considered successfully discriminated if all individuals of a species formed a monophyletic group in the phylogenetic tree (Hollingsworth et al., 2009). The ratio of successfully identified species to all sampled species was calculated as the proportion of species that were discriminated.
PCR amplification succeeded for all DNA samples for the four DNA barcoding regions, ITS, matK, rbcL, and trnS-trnG. However, there were two difficulties in sequencing and alignment. First, problems were encountered in the sequencing of the trnS-trnG region, where a mono-nucleotide repeat (poly-A) fragment in the middle of this marker led to sequencing failure for the latter half of this region in most of the individuals. Therefore, this relatively short chloroplast region was sequenced with both forward and reverse primers and then assembled. Second, several sequences of ITS were extremely hard to align due to its considerable number of variable sites, which required us to carefully check and manually edit the sequences. Eventually, the four regions were successfully amplified and sequenced in 21 sampled species. Eighty-five new Aceraceae sequences were obtained (KX264925-KX264991), and we also downloaded 45 other sequences of Aceraceae species from GenBank (Table S1). In total, 231 sequences of 85 species in Aceraceae were analyzed. The aligned lengths of the four DNA barcodes ranged from 346 bp (ITS) to 636 bp (matK), and mutation sites (SNPs) were lowest for rbcL (27) and highest for ITS (137). Compared to the other markers, rbcL was the most highly conserved, with the fewest mutation sites and indels (0). A comparison of four single barcoding regions and one combination of all chloroplast regions is shown in Table 3.
Interspecific and Intraspecific Variability
We investigated variability of the four DNA markers for all Aceraceae species, and all DNA regions showed higher genetic variability between than within species (Table 3). The nuclear ITS region showed the highest interspecific sequence divergence (4.74%), followed by trnS-trnG (2.68%), matK (0.86%) and rbcL (0.44%). In addition, the intraspecific sequence divergence was also higher for ITS (0.62%) and lower for trnS-trnG (0%) between all the detected single-locus barcodes. The barcoding gap between interspecific and intraspecific distances was graphed based on the K2P model for each candidate barcode marker and the combinations of markers (Figure 2). Our results demonstrated that the combination of all four markers (rbcL+matK+trnS-trnG+ITS, RMTI) showed the highest interspecific gap (divergence), and light overlaps without distinct gaps were found for single markers and/or combinations of the candidate loci (Figure 2).
Figure 2. Histograms of the frequencies (y-axes) of pairwise intraspecific (black bars) and interspecific (gray bars) divergences based on the K2P distance (x-axes) for individual and combined rbcL (R), matK (M), trnS-trnG (T), and ITS (I) markers.
To evaluate the species discrimination power of the four single barcoding markers and the eight combinations of two or more markers, two different analyses were used. In the Tree-Building and PWG-Distance analyses, there were consistent results in the species discrimination power for all markers (Figure 3). Generally, species forming separate clusters in the tree with bootstrap support >50% were considered to be discriminated. In the single chloroplast marker analysis, matK had a relatively higher percentage of successful species discrimination (33.33%), followed by trnS-trnG (23.81%) and rbcL (8.11%). The percentage of species discrimination ranged from 23.81% (R+T) to 49.14% (M+T) for combinations of two chloroplast markers, and R+M provided successful species discrimination of 38.71%. Furthermore, the discrimination rate was 57.14% for combinations of three chloroplast markers (R+M+T) (Figure 3). The combinations of ITS plus one or three chloroplast barcodes significantly improved the species resolution (Figures 4, 5). The combinations T+I (90.5% resolution) and R+M+T+I (90.5% resolution) showed the highest percentage of successful species identification in the sampled Aceraceae species (Figure 3).
Figure 3. Species discrimination rate of all tested single- and multi-locus barcodes in Aceraceae. R, rbcL; M, matK; T, trnS-trnG; I, ITS.
Figure 4. Neighbor-Joining (NJ) tree of 21 Aceraceae species based on the combination of all three chloroplast (rbcL, matK, and trnS-trnG) regions.
Figure 5. Neighbor-Joining (NJ) tree of 21 Aceraceae species based on the combination of all single barcodes (rbcL, matK, trnS-trnG, and ITS).
In addition, we evaluated the species discrimination power at the section level within Aceraceae (Figure S1). In the analysis of single barcode markers, the nuclear ITS fragment provided the highest resolution power (71.43%), followed by the three chloroplast DNA barcodes matK (42.42%), trnS-trnG (37.5%) and rbcL (10.26%). Combinations of ITS plus one or three chloroplast DNA markers significantly improved species discrimination. The discrimination rate was 89.47% for the combination M+I. The combination R+M+T+I (90%) showed the highest percent identification at the section level within Aceraceae.
Evaluation of Barcode Power in Aceraceae
According to the CBOL Plant Working Group (2009), three main criteria should be satisfied in evaluating markers for plant DNA barcoding: (i) the presence of sufficient flanking sequences for developing appropriate universal barcoding primer markers, (ii) a relatively short nucleotide sequence to facilitate PCR amplification and sequencing, and (iii) significant genetic divergence at the species level. In our study, all samples of Aceraceae were successfully amplified and sequenced for the four DNA barcode candidates, with the exception of the slightly short sequences of ITS, which were manually edited. This result indicates a high universality for the DNA markers that we used here.
An ideal DNA barcode could provide significant species discrimination and identification (Feng et al., 2014; Gong et al., 2016; Nigro et al., 2016). We used two different analyses to evaluate the discrimination rates between species and sections within Aceraceae. The Tree-Building method is an analytical method used to generate a graphical representation of the results, which is useful for determining the power of a given combination of markers to discriminate between closely related species (Zhang et al., 2013). In addition, we used the PWG-Distance method to compute intra- and interspecific pairwise distances based on the Kimura two-parameter (K2P) model. Using these two approaches, we obtained similar results for species and section identification in Aceraceae.
Low Efficacy of the Three Chloroplast DNA Markers
In the current study, the chloroplast barcoding markers or different combinations of the chloroplast barcoding markers (2-loci) had low identification power in the sampled Aceraceae species (Table 3). Among the single chloroplast markers, for two coding regions, rbcL had lower interspecific variation (0.0044) and species resolution (10.26%), but matK had low interspecific variation (0.0086) and relatively high species resolution (42.42%). This result was similar to previous results, which showed very low divergence and low power to discriminate between species (Feng et al., 2013) for the rbcL region. These results imply that the two coding spacers, rbcL and matK, are unsuitable for distinguishing the forest species in Aceraceae. The non-coding spacer trnS-trnG provided more interspecific distance (0.0268) than the two coding fragments (rbcL and matK), but trnS-trnG provided lower species discrimination (33.3%). However, the debate regarding the use of non-coding regions in DNA barcoding is ongoing due to unreliable alignment of sequences with very high variation (Särkinen and George, 2013). In addition, some authors suggest that indel regions of non-coding DNA sequences provide important phylogenetic information and species discrimination power for some of the group, e.g., Taxus (Liu et al., 2011).
Recently, some studies have found that combinations of barcoding markers improved species delimitation power more than single candidate DNA barcodes (Feng et al., 2013; Zhang et al., 2014; Yan et al., 2015). The 2-locus combination of matK+rbcL has been officially proposed as the core barcode for land plants (Clement and Donoghue, 2012; Wirta et al., 2016) based on the straightforward recovery of the rbcL region and the discriminatory power of the matK region (CBOL Plant Working Group, 2009). In our study, this combination resulted in successful identification 38.71% of the time, which was higher than the combination of rbcL+trnS-trnG (23.81%) but lower than matK+trnS-trnG (49.14%). From the above results, the core barcode region (matK+rbcL) had higher species resolution than all single chloroplast barcoding markers. However, the identification power of the core barcode region is still lower than that of the ITS fragment. Our results indicate that the main barcode marker suggested by the CBOL Plant Working Group (2009) is not suitable for molecular identification of Aceraceae species and needs further investigation.
The Nuclear ITS Region is a more Promising Barcode in Aceraceae
In our study, the ITS marker showed the highest interspecific and intraspecific divergences (0.0474 and 0.0062, respectively) and the highest species resolution (50%) of all single DNA barcode candidates. This high resolution was also found in previous research (e.g., Li et al., 2011; Liu et al., 2016; Wirta et al., 2016), for example, in Rhododendron (Yan et al., 2015), Venus Slipper (Guo et al., 2016) and in Rosewood (Hartvig et al., 2015). These previous results showed that the ITS marker performed well as a single barcode for species identification, although the ITS marker was considered to have drawbacks, including incomplete lineage sorting and homogeneous concerted evolution (Liu et al., 2011, 2016; Wang et al., 2011; Wirta et al., 2016). However, in a recent study, Li et al. (2011) found that the nuclear ITS barcode had high species discrimination power, and they proposed that it should be incorporated into the core barcode marker for land plants.
In this study, the nuclear ITS fragment was shorter and had a higher evolutionary rate than the chloroplast DNA regions (ITS: 61.7, 137 mutation sites out of 346 bp; three chloroplast markers combined: 71.4%, 93 mutation sites out of 1499 bp; Table 3). Compared to the mode of uniparental inheritance of chloroplast DNA regions, the nuclear ITS region was a biparentally inherited marker, and its genetic information was transferred by pollen and seed. Thus, the ITS fragment has a larger effective population size than the chloroplast DNA regions, which makes the ITS marker is not easily quicker to complete lineage sorting than the cpDNA markers (Wachowiak et al., 2011). However, the nuclear ITS marker showed a higher evolutionary rate and higher inter-species differentiation than the chloroplast regions, and the nuclear ITS region in plants comprises multiple (reiterated) copies and usually undergoes concerted evolution (Alvarez and Wendel, 2003; Liu et al., 2016). During this process, different copies become homogenized to the same sequence type (becoming almost identical types) as a result of mechanisms such as high-frequency unequal crossing over or gene conversion (Alvarez and Wendel, 2003; Hartvig et al., 2015; Wirta et al., 2016). Therefore, this region may have accumulated more interspecific differentiation, comparable even to that experienced by speciation genes (Alvarez and Wendel, 2003; Guo et al., 2016; Wirta et al., 2016). Considering all characteristics of the nuclear ITS fragment, such as high mutation rate and rapidly concerted evolution, this nuclear gene fragment facilitates higher species discrimination power than chloroplast DNA regions in Aceraceae.
A Combined Nuclear and Chloroplast DNA Barcode Is an Effective Method of Species Delimitation of Aceraceae
In the present study, we found that among sampled Aceraceae species, using a combination of the ITS fragment and a single chloroplast region usually significantly improved the discrimination efficiency (Figure 3), T+I (90.5% resolution) and R+M+T+I (90.5% resolution) differentiated the highest portion of species in Aceraceae (Figures 4, 5). This is in agreement with previous works on DNA barcoding of angiosperm taxa, e.g., Alnus (Betulaceae; Ren et al., 2010) and Lamium (Lamiaceae; Krawczyk et al., 2014). This result may have been caused by the uniparentally inherited chloroplast DNA fragment and the biparentally inherited nuclear ITS marker having different evolutionary modes and tracking different evolutionary histories. A combination of these markers improves species identification power and strengthens our understanding of evolutionary dynamics in plants (Saeki et al., 2011). However, in the present study, for the analysis of the discrimination rate of DNA barcoding combinations T+I and R+M+T+I, the number of sampled Aceraceae species was only 21, and the small number of collected individuals and species may have caused the high estimate of species resolution. In addition, we did not exclude the effect of biological characters in Aceraceae, e.g., complex sex systems and the dispersal of shorter distances among individuals within species, and these may have caused the high level of species identification (Liao et al., 2010; Zhang et al., 2010; Saeki et al., 2011). More samples and species should be collected for further assessment of phylogenetic relationships and the performance of DNA barcoding in the future.
Implications for the Phylogenetic Relationships of Aceraceae Species
Phylogenetic identification and recognition of species are the keystones of biology (Moritz, 1994; Wirta et al., 2016). In the past, biologists identified phylogenetic relationships and positions of biological specimens mainly based on morphological features, such as leaves, flower shape, size, and color. However, traditional taxonomy has at least two drawbacks: first, if the specimens are damaged or lack sufficient diagnostic characters, the specialists may be unable to make accurate identifications (Yan et al., 2015); second, morphologically and taxonomically difficult groups sometimes require an experienced professional taxonomist to address species identification (Li et al., 2011; Liu J. et al., 2014). Such taxonomists need training and experience, which require much time, effort and money. With the development of molecular biology, the rise of DNA barcoding is expected to mitigate this dilemma, at least in part. DNA barcoding is one method that uses standard molecular techniques to achieve rapid and accurate species identification by a short DNA sequence (Hebert et al., 2003; Hebert and Gregory, 2005; Liu et al., 2013; Krawczyk et al., 2014). This method to remedy the limitations of a traditional morphology-based identification system will allow for more rapid progress in traditional taxonomic work (Kress et al., 2005; Liu J. et al., 2014; Nigro et al., 2016).
In the present study, we constructed the phylogenetic tree of the family Aceraceae by combining different DNA barcoding markers (Figures 4, 5). For the combination of all four DNA fragments (R+M+T+I), greater resolution in the NJ tree was achieved with higher bootstrap supports than with the combination of three chloroplast DNA markers (R+M+T). In the phylogenetic analyses, Aceraceae formed a monophyletic group with moderate bootstrap support. Many sections were resolved as monophyletic, with the exception of sect. Trifoliata. The monophyly formed by sect. Palmata and sect. Microcarpa was strongly supported with a high bootstrap value (100%). These two sections share many morphological characters, such as 4-paired bud scales, typically palmate leaves, and corymbose inflorescences. de Jong (1994) and Ogata (1967) combined them into one section, sect. Palmata. This result was confirmed by a previous study of nucleotide variations of chloroplast trnL-trnF and nuclear ITS regions (Tian et al., 2002). In addition, the close relationship between sect. Trifolia and sect. Integrifolia was supported with a high bootstrap value (99%) in the NJ tree. However, some species of the sections Cissifolia, Negundo, Trifoliata did not cluster together; A. henryi of section Cissifolia was closely related to A. mandshuricum subsp. kansuense of section Trifoliata, and the two species formed a clade with high bootstrap support, and A. negundo of section Negundo and A. tetramerum Pax var. betulifolium of section Arguta were clustered together (Figures 4, 5). We speculate that the hybridization and/or introgression among species from different sections may have caused the non-monophyletic clade within Aceraceae. Frequent inter-hybridization has been reported in Aceraceae (Liao et al., 2010; Zhang et al., 2010; Saeki et al., 2011). These characteristics might have caused the low resolution at the section level (Figure S1). We found the resolutions of the combinations M+I (89.47%) and R+M+T+I (90%) at the section level were lower than at the species level (M+ I: 90.5%, R+M+T+I: 90.5%).
In conclusion, we support the use of the ITS marker as a supplementary barcode in plants, while the performance of ITS should be evaluated in extensive trials in different plant groups (Liu et al., 2013; Liu J. et al., 2014; Wirta et al., 2016). The pragmatic solution to a complex trade-off between universality, sequence quality, discrimination, and cost, is the combination of T+I (90.5% species resolution), the most efficient and economical marker among single or combined DNA barcodes when identifying tree species in Aceraceae. More samples and species should be collected for further assessment of phylogenetic relationships and the performance of DNA barcoding.
ZHL conceived and designed the research. YH, DD performed the experiments. YJ, ZHL, GZ, YH and ZL contributed materials/analysis tools. YH, ZHL wrote the paper. DD, XM and ZHL revised the paper. All authors read and approved the final manuscript.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
We are very grateful to the editor Tian Tang, and Dr. Kangshan Mao for their valuable comments and suggestions. The present work was supported by the National Natural Science Foundation of China (31470400, J1210063), the National Key R & D Program for Crop Breeding (2016YFD0100300) and the Program for Changjiang Scholars and Innovative Research Team in University (PCSIRT, No. IRT1174).
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/article/10.3389/fpls.2016.01707/full#supplementary-material
Figure S1. Species discrimination rate of all tested single- and multi-locus barcodes at the section level in Aceraceae. R, rbcL; M, matK; T, trnS-trnG; I, ITS.
Table S1. Samples for testing potential barcodes and accession numbers in GenBank.
Supplementary Material Data Sheet. Alignments of DNA sequences of all tested 12 single- and multi-locus barcodes. Sheet1, ITS; Sheet2, matK+ITS (M+I); Sheet3, matK+trnS-trnG (M+T); Sheet4, matK; Sheet5, rbcL+ ITS (R+I); Sheet6, rbcL+ matK (R+M); Sheet7, rbcL+matK+trnS-trnG (R+M+T); Sheet8, rbcL+matK+trnS-trnG+ITS (R+M+T+I); Sheet9, rbcL++trnS-trnG (R +T); Sheet10, rbcL; Sheet11, trnS-trnG+ITS (T+I); Sheet12, trnS-trnG.
Bolson, M., Smidt, E. C., Brotto, M. L., and Pereira, V. S. (2015). ITS and trnH-psbA as efficient DNA barcodes to identify threatened commercial woody angiosperms from southern Brazilian Atlantic rainforests. PLoS ONE 10:e0143049. doi: 10.1371/journal.pone.0143049
Cho, Y., Mower, J. P., Qiu, Y. L., and Palmer, J. D. (2004). Mitochondrial substitution rates are extraordinarily elevated and variable in a genus of flowering plants. Proc. Natl. Acad. Sci. U.S.A. 101, 17741–17746. doi: 10.1073/pnas.0408302101
Feng, S., Jiang, Y., Wang, S., Jiang, M., Chen, Z., Ying, Q., et al. (2014). Molecular identification of Dendrobium Species (Orchidaceae) based on the DNA barcode ITS2 region and its application for phylogenetic study. Int. J. Mol. Sci. 16, 21975–21988. doi: 10.3390/ijms160921975
Gong, W., Liu, Y., Chen, J., Hong, Y., and Kong, H. H. (2016). DNA barcodes identify Chinese medicinal plants and detect geographical patterns of Sinosenecio (Asteraceae). J. Syst. Evol. 54, 83–91. doi: 10.1111/jse.12166
Grimm, G. W., Renner, S. S., Stamatakis, A., and Hemleben, V. (2006). A nuclear ribosomal DNA phylogeny of Acer inferred with maximum likelihood, splits graphs, and motif analysis of 606 sequences. Evol. Bioinform. 2, 7–22.
Guo, X. D., Wang, H. F., Bao, L., Wang, T. M., Bai, W. N., Ye, J. W., et al. (2014). Evolutionary history of a widespread tree species Acer mono in East Asia. Ecol. Evol. 4, 4332–4345. doi: 10.1002/ece3.1278
Hartvig, I., Czako, M., Kjær, E. D., Nielsen, L. R., and Theilade, I. (2015). The use of DNA barcoding in identification and conservation of rosewood (Dalbergia spp.). PLoS ONE 10:e0138231. doi: 10.1371/journal.pone.0138231
Hasebe, M., Ando, T., and Iwatsuki, K. (1998). Intrageneric relationships of maple trees based on the chloroplast DNA restriction fragment length polymorphisms. J. Plant Res. 111, 441–451. doi: 10.1007/BF02507809
Hollingsworth, M. L., Clark, A. A., Forrest, L. L., Richardson, J., Pennington, R. T., Long, D. G., et al. (2009). Selecting barcoding loci for plants: evaluation of seven candidate loci with species-level sampling in three divergent groups of land plants. Mol. Ecol. Resour. 9, 439–457. doi: 10.1111/j.1755-0998.2008.02439.x
Krawczyk, K., Szczecińska, M., and Sawicki, J. (2014). Evaluation of 11 single-locus and seven multilocus DNA barcodes in Lamium L. (Lamiaceae). Mol. Ecol. Resour. 14, 272–285. doi: 10.1111/1755-0998.12175
Kress, W. J., Wurdack, K. J., Zimmer, E. A., Weigt, L. A., and Janzen, D. H. (2005). Use of DNA barcodes to identify flowering plants. Proc. Natl. Acad. Sci. U.S.A. 102, 8369–8374. doi: 10.1073/pnas.0503123102
Li, D. Z., Gao, L. M., Li, H. T., Wang, H., Ge, X. J., Liu, J. Q., et al. (2011). Comparative analysis of a large dataset indicates that internal transcribed spacer (ITS) should be incorporated into the core barcode for seed plants. Proc. Natl. Acad. Sci. U.S.A. 108, 19641–19646. doi: 10.1073/pnas.1104551108
Li, J. (2011). Phylogenetic evaluation of series delimitations in section Palmata (Acer, Aceroideae, Sapindaceae) based on sequences of nuclear and chloroplast genes. Aliso 29, 43–49. doi: 10.5642/aliso.20112901.05
Li, J. H., Yue, J. P., and Shoup, S. (2006). Phylogenetics of Acer (Aceroideae, Sapindaceae) based on nucleotide sequences of two chloroplast non-coding regions. Harv. Pap. Bot. 11, 101–115. doi: 10.3100/1043-4534
Liao, P. C., Shih, H. C., Yen, T. B., Lu, S. Y., Cheng, Y. P., and Chiang, Y. C. (2010). Molecular evaluation of interspecific hybrids between Acer albopurpurascens and A. buergerianum var. formosanum. Bot. Stud. 51, 413–420.
Liu, C., Tsuda, Y., Shen, H., Hu, L. J., Saito, Y., and Ide, Y. (2014). Genetic structure and hierarchical population divergence history of Acer mono var. mono in south and northeast China. PloS ONE 9:e87187. doi: 10.1371/journal.pone.0087187
Liu, J., Möller, M., Gao, L. M., Zhang, D. Q., and Li, D. Z. (2011). DNA barcoding for the discrimination of Eurasian yews (Taxus L., Taxaceae) and the discovery of cryptic species. Mol. Ecol. Resour. 11, 89–100. doi: 10.1111/j.1755-0998.2010.02907.x
Liu, J., Shi, L., Han, J., Geng, L., Li, G., Lu, H., et al. (2014). Identification of species in the angiosperm family Apiaceae using DNA barcodes. Mol. Ecol. Resour. 14, 1231–1238. doi: 10.1111/1755-0998.12262
Liu, Z. W., Zhao, Q. R., and Zhou, J. (2013). A test of four candidate barcoding markers for the identification of geographically widespread Chimaphila species (Pyroleae, Ericaceae). Acta Bot. Gallica 160, 11–17. doi: 10.1080/12538078.2013.773462
Nigro, L. M., Angel, M. V., Blachowiak-Samolyk, K., Hopcroft, R. R., and Bucklin, A. (2016). Identification, discrimination, and discovery of species of marine planktonic ostracods using DNA barcodes. PloS ONE 11:e0146327. doi: 10.1371/journal.pone.0146327
Pons, J., Barraclough, T. G., Gomez-Zurita, J., Cardoso, A., Duran, D. P., Hazell, S., et al. (2006). Sequence-based species delimitation for the DNA taxonomy of undescribed insects. Syst. Biol. 55, 595–609. doi: 10.1080/10635150600852011
Ren, B. Q., Xiang, X. G., and Chen, Z. D. (2010). Species identification of Alnus (Betulaceae) using nrDNA and cpDNA genetic markers. Mol. Ecol. Resour. 10, 594–605. doi: 10.1111/j.1755-0998.2009.02815.x
Saeki, I., Dick, C. W., Barnes, B. V., and Murakami, N. (2011). Comparative phylogeography of red maple (Acer rubrum L.) and silver maple (Acer saccharinum L.): impacts of habitat specialization, hybridization and glacial history. J. Biogeogr. 38, 992–1005. doi: 10.1111/j.1365-2699.2010.02462.x
Saeki, I., and Murakami, N. (2009). Chloroplast DNA phylogeography of the endangered Japanese red maple (Acer pycnanthum): the spatial configuration of wetlands shapes genetic diversity. Divers. Distrib. 15, 917–927. doi: 10.1111/j.1472-4642.2009.00609.x
Shang, H., Luo, Y. B., and Bai, W. N. (2012). Influence of asymmetrical mating patterns and male reproductive success on the maintenance of sexual polymorphism in Acer pictum subsp. mono (Aceraceae). Mol. Ecol. 21, 3869–3878. doi: 10.1111/j.1365-294X.2012.05555.x
Vences, M., Thomas, M., Bonett, R. M., and Vieites, D. R. (2005). Deciphering amphibian diversity through DNA barcoding: chances and challenges. Phil. Trans. R. Soc. B. 360, 1859–1868. doi: 10.1098/rstb.2005.1717
Wachowiak, W., Palme, A. E., and Savolainen, O. (2011). Speciation history of three closely related pines Pinus mugo (T.), P. uliginosa (N.) and P. sylvestris (L.). Mol. Ecol. 20, 1729–1743. doi: 10.1111/j.1365-294X.2011.05037.x
Wang, Q., Yu, Q. S., and Liu, J. Q. (2011). Are nuclear loci ideal for barcoding plants? A case study of genetic delimitation of two sister species using multiple loci and multiple intraspecitic individuals. J. Syst. Evol. 49, 182–188. doi: 10.1111/j.1759-6831.2011.00135.x
Wirta, H., Várkonyi, G., Rasmussen, C., Kaartinen, R., Schmidt, N. M., Hebert, P. D. N., et al. (2016). Establishing a community-wide DNA barcode library as a new tool for arctic research. Mol. Ecol. Resour. 16, 809–822. doi: 10.1111/1755-0998.12489
Yan, L. J., Liu, J., Möller, M., Zhang, L., Zhang, X. M., Li, D. Z., et al. (2015). DNA barcoding of Rhododendron (Ericaceae), the largest Chinese plant genus in biodiversity hotspots of the Himalaya-Hengduan Mountains. Mol. Ecol. Resour. 15, 932–944. doi: 10.1111/1755-0998.12353
Zhang, C. Y., Wang, F. Y., Yan, H. F., Hao, G., Hu, C. M., and Ge, X. J. (2013). Testing DNA barcoding in closely related groups of Lysimachia L. (Myrsinaceae). Mol. Ecol. Resour. 12, 98–108. doi: 10.1111/j.1755-0998.2011.03076.x
Zhang, J., Chen, M., Dong, X., Lin, R., Fan, J., and Chen, Z. (2014). Evaluation of four commonly used DNA barcoding loci for Chinese medicinal plants of the family Schisandraceae. PloS ONE 10:e0125574. doi: 10.1371/journal.pone.0125574
Keywords: Aceraceae, chloroplast DNA, barcoding markers, species identification, ITS
Citation: Han Y-W, Duan D, Ma X-F, Jia Y, Liu Z-L, Zhao G-F and Li Z-H (2016) Efficient Identification of the Forest Tree Species in Aceraceae Using DNA Barcodes. Front. Plant Sci. 7:1707. doi: 10.3389/fpls.2016.01707
Received: 14 June 2016; Accepted: 31 October 2016;
Published: 16 November 2016.
Edited by:Tian Tang, Sun Yat-sen University, China
Reviewed by:Xue-jun Ge, South China Institute of Botany (Chinese Academy of Science), China
Jie Liu, Kunming Institute of Botany, China
Copyright © 2016 Han, Duan, Ma, Jia, Liu, Zhao and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhong-Hu Li, email@example.com
†These authors have contributed equally to this work.