ORIGINAL RESEARCH article

Front. Plant Sci., 12 May 2023

Sec. Functional and Applied Plant Genomics

Volume 14 - 2023 | https://doi.org/10.3389/fpls.2023.1149682

Genetic and morphological characterization of United States tea (Camellia sinensis): insights into crop history, breeding strategies, and regional adaptability

  • 1. Horticultural Sciences Department, University of Florida, Gainesville, FL, United States

  • 2. Plant Pathology Department, University of Florida, Gainesville, FL, United States

Article metrics

View details

5

Citations

6,1k

Views

1,4k

Downloads

Abstract

Multiple introductions of tea (Camellia sinensis) to the United States since the 1850s have resulted in US tea germplasm that are currently poorly characterized. To resolve questions concerning the relatedness and regional adaptability of US tea germplasm, 32 domestic individuals were evaluated using 10 InDel markers, and compared with a background population of 30 named and registered Chinese varieties of tea. The marker data were analyzed via a neighbor-joining cladistic tree derived from Nei’s genetic distance, STRUCTURE, and Discriminant Analysis of Principal Components, which revealed four genetic groups. Nineteen individuals selected from the four groups were assessed for seven leaf traits, two floral descriptors, and leaf yield, to identify plants best adapted to Florida field conditions. Our analyses compared with available historical records led us to estimate the most likely provenance of some of the US individuals, to precisely identify tea plant material and to choose most diverse accessions for breeding tea improved for adaptability, yield and quality.

1 Introduction

Tea (Camellia sinensis) is one of the most-consumed beverages globally. Its popularity is due to both cultural precedent and documented health benefits (Chen and Lin, 2015). Despite a growing market for tea in the United States (US), almost all tea sold in the US is imported (FAO, 2018; Zhang et al., 2020). A few small (<100 acre) tea growing operations exist in the Southeast and Pacific Northwest regions, with only one extant large-scale (>100 acre) tea farm operating in coastal South Carolina. Currently, the market for loose leaf and specialty tea is growing (Goggi, 2022). At the same time, the number of small-scale tea farms in the Southeastern US, which would be more likely to supply this market than the commodity tea market, is growing as well (Price, 2021). The nursery stock for tea that is domestically available in the US has a complex history (D’Auria et al., 2022) which includes almost two hundred years of interbreeding among historical stocks and two introductions of new cohorts of germplasm in the past sixty years (Walcott, 1999). Available germplasm is not well-characterized, presenting difficulties both for farmers attempting to choose the best plants for their environment, and for plant breeders attempting to develop optimal breeding strategies.

Tea is an evergreen perennial that was likely first domesticated in southwestern China around 3000 BCE, according to historical evidence (Wambulwa et al., 2021). Tea prefers well-drained, acidic soils and temperatures between 12 and 30°C during the growing season (Ahmed and Stepp, 2013). Most tea is produced in humid tropical or humid subtropical climate regimes; however, different varieties appear to be adapted to local climatic conditions (Xia et al., 2020).

Since tea’s first domestication resulting in China-type teas, the crop has undergone possibly two more domestication events in southwestern China and Assam, India, resulting in Chinese Assam-types and Indian Assam-types (Meegahakumbura et al., 2016). The vast spread of tea in antiquity and long history of human cultivation have resulted in area-specific landraces and clonal varieties. These generally fall into one of two recognized subspecies, China-type, Camellia sinensis var. sinensis (CSS) and Assam-type, Camellia sinensis var. assamica (CSA). However, cross pollination followed by continuous selection has resulted in many varieties exhibiting varying levels of admixture between the two. Generally, CSS is characterized by small leaves and a spreading habit, while CSA has larger leaves and a semi-arboreal habit (Carr, 2018). However, identification based on morphological traits has limited utility because of the continuous nature of the characteristics (Wang et al., 2020).

Historical records reveal a complex past of tea cultivation in the US, particularly in the Southeast, and with regard to the provenance of plants. Junius Smith’s 1849 attempt at tea cultivation in South Carolina used seeds and plants sourced from India with some possible admixture of CSS plants from Guandong, China (Rose, 2010; Walcott, 2012). Much of the tea propagated in the eastern US in the 1850s and 1860s originated from Anhui, Fujian, and Zhejiang provinces in China, sourced by Robert Fortune for the Royal Horticultural Society (Gardener, 1971). The Pinehurst tea farm in South Carolina, established in the 1880s, included an Assam-Chinese hybrid that likely predates Fortune’s expedition; this plantation also included plants from Darjeeling, as well as an unnamed Japanese variety, and ‘Dragon’s Pool’ seed from Zhejiang (Walcott, 1999). The full extent of tea germplasm exchange from East to West cannot be quantified; however, given the federal support of tea cultivation, it is likely that in the 19th century, tea germplasm was imported to the US from China, Japan and India (Klose, 1950). In the 1960s, Lipton installed experimental stations for tea production around the US Southeast and West. Though Lipton had largely abandoned US tea production by the 1980s, some of the tea lines they introduced are currently being revived. In addition to the historical varieties, more recent (<40 years) efforts to produce tea by nursery growers and hobbyists have led to new introductions from China, India, Russia, Georgia, Japan, Korea, and Nepal. These manifold geographical origins, together with up to 200 years of hybridization, and the inclusion of varied recent additions, complicates the questions of 1) selecting area-appropriate germplasm for growers and 2) optimizing breeding strategies for researchers.

Where pedigrees are lacking, investigating the genetics of tea plants is important for making decisions about controlled crosses in a breeding program. DNA-based methods of identification have been shown to produce adequate conclusions about ancestry, admixture, and phylogenetic relationships (Liu et al., 2017; Jin et al., 2022). Availability of the genome sequence for tea (Wei et al., 2018; An et al., 2020) facilitates the generation of DNA-based markers. Research performed on Chinese tea has shown that local varieties retain genetic signatures related to the region to which they are adapted (Liu et al., 2019). Supporting this, phylogenetic studies performed using plastid genomes (Wambulwa et al., 2017; Meegahakumbura et al., 2018), SSR markers (Tan et al., 2016; Liu et al., 2018), SNP and InDel markers (Liu et al., 2019) have found a strong correlation between clades and geographical origins of plants.

Given the limited nature of information regarding provenance and performance of US tea, this research was performed to characterize the genetic diversity and potential geographic origins of 32 tea accessions collected from sources in the southeastern US. Ten InDel markers developed by Liu et al. (2019) were used to estimate the genetic diversity of the tea accessions. The study included 30 of Liu et al. (2019)’s reported Chinese varieties as a background population, against which the US accessions were compared, thus deciphering geographic origin for some of the University of Florida tea accessions. Morphological descriptors and yield were analyzed for 19 of the accessions to test relationships between markers and yield phenotype.

2 Materials and methods

In previous studies from our research group (Orrock et al., 2017; Orrock et al., 2020; Orrock et al., 2021), we have used plant names provided by the nursery or other sources as names of “accessions.” However, recognizing extent of variability within commercially sourced plants sold under a single trade name, here we revise our terminology on germplasm to better reflect the diversity. “Named groups” are tea plants obtained with a trade name provided by the commercial source. Genetically unique individuals within a named group are considered “accessions.” Wherever the derivation of a named group (clonally or sexually propagated) is unclear, single plants will be referred to as “individuals.”

2.1 Tea plants

Thirty-two domestically sourced individuals of tea were included in this study (Supplementary Table 1). Nineteen individuals from eight named groups were sampled from the genetic diversity and yield study plots at Plant Science Research and Education Unit in Citra, FL (29°24’27N, 82°14’11W). Sampled individuals were chosen for the genetic evaluation based on annual yield totals for 2021 (Table 1). Within each named group, the highest-yielding and lowest-yielding individuals were included. For the three highest-yielding named groups, ‘Small Leaf,’ ‘Fairhope,’ and Miwa’s Garden,’ a median-yield individual was also sampled for the genetic diversity study. Seven named groups (‘Assamica,’ ‘Big Leaf,’ ‘China Seed,’ ‘Fairhope,’ ‘Georgian,’ ‘Large Leaf,’ and ‘Small Leaf’) were installed in 2016, and one (‘Miwa’s Garden’) was installed in 2020. The site is maintained with drip irrigation and weed barrier fabric, and plants are fertigated weekly with 6 lbs of N, 8.5 lbs of S, and 7 lbs of K per acre. The nineteen individuals representing eight named groups were also used for measuring morphological descriptors. Leaf samples from three individuals growing at the Great Mississippi Tea Company in Brookhaven, MS and an additional ten container-grown accessions maintained in a greenhouse in Gainesville, FL were included in the genetic diversity study (Supplementary Table 1). The greenhouse plants were used to determine timing of anthesis. Thirty tea accessions sourced from Anhui Agricultural University in Hefei, China were included in the genetic diversity study as a background population against which the unknown accessions were compared (Supplementary Table 2). Each of these accessions is a representative from a registered Chinese variety.

Table 1

Named GroupHigh yield (g/plant/year)IndividualLow yield (g/plant/year)IndividualMedian yield (g/plant/year)Individual
‘Assamica’22.3Assamica 29.9Assamica 1NANA
‘Big Leaf’91.1Big Leaf 12.2Big Leaf 2NANA
‘China Seed’27.7China Seed 10.1China Seed 2NANA
‘Fairhope’142.6Fairhope 33.7Fairhope 132.8Fairhope 2
‘Georgian’64.2Georgian 24.7Georgian 1NANA
‘Large Leaf’90.6Large Leaf 23.4Large Leaf 1NANA
‘Small Leaf’135.1Small Leaf 25.1Small Leaf 354.6Small Leaf 1
‘Miwa’s Garden’158.5Miwa’s Garden 210.9Miwa’s Garden 347.0Miwa’s Garden 1

The field-grown tea plants used in the genetic diversity study and their respective total annual yield (g/plant/year).

Within each named group, individuals were chosen to be representative of the highest and lowest yield from the mean of 2020 and 2021, marked as high yield and low yield respectively. ‘Miwa’s Garden’ idividuals were chosen from 2021 yield data, as this named group was not harvested in 2020. For the three highest-yielding accessions, individuals with a median yield is also included (NA, not applicable).

2.2 Yield

Yield was assessed over 3 harvests in 2020 and 4 harvests in 2021. Yield measurements included all plants within each of 8 named groups on the site. Plants were harvested by plucking the apical meristem and first two expanded leaves, commonly referred to as a “two and a bud” pluck. Yield in fresh weight was measured per-plant immediately after harvest to prevent loss of mass due to transpiration.

2.3 Morphological measurements and anthesis characterization

Morphological descriptors were measured on field-grown plants to describe leaf and flower characteristics as outlined in the IPGRI descriptor tool (IPGRI, 1997). Seven leaf characters (Internode length, mature leaf color, leaf margin, leaf length, leaf width, leaf length, and leaf pose) and two floral characters (relative height between pistil and stamen, and style splitting) were measured using calipers and visual inspection, as appropriate, in January 2022. Mature leaf descriptors were recorded on the first fully expanded leaf occurring on mature wood (Figure 1). Anthesis was measured from February through April of 2021 on greenhouse plants using time-lapse photography. Photographs of the floral bud were taken in 10-minute increments over a 24–48-hour period, starting at balloon stage and ending after the completion of anthesis. Photographs were time-stamped to accurately determine beginning and end times of anthesis.

Figure 1

2.4 DNA extraction and amplification

Young leaves were sampled and frozen at -20°C until DNA extraction. Genomic DNA was extracted using the DNEasy Plant Kit (Qiagen) and quantified using NanoDrop (Thermo Fisher Scientific, Waltham, MA, USA). Concentration of genomic DNA was adjusted to 10 ng.µl-1 prior to PCR amplification. Fluorophore 6-FAM-tagged primers were used for PCR amplification (Supplementary Table 3). PCR amplifications were performed in 20 uL reactions each containing: 10 µL Plant Phire DNA polymerase master mix (Thermo Fisher Scientific), 1 µL forward primer (0.5 µmol), 1 µL reverse primer (0.5 µmol), 1 µL genomic DNA (10 to 20 ng), and 7 µL water. Cycling conditions were as follows: 5 minutes initial denaturation at 98°C; 35 cycles of 5 seconds denaturation at 98°C, 5 seconds annealing at primer-specific temperature, and 20 seconds extension at 72°C; and 5 minutes final extension at 72°C. Products were verified on 2% (wt/vol) agarose gel, and fragment size was determined using capillary electrophoresis (Applied Biosystems 3730 Genetic Analyzer, Waltham, MA, USA). Alleles were called in PeakScanner version 1.0. Missing alleles were coded as “0” for null alleles. Five fragments amplified from CsIndel17 were sequenced with Sanger sequencing to confirm the priming and polymorphism of that InDel marker (Supplementary File 1).

2.5 Genetic statistics and analysis

Expected heterozygosity (He), observed heterozygosity (Ho) and fixation index (Fst) for 10 InDels were calculated in GenAlEx (Peakall and Smouse, 2012). Polymorphic information content (PIC) for 10 InDels was calculated with the ‘pic_calc’ function in Rstudio (version 2022.2.3.492) using package ‘PopGenUtils (R Studio, 2022).’ A distance matrix, calculated using Nei’s genetic distance, was used to create a neighbor-joining (NJ) tree with the ‘bionjs’ algorithm in Rstudio using the ‘ape’ package (Paradis and Schliep, 2019). The NJ tree was visualized using MEGA 11 (Tamura et al., 2021). Population structure was evaluated in STRUCTURE using the admixture, sampling locations, and correlated allele frequencies model with a burn-in of 100,000 and 10,000 MCMC repetitions (Pritchard et al., 2000). The best k was determined using the Evanno method (Evanno et al., 2005) via Structure Harvester (Earl and vonHoldt, 2012). The best Q-plot, determined using CluMPaK, was visualized using the R package ‘StuctuRly’ (Criscuolo and Angelini, 2020). Genetic groupings were further evaluated using Discriminant Analysis of Principal Components (DAPC) in the R package ‘adegenet’ (Jombart, 2008). Comparison of clusters produced by the NJ method and DAPC was performed by analyzing intersection of sets; sets were assigned by visual examination of clusters produced by NJ algorithm, and posterior DAPC group assignments. Set intersections were calculated and visualized using ‘Intervene’ (Supplementary Table 4) (Khan and Mathelier, 2017).

3 Results

3.1 Yield

Total annual yield of individual plants in 2020 and 2021 ranged from 0.1 g/plant/year (‘China Seed’) to 158.5 g/plant/year (‘Miwa’s Garden’) (Table 1). Among the named groups of tea plants, those that recorded the highest total annual yield per plant in both 2020 and 2021 were ‘Small Leaf’ and ‘Fairhope’ (Table 2). These groups also had the highest variation. Lowest variance of total annual yield per plant in both 2020 and 2021 was observed in ‘Assamica’ (Table 2). In 2020, annual yields for ‘Small Leaf,’ ‘Fairhope,’ ‘Georgian,’ ‘Large Leaf,’ ‘Big Leaf,’ and ‘Assamica,’ were not significantly different at α=0.05. ‘Miwa’s Garden’ was not harvested during that year. In 2021, ‘Small Leaf,’ ‘Fairhope,’ and ‘Miwa’s Garden’ were the highest yielding named groups, with mean yields above 50 grams per plant.

Table 2

Named Group20202021Number of plants (n)
Yield (g/plant)VarianceYield (g/plant)Variance
‘Assamica’15.21 ab2.0615.72 d1.845
‘Big Leaf’19.31 ab7.0636.29 bc29.2140
‘China Seed’11.56 b4.0810.83 d4.1327
‘Fairhope’20.72 a29.3850.28 ab77.4025
‘Georgian’20.00 a21.0918.71 cd9.0520
‘Large Leaf’14.77 ab4.4726.39 cd19.9940
‘Small Leaf’20.76 a17.6863.90 a94.1112
‘Miwa’s Garden’NANA54.35 ab83.2725

Mean annual yield by named group of tea plants, variance and the number of plants in each group at PSREU in Citra, FL.

Different lowercase letters following the mean values within each column indicate significant difference (p<0.05) by Duncan’s Multiple Range Test for 2020 and 2021. Plants were harvested in July, September, and November of 2020, and April, June, August, and September of 2021. Mean annual yield was determined by averaging the yield for all harvests within a year by named group. NA, Not available.

3.2 Leaf and floral morphology

Multiple morphological traits varied between named groups and some varied among the individual plants within a named group (Figure 1). When specific traits were considered individually, the 19 plants observed could be grouped into up to four groups (Table 3). Leaf characteristics varied by named group. The ‘Large Leaf’ and ‘Big Leaf’ accessions had the longest and widest leaves, while ‘Small Leaf’ had the shortest and narrowest leaves (Figure 1). Leaf colors were assessed to be light green, green, greyed-green, greyed-yellow or yellow green. Leaf color varied within named groups, with only ‘Large Leaf’ and ‘Big Leaf’ showing uniform leaf color. Leaf margins were largely serrulate or biserrate, with ‘Big Leaf’ showing denticulate margins. Leaf pose was moderately upright at between 40 to 75° relative to the stem. Leaf pose was above 57° for all individuals except ‘China Seed 1,’ (40°), ‘Small Leaf 1’ (53°) and ‘Small Leaf 2’ (53°) (Table 3). The relative height of pistil and stamen varied among and within the named groups, with only ‘Small Leaf’ individuals showing a uniform characteristic of equal height between pistil and stamen. The style splitting pattern was more uniform within named groups, with ‘Big Leaf’ and ‘Large Leaf’ having the style separating at the base and ascending freely, while ‘Small Leaf,’ ‘Miwa’s Garden,’ and ‘Georgian,’ had a united style that split near the top. Both ‘China Seed’ individuals showed a style that split at the middle of its length. ‘Fairhope’ mostly shared this united style, though one individual, ‘Fairhope 1,’ showed a style that split around the middle of its length (Supplementary Figure 1).

Table 3

IndividualInternode Length (mm)Mature Leaf ColorLeaf MarginLeaf Length (mm)Leaf Width (mm)Leaf Length: Width RatioLeaf Pose (°)Relative height between pistil and stamenStyle splitting
Assamica 112.9 ± 2.1 cdefg2379.0 ± 3.8 bcde30.2 ± 1.2 defg2.657 ± 4.6 de33
Assamica 28.0 ± 2.0 fg1377.0 ± 7.0 bcdef32.9 ± 2.3 cd2.366.4 ± 1.6 abcd11
BigLeaf 18.7 ± 1.6 efg2588.2 ± 2.2 ab42.5 ± 1.4 a2.168.0 ± 3.4 abcd11
BigLeaf 27.3 ± 1.1 fg2571.1 ± 6.3 defg32.3 ± 3.2 de2.269 ± 1.9 abc31
ChinaSeed 110.0 ± 6.3 fg5371.5 ± 4.7 def29.9 ± 2.3 defg2.440.0 ± 3.9 f22
ChinaSeed 227.3 ± 4 a1395.4 ± 3.4 a38.5 ± 1.6 ab2.561 ± 2.4 bcde12
Fairhope 111.8 ± 1.6 defg5351.9 ± 2.1 i18.1 ± 0.7 j2.959.6 ± 2.4 bcde22
Fairhope 218.8 ± 3.2 bc1364.9 ± 2.1 fgh30.3 ± 0.9 defg2.175 ± 2.7 a13
Fairhope 312.8 ± 1.8 cdefg2474.7 ± 4.9 cdef29.5 ± 1.7 defg2.561.0 ± 1.9 bcde33
Georgian 115.6 ± 1.9 bcde5356.7 ± 2.2 hi22 ± 0.9 hij2.671 ± 1.3 ab13
Georgian 27.0 ± 0.7 fg1369.9 ± 4.9 efg25.3 ± 2.0 ghi2.860.0 ± 4.2 bcde33
LargeLeaf 16.4 ± 0.6 g2365.6 ± 2.8 fgh30.8 ± 2.0 def2.169.4 ± 8.3 abc11
LargeLeaf 220.0 ± 3.5 b23.585.5 ± 4.4 abc37.6 ± 1.8 bc2.359.0 ± 1.9 cde31
MiwasGarden 19.4 ± 1.1 efg5383.2 ± 2.1 bcd28.7 ± 1.1 defg2.957 ± 2.5 de33
MiwasGarden 214.2 ± 2.4 bcdef23.583.1 ± 3.2 bcd33.6 ± 0.6 cd2.568.0 ± 2.5 abcd33
MiwasGarden 311.4 ± 1.2 defg2365.8 ± 3.5 fgh26.6 ± 1.1 fgh2.561 ± 2.9 bcde23
SmallLeaf 118.2 ± 1.5 bcd2364.2 ± 2.5 fgh27.1 ± 0.8 efgh2.453 ± 2.5 e23
SmallLeaf 218.9 ± 2.0 bc5358.8 ± 3.7 ghi26.7 ± 1.6 fgh2.253.0 ± 4.6 e23
SmallLeaf 311.2 ± 1.5 defg2350.3 ± 3.0 i21.2 ± 1.5 ij2.470 ± 2.2 abc23

Morphological characteristics from IPGRI descriptor tool for 19 individual plants belonging to 8 named groups of tea.

Individuals within each named group are ordered from highest to lowest 2021 yield. Mature leaf color (1= Light green, 2 = Green, 3 = Greyed-green, 4 = Greyed-yellow, 5 = Yellow green). Leaf margin (1 = Entire, 2 = Wavy, 3 = Serrulate, 4 = Biserrate, 5 = Denticulate). Relative height between pistil and stamen (1 = same height, 2 = stamen higher than pistil, 3 = pistil higher than stamen). Splitting of style (1 = geniculate, free for greater part of their length, 2 = ascending, free for about half their length, 3 = united for greater part of the length, the free part short, more or less horizontal and terminal). Letters within a column indicate significantly different (p<0.05) groups identified by Duncan’s multiple range test.

3.3 Anthesis characterization

Timelapse photography revealed that tea flowers largely began anthesis in the early- to mid-morning, typically commencing between 5:00 and 11:00 am (Figure 2). Fewer than 10% of flowers began anthesis in the late afternoon or evening. Duration of anthesis was typically within 8 hours of the onset. When anthesis took more than 12 hours, it typically began in the afternoon and paused overnight.

Figure 2

3.4 Genetic markers

Analysis of 10 InDel markers across 62 individuals revealed a total of 96 unique alleles (Table 4). Fragment sizes ranged from 138 to 358 base pairs. Three loci, CsInDel11, CsInDel17, and CsInDel38, showed notably lower observed heterozygosity (Ho) compared to expected heterozygosity (He). These loci were largely homozygous at the individual level but showed a high degree of variation among individuals. The polymorphism information content (PIC) was similar to the He for all loci. All PIC values were above 0.45, and 7 of 10 InDels showed PIC values at or above 0.50, indicating the InDel markers are highly informative (Botstein et al., 1980). The mean PIC value was 0.67. The fixation index (Fst) was measured between the US individuals as one population (n=32) and the Chinese individuals as a second population (n=30). At all loci, Fst was <0.05, indicating that there is not sufficient genetic structure to consider the US and Chinese individuals as separate populations.

Table 4

LocusNaFragment Size (bp)HeHoPICFst
CsInDel049213-2360.80 ± 0.020.82 ± 0.060.800.024
CsInDel099203-2440.71 ± 0.000.69 ± 0.090.700.031
CsInDel1110286-3220.83 ± 0.000.38 ± 0.050.840.035
CsInDel1720307-3580.89 ± 0.000.35 ± 0.110.900.020
CsInDel1812283-3180.74 ± 0.010.76 ± 0.150.720.011
CsInDel197176-2070.52 ± 0.010.32 ± 0.100.490.009
CsInDel204282-2920.55 ± 0.030.62 ± 0.100.460.005
CsInDel287216-2330.54 ± 0.020.38 ± 0.000.500.007
CsInDel3814138-3500.82 ± 0.050.43 ± 0.120.840.034
CsInDel434221-2370.55 ± 0.000.42 ± 0.070.470.007
10 InDels: US Individuals73NA0.701 ± 0.040.562 ± 0.08NANA
10 InDels: Chinese Individuals82NA0.695 ± 0.050.483 ± 0.05NANA
10 InDels: US and Chinese Individuals96NA0.712 ± 0.050.522 ± 0.060.67NA

Details concerning the apparent number of alleles (Na), range of fragment sizes in base pairs (bp), and observed and expected heterozygosity (Ho and He, respectively) at 10 InDel loci for 62 individual samples of tea and the apparent number of alleles.

Loci where the Ho was less than the He by >0.25 are indicated in bold. Ho and He were calculated for all loci in the entire study population (n=62) and two subpopulations: US Individuals (n=32), and Chinese Individuals (n=30). Polymorphism information content (PIC) was calculated at 10 InDel loci for the entire population (n=62) and a mean PIC value for all loci was also calculated. Fixation index (Fst) was calculated for each locus as a comparison between the two subpopulations. NA, Not applicable.

3.5 Genetic diversity and population structure analysis

3.5.1 Genetic distance

The NJ tree revealed four distinct clusters (Figure 3; Supplementary Table 4). ‘Shuchazao’ and ‘Yunkang 10,’ the reference genome varieties for CSS and CSA respectively, clustered on separate clades. The Chinese variety groupings showed some homology with their previously published groupings according to Liu et al. (2019), notably in the proximity among ‘Shuchazao,’ ‘Guyuxian,’ Ziyan,’ and ‘Xiaoxianghong’ in one clade, ‘Yunkang 10,’ ‘Dahong,’ and ‘Shancha 1’ in another, and ‘Anjibaicha,’ ‘Baihaizhao,’ ‘Chuanmu 28,’ and ‘Fudingdabai’ in a third. Geographical origin of the Chinese varieties was not strongly indicated in the tree. While varieties listed from northern and eastern China (an area encompassing Anhui, Shanxi, and Zhejiang provinces) sometimes occurred close to one another at the tips, there was no overall topology suggesting a strong geographical correlation. Among US tea, several of the seed lot-derived individuals showed considerable distance within named groups, with one ‘Fairhope’ clustering closer to ‘Yunkang 10’ and two others grouping near ‘Shuchazao’ (Figure 3). Both ‘China Seed’ individuals grouped next to one another, indicating they were derived from a narrow gene pool. ‘Big Leaf’ and ‘Large Leaf’ individuals had close genetic distance, indicating that these named groups may be derived from the same or very similar original plants. ‘Small Leaf’ individuals showed a broader distance, indicating more genetic diversity within the named group. Notably, ‘Small Leaf 3’ and ‘MS Oolong,’ which are purported to come from the same population (Jason MacDonald, personal communication, October 3, 2021), grouped on the same node. ‘Sochi,’ ‘Gangwang-do,’ ‘Georgian 2’ and ‘Chestnut Hill,’ all of which are named groups known to come from cold regions, grouped together in one node. ‘Black Sea Tea,’ another cold region named group, clustered with ‘Georgian 1’ on a separate node (Figure 3).

Figure 3

3.5.2 Population structure

STRUCTURE analysis indicated 4 ancestral populations (Figure 4). Each method included in Evanno’s determination of best k indicated k=4. Proximity between individuals seen in the NJ tree are supported by this analysis. ‘Big Leaf’ and ‘Large Leaf’ derive most of their ancestry from a single source, Group 1. ‘Small Leaf 3’ and ‘MS Oolong’ appear to share most of their ancestry from a second source, Group 2. Among the Chinese background population, ‘Tieguanyin’ and ‘Zimudan,’ both from Fujian province, show the highest proportion of ancestry from this group.

Figure 4

Seed lot-derived named groups ‘Fairhope,’ and ‘Georgian’ vary in their ancestral proportions more than ‘Small Leaf,’ also a seed lot-derived named group. ‘Baihaozhao,’ ‘Echa 1,’ ‘Fudingdabai,’ ‘Chuanmu 28,’ ‘Zhenong,’ ‘Anjibaicha,’ ‘Fuzao 2,’ and ‘Bedou,’ which appear together in one node on the NJ tree, are shown in series on the Q-plot with continuous decrease in ancestry from cluster 3 and increasing ancestry from cluster 4 (Figure 4). Among these varieties, all except ‘Anjibaicha’ and ‘Fudingdabai’ originate in northeast China. Proximal to this group are ‘Shuchazao,’ ‘Guyuxian,’ and ‘Zhenong 108,’ also from northeast China. ‘Yunkang 10’ shows 85% of its genome from cluster 4, while ‘Shuchazao’ displays more admixture, with 59% of its genome from cluster 3 and 36% from cluster 4. The most admixed individuals are ‘Tieguanyin’ and ‘Zimudan,’ which share a geographical provenance of Fujian province. Among the Chinese varieties, this population structure analysis reflects geographical provenance more strongly than the clusters identified using the NJ method, although both methods used the same set of multilocus genotype data. Varieties originating from northeast China have a higher proportion of ancestry from cluster 3, while those from southwest China largely show a higher proportion of ancestry from cluster 4. ‘Yunkang 10’ and ‘Yinghong 9,’ both CSA varieties, show nearly the same proportion of ancestry from cluster 4 and cluster 2, while ‘Yinghong 9’ shows proportionally more ancestry from cluster 3 (Figure 4).

3.5.3 Discriminant analysis of principal components

Principal components analysis anterior to DAPC indicated 4 groups based on the Bayesian information criterion (BIC) (Figure 5). Posterior assignments indicate cluster 1 is the largest with 23 individuals, or 37% of the study population. ‘Yunkang 10’ and ‘Yinghong 9,’ both CSA varieties, appear in Cluster 1. Cluster 3 includes ‘Anjibaicha,’ ‘Baihaozhao,’ ‘Bedou,’ ‘Chuanmu 28,’ ‘Echa 1,’ ‘Fudingdabai,’ ‘Fuzao 2,’ ‘Zhenong,’ and ‘Guyuxiang.’ As previously stated, most of these varieties come from northeast China. This group shows homology with the NJ tree node containing these varieties. ‘Small Leaf 3’ is again associated closely with ‘MS Oolong,’ appearing at the same coordinates on the graph. Accessions from ‘Large Leaf’ and ‘Big Leaf’ appear closely grouped in cluster 1. Individuals from seed lot-derived named groups ‘Fairhope,’ ‘Georgian,’ and ‘Miwa’s Garden’ are split between the clusters, while ‘China Seed’ individuals appear in cluster 2 and ‘Small Leaf’ individuals appear in cluster 4 (Figure 5; Supplementary Table 4).

Figure 5

3.5.4 Comparison of clusters

Clusters identified by the Neighbor-joining method (Figure 3) and by DAPC (Figure 5) were compared using Intersection of Sets (Figure 6; Supplementary Table 4). The groups identified by the two different methods did not show high rates of intersection, suggesting that the two clustering methods identified different groupings. ‘NJ 2’ is the smallest set with only 8 accessions, 6 of which appear in ‘DAPC 1.’ The next highest rates of intersection were between ‘DAPC 3’ and ‘NJ 1,’ where ‘DAPC 3’ comprises 69% of accessions shared with ‘NJ 1’; and ‘DAPC 4’ and ‘NJ 1,’ where ‘DAPC 4’ comprises 61% of accessions shared with ‘NJ 1’. ‘NJ 1’ is largely split between ‘DAPC 3’ and ‘DAPC 4,’ while the remaining groups show broader dispersal of intersections (Figure 6).

Figure 6

4 Discussion

The plants included in this study are derived from clonal propagation, selected cuttings from seed-derived plants, or from seed lots (Orrock et al., 2017). Levels of genetic diversity within a named group varied according to propagation method, with clonal varieties ‘Large Leaf’ and ‘Big Leaf,’ suspected to be highly similar germplasm under different trade names, clustering tightly together in the analysis of genetic distance and both analyses of population structure. Individuals from ‘Big Leaf’ and ‘Large Leaf’ were, however, differentiated at two loci. At CsInDel09, ‘Big Leaf 1’ and ‘Large Leaf 1’ had an allele of 212 bp while ‘Big Leaf 2’ and ‘Large Leaf 2’ had an allele of 221 bp. At CsInDel11, ‘Large Leaf 2’ was homozygous for an allele of 301 bp while the other individuals were heterozygous with alleles of 298/301 bp (Supplementary File 2). This indicates that these named groups may derive from several selections of a closely related population, instead of purely clonal propagation from a single individual. Congruent to this, the leaf margin trait significantly differed between ‘Large Leaf’ and ‘Big Leaf’ (Table 3). The named group ‘Large Leaf’ was reported to be tetraploid (Hembree et al., 2019), and as such the tetraploids may be incompatible with the diploid accessions. The field-tested and greenhouse individuals from both ‘Big Leaf’ and ‘Large Leaf’ show low rates of seed development and maturation, usually aborting the seed by about four months (data not shown).

Though it is a woody perennial crop, tea is not cultivated for a mast of fruit or nuts, but rather for vegetative growth. For this crop, yield and biomass accumulation are closely related parameters (Orrock et al., 2021), with higher yields also indicating that an individual is better adapted to local environmental and climatic conditions. Given the wide array of factors that influence commercial yields, it is difficult to define a “typical yield” for a tea plant, and yield data from varietal garden plots not managed for commercial production may not be predictive of yields for the same germplasm under different conditions. Nevertheless, yield comparisons are necessary to fully investigate poorly described germplasm and make breeding decisions regarding controlled crosses.

The 19 field-grown individuals used in this study were selected based on yield, and all three analyses were able to distinguish at least some high- versus low-yielding genotypes. Cluster analysis of these field-grown accessions may indicate genetic variations related to adaptability in Florida conditions. Among the low-performing named groups, ‘Georgian’ and ‘China Seed,’ occurrence of a few high-yielding individuals may indicate genetic attributes that enable them to perform well despite the majority of the named group showing poor adaptability to the region. They warrant further investigation to preserve advantageous genetic traits that may be present in these individuals and missing from others in the named group. All four individuals representing ‘Big Leaf’ and ‘Large Leaf’ grouped together, which is unsurprising given the other results presented in this study. Since individuals within these named groups show low genetic diversity, poor performance from low yielding ‘Big Leaf’ and ‘Large Leaf’ individuals can possibly be explained by competition, micro-environmental conditions, or patchy distribution of pests and diseases within the field. In the NJ tree of the full 62-member population, the high- and median-yielding ‘Fairhope’ and ‘Miwa’s Garden’ individuals clustered together. The low-yielding individuals from ‘Fairhope’ and ‘Miwa’s Garden’ appeared in a distant cluster from the other individuals in their named group. This division was preserved for ‘Fairhope’ in the population structure analysis, and for ‘Miwa’s Garden’ in the DAPC.

Seed lot-derived named groups ‘Fairhope,’ ‘Small Leaf,’ and ‘Miwa’s Garden’ showed higher genetic diversity within the groups, as well as higher variance in yields. As a self-incompatible species with long-term allogamy, the tea genome shows high heterozygosity and diversity (Chen et al., 2012). Even within a seed lot-derived named group, high genetic diversity is reflected in the cladistic analysis. However, having prior knowledge of the named groups being derived from either seed lot or clonal propagation is not sufficient to predict genetic distance, since some seed lot-derived named groups may have more closely related progenitors, as seen in ‘China Seed’. Cladistic analysis is therefore ideal for estimating genetic distance, both between and within seed lot-derived named groups.

The cluster analysis using the NJ tree may be used to direct choices in breeding, especially of controlled crosses. In an outcrossing organism, inbreeding depression is mitigated by self-incompatibility. Individuals that are closely related may fail to produce viable offspring, or, if offspring survive past the juvenile stage, they may be weak and underperforming. Seed lot-derived individuals from ‘China Seed’ do not show the high variance of yield seen in other seed lot-derived named groups ‘Fairhope’ and ‘Small Leaf.’ Both individuals from ‘China Seed’ clustered nearby each other on the cladistic analysis of genetic distance, indicating low genetic distance, or high genetic similarity, between those individuals. This suggests that ‘China Seed’ has lower genetic variability compared to ‘Fairhope’ and ‘Small Leaf.’ ‘Fairhope,’ ‘Small Leaf,’ ‘Miwa’s Garden,’ and ‘Georgian,’ all seed lot-derived named groups, showed more genetic dissimilarity between individuals included in the study, suggesting these named groups will show more diversity in field production settings and could be useful to identify and select superior performers.

The dendrogram clusters are supported by the population structure results, which estimates four ancestral populations. Long-lived perennials tend to show weak population structure, even among distinct geographical populations; indeed, the likelihood that all individuals studied ultimately belong to a single population was supported by the low Fst scores for each InDel locus between US individuals and the Chinese background population (Table 4). Therefore, the STRUCTURE analysis was performed with sampling locations included using the LOCPRIOR function (Miller and Gross, 2011). The results for the Chinese background population indicate a different ancestral population for accessions sourced from southwest China compared to those sourced from northeast China. Also separated in this analysis are ‘Shuchazao,’ and ‘Yunkang 10,’ the reference genome specimens for C. sinensis var. sinensis and C. sinensis var. assamica, respectively. The population structure analysis suggests that 26 of the 32 US genotypes tested share ancestry with Chinese varieties, a conclusion that is consistent with the historical record of anthropogenically mediated tea movement from East to West. The population structure analysis also shows two subpopulations poorly represented in the Chinese background population but present in the US domestic tea germplasm.

In the population structure analysis, ‘Big Leaf,’ ‘Large Leaf,’ and ‘Red Leaf’ individuals showed a high proportion of ancestry from Group 1. Among the Chinese individuals ‘Hongyang 12’ and ‘Xiaoxianghong’ showed the highest proportion from Group 1. These seven individuals also clustered together in the NJ tree and DAPC. The geographical origins of ‘Big Leaf’ and ‘Large Leaf’ are unconfirmed, but they are thought to derive from a US Department of Agriculture introduction made around 1970 (Jason MacDonald, personal communication, September 8, 2022). ‘Red Leaf,’ is distinguished morphologically by the red color of new shoots and pink flowers (Supplementary Figure 1). It has been reported to exist in Japan as far back as 1895 (Makino, 1905; Kitamura, 1950). Kitamura (1950) also notes the large leaf size of red-leaf tea, a trait shared by ‘Big Leaf’ and ‘Large Leaf.’ The historical and genetic evidence suggest that ‘Big Leaf,’ ‘Large Leaf,’ and ‘Red Leaf’ share ancestry from a Japanese population of tea. The genetic evidence further suggests that the progenitors of this population contributed some genetic material to Chinese individuals ‘Hongyang 12,’ ‘Xiaxianghong,’ and Korean individual ‘Gangwang-do’ (Figure 3).

‘Small Leaf 3’ and ‘MS Oolong’ showed much of their ancestry from Group 2, another group that is not strongly represented in the Chinese background population. The University of Florida ‘Small Leaf’ named group came from the same nursery stock as the ‘MS Oolong’ individual. This named group is thought to originate from clonal propagules sourced from Charles Shepard’s Pinehurst tea plantation in South Carolina (Jason McDonald, personal communication, September 8, 2022). There is a stark division in ancestry between ‘MS Oolong’ and ‘Small Leaf 3’ compared to ‘Small Leaf 1’ and Small Leaf 2,’ the latter of which share more ancestry with individuals from northeast China (Figure 3) and are morphologically comparable (Table 3). The genetic evidence suggests a number of cuttings were taken from genetically dissimilar plants and sold under a single trade name; a conjecture consistent with the historical record showing tea from several different regions were planted at Pinehurst (Walcott, 1999). The putative origin of ‘MS Oolong’ and ‘Small Leaf 3’ is more difficult to ascertain. Of the recorded plants at Pinehurst, an Assam hybrid seems unlikely because of the small leaf size. A Chinese variety also seems unlikely, since the ancestral Group 2 would be better represented in the genetic structure analysis of the Chinese population. These individuals may have therefore originated from a Japanese tea population that was discrete from the ‘Big Leaf,’ ‘Large Leaf,’ and ‘Red Leaf’ progenitors. Though the genetic diversity among commercial Japanese tea varieties is low compared to other tea-growing regions (Ni et al., 2008), this is due to the high density of ‘Yabukita’ plantings and its heavy use as a breeding parent. ‘Yabukita’ was not selected until the early 20th century, about 50 years after the establishment of Pinehurst tea plantation. Japanese tea varieties from the Shizuoka and Uji regions, the two main sites of tea’s introduction into Japan from China around the 13th century, were likely sourced from distinct populations within China (Yamashita et al., 2019). Additionally, recent genetic analyses using SNP markers have shown these varieties share little genetic similarity with extant Chinese varieties (Yamashita et al., 2019). However, further studies involving genetic comparison between Japanese tea varieties and US-sourced tea accessions are necessary to confirm this hypothesis.

DAPC, by minimizing variation within groups, can be useful for balancing breeding decisions based on NJ or similar algorithms, which are designed to show genetic distance. DAPC is used in this study instead of another commonly used measure of genetic variation, the PCA. DAPC does in fact begin by transforming the data using PCA, which allows for an informed selection of the number of groups using the BIC to determine the best k. While PCA helps to interpret total genetic variation, DAPC is more suited for analyzing variance among groups, while minimizing within-group variation. (Jombart and Collins, 2015). DAPC is not based on a distance matrix like the NJ algorithm, but rather on allele frequencies. Reviewing the results using intersection of sets provides insight into the utility of the different clustering analyses. The DAPC posterior assignment groupings showed homology with node tip assignments seen in the NJ clusters, and ancestry estimates produced by the population structure analysis. However, group assignments differed between the DAPC and NJ algorithm, to some extent; this divergence is explored in the Intersection of Sets analysis (Figure 6). Notable shared assignments between the DAPC and NJ method included named groups shown to be closely related such as ‘Big Leaf’ and ‘Large Leaf.’ Since DAPC is designed to minimize variation within groups, some of the seed lot-derived named groups appeared in the same cluster, while on the NJ tree, they appeared in different clades. The DAPC results are in keeping with the population structure analysis, which shows ‘Small Leaf’ and ‘Miwa’s Garden’ in the same DAPC groups, and also having similar proportions of ancestry to one another on the population structure analysis.

The investigation into timing and duration of anthesis presents opportunities to cross pairs that would otherwise be temporally isolated. Finally, it should be noted that the breeding recommendations presented here are based on yield and do not take into account questions of quality. Further studies examining the tea produced from named groups and individuals under Florida field conditions are required to investigate horticultural and genetic effects on quality of Florida-grown tea.

As an allogamous plant with a highly heterozygous genome and broad geographic range of cultivation, the potential for tea germplasm to possess adaptations to specific environmental conditions is high. Inferring regional suitability may be possible using genetic analysis and observations of regional viability. ‘Chestnut Hill’ was selected from Morris Arboretum near Philadelphia, which is USDA Plant Hardiness Zone 7a and experiences winter low temperatures from -17.8 to -15°C (USDA, 2012). This specimen was obtained by the Arboretum in 1953 (Goff, 2021). Given the warming trend of the last 50 years, it has likely survived even colder temperatures. ‘Gangwang-do,’ ‘Sochi,’ and ‘Georgian 2’ clustered together with ‘Chestnut Hill’ on the NJ tree, indicating that they, too, may be suited for relatively colder winter temperatures. This analysis is consistent with the reported origins of these named groups in the colder parts of the tea growing range: mountains near the Korean DMZ, and the region around Sochi, Russia.

The poor yield performance of ‘Assamica’ in Florida field conditions is worth noting, especially because this named group was predicted to be one of the better performers for this area due to CSA being mostly cultivated in the relatively warmer tea growing regions (Wei et al., 2018). Knowing that CSA subspecies available in the US likely originated from the Indian Himalayas, it is possible that the field-tested ‘Assamica’ individuals are either shade-adapted or require higher altitudes in areas reaching above 30°C, if not both (Carr, 2018).

Based on yield, ‘Fairhope’ individuals perform well in Florida conditions, as do ‘Miwa’s Garden,’ ‘Big Leaf’ and ‘Small Leaf.’ The polyploidy of ‘Big Leaf’ and ‘Large Leaf’ is consistent with broader environmental adaptability (Ramsey, 2011). One of the ‘Small Leaf’ accessions, ‘Small Leaf 3,’ has had low yields in Florida, but shows high homology with ‘MS Oolong’ in the population structure analysis, NJ tree, and DAPC. The ‘MS Oolong’ individual is an established plant at a working tea farm in Mississippi and was sourced from the same tea seed farm as the named group ‘Small Leaf.’ The ‘MS Oolong’ plants are productive in an area with cooler summer and winter temperatures than Florida, among other environmental differences. Given the high genetic similarity between these two individuals, and the dichotomy in ancestry within ‘Small Leaf’ demonstrated by the population structure analysis, it is likely that some ‘Small Leaf’ plants are generally more cold-hardy and less likely to be heat tolerant. Indeed, this highlights some possibly rare attributes of the surviving ‘Small Leaf’ plants field-grown in Florida.

Statements

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

BR and BSR designed the outline of this study and collected the resources needed. CC completed the experimental research, computational data analyses and prepared the manuscript. All authors contributed to the article and approved the submitted version.

Funding

CC was supported by a research fellowship from the Graduate School, University of Florida, Gainesville, FL.

Acknowledgments

The authors extend their gratitude to Dr. Chaoling Wei, Anhui Agricultural University, Heifei, China, for the generous donation of tea genomic DNA samples and Mr. Jason MacDonald, Great Mississippi Tea Company, Brookhaven, MS for the sharing of plant materials.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1149682/full#supplementary-material

References

  • 1

    AhmedS.SteppJ. R. (2013). “Green tea,” in Tea in health and disease prevention. Ed. PreedyV. R. 1st ed. ( London, UK: Elsevier), 1931. doi: 10.1016/B978-0-12-384937-3.00002-1

  • 2

    AnY.MiX.ZhaoS.GuoR.XiaX.LiuS.et al. (2020). Revealing distinctions in genetic diversity and adaptive evolution between two varieties of Camellia sinensis by whole-genome resequencing. Front. Plant Sci.11. doi: 10.3389/fpls.2020.603819

  • 3

    BotsteinD.WhiteR. L.SkolnickM.DavisR. W. (1980). Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am. J. Hum. Genet.32:3, 314331.

  • 4

    CarrM. K. V. (2018). Advances in tea agronomy (Cambridge: Cambridge University Press).

  • 5

    ChenX.HaoS.WangL.FangW.WangY.LiX. (2012). Late-acting self-incompatibility in tea plant (Camellia sinensis). Biologia67, 2. doi: 10.2478/s11756-012-0018-9

  • 6

    ChenZ.LinZ. (2015). Tea and human health: biomedical functions of tea active components and current issues. J. Zhejiang Univ. Sci. B16:2, 87102. doi: 10.1631/jzus.B1500001

  • 7

    CriscuoloN. G.AngeliniC. (2020). StructuRly: a novel shiny app to produce comprehensive, detailed and interactive plots for population genetic analysis. PloS One15, 2. doi: 10.1371/journal.pone.0229330

  • 8

    D’AuriaJ. C.CohenS. P.LeungJ.GlockzinK.GlockzinK. M.Gervay-HagueJ.et al. (2022). United states tea: a synopsis of ongoing tea research and solutions to united states tea production issues. Front. Plant Sci.13. doi: 10.3389/fpls.2022.934651

  • 9

    EarlD. A.vonHoldtB. M. (2012). STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the evanno method. Conserv. Genet. Resour.4:2, 359361. doi: 10.1007/s12686-011-9548-7

  • 10

    EvannoG.RegnautS.GoudetJ. (2005). Detecting the number of clusters of individuals using the software structure: a simulation study. Mol. Ecol.14:8, 26112620. doi: 10.1111/j.1365-294X.2005.02553.x

  • 11

    FAO (2018) Global tea consumption and production driven by robust demand in China and India. Available at: http://www.fao.org/news/story/en/item/1136255/icode/.

  • 12

    GardenerR. (1971). Robert Fortune and the cultivation of tea in the united states. Arnoldia31, 119.

  • 13

    GoffE. (2021). Catalogue of plants in the living collection January 2021 (Philaelphia, PA: Morris ArborteumAugust 18, 2022. Available at: https://www.morrisarboretum.org/pdf/PlantCatalogue-21-1.pdf.

  • 14

    GoggiP. (2022) State of the U.S. tea industry: review of 2021 and 2022 predictions. world tea news. Available at: https://www.worldteanews.com/issues-trends/state-us-tea-industry-review-2021-and-2022-predictions.

  • 15

    HembreeW. G.RanneyT. G.JacksonB. E.WeathingtonM. (2019). Cytogenetics, ploidy, and genome sizes of Camellia and related genera. HortScience54 (7), 11241142. doi: 10.21273/HORTSCI13923-19

  • 16

    IPGRI (1997). Descriptors for tea (Camellia sinensis) (Rome, Italy: International Plant Genetic Resource Institute).

  • 17

    JinJ. Q.DaiW. D.ZhangC. Y.LinZ.ChenL. (2022). Genetic, morphological, and chemical discrepancies between Camellia sinensis (L.) o. kuntze and its close relatives. J. Food Composition Anal.108, 104417. doi: 10.1016/j.jfca.2022.104417

  • 18

    JombartT. (2008). Adegenet: a r package for the multivariate analysis of genetic markers. Bioinformatics24, 14031405. doi: 10.1093/bioinformatics/btn129

  • 19

    JombartT.CollinsC. (2015) A tutorial for discriminant analysis of principal components (DAPC) using adegenet 2.0.0.43. Available at: https://adegenet.r-forge.r-project.org/files/tutorial-dapc.pdf (Accessed July 20, 2022).

  • 20

    KhanA.MathelierA. (2017). Intervene: a tool for intersection and visualization of multiple gene or genomic region sets. BMC Bioinf.18, 287. doi: 10.1186/s12859-017-1708-7

  • 21

    KitamuraS. (1950). On tea and camellias. Acta Phytotaxon. Geobot. 14, 2, 5662. doi: 10.18942/bunruichiri.KJ00001077611

  • 22

    KloseN. (1950). Experiments in tea production in the united states. Agric. History24 (3), 156161.

  • 23

    LiuS.AnY.LiF.LiS.LiuL.ZhouQ.et al. (2018). Genome-wide identification of simple sequence repeats and development of polymorphic SSR markers for genetic studies in tea plant (Camellia sinensis). Mol. Breed.38(59) 5. doi: 10.1007/s11032-018-0824-z

  • 24

    LiuS.AnY.TongW.QinX.SamarinaL.GuoR.et al. (2019). Characterization of genome-wide genetic variations between two varieties of tea plant (Camellia sinensis) and development of InDel markers for genetic research. BMC Genomics20 (1), 935. doi: 10.1186/s12864-019-6347-0

  • 25

    LiuS.LiuH.WuA.HouY.AnY.WeiC. (2017). Construction of fingerprinting for tea plant (Camellia sinensis) accessions using new genomic SSR markers. Mol. Breed.37, 93. doi: 10.1007/s11032-017-0692-y

  • 26

    MakinoT. (1905). Observations of the flora of japan. Tokyo Shokubutsugaka Zasshi, Tokyo, Japan, Vol. 19. 135.

  • 27

    MeegahakumburaM. K.WambulwaM. C.LiM. M.ThapaK. K.SunY. S.MöllerM.et al. (2018). Domestication origin and breeding history of the tea plant (Camellia sinensis) in China and India based on nuclear microsatellites and cpDNA sequence data. Front. Plant Sci.8. doi: 10.3389/fpls.2017.02270

  • 28

    MeegahakumburaM. K.WambulwaM. C.ThapaK. K.LiM. M.MöllerM.XuJ. C.et al. (2016). Indications for three independent domestication events for the tea plant (Camellia sinensis (L.) o. kuntze) and new insights into the origin of tea germplasm in China and India revealed by nuclear microsatellites. PloS One11 (5). doi: 10.1371/journal.pone.0155369

  • 29

    MillerA. J.GrossB. L. (2011). From forest to field: perennial fruit crop domestication. Am. J. Bot.98 (9), 13891414. doi: 10.3732/ajb.1000522

  • 30

    NiS.YaoM.ChenL.ZhaoL.WangX. (2008). Germplasm and breeding research of tea plant based on DNA marker approaches. Front. Agric. China2 (2), 200207. doi: 10.1007/s11703-008-0043-1

  • 31

    OrrockJ.RathinasabapathiB.RichterB. S. (2020). Anthracnose in U.S. tea: pathogen characterization and susceptibility among six tea accessions. Plant Dis.104, 10551059. doi: 10.1094/PDIS-07-19-1518-RE

  • 32

    OrrockJ.RichterB. S.RathinasabapathiB. (2017). Tea as a novel crop for Florida: field establishment of eight accessions. Proc. Florida State Hortic. Soc.130, 3536.

  • 33

    OrrockJ. M.RichterB. S.RathinasabapathiB. (2021). Performance of seven tea accessions in north-central Florida: correlations between potential yield and growth parameters over 2 years. HortTechnology31 (6), 846851. doi: 10.21273/HORTTECH04908-21

  • 34

    ParadisE.SchliepK. (2019). Ape 5.0: an environment for modern phylogenetics and evolutionary analyses in rBioinformatics35, 526528. doi: 10.1093/bioinformatics/bty633

  • 35

    PeakallR.SmouseP. E. (2012). GenAlEx 6.5: genetic analysis in excel. population genetic software for teaching and research-an update. Bioinformatics28, 25372539. doi: 10.1093/bioinformatics/bts460

  • 36

    PriceT. A. (2021) Is tea the south’s next lucrative crop? these family farms have mastered the centuries-old art. Available at: https://www.clarionledger.com/in-depth/news/american-south/2021/10/07/mississippi-tea-farms-growing-business-american-south/5408502001/.

  • 37

    PritchardJ. K.StephensM.DonnellyP. (2000). Inference of population structure using multilocus genotype data. Genetics155 (2), 945959. doi: 10.1093/genetics/155.2.945

  • 38

    RamseyJ. (2011). Polyploidy and ecological adaptation in wild yarrow. Proc. Natl. Acad. Sci.108 (17), 70967101. doi: 10.1073/pnas.1016631108

  • 39

    RoseS. (2010). For all the tea in China: how England stole the world’s favorite drink and changed history (New York, USA: Penguin Group (USA) Inc.).

  • 40

    R Studio Team (2022). RStudio: integrated development environment for r (Boston, MA: RStudio, PBC). Available at: http://www.rstudio.com/.

  • 41

    TamuraK.StecherG.KumarS. (2021). MEGA11: molecular evolutionary genetics analysis version 11. Mol. Biol. Evol.38 (7), 30223027. doi: 10.1093/molbev/msab120

  • 42

    TanL. Q.WangL. Y.XuL. Y.WuL. Y.PengM.ZhangC. C.et al. (2016). SSR-based genetic mapping and QTL analysis for timing of spring bud flush, young shoot color, and mature leaf size in tea plant (Camellia sinensis). Tree Genet. Genomes12 (3), 52. doi: 10.1007/s11295-016-1008-9

  • 43

    USDA Plant Hardiness Zone Map (2012). Agricultural research service, U.S (Washington DC, USA: Department of Agriculture). Available at: https://planthardiness.ars.usda.gov/.

  • 44

    WalcottS. M. (1999). Tea production in south Carolina. Southeastern Geographer39 (1), 6174. doi: 10.1353/sgo.1999.0019

  • 45

    WalcottS. M. (2012). Brewing a new American tea industry. Geographical Rev.102 (3), 350363. doi: 10.1111/j.1931-0846.2012.00156.x

  • 46

    WambulwaM. C.MeegahakumburaM. K.KamunyaS.MuchugiA.MöllerM.LiuJ.et al. (2017). Multiple origins and a narrow genepool characterise the African tea germplasm: concordant patterns revealed by nuclear and plastid DNA markers. Sci. Rep.7. doi: 10.1038/s41598-017-04228-0

  • 47

    WambulwaM. C.MeegahakumburaM. K.KamunyaS.WachiraF. N. (2021). From the wild to the cup: tracking footprints of the tea species in time and space. Front. Nutr.8, 706770. doi: 10.3389/fnut.2021.706770

  • 48

    WangX.FengH.ChangY.MaC.WangL.HaoX.et al. (2020). Population sequencing enhances understanding of tea plant evolution. Nat. Commun.11:1, 4447. doi: 10.1038/s41467-020-18228-8

  • 49

    WeiC.YangH.WangS.ZhaoJ.LiuC.GaoL.et al. (2018). Draft genome sequence of camellia sinensis var. sinensis provides insights into the evolution of the tea genome and tea quality. Proc. Natl. Acad. Sci.115:18, E4151E4158. doi: 10.1073/pnas.1719622115

  • 50

    XiaE.TongW.HouY.AnY.ChenL.WuQ.et al. (2020). The reference genome of tea plant and resequencing of 81 diverse accessions provide insights into its genome evolution and adaptation. Mol. Plant13:7, 10131026. doi: 10.1016/j.molp.2020.04.010

  • 51

    YamashitaH.KataiH.KawaguchiL.NaganoA. J.NakamuraY.MoritaA.et al. (2019). Analyses of single nucleotide polymorphisms identified by ddRAD-seq reveal genetic structure of tea germplasm and Japanese landraces for tea breeding. PloS One14:8, e0220981. doi: 10.1371/journal.pone.0220981

  • 52

    ZhangQ.LiT.WangQ.LeCompteJ.HarkessR. L.BiG. (2020). Screening tea cultivars for novel climates: plant growth and leaf quality of Camellia sinensis cultivars grown in Mississippi, United States. Front. Plant Sci.11. doi: 10.3389/fpls.2020.00280

Summary

Keywords

InDel markers, Nei’s genetic distance, Camellia sinensis, tea yield, biohistory

Citation

Clarke C, Richter BS and Rathinasabapathi B (2023) Genetic and morphological characterization of United States tea (Camellia sinensis): insights into crop history, breeding strategies, and regional adaptability. Front. Plant Sci. 14:1149682. doi: 10.3389/fpls.2023.1149682

Received

22 January 2023

Accepted

06 April 2023

Published

12 May 2023

Volume

14 - 2023

Edited by

Qiangqiang Xiong, Yangzhou University, China

Reviewed by

Romit Seth, North Carolina State University, United States; Mehdi Rahimi, Graduate University of Advanced Technology, Iran

Updates

Copyright

*Correspondence: Bala Rathinasabapathi,

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics