Fine mapping of the flavonoid 3’,5’-hydroxylase gene controlling anthocyanin biosynthesis in pepper anthers and stems

Pepper (Capsicum annuum L) is one of the most important vegetables grown worldwide. Nevertheless, the key structural and regulatory genes involved in anthocyanin accumulation in pepper have not been well understood or fine mapped yet. In this study, F1, F2, BC1P1, and BC1P2 pepper populations were analyzed and these populations were derived from a cross between line 14-Z4, which has yellow anthers and green stems, and line 14-Z5, which has purple anthers and stems. The results showed that the yellow anthers and green stems were determined by a single recessive locus called to as ayw. While, using preliminary and fine mapping techniques, ayw locus was located between markers aywSNP120 and aywSNP124, with physical distance of 0.2 Mb. The CA11g18550 gene was identified as promising candidate for the ayw locus, as it co-segregated with the yellow anthers and green stems phenotypes. CA11g18550 encodes a homolog of the F3’5’H (flavonoid 3’,5’-hydroxylase) anthocyanin synthesis structure gene. The missense mutation of CA11g18550 possibly resulted in a loss-of-function. The expression analysis showed that CA11g18550 was significantly expressed in the stems, leaves, anthers and petals in 14-Z5, and it’s silencing caused the stems changing from purple to green. This study provides a theoretical basis for using yellow anthers and green stems in pepper breeding and helps to advance the understanding of anthocyanin synthesis.


Introduction
Pepper is an important vegetable because of its pungency and high nutritional value with industrial applications around the world. In 2020, the total harvested area of pepper was 2.07 million hectares, with a production of 36.14 million tonnes (FAO, https://www.fao.org/home/zh/). Many countries rely on male sterile lines to produce hybrid pepper seeds (Dash et al., 2001).
Hybrid breeding uses heterosis to produce varieties that possess both high yield and good quality characteristics (Zhang et al., 2016a). Compared to the creation of hybrid seeds by hand emasculation and pollination, the hybrid system based on male sterility reduces the risk of producing false hybrid seeds resulting from self-pollination (Cheema and Dhaliwal, 2005;Atanassova, 2007). In particular, the release of genic male sterility (GMS) lines is much easier than that of cytoplasmic male sterile (CMS) lines and GMS is less susceptible to ambient temperature compared to CMS (Wei et al., 2019). However, 50% fertile plants need to be pulled out in the field during hybrid seed production. Therefore, it is convenient and economical to differentiate male sterile plants by using Marker-assisted breeding (MAS). Identifying and developing morphological markers associated with traits of interest in breeding is a prerequisite for the successful use of Marker-assisted breeding (MAS) (Zhang et al., 2016a). Such as, the trichome density on the main stem of pepper plants could be used as a morphological marker for assessing PepMoV resistance (Kim et al., 2011), yellow leaves in tomato can be used as a morphological marker for resistance to ToMV (Zhang et al., 2016a). The green hypocotyl is another useful morphological marker in the selection of male sterile 10 (ms10) during seedling evaluation (Zhang et al., 2016a).
Currently, pepper fruit color is the primary focus of study on the anthocyanin production of peppers. The A locus controlling anthocyanin accumulation in pepper was mapped on chromosome 10, encoding MYB transcription factor, which was a homologous gene of petunia anthocyanin2 (an2) (Borovsky et al., 2004). Another gene, anthocyanidin 3-O-glucosyltransferase, was fine-mapped on chromosome 10 and controls anthocyanin synthesis in pepper fruit (Liu et al., 2020). Eight recessive genes (al-1 to al-8) were found to control yellow anthers and green stems, while they have not been cloned yet (Wang and Bosland, 2006). Potato purple pigment generation locus P was necessary for blue/purple anthocyanin production, which was localized to chromosomes 11 and codes for F3'5'H (Jung et al., 2005). Several genes that control controlling anthocyanin synthesis have been found in tomato. Anthocyanin free (af), anthocyanin reduced (are), anthocyanin without (aw), hoffman' s Anthocyaninless (ah), and anthocyanin fruit (aft) were mapped by map-based cloning, encoding CHI, F3H, DFR, bHLH, and R2R2-MYB, respectively (Goldsbrough et al., 1994;Kang et al., 2014;Maloney et al., 2014;Qiu et al., 2016;Colanero et al., 2019).
In this study, a pepper line 14-Z4, which has yellow anthers and green stems, was selected from over 1000 pepper accessions. It was then crossed with the pepper line 14-Z5, which has purple anthers and stems, to obtain P 1 , P 2 , F 1 , F 2 , BC 1 P 1 and BC 1 P 2 generations for studying the inheritance of anthers and stems color in pepper. After fine mapping, the gene that controls yellow color of anthers and green color of stems was mapped between markers aywSNP120 and aywSNP124, with six candidate genes, among them CA11g18550 encoded F3'5'H. Furthermore, missense mutations were found in the exon area, and the marker aywSNP550 obtained from CA11g18550 properly co-segregated with the yellow color of anthers and the green color of stems. Therefore, it can be presumed that CA11g18550 was a strong candidate gene for controlling the yellow anthers and green stems color.

Experimental materials
The experimental materials used in the present study included the Capsicum annuum L. inbred parental pepper lines 14-Z4 and 14-Z5 ( Figure 1) and the F 1 , F 2 , BC 1 P 1 , and BC 1 P 2 populations derived from reciprocal crosses between the parental lines. Sweet pepper inbred line 14-Z4, which has blocky fruits, yellow anthers and green stems, was provided by the Beijing Vegetable Research Center at the Beijing Academy of Agriculture and Forestry Sciences. The inbred line 14-Z5 was introduced to China from Turin, Italy. This inbred bears hornshaped fruits in clusters, and has purple anthers and stems. During spring of 2015, the P 1 , P 2 , F 1 , BC 1 P 1 , BC 1 P 2 , and F 2 generations included 25,26,30,126,130, and 253 individuals, respectively. During the fall of 2015, the P 1 , P 2 , F 1 , BC 1 P 1 , BC 1 P 2 , and F 2 generations included 21, 25, 24, 84, 86, and 244 individuals, respectively. During the spring of 2018, the F 2 generation included 1059 individuals.
A total of 2561 pairs of SSR primers and 185 pairs of InDel primers were developed by our research group using publicly available genomic sequence information for pepper (Zhang et al., 2016b).

Transcriptome library construction and analysis
Anthers were selected on the day of flowering from 14-Z4 and 14-Z5 and immediately frozen in liquid nitrogen. Total RNA was extracted with RNAprep Pure Kit (for plant) (Tiangen, Beijing, China) as described by the manufacturer. The integrity and concentration of RNA were verified using an Agilent 2100 Bioanalyzer (Agilent Technologies, Inc., Santa Clara, CA, USA). Sequencing libraries were constructed following the manufacturer's instructions of NEBNext ® Ultra ™ RNA Library Prep Kit for Illumina ® (NEB, Ipswich, MA, USA). Sequencing was performed on an Illumina HiSeq 2500 platform (Illumina, USA). Low quality reads were removed. Clean reads were filtered from the raw reads and were mapped to the CM334 genome using Tophat2 software and bowtie2 software (Langmead, 2010;Trapnell et al., 2010;Kim et al., 2013;Kim et al., 2014). Gene expression levels were estimated using FPKM values (fragments per kilobase of exon per million fragments mapped) by RSEM software package (Li and Dewey, 2011). The raw sequencing data generated in this study are available in the NCBI (PRJNA987024).
DESeq and EdgeR were employed and used to evaluate differentially expressed genes (Anders and Huber, 2010). Subsequently, the gene abundance differences between those samples were calculated based on the ratio of the FPKM values. The false discovery rate (FDR) control method was used to identify the threshold of the P-value in multiple tests to calculate the significance of the differences. Here, only genes with an absolute value of log2FC ≥ 1 and FDR < 0.05 were identified as differentially expressed gene (DEGs). DEGs were subjected to Gene Ontology (GO, http://www.geneontology.org/) and Kyoto Encyclopedia of Genes and Genomes (KEGG, https://www.genome.jp/ kegg) analysis.

Phenotyping and segregation analysis
Anther colors (yellow or purple) and stem colors (green and purple) were observed in the first flower of each plant at the beginning of flowering. Two different individual researchers recorded anther colors and stem colors in the P 1 , P 2 , F 1 , BC 1 P 1 , BC 1 P 2 , and F 2 populations at the same time to ensure accurate results. The data were statistically analyzed using Excel 2003, and Chi-squared tests were performed using SAS 8.1 (SAS Institute Inc., Cary, NC, USA).

Chromosomal mapping of the gene controlling anthers and stems coloration
Total genomic DNAs of P 1 , P 2 , F 1 and F 2 populations were extracted from young leaves of each plant using the Plant Genomic DNA Kit (Tiangen, Beijing, China). DNAs from 10 individual F 2 plants with yellow anthers and green stems were pooled and DNAs from 10 individual F 2 plants with purple anthers and stems were pooled, separately. Before searching for primers to amplify polymorphic products between DNA pools. The reactions for amplifying SSR and InDel markers (10 mL) contained 3 mL of DNA (2.5 ng/mL), 1 mL of each forward and reverse primer (50 ng/ mL), and 5 mL of Go Taq ® Green Master Mix (Promega, WI, USA). The polymerase chain reaction (PCR) cycling conditions were as follows: pre-denaturation at 94°C for 4 min; 35 cycles of denaturation at 94°C for 15 s, annealing at 55°C for 15 s, and extension at 72°C for 30 s; followed by a final temperature hold at 72°C for 10 min. The amplification products were separated by electrophoresis on 6.0% non-denaturing polyacrylamide gels for 1 h at constant 150 V. The polymorphic primers were used to analyze the genotypes of 253 individuals the F 2 population at the spring of 2015, and a linkage map was generated using JoinMap 4.0.

Development of co-separation KASPar markers of the ayw locus
After mapping the gene to a narrow region, sanger sequencing was used to found the difference sequence between parents in the mapping region, and design KASPar markers based on SNPs. The KASPar platform (LGC Genomics, UK) was used for SNP genotyping individuals from the segregating populations. The conditions for touchdown PCR were as follows: 95°C for 15 min; followed by 10 cycles of denaturation at 94°C for 20 s, and annealing at 61°C (−0.6°C/ cycle) for 60 s; and then 26 cycles of denaturation at 94°C for 20 s and annealing at 55°C for 60 s.

Gene expression analysis
Total RNA was extracted using the RNAprep Pure Kit (For Plant) (Tiangen, Beijing, China). First-strand cDNA was synthesized with a PrimeScript ™ RT Kit (Takara, Japan). Primers used for the qRT-PCR were designed in Primer5, and primers with product length of 100-300 bp were selected. UBI-3 (CA06g03040) was selected as internal control (Wan et al., 2011) (Table S1). The qRT-PCR was performed with a TB Green ® Premix Ex Taq ™ II (Takara, Japan), on an LightCycler 480 II system. The qRT-PCR was performed with three biological replications and three technical replicates, and relative expression values were calculated using the 2 −DDCt method.

Phylogenetic analysis
A phylogenetic tree was constructed with the amino acid sequences of ayw and its homologs in other species. All sequences, except ayw, were all obtained from the NCBI database. Sequence alignment and tree construction were performed via ClustalW software (version 2.1) and a neighbor-joining tree was constructed using MEGA 7 software (version 7.0.26) with 1000 bootstrap replications.

Virus-induced gene silencing
Virus−induced gene silencing was used to investigate gene function (Liu et al., 2002). Primers were designed using Primer 5 software (forward primer: ACCGAATTCTCTAGATGTTGCTTCTAC TCCTAATGCAGCT, and reverse primer: CGTGAGCTCGGT ACCTTTTGTGTAATTTTTTCATCCCTCT). The ayw fragment (500 bp) was PCR amplified using Primer STAR MAX DNA Polymerase 045A (Takara, Japan) from pepper cDNA. The constructs consisting of pTRV1, pTRV2, pTRV2-PDS, pTRV2-ayw were transformed into Agrobacterium tumefaciens GV3101, respectively. The transformed cells were cultured in the Luria-Bertani (LB) medium until the OD600 was 1.0-1.2 and then harvested and re-suspended in the MS buffer (200 mM acetosyringone, 10 mM MES, 10 mM MgCl2, pH = 5.8) to a final OD600 of approximately 1.0. The pTRV1 and the pTRV2, pTRV2-PDS and pTRV2-ayw were mixed in a 1:1 (v/v) ratio, respectively. The mixtures were accomplished using 1 mL syringe without the needle into cotyledons of 3-week-old seedlings. The infiltrated plants were grown in a growth chamber at 22°C, 16 h light/8 h darkness.

Analysis of the transcriptome sequencing date
To determine the cause of purple deficit in 14-Z4, transcriptome sequencing was performed on anthers from both 14-Z4 and 14-Z5 lines. A total of six samples generated 57769436-71338670 clean reads with an average 41.70% to 42.25% GC content for all libraries. The Q20 and Q30 values exceeded 98.00% and 95.15%, showing the high throughput and quality of the RNA-seq data, respectively (Table 1).

GO classification statistics and KEGG enrichment analysis
GO classification statistics and KEGG enrichment analysis were performed to classify the key terms and pathways related to yellow anthers and green stems between 14-Z4 vs 14-Z5. In the 14-Z4 vs 14-Z5 comparisons, 3287 DEGs showed GO annotations. Metabolic process (1723 DEGs), cellular process (1438 DEGs), and singleorganizational process (1145 DEGs) were the main terms in the biological process; cell (1366 DEGs), cell part (1366 DEGs), and membrane (1272 DEGs) were the main terms in the cellular component; category activity (1658 DEGs), binding (1600 DEGs), and transporter activity (267 DEGs) were the main terms in the molecular function ( Figure 3A). To further elaborate the biological interpretation, all DEGs were mapped to the KEGG database ( Figure 3B). According to KEGG pathway enrichment analysis we obtained 12 significantly enriched pathways, among them photosynthesis-antenna proteins, phenylpropanoid biosynthesis, and flavonoid biosynthesis were three predominant KEGG pathways. Phenylpropanoid biosynthesis was an upstream pathway of anthocyanin synthesis, and anthocyanin synthesis was derived from branches of the flavonoid pathway ( Figure 3B). Therefore, the color difference between 14-Z4 and 14-Z5 was due to the anthocyanin synthesis.

Genetic analysis of yellow anthers and green stems trait in pepper
The F 1 plants crossed by "14-Z5" (yellow anthers and green stems) and "14-Z5" (purple anthers and stems), the segregation of purple anthers and stems phenotype. In the F 2 populations observed during the spring of 2015, a total of 184 plants with purple anthers and stems and 69 plants with yellow anthers and green stems were observed, and the ratio of plants with purple anthers and stems to those with yellow anthers and green stems was consistent with the 3:1 ratio expected for segregation of a single gene, according to a Chi-squared test (Table 2). Similarly, during the fall of 2015, a total of 186 plants with purple anthers and stems and 57 plants with yellow anthers and green stems were observed, and the ratio of plants with purple anthers and stems to those with yellow anthers and green stems in the F 2 was also consistent with the expected ratio of 3:1 according to a Chi-squared test (Table 2). In the BC 1 P 1 population, for both spring and fall 2015, the  proportion of plants with yellow anthers and green stems to those with purple anthers and stems was 1:1. In the BC 1 P 2 population in the spring and fall of 2015, the anthers and stems of all plants were purple (Table 2). These findings suggest that yellow anthers and green stems co-segregated, and are controlled by a recessive nuclear gene called ayw.

Preliminary chromosomal mapping of the pepper ayw locus
To distinguish SSR and InDel alleles, 2561 pairs of primers were used to screen parental lines 14-Z4 and 14-Z5 for polymorphisms, resulting in a selection of 357 pairs of polymorphic primers with a polymorphism rate of 13.00%. The pooled DNAs of plants with yellow anthers and green stems or purple anthers and stems were each analyzed using these 357 pairs of polymorphic primers, from which nine pairs of polymorphic primers (including eight pairs of SSR primers and one pair of InDel primers), with a polymorphism rate of 2.52%, were identified (Table S2). These nine pairs of polymorphic primers were then used for marker analysis and to construct a linkage map in F 2 population comprised of 253 individual plants. A linkage map of 6.6 cM was constructed and the ayw locus was mapped on pepper chromosome 11 between marker genSSR5929 and marker genSSR5955, which were 0.4 cM and 1.4 cM from ayw locus, respectively (Figure 4).

Validation of the marker cosegregating with the pepper ayw locus
In the spring of 2018, the F 2 population of 1059 plants was employed for fine mapping. Following sanger sequencing of PCR results, sequence variations were detected between markers SSR5929 and SSR5955 in the two parental lines, resulting in the identification of three SNPs. Corresponding to these SNPs, 3 KASPar markers were designed and they were polymorphic between the parental lines and F 2 populations (Table S3). Further linkage analysis was conducted on F 2 populations it was determined that the ayw locus was located between marker aywSNP120 and marker aywSNP124, which were physically separated by a distance of 0.2 Mb (Figure 4).
According to the gene annotation of the CM334 reference genome, a total of 6 putative genes (CA11g18510, CA11g18520, CA11g18530, CA11g18540, CA11g18550, and CA11g18560) were located in the 0.2 Mb region ( Figure 4). According to the annotation, a gene named CA11g18550 was annotated as F3'5'H, which was a structural gene in the anthocyanin synthesis pathway (Table 3). Sequencing results showed that the sequence of CA11g18550 was different between 14-Z4 and 14-Z5 ( Figure 5). Based on the SNP differences of sequencing at positions + 835 bp, KASPar marker aywSNP550 was designed and the marker was identified to co-segregated with phenotype (Table S3).

Evolution and expression analysis
CA11g18550 encodes for F3'5'H, a structural gene in the anthocyanin synthesis pathway. A BLAST search (NCBI) performed with the coding sequence was performed using the coding sequence and a phylogenetic tree was generated from the protein sequences from the NCBI database ( Figure 6A). The results showed that CA11g18550 had a high level of similarity with Lycianthes rantonnei and Petunia x hybrida, with the percent identity of 92.58% and 90.31%, respectively. Genetic and physical maps of ayw locus and analysis of candidate genes. To further investigate the expression pattern of CA11g18550, stems, leaves, petals and anthers of the two parents were analyzed by qRT-PCR. The results showed that CA11g18550 was expressed at a significantly higher level (>37 times) in anthers of 14-Z5 than in 14-Z4 ( Figure 6B). Additionally, the expression of other genes involved in anthocyanin synthesis (F3'H, F3'5'H, DFR, ANS, and UFGT) in 14-Z5 was significantly more than twice as high as in 14-Z4 in stems, leaves, anthers, and petals. However, the expression of C4H, 4CL, CHS, and CHI genes did not show any relation to purple color ( Figure 6B).

VIGS experimental verification
Virus induced gene silencing was used to observe whether the ayw locus was related to anthocyanin accumulation. The positive control, pTRV1 + pTRV2-CaPDS was employed to confirm the success of gene silencing (Figures 7A, C). The expression level of ayw in pTRV1 + pTRV2 plants was found to be 5.97 times higher compared to pTRV1 + pTRV2-ayw plants, indicating that VIGS could effectively silence ayw ( Figure 7D). The stem of pTRV1 + pTRV2 plant has obvious purple, while the stem of pTRV1 + pTRV2-ayw plant appeared green ( Figure 7B), indicating that the down-regulation of ayw could reduce the accumulation of anthocyanins in pepper.

Discussion
Anthocyanin synthesis and regulation have been extensively documented, and some genes affecting fruit color in pepper have been uncovered (Borovsky et al., 2004;Liu et al., 2020). However, the regulation of anthers color and stems color in pepper is still unclear. In this work, a molecular genetic investigation of pepper anthers and stems was carried. The research was conducted in reciprocal cross F 1 , BC 1 P 1 , BC 1 P 2 , and F 2 populations developed from parental pepper lines 14-Z4, which has yellow anthers and green stems, and 14-Z5, which has purple anthers and stems. According to the data from three seasons, the yellow anthers and green stems were identified to be cosegregated. Through map-based cloning, a recessive gene ayw, which encodes F3'5'H, was identified as the major candidate gene influencing the yellow color of the anthers and the green color of the stems after preliminary and fine mapping. Our research provides a useful reference for clarifying the synthesis of anthocyanins in pepper, and also provides important theoretical support for the application of green stem as a seedling marker in pepper GMS breeding.

Delphinidin makes pepper purple
Plant pigments such as betalains, carotenoids, and anthocyanins are responsible for the diverse color found in plants (Grotewold, 2006;Tanaka et al., 2008;Zhao et al., 2022). Red violet betacyanins and the The cDNA sequence alignment of CA11g18550 between 14-Z4 and 14-Z5. Wang et al. 10.3389/fpls.2023.1232755 Frontiers in Plant Science frontiersin.org yellow betaxanthins are immonium conjugates of betalamic acid with cyclo-dopa and amino acids or amines, respectively (Strack et al., 2003). Carotenoids mainly provide yellow to orange colors for plants, and anthocyanins provide the majority of the orange, red, purple, and blue colors for plants (Grotewold, 2006). In this study, we analyzed the transcriptomes of anthers from the pepper lines 14-Z4 and 14-Z5. The results showed that flavonoid biosynthesis and phenylpropanoid biosynthesis were significantly enriched. These two KEGG pathways are related to anthocyanin synthesis and anthocyanins were derived from a branch of the flavonoid pathway. The results showed that the difference in color between 14-Z4 and 14-Z5 is caused by the difference in anthocyanin content. Finally, we were able to map the ayw locus between aywSNP120 and aywSNP124 on chromosome 11, which included six candidate genes. Among these genes, one gene named CA11g18550 attracted our great attention based on annotation. Additionally, a marker aywSNP550, which was derived from CA11g18550, perfectly cosegregated with the yellow color of the anthers and green color of the stems. CA11g18550 encodes F3'5'H, which is member of the cytochromes P450 family. It converts dihydrokaempferol and dihydroquercetin into dihydromyricetin, which is is the precursor of delphinidin (Peng et al., 2019). In carnations, chrysanthemums and other flowers, blue/purple flower colors could not be produced due to the lack of delphinidin. The purple tubers of potatoes and purple flowers and colors of petunias could not be separated from delphinidin. Delphinidin is the main anthocyanin in the flowers, stems and fruits of purple pepper. Studies have shown that delphinidin derivatives are the only anthocyanins present in purple/black pepper fruits. The most common anthocyanin structure in pepper fruit was delphine-3-p-coumarinyl-rutinoside-5-glucoside (Lightbourn et al., 2008;Stommel et al., 2009). Therefore, the synthetic pathway of delphinidin is the key pathway of purple color in pepper. The hydroxylation of F3'5'H at the 5' -position is a particularly important step, which determines the synthesis of delphinidin in plants (Wang et al., 2014). Only plants containing delphinidin could display blue/ purple color (Jung et al., 2005;Sato et al., 2011;Brugliera et al., 2013). Many studies have shown that introducing the F3'5'H could improve flower color and cultivate blue carnations, chrysanthemums and other flowers (Mori et al., 2004;Brugliera et al., 2013;Chandler et al., 2013). When the F3'5'H of pansy was transferred, the pink chrysanthemum lacking F3'5'H was transformed into the transgenic blue chrysanthemum which could produce anthocyanins accumulating delphinidin (Brugliera et al., 2013). When the F3'5'H gene cloned from Vinca major was expressed in transgenic Petunia hybrida, some transgenic plants show dramatic color changes from red to dark purple (Mori et al., 2004). In addition, the blue/purple color of other plant organs was also usually due to the accumulation of delphinidin. It has been proven that P gene of potato and the F3'5'H gene of tomato were located in the same region in potato and tomato genomes, and F3'5'H gene was co-segregated with tuber purple character in F 1 subpopulation in potato (Jung et al., 2005). Further transgenic results showed that overexpression of F3'5'H gene could make potato have purple stems and purple tubers (Jung et al., 2005).
In addition, there were four missense mutations in exon of CA11g18550 between 14-Z4 and 14-Z5, and the KASPar marker aywSNP550 was designed according to the SNP mutation on +835bp of CA11g18550, which co-segregated with the color of anthers and stems of the F 2 population. The expression level of A B
CA11g18550 in purple anthers and stems was significantly higher than in yellow anthers and green stems. At the same time, silencing CA11g18550 through VIGS causes purple stems to turn green ( Figure 7B). Therefore, based on all the evidence above, CA11g18550 was identified as a strong candidate gene for ayw locus in pepper.

Stem color controlled by single gene is a useful phenotypic marker in plant breeding
Previous studies have shown that recessive traits controlled by single genes could be useful for crop breeding (Cheema and Dhaliwal, 2005;Gardner and Panthee, 2010;Saxena et al., 2011;Su et al., 2012;Panthee and Gardner, 2013;Wan et al., 2015;Zhang et al., 2016a). The homozygosity of the inbred parents of F 1 hybrid seeds has an immediate impact on yield and quality. The use of male sterile lines throughout the hybrid seed production process can help avoid false hybrid seeds produced by self-pollination, increase seed quality, and lower production costs. However, CMS needs three lines of sterile line, maintainer line and restorer line, which difficult to breed and are susceptible to ambient temperature. GMS can avoid the defects of CMS, while needs 50% fertile plants need to be pulled out in the field during hybrid seed production. Morphological markers inherited as a single recessive gene, which linked to these traits could be used to distinguish selfed seedlings from true hybrids, to test the genetic composition of parental lines and hybrid seeds, and to identify sterile males during early developmental stages (Cheema and Dhaliwal, 2005;Gardner and Panthee, 2010;Saxena et al., 2011;Su et al., 2012;Panthee and Gardner, 2013;Wan et al., 2015;Zhang et al., 2016a). For example, in rice, the phenotype of the gry79 mutant was controlled by a single recessive nuclear gene, and its mutant phenotype was useful for discriminating false hybrids (Wan et al., 2015). Zhang et al. (2016a) developed a seedling morphological marker from the single recessive gene aa for the MAS of the male sterile 10 gene in tomato. However, identifying pepper hybrids, as well as male sterile and maintainer lines can be complicated, time consuming, and costly. Anther color of pepper could be used as a marker character to distinguish male sterility. A nuclear male sterile line (MS-12) was associated with 3 morphological markers-taller plant height, erect plant growth habit, and dark purple anthers. Male sterile plants with reduced plant height and moderate growth behaviour could be selected during the early growth stage. However, individual variances in plant height and growth habits that were easily influenced by the environment, this could result in the escape of composite male viable plants with tall height during the initial monitoring in the hybrid seed production field. Thus, the anther color was used for the second monitoring for further identification in the later stage (Dash et al., 2001). Selection of male sterile plants at the flowering stage was expensive and timeconsuming and too late. In contrast, selection of male-sterile plants at the seedling stage allows only male sterile plants to be transplanted in hybrid seed production block economizing area and labor (Cheema and Dhaliwal, 2005). Our study showed that the ayw locus controls yellow anthers and green stems. The anthocyaninless mutants (al-1 to al-8) also showed that yellow anthers and green stems were controlled by the same recessive genes, even though there was no clear gene location (Wang and Bosland, 2006). Therefore, green stems, a readily identifiable recessive marker characteristic, could be transmitted to pepper male sterile lines, allowing male sterile lines to be distinguished by stem color at the seedling stage and avoiding the removal of 50% fertile lines after field transplanting. As a result, it is possible to distinguish male sterile lines by stem color at the seedling stage and avoid the removal of 50% fertile lines after transplanting in the field.
In summary, in this study identified CA11g18550 as a strong candidate gene of ayw through various methods including mapping, RNA-seq, and gene expression analysis. This provides a useful reference for the future research on anthocyanins, and lays a foundation for using green stem as morphological marker in pepper assisted breeding.

Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: NCBI BioProject, accession PRJNA987024. The names of the repository and accession number can be found in the article.

Author contributions
XZ, HD and SG contributed to the experimental design. YW and GW performed experiments, analyzed results. QW and BC participated in experimental design and statistical analysis. YW and ZW participated in manuscript writing. All authors read and approved the final manuscript.