Abstract
With a great diversity of species, Orchidaceae stands out as an essential component of plant biodiversity, making it a primary resource for studying angiosperms evolution and genomics. This study focuses on 13 published orchid genomes to identify and analyze the CYP75 gene family belonging to the cytochrome P450 superfamily, which is closely related to flavonoid biosynthetic enzymes and pigment regulation. We found 72 CYP75s in the 13 orchid genomes and further classified them into two classes: CYP75A and CYP75B subfamily, the former synthesizes blue anthocyanins, while the latter is involved in the production of red anthocyanins. Furthermore, the amount of CYP75Bs (53/72) greatly exceeds the amount of CYP75As (19/72) in orchids. Our findings suggest that CYP75B genes have a more important evolutionary role, as red plants are more common in nature than blue plants. We also discovered unique conserved motifs in each subfamily that serve as specific recognition features (motif 19 belong to CYP75A; motif 17 belong to CYP75B). Two diverse-colored varieties of C. goeringii were selected for qRT-PCR experiments. The expression of CgCYP75B1 was significantly higher in the purple-red variant compared to the yellow-green variant, while CgCYP75A1 showed no significant difference. Based on transcriptomic expression analysis, CYP75Bs are more highly expressed than CYP75As in floral organs, especially in colorful petals and lips. These results provide valuable information for future studies on CYP75s in orchids and other angiosperms.
Introduction
CYP450 (cytochrome P450, also called CYP) is named because it can bind to ferrous carbon monoxide and has a peak light absorption value of 450 nm (). In 1962, “CYP450” was first named as a colored substance in the cell, and it exists as an enzyme (). It is one of the largest protein superfamilies in nature that almost exists in all prokaryotes and eukaryotes, especially with a large number in plants (; ). They are a series of self-oxidating heme enzymes involved in various primary and secondary metabolic activities in plants and participate in biosynthetic pathways in flavonoids, aliphatic acids, sterols, phytohormones, terpenes, lignins and other biomolecules (; ). In general, CYPs can be divided into a gene family with amino acid sequence identity greater than 40% and a subfamily with amino acid sequence identity greater than 55% (). CYP71−CYP99 and CYP701−CYP999 and found in plants (). CYP75 is a crucial member that regulates flavonoid biosynthesis that determines pigmentation of plant tissues (). CYP75 gene family includes two subfamilies, CYP75A and CYP75B, regulating two key enzymes in the anthocyanin synthesis pathways, including Flavonoid 3’,5’-Hydroxylase (F3’5’H) and Flavonoid 3’-Hydroxylase (F3’H) which are precursors of blue and red anthocyanins, respectively ().
Recent studies have discussed the function of CYP75s in several plants. They were first identified in the Petunia hybrida, wherein the introduction of clonal F3’H cDNA caused an unusual pink color of petunia and their pollen produced paeoniflorin (; ). In Arabidopsis thaliana, a TT7 gene was identified, which could encode F3’H and its function was validated in flavonoid and anthocyanins synthesis (). The sF3’H1 of Glycine max is mainly responsible for the alteration of pubescent color from brown to gray (). In Vitis vinifera, CYP75 genes encoding F3’H and F3’5’H are highly expressed in all tissues of the plant, especially in the epidermis of mature red berries that mainly synthesize anthocyanins (). In Solanum lycopersicum, CYP75A31 encodes a F3’5H, which accepts flavones, flavanones, dihydroflavonols and flavonols as substrates (). In Epimedium sagittatum, EsF3’H and EsF3’5H genes are highly expressed in colored tissues and their expressions are positively correlated with the pattern of anthocyanin accumulation in leaves (). The CsF3’5’H in Camellia sinensis acts as a key agent controlling trihydroxyflavone-3-alcohol synthesis and effectively converts 4’-hydroxylated flavonoids into 3’4’5’- and/or 3’4’-hydroxylated products (). The F3’H-1 and F3’H-2 in Hordeum vulgare regulates the accumulation of magenta pigments in peels and stems, respectively, and the F3’5’H-1 is closely related to the accumulation of blue pigments in the barley grain paste layers (). With the continuous advancement of molecular experimental research, it is now possible to change plants color by controlling regulatory genes, such as CYP75. For example, the up-regulated of CYP75A and CYP75B genes which encode a critical dihydroflavonol 4-reductase (DFR) in Dendrobium officinale, may lead to anthocyanins accumulation, promoting color change from green to red (). However, the studies confirmed that CYP75B can code F3’H instead of CYP75A, and some species in Asteraceae have F3’5’H with the same amino acid sequences as F3’H which suggests that the F3’5’H in these plants is regulated by CYP75B instead of CYP75A (). Similar to Callistephus chinensis (F3’5’H: CYP75B5), Osteospermum hybrida (F3’5’H: CYP75B17)and Pericallis cruenta (F3’5’H: CYP75B18), they all lost CYP75A genes to code F3’5’H then reacquired by duplication and neofunctionalization of CYP75B genes (). This is further proved by partial amino acid exchanges between F3’5’H and F3’H ().
Orchidaceae is one of the largest monocotyledon families, boasting significant ornamental and economic importance due to its diverse range of colors and distinctive flower shapes (). There are more than 28,000 species and 850 genera in Orchidaceae, represents approximately 10% of all flowering plants worldwide and has the largest number of species (). Orchids are remarkable for shedding light on plant evolution, with more complete orchid genomes now available, researchers have gained significant insight into the genetic foundations of orchid biology (). Extensive research has been conducted on CYP75s in model plants, but there is currently limited knowledge about the characteristics of these genes in the Orchidaceae. In this study, we performed genome-wide identification, classification, characterization, and expression pattern analysis of CYP75s in 13 orchids, representing four subfamilies of Orchidaceae with considerable diversity to elucidate the evolution of CYP75s in orchids. The findings could provide new insights into the underlying mechanisms that drive the evolution and diversification of organ morphology in orchids and other flowering plants.
Materials and methods
Data sources
To investigate the features of the CYP75 gene family in Orchidaceae, 13 orchids with completed whole-genome sequencing were selected. They include C. goeringii (; ), C. sinense (), C. ensifolium (), Gastrodia elata (; ), D. catenatum (; ), D. chrysotoxum (), D. huoshanense (), Phalaenopsis aphrodite (), P. equestris () of Epidendroideae, Platanthera guangdongensis (), Pl. zijinensis () of Orchidoideae, Vanilla planifolia () of Vanilloideae, Apostasia shenzhenica () of Apostasioideae. The genome download urls for 13 orchids can be found in Table S1. And four CYP75A (AUB13331.1 from Horgeum vulgare; ABI95365.1 and AAZ79451.1 from D. hybrid; AEB96145.1 from D. moniliforme) and four CYP75B (BAJ93256.1 from H. vulgare; AT5G07990.1 from A. thaliana; XP015613041.1 from Oryza sativa; AF155332.1 from Petunia hybrida) proteins were downloaded from the NCBI (https://www.ncbi.nlm.nih.gov/). All the protein sequences can be found in the Table S2.
Identification and physicochemical properties of CYP75s
It is complicated to identify CYPs due to their extremely high sequence conservation (; ; ; ). Therefore, eight CYP75 proteins were used as queries to perform a blast search (built-in TBtools; ) against 13 orchid genomes with an E-value threshold of e-20 and a requirement for amino acid sequence similarity greater than 50% (). Pfam verification was performed on all protein sequences in the blast results. The CYP domains (PF00067) built on the hidden Markov model were downloaded from Sanger center (http://pfam.sanger.ac.uk/). The identification of all alternative orchid CYPs was conducted using the Hmmsearch program (built-in Tbtools; ). The subsequent screening study included all protein sequences containing CYP domains. To facilitate identification, 14 A. thaliana CYP proteins (AtCYP71A12, AtCYP71A13, AtCYP71B15, AtCYP73A5, AtCYP76C1, AtCYP77B1, AtCYP77A6, AtCYP78A10, AtCYP82G1, AtCYP84A1, AtCYP89A2, AtCYP98A3, AtCYP701A3, AtCYP706A1) belonging to subfamilies other than CYP75 were also downloaded from the TAIR (https://www.arabidopsis.org/). Then, ML phylogenetic trees were constructed for further screening of CYP75 genes based on all CYP genes of each orchid and other species. The phylogenetic analysis was conducted using the maximum likelihood (ML) approach, and the ML tree was constructed using the RAxML on the CIPRES Science Gateway web server (RAxML-HPC2 on XSEDE; ) with 1,000 bootstrap iterations. The phylogenetic tree of CYPs for each orchid is detailed in the Figure S1. The target genes were clustered with eight CYP75 query genes. Ultimately, we aligned all candidate CYP75 protein sequences of 13 orchids and further eliminated the incomplete gene annotation sequences (Figure 1). The completed protein sequences of orchid CYP75s can be found in Table S2. The physicochemical properties of CYP75 proteins were predicted by ExPASy database (https://www.expasy.org/) (). Subcellular localization was predicted by Plant-mPloc (http://www.csbio.sjtu.edu.cn/bioinf/plant-multi/#) ().
Figure 1
Phylogenetic analyses
We performed multiple sequence alignment of CYP75 proteins from 13 orchids and other species using MEGA 7.0 software (
Motif and gene structure analysis
To illustrate the variations of motifs among orchid CYP75 proteins, the MEME motif search tool was employed to detect the conserved motifs in all CYP75 proteins (
Collinearity and location analysis on chromosome
Utilizing chromosome-level genome assemblies of C. goeringii, C. ensifolium, and D. chrysotoxum, genomic FASTA files were merged pairwise to generate a database for BLASTp queries. The merged BLAST files and modified GFF3 files for each species were analyzed using MCscanX (
We selected six orchids with chromosome-level genomes to analyze the distribution of CYP75 genes on their chromosomes. According to the genome data and annotation data of C. goeringii, C. ensifolium, D. huoshanense, D. chrysotoxum, Pl. guangdongensis and Pl. zijinensis, the chromosomal localization of five orchid CYP75 genes were visualized through TBtools (
Prediction of Cis-acting elements
A total of 2,000bp upstream and downstream of CgCYP75s was extracted via TBtools, respectively (
Transcriptome data and expression analysis
For transcriptome analysis, RSEM (
Gene ontology analysis
EggNOG-mapper v2 (http://eggnog-mapper.embl.de/) was used to perform a search against the eggNOG5.0 database for gene ontology (GO) functional annotation (
Results
Identification and protein features of orchid CYP75s
A total of 72 CYP75s were identified from 13 orchids, with the number of CYP75s in each orchid ranging from two to ten (Table 1). Our study reveals that CYP75B subfamily has a significantly higher number (53/72) compared to CYP75A subfamily (19/72). Each of the 13 orchids contains one−four CYP75A genes and between two−six CYP75B genes. Among them, V. planifolia has the largest number of CYP75 genes, with a total of ten (four CYP75A and six CYP75B), while Pl. guangdongensis has the smallest number with only two CYP75B genes and lacks CYP75A gene.
Table 1
| Species | Gene ID | Name | AAa(aa) | pIb | Mwc(kDa) | IId | AIe | GRAVYf | Localizationg |
|---|---|---|---|---|---|---|---|---|---|
| P. aphrodite | PAXXG087010 | PapCYP75A1 | 506 | 9.14 | 56.74 | 49.87 | 99.05 | -0.069 | Endoplasmic reticulum |
| PAXXG006510 | PapCYP75B1 | 551 | 6.69 | 61.37 | 39.40 | 97.64 | -0.056 | Endoplasmic reticulum | |
| PAXXG079820 | PapCYP75B2 | 513 | 6.54 | 57.33 | 44.20 | 101.15 | -0.123 | Endoplasmic reticulum | |
| PAXXG116530 | PapCYP75B3 | 506 | 7.73 | 56.38 | 37.35 | 103.70 | 0.021 | Endoplasmic reticulum | |
| PAXXG116560 | PapCYP75B4 | 503 | 6.68 | 55.90 | 38.79 | 102.45 | -0.004 | Endoplasmic reticulum | |
| PAXXG116590 | PapCYP75B5 | 269 | 6.14 | 30.49 | 45.01 | 93.57 | -0.138 | Endoplasmic reticulum | |
| PAXXG172030 | PapCYP75B6 | 522 | 6.62 | 57.28 | 32.68 | 101.65 | 0.042 | Endoplasmic reticulum | |
| P. equestris | Peq013982 | PeqCYP75A1 | 491 | 7.65 | 54.77 | 43.57 | 94.75 | -0.099 | Endoplasmic reticulum |
| Peq002928 | PeqCYP75B1 | 489 | 6.69 | 53.89 | 34.32 | 99.12 | -0.038 | Endoplasmic reticulum | |
| Peq006329 | PeqCYP75B2 | 531 | 6.62 | 59.14 | 39.48 | 98.93 | -0.053 | Endoplasmic reticulum | |
| Peq008143 | PeqCYP75B3 | 271 | 5.59 | 30.79 | 36.80 | 98.97 | -0.084 | Endoplasmic reticulum | |
| Peq008146 | PeqCYP75B4 | 511 | 8.39 | 57.02 | 39.79 | 100.80 | 0.015 | Endoplasmic reticulum | |
| Peq013868 | PeqCYP75B5 | 516 | 6.54 | 57.48 | 45.36 | 102.07 | -0.111 | Endoplasmic reticulum | |
| C. goeringii | GL07540 | CgCYP75A1 | 503 | 8.75 | 55.82 | 48.29 | 98.53 | -0.030 | Endoplasmic reticulum |
| GL10771 | CgCYP75A2 | 235 | 4.83 | 26.10 | 41.03 | 93.83 | -0.076 | Endoplasmic reticulum | |
| GL07339 | CgCYP75B1 | 533 | 9.24 | 58.66 | 36.56 | 98.26 | -0.042 | Endoplasmic reticulum | |
| GL13941 | CgCYP75B2 | 292 | 5.70 | 32.28 | 35.72 | 102.91 | -0.092 | Endoplasmic reticulum | |
| GL26796 | CgCYP75B3 | 507 | 8.19 | 55.97 | 39.78 | 101.60 | 0.017 | Endoplasmic reticulum | |
| GL26797 | CgCYP75B4 | 272 | 5.99 | 30.47 | 39.98 | 96.18 | -0.153 | Endoplasmic reticulum | |
| GL27961 | CgCYP75B5 | 517 | 6.91 | 57.31 | 46.28 | 103.23 | -0.048 | Endoplasmic reticulum | |
| GL27962 | CgCYP75B6 | 290 | 5.63 | 32.20 | 41.23 | 96.59 | -0.192 | Endoplasmic reticulum | |
| C. ensifolium | JL017476 | CeCYP75A1 | 503 | 8.75 | 55.83 | 49.33 | 98.73 | -0.026 | Endoplasmic reticulum |
| JL011638 | CeCYP75B1 | 519 | 7.13 | 57.69 | 38.75 | 103.03 | 0.022 | Endoplasmic reticulum | |
| JL016547 | CeCYP75B2 | 517 | 6.94 | 57.38 | 45.00 | 104.16 | -0.036 | Endoplasmic reticulum | |
| JL021317 | CeCYP75B3 | 523 | 8.41 | 57.82 | 39.78 | 100.36 | -0.021 | Endoplasmic reticulum | |
| C. sinense | Mol022332 | CsCYP75A1 | 503 | 8.64 | 55.95 | 48.61 | 98.33 | -0.040 | Endoplasmic reticulum |
| Mol004997 | CsCYP75B1 | 530 | 9.24 | 58.30 | 35.56 | 96.23 | -0.070 | Endoplasmic reticulum | |
| Mol005994 | CsCYP75B2 | 531 | 8.47 | 58.95 | 38.51 | 102.17 | -0.014 | Endoplasmic reticulum | |
| Mol021869 | CsCYP75B3 | 515 | 6.75 | 57.14 | 43.98 | 104.19 | -0.036 | Endoplasmic reticulum | |
| V. planifolia | Vpla_KAG0447236.1 | VplCYP75A1 | 507 | 8.45 | 56.24 | 43.53 | 100.89 | 0.053 | Endoplasmic reticulum |
| Vpla_KAG0447237.1 | VplCYP75A2 | 270 | 5.13 | 29.74 | 32.81 | 91.85 | -0.002 | Endoplasmic reticulum | |
| Vpla_KAG0495957.1 | VplCYP75A3 | 435 | 6.11 | 48.33 | 43.07 | 88.85 | -0.116 | Endoplasmic reticulum | |
| Vpla_KAG0496007.1 | VplCYP75A4 | 503 | 8.10 | 55.92 | 47.05 | 93.70 | -0.031 | Endoplasmic reticulum | |
| Vpla_KAG0451884.1 | VplCYP75B1 | 523 | 6.48 | 58.05 | 35.05 | 101.99 | -0.030 | Endoplasmic reticulum | |
| Vpla_KAG0455627.1 | VplCYP75B2 | 506 | 7.00 | 56.21 | 36.62 | 98.08 | -0.043 | Endoplasmic reticulum | |
| Vpla_KAG0455628.1 | VplCYP75B3 | 501 | 8.51 | 55.63 | 38.04 | 102.38 | 0.025 | Endoplasmic reticulum | |
| Vpla_KAG0466647.1 | VplCYP75B4 | 533 | 8.70 | 59.75 | 41.10 | 91.82 | -0.238 | Endoplasmic reticulum | |
| Vpla_KAG0466648.1 | VplCYP75B5 | 481 | 8.58 | 53.86 | 38.34 | 93.06 | -0.289 | Endoplasmic reticulum | |
| Vpla_KAG0497903.1 | VplCYP75B6 | 542 | 6.63 | 60.13 | 38.66 | 99.48 | -0.052 | Endoplasmic reticulum | |
| D. chrysotoxum | Maker09736 | DchCYP75A1 | 504 | 8.59 | 55.84 | 47.46 | 99.09 | -0.037 | Endoplasmic reticulum |
| Maker93966 | DchCYP75A2 | 504 | 8.39 | 55.86 | 47.46 | 99.27 | -0.023 | Endoplasmic reticulum | |
| Maker59722 | DchCYP75B1 | 531 | 8.12 | 59.23 | 41.61 | 105.63 | -0.019 | Endoplasmic reticulum | |
| Maker83891 | DchCYP75B2 | 519 | 6.75 | 57.49 | 43.90 | 100.39 | -0.054 | Endoplasmic reticulum | |
| Maker109076 | DchCYP75B3 | 668 | 8.79 | 76.21 | 31.78 | 100.70 | -0.086 | Endoplasmic reticulum | |
| Maker118630 | DchCYP75B4 | 552 | 7.28 | 61.45 | 37.57 | 94.71 | -0.121 | Endoplasmic reticulum | |
| Maker118636 | DchCYP75B5 | 518 | 7.28 | 57.17 | 39.23 | 98.82 | -0.027 | Endoplasmic reticulum | |
| D. catenatum | Dca000941 | DcaCYP75A1 | 504 | 8.08 | 55.84 | 43.51 | 100.26 | 0.008 | Endoplasmic reticulum |
| Dca000430 | DcaCYP75B1 | 523 | 7.80 | 57.90 | 40.36 | 101.68 | -0.029 | Endoplasmic reticulum | |
| Dca008242 | DcaCYP75B2 | 512 | 6.86 | 56.69 | 31.44 | 98.11 | -0.036 | Endoplasmic reticulum | |
| Dca013687 | DcaCYP75B3 | 525 | 7.75 | 59.56 | 39.05 | 103.07 | -0.067 | Endoplasmic reticulum | |
| Dca013688 | DcaCYP75B4 | 512 | 8.04 | 57.41 | 39.50 | 103.03 | 0.006 | Endoplasmic reticulum | |
| Dca020470 | DcaCYP75B5 | 539 | 8.12 | 60.32 | 38.82 | 105.14 | -0.027 | Endoplasmic reticulum | |
| D. huoshanense | Dhu000016471 | DhuCYP75A1 | 504 | 7.63 | 55.75 | 43.84 | 99.48 | -0.002 | Endoplasmic reticulum |
| Dhu000016482 | DhuCYP75A2 | 504 | 8.66 | 55.83 | 45.45 | 99.70 | -0.012 | Endoplasmic reticulum | |
| Dhu000012876 | DhuCYP75B1 | 521 | 7.29 | 57.71 | 40.21 | 101.69 | -0.030 | Endoplasmic reticulum | |
| Dhu000016330 | DhuCYP75B2 | 517 | 6.31 | 57.35 | 33.97 | 96.79 | -0.052 | Endoplasmic reticulum | |
| Dhu000019542 | DhuCYP75B3 | 539 | 8.39 | 60.33 | 38.66 | 104.42 | -0.040 | Endoplasmic reticulum | |
| Dhu000020018 | DhuCYP75B4 | 531 | 8.12 | 59.23 | 41.61 | 105.63 | -0.019 | Endoplasmic reticulum | |
| Pl. guangdongensis | PGU007087 | PguCYP75B1 | 520 | 8.62 | 57.76 | 40.37 | 98.65 | -0.086 | Endoplasmic reticulum |
| PGU010950 | PguCYP75B2 | 522 | 8.96 | 58.06 | 52.92 | 101.69 | -0.071 | Endoplasmic reticulum | |
| Pl. zijinensis | PZI001224 | PziCYP75A1 | 504 | 9.41 | 55.86 | 47.87 | 101.61 | 0.031 | Endoplasmic reticulum |
| PZI012105 | PziCYP75B1 | 520 | 8.32 | 57.64 | 40.20 | 99.04 | -0.077 | Endoplasmic reticulum | |
| PZI015225 | PziCYP75B2 | 514 | 8.80 | 56.46 | 43.19 | 94.36 | -0.047 | Endoplasmic reticulum | |
| A. shenzhenica | Ash019093 | AshCYP75A1 | 508 | 6.63 | 56.67 | 47.90 | 98.17 | -0.026 | Endoplasmic reticulum |
| Ash001251 | AshCYP75B1 | 521 | 7.71 | 57.18 | 34.43 | 100.56 | -0.016 | Endoplasmic reticulum | |
| Ash004102 | AshCYP75B2 | 520 | 7.26 | 57.78 | 42.60 | 95.44 | -0.061 | Endoplasmic reticulum | |
| Ash007554 | AshCYP75B3 | 520 | 7.31 | 56.94 | 46.91 | 98.71 | -0.036 | Endoplasmic reticulum | |
| Ash015225 | AshCYP75B4 | 508 | 6.22 | 56.16 | 45.84 | 95.61 | -0.011 | Endoplasmic reticulum | |
| G. elata | Gel004675 | GelCYP75A1 | 514 | 7.14 | 57.25 | 43.06 | 101.01 | -0.064 | Endoplasmic reticulum |
| Gel010620 | GelCYP75B1 | 512 | 8.79 | 56.24 | 49.58 | 96.45 | -0.081 | Endoplasmic reticulum | |
| Gel016567 | GelCYP75B2 | 517 | 9.26 | 57.50 | 47.94 | 100.97 | -0.064 | Endoplasmic reticulum | |
| Gel005429 | GelCYP75B3 | 504 | 8.37 | 56.27 | 42.41 | 102.64 | 0.013 | Endoplasmic reticulum |
A list of CYP75 genes in 13 Orchids, their characteristics, and subcellular localization of proteins.
AAa, Amino acid number; pIb, Theoretical isoelectric point; Mwc (kDa), Molecular weight; IId, Instability index; AIe, Aliphatic index; GRAVYf, Grand average of hydrophobicity; Localizationg, Subcellular localization predicted by Plant-mPloc (
The 72 CYP75 protein sequences range from 235−668 amino acids, with a mean of 493. The molecular weight ranges from 26.10−76.21 kDa, with a mean of 54.80 kDa. Around 88.89% (64/72) of the CYP75 proteins have high isoelectric points (pI>7), with an average of 7.57. The average instability index (II) is 41.39, and 39 CYP75 proteins are below this index, indicating good protein stability. The average aliphatic index (AI) for the 72 CYP75 proteins is 99.40, indicating high thermal stability. Moreover, the calculated mean hydrophilic index (GRAVY) of CYP75 proteins in all orchids is negative, indicating a high degree of hydrophilicity. All CYP75 proteins are localized within the endoplasmic reticulum (ER), as evidenced by subcellular localization results. This localization pattern is consistent with the majority of CYP proteins, which are known to primarily function in the ER (
Gene ontology analysis was performed to delineate gene functional classifications of orchid CYP75s and investigate the important biological processes they might be involved in. As a result, GO terms “response to stimulus,” “response to auxin”, “response to organic substance”, “response to endogenous stimulus”, “response to chemical”, “response to hormone”, “secondary metabolic process”, “secondary metabolite biosynthetic process”, “obsolete oxidation-reduction process”, “membrane” and “oxidoreductase activity” constituted the greatest number of genes for GO ontologies “Biological Process”, “Cellular Component,” and “Molecular Function”, respectively (Figure S2; Table S7). The data suggests that the gene ontology of orchid CYP75s is significantly enriched in the “Biological Process”, which is strongly linked to the production of plant metabolites. Additionally, it is heavily concentrated within response elements that relate to the plant’s reaction to external environmental factors.
Phylogenetic analysis of orchid CYP75s
A phylogenetic tree was constructed to analyze the evolutionary patterns of orchid CYP75s (Figure 2). 72 CYP75 proteins were used, and eight query protein sequences were used. The maximum likelihood (ML) method was used for the phylogenetic tree, which was constructed with RAxML on the CIPRES Science Gateway web server (RAxML-HPC2 on XSEDE;
Figure 2

Phylogenetic tree of CYP75 genes based on the CYP75 protein sequences of 13 orchids. The CYP75 gene family was classified into two classes: CYP75A and CYP75B. CYP75 protein sequences of all species and query sequences are available in Table S2.
Motif variation and gene structure of orchid CYP75s
Motifs of CYP75 proteins in 13 orchids were examined using the online analysis tool MEME, and 20 motifs were set as upper bound (Figure 3A). There are four conserved motifs commonly found in cytochrome P450 enzymes: PERF motif, K-helix region, I-helix region and the consensus sequences of the heme-binding region (Figures 1, 3B), also referred to as the “P450 signature” (
Figure 3

Conserved motifs of CYP75 proteins. (A) Predicted motifs with the phylogenetic tree of orchid CYP75s. (B) Sequence logo of motif 1, 4, 11, 12, 17, 19 which encoded the K-helix region, Heme-binding region, PERF motif, I-helix region, the CYP75B unique motif and the CYP75A unique motif, respectively.
To further explore the characteristics of CYP75 genes in orchids, intron-exon structure is analyzed as shown by Figure S3. The results show that the orchid CYP75 family is composed of one−eight exons and one−seven introns, and in DchCYP75B3 and DhuCYP75B4, the exons are split into many small fragments by introns. Most CYP75 protein sequences have relatively long intron regions, while all CYP75A genes contain only one intron, which is a unique feature of the CYP75A subfamily.
Chromosomal localization of orchid CYP75s
As shown in Figure 4A, both CYP75A genes of C. goeringii are located on chromosome 02, while the remaining CgCYP75B genes are scattered on chromosomes 01, 08, 11, and 14. Among them, chromosomes 02, 08 and 11 all have two CgCYP75s located at the same site. The four CYP75s of C. ensifolium are scattered on chromosomes 01, 02, 09 and 11, respectively (Figure 4B). The CYP75 genes of D. huoshanense are evenly distributed across five chromosomes, specifically chromosomes 01, 06, 10, 16, and 18. The two DhuCYP75A genes are located together on chromosome 06 (Figure 4C). In D. chrysotoxum, only DchCYP75B4 and DchCYP75B5 are co-located on chromosome 07, while the remaining DchCYP75 are separately distributed on various chromosomes. Among them, DchCYP75A1 has not been assembled onto the chromosome, but is located on scaffold 787 (Figure 4D). For both Platanthera species, two PguCYP75Bs are co-located on chromosome 02 and 08 of Pl. guangdongensis, while three PziCYP75s are individually located on chromosome 01, 10 and 13 of Pl. zijinensis (Figures 4E, F).
Figure 4

Chromosome distribution in orchids CYP75s. (A)C. goeringii. (B)C. ensifolium. (C)D. huoshanense. (D)D. chrysotoxum. (E)Pl. guangdongensis. (F)Pl. zijinensis.
Collinearity analysis of CYP75 gene family in three orchids
To investigate the evolution of CYP75 genes in orchids, the collinear relationship among CYP75s in C. goeringii, C. ensifolium, and D. chrysotoxum was analyzed. Our collinear analysis revealed a one-to-one correspondence among all CYP75 genes in the three orchids, indicating limited reshuffling of CYP75 orthologs and significant genomic rearrangements following the divergence of Dendrobium and Cymbidium lineages (Figure 5). Furthermore, we also examined that CYP75 gene tandem duplication occurred on the chromosomes of both C. goeringii and D. chrysotoxum, which was consistent with the results of chromosome localization (Figure 4). Our results suggest that a small-scale tandem duplication may have led to the expansion of CYP75 gene family in orchids.
Figure 5

The collinearity of CYP75 genes between C. goeringii and C. ensifolium, C. goeringii and D. chrysotoxum.
Cis-acting regulatory elements of CgCYP75s
To explore the regulatory roles of CYP75s, we retrieved the 2,000 bp upstream and downstream regions of CYP75 genes in C. goeringii to identify potential cis-elements. We identified a total of 1,257 cis-acting elements, including 38 types and ten responsive functions (Figure 6; Table S4). Among these elements, TATA-box made up the most common elements (46.38%), followed by CAAT-box (22.83%) (Table S5). Cis-element functions included phytohormone responsiveness for gibberellin, auxin, methyl jasmonate (MeJA), salicylic acid, and abscisic acid (ABA); stress responses, such as anoxic, anaerobic, low-temperature, and defense; and growth and development elements, such as light response and MYB binding site (Figure 6). Each CgCYP75 gene contained multiple types of elements with light responsiveness as the most occurring element function (Figure 6), supporting that light is one of the most important environmental factors affecting flavonoid biosynthesis in plants (
Figure 6

Cis-acting elements in the 2k bp of upstream and downstream regions of CgCYP75 genes. (A) Elements with similar regulatory functions are displayed in the same color. (1) The 2k bp of upstream of CgCYP75 genes. (2) The 2k bp of downstream of CgCYP75 genes. (B) Numbers of each type of element.
Expression patterns of orchid CYP75s and qRT-PCR verification of CgCYP75s
The orchid transcriptomic data from nine orchids out of 13 were visualized as heatmaps via TBtools software as shown in Figure 7. Expression analysis was conducted using transcriptome data from nine orchids, including various plant organs, such as sepals, petals, labellums (also known as lips), the gynostemium, inflorescence, stems, tubers, pollinia, and complete flowers. The expression profile showed that CYP75B genes were expressed broadly in flower organs, while the expression of CYP75A genes was more confined to C. ensifolium, D. catenatum and G. elata, particularly in the gynostemiums (Figure 7). In C. goeringii, CYP75A genes showed little expression in all tissues of two varieties, while CgCYP75B1 exhibited an exclusive expression in four organs of ‘PR’ C. goeringii and gynostemium of ‘GY’ C. goeringii (Figure 7A). Furthermore, CeCYP75B1 and CeCYP75B2 exhibited elevated expression levels across all four floral organs (Figure 7B). In D. chrysotoxum, the expression of the DchCYP75B4 gene was significantly higher on the lip than other parts, due to the predominant distribution of its anthocyanin in the red macula of the lip (Figure 7C). A similar situation existed in D. catenatum, DcaCYP75B1 and DcaCYP75B3 were highly expressed in the lip with purple-red spots (Figure 7D). PeqCYP75B1 was expressed prominently in various tissues of P. equestris, with higher expression in the darker colored lip region. On the other hand, PeqCYP75B5 showed significant expression levels in the sepals. (Figure 7E). AshCYP75B3 of A. shenzhenica was expressed in all parts except for the pollinium, with the highest expression in the inflorescence (Figure 7F). In both of the Platanthera species, PziCYP75B1 and PguCYP75B4 exhibited the highest expression levels in fleshy underground tubers (Figure 7G), possibly indicating their involvement in non-biological stress response (
Figure 7

The expression patterns of CYP75 genes among different tissues in nine orchids. Se, sepal; pe, petal; lip, labellum; gy, gynostemium; in, inflorescence; st, stem; tu, tuber; po, pollinium; fl, whole flower. (A)C. goeringii. GY, green-yellow flower; PR, purple-red flower. (B)C. ensifolium. (C)D. chrysotoxum. (D)D. catenatum. (E)P. equestris.(F)A. shenzhenica. (G)Pl. guangdongensis and Pl. zijinensis. (H)G. elata. The FPKM values of orchid CYP75s in different flower organs are listed in Table S6.
To further investigate the specific roles of CYP75 gene expression in two C. goeringii varieties, their sepals, petals, lips and gynostemiums were analyzed by qRT-PCR (Figure 8). In two C. goeringii varieties (Figure 8A), the CgCYP75B1 showed extremely high expression in ‘PR’ C. goeringii, especially in petal which matched the transcriptome data, but were barely detected in ‘GY’ C. goeringii, further verifying that CYP75B genes have an anthocyanin-specific expression in the flower organs. As for CgCYP75A1, its expression level was relatively low in four types of tissues of the two C. goeringii plants (Figure 8B). Interestingly, CgCYP75A1 and CgCYP75B1 showed a certain degree of stable expression in the gynostemiums of both C. goeringii plants (Figure 8B). The function of these CYP75 genes may not be related to anthocyanin synthesis, but rather to attracting pollinators to promote reproduction (
Figure 8

Expression profiles of different tissues of CgCYP75 genes by real-time reverse transcription quantitative PCR (RT-qPCR). (A) Two flower colour types. GY, green-yellow flower; PR, purple-red flower; se, sepal; pe, petal; lip, labellum; gy, gynostemium. (B) RT-qPCR validation of transcriptomic data of the CgCYP75A1 and CgCYP75B1 at four flower organs. The error bars indicate three RT-qPCR biological replicates. The asterisk indicates the P value in the significance test (** p < 0.01, *** p < 0.001).
Discussion
Flower color is one of the key criteria for evaluating the quality of ornamental plants, as well as their horticultural and economic values. Plant flower color is also a major factor in attracting pollinators, which helps to increase the success rate of pollination and plays an important role in the evolution of plants (
Phylogenetic relationships and protein structure analyses support the division of the CYP75 gene family into two branches, CYP75A and CYP75B (Figure 2) which is in agreement with previous studies (
All members of the CYP gene family have four identifiable motifs: the PERF motif, K-helix region, I-helix region, and heme-binding region, as shown in Figure 1 (
Whole-genome sequencing has enabled researchers to uncover variations in the gene structure of gene families across different species. While gene structure tends to be conserved within the same clade, peculiarities have been observed in the CYP75 gene family of orchids, particularly in subfamilies CYP75A and CYP75B. In contrast to the longer introns typically found in orchids, CYP75As has a single intron (Figure S3), which is a unique characteristic. Longer introns are believed to be favored during gene evolution as they increase recombination between adjacent exons, thus promoting natural selection efficiency (
Gene expression is primarily regulated by cis-acting elements of the transcription start site (
Studies have shown that the expression level of the CYP75 genes is positively related to the accumulation of anthocyanins. Based on the RT-qPCR and transcriptome expression analysis in this study, this point of view can be supported. In two C. goeringii varieties with significantly different colors, the expression of the CYP75B genes which regulate the purple-red anthocyanins is generally much higher in ‘PR’ C. goeringii compared to ‘YG’ C. goeringii. Moreover, the expression level of CgCYP75B is higher than that of CgCYP75A in any of the two varieties. Furthermore, in the lip of ‘YG’ C. goeringii without anthocyanin, the CYP75 gene shows relatively stable expression. This suggests that the function of the CYP75 genes is not mainly to regulate anthocyanin synthesis. The two F3’H and F3’5’H enzymes regulated by the CYP75 genes are both flavonoids, and flavonoids are involved in almost all metabolic processes in plants, including attracting pollinators (
Conclusion
In this study, we identified 72 members of the CYP75 gene family from the 13 orchid genomes. We analyzed their protein physicochemical properties, subcellular localization, motifs, intron-exon structures, chromosome distribution, promoter elements, expression patterns and gene ontology classification. We found that the CYP75 gene family in Orchidaceae is a low-copy gene family with high conservation. V. planifolia has the highest count of CYP75 genes, with ten, while Pl. guangdongensis has only two genes, the CYP75A gene is missing and only two CYP75B genes are present. We observed a notable discrepancy between the number of genes in the CYP75A and CYP75B subfamilies, which could potentially be attributed to differences in their evolutionary statuses. The characteristic motifs of CYP450 are present in all CYP75 proteins, with a specific motif found in two subfamilies, respectively. We demonstrate that the distinct roles of cis-elements in light response and MYB binding sites are working together with the crucial function of CYP75s in the biosynthesis of anthocyanins. In addition, the expression patterns generated by transcriptomic and RT-qPCR data supported a color-specific expression of CYP75Bs in the flower organs. Our study presents a comprehensive analysis of the functions and expression patterns of CYP75 genes in Orchidaceae. These results build a foundation for deeper understanding regarding the role of CYP75 genes in plant anthocyanin biosynthesis of plants, offering insights into the flexibility of plant pigmentation. A crucial subsequent task will entail conducting functional analysis of CYP75 in non-model plants, in order to discern the further functions of CYP75 in the context of angiosperms evolution.
Statements
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/Supplementary Material.
Author contributions
Z-JL and SL conceived and designed the research. XZ and M-MZ prepared the original draft. XH and YH performed the data analysis. YL and SA wrote and edited the language. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the Forestry Peak Discipline Construction Project of Fujian Agriculture and Forestry University (72202200205).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2023.1243828/full#supplementary-material
References
1
AiY.LiZ.SunW.-H.ChenJ.ZhangD.MaL.et al. (2021). The Cymbidium genome reveals the evolution of unique morphological traits. Hortic. Res.8, 255. doi: 10.1038/s41438-021-00683-z
2
ArtimoP.JonnalageddaM.ArnoldK.BaratinD.CsardiG.de CastroE.et al. (2012). ExPASy: SIB bioinformatics resource portal. Nucleic Acids Res.40, W597–W603. doi: 10.1093/nar/gks400
3
AyabeS.AkashiT. (2006). Cytochrome P450s in flavonoid metabolism. Phytochem. Rev.5, 271–282. doi: 10.1007/s11101-006-9007-3
4
BabuP. R.RaoK. V.ReddyV. D. (2013). Structural organization and classification of cytochrome P450 genes in flax (Linum usitatissimum L.). Gene513, 156–162. doi: 10.1016/j.gene.2012.10.040
5
BaileyT. L.BodenM.BuskeF. A.FrithM.GrantC. E.ClementiL.et al. (2009). MEME SUITE: Tools for motif discovery and searching. Nucleic Acids Res.37, W202–W208. doi: 10.1093/nar/gkp335
6
Brignac-HuberL. M.ParkJ. W.ReedJ. R.BackesW. L. (2016). Cytochrome P450 organization and function are modulated by endoplasmic reticulum phospholipid heterogeneity. Drug Metab. Dispos44 (12), 1859–1866. doi: 10.1124/dmd.115.068981
7
BruglieraF.Barri-RewellG.HoltonT. A.MasonJ. G. (1999). Isolation and characterization of a flavonoid 3’-hydroxylase cDNA clone corresponding to the Ht1 locus of Petunia hybrida. Plant J.19, 441–451. doi: 10.1046/j.1365-313X.1999.00539.x
8
CaiJ.LiuX.VannesteK.ProostS.TsaiW.-C.LiuK.-W.et al. (2015). The genome sequence of the orchid Phalaenopsis equestris. Nat. Genet.47, 65–72. doi: 10.1038/ng.3149
9
CederbaumA. I. (2015). Molecular mechanisms of the microsomal mixed function oxidases and biological and pathological implications. Redox Biol.4, 60–73. doi: 10.1016/j.redox.2014.11.008
10
ChaoY.-T.ChenW.-C.ChenC.-Y.HoH.-Y.YehC.-H.KuoY.-T.et al. (2018). Chromosome-level assembly, genetic and physical mapping of Phalaenopsis aphrodite genome provides new insights into species adaptation and resources for orchid breeding. Plant Biotechnol. J.16, 2027–2041. doi: 10.1111/pbi.12936
11
ChaseM. W.CameronK. M.FreudensteinJ. V.PridgeonA. M.SalazarG.van den BergC.et al. (2015). An updated classification of Orchidaceae: Updated Classification of Orchidaceae. Bot. J. Linn Soc.177, 151–174. doi: 10.1111/boj.12234
12
ChenC.ChenH.HeY.XiaR. (2018). TBtools, a toolkit for biologists integrating various biological data handling tools with a user-friendly interface. bioRxiv [Preprint]. doi: 10.1101/289660
13
ChouK.-C.ShenH.-B. (2010). Plant-mPLoc: a top-down strategy to augment the power for predicting plant protein subcellular localization. PLoS One5, e11335. doi: 10.1371/journal.pone.0011335
14
ChungO.KimJ.BolserD.KimH. M.JunJ. H.ChoiJ. P.et al. (2022). A chromosome-scale genome assembly and annotation of the spring orchid (Cymbidium goeringii). Mol Ecol Resour22, 1168–1177. doi: 10.1111/1755-0998.13537
15
CrooksG. E.HonG.ChandoniaJ.-M.BrennerS. E. (2004). WebLogo: a sequence logo generator. Genome Res.14, 1188–1190. doi: 10.1101/gr.849004
16
DarribaD.PosadaD.KozlovA. M.StamatakisA.MorelB.FlouriT. (2019). ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models. Mol. Biol. Evol.37, 291–294. doi: 10.1093/molbev/msz189
17
DarribaD.PosadaD.KozlovA. M.StamatakisA.MorelB.FlouriT. (2004). ModelTest-NG: a new and scalable tool for the selection of DNA and protein evolutionary models. Mol. Biol. Evol.37, 291–294. doi: 10.1093/molbev/msz189
18
FalginellaL.CastellarinS. D.TestolinR.GambettaG. A.MorganteM.Di GasperoG. (2010). Expansion and subfunctionalisation of flavonoid 3’,5’-hydroxylases in the grapevine lineage. BMC Genomics11, 562. doi: 10.1186/1471-2164-11-562
19
HanB.JingY.DaiJ.ZhengT.GuF.ZhaoQ.et al. (2020). A chromosome-level genome assembly of Dendrobium huoshanense using long reads and Hi-C data. Genome Biol. Evol.12, 2486–2490. doi: 10.1093/gbe/evaa215
20
HasingT.TangH.BrymM.KhaziF.HuangT.ChambersA. H. (2020). A phased Vanilla planifolia genome enables genetic improvement of flavour and production. Nat. Food1, 811–819. doi: 10.1038/s43016-020-00197-2
21
HeZ.ZhangH.GaoS.LercherM. J.ChenW.-H.HuS. (2016). Evolview v2: An online visualization and management tool for customized and annotated phylogenetic trees. Nucleic Acids Res.44, W236–W241. doi: 10.1093/nar/gkw370
22
Hernandez-GarciaC. M.FinerJ. J. (2014). Identification and validation of promoters and cis-acting regulatory elements. Plant Sci.217–218, 109–119. doi: 10.1016/j.plantsci.2013.12.007
23
HoltonT. A.BruglieraF.LesterD. R.TanakaY.HylandC. D.MentingJ. G.et al. (1993). Cloning and expression of cytochrome P450 genes controlling flower colour. Nature366 (6452), 276–279. doi: 10.1038/366276a0
24
HopkinsR.RausherM. D. (2012). Pollinator-mediated selection on flower color allele drives reinforcement. Science335, 1090–1092. doi: 10.1126/science.1215198
25
HuW.HouX.HuangC.YanY.TieW.DingZ.et al. (2015). Genome-wide identification and expression analyses of Aquaporin gene family during development and abiotic stress in banana. IJMS16, 19728–19751. doi: 10.3390/ijms160819728
26
HuangW.SunW.WangY. (2012). Isolation and molecular characterisation of flavonoid 3’-hydroxylase and flavonoid 3’, 5’-hydroxylase genes from a traditional Chinese medicinal plant, Epimedium sagittatum. Gene497, 125–130. doi: 10.1016/j.gene.2011.11.029
27
Huerta-CepasJ.SzklarczykD.HellerD.Hernández-PlazaA.ForslundS. K.CookH.et al. (2019). eggNOG 5.0: A hierarchical, functionally and phylogenetically annotated orthology resource based on 5090 organisms and 2502 viruses. Nucleic Acids Res.47, D309–D314. doi: 10.1093/nar/gky1085
28
IwashinaT. (2003). Flavonoid function and activity to plants and other organisms. Biol. Sci. Space17, 24–44. doi: 10.2187/bss.17.24
29
JoB.-S.ChoiS. S. (2015). Introns: The functional benefits of introns in genomes. Genomics Inform13, 112. doi: 10.5808/GI.2015.13.4.112
30
KianiY. S.JabeenI. (2019). Lipophilic metabolic efficiency (LipMetE) and drug efficiency indices to explore the metabolic properties of the substrates of selected cytochrome P450 isoforms. ACS Omega5, 179–188. doi: 10.1021/acsomega.9b02344
31
KumarM. S.BabuP. R.RaoK. V.ReddyV. D. (2014). Organization and classification of cytochrome P450 genes in Castor (Ricinus communis L.). Proc. Natl. Acad. Sci. India Sect. B Biol. Sci.84, 131–143. doi: 10.1007/s40011-013-0192-8
32
KumarS.StecherG.TamuraK. (2016). MEGA7: Molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol. Biol. Evol.33, 1870–1874. doi: 10.1093/molbev/msw054
33
LamP. Y.LiuH.LoC. (2015). Completion of tricin biosynthesis pathway in rice: cytochrome P450 75B4 is a unique Chrysoeriol 5’-Hydroxylase. Plant Physiol.168, 1527–1536. doi: 10.1104/pp.15.00566
34
LescotM. (2002). PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res.30, 325–327. doi: 10.1093/nar/30.1.325
35
LiB.DeweyC. N. (2011). RSEM: Accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinf.12, 323–323. doi: 10.1186/1471-2105-12-323
36
LiC.DongN.ZhaoY.WuS.LiuZ.ZhaiJ. (2021). A review for the breeding of orchids: Current achievements and prospects. Hortic. Plant J.7, 380–392. doi: 10.1016/j.hpj.2021.02.006
37
LiM.-H.LiuK.-W.LiZ.LuH.-C.YeQ.-L.ZhangD.et al. (2022). Genomes of leafy and leafless Platanthera orchids illuminate the evolution of mycoheterotrophy. Nat. Plants8, 373–388. doi: 10.1038/s41477-022-01127-9
38
LiY.WeiK. (2020). Comparative functional genomics analysis of cytochrome P450 gene superfamily in wheat and maize. BMC Plant Biol.20, 93. doi: 10.1186/s12870-020-2288-7
39
LiuX.ZhuX.WangH.LiuT.ChengJ.JiangH. (2020). Discovery and modification of cytochrome P450 for plant natural products biosynthesis. SSB5, 187–199. doi: 10.1016/j.synbio.2020.06.008
40
MaD.TangH.ReicheltM.PiirtolaE.-M.SalminenJ.-P.GershenzonJ.et al. (2021). Poplar MYB117 promotes anthocyanin synthesis and enhances flavonoid B-ring hydroxylation by up-regulating the flavonoid 3’,5’-hydroxylase gene. J. Exp. Bot.72, 3864–3880. doi: 10.1093/jxb/erab116
41
MillerM. A.SchwartzT.PickettB. E.HeS.KlemE. B.ScheuermannR. H.et al. (2015). A restful API for access to phylogenetic tools via the CIPRES Science Gateway. Evol. Bioinform. Online11, EBO.S21501. doi: 10.4137/EBO.S21501
42
MizutaniM.OhtaD. (2010). Diversification of P450 genes during land plant evolution. Annu. Rev. Plant Biol.61, 291–315. doi: 10.1146/annurev-arplant-042809-112305
43
MuJ.YangY.LuoY.SuR.NiklasK. J. (2017). Pollinator preference and pollen viability mediated by flower color synergistically determine seed set in an Alpine annual herb. Ecol. Evol.7, 2947–2955. doi: 10.1002/ece3.2899
44
NeveE. P.Ingelman-SundbergM. (2010). Cytochrome P450 proteins: Retention and distribution from the endoplasmic reticulum. Curr. Opin. Drug Discovery Dev.13, 78–85.
45
NelsonD. R.KoymansL.KamatakiT.StegemanJ. J.FeyereisenR.WaxmanD. J.et al. (1999). P450 superfamily: update on new sequences, gene mapping, accession numbers and nomenclature. Pharmacogenet Genomics6, 1–42. doi: 10.1097/00008571-199602000-00002
46
NiuZ.ZhuF.FanY.LiC.ZhangB.ZhuS.et al. (2021). The chromosome-level reference genome assembly for Dendrobium officinale and its utility of functional genomics research and molecular breeding study. Acta Pharm. Sin. B11, 2080–2092. doi: 10.1016/j.apsb.2021.01.019
47
OlsenK. M.HehnA.JugdéH.SlimestadR.LarbatR.BourgaudF.et al. (2010). Identification and characterisation of CYP75A31, a new flavonoid 3’ 5’-hydroxylase, isolated from Solanum lycopersicum. BMC Plant Biol.10, 21. doi: 10.1186/1471-2229-10-21
48
OmuraT.SatoR. (1962). A new cytochrome in liver microsomes. J. Biol. Chem.237, 1375–1376.
49
PourcelL.RoutaboulJ.CheynierV.LepiniecL.DebeaujonI. (2007). Flavonoid oxidation in plants: From biochemical properties to physiological functions. Trends Plant Sci.12, 29–36. doi: 10.1016/j.tplants.2006.11.006
50
QiX.YuX.XuD.FangH.DongK.LiW.et al. (2017). Identification and analysis of CYP450 genes from transcriptome of Lonicera japonica and expression analysis of chlorogenic acid biosynthesis related CYP450s. PeerJ5, e3781. doi: 10.7717/peerj.3781
51
RausherM. D. (2006). “The evolution of flavonoids and their genes,” in The science of flavonoids (Berlin: Springer), 175–211. doi: 10.1007/978-0-387-28822-2_7
52
RausherM. D. (2008). Evolutionary transitions in floral color. Int. J. Plant Sci.169, 7–21. doi: 10.1086/523358
53
ReddyS. M.KumarM. S.RaoK. V.ReddyV. D. (2014). Structural organization, classification and phylogenetic relationship of cytochrome P450 genes in Ctrus cleemtina and Citrus sinensis. Tree Genet Genomes10, 399–409. doi: 10.1007/s11295-013-0695-8
54
RobertsD. L.DixonK. W. (2008). Orchids. Curr Biol18, R325–R329. doi: 10.1016/j.cub.2008.02.026
55
RozewickiJ.LiS.AmadaK. M.StandleyD. M.KatohK. (2019). MAFFT-DASH: integrated protein sequence and structural alignment. Nucleic Acids Res.47, W5–W10. doi: 10.1093/nar/gkz342
56
SamantaA.DasG.DasS. K. (2011). Roles of flavonoids in plants. Int J Pharm Sci Tech6, 12–35.
57
SchoenbohmC.MartensS.EderC.ForkmannG.WeisshaarB. (2000). Identification of the Arabidopsis thaliana flavonoid 3’-hydroxylase gene and functional expression of the encoded P450 enzyme. Biol. Chem.381, 749–753. doi: 10.1515/BC.2000.095
58
SchulerM. A.Werck-ReichhartD. (2003). Functional genomics of P450s. Annu. Rev. Plant Biol.54, 629–667. doi: 10.1146/annurev.arplant.54.031902.134840
59
SeitzC.AmeresS.ForkmannG. (2007). Identification of the molecular basis for the functional difference between flavonoid 3’-hydroxylase and flavonoid 3’,5’-hydroxylase. FEBS Lett.581, 3429–3434. doi: 10.1016/j.febslet.2007.06.045
60
SeitzC.EderC.DeimlB.KellnerS.MartensS.ForkmannG. (2006). Cloning, functional identification and sequence analysis of flavonoid 3’-hydroxylase and flavonoid 3’,5’-hydroxylase cDNAs reveals independent evolution of flavonoid 3’,5’-hydroxylase in the Asteraceae family. Plant Mol. Biol.61, 365–381. doi: 10.1007/s11103-006-0012-0
61
SimoF. A.WaterhouseR. M.IoannidisP.KriventsevaE. V.ZdobnovE. M. (2015). Busco: Assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics31, 3210–3212. doi: 10.1093/bioinformatics/btv351
62
SunY.ChenG.-Z.HuangJ.LiuD.-K.XueF.ChenX.-L.et al. (2021). The Cymbidium goeringii genome provides insight into organ development and adaptive evolution in orchids. Ornamental Plant Res.1, 1–13. doi: 10.48130/OPR-2021-0010
63
TanakaY.BruglieraF. (2013). Flower colour and cytochromes P450. Phil. Trans. R. Soc B368, 20120432. doi: 10.1098/rstb.2012.0432
64
TodaK.YangD.YamanakaN.WatanabeS.HaradaK. (2002). A single-base deletion in soybean flavonoid 3-hydroxylase gene is associated with gray pubescence color. Plant Mol. Biol.50 (2), 187–196. doi: 10.1023/a:1016087221334
65
TroshinP. V.ProcterJ. B.BartonG. J. (2011). Java bioinformatics analysis web services for multiple sequence alignment-JABAWS:MSA. Bioinformatics27, 2001–2002. doi: 10.1093/bioinformatics/btr304
66
VikhorevA. V.StryginaK. V.KhlestkinaE. K. (2019). Duplicated flavonoid 3’-hydroxylase and flavonoid 3’, 5’-hydroxylase genes in barley genome. PeerJ7, e6266. doi: 10.7717/peerj.6266
67
WangY.-S.XuY.-J.GaoL.-P.YuO.WangX.-Z.HeX.-J.et al. (2014). Functional analysis of flavonoid 3’,5’-hydroxylase from tea plant (Camellia sinensis): Critical role in the accumulation of catechins. BMC Plant Biol.14, 347. doi: 10.1186/s12870-014-0347-7
68
WangQ.-Q.ZhuM.-J.YuX.BiY.-Y.ZhouZ.ChenM.-K.et al. (2021). Genome-wide identification and expression analysis of terpene synthase genes in cymbidium faberi. Front. Plant Sci.12. doi: 10.3389/fpls.2021.751853
69
WangY. P.TangH. B.DebarryJ. DTanX.LiJ. P.WangX. Y.et al. (2012). MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res.40, e49. doi: 10.1093/nar/gkr1293
70
WenW.AlseekhS.FernieA. R. (2012). Conservation and diversification of flavonoid metabolism in the plant kingdom. Curr Opin Plant Biol.55, 100–108. doi: 10.1016/j.pbi.2020.04.004
71
WhibleyA. C.LangladeN. B.AndaloC.HannaA. I.BanghamA.ThébaudC.et al. (2006). Evolutionary paths underlying flower color variation in Antirrhinum. Science313, 963–966. doi: 10.1126/science.1129161
72
XiaoY.WenJ.MengR.MengY.ZhouQ.NieZ.-L. (2021). The expansion and diversity of the CYP75 gene family in Vitaceae. PeerJ9, e12174. doi: 10.7717/peerj.12174
73
XuY.LeiY.SuZ.ZhaoM.ZhangJ.ShenG.et al. (2021). A chromosome-scale Gastrodia elata genome and large-scale comparative genomic analysis indicate convergent evolution by gene loss in mycoheterotrophic and parasitic plants. Plant J.108, 1609–1623. doi: 10.1111/tpj.15528
74
YangF.GaoJ.WeiY.RenR.ZhangG.LuC.et al. (2021). The genome of Cymbidium sinense revealed the evolution of orchid traits. Plant Biotechnol. J.19, 2501–2516. doi: 10.1111/pbi.13676
75
YuanY.JinX.LiuJ.ZhaoX.ZhouJ.WangX.et al. (2018). The Gastrodia elata genome provides insights into plant adaptation to heterotrophy. Nat. Commun.9, 1615. doi: 10.1038/s41467-018-03423-5
76
ZhangP.DuH.WangJ.PuY.YangC.YanR.et al. (2020). Multiplex CRISPR/Cas9-mediated metabolic engineering increases soya bean isoflavone content and resistance to soya bean mosaic virus. Plant Biotechnol. J.18, 1384–1395. doi: 10.1111/pbi.13302
77
ZhangT.LiuC.HuangX.ZhangH.YuanZ. (2019). Land-plant phylogenomic and pomegranate transcriptomic analyses reveal an evolutionary scenario of CYP75 genes subsequent to whole genome duplications. J. Plant Biol.62, 48–60. doi: 10.1007/s12374-018-0319-9
78
ZhangG.-Q.LiuK.-W.LiZ.LohausR.HsiaoY.-Y.NiuS.-C.et al. (2017). The Apostasia genome and the evolution of orchids. Nature549, 379–383. doi: 10.1038/nature23897
79
ZhangG.-Q.XuQ.BianC.TsaiW.-C.YehC.-M.LiuK.-W.et al. (2016). The Dendrobium catenatum Lindl. genome sequence provides insights into polysaccharide synthase, floral development and adaptive evolution. Sci. Rep.6, 19029. doi: 10.1038/srep19029
80
ZhangW.ZhangG.ZengP.ZhangY.HuH.LiuZ.et al. (2021a). Genome sequence of Apostasia ramifera provides insights into the adaptive evolution in orchids. BMC Genomics22, 536. doi: 10.1186/s12864-021-07852-3
81
ZhangY.ZhangG.-Q.ZhangD.LiuX.-D.XuX.-Y.SunW.-H.et al. (2021b). Chromosome-scale assembly of the Dendrobium chrysotoxum genome enhances the understanding of orchid evolution. Hortic. Res.8, 183. doi: 10.1038/s41438-021-00621-z
82
ZhaoX.LiuD.-K.WangQ.-Q.KeS.LiY.ZhangD.et al. (2022). Genome-wide identification and expression analysis of the GRAS gene family in Dendrobium chrysotoxum. Front. Plant Sci.13. doi: 10.3389/fpls.2022.1058287
83
ZhaoX.ZhangM.HeX.ZhengQ.HuangY.LiY.et al. (2023). Genome-wide identification and expression analysis of the SPL gene family in three orchids. IJMS24, 10039. doi: 10.3390/ijms241210039
84
ZorattiL.KarppinenK.Luengo EscobarA.HäggmanH.JaakolaL. (2014). Light-controlled flavonoid biosynthesis in fruits. Front. Plant Sci.5. doi: 10.3389/fpls.2014.00534
Summary
Keywords
Orchidaceae, cytochrome P450, CYP75 gene family, expression analysis, Cymbidium goeringii
Citation
Li Y, Zhao X, Zhang M-M, He X, Huang Y, Ahmad S, Liu Z-J and Lan S (2023) Genome-based identification of the CYP75 gene family in Orchidaceae and its expression patterns in Cymbidium goeringii. Front. Plant Sci. 14:1243828. doi: 10.3389/fpls.2023.1243828
Received
22 June 2023
Accepted
11 September 2023
Published
27 September 2023
Volume
14 - 2023
Edited by
Dianella G. Howarth, St. John’s University, United States
Reviewed by
Cao Deng, DNA Stories Bioinformatics Center, China; Zhichao Xu, Northeast Forestry University, China
Updates

Check for updates
Copyright
© 2023 Li, Zhao, Zhang, He, Huang, Ahmad, Liu and Lan.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhong-Jian Liu, zjliu@fafu.edu.cn; Siren Lan, lkzx@fafu.edu.cn
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.