ORIGINAL RESEARCH article

Front. Plant Sci., 13 July 2015

Sec. Plant Genetics and Genomics

Volume 6 - 2015 | https://doi.org/10.3389/fpls.2015.00515

Comparative analysis of the phytocyanin gene family in 10 plant species: a focus on Zea mays

  • Institute of Life Sciences, Jiangsu University Zhenjiang, China

Abstract

Phytocyanins (PCs) are plant-specific blue copper proteins, which play essential roles in electron transport. While the origin and expansion of this gene family is not well-investigated in plants. Here, we investigated their evolution by undertaking a genome-wide identification and comparison in 10 plants: Arabidopsis, rice, poplar, tomato, soybean, grape, maize, Selaginella moellendorffii, Physcomitrella patens, and Chlamydomonas reinhardtii. We found an expansion process of this gene family in evolution. Except PCs in Arabidopsis and rice, which have described in previous researches, a structural analysis of PCs in other eight plants indicated that 292 PCs contained N-terminal secretion signals and 217 PCs were expected to have glycosylphosphatidylinositol-anchor signals. Moreover, 281 PCs had putative arabinogalactan glycomodules and might be AGPs. Chromosomal distribution and duplication patterns indicated that tandem and segmental duplication played dominant roles for the expansion of PC genes. In addition, gene organization and motif compositions are highly conserved in each clade. Furthermore, expression profiles of maize PC genes revealed diversity in various stages of development. Moreover, all nine detected maize PC genes (ZmUC10, ZmUC16, ZmUC19, ZmSC2, ZmUC21, ZmENODL10, ZmUC22, ZmENODL13, and ZmENODL15) were down-regulated under salt treatment, and five PCs (ZmUC19, ZmSC2, ZmENODL10, ZmUC22, and ZmENODL13) were down-regulated under drought treatment. ZmUC16 was strongly expressed after drought treatment. This study will provide a basis for future understanding the characterization of this family.

Introduction

Blue copper proteins are ancient, type-I copper-containing proteins, which function as electron transporters in bacteria and plants (Giri et al., 2004). Blue copper proteins in plant are defined as phytocyanins (PCs), which include plastocyanins and some phytocyanin-related proteins (De Rienzo et al., 2000). Structurally, PCs consist of two conserved disulfide bridged Cys residues, four copper ligands, and an eight-stranded β-sandwich fold (Hart et al., 1996). According to the glycosylation state, copper ligand residues, domain organization, and spectroscopic properties of proteins, PCs can be divided into four groups: plantacyanins (PLCs), uclacyanins (UCs), stellacyanins (SCs), and early nodulin-like proteins (ENODLs; Nersissian et al., 1998; Ma et al., 2011; Li et al., 2013). PLCs contain a copper binding site consisting of one Met, one Cys, and two His ligands (Guss et al., 1996). And the N-terminal leader sequences in PLCs usually contain the endoplasmic reticulum target signal peptides (Nersissian et al., 1998). Although UCs also include the same four residues as described above in their copper-binding sites, they contain another domain resembling a cell-wall structural proteins (glycoproteins; Nersissian et al., 1998). SCs use a Gln residue as a copper ligand, while PLCs and UCs have a Met residue in this position (Nersissian et al., 1998). Like UCs, SCs consist of a copper-binding domain and a glycoprotein-like domain. The structure of ENODLs is similar to that of UCs and SCs, but ENODLs cannot bind copper, which might be involved in process without copper-binding (Greene et al., 1998; Mashiguchi et al., 2009; Ma et al., 2011).

Previous studies have indicated that PCs are involved in various plant activities, including cell differentiation and reorganization (Fedorova et al., 2002; Kato et al., 2002), pollen tube germinating and anther pollination (Kim et al., 2003; Dong et al., 2005), reproductive potential determining (Khan et al., 2007), apical buds organ development (Mashiguchi et al., 2009), and somatic embryogenesis (Poon et al., 2012), etc. In addition, PCs may also function in stress responses, including enhancing osmotic tolerance (Wu et al., 2011), inhibiting aluminum absorption and protecting cell from aluminum toxicity (Ezaki et al., 2001, 2005). Several researches have indicated that salt and drought stresses can induce the expression of some PC genes, suggesting the potential response to abiotic stresses (Ozturk et al., 2002; Ma et al., 2011).

To date, through a comprehensive bioinformatics analysis, only 38, 62, and 84 PC genes have been identified in Arabidopsis, rice and Brassica rapa, respectively (Mashiguchi et al., 2009; Showalter et al., 2010; Ma et al., 2011; Li et al., 2013). In the present study, including Arabidopsis and rice, we identified the PC gene family of 10 species in plants, and each species contains 1–89 PC genes. Considering the important roles associated with developmental functions and stress responses, and the number of the PC genes varied largely among plant species, it’s of considerable interest to us to research how the PC genes have evolved in Plantae, and how and why different plant species have obtained such different PC genes. Here, our results indicate that the PC gene family has an expansion process in plant evolution, and that tandem and segmental duplications and retrotransposition play dominant roles for their expansion. Our studies also reveal diverse expression patterns of the PC genes in maize.

Materials and Methods

Identification of the PC Genes Plants and Bioinformatics Analysis

We first used Arabidopsis, rice and B. rapa PC sequences (Mashiguchi et al., 2009; Showalter et al., 2010; Ma et al., 2011; Li et al., 2013) as queries in basic local alignment search tool (BLAST) searches against the phytozome1 (Goodstein et al., 2012) with -1 expect (E) threshold to identify potential members of the PC gene family in plants. The sequences were then confirmed as encoding PC for the presence of a plastocyanin-like domain (PCLD) signature by the Pfam (Punta et al., 2012) searches. Subsequently, SignalP 4.1 Server (Petersen et al., 2011) was used to check the signal peptide (SP) of all proteins. Big-PI Plant Predictor (Eisenhaber et al., 2003) was used to predict the glycosylphosphatidylinositol (GPI)-anchor signal. In addition, we also used NetNGlyc 1.0 Server2 to predict the N-glycosylation sites in PC proteins. Putative arabinogalactan (AG) glycomodules were predicted mainly following the previously described criteria (Schultz et al., 2002; Showalter et al., 2010; Ma et al., 2011). The structure characteristics of PCs are shown in Supplementary Table S1.

Phylogenetic Analyses of the PC Gene Family in Plants

We used MUSCLE 3.52 (Edgar, 2004) to perform multiple sequence alignments of full-length protein sequences. And neighbor-joining (NJ) method in MEGA v5 (Tamura et al., 2011) was used to carry out phylogenetic analyses of the PC proteins with Dayhoff methods and default assumptions. Bootstrap analyses with 1,000 replicates were used to test support.

Estimation of the Maximum Number of Gained and Lost PCS

Next, we divided the phylogeny into different clades to determine the expansion extent of PC gene family in different plant lineages. Nodes among lineages denoted the most recent common ancestor (MRCA) and were labeled as V: Viridiplantae; E: Embryophyte; T: Tracheophyte; A: Angiosperm; G: Grass; Eu: eudicots; R: Rosid. Notung v2.6 (Chen et al., 2000) was used to infer gene loss and duplication events.

Conserved Motifs Analyses

MEME program3 (Bailey et al., 2006) was used to identify motifs in the plant PC proteins. This program was run with the following parameters: maximum number of motifs = 8, number of repetitions = any, and with optimum motif widths between 6 and 50 residues.

Chromosomal Location and Exon–Intron Structure Analysis

We used the annotation information of the PC genes on phytozome1 (Goodstein et al., 2012) to determine their chromosomal locations. The segmental duplication (or syntenic) regions of the different chromosomes in maize and Arabidopsis genomes were calculated with the Synteny Mapping and Analysis Program (SyMap; Soderlund et al., 2011). Genomicus4 online tool (Louis et al., 2013) was used to explore the PC gene organization information within and between genomes. The exon–intron structure of PC genes was also collected from genome annotations.

Estimating the Age of Duplicated Paralog Gene Pairs

We first determined paralogous gene pairs by the protein phylogeny, and used them as references for a multiple alignment of DNA coding sequences using embedded ClustalW (codons) software in MEGA v5 (Tamura et al., 2011). And we used K-Estimator 6.0 program (Comeron, 1999) to estimate the Ka and Ks values of paralogous genes. The approximate data of the duplication event for each of gene pair was calculated using the formula (T = Ks/2λ), assuming the clock-like rate (λ) is 1.5 × 10-8 and 6.5 × 10-9 synonymous/substitution site/year for Arabidopsis (Koch et al., 2000) and for maize (Gaut et al., 1996), respectively.

Microarray-Based Expression Analysis

We used the Plant Expression Database (PLEXdb; Dash et al., 2012) for expression analyses of maize PC genes. One experiment (ZM37) contributed by Kaeppler group in Sekhon et al. (2011) was selected in this study. Expression data in 34 selected tissues were gene-wise normalized in the Genesis (v 1.7.6) program (Sturn et al., 2002).

Plant Materials and Treatment

We used 1-week-old maize (Zea mays L. inbred line B73) seedlings to examine the expression patterns of PC genes under salt and drought stresses. Plants were grown in a plant growth chamber at 23 ± 1°C with a 14 h light/10 h dark photoperiod. Control (CK) seedlings were grown with normal irrigation. For salt treatment, the maize seedlings were kept in 150 mM NaCl for 24 h. For drought treatment, the seedlings were dried between folds of tissue paper at 23 ± 1°C for 3 h. Each sample was conducted three replicates.

RNA Isolation and Quantitative Real-Time PCR (QRT-PCR) Analysis

Trizol total RNA extraction kit (Sangon, Shanghai, China) was used to extract total RNA. Next, moloney murine leukemia virus (M-MLV) reverse transcriptase (TakaRa, Dalian, China) was used to perform reverse transcription. Triplicate quantitative assays were performed using SYBR Green Master Mix (TakaRa) with an ABI 7500 sequence detection system. Nine maize PC genes were randomly selected for real-time quantitative reverse transcription polymerase chain reaction (qRT-PCR) analysis. The gene-specific primers (Table 3) were synthesized in Sangon. The expression level of Actin 1 (GRMZM2G126010) gene was used as a reference. 2-ΔΔCT method (Livak and Schmittgen, 2001) was used to calculate the relative expression level of the PC genes.

Results and Discussion

Identification of PC Multigene Family in Plants

Phytocyanins are plant-specific ancient blue copper proteins which function as electron transporter. Though some researches (Mashiguchi et al., 2009; Showalter et al., 2010; Ma et al., 2011; Li et al., 2013) have been made in the characterization of plant PCs during the past decade, studies on this gene family are still scarce. In order to identify PC multigene families in other plant species, we used Arabidopsis, rice and B. rapa PC proteins as queries to perform a genome-wide search in eight genomes in Viridiplantae. The returned sequences were further confirmed as encoding PC by the Pfam (Punta et al., 2012) searches for the presence of a plastocyanin-like domain (PCLD) signature conserved in other PC proteins. As we know, the Arabidopsis and rice PCs have been bioinformatically and systematically studied in previous study (Ma et al., 2011; Li et al., 2013), so, the previous published data were also used to carry out deeper analysis. As a result, a total of 465 PC genes were identified from 10 plants in the phytozome database (Table 1). Our analysis shows that the number of PC genes ranged from 1 to 89 across the different plant species (Table 1). The soybean genome contains a maximum of 89 PC genes, while, chlamydomonas has only one. About 60 and 77 putative PC genes were identified from maize and poplar, respectively. Poplar has about two times PC genes than Arabidopsis, whereas rice and maize have a similar number of the PC genes when compared with that of poplar. By searching the Genome database of NCBI5, we found that the poplar, Arabidopsis and maize genomes contain 42,577, 33,583, and 39,454 genes, respectively, which are 39.4, 9.9, and 29.2% larger than that of rice (30,534), respectively. This implied that the number of PCs is not proportional to the size of the genomes. Obviously, there will be some forces to prompt the number change of this gene family in different plant species.

Table 1

LineageOrganismGenome size (Mb)No. of predicted genesNo. of PC genes
AlgaeChlamydomonas reinhardtii120.41144881
MossPhyscomitrella patens477.953593628
LycophytesSelaginella moellendorffii212.53478220
DicotsArabidopsis thaliana119.673358338
Populus trichocarpa485.674257777
Vitis vinifera486.262826841
Solanum lycopersicum781.512746649
Glycine max973.495020289
MonocotsOryza sativa382.783053462
Zea mays2065.73945460
Total465

PC genes identified in 10 sequenced plants.

The data come from www.ncbi.nih.gov/genome/.

Structural Analysis of the Putative PC Proteins

To further investigate the structural characteristics of PC proteins, we used several bioinformatics websites as described in the materials and methods section to predict the AG glycomodules, SPs, GPI-anchor signals (GASs), and N-glycosylation sites of PCs. Our results (Supplementary Table S1) indicated that 292 PCs were predicted to contain an N-terminal SP required for targeting to the endoplasmic reticulum. In addition, 217 PCs were expected to have GASs responsible for plasma membrane localization. The subcellular localizations of plant PCs have been found to correlate with their specific functions. For example, AtSC3/AtBCB, an Arabidopsis blue copper binding protein, was strongly localized in the plasma membrane and induced by aluminum stress and oxidative stress, suggesting that the plant PCs may participate in some abiotic stress responses (Ezaki et al., 2001, 2005). Additionally, PC proteins accumulated in the sieve element plasma membrane may be involved in determining reproductive potential (Khan et al., 2007). Moreover, 281 PCs had putative AG glycomodules in the (Pro, Ala, Ser, Thr)-rich region. These 281 PCs might be AGPs for the existence of AG glycomodules and SPs. According to the distribution of the SP, PCLD, AGP-like region (ALR) and GAS, these PCs were separated into ten types (Figure 1). Type I PCs had typical properties, including an N-terminal SP, a PCLD, an ALR, and a C-terminal GAS. Type II PCs were short of GAS, while other features were similar to type I. Both GAS and ALR were absent from type VI, VIII, and IX PCs. Interestingly, we also found that type III, IV, VIII, and X PCs possessed two PCLDs, and type V had three PCLDs. The domain repeats are usually thought to evolve through recombination events and intragenic duplication (Björklund et al., 2006). The creation of new multi-domain architectures is an important mechanism that provides opportunities for the organism to expand its repertoire of cellular functions, such as transcriptional regulation, protein transport and assembly (Andrade et al., 2001; D’Andrea and Regan, 2003; Weiner et al., 2006). Furthermore, protein domain repeats may constitute a source of variability. In human genome, duplications are more common in genes containing repeated domains than in non-repeated ones (Björklund et al., 2010). The domain repetition is quite important in evolution, since it provides a path where proteins can evolve through removing or adding functionally similar or distinct blocks (Light et al., 2012). In this study, we identified some multi-PCLD domains in PCs. This presence of PCLD domain repeats contribute to the complexity of this gene family. Its effect on the function of PC proteins remains to be examined. However, our findings suggest that the PCLD repeats may play an important role in PC protein evolution.

FIGURE 1

Origin and Contrasting Changes in the Numbers of Plant PC Genes

It has been suggested that the Chlorophycean is the primitive species in Viridiplantae from which all land plants have evolved (Misumi et al., 2008). The earliest PCs possibly originated about 1 billion years ago in algae (Merchant et al., 2007; Misumi et al., 2008). Our search for PCs in Chlamydomonas reinhardtii found only one member. Therefore, the origin of the plant PC genes could be traced to the ancient algae. The PC gene family appeared to expand by duplication events. For example, Physcomitrella patens has 28 PC genes, which soybean exhibites 89 paralogous gene sequences representing about 19% of total 465 identified PCs, which might be due to at least three whole genome duplications (Schmutz et al., 2010). As we know, expansion and conservation of a gene family in evolution imply important roles during organism adaptation to environment (Cao et al., 2011; Cao and Shi, 2012). Next, we also estimated the number of PC genes in the MRCA to better understand how this family gene has evolved in Viridiplantae. Reconciliation of the species phylogeny with the gene trees suggested that one ancestral PC gene exist in the MRCA of Viridiplantae. Furthermore, we identified 32 orthologous genes in the Embryophyte MRCA and 44 in the MRCA of Tracheophyte (Figure 2). We also found that the number of PCs remained relatively increased from the land plants (P. patens) to the angiosperms. Eudicot ancestral PCs once more expanded significantly after the separation from monocot species about 145 million years ago (Xu et al., 2009). We identified about 109 ancestral PC genes in the MRCA of eudicots. After that, many PC genes have lost in the eudicots. It appeared that the PC family had been reduced in all the analyzed eudicot species compared with the number of MRCA in eudicots. For example, the number of PCs decreased approximately 65.1 and 55 percent for Arabidopsis and tomato, respectively. Whereas when compared the number of ancestral PC genes, it appeared that this family had expanded in all the extant species. In addition, this expansion was uneven among these plant species. For instance, there are 77, 60, 41, and 28 genes in poplar, maize, grape, moss, respectively, while the estimated numbers of genes in the MACA of Viridiplantae are seven. Therefore, poplar, maize, grape and moss have gained 70, 53, 34 and 21 genes, respectively, since their splits. The numbers of genes gained in the soybean lineage are much greater than that in other lineages.

FIGURE 2

Chromosomal Distribution and Duplication Patterns of PC Genes in Plants

Gene duplication, which usually occurs via segmental duplication, tandem duplication and retrotransposition, plays important roles in organismal evolution (Chen et al., 2014; Cao and Li, 2015). To search for duplication mechanisms for PC genes, as examples, we examined their genomic distribution in Arabidopsis and maize. The results showed that PC genes are dispersed throughout Arabidopsis and maize genomes (Figure 3). We also found that about 79.5 and 96.7% of PC genes locate on the duplicated segments of chromosomes in Arabidopsis and maize, respectively. Within identified duplication events, 5 of 11 pairs (AtENDOL14/AtENODL15, AtENODL5/AtENODL6, AtUC4/AtUC5, AtENODL1/AtENODL2, and AtENODL11/AtENODL12) in Arabidopsis and 7 of 20 pairs (ZmUC6/ZmUC10, ZmUC13/ZmUC14, ZmSC1/ZmSC2, ZmUC22/ZmPLC2, ZmENODL4/ZmENODL19, ZmENODL12/ZmENODL21, ZmENODL16/ZmENODL24) in maize are retained (Figure 3). In addition, evolutionary dates of these duplicated PC genes were also estimated (Table 2). The result indicated that duplication events for Arabidopsis six pairs and maize seven pairs occurred within the past 19.73–28.58 million years and 11.72–21.16 million years, respectively (Table 2). These periods coincide with the time of the secondary large-scale genome duplication in Arabidopsis and maize (Gaut et al., 1996; Koch et al., 2000). In addition, we also observed some earlier segmental duplication events occurred around from 41.63 to 58.33 MYA in the PCs of Arabidopsis (AtENODL1/AtENODL2 and AtENODL11/AtENODL12) and maize (ZmUC3/ZmUC23 and ZmUC22/ZmPLC2), nearly within or following grasses origination (Kellogg, 2001). Interestingly, we also found that about 31.67% of PC genes were tandemly clustered in maize, and only one clustered PCs (AtUC7-AtUC3) were also identified in Arabidopsis (Figure 3), suggesting that tandem duplication may be another factor generating the family genes. In a word, segmental duplication and tandem duplication contribute to the expansion of the PC gene family.

Table 2

Paralogous pairsKaKsKa/KsDuplication typesData (million years ago)
AtENODL17/AtENODL190.183950.701020.26243Retrotransposition23.37
AtENODL3/AtENODL40.137410.319290.43036Retrotransposition10.64
AtENODL14/AtENODL150.184370.701520.26282Segmental duplication23.38
AtENODL5/AtENODL60.225580.777830.29001Segmental duplication25.93
AtUC4/AtUC50.312720.856690.36503Segmental duplication28.56
AtENODL1/AtENODL20.466451.453910.32083Segmental duplication48.46
AtENODL11/AtENODL120.382161.360910.28081Segmental duplication45.36
AtSC1/AtSC20.193070.348710.55367Retrotransposition11.63
At1g45063/At3g533300.375090.714320.5251Retrotransposition23.81
AtENODL22/AtPC10.724833.009410.24085Retrotransposition100.31
AtUC3/AtUC70.226500.591840.3827Tandem duplication19.73
ZmUC6/ZmUC100.054010.189880.28444Segmental duplication17.26
ZmUC11/ZmUC240.323930.506720.63927Tandem duplication46.07
ZmUC13/ZmUC140.107500.185730.57879Segmental duplication16.88
ZmUC16/ZmPLC30.676580.883560.76574Retrotransposition80.32
ZmUC8/ZmUC190.498740.748530.66629Retrotransposition68.05
ZmSC4/ZmSC50.021840.018471.18246Retrotransposition1.68
ZmSC1/ZmSC20.117030.232770.50277Segmental duplication21.16
ZmUC1/ZmUC120.099260.155130.63985Tandem duplication14.10
ZmUC3/ZmUC230.405860.541190.74994Retrotransposition41.63
ZmUC18/ZmUC260.802490.919560.87269Retrotransposition70.74
ZmENODL20/ZmENODL220.648240.945280.68577Retrotransposition85.93
ZmUC22/ZmPLC20.437530.641580.68196Retrotransposition58.33
ZmPLC1/ZmUC90.054690.144480.37853Retrotransposition13.13
ZmUC4/ZmUC50.067730.128980.52512Tandem duplication11.72
ZmENODL2/ZmENODL250.561830.928320.60521Retrotransposition84.39
ZmENODL4/ZmENODL190.174640.330730.52804Segmental duplication30.07
ZmENODL7/ZmENODL130.03350.075310.44483Retrotransposition6.85
ZmENODL3/ZmENODL60.585640.951520.61548Segmental duplication86.50
ZmENODL12/ZmENODL210.079880.154770.51612Segmental duplication14.07
ZmENODL16/ZmENODL240.113640.319010.35623Segmental duplication29.00

Inference of duplication time of PC paralogous pairs in Arabidopsis and maize.

FIGURE 3

Similar expansion patterns were also found in Oryza sativa and B. rapa PC genes (Ma et al., 2011; Li et al., 2013). In the rice genome, 20 of 62 OsPC genes were segmental duplications; while, 63 of 84 BrPC genes were attributed to segmental duplications in the B. rapa. This indicated that this type of duplication event contributes to the expansion of the PC genes in these plants. Tandem duplication is an important factor dramatically expanding new copies in clusters by unequal recombination or replication slippage (Anderson and Roth, 1977; Blanc and Wolfe, 2004; Cannon et al., 2004; Thomas, 2005). Initially, tandem duplicated genes have similar sequences and functions; but, in the subsequent evolution, they tend to divergence in structure and expression patterns during too many changes in the cis- and trans-acting effects, DNA sequences, regulatory networks, and chromatin modifications (Charon et al., 2012). Several previous studies have investigated these divergences between duplicate genes (Makova and Li, 2003; Li et al., 2005; Ganko et al., 2007). During the process of evolution, some duplicated genes were maintained the similarity of functions, while others either gained new functions (neofunctionalization) or subdivided their functions (subfunctionalization), or lost them (pseudogenization; Pinyopich et al., 2003; Franzke et al., 2010; Wang and Paterson, 2011). Plants cannot freely escape the changing environment. Therefore, some genes associated with stress defense are required to expand to resist these environmental stimulations. Previous studies have indicated that tandem duplicated genes are often involved in responses to environmental stimuli or stress in plants (Leister, 2004; Maere et al., 2005; Fang et al., 2012). Our results also indicated that about 31.67 and 29.03% of PC genes were tandemly clustered in monocots maize and rice, respectively. And some stress responses were often associated with the PC proteins (Ezaki et al., 2001, 2005; Ozturk et al., 2002; Ma et al., 2011; Wu et al., 2011). Amplification of the PC genes by tandem duplication in monocots maize and rice is regarded as a mechanism for protecting plants from harmful stresses, which may be crucial for organismal adaptation to different environments. Only one clustered PCs (AtUC7-AtUC3) were identified in Arabidopsis, and no duplicated PC genes were identified from tandem duplications in another eudicot B. rapa (Li et al., 2013), implying different expansion types of this gene family between monocots maize and rice and eudicots Arabidopsis and B. rapa.

We also found that the Ka/Ks values of the sequences among PC pairs were significantly different (Table 2). Moreover, except for the ZmSC4/ZmSC5 gene pairs, all other’s estimated Ka/Ks values were less than 1, implying that most of the duplicated PC sequences within these pairs are under purifying selection pressure in evolution. The Ka/Ks value of ZmSC4/ZmSC5 pairs is 1.18246, indicating that positive selection might be occurred between this gene pairs after duplication about 1.68 Mya. Gene or protein evolution is an outcome of the interplay between mutation and selection. During evolution, some functional regions have reached the optimal state. Therefore, most of the mutations that altered the function will be abandoned by purifying selection. With changes in environment, subsequent selective pressure spurs such regions to change to improve the fitness of the organism in a new environment accordingly. From this point, detecting positive selection seems especially necessary, because it can indicate selective advantages in changing the gene or protein sequences. These selective advantages are essential for understanding of functional regions of the gene or protein and functional shift (Morgan et al., 2010). In this study, one duplicated gene pairs (ZmSC4/ZmSC5) were identified to undergo positive selection after separated by duplication, implying that functional divergence of duplicated genes might have accelerated by positive selection during long periods of evolution. Thus, this might facilitate an adaption to different environments for the organism.

Motif Distribution and Intron Loss in Some Clades

We used Pfam (Punta et al., 2012) to identify the major domains of PC proteins in plants. Results showed that all PC proteins possessed PCLD signature that is essential for electron transport activity. To recognize some smaller individual motifs, we used the MEME6 (Bailey et al., 2006) to study the diversification of PC proteins in plants. As a result, we identified eight distinct motifs in these members (Supplementary Figure S1). Obviously, most members in each clade have similar motif compositions, suggesting functional conservation of the PC proteins in the same clade (Supplementary Figure S1). Therefore, motif compositions of the PC proteins in each clade may provide additional support for the phylogenetic analyses (Cao, 2012).

Exon–intron structure has been used to explain the evolutionary relationships (Cao et al., 2010; Koralewski and Krutovsky, 2011; Chen and Cao, 2014). Next, we compared the exon–intron organization of the PCs in 10 plants. Supplementary Figure S1 provided a detailed illustration of the position of introns of each PCLD domain. Our results indicated a conserved 1 phase intron insertion in PCLD of most PC paralogs. Interestingly, we also found that this intron insertion has been lost in some poplar PCLD (Supplementary Figure S1). Moreover, these intronless genes in PCLD tended to form species-specific clusters on the poplar chromosomes 2, 6, and 15 (Supplementary Figure S1). It may be the consequences of retroposition and tandem duplications. The loss of intron in these PCs was likely associated with recent evolutionary expansion, like, retroposition and tandem duplication. To test this hypothesis, we identified the candidate donor gene based on the following two criteria. The first criterion is that the retrogene will have identical sequences to the donor gene after retroposition, so they will cluster together in a phylogenetic tree (Kong et al., 2007). Since retrogene comes from retroposition, it usually lacks specific introns compared with the donor gene. Therefore, the second criterion is that the donor gene can be judged from the presence/absence of the specific intron (Kong et al., 2007). Figure 4 shows an example of intron loss caused by gene expansion. Genes with the conserved intron (such as, PtENODL13) usually locate basal positions of the phylogenetic tree, while genes without the intron (such as, others 10 PCs in the clade as shown in Figure 4) often form terminal clades. It is likely that PtENODL13 contains the conserved intron and is their ancestor (donor gene), from which the intronless retrogenes were generated by retroposition and tandem duplication.

FIGURE 4

Expression Profiles of the PC Genes in Maize

We first used publicly available microarray data to detect the spatiotemporal expression patterns of the maize PC genes. Expression profiles of the PC genes were mined at 34 different tissues. Only 54 probes were detected standing for the 54 ZmPC transcripts. The remaining six transcripts with no detectable expression signal are GRMZM2G463441, GRMZM2G136879, GRMZM2G148624, GRMZM2G047208, GRMZM2G085504, and AC209987.4_FGT010. The results indicated that these genes are expressed variously in different tissues, implying that they may be involved in many growth and developmental processes (Figure 5). Such as, most ZmPC genes of clade A showed high expression levels in the root, leaf and internodes, but low expression levels in the endosperm and embryo. In contrast, ZmPC genes in clade B presented the oppositive results compared with clade A. That is, most members of clade B displayed high expression levels in the embryo and endosperm, but showed low level expression in the leaf, root and internodes. This suggested that ZmPC genes in different clades may be involved in various biological processes. Some ZmPCs were also found to be highly expressed in some specific organs, such as, ZmSC5 in anthers, ZmUC3 and ZmUC23 in embryo, suggesting that they might be involved in the growth and development of these organs in maize. Similar results have also been observed in their homologs in Arabidopsis (AtENODL1/5/6/7/11/12/16, AtAGP6/11, and FLA3; Yu et al., 2005; Levitin et al., 2008; Li et al., 2010; Ma et al., 2011), rice (OsENODL9/14/16/17; Ma et al., 2011), and B. rapa (BrENODL22/27 and BrSCL8/9; Li et al., 2013), which were highly expressed in reproductive organs. The functions of some PC genes have been investigated in several studies. For example, a sieve element-specific expressed gene (AtENODL9) may be involved in determining reproductive potential in Arabidopsis (Khan et al., 2007); AtAGP6 and AtAGP11 are involved in pollen tube growth (Levitin et al., 2008); Over expression of the FLA3 led to short siliques with low seed set due to the reduced stamen filament, suggesting that the FLA3 gene is involved in microspore development and pollen intine formation (Li et al., 2010). Next, we also investigated the expression patterns of nine ZmPCs detected in maize seedlings subjected to salt and drought treatments by qRT-PCR. The primers were listed in Table 3. The analysis revealed that these genes are differently expressed under salt and drought conditions (Figure 6). Among the nine detected ZmPC genes, all members were down-regulated under salt treatment. And five members (ZmUC19, ZmSC2, ZmENODL10, ZmUC22, and ZmENODL13) were down-regulated under drought treatment. Some rice PC genes (OsENODL19, OsENODL12, OsUCL17, OsUCL20, OsUCL7, OsUCL8, and OsUCL18) have been investigated to be down-regulated by drought and/or salt stresses (Ma et al., 2011). Interestingly, we also found that ZmUC16/21 were significantly up-regulated after drought treatment, suggesting that these ZmPCs are more likely to play key roles in maize drought response. An increasing number of evidence has suggested that PCs may also function in stress responses. Previous studies reported that some PCs, such as, OsUCL23/26/27 (Ma et al., 2011), BrUCL6/16 (Li et al., 2013), were up-regulated under drought or salt stresses. Moreover, over-expression of AtBCB/AtSC3 could confer aluminum resistance in Arabidopsis (Ezaki et al., 2001, 2005). And BcBCP1 can enhance tolerance to osmotic stress when over-expressed in tobacco (Wu et al., 2011). The differential expression profiles of different PC family genes may imply diverse roles of plant response to stress. On the other hand, PC genes which are up-regulated during several abiotic stresses are likely to be required for enhancing resistance to stress. Therefore, PCs can function in developmental processes and stress responses.

Table 3

Primer namesPrimer sequences (5′–3′)
ZmUC10-FGACCACCACAACACCGTACA
ZmUC10-RGCTAGCTGGACGATGACACA
ZmUC16-FTGAAGATGCAGGTGCAAGTC
ZmUC16-RAACGGAAAGTCTGCTTCGAC
ZmUC19-FAACAACATCTCCGCCTTCC
ZmUC19-RGTGCAGCAGAAGCAGCAGTA
ZmSC2-FAAGAACTTCCGTGTCGGAGA
ZmSC2-RGAGTTGGTGCAGCTGTCGTA
ZmUC21-FGTTCGTGTACCCCAAGGAGA
ZmUC21-RGCTTGTTGCAGATGAACCAC
ZmENODL10-FCGACGACCCCTACAACAACT
ZmENODL10-RCTTGTTGGATCGTGACATGG
ZmUC22-FGACGTGCTCGTGTTCAGCTA
ZmUC22-RGAAGTAGTGCGTGCCTCTGC
ZmENODL13-FGCGTCGTCTTCTTCCTTGTC
ZmENODL13-RGGTCGAGAACGAACTTGGTG
ZmENODL15-FGAAGACCAGCTTCCAGATCG
ZmENODL15-RGCTTGTCGTAGGAGGAGGTG
Actin1-FGCTGAGCGGGAGATTGTCA
Actin1-RCTTCCTGATATCAACATCA

Primers used in this study.

FIGURE 5

FIGURE 6

Summary

A comparative genomic analysis of the PC gene family in plants was provided in this study. This gene family had an expansion process in the course of plant evolution. A structural analysis of PCs indicated that 292 PCs contained N-terminal secretion signals and 217 PCs were expected to have GPI-anchor signals. Moreover, 281 PCs had putative arabinogalactan glycomodules and might be AGPs. The gene organization and motif composition are highly conserved in each clade, indicative of functional conservation. Most PC genes may be originated from the tandem and segmental duplications. In addition, expression profiles of the maize PC genes also provided better understanding in possible functional divergence. The results provide a base for further functional and evolutionary study of the PC gene family in plants.

Statements

Author contributions

JC designed, supervised, and carried out parts of the experiments and wrote the manuscript. XL, YV, and LD performed the experiments. XL, YV, and LD provided material, and helped in data analysis and writing. All authors read and approved the manuscript.

Acknowledgments

This project is supported by grants from the National Science Foundation of China (No. 31100923, 31200209), the National Science Foundation of Jiangsu Province (BK2011467), the Priority Academic Program Development of Jiangsu Higher Education Institutions (PAPD), and Jiangsu University “Youth Backbone Teacher Training Project” from 2012 to 2016.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: http://journal.frontiersin.org/article/10.3389/fpls.2015.00515

Figure S1

Motif composition of PC proteins and exon–intron organization of PCLD in plants. Conserved motif distribution of the PC proteins is displayed. Positions of the 0, 1, and 2 phase intron were shown with blue, bright green, and red vertical lines, respectively.

References

  • 1

    AndersonR. P.RothJ. R. (1977). Tandem genetic duplications in phage and bacteria.Annu. Rev. Microbiol.31473505. 10.1146/annurev.mi.31.100177.002353

  • 2

    AndradeM. A.Perez-IratxetaC.PontingC. P. (2001). Protein repeats: structures, functions, and evolution.J. Struct. Biol.134117131. 10.1006/jsbi.2001.4392

  • 3

    BaileyT. L.WilliamsN.MislehC.LiW. W. (2006). MEME: discovering and analyzing DNA and protein sequence motifs.Nucleic Acids Res.34W369W373. 10.1093/nar/gkl198

  • 4

    BjörklundA. K.EkmanD.ElofssonA. (2006). Expansion of protein domain repeats.PLoS Comput. Biol.2:e114. 10.1371/journal.pcbi.0020114

  • 5

    BjörklundA. K.LightS.SagitR.ElofssonA. (2010). Nebulin: a study of protein repeat evolution.J. Mol. Biol.4023851. 10.1016/j.jmb.2010.07.011

  • 6

    BlancG.WolfeK. H. (2004). Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution.Plant Cell1616791691. 10.1105/tpc.021410

  • 7

    CannonS. B.MitraA.BaumgartenA.YoungN. D.MayG. (2004). The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana.BMC Plant Biol.4:10. 10.1186/1471-2229-4-10

  • 8

    CaoJ. (2012). The pectin lyases in Arabidopsis thaliana: evolution, selection and expression profiles.PLoS ONE7:e46944. 10.1371/journal.pone.0046944

  • 9

    CaoJ.HuangJ.YangY.HuX. (2011). Analyses of the oligopeptide transporter gene family in poplar and grape.BMC Genomics12:465. 10.1186/1471-2164-12-465

  • 10

    CaoJ.LiX. (2015). Identification and phylogenetic analysis of late embryogenesis abundant proteins family in tomato (Solanum lycopersicum).Planta241757772. 10.1007/s00425-014-2215-y

  • 11

    CaoJ.ShiF. (2012). Evolution of the RALF gene family in plants: gene duplication and selection patterns.Evol. Bioinform. Online8271292. 10.4137/EBO.S9652

  • 12

    CaoJ.ShiF.LiuX.HuangG.ZhouM. (2010). Phylogenetic analysis and evolution of aromatic amino acid hydroxylase.FEBS Lett.58447754782. 10.1016/j.febslet.2010.11.005

  • 13

    CharonC.BruggemanQ.ThareauV.HenryY. (2012). Gene duplication within the Green Lineage: the case of TEL genes.J. Exp. Bot.6350615077. 10.1093/jxb/ers181

  • 14

    ChenK.DurandD.Farach-ColtonM. (2000). NOTUNG: a program for dating gene duplications and optimizing gene family trees.J. Comput. Biol.7429447. 10.1089/106652700750050871

  • 15

    ChenY.CaoJ. (2014). Comparative genomic analysis of the Sm gene family in rice and maize.Gene539238249. 10.1016/j.gene.2014.02.006

  • 16

    ChenY.HaoX.CaoJ. (2014). Small auxin upregulated RNA (SAUR) gene family in maize: Identification, evolution, and its phylogenetic comparison with Arabidopsis, rice, and sorghum.J. Integr. Plant Biol.56133150. 10.1111/jipb.12127

  • 17

    ComeronJ. M. (1999). K-Estimator: calculation of the number of nucleotide substitutions per site and the confidence intervals.Bioinformatics15763764. 10.1093/bioinformatics/15.9.763

  • 18

    D’AndreaL. D.ReganL. (2003). TPR proteins: the versatile helix.Trends Biochem. Sci.28655662. 10.1016/j.tibs.2003.10.007

  • 19

    DashS.Van HemertJ.HongL.WiseR. P.DickersonJ. A. (2012). PLEXdb: gene expression resources for plants and plant pathogens.Nucleic Acids Res.40D1194D1201. 10.1093/nar/gkr938

  • 20

    De RienzoF.GabdoullineR. R.MenzianiM. C.WadeR. C. (2000). Blue copper proteins: a comparative analysis of their molecular interaction properties.Protein Sci.914391454. 10.1110/ps.9.8.1439

  • 21

    DongJ.KimS. T.LordE. M. (2005). Plantacyanin plays a role in reproduction in Arabidopsis.Plant Physiol.138778789. 10.1104/pp.105.063388

  • 22

    EdgarR. C. (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput.Nucleic Acids Res.3217921997. 10.1093/nar/gkh340

  • 23

    EisenhaberB.WildpanerM.SchultzC. J.BornerG. H.DupreeP.EisenhaberF. (2003). Glycosylphosphatidylinositol lipid anchoring of plant proteins. Sensitive prediction from sequence- and genome-wide studies for Arabidopsis and rice.Plant Physiol.1331691701. 10.1104/pp.103.023580

  • 24

    EzakiB.KatsuharaM.KawamuraM.MatsumotoH. (2001). Different mechanisms of four aluminum (Al)-resistant transgenes for Al toxicity in Arabidopsis.Plant Physiol.127918927. 10.1104/pp.010399

  • 25

    EzakiB.SasakiK.MatsumotoH.NakashimaS. (2005). Functions of two genes in aluminium (Al) stress resistance: repression of oxidative damage by the AtBCB gene and promotion of efflux of Al ions by the NtGDI1 gene.J. Exp. Bot.5626612671. 10.1093/jxb/eri259

  • 26

    FangL.ChengF.WuJ.WangX. (2012). The impact of genome triplication on tandem gene evolution in Brassica rapa.Front. Plant Sci.3:261. 10.3389/fpls.2012.00261

  • 27

    FedorovaM.van de MortelJ.MatsumotoP. A.ChoJ.TownC. D.VandenBoschK. A.et al (2002). Genome-wide identification of nodule-specific transcripts in the model legume Medicago truncatula.Plant Physiol.130519537. 10.1104/pp/006833

  • 28

    FranzkeA.LysakM. A.Al-ShehbazI. A.KochM. A.MummenhoffK. (2010). Cabbage family affairs: the evolutionary history of Brassicaceae.Trends Plant Sci.16108116. 10.1016/j.tplants.2010.11.005

  • 29

    GankoE. W.MeyersB. C.VisionT. J. (2007). Divergence in expression between duplicated genes in Arabidopsis.Mol. Biol. Evol.2422982309. 10.1093/molbev/msm158

  • 30

    GautB. S.MortonB. R.McCaigB. C.CleggM. T. (1996). Substitution rate comparisons between grasses and palms: synonymous rate differences at the nuclear gene Adh parallel rate differences at the plastid gene rbcL.Proc. Natl. Acad. Sci. U.S.A.931027410279. 10.1073/pnas.93.19.10274

  • 31

    GiriA. V.AnishettyS.GautamP. (2004). Functionally specified protein signatures distinctive for each of the different blue copper proteins.BMC Bioinformatics5:127. 10.1186/1471-2105-5-127

  • 32

    GoodsteinD. M.ShuS.HowsonR.NeupaneR.HayesR. D.FazoJ.et al (2012). Phytozome: a comparative platform for green plant genomics.Nucleic Acids Res.40D1178D1186. 10.1093/nar/gkr944

  • 33

    GreeneE. A.ErardM.DedieuA.BarkeD. G. B. (1998). MtENOD16 and 20 are members of a family of phytocyanin-related early nodulins.Plant Mol. Biol.36775783. 10.1023/A:1005916821224

  • 34

    GussJ. M.MerrittE. A.PhizackerleyR. P.FreemanH. C. (1996). The structure of a phytocyanin, the basic blue protein from cucumber, refined at 1.8 A resolution.J. Mol. Biol.262686705. 10.1006/jmbi.1996.0545

  • 35

    HartP. J.NersissianA. M.HerrmannR. G.NalbandyanR. M.ValentineJ. S.EisenbergD. (1996). A missing link in cupredoxins: crystal structure of cucumber stellacyanin at 1.6 A resolution.Protein Sci.521752183. 10.1002/pro.5560051104

  • 36

    KatoT.KawashimaK.MiwaM.MimuraY.TamaokiM.KouchiH.et al (2002). Expression of genes encoding late nodulins characterized by a putative signal peptide and conserved cysteine residues is reduced in ineffective pea nodules.Mol. Plant Microbe Interact.15129137. 10.1094/MPMI.2002.15.2.129

  • 37

    KelloggE. A. (2001). Evolutionary history of the grasses.Plant Physiol.12511981205. 10.1104/pp.125.3.1198

  • 38

    KhanJ. A.WangQ.SjölundR. D.SchulzA.ThompsonG. A. (2007). An early nodulin-like protein accumulates in the sieve element plasma membrane of Arabidopsis.Plant Physiol.14315761589. 10.1104/pp.106.092296

  • 39

    KimS.MolletJ. C.DongJ.ZhangK. L.ParkS. Y.LordE. M. (2003). Chemocyanin, a small basic protein from the lily stigma, induces pollen tube chemotropism.Proc. Natl. Acad. Sci. U.S.A.1001612516130. 10.1073/pnas.2533800100

  • 40

    KochM. A.HauboldB.Mitchell-OldsT. (2000). Comparative evolutionary analysis of chalcone synthase and alcohol dehydrogenase loci in Arabidopsis, Arabis, and related genera (Brassicaceae).Mol. Biol. Evol.1714831498. 10.1093/oxfordjournals.molbev.a026248

  • 41

    KongH.LandherrL. L.FrohlichM. W.Leebens-MackJ.MaH.dePamphilisC. W. (2007). Patterns of gene duplication in the plant SKP1 gene family in angiosperms: evidence for multiple mechanisms of rapid gene birth.Plant J.50873885. 10.1111/j.1365-313X.2007.03097.x

  • 42

    KoralewskiT. E.KrutovskyK. V. (2011). Evolution of exon-intron structure and alternative splicing.PLoS ONE6:e18055. 10.1371/journal.pone.0018055

  • 43

    LeisterD. (2004). Tandem and segmental gene duplication and recombination in the evolution of plant disease resistance gene.Trends Genet.20116122. 10.1016/j.tig.2004.01.007

  • 44

    LevitinB.RichterD.MarkovichI.ZikM. (2008). Arabinogalactan proteins 6 and 11 are required for stamen and pollen function in Arabidopsis.Plant J.56351363. 10.1111/j.1365-313X.2008.03607.x

  • 45

    LiJ.GaoG.ZhangT.WuX. (2013). The putative phytocyanin genes in Chinese cabbage (Brassica rapa L.): genome-wide identification, classification and expression analysis.Mol. Genet. Genomics288120. 10.1007/s00438-012-0726-4

  • 46

    LiJ.YuM.GengL. L.ZhaoJ. (2010). The fasciclin-like arabinogalactan protein gene, FLA3, is involved in microspore development of Arabidopsis.Plant J.64482497. 10.1111/j.1365-313X.2010.04344.x

  • 47

    LiW. H.YangJ.GuX. (2005). Expression divergence between duplicate genes.Trends Genet.21602607. 10.1016/j.tig.2005.08.006

  • 48

    LightS.SagitR.IthychandaS. S.QinJ.ElofssonA. (2012). The evolution of filamin-a protein domain repeat perspective.J. Struct. Biol.179289298. 10.1016/j.jsb.2012.02.010

  • 49

    LivakK. J.SchmittgenT. D. (2001). Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) method.Methods25402408. 10.1006/meth.2001.1262

  • 50

    LouisA.MuffatoM.Roest CrolliusH. (2013). Genomicus: five genome browsers for comparative genomics in eukaryota.Nucleic Acids Res.41(Database issue) D700D705. 10.1093/nar/gks1156

  • 51

    MaH.ZhaoH.LiuZ.ZhaoJ. (2011). The phytocyanin gene family in rice (Oryza sativa L.): genome-wide identification, classification and transcriptional analysis.PLoS ONE6:e25184. 10.1371/journal.pone.0025184

  • 52

    MaereS.De BodtS.RaesJ.CasneufT.Van MontaguM.KuiperM.et al (2005). Modeling gene and genome duplications in eukaryotes.Proc. Natl. Acad. Sci. U.S.A.10254545459. 10.1073/pnas.0501102102

  • 53

    MakovaK. D.LiW. H. (2003). Divergence in the spatial pattern of gene expression between human duplicate genes.Genome Res.1316381645. 10.1101/gr.1133803

  • 54

    MashiguchiK.AsamiT.SuzukiY. (2009). Genome-wide identification, structure and expression studies, and mutant collection of 22 early nodulin-like protein genes in Arabidopsis.Biosci. Biotechnol. Biochem.7324522459. 10.1271/bbb.90407

  • 55

    MerchantS. S.ProchnikS. E.VallonO.HarrisE. H.KarpowiczS. J.WitmanG. B.et al (2007). The Chlamydomonas genome reveals the evolution of key animal and plant functions.Science318245250. 10.1126/science.1143609

  • 56

    MisumiO.YoshidaY.NishidaK.FujiwaraT.SakajiriT.HirookaS.et al (2008). Genome analysis and its significance in four unicellular algae, Cyanidioschyzon merolae, Ostreococcus tauri, Chlamydomonas reinhardtii, and Thalassiosira pseudonana.J. Plant Res.121317. 10.1007/s10265-007-0133-9

  • 57

    MorganC. C.LoughranN. B.WalshT. A.HarrisonA. J.O’ConnellM. J. (2010). Positive selection neighboring functionally essential sites and disease-implicated regions of mammalian reproductive proteins.BMC Evol. Biol.10:39. 10.1186/1471-2148-10-39

  • 58

    NersissianA. M.ImmoosC.HillM. G.HartP. J.WilliamsG.HerrmannR. G.et al (1998). Uclacyanins, stellacyanins, and plantacyanins are distinct subfamilies of phytocyanins: plant-specific mononuclear blue copper proteins.Protein Sci.719151929. 10.1002/pro.5560070907

  • 59

    OzturkZ. N.TalaméV.DeyholosM.MichalowskiC. B.GalbraithD. W.GozukirmiziN.et al (2002). Monitoring large-scale changes in transcript abundance in drought- and saltstressed barley.Plant Mol. Biol.48551573. 10.1023/A:1014875215580

  • 60

    PetersenT. N.BrunakS.von HeijneG.NielsenH. (2011). SignalP 4.0: discriminating signal peptides from transmembrane regions.Nat. Methods8785786. 10.1038/nmeth.1701

  • 61

    PinyopichA.DittaG. S.SavidgeB.LiljegrenS. J.BaumannE.WismanE.et al (2003). Assessing the redundancy of MADS-box genes during carpel and ovule development.Nature4248588. 10.1038/nature01741

  • 62

    PoonS.HeathR. L.ClarkeA. E. (2012). A chimeric arabinogalactanprotein promotes somatic embryogenesis in cotton cell culture.Plant Physiol.160684695. 10.1104/pp.112.203075

  • 63

    PuntaM.CoggillP. C.EberhardtR. Y.MistryJ.TateJ.BoursnellC.et al (2012). The Pfam protein families database.Nucleic Acids Res.40D290D301. 10.1093/nar/gkr1065

  • 64

    SchmutzJ.CannonS. B.SchlueterJ.MaJ.MitrosT.NelsonW.et al (2010). Genome sequence of the palaeopolyploid soybean.Nature463178183. 10.1038/nature08670

  • 65

    SchultzC. J.RumsewiczM. P.JohnsonK. L.JonesB. J.GasparY. M.BacicA. (2002). Using genomic resources to guide research directions. The arabinogalactan protein gene family as a test case.Plant Physiol.12914481463. 10.1104/pp.003459

  • 66

    SekhonR. S.LinH.ChildsK. L.HanseyC. N.BuellC. R.de LeonN.et al (2011). Genome-wide atlas of transcription during maize development.Plant J.66553563. 10.1111/j.1365-313X.2011.04527.x

  • 67

    ShowalterA. M.KepplerB.LichtenbergJ.GuD. Z.WelchL. R. (2010). A bioinformatics approach to the identification, classification, and analysis of hydroxyproline-rich glycoproteins.Plant Physiol.153485513. 10.1104/pp.110.156554

  • 68

    SoderlundC.BomhoffM.NelsonW. M. (2011). SyMAP v3.4: a turnkey synteny system with application to plant genomes.Nucleic Acids Res.39:e68. 10.1093/nar/gkr123

  • 69

    SturnA.QuackenbushJ.TrajanoskiZ. (2002). Genesis: cluster analysis of microarray data.Bioinformatics18207208. 10.1093/bioinformatics/18.1.207

  • 70

    TamuraK.PetersonD.PetersonN.StecherG.NeiM.KumarS. (2011). MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods.Mol. Biol. Evol.2827312739. 10.1093/molbev/msr121

  • 71

    ThomasE. E. (2005). Short, local duplications in eukaryotic genomes.Curr. Opin. Genet. Dev.15640644. 10.1016/j.gde.2005.09.008

  • 72

    WangX. Y.PatersonA. H. (2011). Genes conversion in angiosperm genomes with an emphasis on genes duplicated by polyploidization.Genes2120. 10.3390/genes2010001

  • 73

    WeinerJ.IIIBeaussartF.Bornberg-BauerE. (2006). Domain deletions and substitutions in the modular protein evolution.FEBS J.27320372047. 10.1111/j.1742-4658.2006.05220.x

  • 74

    WuH. Y.ShenY.HuY. L.TanS. J.LinZ. P. (2011). A phytocyaninrelated early nodulin-like gene, BcBCP1, cloned from Boea crassifolia enhances osmotic tolerance in transgenic tobacco.J. Plant Physiol.168935943. 10.1016/j.jplph.2010.09.019

  • 75

    XuG.MaH.NeiM.KongH. (2009). Evolution of F-box genes in plants: different modes of sequence divergence and their relationships with functional diversification.Proc. Natl. Acad. Sci. U.S.A.106835840. 10.1073/pnas.0812043106

  • 76

    YuH. J.HoganP.SundaresanV. (2005). Analysis of the female gametophyte transcriptome of Arabidopsis by comparative expression profiling.Plant Physiol.13918531869. 10.1104/pp.105.067314

Summary

Keywords

phytocyanins, expansion, evolution, expression profile, maize

Citation

Cao J, Li X, Lv Y and Ding L (2015) Comparative analysis of the phytocyanin gene family in 10 plant species: a focus on Zea mays. Front. Plant Sci. 6:515. doi: 10.3389/fpls.2015.00515

Received

31 March 2015

Accepted

26 June 2015

Published

13 July 2015

Volume

6 - 2015

Edited by

Jun Yu, Beijing Institute of Genomics, China

Reviewed by

Sambasivam Periyannan, CSIRO, Australia; Tao Sun, Stanford University, USA

Copyright

*Correspondence: Jun Cao, Institute of Life Sciences, Jiangsu University, Xuefu Road 301, Jiangsu, Zhenjiang 212013, China,

This article was submitted to Plant Genetics and Genomics, a section of the journal Frontiers in Plant Science

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics