ORIGINAL RESEARCH article
Genome-Wide Identification and Comparative Analysis of Cytosine-5 DNA Methyltransferase and Demethylase Families in Wild and Cultivated Peanut
- Biotechnology Research Center, Shandong Academy of Agricultural Sciences, Shandong Provincial Key Laboratory of Crop Genetic Improvement, Ecology and Physiology, Jinan, China
DNA methylation plays important roles in genome protection, regulation of gene expression and is associated with plants development. Plant DNA methylation pattern was mediated by cytosine-5 DNA methyltransferase and demethylase. Although the genomes of AA and BB wild peanuts have been fully sequenced, these two gene families have not been studied. In this study we report the identification and analysis of putative cytosine-5 DNA methyltransferases (C5-MTases) and demethylases in AA and BB wild peanuts. Cytosine-5 DNA methyltransferases in AA and BB wild peanuts could be classified in MET, CMT, and DRM2 groups based on their domain organization. This result was supported by the gene and protein structural characteristics and phylogenetic analysis. We found that some wild peanut DRM2 members didn't contain UBA domain which was different from other plants such as Arabidopsis, maize and soybean. Five DNA demethylase encoding genes were found in AA genome and five in BB genome. The selective pressure analysis showed that wild peanut C5-MTase genes mainly underwent purifying selection but many positive selection sites can be detected. Conversely, DNA demethylase genes mainly underwent positive selection during evolution. Additionally, the expression dynamic of cytosine-5 DNA methyltransferase and demethylase genes in different cultivated peanut tissues were analyzed. Expression result showed that cold, heat or PEG stress could influence the expression level of C5-MTase and DNA demethylase genes in cultivated peanut. These results are useful for better understanding the complexity of these two gene families, and will facilitate epigenetic studies in peanut in the future.
DNA methylation is a process that DNA methyltransferase transfer the methyl group from the methyl donor S-adenosylmethionine (SAMe) to cytosine residues and generate S-adenosylhomocysteine (SAH) (Lu and Mato, 2012). DNA methylation is a conserved epigenetic gene regulation mechanism in plants and animals (Zhong et al., 2014). In a narrow sense, DNA methylation is DNA cytosine methylation which is catalyzed by cytosine-5 DNA methyltransferase through transferring the methyl group from S-adenosyl methionine into the 5′ position of the pyrimidine (Bender, 2004). DNA methylation is associated with gene silencing, X chromosome inactivation in females, and maintenance of genomic integrity in eukaryotes (Heard and Disteche, 2006; Klose and Bird, 2006; Feinberg, 2007). In animal, DNA methylation mainly occurred in CG sequence context, but in plants, it also occurred in other two sequence contexts: CHG and CHH (where H is A, C, or T) (Xiao et al., 2006; Rai et al., 2008; Hsieh et al., 2009).
Small RNAs could direct methylation through recruitment of methyltransferase. It is called RNA-directed DNA methylation (RdDM) pathway (Matzke and Mosher, 2014). At first, the small interfering RNAs (siRNA) were found to be a group of small RNAs that could direct the occurrence of DNA methylation. For example, the silence of many transposable elements (TEs) was due to siRNA-directed DNA methylation (Slotkin et al., 2009). Now, it is confirmed that microRNAs (miRNA) and piwi-interacting RNAs (piRNA) could also direct RdDM (Wu et al., 2010).
In eukaryotes, DNA methylation dynamic is controlled by three DNA methylation mechanisms including, de novo DNA methylation, DNA methylation maintenance, and DNA de-methylation (Law and Jacobsen, 2010). In plants, the key enzyme for de novo DNA methylation is DOMAINS REARRANGED METHYLTRANSFERASE 2 (DRM2). De novo DNA methylation in all sequence contexts depends on RdDM pathway. In animal, the homolog of DRM2 is DNA methyltransferase 3 (Dnmt 3; Law and Jacobsen, 2010; Zhong et al., 2014). Two key enzymes for DNA methylation maintenance are methyltransferase (MET) and chromomethylase (CMT; Du et al., 2012; Garg et al., 2014). The DNA methylation of CG sequence context is mainly maintained by MET (Henderson et al., 2010). In Arabidopsis, the met1 mutation and MET1 antisense gene transgenic lines, displayed genome wide hypomethylation to compare with the wild type (Kim et al., 2008). CMT mainly maintains the methylation of CHH and CHG sequence contexts (Noy-Malka et al., 2014). The CMT was plant-specific methyltransferase. In animal, the homology of MET is DNA methyltransferase 1 (Dnmt1). In Arabidopsis, three CMT genes were found, namely AtCMT1, AtCMT2, and AtCMT3 (Garg et al., 2014). However, in moss Physcomitrella patens, only PpCMT, a homolog of AtCMT3, was identified (Noy-Malka et al., 2014).
All C5-MTases contain methyltransferase domain, but proteins contained methyltransferase domain are not necessary C5-MTases. DNMT2 type proteins which represent the transfer tRNA MTase (Garg et al., 2014) which were not included in our study.
Generally, level of DNA methylation is determined mainly by the common action of both C5-MTases and demethylases (Cao et al., 2014). In Arabidopsis, four demethylases family members were found, including repressor of silencing 1(ROS1), demeter (DME), demeter-like2 (DML2), and demeter-like3 (DML3). Mutation of these genes resulted in increased level of DNA methylation in all sequence contexts at specific genomic loci comparing with the wild type lines (Penterman et al., 2007). These four demethylases were also DNA glycosylase/lyase. ROS1, DME, DML2, and DML3 can excise 5-meC from all sequence contexts (Penterman et al., 2007; Zhu, 2009). Studies showed that ROS1 gene, a 5-meC DNA glycosylase/demethylase, active DNA de-methylation in extensive tissues and cells. But DME gene preferentially expressed in central cell and was more associated with allelic-specific DNA de-methylation (Zhu, 2009).
Peanut is an important crop plant for human nutrition and grown world widely. Peanut is allotetraploid (AABB, 4n = 4x = 40) originated from a single hybridization event between AA and BB wild species, and subsequently underwent spontaneous genome duplication (Kochert et al., 1996; Freitas et al., 2007; Moretzsohn et al., 2013). Recently, whole genome sequencing of the two progenitor wild peanut is complete, the assembled AA and BB genome sequences are available (http://peanutbase.org/). In the present study, we have identified and analyzed C5-MTases and demethylases from AA genome and BB genome.
The phylogenetic relationship among C5-MTases and demethylases in wild peanut has been analyzed. In addition, three-dimensional (3D) structure modeling of selected members was carried out to understand their function. The allotetraploid genome sequencing data and RNA-seq data showed that these C5-MTases and demethylases found in wild peanut could also be found in cultivated peanut. To understand the function of these genes in cultivated peanut, we carried out the gene expression analyses of cultivated C5-MTase and demethylase encoding genes in various tissues/developmental stages and stress conditions. Our analyses provide the framework for future functional studies of these important gene families in peanut.
Materials and Methods
Data Collection and Identification of C5-MTases and DNA Demethylases
Amino acid sequences of all Arabidopsis thaliana C5-MTases and DNA demethylases were collected from the National Center for Biotechnology Information (NCBI) database (http://www.ncbi.nlm.nih.gov/). All protein sequences of A. thaliana C5-MTases and DNA demethylases were used to search the hidden Markov model (HMM) of the conserved key domain of C5-MTases and DNA demethylases using the Pfam online software (version 27.0) (http://pfam.xfam.org/). Gene ID of all A. thaliana C5-MTases and DNA demethylases were showed in Table S1.
The conserved key domain of C5-MTase is the DNA methylase domain and the HMM ID of this domain is PF00145 in pfam database. The conserved key domain of DNA demethylases is the HhH-GPD domain and the HMM ID of this domain is PF00730 in pfam database. Additionally, majority of DNA demethylases contained RRM_DME domain and the HMM ID of this domain is PF15628 in pfam database. We employed the amino acid sequences of two HMMs as queries to identify all possible C5-MTase and DNA demethylase protein sequences in the AA and BB genome database (http://peanutbase.org/) genomes using the BLASTP program (E < 0.001). SMART online software (http://smart.embl-heidelberg.de/) and domain search tool of Pfam were used to confirm and classify each putative peanut C5-MTase or demethylase. Wild Peanut C5-MTases or DNA demethylases were named based on the best hit proteins by NCBI-blastp.
The basic information of C5-MTase and DNA demethylase genes in wild peanut were generated from peanut genome database (http://peanutbase.org/). Protein isoelectric point (pI) and molecular weight (Mw) of C5-MTases and DNA demethylases in peanut were analyzed using online Expasy software (http://web.expasy.org/compute_pi/).
The latest version of genomes, proteins, and cDNA sequences of other species were collected from plant ensemble database (http://plants.ensembl.org/index.html). The respective genome database are as follows: Arabidopsis thaliana genome database (http://www.plantgdb.org/AtGDB/), Glycine max genome database (http://www.plantgdb.org/GmGDB/), Zea may genome database (http://www.plantgdb.org/ZmGDB/), Solanum lycopersicum genome database (http://www.plantgdb.org/SlGDB/), Oryza sativa genome database (http://www.plantgdb.org/OsGDB/), Lotus japonicus genome database (http://www.plantgdb.org/LjGDB/), Medicago truncatula genome database (http://www.plantgdb.org/MtGDB/), Cajanus cajan genome database (http://gigadb.org/dataset/100028), Cicer arietinum genome database (http://nipgr.res.in/CGAP/home.php) and Physcomitrella patens genome database (http://www.cosmoss.org/).
Chromosomal Location and Gene Structure of Wild Peanut C5-MTase and DNA Demethylase Genes
The chromosomal location of wild peanut C5-MTases and DNA demethylase genes was obtained from peanut genome database. The chromosomal location map was plotted using Circos software (Krzywinski et al., 2009). The gene structures of the wild peanut DNA C5-MTase and demethylase genes were plotted using online software Gene Structure Display Server 2.0 (http://gsds.cbi.pku.edu.cn/) (Guo et al., 2007).
Multiple Sequence Alignments and Phylogenetic Analysis
Multiple sequence alignments using ClustalW2 online software (http://www.ebi.ac.uk/Tools/msa/clustalw2/) were performed using the full-length protein sequences of wild peanut C5-MTase and DNA demethylase. Neighbor-Joining (NJ) and Maximum likelihood (ML) trees were constructed using MEGA 4.0 with the aligned protein sequences (Tamura et al., 2007). To support the calculated relationship, 1000 bootstrap samples were generated.
In order to analyze the classes and subgroups of the peanut C5-MTases and DNA demethylase families, P. patens MET (UniProtKB ID: A9RVR7), and CMT (UniProtKB ID: A9ST18; Malik et al., 2012), 10 A. thaliana C5-MTases, 4 A. thaliana DNA demethylases, 7 S. lycopersicum C5-MTases, and 3 S. lycopersicum DNA demethylases (Cao et al., 2014), 7 Z. may C5-MTases (Candaele et al., 2014), 8 G. max C5-MTases, 8 L. japonicus C5-MTases, 6 O. sativa C5-MTases, 5 C. cajan C5-MTases, 6 C. arietinum C5-MTases, and 7 M. truncatula C5-MTases (Garg et al., 2014) (gene ID of all C5-MTases and DNA demethylases were showed in Table S1) were included in the phylogenetic analysis by generating a NJ tree (Poisson correction, pairwise deletion, and bootstrap = 1000 replicates).
Conserved Motif Analysis of C5-MTases and DNA Demethylases in Wild Peanut
Full length amino acid sequences of peanut C5-MTases and DNA demethylases were used by the MEME tool (Bailey et al., 2006) (http://meme.nbcr.net/meme/) to identify conserved motifs (Parameter setting: output motifs: 20; minimum motif width: 6; maximum motif width: 200). The conserved motif sequence information of wild peanut C5-MTase and DNA demethylase was listed in Table S2.
The templates for homology modeling were selected using the best hit in BLAST searches against the PDB database. Homology model of the protein was generated using SWISS-MODLE online software (http://www.swissmodel.expasy.org/interactive). At least 186 models for each protein were generated using “building model” engine and the best model was selected based on the best global model quality estimation (GMQE). QMEAN4 scoring was used to evaluate and check for the possible errors in homology model of the proteins.
Molecular Dynamic Simulations of Protein Domain
Molecular dynamics simulations were performed for 9 ns using the GROMACS package v. 4.5.3 (Van Der Spoel et al., 2005) and trajectories for protein domain in aqueous solution were analyzed to obtain structural and dynamic properties using the GROMACS analysis tools package. The interaction potential energy between the monomers of the dimers, root mean square deviation (RMSD), solvent accessible surface area (SASA), and collective motions (essential dynamics using g_covar and g_anaeig analysis in the GROMACS package) of the protein domain backbone during the simulation time.
Plant Material, Stress Treatments, and RNA Isolation
Peanut (Arachis hypogaea L.) cv. Luhua-14 was used in this experiment. Plant tissues including roots, stems, leaves, flowers, seeds, and 6 stages of gynophores [gynophore tip containing embryo; Stage 1 gynophores (S1): aerial grown with green or purple color; Stage 2 gynophores (S2): three day after soil penetration; Stage 3 gynophores (S3): nine days after soil penetration (Xia et al., 2013); Stage 4 gynophores (S4): 12 days after soil penetration; Stage 5 gynophores/pods (S5): 16 days after soil penetration; Stage 6 gynophores/pods (S6): 20 days after soil penetration] were used for gene expression analysis. All samples were cleaned quickly and immediately frozen in liquid nitrogen before stored at –80°C for RNA extraction and gene expression analysis. Three biological replicates of all tissues were prepared.
Twelve-day-old peanut seedlings were subjected to various abiotic stresses including 20% PEG 6000, cold (8°C) and heat treatment (42°C) treatments according to previously described methods (Chen et al., 2006). Leaf samples were collected at 0, 3, 6, 12, 24, and 48 h after treatment and immediately frozen in liquid nitrogen. Three independent biological replicates were used in this experiment.
Samples were incubated with 500 μl CTAB at 65°C for 15 min. Two hundred and fifty microliters of chloroform and water saturated phenol was added and mixed thoroughly. After centrifugation at 12,000 g for 10 min at 4°C, the supernatant was collected and equal volume of chloroform was added and mixed by vortex. After centrifugation at 12,000 g for 10 min at 4°C, the aqueous phase which containing the RNA was transferred to a clean tube and was precipitated with equal volume of isopropyl alcohol and freeze for 8 h at −20°C. RNA was precipitated by centrifugation at 12,000 g for 20 min at 4°C. RNA was washed with 75% ethanol, air dried, and re-suspended with RNAase-free H2O. For reverse transcription, the first-strand cDNA was synthesized with an oligo (dT) primer using a PrimeScript™ first-strand cDNA synthesis kit (TaKaRa).
Gene Expression Analysis
qRT-PCR was carried out using FastStart Universal SYBR Green Master (ROX) with a 7500 real-time PCR machine (ABI). The following PCR program was used: 95°C for 30 s, 40 cycles (95°C for 5 s, 60°C for 30 s). Data were quantified using the 2−ΔΔCt method based on Ct values of peanut C5-MTases and DNA demethylase genes and β-actin. The primers for qRT-PCR were provided in Table S3. Heat map of was generated based on the RPKM values using the Multiexperiment View software. PRKM values were based on transcriptome data (data not shown).
Promoter Cis-Acting Regulatory Element Analysis
Promoters of wild peanut DNA C5-MTase and demethylase encoding genes were analyzed with the online software Plantcare (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) to find the cis-acting regulatory element.
NLS Predicted and Sub-Cellular Localization
The nuclear localization signal in the proteins was analyzed using the online software cNLS Mapper (http://nls-mapper.iab.keio.ac.jp/cgi-bin/NLS_Mapper_form.cgi). The full-length coding sequences (CDs) of C5-MTase and DNA demethylase genes were amplified by reverse transcription PCR using gene-specific primers and cloned in pMD18-T plasmid (Takara). The CDs sequence of DRM2, CMT2, CMT3, and ROS1 were fused in-frame, upstream of GFP in pBI221-GFP vector. The primers for PCR were provided in Table S4.
The recombinant vectors and the pBI221-GFP control vector were transformed into onion epidermal cells, respectively, using the particle delivery system (Bio-Rad Biolistic PDS-1000/He). The transformed onion epidermal cells were cultured on Murashige and Skoog (MS) plates at 25°C for 24 h in dark. Subsequently, nuclei were stained with 1 μg/mL of 4′, 6-diamidino-2-phenylindole (DAPI) in phosphate-buffered saline for 10 min at room temperature (Sharma et al., 2013). The cultured transformed onion epidermal cells were visualized using a fluorescence microscope (Olympus Microsystems).
Syntenic and Selective Pressure Analysis
The analysis of synteny between AA and BB genomes was based on comparisons of the blocks of 100 kb chromosome containing C5-MTase or DNA demethylase genes. All wild peanut C5-MTase and DNA demethylase genes were set as the anchor points according to their chromosome locations. A syntenic block is defined as a region in which three or more conserved homology genes were presented. The homology gene pairs from blocks were identified by local all-vs.-all BLASTN (E < 10−20) (Sato et al., 2008; Lin et al., 2014).
The Ka, Ks and ω rates (Ka/Ks ratios) between gene pairs or gene families were calculated using the codeml program under PAML (phylogenetic analysis maximum likelihood) V. 4.7 software (Yang, 2007). This program contained six site models: M0, M1a (nearly neutral), M2a (positive selection), M3 (discrete), M7 (beta), and M8 (beta and ω). M0 and M3 comparison can be used to test ω values vary among sites. M0-M3, M1a-M2a, and M7-M8 comparisons can be used to find positive selection sites (Yang et al., 2000). The Bayes Empirical Bayes (BEB) was used to calculate the Bayesian posterior probability (BPP) of the codon sites under a positive selection (Yang et al., 2005).
Classification of C5-MTase and DNA Demethylase Genes in Wild Peanut
In our study, 14 C5-MTase coding genes were identified in wild peanut. The information of these peanut C5-MTase family members was showed in Table 1. Seven C5-MTase coding genes were discovered from AA genome and seven from BB genome. All domains and their positions in these protein sequences were list in Table S5. In wild peanut, C5-MTase gene length varied from 1740 to 4668 bp and protein length varied from 580 to 1556 aa.
From moss (P. patens) to leguminous (C. cajan, C. arietinum, G. max, M. truncatula, and L. japonicus) or other seed plants (A. thaliana, S. lycopersicum, O. sativa, and Z. may), C5-MTases can be divided into three groups (MET, CMT, and DRM2) based on domains located in the N-terminal region of DNA methylase domain in these C5-MTases. Wild peanut C5-MTase names were assigned based on names of best hit proteins by NCBI-blastp. MET group members contained one replication foci domain (RFD) and two bromo adjacent homology (BAH) domains. CMT group members contained one chromo domain (Chr) which located in DNA methylase domain, and one bromo adjacent homology (BAH) domain. DRM2 group members contained ubiquitin associated (UBA) domain in some species (Malik et al., 2012; Candaele et al., 2014; Cao et al., 2014; Garg et al., 2014). Wild peanut C5-MTases also can be divided into three groups based on the domains located in the N-terminal region of DNA methylase domain. AA genome wild peanut MET and CMT group included the MET-A, CMT1-A, CMT2-A, and CMT3-A. BB genome wild peanut MET and CMT group included the MET-B CMT1-B, CMT2-B, and CMT3-B.
AA genome wild peanut DRM2 group included DRM2-A, DRM2X1-A, and DRM2X2-A. BB genome wild peanut DRM2 group included DRM2-B, DRM2X1-B, and DRM2X2-B. Interestingly, not all wild peanut DRM2 members contained UBA domain (Table S5). Both DMR2-A and DRM2-B contained an extra incomplete DNA methylase domain. MET-A, CMT1-A, CMT2-A, CMT3-A, DRM2-A, DRM2X1-A, and DRM2X2-A in AA genome, respectively, were orthologous of MET-B, CMT1-B, CMT2-B, CMT3-B, DRM2-B, DRM2X1-B, and DRM2X2-B in BB genome. These C5-MTase gene sequences were extremely similar to their orthologous (Table S6). For example, MET-A protein and coding gene displayed 99% identity with MET-B protein and coding gene. This result reflected the high level of similarity between peanut AA and BB genomes. DRM2-A protein and coding gene displayed 99% identity with DRM2-B protein and coding gene. Other C5-Tases between these two diploid progenitors also showed high identity with their orthologous.
Ten DNA demethylases were identified in peanut, five from AA genome and five from BB genome. The information of these peanut DNA demethylase family members was showed in Table 1. All domains in these DNA demethylase sequences were listed in Table S5. All these DNA demethylase contained conserved HhH-GPD domain (Figure S1). ROS1-like-A and ROS1-like-B didn't contain RRM_DME domain. Some of these proteins contained Perm-CXXC domain (Table S5). The length of wild peanut DNA demethylase genes varied from 912 to 6180 bp, while the protein length varied from 301 to 2060 aa. Based on names of the best-hit proteins by NCBI-blastp, these 10 demethylases were named ROS1-A, ROS1-like-A, ROS1X2-A, DME-A, DME-like-A, ROS1-B, ROS1-like-B, ROS1X2-B, DME-B, and DME-like-B. ROS1-A, ROS1-like-A, ROS1X2-A, DME-A, and DME-like-A were identified in AA genome, while the orthologus ROS1-B, ROS1-like-B, ROS1X2-B, DME-B, and DME-like-B were identified in BB genome. These DNA demethylase genes and proteins were highly conserved, for example, the gene and protein sequences of ROS1-A displayed 96% identity with ROS1-B (Table S6).
Five C5-MTase genes from AA genome are distributed on A05 chromosome and others are distributed on A01 chromosome. Similarly five C5-MTase genes from BB genome are distributed on B05 chromosome, one is on B10 chromsome and others are on B01 chromosome. Most of the orthologous genes we identified are located on similar chromosome loci, for example CMT1-A (A05:3796815-3808030) and CMT1-B (B05:3729600-3739701) (Figure 1 and Table 1).
Figure 1. Genome location of peanut C5-MTase and demethylase genes. Blue represents AA genome chromosomes and green represents BB genome chromosomes.
Sequence Analysis of Wild Peanut C5-MTase and DNA Demethylase
We analyzed the protein and gene sequences of wild peanut C5-MTase and DNA demethylase. We predicted and verified motifs in C5-MTases and DNA demethylases of wild peanut and some other species including moss, monocots, dicots, and legumes. Totally, 20 conserved motifs (E ≤ 2.3) were found in C5-MTase family proteins. All C5-MTase members contained motif 5. Both MET group and CMT group members contained motif 1, 2, and 15. Only DRM2 group contained motif 4, 6, and 13. Based on motif type, proteins from same group were conserved, except PpCMT which contained only motif 5 and 11 (Figure S2A).
In DNA demethylase family, 20 conserved motifs (E ≤ 6.1e−23) were found. All DNA demethylases contained motif 5 and 14. Most DNA demethylases contained motif 1, 9, 16, and 17. Only DME-A and DME-B contained motif 8, 11, 13, and 18. ROS1-like-A and ROS1-like-B contained fewer conserved motifs that were different from other members. Each DNA demethylase was similar to its orthologous base on motif type and distribution (Figure S2B).
The methyltransferase domain of C5-MTases contained several conserved motifs (I, IV, VI, VIII, IX, and X) that play important roles for transferring methyl group from SAMe onto cytosine (Malone et al., 1995). In other legumes, the conserved sequences of these motifs have been identified (Garg et al., 2014). In wild peanut, we found that all C5-MTase family members contained these six motifs. The distribution order of motifs in wild peanut MET and DRM2 group members was different. But in peanut CMT group, the distribution order of motifs was the same as in MET group proteins. The motif distribution order in wild peanut C5-MTases was similar to other legumes (Figure S3).
Full-length protein sequences of C5-MTase and DNA demethylase from wild peanut and other 10 species (C. cajan, C. arietinum, P. patens, O. sativa, Z. may, A. thaliana, S. lycopersicum, G. max, Lotus japonicas, and M. truncatula) were used for construction of phylogenetic tree. The phylogenetic tree showed that C5-MTase family could be divided into MET, CMT, DRM2 groups. The evolution relationship between MET and CMT group was more close based on phylogenetic analysis. All members of MET and CMT group contained BAH domain. MET group could be divided into four subgroups, the legumes, dicots, monocots and moss subgroups. But in CMT or DRM2 group, the evolution relationship between these group members was complex (Figure 2).
Figure 2. Neighbor-joining phylogeny of C5-MTases. Different species were distinguished by different colors.
Wild peanut DNA demethylase could be divided into two groups. ROS1-like-A and ROS1-like-B were clustered into a single group showing that these two members were quite different from other members. The ROS1-A, ROS1-B, ROS1X2-A, and ROS1X2-B were clustered together (Figure S4). The exon-intron structures of C5-MTases and DNA demethylases supported their phylogenetic analysis result. In C5-MTases, exon-intron structure of members in one group or subgroup was similar to members in other group or subgroup. In DNA demethylases, the exon-intron structures of ROS1-like-A and ROS1-like-B were different from other DNA demethylase family members (Figure S5).
Promoters of all wild peanut C5-MTase and DNA demethylase genes (2000 bp upstream of CDs) were analyzed. Table S7 showed the cis-elements of these promoters associated with abiotic stresses. Most promoters of orthologous genes contained similar cis-elements, and only a small number of orthologous gene promoters contained different cis-elements. The promoter sequences of orthologous genes between these two diploid progenitors were not as conserved as the CDs sequences. For example, MET-B promoter contained a heat inducing CCAAT-box that was not detected in MET-A promoter. MET-B promoter contained a G-box but not in MET-A promoter. ROS1-A promoter didn't contain W-box that could be detected in ROS1-B promoter. Additionally, we found that most C5-MTase gene promoters contained cold inducing element but not in DNA demethylase gene promoters.
Synteny Analyses of C5-MTase and DNA Demethylase Genes in AA and BB Genomes
A syntenic block is defined as the region in which three or more conserved homologs (BLASTP E < 10−20) were located within a 100 kb region between genomes (Sato et al., 2008). To further investigate the potential evolutionary mechanisms of C5-MTases and DNA demethylase genes, we performed all-vs.-all local BLASTP to identify synteny blocks in peanut genome. Blocks of 100 kb chromosome containing C5-MTases and DNA demethylase genes were used for synteny analysis.
Blocks from AA and from BB genomes were inter-genomic synteny blocks. Both in AA and BB wild peanut, no intra-genomic synteny blocks were found for C5-MTase or DNA demethylase genes. Twelve pairs of inter-genomic synteny blocks contained C5-MTase or DNA demethylase genes were found in peanut genome. Figure 3 showed that high level of synteny was maintained between AA and BB genomes.
Figure 3. Syntenic chromosomal segments between flanking genes from AA genome and their orthologous genes from BB genome. Red arrows represent C5-MTase or demethylase gene, black arrows represent flanking genes.
Selective Pressure Analysis of Wild Peanut C5-MTase and DNA Demethylase Genes
To detect whether different groups of wild peanut C5-MTases were under different selective pressure, the hypothesis testing on CMT and DRM2 groups of C5-MTase was performed using site models. Because MET group only contained two genes that belonged to one orthologous pair, they could not be analyzed using site models. M0 showed that wild peanut C5-MTase family underwent purifying selection (ω = 0.82896). The discrete model M3 fit better than one-ratio model M0, suggesting that ω ratios varied among sites (Table S8; LRT, P < 0.05). Although M1a vs. M2a comparison didn't find positive selection site with a P < 0.05, M7 vs. M8 comparison detected many positive selection sites with a 0.5 < P < 0.9. These positive selection sites in C5-MTase genes indicated functional diversity and structural variation between different members.
In wild peanut CMT group, M2a (positive selection model) identified only nine positive selection sites. However, M1a vs. M2a comparison showed that the result of M2a model was not reliable (P > 0.9). The M7 vs. M8 comparison identified 48 positive selection sites (P < 0.1). The M0 suggested that both CMT genes (ω = 0.65814) and DRM2 genes (ω = 0.44354) underwent purifying selection. Only one positively selected site was detected by M2a in wild peanut DRM2 group. These results suggested that wild peanut DRM2 genes were highly conserved and the majority of sites were under purifying selection (Table S8).
M0 result suggested that DNA demethylase genes underwent positive selection (ω = 1.12428). Both M1a vs. M2a comparison and M7 vs. M8 comparison detected more than 200 positive selection sites (P < 0.05). M3 didn't found positive selection site. However, M0 vs. M3 comparison showed that result of M3 model was not reliable (LRT, P > 0.9) (Table S8).
3D Structural Features and Activity of BAH Domain of Peanut C5-MTases
3D structure analysis of MET-A, MET-B, CMT2-A, and CMT2-B showed that peanut MET and CMT members having similar structure to members from other legume species (Garg et al., 2014). The methyltransferase domain (in red) of CMT2-B which could combine SAH and the BAH domain (in blue) of CMT2-B could combine H3K9 methylated-lysine (Figure 4). In Figure 4B, the methyltransferase domain (in red) of MET-B, the BAH1 domain (in green) of MET-B and the BAH2 domain (in yellow) were indicated. In other legume MET members, the BAH1 domains contained seven conserved amino acids which formed an aromatic cage, while BAH domain of CMT contained three conserved amino acids forming an aromatic cage. The aromatic cage could recognize and capture methylated-lysine (Garg et al., 2014). All BAH1 domains of peanut MET group members contained aromatic cages. For example, aromatic cage of MET-A contained seven amino acids: 775(D), 783(I), 784(Y), 785(F), 786(V), 788(Y), and 789(M). All BAH2 domains of peanut MET group members didn't contain aromatic cages. All BAH domains of peanut CMT group members contained 3 conserved amino acids. For example, these 3 amino acids are 449(Y), 454 (W), and 456 (Y) in CMT2-A. In order to investigate the difference between BAH2 and BAH1 or CMT-BAH, we analyzed the stability of these proteins within GROMACS filed. The RMSD of MET-B BAH2 was higher than MET-B BAH1 and other peanut CMT-BAH when they were in solution for 9 ns (Figure 5). Tracking the cavity gap motion by essential dynamic analysis showed collective motion of MET-A and MET-B BAH2 were aberrant comparing with MET-A BAH1, MET-B BAH1, and CMT-BAH. This result suggested that MET-B BAH2 was less stable than BAH1. Therefore, BAH1 could combine methylated-lysine better (Maia et al., 2012).
Figure 4. 3D structure of MET-B and CMT2-B. (A) represents the 3D structure of MET-B and (B) represents the 3D structure of CMT2-B.
Figure 5. RMSD of peanut B domain from MET and CMT group members. “Y” axis represents RSDM (Ångstrom, A) and “X” axis represents time (picosecond, ps).
Expression of C5-MTase and DNA Demethylase Genes in Cultivated Peanut
C5-MTase and DNA demethylase play important roles in plant development through regulating the level of DNA methylation (Ecker, 2013; Eichten et al., 2013; Candaele et al., 2014; Zhong et al., 2014). For example, the extensive de-methylation occurring in endosperm genome can active the expression of many genes (Gehring et al., 2009; Hsieh et al., 2009; Zemach et al., 2010; Wang et al., 2015).
From the genome sequences we identified MET, CMT1-3, DRM2, DRM2X1, DRM2X2, ROS1, ROS1-like, ROS1X2, DME, and DME-like gene in cultivated peanut. These gene sequences were similar to their orthologous genes from AA and BB genomes (data not shown). To gain insights into the putative function of these genes in cultivated peanut, the expression patterns of these genes were investigated by qRT-PCR analysis.
The expression of MET was predominant detected in shoot and gynophore. The expression level of MET was the highest in Stage 1 gynophores (S1). The expression of CMT1 was also predominantly detected in shoot and gynophore. The expression of CMT2 was high in shoot and seed. CMT3 was expressed highly in shoot, root, and gynophore. The expression level of CMT3 was declined from S3 to S6 in the gynophore apex. The expression of DRM2 was ubiquitously in all tissues analyzed. Both DRM2X1 and DRM2X2 were strongly expressed in flowers (Figure 6). DME was predominantly expressed in shoot and root. The expression level of DME-like was high in root, shoot, leaf, flower, S2, and S3. ROS1 was highly expressed in flower and seed. ROS1-like and ROS1X2 showed a similar expression patterns in different tissues (Figure 6). Interestingly, the expression level of ROS1, ROS1-like, and ROS1X2 increased from S1 to S3. From S1 to S3, gynophores experienced the transition from light to dark condition and the quiescent pre-embryo started to develop (Xia et al., 2013). The expression levels of these genes during S1 to S3 were consistent with RNA-seq results. In addition, RNA-seq result showed that DME-like and DRM2X2 were expressed at a very low level during S1 and S2 (Figure S6). Abiotic stress like light, high, and low temperature, PEG treatment and cold all could affect DNA methylation dynamic in plant genome. In this study, we investigated the expression patterns of cultivated peanut C5-MTases and DNA demethylase genes under PEG treatment, high, and low temperature conditions by qRT-PCR (Figures S7–S9).
Figure 6. Relative expression levels of peanut C5-MTase and demethylase genes in different tissue. (A–L) Respectively, presents the relative expression levels of MET, CMT1, CMT2, CMT3, DRM2, DRM2X1, DRM2X2, DME, DME-like, ROS1, ROS2-like, and ROS1X2 in different tissues. “Y” axis represents relative expression level and “X” axis represents tissue names.
Subcellular Localization of Wild and Cultivated Peanut C5-MTases and DNA Demethylases
The presence of nuclear localization signal (NLS) was predicted in wild peanut C5-MTases and DNA demethylases (Table S9). Monopartite NLS was predicted in DRM2X2-A, DRM2X2-B, CMT2-A, CMT2-B, DME-like-A, DME-like-B, ROS1X2-A, ROS1X2-B, and ROS1-like-A and bipartite NLS was predicted in other proteins. The amino acid sequences of the NLSs between orthologous were conserved except ROS1-like-A and ROS1-like-B. NLSs of DRM2X1-A, DRM2X1-B, DRM2X2-A, DRM2X2-B, CMT3-A, and DME-like-B were relatively weak (score < 5). In C. arietinum, O. sativa, and S. lycopersicum, CMT and DRM proteins have been confirmed localized in nucleus (Wada et al., 2003; Dangwal et al., 2013; Cao et al., 2014; Garg et al., 2014). We studied the subcellular localization of peanut C5-MTases and DNA demethylases. Cultivated peanut CMT2, CMT3, DRM2, and ROS1 were cloned in pBI221 vector with C-terminal GFP fusion. These fusion proteins were transiently expressed in onion epidermal cells. The results showed that these GFP fusion proteins were all localized in nucleus (Figure 7).
Figure 7. Subcellular localization of peanut C5-MTases and demethylases. Visualization of GFP fused peanut DRM2, CMT2, CMT3, and ROS1 as compared to pBI221-GFP control vector in onion epidermal cells. FL represents GFP fluorescence; DAPI represents DAPI staining.
In wild peanut genomes, 14 C5-MTases and 10 DNA demethylases were found, including 12 pairs of orthologous genes. The sequences of these C5-MTase and DNA demethylase proteins or genes were similar to their orthologous. These genes and their orthologous were located in the same chromosome. Synteny analyses showed that the segments contained C5-MTase or DNA demethylase genes were very similar with the segments contained their orthologous genes. Promoter analysis showed that the promoters of most C5-MTase and DNA demethylase genes contained light-, heat-, cold-, and salt-induce cis-regulated elements.
Both AA and BB wild peanut genome contained three groups of C5-MTase genes: MET, CMT, and DRM2 group. Unlike Arabidopsis, peanut AA genome and BB genome only contained one MET1 gene, MET-A and MET-B (Hsieh et al., 2011). Like other plant, peanut CMT group contained CMT1, CMT2, and CMT3 type genes (Garg et al., 2014). CMT1-A, CMT2-A, and CMT3-A genes were located in AA genome, while CMT1-B, CMT2-B, and CMT3-B genes were in BB genome. Previous study showed that DRM2 requires intact UBA domain during RdDM in Arabidopsis (Henderson et al., 2010). However, some wild peanut DRM2 group members didn't contained UBA domain such as DRM2-A, DRM2-B, DRM2X1-A, and DRM2X1-B. In seed plant and mammalian, formation of DRM2 homodimer or heterodimer was required to direct DNA de novo methylation (Zhong et al., 2014). It is possible that the DRM2 group member without UBA domain could direct DNA de novo methylation upon forming heterodimer with the DRM2 group member with UBA domain. N-terminus of MET harbor two BAH domains, which might act as a site for protein-protein interaction (Garg et al., 2014). We predicted the 3D structures of wild peanut C5-MTases. We found the structure variation of different BAH domains. We identified that BAH1 of peanut MET group members are similar to BAH domain of CMTs. Both BAH1 and CMT-BAH domains contained aromatic cages which might be involved in interaction of MET with methylated histones tails. BAH2 domains didn't contain aromatic cages. GROMOCS software analysis showed that the RMSD of BAH2 was higher than BAH1 and other CMT-BAH, and collective motion of BAH2 were aberrant comparing with BAH1 and CMT-BAH. The results suggested that BAH2 was less stable than BAH1.
The selective pressure analysis showed that wild peanut C5-MTase genes mainly underwent purifying selection in evolution but many positive selection sites could be detected. DNA demethylase genes mainly underwent positive selection. These results suggested that evolution of these genes was tightly associated with the environment change during peanut evolution history.
Genes encoding enzymes for DNA methylation maintenance, for example CMT, CMT2, and CMT3 didn't express in flower. While genes encoding enzymes for de novo DNA methylation, for example, DRM2X1 (not containing UBA domain) and DRM2X2 (containing UBA domain) expressed primarily in flower. Previous studies have been proven that DRM containing UBA domain exhibited de novo methylation activity. Additionally, DNA methylation was reprogrammed in germ tissues or cells (Seisenberger et al., 2012). So, the silence of DNA methylation maintenance genes and activation of de novo methylation genes may be associated with DNA methylation reprogramming in peanut flower.
Peanut flowering and pollinated above ground. The fertilized ovary elongated and form a gynophore which also was called “peg” like structure which growth downward to the soil to complete pod development. We designate the above ground downward growing gynophore as stage 1 (S1). Gynophore that buried into soil for about 3 d is designated as stage 2 (S2) which is white in color and the pod enlargement is not observed. Peg that buried in soil for 9 d is white in color with enlarged pod is designated as stage 3 (S3). The expression level of DNA demethylase genes (ROS1, ROS1-like, and ROS1X2) increased from S1 to S3. From S1 to S3, the growth condition of gynophores changed from light to dark condition. Recently, epigenetic modifications have been shown to play crucial roles in response to environmental stimuli (Chinnusamy and Zhu, 2009; Gutzat and Mittelsten Scheid, 2012). The drastic change of the environment could be the cause of changed expression of these genes. The changed expression of these genes could affect the methylation pattern of peanut gynophore. Our results of methylation analysis showed that the methylation patterns were significantly different among S1, S2, and S3 (data not shown). In maize embryo and endosperm, the expression change of methylation enzymes affected the methylation patterns of these two tissues and led to distinct developmental fates (Wang et al., 2015). There is the possibility that the development of peanut pod and embryo could be regulated by methylation. DEG data showed that another DNA demethylase gene DME-like didn't express in S1 and S2. In Arabidopsis, different DNA demethylases directed DNA de-methylation in different tissues or cells, for example, AtDME directed DNA de-methylation only in central cell and synergids (Zhu, 2009).
Heat, PEG treatment and cold stress could affect DNA methylation dynamic in plants (Gong et al., 2002; Naydenov et al., 2015). We analyzed the transcript abundance of MTase genes under abiotic stress conditions in cultivated peanut. The results showed that cold, heat or PEG stress could influence the expression level of cultivated peanut C5-MTase and DNA demethylase genes. Higher transcript levels of genes during abiotic stress suggested their involvement in stress induced DNA methylation changes. For example, the expressions of most cultivated peanut C5-MTase genes were increased after 3 or 6 h of cold treatment. DME-like gene showed an increased expression after 48 h of cold treatment. The expression of most C5-MTase genes increased after PEG treatment. These results suggested that these genes might be involved in abiotic stress responses in cultivated peanut.
We studied the subcellular localization of wild and cultivated peanut C5-MTases and DNA demethylase. The results showed that peanut C5-MTases were located in nucleus which was consisted with other species (Cao et al., 2014; Garg et al., 2014). We also found that cultivated peanut DNA demethylase ROS1 located in nucleus.
We have identified members of C5-MTases and DNA demethylases in wild and cultivated peanut. We found differential transcript abundance of C5-MTase and DNA demethylase genes in different tissues and different developmental stages. Our study provided evidence for the roles of C5-MTases and DNA demethylases in response to environmental stresses in cultivated peanut. These results are useful for better understanding the complexity of these two gene families, and will facilitate epigenetic studies in peanut.
XW and SW designed the study, wrote the manuscript and finalized the figures and tables. PW and CG carried out most of the experiment, data analysis, and wrote the method section of the manuscript. XB, SZ, CZ, HX, HS, LH, performed experiments and took care of the plants.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This study was financially supported by the National grants (2011BAD35B04; 31471526; 2013AA102602; 31500217), Shandong provincial grants (BS2013SW006; BS2014SW017), Young Talents Training Program of Shandong Academy of Agricultural Sciences and Shandong Province Germplasm Innovation and Utilization Project.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/article/10.3389/fpls.2016.00007
Figure S1. HhH-GPD domain in demethylase.
Figure S2. Motifs identified in C5-MTases and demethylases. (A) Motifs of C5-MTases and (B) Motifs of demethylases.
Figure S3. Six conserved motifs in C5-MTases. (A) Six conserved motifs of MET group, (B) Six conserved motifs of CMT group, and (C) Six conserved motifs of DRM2 groups.
Figure S4. Neighbor-joining phylogeny of demethylases.
Figure S5. Gene structure of peanut C5-MTases and demethylases. (A) Gene structure of C5-MTases and (B) Gene structure of demethylases.
Figure S6. Heat map (using RPKM value) of C5-MTase and demethylase genes in different cultivated peanut tissue.
Figure S7. Relative expression levels of cultivated peanut C5-MTase and demethylase genes under cold stress. (A–L) respectively represent the relative expression levels of MET, CMT1, CMT2, CMT3, DRM2, DRM2X1, DRM2X2, DME, DME-LIKE, ROS1, ROS2-like, and ROS1X2 under cold stress from 0 to 48 h.“Y” axis represents relative expression level and “X” axis represents treated time (hour, h).
Figure S8. Relative expression levels of cultivated peanut C5-MTase and demethylase genes under PEG stress. (A–L) respectively represent the relative expression levels of MET, CMT1, CMT2, CMT3, DRM2, DRM2X1, DRM2X2, DME, DME-like, ROS1, ROS2-like, and ROS1X2 under PEG treatment from 0 to 48 h. “Y” axis represents relative expression level and “X” axis represents treated time (hour, h).
Figure S9. Relative expression levels of cultivated peanut C5-MTase and demethylase genes under heat stress. (A–L) respectively represent the relative expression levels of MET, CMT1, CMT2, CMT3, DRM2, DRM2X1, DRM2X2, DME, DME-like, ROS1, ROS2-like, and ROS1X2 under heat treatment from 0 to 48 h. “Y” axis represents relative expression level and “X” axis represents treated time (hour, h).
Table S1. C5-MTase and demethylase genes identified in other species.
Table S2. Conserved motif sequences of C5-MTases and demethylases.
Table S3. Primers of cultivated peanut C5-MTase and demethylase genes for qRT-PCR.
Table S4. Primers of cultivated peanut C5-MTase and demethylase genes for gene cloning.
Table S5. Domains and their positions in wild peanut C5-MTases and demethylase proteins.
Table S6. Identity of wild peanut C5-MTase and DNA demethylase orthologous pairs.
Table S7. Cis-acting regulatory elements of wild peanut C5-MTase and demethylase gene promoters.
Table S8. Likelihood values and parameter estimates for wild peanut C5-MTase and DNA demethylase family genes.
Table S9. Nuclear localization signal in wild peanut C5-MTase and DNA demethylase genes.
Candaele, J., Demuynck, K., Mosoti, D., Beemster, G. T., Inzé, D., and Nelissen, H. (2014). Differential methylation during maize leaf growth targets developmentally regulated genes. Plant Physiol. 164, 1350–1364. doi: 10.1104/pp.113.233312
Cao, D., Ju, Z., Gao, C., Mei, X., Fu, D., Zhu, H., et al. (2014). Genome-wide identification of cytosine-5 DNA methyltransferases and demethylases in Solanum lycopersicum. Gene 550, 230–237. doi: 10.1016/j.gene.2014.08.034
Dangwal, M., Malik, G., Kapoor, S., and Kapoor, M. (2013). De novo methyltransferase, OsDRM2, interacts with the ATP-dependent RNA helicase, OseIF4A, in rice. J. Mol. Biol. 425, 2853–2866. doi: 10.1016/j.jmb.2013.05.021
Du, J., Zhong, X., Bernatavichute, Y. V., Stroud, H., Feng, S., Caro, E., et al. (2012). Dual binding of chromomethylase domains to H3K9me2-containing nucleosomes directs DNA methylation in plants. Cell 151, 167–180. doi: 10.1016/j.cell.2012.07.034
Eichten, S. R., Briskine, R., Song, J., Li, Q., Swanson-Wagner, R., Hermanson, P. J., et al. (2013). Epigenetic and genetic influences on DNA methylation variation in maize populations. Plant Cell 25, 2783–2797. doi: 10.1105/tpc.113.114793
Garg, R., Kumari, R., Tiwari, S., and Goyal, S. (2014). Genomic survey, gene expression analysis and structural modeling suggest diverse roles of DNA methyltransferases in legumes. PLoS ONE. 9:e88947. doi: 10.1371/journal.pone.0088947
Gehring, M., Bubb, K. L., and Henikoff, S. (2009). Extensive demethylation of repetitive elements during seed development underlies gene imprinting. Science 324, 1447–1451. doi: 10.1126/science.1171609
Gong, Z., Morales-Ruiz, T., Ariza, R. R., Roldán-Arjona, T., David, L., and Zhu, J. K. (2002). ROS1, a repressor of transcriptional gene silencing in Arabidopsis, encodes a DNA glycosylase/lyase. Cell 111, 803–814. doi: 10.1016/s0092-8674(02)01133-9
Henderson, I. R., Deleris, A., Wong, W., Zhong, X., Chin, H. G., Horwitz, G. A., et al. (2010). The de novo cytosine methyltransferase DRM2 requires intact UBA domains and a catalytically mutated paralog DRM3 during RNA-directed DNA methylation in Arabidopsis thaliana. PLoS.Genet. 6:e1001182. doi: 10.1371/journal.pgen.1001182
Hsieh, T. F., Ibarra, C. A., Silva, P., Zemach, A., Eshed-Williams, L., Fischer, R. L., et al. (2009). Genome-wide demethylation of Arabidopsis endosperm. Science 324, 1451–1454. doi: 10.1126/science.1172417
Hsieh, T. F., Shin, J., Uzawa, R., Silva, P., Cohen, S., Bauer, M. J., et al. (2011). Regulation of imprinted gene expression in Arabidopsis endosperm. Proc. Natl. Acad. Sci. U.S.A. 108, 1755–1762. doi: 10.1073/pnas.1019273108
Kim, M., Ohr, H., Lee, J. W., Hyun, Y., Fischer, R. L., and Choi, Y. (2008). Temporal and spatial downregulation of Arabidopsis MET1 activity results in global DNA hypomethylation and developmental defects. Mol. Cells. 26, 611–615.
Kochert, G., Stalker, H. T., Gimenes, M., Galgaro, L., Romero-Lopes, C., and Moore, K. (1996). RFLP and cytogenetic evidence on the origin and evolution of allotetraploid domesticated peanut, Arachis hypogaea (Leguminosae). Am. J. Bot. 83, 1282–1291
Krzywinski, M., Schein, J., Birol, I., Connors, J., Gascoyne, R., Horsman, D., et al. (2009). Circos: an information aesthetic for comparative genomics. Genome. Res. 19, 1639–1645. doi: 10.1101/gr.092759.109
Lin, Y., Cheng, Y., Jin, J., Jin, X., Jiang, H., Yan, H., et al. (2014). Genome duplication and gene loss affect the evolution of heat shock transcription factor genes in legumes. PLoS ONE. 9:e102825. doi: 10.1371/journal.pone.0102825
Maia, A. M., da Silva, J. H., Mencalha, A. L., Caffarena, E. R., and Abdelhay, E. (2012). Computational modeling of the bHLH domain of the transcription factor TWIST1 and R118C, S144R and K145E mutants. BMC Bioinformatics 13:184. doi: 10.1186/1471-2105-13-184
Malik, G., Dangwal, M., Kapoor, S., and Kapoor, M. (2012). Role of DNA methylation in growth and differentiation in Physcomitrella patens and characterization of cytosine DNA methyltransferases. FEBS. J. 279, 4081–4094. doi: 10.1111/febs.12002
Malone, T., Blumenthal, R. M., and Cheng, X. (1995). Structure-guided analysis reveals nine sequence motifs conserved among DNA aminomethyltransferases, and suggests a catalytic mechanism for these enzymes. J. Mol. Biol. 253, 618–632.
Moretzsohn, M. C., Gouvea, E. G., Inglis, P. W., Leal-Bertioli, S. C. M., Valls, J. F. M., and Bertioli, D. J. (2013). A study of the relationships of cultivated peanut (Arachis hypogaea) and its most closely related wild species using intron sequences and microsatellite markers. Ann. Bot. 111, 113–126. doi: 10.1093/aob/mcs237
Naydenov, M., Baev, V., Apostolova, E., Gospodinova, N., Sablok, G., Gozmanova, M., et al. (2015). High-temperature effect on genes engaged in DNA methylation and affected by DNA methylation in Arabidopsis. Plant. Physiol. Biochem. 87, 102–108. doi: 10.1016/j.plaphy.2014.12.022
Noy-Malka, C., Yaari, R., Itzhaki, R., Mosquna, A., Auerbach Gershovitz, N., Katz, A., et al. (2014). A single CMT methyltransferase homolog is involved in CHG DNA methylation and development of Physcomitrella patens. Plant. Mol. Biol. 84, 719–735. doi: 10.1007/s11103-013-0165-6
Penterman, J., Zilberman, D., Huh, J. H., Ballinger, T., Henikoff, S., and Fischer, R. L. (2007). DNA demethylation in the Arabidopsis genome. Proc. Natl. Acad. Sci. U.S.A. 104, 6752–6757. doi: 10.1073/pnas.0701861104
Rai, K., Huggins, I. J., James, S. R., Karpf, A. R., Jones, D. A., and Cairns, B. R. (2008). DNA demethylation in zebrafish involves the coupling of a deaminase, a glycosylase, and gadd45. Cell 135, 1201–1212. doi: 10.1016/j.cell.2008.11.042
Seisenberger, S., Andrews, S., Krueger, F., Arand, J., Walter, J., Santos, F., et al. (2012). The dynamics of genome-wide DNA methylation reprogramming in mouse primordial germ cells. Mol. Cell. 48, 849–862. doi: 10.1016/j.molcel.2012.11.001
Sharma, R., Priya, P., and Jain, M. (2013). Modified expression of an auxin-responsive rice CC-type glutaredoxin gene affects multiple abiotic stress responses. Planta 238, 871–884. doi: 10.1007/s00425-013-1940-y
Slotkin, R. K., Vaughn, M., Borges, F., Tanurdžić, M., Becker, J. D., Feijó, J. A., et al. (2009). Epigenetic reprogramming and small RNA silencing of transposable elements in pollen. Cell 136, 461–472. doi: 10.1016/j.cell.2008.12.038
Wada, Y., Ohya, H., Yamaguchi, Y., Koizumi, N., and Sano, H. (2003). Preferential de novo methylation of cytosine residues in non-CpG sequences by a domains rearranged DNA methyltransferase from tobacco plants. J. Biol. Chem. 278:42386–42393. doi: 10.1074/jbc.M303892200
Wang, P., Xia, H., Zhang, Y., Zhao, S., Zhao, C., Hou, L., et al. (2015). Genome-wide high-resolution mapping of DNA methylation identifies epigenetic variation across embryo and endosperm in Maize (Zea may). BMC Genomics 16:21. doi: 10.1186/s12864-014-1204-7
Xia, H., Zhao, C., Hou, L., Li, A., Zhao, S., Bi, Y., et al. (2013). Transcriptome profiling of peanut gynophores revealed global reprogramming of gene expression during early pod development in darkness. BMC Genomics 14:517 doi: 10.1186/1471-2164-14-517
Xiao, W., Custard, K. D., Brown, R. C., Lemmon, B. E., Harada, J. J., Goldberg, R. B., et al. (2006). DNA methylation is critical for Arabidopsis embryogenesis and seed viability. Plant Cell 18, 805–814. doi: 10.1105/tpc.105.038836
Zemach, A., Kim, M. Y., Silva, P., Rodrigues, J. A., Dotson, B., Brooks, M. D., et al. (2010). Local DNA hypomethylation activates genes in rice endosperm. Proc. Natl. Acad. Sci. U.S.A. 107, 18729–18734. doi: 10.1073/pnas.1009695107
Zhong, X., Du, J., Hale, C. J., Gallego-Bartolome, J., Feng, S., Vashisht, A. A., et al. (2014). Molecular mechanism of action of plant DRM de novo DNA methyltransferases. Cell 157, 1050–1060. doi: 10.1016/j.cell.2014.03.056
Keywords: DNA mehtylation, C5-MTase, DNA demethylase, peanut, BAH domain, abiotic stress
Citation: Wang P, Gao C, Bian X, Zhao S, Zhao C, Xia H, Song H, Hou L, Wan S and Wang X (2016) Genome-Wide Identification and Comparative Analysis of Cytosine-5 DNA Methyltransferase and Demethylase Families in Wild and Cultivated Peanut. Front. Plant Sci. 7:7. doi: 10.3389/fpls.2016.00007
Received: 26 October 2015; Accepted: 06 January 2016;
Published: 03 February 2016.
Edited by:Rajeev K. Varshney, International Crops Research Institute for the Semi-Arid Tropics, India
Reviewed by:Manish Kumar Pandey, International Crops Research Institute for the Semi-Arid Tropics, India
Vikas Kumar Singh, International Crops Research Institute for the Semi-Arid Tropic, India
Copyright © 2016 Wang, Gao, Bian, Zhao, Zhao, Xia, Song, Hou, Wan and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xingjun Wang, email@example.com
†These authors have contributed equally to this work.