ORIGINAL RESEARCH article
Comparative Analysis of WRKY Genes Potentially Involved in Salt Stress Responses in Triticum turgidum L. ssp. durum
- 1Laboratory of Plant Molecular Physiology, Center of Biotechnology of Borj Cedria, Borj Cedria Science and Technology Park, Hammam-lif, Tunisia
- 2Centre National de Ressources Genomiques Vegetales, French Plant Genomic Center, INRA–CNRGV, Castanet-Tolosan, France
- 3INRA, UMR990 Genomique et Biotechnologie des Fruits, Castanet-Tolosan, France
- 4INPT, Laboratoire de Genomique et Biotechnologie des Fruits, University of Toulouse, Castanet-Tolosan, France
WRKY transcription factors are involved in multiple aspects of plant growth, development and responses to biotic stresses. Although they have been found to play roles in regulating plant responses to environmental stresses, these roles still need to be explored, especially those pertaining to crops. Durum wheat is the second most widely produced cereal in the world. Complex, large and unsequenced genomes, in addition to a lack of genomic resources, hinder the molecular characterization of tolerance mechanisms. This paper describes the isolation and characterization of five TdWRKY genes from durum wheat (Triticum turgidum L. ssp. durum). A PCR-based screening of a T. turgidum BAC genomic library using primers within the conserved region of WRKY genes resulted in the isolation of five BAC clones. Following sequencing fully the five BACs, fine annotation through Triannot pipeline revealed 74.6% of the entire sequences as transposable elements and a 3.2% gene content with genes organized as islands within oceans of TEs. Each BAC clone harbored a TdWRKY gene. The study showed a very extensive conservation of genomic structure between TdWRKYs and their orthologs from Brachypodium, barley, and T. aestivum. The structural features of TdWRKY proteins suggested that they are novel members of the WRKY family in durum wheat. TdWRKY1/2/4, TdWRKY3, and TdWRKY5 belong to the group Ia, IIa, and IIc, respectively. Enrichment of cis-regulatory elements related to stress responses in the promoters of some TdWRKY genes indicated their potential roles in mediating plant responses to a wide variety of environmental stresses. TdWRKY genes displayed different expression patterns in response to salt stress that distinguishes two durum wheat genotypes with contrasting salt stress tolerance phenotypes. TdWRKY genes tended to react earlier with a down-regulation in sensitive genotype leaves and with an up-regulation in tolerant genotype leaves. The TdWRKY transcripts levels in roots increased in tolerant genotype compared to sensitive genotype. The present results indicate that these genes might play some functional role in the salt tolerance in durum wheat.
Durum wheat (Triticum turgidum ssp. durum) is a monocot of the Poaceae family, of the Triticeae tribe and the Triticum genus. Tetraploid Triticum durum (2n = 4× = 28, AABB), originated when two diploid wild grasses were crossed. The A genome originated from T. urartu (Dvorak, 1988). The origin of the B genome is still under discussion, but so far, Ae. Speltoides has been put forward as the closest to the donor of this genome (Fernandez-Calvin and Orellana, 1994). Durum wheat has one of the largest and most complex genomes; its size is estimated at 13,000 Mb.
The unavailability of T. durum genome sequences complicates and hinders the identification of the genetic factors and mechanisms behind responses to abiotic stresses such as drought and salinity. These are just some of the major constraints affecting cereal crops. In terms of commercial production and human food, this species is the second largest of its kind after bread wheat (Triticum aestivum L.). Durum wheat represents 8% of total wheat production, but 80% grow under Mediterranean climates (Monneveux et al., 2000). To date, durum wheat is the only tetraploid wheat species of commercial importance that is widely cultivated in these regions, where drought, heat and salinity limit yield considerably. Durum wheat is also more salt sensitive than bread wheat (Gorham et al., 1990) and saline soil has a negative effect on production (Maas and Grieve, 1990). Consequently, special efforts must be made to increase its tolerance.
Plants adapt to adverse environmental conditions through the induction of stress-responsive and stress-tolerant genes, a process that occurs mainly through transcription factors (TFs). TFs are known to mediate stress signal transduction pathways regulating downstream target gene expression and lead to stress tolerance (Shinozaki and Dennis, 2003; Chen and Zhu, 2004; Yamaguchi-Shinozaki and Shinozaki, 2005; Budak et al., 2013).
The WRKY transcription factor belongs to a very large family of transcription factors potentially involved in drought/salt stress response (Budak et al., 2013). This family originated in early eukaryotes and greatly expanded in plants (Zhang and Wang, 2005). It counts over 70 members in the Arabidopsis genus (Arabidopsis thaliana; Eulgem et al., 2000; Dong et al., 2003), 55 members among cucumber plants (Cucumis sativus; Wei et al., 2012), 119 members among maize plants (Zea Mays; Ling et al., 2011), 94 members among barley plants (Hordeum vulgare; Liu et al., 2014), 182 encoding genes in soybean (Glycine max; Bencke-Malato et al., 2014), 62 members among diploid woodland strawberry plants (Fragaria vesca; Wei et al., 2016) and 100 members among rice grasses (Oryza sativa; Xie et al., 2005).
This transcription factors family is characterized by a 60 amino acids domain containing the WRKY amino acid sequence at its amino-terminal end and a putative zinc finger motif at its carboxy-terminal end. It binds specifically to the (T)(T)TGAC(C/T) sequence motif, known as the W box, which requires both the invariable WRKY amino-acid signature and the cysteine and histidine residues of the WRKY domain, which tetrahedrally coordinate a zinc atom (Rushton et al., 2010).
The existence of either one or two highly conserved WRKY domains is the most vital structural characteristic of the TdWRKY gene (Bi et al., 2016). Furthermore, the global structures of WRKY proteins are highly divergent and can be classified into different groups, which might reflect their distinct roles. WRKY proteins are classified into 3 main groups (I, II, III) based on the number of WRKY domains and the structure of the zinc finger-like-motif.
Group I proteins contain two WRKY domains followed by a C2H2 zinc finger motif. The other WRKY proteins from group II and III contain one WRKY domain followed by a C2H2 or C2HC accordingly. Group II can be divided into five subgroups (IIa, IIb, IIc, IId, and IIe) based on additional amino acid motifs (Yamasaki et al., 2005).
WRKY genes are known to participate in various developmental and physiological metabolisms, including disease resistance (Bhattarai et al., 2010), senescence (Besseau et al., 2012), growth and developmental processes (Guillaumie et al., 2010), as well as biotic and abiotic stress responses (Mingyu et al., 2012). Recently, transgenic Arabidopsis plants overexpressing TaWRKY2 (EU665425) or TaWRKY19 (EU665430) have shown improved tolerance to salt, drought and/or freezing stresses when compared with the wild-type plants (Niu et al., 2012). Marè et al. (2004) described HvWRKY38 (AY541586), a barley gene coding for a WRKY protein, whose expression is involved in cold and drought stress response.
In durum wheat, very few WRKY EST sequences have been identified. Only partial cDNAs were found (Budak et al., 2013; Cifarelli et al., 2013). Literature does not include any report describing the characterization of a TdWRKY gene from tetraploid wheat species.
Available plant genome sequences from monocot plants are key resources that enable a better understanding of their gene content, structure and function. They are also indispensable for understanding transposable elements, intergenic space organization and composition. Genomic comparisons between the genome A diploid wheat donor (Triticum urartu), hexaploid wheat (T. aestivum, The International Wheat Genome Sequencing Consortium (IWGSC), 2014), barley (H. vulgare, The International Barley Sequencing Consortium, 2014) and Brachypodium (Brachypodium distachyon, The International Brachypodium Initiative, 2010), have not only confirmed the broad synteny between Poaceae gene content but have also helped to deduce their function within a phylogenetic context (Dubcovsky et al., 2001).
In this paper, we obtained genomic sequences of the region harboring WRKY genes from T. turgidum ssp. durum from Langdom#65 BAC clone library screening. First, we presented the WRKY gene characterization, including phylogenetic analysis and orthologous gene comparison. Then, we analyzed the genomic environment of WRKY genes, including intergenic space composition, and cis-acting elements that helped to associate a putative function with TdWRKYs. We finally performed TdWRKYs differential gene expression. TdWRKY1, 3, 4, and 5 were induced by high-salt treatment in two durum wheat varieties, Grecale (GR) and Om Rabiaa (OR), shown to be salt-tolerant and -sensitive, respectively, suggesting that TdWRKYs may be involved in salt-stress responses. These data provide new leads toward improving durum wheat tolerance to abiotic stresses.
Materials and Methods
Three genotypes of tetraploid T. turgidum L. ssp. durum (2n = 4× = 28) were used in this study; Langdon LDN#65 for PCR bacterial artificial chromosome (BAC) screening, and OR and GR for functional analyses. The latter two were a local Tunisian variety and an Italian variety introduced in Tunisia, respectively.
BAC Library Screening
The BAC library constructed from the tetraploid T. turgidum L. ssp. durum (2n = 4× = 28) Langdon LDN#65 genotype was used in this study for PCR Bacterial Artificial Chromosome (BAC) screenings. The BAC library is represented by a total of 516,096 clones organized in 1,344,384-well plates. The average insert size of the BAC clones was estimated at around 131 kb resulting in 5.1 genome coverage (Cenci et al., 2003). The LDN#65 BAC library and related tools are available at the French Plant Genomic Resource Center upon request (http://cnrgv.toulouse.inra.fr/en/library/genomic_resource/Ttu-B-LDN65). The library was organized into a two-dimensional pool and BAC library screening was performed as described by Cenci et al. (2003, 2004). The pooling strategy used required 56 PCRs for the superpools (9216 BAC clones each) and 50 additional PCRs for each positive superpool. The screenings have been done as described in Makhloufi et al. (2014).
PCR Primer Design and PCR Amplification for BAC Screening
Two primer design strategies have been used. O. sativa, T. aestivum and H. vulgare sequences harboring conserved WRKY domains were used as queries (NCBI: http://www.ncbi.nlm.nih.gov/; GrainGenes: http://wheat.pw.usda.gov/GG2/index.shtml; Gramene: http://www.gramene.org/; TIGR: http://www.jcvi.org/; HarvEST: http://harvest.ucr.edu/; Phytozome:http://www.phytozome.net/; PTDB: http://plntfdb.bio.uni-potsdam.de/v3.0; and plant GDB: http://www.plantgdb.org/), in a BLAST search to find homologous T. turgidum ESTs (Budak et al., 2013; Cifarelli et al., 2013). Multiple alignments of the DNA sequences were performed by ClustalW software (Larkin et al., 2007). In order to avoid an amplification of the exon–intron junction, prediction of the exon boundaries within Triticeae expressed sequence tags (ESTs) were performed based on rice and Arabidopsis genomic sequences.
The second has been deduced due to the very strong homology between different species of Triticeae. We tried to identify non-specific T. turgidum primers from genes whose function was studied. For this, some WRKY genes that had been specifically studied in the context of abiotic stress, were chosen for primer design: HvWRKY38 (AY541586) (group IIa) (Marè et al., 2004) and TaWRKY2 (EU665425) (group I); TaWRKY19 (EU665430) (group I) (Niu et al., 2012).
PCR primers were then designed to cover exons of the entire selected sequence using the Perl primer tool v.220.127.116.11 (Marshall, 2004) (Table S1). Primers were tested on genomic DNA of LDN#65 before BAC library screening. Total DNA was extracted from wheat Langdon 65 variety using the Plant DNAzol® reagent. PCR conditions used were as follows: initial denaturation at 95°C for 5 min, followed by 45 cycles of 20 s at 95°C, 16 s at 60°C, and 20 s at 72°C, performing a melting curve with an increment of 0.5°C per cycle. PCR products for the selected BACs were separated by electrophoresis (2% Agarose).
BAC Sequencing, Assembly, and Annotation
BAC DNA was extracted using a NucleoSpin® 96 Flash kit (Macherey-Nagel) and the insert size was estimated with NotI digestion (Fast Digest NotI; fermentas). Positive BAC clones were outsourced for 454 Life Sciences pyrosequencing technology using the GS Junior Roche system (Kit 454 Titanium; Roche). Sequence data assembly was performed with Newbler 2.8 software sold by 454 Life Sciences/Roche for 454 data (Veras et al., 2013). The assembly was performed on data previously processed by the software Pyrocleaner after clearing reads from contamination by the host E. coli. Sequenced BAC DNA was analyzed using TriAnnot Pipeline v.3.8 improved for wheat species (http://wheat-urgi.versailles.inra.fr/Tools/TriAnnot-Pipeline) enabling annotation, masking of transposable elements, and gene structure organization (Text S1) (Leroy et al., 2012).
Alignment, Phylogenetic Tree, and Sequence Analysis of TdWRKY Genes
The TdWRKY protein sequences were submitted to the CDD (Conserved Domains Database) from NCBI, and to Motif Scan detection in MyHits (http://myhits.isb-sib.ch/) with Prosite databases selected. Homologous proteins from whole genome sequenced monocot plants (Wheat, Barley, Brachypodium, Maize, Sorghum and Rice) and Arabidopsis were selected for sequence alignments using DNAMAN software (http://www.lynnon.com/).
A neighbor joining phylogenetic tree was derived from a MUSCLE alignment 3.8 (Edgar, 2004) of TdWRKY proteins, their homologous WRKY proteins from cereals and their closest members from A. thaliana. The tree was then produced by MEGA 6 software (Tamura et al., 2013) using the Neighbor-Joining method with 1000 bootstrap replicates.
A 1.5 kb DNA fragment on the 5′-regulatory region upstream of the transcription start of the TdWRKY gene was subjected to in silico analysis using the PLACE signal scan and NSITE-PL (Recognition of PLANT Regulatory motifs with statistics) from the Softberry tool (http://www.softberry.com/) to search for putative cis-regulatory elements in the promoter region potentially involved in controlling TdWRKY gene expression and also in the promoter region of their orthologs in T. aestivum and H. vulgare to see the average frequencies of cis-acting elements related to abiotic stress in these promoters.
Sterile seeds of two independent genotypes from T. turgidum subsp. durum, OR and GR, were first stored for 48 h at 4°C for initialization of germination. Seedlings were sown in recipient Magenta™ vessels containing 50 ml of 50% MS-based medium (Murashige and Skoog, 1962) and were left for 10 d in an in vitro growth chamber maintained at a controlled photoperiod of 14 h during the day at 25°C with 80% humidity and an intense luminosity of 250 μmol m−2 s−1, and for 10 h during the night at 20°C. They were then subjected to abiotic and hormonal stress treatments. For salinity treatment, seedlings were transferred into 50% MS medium containing 200 mM NaCl for 6 or 24 h. Leaves and roots were then harvested separately, dropped immediately into liquid nitrogen, and stored at −80°C for RNA extraction.
Gene Expression Analysis
Total RNA from at least 30 salt-treated and untreated leaves and roots from OR and GR genotypes were extracted using a Pure Link Plant RNA Reagent kit (Invitrogen). Total RNA was DNase treated (Promega), and first-strand cDNA was reverse transcribed from 2 μg of total RNA using an M-MLV Reverse Transcriptase kit (Promega) according to the manufacturer's instructions. First-strand cDNA generated from total RNA including salt-treated and untreated samples from either the OR or GR genotype was subjected to real time quantitative expression analysis. The latter was performed in a fluometric thermal cycler (DNA Engine Opticon 2; MJ Research, Walthan, MA, USA) using SYBR Green fluorescent dye following the manufacturer's instructions. Results were shown using SDS2.2 software on an Applied Biosystem 7900 HT Fast Real-Time PCR System. Comparisons of repeated samples were assessed using CT values among the three replications. Linear data were normalized to the mean CT of 26S rRNA as an internal reference gene and the relative expression ratio was calculated using the formula 2−ΔΔCT. Log2-transformed signal ratio was carried out. The gene specific primers used for PCR are listed in Table S2.
Identification, Sequencing and General Features of the Sequence Composition of T. durum BAC Clones
516,000 clones from the Durum wheat bacterial artificial chromosome (BAC) library, were screened for individual clones harboring WRKY genes. Six T. durum BAC clones were selected and fully sequenced by 454 technologies.
BAC clone annotation revealed 12 non-TE genic features that were classified into two categories: 9 protein coding genes, one hypothetical and 2 gene fragments (Table S4). The gene assignments were all supported by at least one full-length cDNA, an EST and/or homolog in another monocot plant such as T. aestivum, O. sativa, H. vulgare, Z. mays, and B. distachyon. One gene per insert was predicted for TD14H23 (GenBank accession no. KY091673), TD473J01(GenBank accession no. KY091677), TD493B21 (GenBank accession no. KY091678) and TD315C07 (GenBank accession no. KY091676). The remaining two BAC clones TD16L16 (GenBank accession no. KY091674) and TD789O23 (GenBank accession no. KY091679), contain four genes (Table 1). Two genes (of known or unknown function), three genes with putative function, four genes with domain-containing proteins, one hypothetical gene and one truncated (fragmented gene), were identified (Table 1, Table S4).
Table 1. TdWRKY gene models and comparaisons with homologous sequences in grass species using Blastp and Blastn.
An average GC content of 47.5% was found for all BACs. In contrast with the constant GC content, gene and TE composition was highly variable between the different BAC sequences. The proportion of TEs ranged from 64 to 91%, while gene content ranged from 1 to 4 genes per BAC. The coding fraction of the 960,769 kb total samples represents 3.2% of the sequences, while the TE content is 74.6%, distributed as follows: 63.6% for class I, 7.9% for class II and 3.1% for unclassified TEs (Table S3). While class I retrotransposons constitute the highest TE proportion of the 6 sequenced regions, BAC clone TD473J01 shows the highest proportion of CACTA class II (17.0%). Class I TE DNA sequences were distributed as follows: 27.2% Gypsy- (150 TEs) and 15.9% Copia- (71 TEs) like “long terminal repeats (LTR)”-retrotransposons. New class I transposable elements were identified for the first time in this study. They account for 17.9% of length and were identified as de novo LTR-retrotransposons (Table S5). Class II TEs (DNA transposons) represent 7.9% of the cumulative sequence length. The CACTA TEs represent the majority (32.4%) of class II DNA sequences. Novel class II TE families were identified, sharing weak homologies with known CACTA, Mutator and Mariner. The 3.1% novel unclassified elements, share a stretch of weak homology with other Triticeae unclassified transposable elements (Table S3, Text S2).
TdWRKY and Monocot Gene Structure
A comparison of durum wheat, T. aestivum, H. vulgare, and Brachypodium distachyon WRKY gene sequences helped to establish their structure.
Gene length varies from 1.2 to 4.2 Kb. It is conserved amongst the orthologous monocots (TdWRKY1 and TdWRKY5 length was highly conserved among orthologous members). The number of introns varies from 2 to 4. The intron size is also variable; it ranges from 102 to 925 bp (Figure 1, Table 1). All exon-intron junction sites obey the GT/AG rule as identified in other eukaryotic genes. To date, the relative organization of the exons and introns is the same for the other WRKY genes characterized in cereal, i.e., the number of exons and introns remains the same and individual introns occur at relatively the same sites for barley, Brachypodium and wheat genes.
Figure 1. Isolation of 5 TdWRKYs by screening of a durum wheat BAC library and Comparative Structure of TdWRKYs and orthologous from B.distachyon, T. aestivum and H. vulagre. Genomic structure of the durum wheat TdWRKYs gene; white portion are UTR regions, dark gray portions are orthologous exons from different sizes, and light gray boxes are exons from same size between orthologous. The line represents the intron. The red motives are special WRKY family conserved intron. TdWRKYs untranslated regions, introns and exons boundaries are highlighted on the bottom of TdWRKYs.
The size of coding sequences varies from 882 to 1716 bp (Figure 1). The size of exonic regions between the orthologous genes was similar although the overall region structure was slightly different between TdWRKYs and their orthologs (exon 2 from TdWRKY1 and TdWRKY2 were almost the same size, but with 1 to 3 more intron phases in corresponding exons. The same was observed with exon 3 from TdWRKY5). Small stretches of exonic region sequences or more exons (BdiWRKY has 4 more exons, in the 5′ moiety, than TdWRKY4) do not contradict the general pattern of overall high WRKY gene conservation between the TdWRKYs and their homologous genes (Text S3).
Comparison of the TdWRKYs cDNA with sequences of other species showed an identity ranging from 99% for T. aestivum; 86% for Brachypodium and 92% for barley (Table 1, Text S3).
Interestingly, we also identified a cluster of non syntenic collinear genes, probably originating from genomic rearrangements of gene blocks such as gene 1 and gene 4 from TD789O23 (the two gene fragments from Table 1). They were annotated as being similar to the WRKY domain protein. Genes 1 and 4 (order on the BAC clone), share 93 and 94% of similarity, respectively, with TdWRKY5. An alignment of TdWRKY5 from TD315C07 and the two fragments (gene 1 and gene 4) from TD789O23, revealed that gene 4 might be a truncated second allelic form of TdWRKY5 (Figure 1). In fact, the gene 4 genomic sequence is 989 bp shorter (from position 1 to 990, Figure S2) than TdWRKY5. It results in the formation of a new coding protein. The predicted protein from gene 4 belongs to group II as well as TdWRKY5. The WRKY domain on exon2, the PR intron special feature, and the zinc finger-like motif on exon3, are perfectly conserved. The alignment of TdWRKY5, gene 4 and gene 1, showed that gene 1 and gene 4 happened to be fragments of TdWRKY5 (Figure S2). This disruption was the result of a deletion and an insertion. These are two of the most important genomic rearrangement events. They play significant roles in genome evolution. This reshuffling created two new, non-syntenic genes with orthologous species (Table 1). This might, or might not, be a loss-of-function mutation of the second TdWRKY5 allelic form. Unfortunately, we were only able to find approximately 450 bp from the 898 bp of newly inserted fragment (gene1) since it is located at the extreme 5′ start of TD789O23 DNA insert. There are many ways in which exon shuffling may occur. Shuffling involves transposable elements such as LINE-1 retroelements and Helitron transposons, as well as CACTA elements and LTR retroelements, a crossover during sexual recombination or alternative splicing. These hypothesized mechanisms should be thoroughly explored.
TdWRKYs Classification and Phylogeny
WRKY proteins are classified into 3 groups. We used phylogeny to assign a group to our WRKY sequences (Figure 2). WRKY1, 2, and 4 belong to group I. They all have 2 WRKY domains with a C-X (4,5)-C-X (22,23) H-X-H zinc-finger-like type motif on each domain. TdWRKY2 predicted protein contains two WRKY domains. The N-terminal domain has an altered WRKY motif WRKYGKK. An alignment of TdWRKY2, with its orthologous proteins from Aegilops, Brachypodium and T.urartu on the UniProtKB database (http://www.uniprot.org/align/), indicates that the protein is biased at its C-terminus WRKY conserved domain. 29 aa are missing from the zinc finger motif (Figure S1).
Figure 2. Phylogenetic analysis of the TdWRKYs proteins with orthologous members from cereals (T. aestivum, T. urartu, Ae. taushii, H. vulgare, Z. mays, S. bicolor, and O. sativa) and A. thaliana. The Arabidopsis WRKY sequences were downloaded from GenBank (we used gene names for the alignment). TdWRKYs are spotted with black motives. Alignment of the 59 amino acids sequences was performed by MUSCLE 3.8 (http://www.ebi.ac.uk/Tools/msa/muscle/). The evolutionary history was inferred using the Neighbor-Joining method. The bootstrap consensus tree inferred from 1000 replicates is taken to represent the evolutionary history of the taxa analyzed. The evolutionary distances were computed using p-distance method and are in the units of the number of amino acid differences per site. Partial deletion parameters were used, all positions with less than 50% site coverage were eliminated, meaning that, fewer than 50% alignment gaps, missing data, and ambiguous bases were allowed at any position. There were a total of 459 positions in the final dataset. Evolutionary analyses were conducted in MEGA6. All the branches were supported with bootstrap values ≥ 50%.
This might be the cause of alteration in Wbox recognition. The role of conserved residues in this domain has been studied by Maeo et al. (2001) using mutation experiments. Any mutation occurring either in the WRKYGQK or the zinc finger motif of WRKY domains, affecting cysteine or histidine, cancels DNA binding activity (Knoth et al., 2007). The WRKY domain from the C-terminal region is responsible for binding to DNA, whereas the role of the N-terminal is to promote protein-protein interactions (Maeo et al., 2001).
TdWRKY3 and 5 were assigned to groups IIa and IIc respectively, due to the presence of only one WRKY domain with a specific zinc finger motif, C-X (4,5)-C-X (22,25)-H-X-H.
We investigated the Prosite and Pfam databases using the TdWRKYs sequences as queries to identify conserved domains, motifs and active phosphorylation sites (Figure 3, Text S4). The analysis predicted 11 specific sequence features for all TdWRKYs identified, including the WRKY domain, a Plant Zinc Clust domain, and a Gly_Rich region and a His_Rich region (Figure 3). Seven active sites were detected, including 32 CK2_Phospho sites, 22 PKC_Phospho sites, 12 Asn_Glycosylation sites, 29 Myristyle sites, 2 TYR_pospho sites, one amidition_site and 3 CAMP_phospho_sites. These features are related to subcellular localization, signal transduction, transcriptional regulation and protein interaction, and build a basis for TdWRKY function. TdWRKY1, 4, and 5 might act as activators. TdWRKY2 and 3 contain the active repressor motif (LXLXLX) (Xie et al., 2005) (Figure 3).
Figure 3. Sequence features of the TdWRKYs protein. Domains or motifs: WRKY (WRKY domain), RWD, Pla_Zn (Plant Zinc Clust domain), PRO_R (Pro_Rich region), MET_R (Met_Rich region), NLS (Nuclear Localization Signal), LXLXLX (repressor motif), and Fila (Filamin domain; Active sites: 1, Asn_Glycosylation; 2, CK2_Phospho_Site; 3, Microbodies_Cter site; 4, Myristyle site; 5, PKC_Phospho_Site; 6, Amidation site; 7, CAMP phosphosite).
For additional phylogeny support, the detection of the position and the phase of the intron in each region encoding a WRKY domain is essential. Four of the eight WRKY domains, found in TdWRKY genes, contain an intron in a conserved position (Figure 1). This phase 2 intron is localized at the 11th codon downstream of the WRKY motif, interrupting the codon encoding Arg (on 100% of genes). In Group Ia (TdWRKY1, 2, and 4) (the C-terminal WRKY domain), Group IIc (TdWRKY5), IId, IIe, and III genes, this intron comes after the codons for the invariant amino acid sequence PR and separates the WRKY sequence from the zinc finger motif. In Group IIa (TdWRKY3) and IIb genes the intron occurs at a nucleotide position that corresponds to five amino acids after the C-X5-C and separates this from the rest of the zinc finger structure (Tripathi et al., 2012). On TdWRKY3, the WRKY domain was separated from the zinc finger motif by the C-X5-C core but there was not an intron at this position (Figure 1).
Hypothetical Function Deduced By Protein Domain and Orthologous Gene Function Sequence
The phylogenetic analyses revealed distinct clusters of TdWRKYs. Within the WRKY clade, distinct clusters corresponding to WRKY groups and subgroups comprising of further sub-clusters emerged. TdWRKYs with WRKYs from wheat, barley, T. urartu and Aegilops formed one sub-cluster whereas rice, maize and sorghum were grouped as a separate sub-cluster, and Arabidopsis were clustered together. Within sub-clusters, durum wheat, and T. aestivum, WRKYs showed less divergence (as indicated by a shorter branch length) than Brachypodium and barley WRKYs. Similarly, maize and sorghum WRKYs showed less divergence compared to rice. Monocot WRKYs showed less divergence than the closest Arabidopsis WRKYs (Figure 2).
TdWRKY1 was clustered to TaWRKY19, involved in salt stress response. TdWRKY2 was clustered to BdWRKY2 and AtWRKY34, involved in cold stress response (Zou et al., 2010). TdWRKY4 belongs to the TaWRKY53, TaWRKY2 and AtWRKY33 cluster, involved in salt stress (Jiang and Deyholos, 2009). TdWRKY3 is clustered to TaWRKY80 and HvWRKY38, involved in dehydration response and to AtWRKY18, 60, and 40 (Liu et al., 2012). They were described as regulating defense response. TdWRKY5 and its orthologous TaWRKY71, BdWRKY71, and AtWRKY8, involved in salt stress response, were grouped together (Figure 2).
Multiple sequence alignment of the conserved sequence from group Ia TdWRKYs (the two WRKY domains were aligned separately) and group IIa and IIc TdWRKY members with their closest Arabidopsis and monocot WRKY proteins (Figure 4) revealed the very conserved structure of the WRKY motifs and the amino acid residues potentially interacting with zinc ligands. TdWRKYs consist of a four-stranded ß-sheet (1, 2, 3, and 4), with a zinc binding pocket formed by the conserved Cys/His residues located at one end of the ß-sheet, and the WRKYGQK cores, corresponding to the most N-terminal ß-strand (strand ß-1), kinked in the middle of the sequence by the Glyresidue (Yamasaki et al., 2005).
Figure 4. Multiple alignment of the WRKY conserved domain of TdWRKYs with closely related WRKY proteins using DNAMAN software (http://en.bio-soft.net/format/DNAMAN.html), Blue, clear blue and pink represent 100, 75, and 50% similarity, respectively. Four β-sheets indicated over the corresponding sequences.
Putative cis-Acting Elements Identified in the TdWRKY Promoter Region
BAC sequences generated more data about gene environment. It affords greater reliable and precise information for further functional studies. A 1.5 kb 5′ regulatory region upstream of the transcription start of the TdWRKY genes was subjected to in silico analysis using a plant cis-acting regulatory DNA elements (PLACE) signal scan to search for putative cis-regulatory elements potentially involved in the control of TdWRKY gene expression. The data indicated the presence of a large number of conserved cis-regulatory elements that are putative targets for TFs reported to mediate responses to environmental stresses or to stress-related hormones (Table 2). To regulate gene expression, WRKY factors show a binding preference to a DNA sequence called W box: (C/T) TGAC (C/T) (Ciolkowski et al., 2008). This DNA core was over-represented on TdWRKY3, 4 and 5. 6 and 5 boxes were found on TdWRKY1 and 2, respectively.
It has also been shown that some WRKY factor types can bind to other types of cis-elements. ABRE-like motifs (ACGTG) and ABRE-related motifs (ACGTGKC and TACGTGTC) were also found in the promoter region of TdWRKY1 3, 4, and 5. The MYB-core element (TAACTG) and a number of MYB-related motifs (YAACKG, CNGTTR, and GGATG), as well as a MYC (CANNTG) motif and the MYC-related motifs (CATGTG and CACATG), were present on the entire promoters. MYC and MYB had the biggest number among others. DRE (TACCGACAT), CRT (RCCGAC), and low-temperature responsive elements (LTREs) (CCGAC), all containing the CCGAC motif that forms the core of the DRE sequence, were well represented in the promoter of TdWRKY1 and TdWRKY3. DRE-like elements such as CBFHv (RYCGAC) were also identified mostly on TdWRKY1 and TdWRKY3. Two GCC-box motifs (AGCCGCC), target sequences for ERF proteins, were found in the TdWRKY1 and TdWRKY4 promoters. RAV1-A motifs (CAACA), to which RAV1 proteins can bind through their AP2 and B3-like domains, were also present in a large number on TdWRKY promoters (Table 2). The average cis-elements frequencies on the WRKY promoter gene were deduced from TdWRKYs and from orthologous TaWRKYs and HvWRKYs scanned also for regulatory elements. The results showed that the promoters of TdWRKY genes are the richest in terms of putative regulatory elements compared to HvWRKYs and TaWRKYs. Moreover, the frequencies of GCC boxes, MYB and Wboxes in A. thaliana promoter regions are significantly lower than in hexaploid wheat, durum wheat and barley, which might be related to the fact that the average frequency was calculated within the 74 WRKY members (Dong et al., 2003). Promoter analysis in Populus revealed that various cis-acting elements (LTRE, ABRE, ABA, and Wboxes) involved in abiotic stress and phytohormone responses were highly present in the promoter region of PtrWRKY genes (Jiang et al., 2014). The data presented by Makhloufi et al. (2014) on durum wheat TdERF indicated the presence of a large number of conserved cis-regulatory elements that are putative targets for TFs reported to mediate responses to environmental stresses or to stress-related hormones.
The Expression Patterns of TdWRKYS under Salt-Stress Conditions in Two Contrasting Genotypes
The expression pattern of TdWRKYs in response to short term salt stress, in both leaves and roots, was analyzed in the OR and GR genotypes of durum wheat shown to be sensitive and tolerant to high salt, respectively (Makhloufi et al., 2014). Specific primers (Table S2) were designed and used in a quantitative real-time PCR. Upon high-salt treatment (200 mM NaCl), the expression levels of TdWRKY1, 3, 4, and 5 were altered (up to 19-fold in TdWRKY4) in the leaves of sensitive genotype after 6 h under salt stress (Figure 5). Meanwhile, the transcripts level at the same time in tolerant genotype remain almost unchanged for TdWRKY3, 4, and 5 and increased almost 5 times than control (Figure 5A). Thereafter, the expression of the TdWRKY genes in leaf tissues displayed a dramatic increase at 24 h in both genotypes, even though the upregulation was substantially higher in GR (27-fold, 17-fold, 11-fold, and 15-fold for TdWRKY1, 3, 4, and 5, respectively) than in OR (1.5-fold altered, 13-fold, 3-fold, and 12-fold for TdWRKY1, 3, 4, and 5, respectively) (Figure 5A). In treated OR roots, transcripts levels of the four TdWRKYs remain unchanged at 6h in both sensitive and tolerant genotypes. Application of salt induced a decrease in transcript levels of TdWRKY3 (12-fold) and TdWRKY4 (14-fold) after 24 h in sensitive genotype. The expression levels of all TdWRKYs after 24 h of stress treatment, in tolerant genotype, remain constitutive (Figure 5B).
Figure 5. Expression pattern of TdWRKY1, TdWRKY3, TdWRKY4, and TdWRKY5 in response to salt stress. Expression profile of TdWRKY1, 3, 4, and 5 in leaves (A) and roots (B) from GR-salt tolerant and OR-salt sensitive durum wheat genotypes following 200 mM NaCl treatment. The levels of TdWRKY1, 3, 4, and 5 transcripts were assessed by real-time quantitative PCR. mRNA accumulation was monitored in 10-d-old roots and leaves, after 6 and 24 h of NaCl treatment (200 mM NaCl). For each sample, relative fold changes were determined by normalizing the CT value of the TdWRKY1, 3, 4, and 5 genes in different tissues in both sensitive and tolerant varieties to the CT value of Td26S (internal control) and calculating the ratios of transcripts abundance compared to samples from untreated tissues, using the formula 2−ΔΔCT. ΔΔCT refers to fold differences in TdWRKY1, 3, 4, and 5 expression relative to untreated tissues. Log2-transformed signal ratio was carried out. Values are means ±SD (n ≥ 30 plants were pooled for RNA extraction) of three replicates. Asterisks * are used to represent P values: *0.01 < P < 0.05 (Statistically significant); *0.01 < P < 0.05; **P < 0.01 (Statistically highly significant) (Student's t-test).
TdWRKY2 carries a deletion within the fourth and last exon, just after the position encoding the second Cys of the zinc finger motif (Figure S1). By qRT-PCR analysis, we verified disruption of the gene. 2 primer pairs positioned 3′ around the deletion site did not amplify any product (Table S2). As a disruption of the zinc finger motif has been shown to completely abolish the W-box–specific DNA binding activity of WRKY transcription factors (Maeo et al., 2001), it is very likely that TdWRKY2 is a true loss-of-function gene, as was suggested for TdWRKY2 protein.
TE Expansion Responsible for Durum Wheat Genome Organization
Despite the accumulation of complete plant genome sequences, the most comprehensive studies on the organization of gene space throughout the sequence were carried from individual BAC clones or broader regions composed of BAC straddling, also called contigs (Brenner et al., 1993; Vitte and Bennetzen, 2006; Liu et al., 2007). We obtained, from the representative 6 BACs, a cumulative sequence length of almost 1 Mb. The TE proliferation was pronounced (representing 74.6%). LTR (Long Terminal Repeats, class I) appears to be the most represented in durum wheat sequences (63.6%) and among all grasses (Devos, 2010). Class II DNA transposons are generally less invasive than the retrotransposons in plant genomes. Durum wheat class II representation is slightly lower than bread wheat (16%) with 7.9%. Within class II members, MITE elements represent only 0.5% on durum wheat sequences; the same percentage is observed in bread wheat (Choulet et al., 2010). The composition of durum wheat is closely comparable to bread wheat composition and consequently to maize genome one (Schnable et al., 2009).
Although they are representative of the abundant wheat TEs available in the TREP database (Wicker et al., 2002, 2007; http://botserv2.uzh.ch/kelldata/trep-db/TEClassification.html), the class I and class II TEs observed in the genomic sequences of the wheat genomes may not cover all wheat TEs. It is expected that more wheat TEs will be identified, as more wheat genomic sequences become available and as more de novo TE annotation tools are developed (Choulet et al., 2010; Flutre et al., 2011). This is particularly supported by the identification in this study of different novel TE families, most of which are retrotransposons (17.9% of cumulative sequence).
Durum Wheat Genes Clustered into Small Islands
Our data show that durum wheat genes are clustered mainly into several very small islands (from one to four genes) per BAC, separated by large blocks of repetitive elements. Overall durum wheat gene density was estimated at one gene every 80 Kb.
Gene islands reflect a proliferation of the genome. They are common features of large and repetitive genomes, such as wheat and maize genomes and are not found in small genomes such as rice, Arabidopsis thaliana (The Arabidopsis Genome Initiative, 2000), and Brachypodium (Huo et al., 2009). In fact, two BAC contigs (961 and 594 kb), in maize, were analyzed and gene blocks varied between one and four genes per block (Kronmiller and Wise, 2008). This suggests that within large plant genomes, gene islands may originate from a specific selection against the separation of genes by TE insertions that would be deleterious for gene expression or regulation and second a homogeneous expansion combined with preferential deletions in gene-rich regions (Choulet et al., 2010).
Genes from islands would share common functional characteristics (Hurst et al., 2004). Genes are maintained close to one another because this configuration would provide a selection advantage and functional significance (Batada et al., 2007; Janga et al., 2008). This is confirmed by the identification of co-expressed genes or relatives sharing the same functions that are conserved between the Arabidopsis, rice and poplar genomes (Liu and Han, 2009). TdWRKY2 and gene2 coding for a chloride channel protein (CLC) (Table 1) (from TD16L16), involved in vacuolar compartmentation during salt stress (Hechenberger et al., 1996), might share the same function. This assumption is more defensible, because of their very close genomic distance; they are separated only by 500 bp. The same gene structure is conserved on B. distachyon orthologous region. Gene ontology and expression profiles along the chromosome in T. aestivum revealed that these islands are enriched in genes sharing the same function or expression profiles suggesting the existence of long-distance regulation mechanisms in wheat (Choulet et al., 2010).
Correlation between Sequence Similarity and Functional Similarity
Protein sequences contain important information for protein function. We found that TdWRKYs have sequence features including domains/motifs: WRKY domains, which bind WRKY proteins to the W-box motif in the promoter of target genes for transcription regulation; Plant Zinc finger motifs which function in association with WRKY domains; NLS (nuclear Localization Signal) peptides, responsible for leading proteins to the cell nucleus. And active sites such as protein kinases, which play important roles in signal transduction pathways. Special features such as Myristyl sites could function during TdWRKY5-mediated gene responses to stresses, as myristoylation sites play a vital role in membrane targeting and signal transduction in plant responses to environmental stresses (Podell and Gribskov, 2004).
Homologous genes with similar sequences are likely to have equivalent functions and to play the same functional role in equivalent biological processes. It is thus very important to identify homologous genes, especially those which are supported by experimental data. TdWRKYs from different groups (Ia, IIa, and IIc) have a close relationship with orthologous genes from barley, Brachypodium, bread wheat, maize, sorghum, aegilops, T. urartu, rice and Arabidopsis. Genes from T. aestivum, Ae. tauschii, T. urartu, and H. vulgare shared the closest similarity, while rice, sorghum and maize were out-grouped from other monocot plants.
As summarized in Table 2, the promoter regions of TdWRKYs were highly rich in cis-acting elements, and most of them were related to stress-induced gene expression, suggesting the putative role of essentially TdWRKY1, 3, 4, and 5 genes in wheat responses to a variety of environmental stresses. A TBLASTN on NCBI nucleotide collection database of the TdWRKYs 1.5 Kb cis-elements showed that TdWRKY1-UTR region shares 92% similarity from base 1 to 419 bp with T. aestivum 3B genome scaffold HG670306.1 from 28534545576 to 285345992. This 419 bp contains 4 W Box, 4 MYB, 5MYC, 4RAV, and 1 ABRE elements. 1.5 Kb TdWRKY2-UTR (632-949) region also shares 84% similarity with HG670306.1 (74734822-74734532). Elements which might be common to the orthologous regions are 3 Wbox, 4MYB, and 4MYC. 100% of homology was found between the 1.5 Kb from TdWRKY5 putative cis-elements and its homolog HG670306 from coordinate 436276014 to coordinate 436275955. They share all the cis-elements on Table 2. Finally, 99% was shared between all UTR regions from KC174859.1 (TaWRKY53) and 1.5 Kb from 736 to 1501. This region contains 4 Wbox, 6 MYB, 1 MYC, 4 GCC cores, 2 RAV, 2 LTREs, and 3 ABREs. Two hypotheses might be formulated. Firstly, genes that share the same acting elements on their promoters may function the same way and, secondly, homologous cis-acting elements of TdWRKY1, TdWRKY2, and TdWRKY4 are assigned to the 3B chromosome in bread wheat, suggesting that TdWRKY1, 2, and 4 might be also accommodated within chromosome 3B or 3A from T. turgidum.
Promoter region analyses, gene structure within orthologous members, and phylogeny studies all indicated that TdWRKYs are novel members of the WRKY family in durum wheat and, given their high sequence homology with orthologous monocots and Arabidopsis' known function, it can be postulated that they play similar roles in mediating responses to biotic and abiotic stresses.
Differential Expression of TdWRKYS
In a context where no functional characterization has yet been carried out for WRKYs in durum wheat, our data showed that the TdWRKY genes were inducible by high-salt treatment. Moreover, TdWRKY expression in response to salt stress displayed distinctive patterns in two durum wheat genotypes with contrasting behavior regarding tolerance to abiotic stresses. In the tolerant GR variety, TdWRKYs were strongly induced by salt stress within a few hours (6 h), while it was downregulated in the sensitive OR variety. Notably, differences between tolerant and sensitive genotypes were detected, mainly in the expression levels in tolerant genotype leaves at 24 h stress treatment. Peng et al. (2014) showed that WRKY members' unigenes were mostly up-regulated under salt stress in cotton (Gossypium hirsutum L.). They noted that some WRKY genes were expressed in the salt-tolerant genotype Earlistaple 7, but were repressed, weakly induced, or not induced at all in salt-sensitive Nan Dan Ba Di Da Hua, within 24 h. HvvWRKY2 was induced by salt stress in TR1 (Tolerant variety) but not in TS1 (Sensitive variety) (Li et al., 2014). Similarly, expression of wheat TaWRKY2 and TaWRKY19 was induced by salt, and both TaWRKY2 and TaWRKY19 enhanced salt tolerance in transgenic Arabidopsis plants compared with wild type (Niu et al., 2012). The induction pattern showed that the highest gene expression occurs at 3–6 h after salt stress initiation, for TaWRKY2 and TaWRKY19 and at 6 and 24 h for HvvWRKY2, which is likely to be the same for TdWRKY members.
The durum wheat complexity and large genome size (13,000 Mb) have largely prevented the development of genomic resources. Meanwhile, efforts have focused on sequencing of target regions selected as covering one or more genes of interest, called locus. Several BAC clones covering the corresponding region in one or more wheat genomes were sequenced as well.
In this study, we targeted durum wheat BAC clones harboring TdWRKY genes potentially involved in response to salt stress. We validated six BAC clones relative to the genomic library TtuLDN65. The size of these clone inserts after sequencing varies between 120 and 190 Kb. The added value of such an approach is that we obtain the coding sequence with introns and promoter regions that are essential for expression or functional study. Furthermore, it enabled us to access the entire genomic environment of the coding sequence that might provide new information about the structure, conservation, position, order and genomic dynamics. A structural study of the environment of a gene encoding a resistance or tolerance protein can bring a multitude of information that can enrich functional study.
In this study, we identified 5 WRKY genes potentially involved in the salt stress tolerance. This article reports the identification of 5 novel TdWRKY genes. Sequence comparison with orthologs from barley, wheat and Brachypodium provide valuable information for determining gene structure. TdWRKY1, TdWRKY3, TdWRKY4, and TdWRKY5 gene sequences were highly conserved as well as exon-intron boundaries, even with Arabidopsis. These important structural similarities, between orthologous WRKY genes, are indicative of a potential functional conservation.
HB, MB, AG initiated the project. FY, EM, WM performed experiments. FY performed analysis and interpretation of data for the work. FY wrote the paper. HB, WM revised the paper critically for important intellectual content.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This research was realized at the French Plant Genomic Resource Center (CNRGV) (INRA, Toulouse) and Laboratoire de Genomique et Biotechnologie des Fruits (GBF) (INRA, Toulouse). The authors would like to thank Philippe Leroy from (INRA, Clermont-Ferrand) for the Triannot pipelines training.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/article/10.3389/fpls.2016.02034/full#supplementary-material
ABRE, ABA-responsive element; BAC, bacterial artificial chromosome; CRT, C-repeat element; DRE, dehydration-responsive element; ERF, ethylene response factor; EST, expressed sequence tag; LTRE, low-temperature-responsive element; NLS, nuclear localization signal; ORF, open reading frame; TE, transposable element; TF, transcription factor; UTR, untranslated region.
Abe, H., Yamaguchi-Shinozaki, K., Urao, T., Iwasaki, T., Hosokawa, D., and Shinozaki, K. (1997). Role of Arabidopsis MYC and MYB homologs in drought- and abscisic acid-regulated gene expression. Plant Cell 9, 1859–1868.
Baker, S. S., Wilhelm, K. S., and Thomashow, M. F. (1994). The 5′-region of Arabidopsis thaliana cor15a has cis-acting elements that confer cold-, drought- and ABA-regulated gene expression. Plant Mol. Biol. 24, 701–713. doi: 10.1007/BF00029852
Bencke-Malato, M., Cabreira, C., Wiebke-Strohm, B., Bücker-Neto, L., Mancini, E., Osorio, M. B., et al. (2014). Genome-wide annotation of the soybean WRKY family and functional characterization of genes involved in response to Phakopsora pachyrhizi infection. BMC Plant Biol. 14:236. doi: 10.1186/s12870-014-0236-0
Bhattarai, K. K., Atamian, H. S., Kaloshian, I., and Eulgem, T. (2010). WRKY72-type transcription factors contribute to basal immunity in tomato and Arabidopsis as well as gene-for-gene resistance mediated by the tomato R gene Mi-1. Plant J. 63, 229–240. doi: 10.1111/j.1365-313X.2010.04232.x
Brenner, S., Elgar, G., Sandford, R., Macrae, A., Venkatesh, B., and Aparicio, S. (1993). Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genome. Nature. 366, 265–268. doi: 10.1038/366265a0
Cenci, A., Chantret, N., Kong, X., Gu, Y., Anderson, O. D., Fahima, T., et al. (2003). Construction and characterization of a half million clone BAC library of durum wheat (Triticum turgidum ssp. durum). Theor. Appl. Genet. 107, 931–939. doi: 10.1007/s00122-003-1331-z
Cenci, A., Somma, S., Chantret, N., Dubcovsky, J., and Blanco, A. (2004). PCR identification of durum wheat BAC clones containing genes coding for carotenoid biosynthesis enzymes and their chromosome localization. Genome 47, 911–917. doi: 10.1139/g04-033
Choulet, F., Wicker, T., Rustenholz, C., Paux, E., Salse, J., Leroy, P., et al. (2010). Megabase level sequencing reveals contrasted organization and evolution patterns of the wheat gene and transposable element spaces. Plant Cell 22, 1686–1701. doi: 10.1105/tpc.110.074187
Cifarelli, R. A., D'Onofrio, O., Grillo, R., Mango, T., Cellini, F., Piarulli, L., et al. (2013). Development of a new wheat microarray from a durum wheat totipotent cDNA library used for a powdery mildew resistance study. Cell. Mol. Biol. Lett. 18, 231–248. doi: 10.2478/s11658-013-0086-z
Ciolkowski, I., Wanke, D., Birkenbihl, R., and Somssich, I. E. (2008). Studies on DNA binding selectivity of WRKY transcription factors lend structural clues into WRKY domain function. Plant Mol. Biol. 68, 81–92. doi: 10.1007/s11103-008-9353-1
Dubcovsky, J., Ramakrishna, W., SanMiguel, P. J., Busso, C. S., Yan, L., Shiloff, B. A., et al. (2001). Comparative sequence analysis of colinear barley and rice bacterial artificial chromosomes1. Plant Physiol. 125, 1342–1353. doi: 10.1104/pp.125.3.1342
Dubouzet, J. G., Sakuma, Y., Ito, Y., Kasuga, M., Dubouzet, E. G., Miura, S., et al. (2003). OsDREB genes in rice, Oryza sativa L., encode transcription activators that function in drought-, high-salt- and cold-responsive gene expression. Plant J. 33, 751–763. doi: 10.1046/j.1365-313X.2003.01661.x
Dvorak, J. (1988). “Cytogenetical and molecular inferences about the evolution of wheat,” in 7th International Wheat Genetics Symposium, eds T. E. Miller and R. M. D. Koebner (Cambridge: Institute of Plant Science Research), 187–192.
Eulgem, T., Rushton, P. J., Schmelzer, E., Hahlbrock, K., and Somssich, I. E. (1999). Early nuclear events in plant defence signalling: rapid gene activation by WRKY transcription factors. EMBO J. 18, 4689–4699. doi: 10.1093/emboj/18.17.4689
Fernandez-Calvin, B., and Orellana, J. (1994). Metaphase I-bound arms frequency and genome analysis in wheat-Aegilops hybrids. Similar relationships between the B genome of wheat and S and Sl genomes of Ae. speltoides, Ae. longissima and Ae. sharonensis. Theor. Appl. Genet. 88, 1043–1049. doi: 10.1007/BF00220814
Flutre, T., Duprat, E., Feuillet, C., and Quesneville, H. (2011). Considering transposable element diversification in de novo annotation approaches. PLoS ONE 6:e16526. doi: 10.1371/journal.pone.0016526
Guillaumie, S., Mzid, R., Méchin, V., Léon, C., Hichri, I., Destrac-Irvine, A., et al. (2010). The grapevine transcription factor WRKY2 influences the lignin pathway and xylem development in tobacco. Plant Mol. Biol. 72, 215–234. doi: 10.1007/s11103-009-9563-1
Hechenberger, M., Schwappach, B., Fischer, W. N., Frommer, W. B., Jentsch, T. J., and Steinmeyer, K. (1996). A family of putative chloride channels from Arabidopsis and functional complementation of a yeast strain with a CLC gene disruption. J. Biol. Chem. 271, 33632–33638. doi: 10.1074/jbc.271.52.33632
Huo, N., Vogel, J. P., Lazo, G. R., You, F. M., Ma, Y., McMahon, S., et al. (2009). Structural characterization of Brachypodium genome and its syntenic relationship with rice and wheat. Plant Mol. Biol. 70, 47–61. doi: 10.1007/s11103-009-9456-3
Jiang, Y., and Deyholos, M. K. (2009). Functional characterization of Arabidopsis NaCl-inducible WRKY25 and WRKY33 transcription factors in abiotic stresses. Plant Mol. Biol. 69, 91–105. doi: 10.1007/s11103-008-9408-3
Jiang, Y., Duan, H., Yin, J., Ye, S., Zhu, J., and Zhang, F. (2014). Genome-wide identification and characterization of the Populus WRKY transcription factor family and analysis of their expression in response to biotic and abiotic stresses. J. Exp. Bot.65, 6629–6644. doi: 10.1093/jxb/eru381
Kagaya, Y., Ohmiya, K., and Hattori, T. (1999). RAV1, a novel DNA-binding protein, binds to bipartite recognition sequence through two distinct DNA binding domains uniquely found in higher plants. Nucleic Acids Res. 27, 470–478. doi: 10.1093/nar/27.2.470
Knoth, C., Ringler, J., Dangl, J. L., and Eulgem, T. (2007). Arabidopsis WRKY70 is required for full RPP4-mediated disease resistance and basal defense against Hyaloperonosporaparasitica. Mol. Plant Microbe Interact. 20, 120–128. doi: 10.1094/MPMI-20-2-0120
Larkin, M. A., Blackshields, G., Brown, N. P., Chenna, R., Mc Gettigan, P. A., McWilliam, H., et al. (2007). Clustal W and Clustal X version 2.0. Bioinformatics 23, 2947–2948. doi: 10.1093/bioinformatics/btm404
Leroy, P., Guilhot, N., Sakai, H., Bernard, A., Choulet, F., Theil, S., et al. (2012). TriAnnot: a versatile and high performance pipeline for the automated annotation of plant genomes. Front. Plant Sci. 3:5. doi: 10.3389/fpls.2012.00005
Li, H., Guo, Q., Lan, X., Zhou, Q., and Wei, N. (2014). Comparative expression analysis of five WRKY genes from Tibetan hulless barley under various abiotic stresses between drought-resistant and sensitive genotype. Acta Physiol. Plant. 36, 963–973. doi: 10.1007/s11738-013-1475-7
Liu, D., Leib, K., Zhao, P., Kogel, K. H., and Langen, G. (2014). Phylogenetic analysis of barley WRKY proteins and characterization of HvWRKY1 and -2 as repressors of the pathogen-inducible gene HvGER4c. Mol. Genet. Genomics 289, 1331–1345. doi: 10.1007/s00438-014-0893-6
Liu, Z. Q., Lu, Y., Wu, Z., Mei, C., Lu, K., and Yu, T. O. (2012). Cooperation of three WRKY-domain transcription factors WRKY18, WRKY40, and WRKY60 in repressing two ABA-responsive genes ABI4 and ABI5 in Arabidopsis. J. Exp. Bot. 63, 6371–6392. doi: 10.1093/jxb/ers293
Maeo, K., Hayashi, S., Kojima-Suzuki, H., Morikami, A., and Nakamura, D. (2001). Role of conserved residues of the WRKY domain in the DNA-binding of tobacco WRKY family proteins. Biosci. Biotechnol. Biochem. 65, 2428–2436. doi: 10.1271/bbb.65.2428
Makhloufi, E., Yousfi, F. E., Marande, W., Mila, I., Hanana, M., Bergès, H., et al. (2014). Isolation and molecular characterization of ERF1, an ethylene response factor gene from durum wheat (Triticum turgidum L. subsp. durum), potentially involved in salt-stress responses. J. Exp. Bot. 65, 6359–6371. doi: 10.1093/jxb/eru352
Marè, C., Mazzucotelli, E., Crosatti, C., Francia, E., Stanca, A. M., and Cattivelli, L. (2004). Hv-WRKY38: a new transcription factor involved in cold- and drought-response in barley. Plant Mol. Biol. 55, 399–416. doi: 10.1007/s11103-004-0906-7
Mingyu, Z., Zhengbin, Z., Shouyi, C., Jinsong, Z., and Hongbo, S. (2012). WRKY transcription factor superfamily: structure, origin and functions. Afr. J. Biotechnol. 11, 8051–8059. doi: 10.5897/AJB11.549
Monneveux, P., Zaharieva, M., and Rekika, D. (2000). “The utilisation of Triticum and Aegilops species for the improvement of durum wheat,” in Presented at Durum Wheat Improvement in the Mediterranean Region: New Challenges Zaragoza (2000-04-12 – 2000-04-14). Available online at: http://prodinra.inra.fr/record/158629
Nieva, C., Busk, P. K., Domínguez-Puigjaner, V., Lumbreras, P. S., Testillano, M. C., and Risueño, M. (2005). Isolation and functional characterisation of two new bZIP maize regulators of the ABA responsive gene rab28. Plant Mol. Biol. 58, 899–914. doi: 10.1007/s11103-005-8407-x
Niu, C. F., Wei, W., Zhou, Q. Y., Tian, A. G., Hao, Y. J., Zhang, W. K., et al. (2012). Wheat WRKY genes TaWRKY2 and TaWRKY19 regulate abiotic stress tolerance in transgenic Arabidopsis plants. Plant Cell. Environ. 35, 1156–1170. doi: 10.1111/j.1365-3040.2012.02480.x
Peng, Z., He, S., Gong, W., Sun, J., Pan, Z., and Xu, F. (2014). Comprehensive analysis of differentially expressed genes and transcriptional regulation induced by salt stress in two contrasting cotton genotypes. BMC Genomics 15:760. doi: 10.1186/1471-2164-15-760
Quesneville, H., Bergman, C. M., Andrieu, O., Autard, D., Nouaud, D., Ashburner, M., et al. (2005). Combined evidence annotation of transposable elements in genome sequences. PLoS Comput. Biol. 1:e22. doi: 10.1371/journal.pcbi.0010022
Schnable, P. S., Ware, D., Fulton, R. S., Stein, J. C., Wei, F., Pasternak, S., et al. (2009). The B73 maize genome: complexity, diversity, and dynamics. Science 326, 1112–1115. doi: 10.1126/science.1178534
Shinozaki, K., and Dennis, E. S. (2003). Cell signalling and gene regulation: global analyses of signal transduction and gene expression profiles. Curr. Opin. Plant Biol. 6, 405–409. doi: 10.1016/S1369-5266(03)00093-1
Skinner, J. S., vonZitzewitz, J., Szucs, P., Marquez-Cedillo, L., Filichkin, T., Amundsen, K., et al. (2005). Structural, functional, and phylogenetic characterization of a large CBF gene family in barley. Plant Mol. Biol. 59, 533–551. doi: 10.1007/s11103-005-2498-2
Svensson, J. T., Crosatti, C., Campoli, C., Bassi, R., Stanca, A. M., Close, T. J., et al. (2006). Transcriptome analysis of cold acclimation in barley albina and xantha mutants. Plant Physiol. 141, 257–270. doi: 10.1104/pp.105.072645
The International Wheat Genome Sequencing Consortium (IWGSC) (2014). A chromosome-based draft sequence of the hexaploid bread wheat (Triticum aestivum) genome. Science 345:1251788. doi: 10.1126/science.1251788
Towler, D. A., Gordon, J. I., Adams, S. P., and et Glaser, L. (1988). The biology and enzymology of eukaryotic protein acylation. Annu. Rev. Biochem. 57, 69–99. doi: 10.1146/annurev.bi.57.070188.000441
Tripathi, P., Rabara, R. C., Langum, T. J., Boken, A. K., Rushton, D. L., Boomsma, D. D., et al. (2012). The WRKY transcription factor family in Brachypodium distachyon. BMC Genomics 13:270. doi: 10.1186/1471-2164-13-270
Ulm, R., Ichimura, K., Mizoguchi, T., Peck, S. C., Zhu, T., Wang, X., et al. (2002). Distinct regulation of salinity and genotoxic stress responses by Arabidopsis MAP kinase phosphatase 1. EMBO J. 21, 6483–6493. doi: 10.1093/emboj/cdf646
Veras, A. A., de Sá, P. H., Azevedo, V., Silva, A., and Ramos, R. T. J. (2013). AutoAssemblyD: a graphical user interface system for several genome assemblers. Bioinformation 9, 840–841. doi: 10.6026/97320630009840
Vitte, C., and Bennetzen, J. L. (2006). Analysis of retrotransposon structural diversity uncovers properties and propensities in angiosperm genome evolution. Proc. Natl. Acad. Sci. 103, 17638–17643. doi: 10.1073/pnas.0605618103
Wei, K., Chen, J., Chen, Y. F., Wu, L. J., and Xie, D. O. (2012). Molecular phylogenetic and expression analysis of the complete WRKY transcription factor family in maize. DNA Res. 19, 153–164. doi: 10.1093/dnares/dsr048
Wei, W., Hu, Y., Han, Y. T., Zhang, K., Zhao, F. L., and Feng, J. Y. (2016). The WRKY transcription factors in the diploid woodland strawberry Fragaria vesca: identification and expression analysis under biotic and abiotic stresses. Plant Physiol. Biochem. 105, 129–144. doi: 10.1016/j.plaphy.2016.04.014
Wicker, T., Sabot, F., Hua-Van, A., Bennetzen, J. L., Capy, P., Chalhoub, B., et al. (2007). A unified classification system for eukaryotic transposable elements. Nat. Rev. Genet. 8, 973–982. doi: 10.1038/nrg2165
Wicker, T., Zimmermann, W., Perovic, D., Paterson, A. H., Ganal, M., Graner, A., et al. (2005). A detailed look at 7 million years of genome evolution in a 439 kb contiguous sequence at the barley Hv-eIF4E locus: recombination, rearrangements and repeats. Plant J. 41, 184–194. doi: 10.1111/j.1365-313X.2004.02285.x
Wu, L., Chen, X., Ren, H., Zhang, Z., Zhang, H., Wang, J., et al. (2007). ERF protein JERF1 that transcriptionally modulates the expression of abscisic acid biosynthesis-related gene enhances the tolerance under salinity and cold in tobacco. Planta 226, 815–825. doi: 10.1007/s00425-007-0528-9
Xie, Z., Zhang, Z. L., Zou, X., Huang, J., Ruas, P., Thompson, D., et al. (2005). Annotations and functional analyses of the rice WRKY gene superfamily reveal positive and negative regulators of abscisic acid signaling in aleurone cells. Plant Physiol. 137, 176–189. doi: 10.1104/pp.104.054312
Yamaguchi-Shinozaki, K., and Shinozaki, K. (2005). Organization of cis-acting regulatory elements in osmotic- and cold-stress-responsive promoters. Trends Plant Sci. 10, 88–94. doi: 10.1016/j.tplants.2004.12.012
Yamamoto, S., Nakano, T., Suzuki, K., and etShinshi, H. (2004). Elicitor-induced activation of transcription via W box-related cis-acting elements from a basic chitinase gene by WRKY transcription factors in tobacco. Biochim. Biophys. Acta 1679, 279–287. doi: 10.1016/j.bbaexp.2004.07.005
Yamasaki, K., Kigawa, T., Inoue, M., Tateno, M., Yamasaki, T., Yabuki, T., et al. (2005). Solution structure of an Arabidopsis WRKY DNA binding domain. Plant Cell 17, 944–956. doi: 10.1105/tpc.104.026435
Yang, P., Chen, C., Wang, Z., Fan, B., and et Chen, Z. (1999). Pathogen- and salicylic acid induced WRKY DNA-binding activity recognizes the elicitor response element of the tobacco class I chitinase gene promoter. Plant J. 18, 141–149. doi: 10.1046/j.1365-313X.1999.00437.x
Keywords: BAC sequencing, cis-regulatory elements, WRKY, transposable elements, salt stress, Triticum durum, wheat
Citation: Yousfi F-E, Makhloufi E, Marande W, Ghorbel AW, Bouzayen M and Bergès H (2017) Comparative Analysis of WRKY Genes Potentially Involved in Salt Stress Responses in Triticum turgidum L. ssp. durum. Front. Plant Sci. 7:2034. doi: 10.3389/fpls.2016.02034
Received: 29 June 2016; Accepted: 20 December 2016;
Published: 31 January 2017.
Edited by:Teresa Donze, West Chester University, USA
Reviewed by:Hao Wang, University of Georgia, USA
Dong-Ha Oh, Louisiana State University, USA
Zoe Joly-Lopez, New York University, USA
Copyright © 2017 Yousfi, Makhloufi, Marande, Ghorbel, Bouzayen and Bergès. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Hélène Bergès, firstname.lastname@example.org