ORIGINAL RESEARCH article

Front. Plant Sci., 22 October 2019

Sec. Plant Pathogen Interactions

Volume 10 - 2019 | https://doi.org/10.3389/fpls.2019.01332

Further Elucidation of the Argonaute and Dicer Protein Families in the Model Grass Species Brachypodium distachyon

  • Institute of Phytopathology, Centre for BioSystems, Land Use and Nutrition, Justus Liebig University, Giessen, Germany

Abstract

RNA interference (RNAi) is a biological process in which small RNAs regulate gene silencing at the transcriptional or posttranscriptional level. The trigger for gene silencing is double-stranded RNA generated from an endogenous genomic locus or a foreign source, such as a transgene or virus. In addition to regulating endogenous gene expression, RNAi provides the mechanistic basis for small RNA-mediated communication between plant hosts and interacting pathogenic microbes, known as cross-kingdom RNAi. Two core protein components, Argonaute (AGO) and Dicer (DCL), are central to the RNAi machinery of eukaryotes. Plants encode for several copies of AGO and DCL genes; in Arabidopsis thaliana, the AGO protein family contains 10 members, and the DCL family contains four. Little is known about the conservation and specific roles of these proteins in monocotyledonous plants, which account for the most important food staples. Here, we utilized in silico tools to investigate the structure and related functions of AGO and DCL proteins from the model grass Brachypodium distachyon. Based on the presence of characteristic domains, 16 BdAGO- and 6 BdDCL-predicted proteins were identified. Phylogenetic analysis showed that both protein families were expanded in Brachypodium as compared with Arabidopsis. For BdDCL proteins, both plant species contain a single copy of DCL1 and DCL4; however, Brachypodium contains two copies each of DCL2 and DCL3. Members of the BdAGO family were placed in all three functional clades of AGO proteins previously described in Arabidopsis. The greatest expansion occurred in the AtAGO1/5/10 clade, which contains nine BdAGOs (BdAGO5/6/7/9/10/11/12/15/16). The catalytic tetrad of the AGO P-element-induced wimpy testis domain (PIWI), which is required for endonuclease activity, is conserved in most BdAGOs, with the exception of BdAGO1, which lacks the last D/H residue. Three-dimensional modeling of BdAGO proteins using tertiary structure prediction software supported the phylogenetic classification. We also predicted a provisional interactome network for BdAGOs, their localization within the cell, and organ/tissue-specific expression. Exploring the specifics of RNAi machinery proteins in a model grass species can serve as a proxy for agronomically important cereals such as barley and wheat, where the development of RNAi-based plant protection strategies is of great interest.

Introduction

RNA interference (RNAi) is a regulatory mechanism utilized by most eukaryotes for endogenous gene silencing and protection against mobile repetitive sequences, transposons, and viruses (Fire et al., 1998; Wilson and Doudna, 2013). In contrast to transcriptional gene silencing (TGS), which results in the methylation of DNA and/or histones, posttranscriptional gene silencing (PTGS) operates by transcript degradation or translation inhibition. Selection of the target for silencing is governed by sequence complementarity between a single-stranded small RNA (sRNA) and the target RNA. Beyond its native role, the RNAi machinery has been exploited for developing novel plant protection strategies based on double-stranded (ds)RNA applications. Delivery of artificial dsRNA through transgene expression [host-induced gene silencing (HIGS)] or exogenous application [spray-induced gene silencing (SIGS)] was proven effective against fungal pathogens (Nowara et al., 2010; Koch et al., 2013; Koch et al., 2016; Wang et al., 2016), nematodes (Dutta et al., 2015), insects (Coleman et al., 2014; Abdellatef et al., 2015; Head et al., 2017), and parasitic plants (Tomilov et al., 2008; for review, see Andrade and Hunter, 2016; Cai et al., 2018). A recent discovery revealed that RNAi also is involved in natural cross-kingdom RNA communication (ckRNAi), where sRNA molecules function as mediators that are exchanged bidirectionally between a host plant and a microbial pathogen to silence their target transcripts and impact the outcome of the plant–pathogen interaction (Weiberg et al., 2013; Zhang et al., 2016; Wang et al., 2017a; Wang et al., 2017b).

Regardless of which RNAi-based process or application is involved, evolutionarily conserved protein components, including Dicer [termed Dicer-like (DCL) in plants] and Argonaute (AGO), play key roles in dsRNA processing. Dicers and DCLs are RNase III endonucleases that process exogenously supplied or endogenously generated ds- or hairpin (hp)-containing RNA precursors into various species of dsRNAs, commonly 21–24 nucleotides (nt) in length. These sRNAs are then loaded onto specific AGO proteins, which are components of the RNA-induced silencing complex (RISC). The loaded sRNA is processed into a single-stranded sRNA molecule, which then guides the RISC to complementary targets in the cytoplasm or nucleus. Depending on the biological context, target recognition leads to PTGS via RNA degradation, which may be mediated by the AGO protein’s slicer activity, or inhibition of translation, or to TGS via genomic DNA and/or histone methylation (Carthew and Sontheimer, 2009; Poulsen et al., 2013; Borges and Martienssen, 2015; Fang and Qi, 2016).

Phylogenetic analysis of genes belonging to the Dicer family suggests that they arose early in the evolution of eukaryotes and that their duplication and diversification correlated with the development of multicellularity and the need for complex gene regulation (Mukherjee et al., 2013). In plants, the structure and function of DCL proteins have been investigated most intensively in Arabidopsis, which expresses four DCLs (Schauer et al., 2002). The domain architecture of these proteins, like that of other eukaryotic Dicers, generally consists of an amino-terminal DEXDc and helicase-C (HELICc) domain, which are thought to mediate processive movement along a dsRNA, a dicer-dimer (heterodimerization) domain that facilitates binding with protein partners (Qin et al., 2010), a P-element-induced wimpy testis (PIWI)–Argonaute–Zwille (PAZ) domain, which binds the 3’ end of the dsRNA, two RIBOc (ribonuclease III family) domains and at least one dsRNA-binding motif (DSRM) domain at the C terminus (Schauer et al., 2002; Mukherjee et al., 2013; Bologna and Voinnet, 2014; Song and Rossi, 2017).

Analyses of Arabidopsis mutants revealed that the four DCLs generate different types of sRNAs, although some functional redundancy was observed (Gasciolli et al., 2005; Bologna and Voinnet, 2014; Borges and Martienssen, 2015). AtDCL1 produces microRNAs (miRNAs), a class of sRNAs that regulates endogenous gene expression via PTGS (Bartel, 2004). The remaining AtDCLs generate various subclasses of small interfering RNAs (siRNAs), including i) natural-antisense-transcript (nat)-siRNAs generated by AtDCL2 (Borsani et al., 2005), ii) trans-activating (ta)-siRNAs produced by AtDCL4 (Dunoyer et al., 2005), and iii) TGS-related 24-nt siRNAs generated by AtDCL3, which are responsible for silencing transposons and other repeated DNA sequences (Xie et al., 2004).

Despite the diversity of sRNAs, their association with AGO proteins and the RISC complex is a common feature. AGO proteins were named after the tube-shaped leaves of Arabidopsis ago1 mutants, which resemble the tentacles of the pelagic octopus, Argonauta argo (Bohmert et al., 1998). AGO proteins are highly conserved in nature, although the size of this family varies substantially between species (Höck and Meister, 2008; Zhang et al., 2015; Fang and Qi, 2016; You et al., 2017).

AGOs have a high level of structure and domain conservation between the prokaryotic and eukaryotic variants, even when the biological function clearly differs (Willkomm et al., 2015). Several prokaryotic (Wang et al., 2009; Liu et al., 2018) and eukaryotic (Lingel et al., 2003; Lingel et al., 2004; Boland et al., 2011) complete AGO structures or individual domains have been crystallographically resolved. Several human AGO proteins have been crystallized, namely, AGO2 in complex with a miRNA (Elkayam et al., 2012), AGO1 (Faehnle et al., 2013), and AGO3 (Park et al., 2017), showing that the AGO activity is dependent on conservation of active site residues and their interaction with other protein regions. The structures of AtAGO proteins have been partly resolved, especially the middle (MID) domain of AtAGO1, AtAGO2, and AtAGO5 (Frank et al., 2012; Zha et al., 2012). Functional domains characteristic of all AGO proteins, including the Arabidopsis AGOs, are the PAZ, MID, and PIWI domains governing the binding of sRNA ends and the slicer activity (Höck and Meister, 2008; Frank et al., 2012).

In Arabidopsis, 10 different AGOs have been identified. Phylogenetic analyses have divided them into three clades, comprising AGO1/5/10, AGO2/3/7, and AGO4/6/8/9 (Vaucheret, 2008). The different clades contain AGOs that mediate PTGS or TGS after they load specific types of sRNAs, which are selected based on length and identity of the 5’ nt (Bologna and Voinnet, 2014; Zhang et al., 2015; Fang and Qi, 2016). For example, AtAGO1 is involved in endogenous developmental regulation by miRNAs (Vaucheret et al., 2004), antiviral defense (alongside AtAGO2, Harvey et al., 2011), as well as bidirectional ckRNAi (Weiberg et al., 2013). AtAGO4 and AtAGO6 are involved in DNA and histone methylation (Zilberman et al., 2003; Zheng et al., 2007). AtAGO9 is known to be involved in female gametogenesis (Olmedo-Monfil et al., 2010), while AtAGO10 competes with AtAGO1 for sRNA loading in regulation of shoot apical meristem development (Zhu et al., 2011). AtAGO7 plays a role in defense against viruses (Qu et al., 2008).

In comparison to Arabidopsis, little is known about the RNAi machinery components in monocots. The number of AGO proteins is expanded in cereals, as there are 17 AGOs in maize (Zea mays) and 19 in rice (Oryza sativa; Fang and Qi, 2016; Mirzaei et al., 2014; Patel et al., 2018). The copy number of DCL2 and/or DCL3 genes also differs between monocot species and Arabidopsis. Six predicted DCLs were identified in rice, while five were identified in maize, wheat, and barley (Margis et al., 2006). Furthermore, the DCL3b gene has diverged significantly from its DCL3a paralog (Margis et al., 2006) and, thus, is considered a distinct, monocot-specific class of Dicer, termed DCL5 (Fukudome and Fukuhara, 2017; Borges and Martienssen, 2015).

Cereals are major staple crops worldwide; however, a plethora of pathogens and pests threaten their production (Savary et al., 2012). Recent efforts to develop environmentally friendly plant protection strategies have demonstrated that HIGS and SIGS can be used in major cereal crops, such as barley and wheat, to control necrotrophic fungi (Koch et al., 2013; Koch et al., 2016; Koch et al., 2018) and aphid pests (Abdellatef et al., 2015). Developing a better understanding of the cereal AGO and DCL protein family members and their specific functions is a prerequisite for clarifying the mechanisms undergirding RNAi-mediated plant protection. Here, we use the model species for temperate grass plants, Brachypodium distachyon (Brachypodium), to investigate cereal AGO and DCL proteins. Brachypodium is self-fertile, has a small genome (∼272 Mb), a short life cycle, and established transformation protocols (Vogel et al., 2006). The commonly used diploid inbred line Bd21 is fully sequenced (The International Brachypodium Initiative, 2010). In addition, literature data reveal strong responsiveness of Brachypodium sRNA pools to abiotic stress, suggesting that the RNAi machinery is sensitive to environmental changes (Wang et al., 2015).

Based on genomic database searches and in silico analysis, we identified six BdDCL proteins, as well as 16 previously reported AGO protein sequences in Brachypodium (Mirzaei et al., 2014). Since the structure of proteins closely relates to function and thus can serve as an indication of interaction patterns and redundancy in large protein families, we especially looked into domain structure conservation in Brachypodium relative to Arabidopsis. Similar to the protein structure and interactome analysis applied in Secic et al. (2015), we subjected Bd AGO and DCL proteins to a series of in silico analysis steps. The focus of this study is the structures and related functions of the AGO-like and DCL proteins of B. distachyon, with special regard to analysis of the phylogeny and three-dimensional (3D) structure modeling of the AGO family, as compared with the more familiar Arabidopsis thaliana AGO protein family. Given that the At AGO1/5/10 clade contains proteins involved in ckRNAi, we were especially interested to define the BdAGO proteins that are structurally most related to this clade and thus potentially have a key function in plant immunity and RNAi-based plant protection.

Materials and Methods

Acquisition of Sequences and Database Search

AGO and DCL protein sequences corresponding to the primary transcripts of specific genes were acquired by searching the Plant Comparative Genomics portal Phytozome 12 (Goodstein et al., 2012) B. distachyon v3.1 database (The International Brachypodium Initiative, 2010). Proteins whose domain architecture resembled those of Arabidopsis AGO and DCL proteins were considered. The Arabidopsis AGO and DCL protein sequences were taken from The Arabidopsis Information Resource database (Rhee et al., 2003; Berardini et al., 2015). Information on resolved protein structures was acquired from the Research Collaboratory for Structural Bioinformatics Protein Data Bank (Berman et al., 2000; Burley et al., 2018). The Brachypodium eFP Browser (Sibout et al., 2017; Winter et al., 2007) was used to assess the expression of transcripts corresponding to proteins involved in this study, based on the expression atlas detailing different organs and developmental stages.

Phylogenetic Analysis, Interactome Analysis, and Localization

The phylogenetic analysis and tree rendering were done by the Phylogeny.fr web server (Dereeper et al., 2008; Dereeper et al., 2010). The operational sequence is composed of MUSCLE 3.8.31 (Edgar, 2004) for alignment with default settings, Gblocks 0.91b for removal of ambiguous regions (Castresana, 2000), PhyML 3.1/3.0 aLRT for phylogeny (Guindon and Gascuel, 2003; Anisimova and Gascuel, 2006), based on maximum likelihood, and TreeDyn 198.3 for graphical representation (Chevenet et al., 2006). Multiple sequence alignment (MSA) was done using Clustal Omega at European Molecular Biology Laboratory—European Bioinformatics Institute (Sievers et al., 2011; Goujon et al., 2010) and the conserved residues and domains visualized by the Mview multiple alignment viewer (Brown et al., 1998). Pairwise sequence alignments were done using EMBOSS Needle (Rice et al., 2000), utilizing the Needleman–Wunsch algorithm for global alignment. Domain search was conducted using Simple Modular Architecture Research Tool (SMART) in normal SMART mode (Schultz et al., 1998; Letunic and Bork, 2017) and visualized with the Illustrator for Biological Sequences (IBS) online illustrator (Liu et al., 2015). Prediction of protein location was done using the plant subcellular localization integrative predictor (PSI), which shows an integrative result based on the output of an 11-member predictor community (Liu et al., 2013). Prediction of the interactome was done using the STRING database of protein–protein associations, while searching by protein sequence (Szklarczyk et al., 2019). Resulting associations/possible interactions that originate from text mining have been excluded, and the results show only associations supported by co-expression and/or experimental data.

Three-Dimensional Structure Modeling and Validation

SWISS-MODEL, a homology-based modeling software available at the ExPASy web server (Waterhouse et al., 2018), and CPHmodels 3.2 protein homology modeling server (Nielsen et al., 2010) were both used for 3D structure prediction from the sequence data. BLAST (Camacho et al., 2009) and HHBlits (Remmert et al., 2011) template search through the SWISS-MODEL template library was done and the models built using ProMod3 (Waterhouse et al., 2018) and the target-template alignment.

QMEAN, used for validation of the predicted 3D structures, is a scoring function that considers single residues and the global model, delivering an estimation of absolute quality of the prediction (Benkert et al., 2011). In order to check the stereochemical quality of predicted structures, we used the PROCHECK program (Morris et al., 1992; Laskowski et al., 1993). One of the stereochemical parameters considered is the fitness of the model in a Ramachandran plot, which maps the allowed backbone dihedral angles of amino acids (aa) in a protein structure (Ramachandran et al., 1963). Further on, we used the WHATCHECK software (Hooft et al., 1996) to calculate the Ramachandran Z-score, which compares the quality of the query structure to structures with high confidence (Hooft et al., 1997). Lastly, we used the dDFIRE/DFIRE2 energy calculation (Yang and Zhou, 2008) to calculate free energy scores for our structure predictions.

PyMOL (The Py-MOL Molecular Graphics System) was used for visualization of the predicted structures (Schrödinger, 2010, Open-Source PyMOL 1.3).

Results

Argonaute and Dicer Protein Families Are Expanded in Brachypodium Relative to Arabidopsis

To identify AGO and DCL proteins, the B. distachyon v3.1 database (The International Brachypodium Initiative, 2010) was searched for transcripts whose encoded proteins contain the characteristic domain architecture of each protein family. The accession numbers of the acquired sequences, the names assigned to the corresponding BdAGO proteins, the location of the encoding genes, and a description of the primary transcripts are shown in Table 1. The naming convention is similar to that used by Mirzaei et al. (2014) for 16 AGO proteins identified by primary transcripts in the B. distachyon Bd21 v3.1 annotation (The International Brachypodium Initiative, 2010). Our search for BdDCL candidates within the Bd21 v3.1 database revealed nine sequences. Clear lack of functional domains or insufficient length of the deduced aa sequence reduced the number of putative DCL genes to six (Bradi1g15440, Bradi1g77087, Bradi2g23187, Bradi5g15337, Bradi1g21030, and Bradi3g29287). Accession numbers and assigned names for the encoded BdDCLs are shown in Table 2. The putative AtAGO and AtDCL protein sequences were downloaded from The Arabidopsis Information Resource database (Table S1) and included in the MSA and phylogenetic analysis.

Table 1

Assigned name of proteinPrimary transcript ID (Phytozome)LocationDescription (Phytozome)
BdAGO1Bradi2g10360.2Bd2:8611187.8615652 reversePTHR22891//PTHR22891:SF44 – Eukaryotic translation initiation factor 2C
BdAGO2Bradi2g14147.1Bd2:12806099.12812784 reversePTHR22891:SF20 – Protein AGO 4-related
BdAGO3Bradi2g10370.1Bd2:8620394.8628745 reverseAGO family, subfamily AGO4
BdAGO4Bradi4g08587.1Bd4:7715921.7724879 reversePTHR22891:SF35 – Protein AGO 6
BdAGO5Bradi1g12431.2Bd1:9307067.9313002 forwardPTHR22891//PTHR22891:SF49 – Eukaryotic translation initiation factor 2C
BdAGO6Bradi1g05162.2Bd1:3447373.3455769 forwardPTHR22891//PTHR22891:SF24 – Eukaryotic translation initiation factor 2C
BdAGO7Bradi1g28260.3Bd1:23482384.23489131 reverseAGO family, subfamily monocot-AGO1
BdAGO8Bradi1g16060.3Bd1:12986117.12991032 reverseAGO family, subfamily AGO7
BdAGO9Bradi1g36907.2Bd1:32760045.32772130 reversePTHR22891//PTHR22891:SF25 – Eukaryotic translation initiation factor 2C
BdAGO10Bradi1g54977.1Bd1:53536162.53543236 forwardPTHR22891//PTHR22891:SF36 – Eukaryotic translation initiation factor 2C
BdAGO11Bradi1g29577.1Bd1:25162908.25171156 reversePTHR22891//PTHR22891:SF57 – Eukaryotic translation initiation factor 2C
BdAGO12Bradi5g18540.1Bd5:21720455.21728815 reverseAGO family, subfamily AGO1
BdAGO13Bradi5g21810.1Bd5:24487261.24492250 forwardAGO family, subfamily AGO2/3
BdAGO14Bradi5g21800.1Bd5:24479944.24484383 forwardAGO family, subfamily AGO2/3
BdAGO15Bradi3g51077.3Bd3:51944662.51956527 forwardPTHR22891//PTHR22891:SF34 – Eukaryotic translation initiation factor 2C
BdAGO16Bradi3g60697.5Bd3:59325332.59333596 reversePTHR22891//PTHR22891:SF34 – Eukaryotic translation initiation factor 2C

Assigned names and accession numbers of BdAGO proteins as well as genomic location and description of the primary transcript (as acquired from Phytozome Bd21 v3.1 database).

Table 2

Assigned name of proteinPrimary transcript ID (Phytozome)LocationDescription (Phytozome)
BdDCL1Bradi1g77087.1Bd1:73701094.73713218 forwardPTHR14950:SF3 – ENDORIBONUCLEASE DICER HOMOLOG 1
BdDCL2aBradi1g15440.1Bd1:12353426.12376799 forwardDCL family, subfamily DCL2
BdDCL2bBradi1g21030.3Bd1:16934990.16948923 reversePTHR14950:SF19 - ENDORIBONUCLEASE DICER HOMOLOG 2
BdDCL3aBradi3g29287.1Bd3:31008845.31020951 forwardPTHR14950//PTHR14950:SF31 - HELICASE-RELATED//SUBFAMILY NOT NAMED
BdDCL3bBradi2g23187.3Bd2:20726122.20733365 reversePF00636//PF02170//PF03368 – Ribonuclease III domain (Ribonuclease_3)//PAZ domain (PAZ)//Dicer dimerization domain (Dicer_dimer)
BdDCL4Bradi5g15337.3Bd5:18845215.18867304 reversePTHR14950:SF15 – DCL 4

Assigned names and accession numbers of BdDCL proteins as well as genomic location and description of the primary transcript (as acquired from Phytozome Bd21 v3.1 database).

A phylogenetic analysis of the inferred BdAGO protein sequences relative to those of Arabidopsis AGOs is shown in Figure 1. BdAGO proteins were placed in all three AtAGO clades. Some were grouped with a specific AtAGO member within a clade (e.g., BdAGO8 was grouped with AtAGO7, and BdAGO5/6/7/10 were grouped with AtAGO5), whereas other BdAGOs were distributed throughout an entire clade (e.g., BdAGO1/2/3/4 within the AtAGO4/6/8/9 clade). In the AtAGO1/5/10 clade, BdAGO9/11/12/15/16 were interspersed with AtAGO1 and AtAGO10. These findings suggest that the structural and functional differences of AtAGO proteins are translated to the expanded Brachypodium family. Phylogenetic analysis of the inferred BdDCL proteins showed that they strongly aligned with individual members of the Arabidopsis DCL family (Figure 2). Like Arabidopsis, Brachypodium contains a single ortholog of DCL1 and DCL4; however, expansion of the BdDCL family has led to the presence of two copies of both DCL2 and DCL3, as compared with the single ortholog present in Arabidopsis. Sequence comparisons revealed that DCL2a and DCL2b share 82.5% similarity at the aa level, while BdDCL3a and BdDCL3b share 44.2% similarity (aa, global alignment). Together, the phylogenetic trees show distinct branches interspersing Arabidopsis and Brachypodium homologues in functional clades; to our knowledge, this is the first indication of how the expansion of AGO and DCL protein families in Brachypodium relates to the specific clades and/or functional diversity of the corresponding Arabidopsis proteins.

Figure 1

Figure 2

Predicted Domains of BdAGO and BdDCL Proteins Indicate Structure Conservation

Next, we executed a domain search using SMART to elucidate the structures and functions of the 16 BdAGO proteins and six BdDCLs. The domain structure visualization of BdAGO (Figure 3) and BdDCL (Figure 4) proteins highlights the differences between members of each protein family with respect to the positions and presence/absence of the typically conserved domains. Detailed domain prediction data, as acquired by SMART/Pfam search, and the corresponding confidence values are shown for BdAGO (Table S2) and BdDCL proteins (Table S3). Consistent with other eukaryotic AGO proteins, many members of the BdAGO family are predicted to have four characteristic functional domains, including the N-terminal domain, PAZ, MID, and PIWI domain (Zhang et al., 2014). However, while the domain prediction results identified a variable N-t domain in most BdAGOs that consisted of both an N-domain and a DUF1785 domain (Poulsen et al., 2013), BdAGO1 and BdAGO13 contained only the DUF1785 domain. In addition, BdAGO1, BdAGO2, BdAGO3, BdAGO4, BdAGO13, and BdAGO14 were not predicted to contain a MID domain, in comparison to a previous report (Mirzaei et al., 2014). MSA performed by Clustal Omega on the PIWI domain of BdAGO proteins (Figure 5) showed a typical pattern of conservation for the DEDD/H catalytic tetrad required for slicer activity and a conserved QF-V motif in all aligned sequences except BdAGO1, which has the shortest protein sequence of all the BdAGO proteins and lacks the D/H residue of the catalytic tetrad.

Figure 3

Figure 4

Figure 5

Analysis of the predicted domains in BdDCL proteins revealed that the characteristic DEXDc, HELICc, Dicer-dimer (DUF283, Qin et al., 2010), PAZ, RIBOc, and DSRM domains are present in most family members. However, BdDCL2b and BdDCL3b lack the dimerization domain; BdDCL3b additionally lacks both the DEXDc and HELICc domains and instead contains an additional DSRM domain at the N terminus (position: 131-218, E-value: 8.6e-17; Figure 4 and Table S3). By contrast, BdDCL2a and BdDCL2b contain only one DSRM domain (Figure 4, Table S3).

Three-Dimensional Modeling Supports Phylogenetic Data Showing a Strong Expansion in the BdAGOs in the AGO1/5/10-Related Clade

In order to obtain an optimal homology-based 3D model of the studied proteins, we used SWISS-MODEL and CPHmodels 3.2. When choosing between models generated by alternative software programs or based on different templates, validation of the predicted structures is crucial for generating a consensus on the optimal model and further comparison. In case of BdAGOs, validation of the predicted structures was done using four different measurements, the results of which are shown in Table S4. While a 0-1 QMEAN value gives an absolute scoring of the predicted model, the Z-score shown in Table S4 serves as a comparison of the quality of the prediction of the query model relative to expected from a high-resolution X-ray crystallography structure. Typically, the more negative the Z-score is, the lower the quality of the predicted structure. Using PROCHECK, we report on the percentage of residues that fall into the most favored regions of the Ramachandran plot. The free energy score of the conformation of the predicted protein calculated by dDFIRE usually indicates lower values for a better model. Based on validation of the 3D models by the software, SWISS-MODEL was chosen as the preferred modeling tool for BdAGO proteins (Table S4). The corresponding AtAGO 3D structures, predicted and validated in the same fashion (Table S5), were subsequently used alongside the visualization of the BdAGO proteins by PyMOL. Figures 6, 7, and S1 display the models, in which the PAZ, MID, and PIWI domains (where predicted) and residues comprising the DEDD/H catalytic tetrad are indicated for all BdAGOs and a corresponding AtAGO representing the appropriate branch of the phylogenetic tree depicted in Figure 1. Overall, the predicted structures of the BdAGOs mirror the corresponding AtAGO structures, suggesting a functional conservation. The PIWI domain and the catalytic tetrad especially show similarity between the clade members shown together in Figures 6 and 7. The BdDCL proteins did not have successfully modeled structures predicted by either software and thus are not shown.

Figure 6

Figure 7

Expression Analysis and Putative Interactors of BdAGO Proteins

We addressed the question of tissue-specific expression of BdAGO and BdDCL genes by utilizing the B. distachyon eFP Browser (Sibout et al., 2017; Winter et al., 2007) in Table 3. Stronger expression of BdAGO and BdDCL genes was observed in seed and stem tissue compared with roots or leaves. The plant subcellular localization integrative predictor used for protein localization predicted that all BdAGOs reside in the cytosol, except for BdAGO3, BdAGO14 (predicted to localize in the nucleus), and BdAGO7 (predicted to localize in plastids), with varying scores of confidence (Table S7).

Table 3

Gene IDAssigned name1Highest expression signal2Peduncle, spikelet and stem nodes2Root2Leaf2Seed
Bradi2g10360BdAGO1Whole_grain_11_DAF
Bradi2g14147BdAGO2First_node_27_DAG
Bradi2g10370BdAGO3First_node_10_DAG
Bradi4g08587BdAGO4Whole_grain_2_years
Bradi1g12431BdAGO5Not found in browser
Bradi1g05162BdAGO6Endosperm_31_DAF
Bradi1g28260BdAGO7Endosperm_11_DAF
Bradi1g16060BdAGO8First_node_10_DAG
Bradi1g36907BdAGO9First_node_10_DAG
Bradi1g54977BdAGO10Whole_grain_11_DAF
Bradi1g29577BdAGO11Upper_part_of_inclined_node_42_DAG
Bradi5g18540BdAGO12Last_internode_35_DAG
Bradi5g21810BdAGO13Last_node_35_DAG
Bradi5g21800BdAGO14Last_internode_35_DAG
Bradi3g51077BdAGO15Roots_10_DAG
Bradi3g60697BdAGO16Roots_10_DAG
Bradi1g77087BdDCL1Whole_grain_2_years
Bradi1g15440BdDCL2aFirst_internode_27_DAG
Bradi1g21030BdDCL2bWhole_grain_2_years
Bradi3g29287BdDCL3aLower_part_of_inclined_node_42_DAG
Bradi2g23187BdDCL3bFirst_node_10_DAG
Bradi5g15337BdDCL4Whole_grain_2_years

Gene expression data as displayed on the B. distachyon eFP Browser (Winter et al., 2007; Sibout et al., 2017).

1The tissue with the highest absolute expression level per gene ID is indicated (DAF, day after fertilization; DAG, day after germination).

2Summary of relative expression level per gene ID displayed for tissues; color indicates relative expression levels (log2, the control is calculated from all the samples displayed on the particular eFP browser view); dark blue (high for the transcript relative to control) to light blue (low for the transcript relative to control).

Finally, prediction of proteins that interact with BdAGOs was carried out using STRING (Table S6). All predicted BdAGOs were found to be either co-expressed or experimentally shown to interact with three proteins: Bradi1g36340.1, Bradi2g30160.1, and Bradi4g45065.1. BLASTP search of these protein sequences identified them as a 110-kDa U5 small nuclear ribonucleoprotein component CLO (Bradi1g36340.1), a putative GTP-binding/transcription factor (Bradi2g30160.1), and DNA-directed RNA polymerase V subunit 1 or DNA-directed RNA polymerase V subunit 1 (Bradi4g45065.1). In addition to the aforementioned proteins, BdAGO9 (classified in the AtAGO1/5/10 clade) was predicted to interact with seven other proteins, identified as three homeobox proteins knotted-1-like (Bradi1g12677.1, Bradi1g12690.1, Bradi1g57607.1), two GATA transcription factors (Bradi2g14890.1, Bradi2g45750.1), and two putative uncharacterized proteins (Figure S2).

Discussion

In the present work, we investigated the phylogenetic relationships, domain, structure conservation, and predicted redundancy of AGO and DCL proteins in the model grass plant B. distachyon. Our findings imply that BdAGOs and BdDCLs have more copies and possibly greater diversification relative to Arabidopsis. One known example of such diversification in monocotyledonous plants is the rice AGO18, which confers antiviral immunity by sequestration of an miRNA (Wu et al., 2015). Since the presence of domains typical for AGO and DCL protein families serves as a selection criterion for proteins within this uninvestigated grass model species, we discuss phylogenetic relationships and predicted domain occurrence in detail.

Our analyses show that Brachypodium, like other grasses, contains one protein (BdDCL1) whose sequence groups with AtDCL1, one with AtDCL4 (BdDCL4), and two proteins each that group with AtDCL2 and AtDCL3 (Margis et al., 2006). Analysis of their predicted domain structures showed that BdDCL2b and BdDCL3b lack the dicer-dimer (DUF283) domain, known to mediate heterodimerization of AtDCL4 with its protein partners (Qin et al., 2010), but it is partially missing in two other DCLs (AtDCL3 and OsDCL2b, Margis et al., 2006). The second DSRM domain also was not predicted in either of the BdDCL2s. This finding is consistent with the previous discovery that AtDCL2 in Arabidopsis and OsDCL2a and OsDCL2b in rice also contain only one DSRM (Margis et al., 2006). This second DSRM domain has only a weak affinity for dsRNA, but it specifically binds to proteins of the HYPONASTIC LEAVES 1/dsRNA-binding protein family (Hiraguri et al., 2005; Margis et al., 2006). Since the DSRM domains mediate the transfer of the newly generated sRNA to the appropriate AGO protein (Parker et al., 2008), variations in the C-terminal architecture may influence which downstream partners and RNAi pathways are utilized by specific DCLs. The high level of divergence between DCL3a and DCL3b in several monocot species has led to the classification of DCL3b as a distinct type of DCL, termed DCL5. This monocot-specific class of DCLs has been retained for over 60 million years (Margis et al., 2006). It is is responsible for generating 24-nt-phased sRNAs in the male reproductive organs (Song et al., 2012). Interestingly, the predicted domain structure of BdDCL3b differs substantially from that of BdDCL3a, as it lacks both the DEXDc and HELICc domains (alongside the dicer-dimer domain) but contains an additional N-terminal DSRM (Figure 4, Table S3). Since the helicase domains are thought to mediate unwinding of the dsRNA (Zhang et al., 2004), the functionality of BdDCL3b is unclear. Mutations in AtDCL1 that impair helicase activity were previously shown to suppress miRNA accumulation (Liu et al., 2012). However, comparable levels of transcripts for two splice variants of AtDCL2, one of which contains an altered helicase region, were detected throughout the Arabidopsis life cycle (Margis et al., 2006). Additional structural and biochemical analyses are therefore required to assess the role of BdDCL3b.

Phylogenetic analysis of the BdAGO protein family placed members in all three clades defined by Arabidopsis AGOs. Thus, the structural and functional differences of AtAGO proteins appear to be translated to the expanded Brachypodium family. For two of the three clades, the number of BdAGO and AtAGO family members was equivalent. By contrast, the AtAGO1/5/10 clade was highly expanded in Brachypodium, with four members (BdAGO9/11/12/15/16) grouped with AtAGO1. This member of the Arabidopsis family is associated with a range of functions, including processing of dsRNA from transgenes and exogenous sources, and RNAs involved in ckRNAi (Vaucheret et al., 2004; Weiberg et al., 2013). If the corresponding BdAGO members of this clade are found to have similar functions in PTGS-mediated transgene silencing, this information would be highly useful for developing RNAi-based protection strategies for cereal crops. AtAGO10 groups with the same BdAGOs, as expected considering the clade association with AtAGO1. AtAGO5, which is the third member of the AtAGO1/5/10 clade, groups with BdAGO5/6/7/10. By contrast, BdAGO1/2/3/4 were interspersed within the AtAGO4/6/8/9 clade, raising the possibility that these Brachypodium proteins are involved in TGS.

As displayed in our domain visualization (Figure 3), all 16 BdAGOs have a predicted PAZ domain. In AGOs, this domain recognizes the 3’ end of the guide sRNA molecule, made accessible to the hydrophobic pocket of this nucleotide-binding domain by the typical 2’-O-methyl modification of the final sugar (Lingel et al., 2003; Cenik and Zamore, 2011). The MID domain recognizes the 5’ nucleotide of the sRNA, thus giving preference of an AGO protein into which the sRNA will be loaded (Frank et al., 2012). In Arabidopsis, sRNA with a 5’ U are sorted into AtAGO1, while AtAGO2 and AtAGO4 load sRNAs with a 5’ A and AtAGO5 loads sRNAs with a 5’ C (Mi et al., 2008). Our SMART/Pfam domain architecture search failed to identify a MID domain in any of the BdAGOs grouped in the AtAGO4/6/8/9 clade (BdAGO1/2/3/4) and with the AtAGO2/3 (BdAGO13/14) (Table S2), although this domain was reported in these proteins in a different study (Mirzaei et al., 2014). The specificity of sRNA sorting into particular AGOs can be further determined by the recognition of the sRNA secondary structure/base pairing by a QF-V motif present in the PIWI domain (Zhang et al., 2014). All Arabidopsis AGOs have the conserved QF-V motif, as do all 16 BdAGOs (Figure 5). The DEDD/H catalytic tetrad in the PIWI domain is also present in all but one of the BdAGOs. These active-site residues are critical for the RNase H-like endonuclease (slicer) activity exhibited by certain AGOs, which mediates sequence-specific cleavage of the target transcript. AtAGO1, AtAGO2, AtAGO4, AtAGO7, and AtAGO10 have been shown to have endonucleolytic activity toward target RNAs (Fang and Qi, 2016). Originally identified as a catalytic triad consisting of the residues DDH in most AtAGOs, but DDD in AtAGO2 and AtAGO3 (Höck and Meister, 2008), studies of yeast AGO revealed the importance of an invariant glutamate (E) residue, creating a catalytic tetrad (Nakanishi et al., 2012). This E residue is conserved in all Arabidopsis AGOs (Zhang et al., 2014). Consistent with these findings, MSA visualization of the BdAGO PIWI domains indicated that the majority of BdAGO proteins have the DEDH tetrad, except BdAGO13 and BdAGO14, which like their closest homologues AtAGO2 and AtAGO3, contain the DEDD tetrad (Figure 5). The only exception is BdAGO1, which is a short protein that terminates after 624 residues and lacks the last catalytic residue of the tetrad. Without the conserved catalytic residues, a specific AGO protein might induce gene silencing through means other than cutting, but Höck and Meister (2008) also discuss that the presence of a conserved catalytic triad does not mean the protein indeed has endonuclease activity. If an AGO does not display endonuclease activity, it may mediate PTGS via translation inhibition of the target RNA (Carthew and Sontheimer, 2009). Interestingly, the L1 and L2 linkers are predicted in all 16 BdAGOs as well (Table S2).

3D structure visualizations of all BdAGOs (Figures 6, 7 and S1) reinforce the conservation of the PAZ, PIWI, and MID domains (when predicted by SMART) and the catalytic tetrad residues in proximity within the PIWI domain (magenta spheres). The differences in the folding and looping linker regions within a certain group, relative to Arabidopsis AGOs, are shown in the model visualizations. Furthermore, the similarity between the 3D structures of BdAGOs that were predicted either to contain or lack a MID domain by the SMART/Pfam domain architecture search (e.g., Figure 6) reinforces the importance of comparing entire 3D models in order to gain insight into structure/function conservation. These structures are based on templates with better-known functional specificity and thus hint at the functions of the orthologs in Bd. As shown in Table S4, the templates used for modeling are based on either Argonaute 1 or Argonaute 2, with varying coverage and confidence values.

To assess the expression levels and locations of BdAGO and BdDCL family members, we analyzed the microarray-based expression data in the B. distachyon eFP browser (Table 3). Expression of BdAGO genes was observed in all four tissues assayed, although to varying extents. The expression patterns across the gene families indicate potential for functional redundancy. Notably, all members of the AtAGO1 clade (BdAGO9/11/12/15/16) show high and intermediate levels of expression in stem nodes and root tissue, respectively, while the BdAGO1/2/3/4 proteins generally display high expression in stem nodes and seeds. Analysis of BdDCL gene expression revealed that most members of this family are highly expressed in stem nodes and/or seeds. In vivo experimental approaches are necessary to decipher whether the apparent co-expression of these genes indicates specific compartmentalization or complete/partial redundancy in the various RNAi processes, including environmental RNAi and ckRNAi pathways.

Finally, we used STRING to predict the interactome for members of the BdAGO family. This analysis indicates that all BdAGOs interact with three proteins (Table S6), as was expected because of the domain conservation within the family. In addition, several potential interactors were identified for BdAGO9, based on co-expression or experimental data (Figure S2, Table S6). Of these, DNA-directed RNA polymerase V subunit 1 was previously shown to co-localize with, and possibly directly bind to, AtAGO4 via a so-called “Ago hook” (GW-rich domain), in order to facilitate the recruitment of AGO4 to chromatin to mediate TGS (El-Shami et al., 2007; Fang and Qi, 2016). Poulsen et al. (2013) have discussed that the binding of GW interactors to AGO make the loop with the E residue of the catalytic tetrad unavailable to the otherwise rigid DDD/H triad within the PIWI domain, thus offering an explanation of how the slicing activity is prevented in cases of silencing by translational inhibition. Moreover, GW containing proteins Needed for RDR2-independent DNA methylation and Silencing Defective 3 have been indicated in pathways bringing DNA/chromatin silencing together with RNAi proteins (Garcia et al., 2012; Pontier et al., 2012). Protein co-expression and interaction studies in vivo are necessary to confirm the identity and locations of these putative BdAGO interacting proteins. Due to the stringency of the prediction (excluding text mining data) and the lack of knowledge about the Brachypodium RNAi machinery, we were unable to predict additional interactions or to detect RNAi-related proteins that are known to interact with members of the AGO family in Arabidopsis. These include DCLs, HEN1 (involved in the methylation of sRNA 3’ ends to prevent degradation), RDRs (RNA-dependent RNA polymerases that synthesize dsRNAs from single-stranded RNAs), and HSP90, the heat shock protein that binds to AtAGO1 and AtAGO4 to aid the loading of the sRNAs and RISC assembly (Fang and Qi, 2016; Nakanishi, 2016). Moreover, the predicted localization of the BdAGOs places the majority of them in the cytosol, except for BdAGO3 and BdAGO14, which are predicted to localize in the nucleus (Table S7). From what is known about Arabidopsis AGOs, AtAGO1 is proposed to have a localization in the nucleus and the cytoplasm (Vaucheret, 2008), while AtAGO4 localizes to the nuclear Cajal bodies (Höck and Meister, 2008).

In sum, based on in silico prediction, our data provide the first detailed functional insight into the AGO and DCL protein families in Brachypodium. In the context of plant–microbe interactions and ckRNAi, the Brachypodium orthologs of AtAGO1 are of special interest because microbial sRNAs are shown to be loaded onto AtAGO1 (Weiberg et al., 2013). Our predictions indicate a clade of BdAGOs structurally similar to AtAGO1, consisting of BdAGO9/11/12/15/16 (Figure 7). Elaborating on such similarities with the well-established clades of Arabidopsis AGOs and DCLs is a valuable basis for testing the hypothesis that BdAGO9/11/12/15/16 proteins are required for exogenous and endogenous dsRNA processing in HIGS, SIGS, and bidirectional ckRNAi in the grass model. Beyond what is predictable by in silico analysis, more data on expression patterns and interacting proteins are needed to further understand the role of these pillar proteins of RNAi pathways in cereals.

Funding

This work was funded by the Deutsche Forschungsgemeinschaft in the program GRK2355 to KHK and by of the European Union in the Marie Skłodowska-Curie Actions CEREALPATH to KHK and SZ.

Statements

Data availability statement

All datasets for this study are included in the article/Supplementary Files.

Author contributions

EŠ, SZ, and KHK wrote the text. EŠ and SZ performed analysis and designed the figures.

Acknowledgments

We thank Sebastien Santini—CNRS/AMU IGS UMR7256, for administration of Phylogeny.fr, used for phylogenetic analysis within this study. We are very thankful to D’Maris Dempsey for her assistance in text editing. This manuscript summarizes EŠ’s and KHK’s contribution during the Organisation for Economic Cooperation and Development Conference on RNAi-Based Pesticides, which was sponsored by the Organisation for Economic Cooperation and Development Co-operative Research Programme: Biological Resource Management for Sustainable Agricultural Systems whose financial support made it possible for the author KHK to participate in the conference.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpls.2019.01332/full#supplementary-material

References

  • 1

    AbdellatefE.WillT.KochA.ImaniJ.VilcinskasA.KogelK.-H. (2015). Silencing the expression of the salivary sheath protein causes transgenerational feeding suppression in the aphid Sitobion avenae. Plant Biotechnol. J.13 (6), 849857. doi: 10.1111/pbi.12322

  • 2

    AndradeE. C.HunterW. B. (2016). “RNA interference–natural gene-based technology for highly specific pest control (HiSPeC)”. RNA interference (Ch.19). Ed. AbdurakhmonovI. Y. (IntechOpen), 391409. doi: 10.5772/61612

  • 3

    AnisimovaM.GascuelO. (2006). Approximate likelihood-ratio test for branches: a fast, accurate, and powerful alternative. Syst. Biol.55 (4), 539552. doi: 10.1080/10635150600755453

  • 4

    BartelD. P. (2004). MicroRNAs: genomics, biogenesis, mechanism, and function. Cell116 (2), 281297. doi: 10.1016/S0092-8674(04)00045-5

  • 5

    BenkertP.BiasiniM.SchwedeT. (2011). Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics27 (3), 343350. doi: 10.1093/bioinformatics/btq662

  • 6

    BerardiniT. Z.ReiserL.LiD.MezheritskyY.MullerR.StraitE.et al. (2015). The aArabidopsis information resource: Making and mining the “gold standard” annotated reference plant genome. Genesis53 (8), 474485. doi: 10.1002/dvg.22877

  • 7

    BermanH.GillilandG. M.WeissigH.ShindyalovI. N.WestbrookJ.BourneP. E.et al. (2000). The protein data bank. Nucleic Acids Res.28 (1), 235242. doi: 10.1093/nar/28.1.235

  • 8

    BohmertK.CamusI.BelliniC.BouchezD.CabocheM.BenningC. (1998). AGO1 defines a novel locus of Arabidopsis controlling leaf development. EMBO J.17 (1), 170180. doi: 10.1093/emboj/17.1.170

  • 9

    BolandA.HuntzingerE.SchmidtS.IzaurraldeE.WeichenriederO. (2011). Crystal structure of the MID-PIWI lobe of a eukaryotic Argonaute protein. Proc. Natl. Acad. Sci.108 (26), 10466 LP–10471. doi: 10.1073/pnas.1103946108

  • 10

    BolognaN. G.VoinnetO. (2014). The diversity, biogenesis, and activities of endogenous silencing small RNAs in Arabidopsis. Annu. Rev. Plant Biol.65, 473503. doi: 10.1146/annurev-arplant-050213-035728

  • 11

    BorgesF.MartienssenR. A. (2015). The expanding world of small RNAs in plants. Nat. Rev. Mol. Cell Biol.16 (12), 727741. doi: 10.1038/nrm4085

  • 12

    BorsaniO.ZhuJ.VersluesP. E.SunkarR.ZhuJ.-K. (2005). Endogenous siRNAs derived from a pair of natural cis-Antisense transcripts regulate salt tolerance in arabidopsis. Cell123 (7), 12791291. doi: 10.1016/j.cell.2005.11.035

  • 13

    BrownN. P.LeroyC.SanderC. (1998). MView: a web-compatible database search or multiple alignment viewer. Bioinformatics14 (4), 380381. doi: 10.1093/bioinformatics/14.4.380

  • 14

    BurleyS. K.YangH.TanL.SalaR.HudsonB. P.BhikadiyaC.et al. (2018). RCSB Protein Data Bank: biological macromolecular structures enabling research and education in fundamental biology, biomedicine, biotechnology and energy. Nucleic Acids Res.47 (D1), D464D474. doi: 10.1093/nar/gky1004

  • 15

    CaiQ.HeB.KogelK.-H.JinH. (2018). Cross-kingdom RNA trafficking and environmental RNAi — nature’s blueprint for modern crop protection strategies. Curr. Opin. Microbiol.46, 5864. doi: 10.1016/j.mib.2018.02.003

  • 16

    CamachoC.CoulourisG.AvagyanV.MaN.PapadopoulosJ.BealerK.et al. (2009). BLAST+: architecture and applications. BMC Bioinformatics10 (1), 421. doi: 10.1186/1471-2105-10-421

  • 17

    CarthewR. W.SontheimerE. J. (2009). Origins and mechanisms of miRNAs and siRNAs. Cell136 (4), 642655. doi: 10.1016/j.cell.2009.01.035

  • 18

    CastresanaJ. (2000). Selection of conserved blocks from multiple alignments for their use in phylogenetic analysis. Mol. Biol. Evol.17 (4), 540552. doi: 10.1093/oxfordjournals.molbev.a026334

  • 19

    CenikE. S.ZamoreP. D. (2011). Argonaute proteins. Curr. Biol.21 (12), R446R449. doi: 10.1016/j.cub.2011.05.020

  • 20

    ChevenetF.BrunC.BañulsA.-L.JacqB.ChristenR. (2006). TreeDyn: towards dynamic graphics and annotations for analyses of trees. BMC Bioinformatics7, 439. doi: 10.1186/1471-2105-7-439

  • 21

    ColemanA. D.WoutersR. H. M.MugfordS. T.HogenhoutS. A. (2014). Persistence and transgenerational effect of plant-mediated RNAi in aphids. J. Exp. Bot.66 (2), 541548. doi: 10.1093/jxb/eru450

  • 22

    DereeperA.AudicS.ClaverieJ.-M.BlancG. (2010). BLAST-EXPLORER helps you building datasets for phylogenetic analysis. BMC Evol. Biol.10, 8. doi: 10.1186/1471-2148-10-8

  • 23

    DereeperA.GuignonV.BlancG.AudicS.BuffetS.ChevenetF.et al. (2008). Phylogeny.fr: robust phylogenetic analysis for the non-specialist. Nucleic Acids Res.36 (Web Server issue), W465W469. doi: 10.1093/nar/gkn180

  • 24

    DunoyerP.HimberC.VoinnetO. (2005). DICER-LIKE 4 is required for RNA interference and produces the 21-nucleotide small interfering RNA component of the plant cell-to-cell silencing signal. Nat. Genet.37, 1356. doi: 10.1038/ng1675

  • 25

    DuttaT. K.PapoluP. K.BanakarP.ChoudharyD.SirohiA.RaoU. (2015). Tomato transgenic plants expressing hairpin construct of a nematode protease gene conferred enhanced resistance to root-knot nematodes. Front. Microbiol.6, 260. doi: 10.3389/fmicb.2015.00260

  • 26

    EdgarR. C. (2004). MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res.32 (5), 17921797. doi: 10.1093/nar/gkh340

  • 27

    ElkayamE.KuhnC.-D.TociljA.HaaseA. D.GreeneE. M.HannonG. J.et al. (2012). The structure of human argonaute-2 in Complex with miR-20a. Cell150 (1), 100110. doi: 10.1016/j.cell.2012.05.017

  • 28

    El-ShamiM.PontierD.LahmyS.BraunL.PicartC.VegaD.et al. (2007). Reiterated WG/GW motifs form functionally and evolutionarily conserved ARGONAUTE-binding platforms in RNAi-related components. Genes Dev.21 (20), 25392544. doi: 10.1101/gad.451207

  • 29

    FaehnleC. R.ElkayamE.HaaseA. D.HannonG. J.Joshua-TorL. (2013). The making of a slicer: activation of human Argonaute-1. Cell Rep.3 (6), 19011909. doi: 10.1016/j.celrep.2013.05.033

  • 30

    FangX.QiY. (2016). RNAi in Plants: an argonaute-centered view. Plant Cell.28 (2), 272 LP–272285. doi: 10.1105/tpc.15.00920

  • 31

    FireA.XuS.MontgomeryM. K.KostasS. A.DriverS. E.MelloC. C. (1998). Potent and specific genetic interference by double-stranded RNA in Caenorhabditis elegans. Nature391 (6669), 806811. doi: 10.1038/35888

  • 32

    FrankF.HauverJ.SonenbergN.NagarB. (2012). Arabidopsis Argonaute MID domains use their nucleotide specificity loop to sort small RNAs. EMBO J.31 (17), 3588 LP–3595. doi: 10.1038/emboj.2012.204

  • 33

    FukudomeA.FukuharaT. (2017). Plant dicer-like proteins: double-stranded RNA-cleaving enzymes for small RNA biogenesis. J. Plant Res.130 (1), 3344. doi: 10.1007/s10265-016-0877-1

  • 34

    GarciaD.GarciaS.PontierD.MarchaisA.RenouJ. P.LagrangeT.et al. (2012). Ago hook and RNA helicase motifs underpin dual roles for SDE3 in antiviral defense and silencing of nonconserved intergenic regions. Mol. Cell48 (1), 109120. doi: 10.1016/j.molcel.2012.07.028

  • 35

    GasciolliV.MalloryA. C.BartelD. P.VaucheretH. (2005). Partially redundant functions of Arabidopsis DICER-like enzymes and a role for DCL4 in producing trans-acting siRNAs. Curr. Biol.15 (16), 14941500. doi: 10.1016/j.cub.2005.07.024

  • 36

    GoodsteinD. M.ShuS.HowsonR.NeupaneR.HayesR. D.FazoJ.et al. (2012). Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res.40 (Database issue), D1178D1186. doi: 10.1093/nar/gkr944

  • 37

    GoujonM.McWilliamH.LiW.ValentinF.SquizzatoS.PaernJ.et al. (2010). A new bioinformatics analysis tools framework at EMBL–EBI. Nucleic Acids Res.38 (suppl_2), W695W699. doi: 10.1093/nar/gkq313

  • 38

    GuindonS.GascuelO. (2003). A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol.52, 696704. doi: 10.1080/10635150390235520

  • 39

    HarveyJ. J. W.LewseyM. G.PatelK.WestwoodJ.HeimstädtS.CarrJ. P.et al. (2011). An antiviral defense role of AGO2 in plants. PLoS One6 (1), e14639. doi: 10.1371/journal.pone.0014639

  • 40

    HeadG. P.CarrollM. W.EvansS. P.RuleD. M.WillseA. R.ClarkT. L.et al. (2017). Evaluation of SmartStax and SmartStax PRO maize against western corn rootworm and northern corn rootworm: efficacy and resistance management. Pest Manag. Sci.73 (9), 18831899. doi: 10.1002/ps.4554

  • 41

    HiraguriA.ItohR.KondoN.NomuraY.AizawaD.MuraiY.et al. (2005). Specific interactions between Dicer-like proteins and HYL1/DRB-family dsRNA-binding proteins in Arabidopsis thaliana. Plant Mol. Biol.57 (2), 173188. doi: 10.1007/s11103-004-6853-5

  • 42

    HöckJ.MeisterG. (2008). The Argonaute protein family. Genome Biol.9 (2), 210. doi: 10.1186/gb-2008-9-2-210

  • 43

    HooftR. W. W.SanderC.VriendG. (1997). Objectively judging the quality of a protein structure from a Ramachandran plot. Bioinformatics13 (4), 425430. doi: 10.1093/bioinformatics/13.4.425

  • 44

    HooftR. W. W.VriendG.SanderC.AbolaE. E. (1996). Errors in protein structures. Nature381 (6580), 272. doi: 10.1038/381272a0

  • 45

    KochA.BiedenkopfD.FurchA.WeberL.RossbachO.AbdellatefE.et al. (2016). An RNAi-based control of fusarium graminearum infections through spraying of long dsRNAs Involves a plant passage and is controlled by the fungal silencing machinery. PLOS Pathog.12 (10), e1005901. doi: 10.1371/journal.ppat.1005901

  • 46

    KochA.KumarN.WeberL.KellerH.ImaniJ.KogelK.-H. (2013). Host-induced gene silencing of cytochrome P450 lanosterol C14α-demethylase–encoding genes confers strong resistance to Fusarium species. Proc. Natl. Acad. Sci.110 (4), 19324 LP–19329. doi: 10.1073/pnas.1306373110

  • 47

    KochA.SteinE.KogelK.-H. (2018). RNA-based disease control as a complementary measure to fight Fusarium fungi through silencing of the azole target Cytochrome P450 Lanosterol C-14 α-Demethylase. Eur. J. Plant Pathol.152 (4), 10031010. doi: 10.1007/s10658-018-1518-4

  • 48

    LaskowskiR.MacarthurM. W.MossD. S.ThorntonJ. (1993). PROCHECK:A program to check the stereochemical quality of protein structures. J. Appl. Crystallogr.26, 283291. doi: 10.1107/S0021889892009944

  • 49

    LetunicI.BorkP. (2017). 20 years of the SMART protein domain annotation resource. Nucleic Acids Res.46 (D1), D493D496. doi: 10.1093/nar/gkx922

  • 50

    LingelA.SimonB.IzaurraldeE.SattlerM. (2003). Structure and nucleic-acid binding of the Drosophila Argonaute 2 PAZ domain. Nature426 (6965), 465469. doi: 10.1038/nature02123

  • 51

    LingelA.SimonB.IzaurraldeE.SattlerM. (2004). Nucleic acid 3ʹ-end recognition by the Argonaute2 PAZ domain. Nat. Struct. Mol. Biol.11, 576. doi: 10.1038/nsmb777

  • 52

    LiuC.AxtellM. J.FedoroffN. V. (2012). The helicase and RNaseIIIa domains of Arabidopsis Dicer-Like1 modulate catalytic parameters during microRNA biogenesis. Plant Physiol.159 (2), 748758. doi: 10.1104/pp.112.193508

  • 53

    LiuL.ZhangZ.MeiQ.ChenM. (2013). PSI: a comprehensive and integrative approach for accurate plant subcellular localization prediction. PLOS ONE8 (10), e75826. doi: 10.1371/journal.pone.0075826

  • 54

    LiuW.XieY.MaJ.LuoX.NieP.ZuoZ.et al. (2015). IBS: an illustrator for the presentation and visualization of biological sequences. Bioinformatics.31 (20), 33593361. doi: 10.1093/bioinformatics/btv362

  • 55

    LiuY.EsyuninaD.OlovnikovI.TeplovaM.KulbachinskiyA.AravinA. A.et al. (2018). Accommodation of helical imperfections in rhodobacter sphaeroides argonaute ternary complexes with guide RNA and target DNA. Cell Rep.24 (2), 453462. doi: 10.1016/j.celrep.2018.06.021

  • 56

    MargisR.FusaroA. F.SmithN. A.CurtinS. J.WatsonJ. M.FinneganE. J.et al. (2006). The evolution and diversification of Dicers in plants. FEBS Lett.580 (10), 24422450. doi: 10.1016/j.febslet.2006.03.072

  • 57

    MiS.CaiT.HuY.ChenY.HodgesE.NiF.et al. (2008). Sorting of Small RNAs into arabidopsis argonaute complexes is directed by the 5’ terminal nucleotide. Cell133 (1), 116127. doi: 10.1016/j.cell.2008.02.034

  • 58

    MirzaeiK.BahramnejadB.ShamsifardM. H.ZamaniW. (2014). In silico identification, phylogenetic and bioinformatic analysis of argonaute genes in plants. Int. J. Genomics.2014, 967461. doi: 10.1155/2014/967461

  • 59

    MorrisA. L.MacArthurM. W.HutchinsonE. G.ThorntonJ. M. (1992). Stereochemical quality of protein structure coordinates. Proteins Struct. Funct. Bioinf.12 (4), 345364. doi: 10.1002/prot.340120407

  • 60

    MukherjeeK.CamposH.KolaczkowskiB. (2013). Evolution of animal and plant dicers: early parallel duplications and recurrent adaptation of antiviral RNA binding in plants. Mol. Biol. Evol.30 (3), 627641. doi: 10.1093/molbev/mss263

  • 61

    NakanishiK. (2016). Anatomy of RISC: how do small RNAs and chaperones activate Argonaute proteins? Wiley Interdiscip. Rev. RNA.7 (5), 637660. doi: 10.1002/wrna.1356

  • 62

    NakanishiK.WeinbergD. E.BartelD. P.PatelD. J. (2012). Structure of yeast Argonaute with guide RNA. Nature486, 368. doi: 10.1038/nature11211

  • 63

    NielsenM.LundegaardC.LundO.PetersenT. N. (2010). CPHmodels-3.0–remote homology modeling using structure-guided sequence profiles. Nucleic Acids Res.38 (Web Server issue), W576W581. doi: 10.1093/nar/gkq535

  • 64

    NowaraD.GayA.LacommeC.ShawJ.RidoutC.DouchkovD.et al. (2010). HIGS: Host-Induced Gene Silencing in the obligate biotrophic fungal pathogen blumeria graminis. Plant Cell.22 (9), 3130 LP–3141. doi: 10.1105/tpc.110.077040

  • 65

    Olmedo-MonfilV.Durán-FigueroaN.Arteaga-VázquezM.Demesa-ArévaloE.AutranD.GrimanelliD.et al. (2010). Control of female gamete formation by a small RNA pathway in Arabidopsis. Nature464, 628. doi: 10.1038/nature08828

  • 66

    ParkM. S.PhanH.-D.BuschF.HinckleyS. H.BrackbillJ. A.WysockiV. H.et al. (2017). Human Argonaute3 has slicer activity. Nucleic Acids Res.45 (20), 1186711877. doi: 10.1093/nar/gkx916

  • 67

    ParkerG. S.MaityT. S.BassB. L. (2008). dsRNA binding properties of RDE-4 and TRBP reflect their distinct roles in RNAi. J. Mol. Biol.384 (4), 967979. doi: 10.1016/j.jmb.2008.10.002

  • 68

    PatelP.MathioniS.KakranaA.ShatkayH.MeyersB. (2018). Reproductive phasiRNAs in grasses are compositionally distinct from other classes of small RNAs. New Phytol.220, 851864. doi: 10.1111/nph.15349

  • 69

    PontierD.PicartC.RoudierF.GarciaD.LahmyS.AzevedoJ.et al. (2012). NERD, a Plant-Specific GW Protein, Defines an Additional RNAi-Dependent Chromatin-Based Pathway in Arabidopsis. Mol. Cell48 (1), 121132. doi: 10.1016/j.molcel.2012.07.027

  • 70

    PoulsenC.VaucheretH.BrodersenP. (2013). Lessons on RNA silencing mechanisms in plants from eukaryotic argonaute structures. Plant Cell.25 (1), 2237. doi: 10.1105/tpc.112.105643

  • 71

    QinH.ChenF.HuanX.MachidaS.SongJ.YuanY. A. (2010). Structure of the Arabidopsis thaliana DCL4 DUF283 domain reveals a noncanonical double-stranded RNA-binding fold for protein–protein interaction. RNA16 (3), 474481. doi: 10.1261/rna.1965310

  • 72

    QuF.YeX.MorrisT. J. (2008). Arabidopsis DRB4, AGO1, AGO7, and RDR6 participate in a DCL4-initiated antiviral RNA silencing pathway negatively regulated by DCL1. Proc. Natl. Acad. Sci. U. S. A.105 (38), 1473214737. doi: 10.1073/pnas.0805760105

  • 73

    RamachandranG. N.RamakrishnanC.SasisekharanV. (1963). Stereochemistry of polypeptide chain configurations. J. Mol. Biol.7 (1), 9599. doi: 10.1016/S0022-2836(63)80023-6

  • 74

    RemmertM.BiegertA.HauserA.SödingJ. (2011). HHblits: lightning-fast iterative protein sequence searching by HMM-HMM alignment. Nat. Methods9, 173. doi: 10.1038/nmeth.1818

  • 75

    RheeS. Y.BeavisW.BerardiniT. Z.ChenG.DixonD.DoyleA.et al. (2003). The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community. Nucleic Acids Res.31 (1), 224228. doi: 10.1093/nar/gkg076

  • 76

    RiceP.LongdenI.BleasbyA. (2000). EMBOSS: the european molecular biology open software suite. Trends Genet.16 (6), 276277. doi: 10.1016/S0168-9525(00)02024-2

  • 77

    SavaryS.FickeA.AubertotJ.-N.HollierC. (2012). Crop losses due to diseases and their implications for global food production losses and food security. Food Secur.4, 519537. doi: 10.1007/s12571-012-0200-5

  • 78

    SchauerS. E.JacobsenS. E.MeinkeD. W.RayA. (2002). DICER-LIKE1: blind men and elephants in Arabidopsis development. Trends Plant Sci.7 (11), 487491. doi: 10.1016/S1360-1385(02)02355-5

  • 79

    SchrödingerL. L. C. (2010). The PyMOL Molecular Graphics System, Version∼1.3.

  • 80

    SchultzJ.MilpetzF.BorkP.PontingC. P. (1998). SMART, a simple modular architecture research tool: identification of signaling domains. Proc. Natl. Acad. Sci.95 (11), 5857 LP–5864. doi: 10.1073/pnas.95.11.5857

  • 81

    SecicE.SutkovicJ.Abdel GawwadM. (2015). Interactome analysis and docking sites prediction of radiation sensitive 23 (RAD 23) proteins in Arabidopsis Thalianathaliana. Curr. Proteomics12 (1), 2844. doi: 10.2174/1570164612666150225234240

  • 82

    SiboutR.ProostS.HansenB. O.VaidN.GiorgiF. M.Yue-KuangS.et al. (2017). Expression atlas and comparative coexpression network analyses reveal important genes involved in the formation of lignified cell wall in Brachypodium distachyon. New Phytol215 (3), 10091025. doi: 10.1111/nph.14635

  • 83

    SieversF.WilmA.DineenD.GibsonT. J.KarplusK.LiW.et al. (2011). Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega. Mol. Syst. Biol.7 (1), 539. doi: 10.1038/msb.2011.75

  • 84

    SongM. S.RossiJ. J. (2017). Molecular mechanisms of Dicer: endonuclease and enzymatic activity. Biochem. J.474, 10) 16031618. doi: 10.1042/BCJ20160759

  • 85

    SongX.LiP.ZhaiJ.ZhouM.MaL.LiuB.et al. (2012). Roles of DCL4 and DCL3b in rice phased small RNA biogenesis. Plant J.69 (3), 462474. doi: 10.1111/j.1365-313X.2011.04805.x

  • 86

    SzklarczykD.GableA. L.LyonD.JungeA.WyderS.Huerta-CepasJ.et al. (2019). STRING v11: protein-protein association networks with increased coverage, supporting functional discovery in genome-wide experimental datasets. Nucleic Acids Res.47 (D1), D607D613. doi: 10.1093/nar/gky1131

  • 87

    The International Brachypodium Initiative (2010). Genome sequencing and analysis of the model grass Brachypodium distachyon. Nature463, 763768. doi: 10.1038/nature08747

  • 88

    TomilovA. A.TomilovaN. B.WroblewskiT.MichelmoreR.YoderJ. I. (2008). Trans-specific gene silencing between host and parasitic plants. Plant J.56 (3), 389397. doi: 10.1111/j.1365-313X.2008.03613.x

  • 89

    VaucheretH. (2008). Plant ARGONAUTES. Trends Plant Sci.13 (7), 350358. doi: 10.1016/j.tplants.2008.04.007

  • 90

    VaucheretH.VazquezF.CrétéP.BartelD. P. (2004). The action of ARGONAUTE1 in the miRNA pathway and its regulation by the miRNA pathway are crucial for plant development. Genes Dev.18 (10), 11871197. doi: 10.1101/gad.1201404

  • 91

    VogelJ. P.GarvinD. F.LeongM. O.HaydenD. M. (2006).Agrobacterium-mediated transformation and inbred line development in the model grass Brachypodium distachyon. Plant Cell Tissue Organ Culture84, 199211. doi: 10.1007/s11240-005-9023-9

  • 92

    WangB.SunY.SongN.ZhaoM.LiuR.FengH.et al. (2017b). Puccinia striiformis f. sp. tritici microRNA-like RNA 1 (Pst-milR1), an important pathogenicity factor of Pst, impairs wheat resistance to Pst by suppressing the wheat pathogenesis-related 2 gene. New Phytol.215 (1), 338350. doi: 10.1111/nph.14577

  • 93

    WangH.-L. V.DinwiddieB. L.LeeH.ChekanovaJ. A. (2015). Stress-induced endogenous siRNAs targeting regulatory intron sequences in Brachypodium. RNA (New York, N.Y.)21 (2), 145163. doi: 10.1261/rna.047662.114

  • 94

    WangM.WeibergA.DellotaE.YamaneD.JinH. (2017a). Botrytis small RNA Bc-siR37 suppresses plant defense genes by cross-kingdom RNAi. RNA Biol.14 (4), 421428. doi: 10.1080/15476286.2017.1291112

  • 95

    WangM.WeibergA.LinF.-M.ThommaB. P. H. J.HuangH.-D.JinH. (2016). Bidirectional cross-kingdom RNAi and fungal uptake of external RNAs confer plant protection. Nat. Plants2, 16151. doi: 10.1038/nplants.2016.151

  • 96

    WangY.JuranekS.LiH.ShengG.WardleG. S.TuschlT.et al. (2009). Nucleation, propagation and cleavage of target RNAs in Ago silencing complexes. Nature461, 754. doi: 10.1038/nature08434

  • 97

    WaterhouseA.BertoniM.BienertS.StuderG.TaurielloG.GumiennyR.et al. (2018). SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res.46 (W1), W296W303. doi: 10.1093/nar/gky427

  • 98

    WeibergA.WangM.LinF.-M.ZhaoH.ZhangZ.KaloshianI.et al. (2013). Fungal small RNAs suppress plant immunity by hijacking host RNA interference pathways. Science342 (6154), 118 LP–118123. doi: 10.1126/science.1239705

  • 99

    WillkommS.ZanderA.GustA.GrohmannD. (2015). A prokaryotic twist on argonaute function. Life (Basel, Switzerland)5 (1), 538553. doi: 10.3390/life5010538

  • 100

    WilsonR. C.DoudnaJ. A. (2013). Molecular mechanisms of RNA interference. Annu. Rev. Biophys.42, 217239. doi: 10.1146/annurev-biophys-083012-130404

  • 101

    WinterD.VinegarB.NahalH.AmmarR.WilsonG. V.ProvartN. J. (2007). An “Electronic Fluorescent Pictograph” browser for exploring and analyzing large-scale biological data sets. PLoS One2 (8), e718. doi: 10.1371/journal.pone.0000718

  • 102

    WuJ.YangZ.WangY.ZhengL.YeR.JiY.et al. (2015). Viral-inducible Argonaute18 confers broad-spectrum virus resistance in rice by sequestering a host microRNA. Elife4, e05733. doi: 10.7554/eLife.05733

  • 103

    XieZ.JohansenL. K.GustafsonA. M.KasschauK. D.LellisA. D.ZilbermanD.et al. (2004). Genetic and functional diversification of small RNA pathways in plants. PLOS Biol.2 (5), e104. doi: 10.1371/journal.pbio.0020104

  • 104

    YangY.ZhouY. (2008). Specific interactions for ab initio folding of protein terminal regions with secondary structures. Proteins Struct. Funct. Bioinf.72 (2), 793803. doi: 10.1002/prot.21968

  • 105

    YouC.CuiJ.WangH.QiX.KuoL.-Y.MaH.et al. (2017). Conservation and divergence of small RNA pathways and microRNAs in land plants. Genome Biol.18, 158. doi: 10.1186/s13059-017-1291-2

  • 106

    ZhaX.XiaQ.Adam YuanY. (2012). Structural insights into small RNA sorting and mRNA target binding by Arabidopsis argonaute mid domains. FEBS Lett.586 (19), 32003207. doi: 10.1016/j.febslet.2012.06.038

  • 107

    ZhangH.KolbF. A.JaskiewiczL.WesthofE.FilipowiczW. (2004). Single processing center models for human dicer and bacterial RNase III. Cell118 (1), 5768. doi: 10.1016/j.cell.2004.06.017

  • 108

    ZhangH.XiaR.MeyersB. C.WalbotV. (2015). Evolution, functions, and mysteries of plant ARGONAUTE proteins. Curr. Opi. Plant Biol.27, 8490. doi: 10.1016/j.pbi.2015.06.011

  • 109

    ZhangT.ZhaoY.-L.ZhaoJ.-H.WangS.JinY.ChenZ.-Q.et al. (2016). Cotton plants export microRNAs to inhibit virulence gene expression in a fungal pathogen. Nat. Plants2, 16153. doi: 10.1038/nplants.2016.153

  • 110

    ZhangX.NiuD.CarbonellA.WangA.LeeA.TunV.et al. (2014). ARGONAUTE PIWI domain and microRNA duplex structure regulate small RNA sorting in Arabidopsis. Nat. Commun.5, 5468. doi: 10.1038/ncomms6468

  • 111

    ZhengX.ZhuJ.KapoorA.ZhuJ. (2007). Role of Arabidopsis AGO6 in siRNA accumulation, DNA methylation and transcriptional gene silencing. EMBO J.26 (6), 1691 LP–1701. doi: 10.1038/sj.emboj.7601603

  • 112

    ZhuH.HuF.WangR.ZhouX.SzeS.-H.LiouL. W.et al. (2011). Arabidopsis Argonaute10 specifically sequesters miR166/165 to regulate shoot apical meristem development. Cell145 (2), 242256. doi: 10.1016/j.cell.2011.03.024

  • 113

    ZilbermanD.CaoX.JacobsenS. E. (2003). ARGONAUTE4 control of locus-specific siRNA accumulation and DNA and histone methylation. Science2995607, 716 LP–716719. doi: 10.1126/science.1079695

Summary

Keywords

protein, structure, prediction, RNAi, Argonaute, Dicer, Brachypodium

Citation

Šečić E, Zanini S and Kogel K-H (2019) Further Elucidation of the Argonaute and Dicer Protein Families in the Model Grass Species Brachypodium distachyon. Front. Plant Sci. 10:1332. doi: 10.3389/fpls.2019.01332

Received

13 April 2019

Accepted

25 September 2019

Published

22 October 2019

Volume

10 - 2019

Edited by

Hailing Jin, University of California, United States

Reviewed by

Zhaoqing Chu, Shanghai Chenshan Plant Science Research Center (CAS), China; Ling Li, Mississippi State University, United States

Updates

Copyright

*Correspondence: Karl-Heinz Kogel,

This article was submitted to Plant Microbe Interactions, a section of the journal Frontiers in Plant Science

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics