Abstract
The knowledge about plant miRNAs has increased exponentially, with thousands of miRNAs been reported in different plant taxa using high throughput sequencing technologies and bioinformatic tools. Nevertheless, several groups of plants remain unexplored, and the gap of knowledge about conifer miRNAs is considerable. There is no sequence or functional information available on miRNAs in Araucariaceae. This group is represented in Brazil by only one species, Araucaria angustifolia, an endangered species known as Brazilian pine. In the present study, Brazilian pine has its transcriptome explored with respect to small RNAs, representing the first description in a member of the Araucariaceae family. The screening for conserved miRNAs in Brazilian pine revealed 115 sequences of 30 miRNA families. A total of 106 precursors sequences were predicted. Forty one comprised conserved miRNAs from 16 families, whereas 65 were annotated as novel miRNAs. The comparison of Brazilian pine precursors with sRNA libraries of other five conifer species indicates that 9 out 65 novel miRNAs are conserved among gymnosperms, while 56 seems to be specific for Brazilian pine or restricted to Araucariaceae family. Analysis comparing novel Brazilian pine miRNAs precursors and Araucaria cunninghamii RNA-seq data identified seven orthologs between both species. Mature miRNA identified by bioinformatics predictions were validated using stem-loop RT-qPCR assays. The expression pattern of conserved and novel miRNAs was analyzed in five different tissues of 3-month-old Araucaria seedlings. The present study provides insights about the nature and composition of miRNAs in an Araucariaceae species, with valuable information on miRNAs diversity and conservation in this taxon.
Introduction
MicroRNAs (miRNAs) represent an important class of gene expression regulators (He and Hannon, ). These small nucleic acids correspond to 20–25 nt endogenous noncoding RNA sequences with considerable impact on virtually all biological processes (Budak and Akpinar, ). miRNA genes are transcribed as long precursor transcripts, called pri-miRNAs, which have the capacity to form a fold-back hairpin structure (Chorostecki et al., ). By dicer-like-1 enzyme (DL1) processing, precursors are cleaved and mature 5p/3p miRNA duplexes are produced (Zhang Y. et al., ). Usually, one of these mature miRNAs, is incorporated into RNA-induced silencing complexes (RISCs) (Paroo et al., ), and by the action of Argonaute 1 proteins (AGO1), these complexes act over mRNA targets, directing their sequestration or degradation (Pratt and MacRae, ).
The first record of a miRNA was performed in 1993 in a study with C. elegans (Lee et al., ). Since then, a series of studies were carried out and thousands of miRNAs were described in several animal and plant taxa (Cui et al., ). Among land plants, the miRNA characterization approach was extensively propagated, mainly, in Angiosperm model species like Oryza sativa (Wang, ), Zea mays (Mica, ), Panicum virgatum (Xie et al., ), Glycine max (Severin et al., ) among others. In miRBase (Kozomara and Griffiths-Jones, ) it is possible note that Angiosperm species miRNAs are remarkably predominant among Viridiplantae data. Even with predictions of novel miRNAs in some conifer species like Picea abies (Yakovlev et al., ), Pinus taeda (Lu et al., ), Pinus densata (Wan et al., ), among others, the knowledge about the complexity of Gymnosperm miRNAs is very limited. Besides, in the field of small RNAs biology, a series of non-studied conifer species have great commercial and ecological importance, which means that a plethora of valuable genetic resources remains hidden in several taxa.
Araucaria angustifolia (Bertol.) Kuntze is the only endemic species of Gymnosperm with economic importance in Brazil (Steiner et al., ). This species, commonly named Brazilian pine, was the most important wood species from south Brazil in the past century (Santos et al., ). Representing valuable source of seeds, wood, fiber and resin, Brazilian pine was the target of extensive exploitation over decades, suffering massive population decrease (Steiner et al., ). Brazilian pine seeds are recalcitrant, maintaining a high metabolism status during the storage (Steiner et al., ). Consequently, under normal conditions, the seeds have a short conservation period, with substantial decrease in water potential and viability reduction at 4 months after harvest (Araldi et al., ). This recalcitrant feature compromises the conservation of Brazilian pine seeds and, consequently, hampers recovery efforts for degraded populations (Longhi et al., ). Currently, this species is classified as critically endangered, according to the International Union of Conservation of Nature Red List of Threatened Species (Thomas, ).
Brazilian pine has been targeted by some genetic studies mainly with a focus on somatic embryogenesis (Santos et al., , ; Steiner et al., ). Recently, an RNA-seq data were used to perform a transcriptome comparative profile analysis of early development stages (Elbl et al., ). However, there is no information about miRNAs in this species. In the present study, the Illumina technology was used for sequencing a Brazilian pine small RNA library. Using a bioinformatic approach, a series of conserved and putative novel miRNAs, including their stem-loop structure, sequences, and some potential targets were reported. Also, predicted miRNAs were compared with sRNA sequences from six different conifer species and with RNA-seq data from Araucaria cunninghamii to investigate the presence of these miRNAs in other conifer taxa. Finally, stem-loop RT-qPCR was applied to validate bioinformatics outputs and analyze differential expression patterns of 12 conserved and 30 novel predicted miRNAs in five different tissues of 3-month Araucaria plants. The present data provide valuable information about Brazilian pine micro-RNA biology and will be very useful for future studies in this species as well as in Araucariaceae family.
Materials and Methods
Plant Material
For small RNA library preparation and sequencing, fresh leaves were collected from an adult Araucaria tree situated at coordinate 29°51′52.3″S 50°53′51.9″ in Rio Grande do Sul in Brazil.
For stem-loop RT-qPCR analysis, Araucaria seeds, obtained from the seasonal production, were used for germination and seedling production. Brazilian pine plants were grown under standard greenhouse conditions (Moreira-souza et al., ) until reaching 90 days (3-months). Then, samples of each replication were collected and frozen in an ultra-freezer at −80°C for subsequent molecular analysis.
Small RNA Isolation and Illumina Sequencing
Total RNA was extracted from Brazilian pine fresh leaves with Trizol reagent (Invitrogen, CA, USA), following the standard protocol. The quantification of isolated RNA was determined using Nanodrop (Nanodrop Technologies, Wilmington, DE, USA). RNA sample was sent to Fasteris SA (Plan-les-Ouates, Switzerland) for sequencing. Using the Illumina HiSeq2000 platform, one sRNA library was constructed and sequenced, comprising 28,376,092 single-end reads with a length of 50 bases (NCBI accession number SRR8599283). The sRNA library building follows a series of standard steps, briefly described as follows: gel purification of the RNA fragments ranging from 20 to 30 nt, ligation of the 3p and 5p adapters and followed by gel purification, cDNA synthesis and cDNA gel purification, and, finally, PCR amplification to generate a cDNA colony template library for deep sequencing.
Bioinformatic Analysis of sRNA Library
The Illumina small RNA library was processed. First, poor-quality bases, with a Fastq value below 30, were removed and adapter sequences were trimmed using Sickle-Quality-Base-Trimming (https://github.com/najoshi/sickle) and Cutadapt (https://cutadapt.readthedocs.io/en/stable/), respectively. Second, reads with unknown nucleotides (containing one or more “N” bases) were removed with Prin-Seq script (Schmieder and Edwards, ). Third, sequences shorter than 18 and longer than 25 nucleotides were also excluded. Finally, Plant small RNAs derived from rRNAs, tRNAs, snRNAs, snoRNAs deposited at the tRNAdb (Jühling et al., ), SILVA rRNA (Jühling et al., ), and NONCODE v3.0 (Jühling et al., ) databanks as well as from Gymnosperm mtRNA and cpRNA deposited at NCBI GenBank database (https://www.ncbi.nlm.nih.gov/) were used as references to align the reads by Bowtie (Langmead et al., ).
A set of 24 A. angustifolia mRNA-seq libraries were downloaded from NCBI Sequence Read Achieve (SRA) under bioproject PRJNA240554 (Elbl et al., ). The libraries were processed in order to cut poor-quality bases off the start and end of the reads, considering a fastq quality threshold of 30, and remove adapter sequences using Trim galore (https://www.bioinformatics.babraham.ac.uk/projects/trim_galore/). Next, the complete transcriptome was assembled with Trinity (Haas et al., ) using default parameters, and the predicted contigs were used as reference sequences for pre-miRNA prediction and identification of potential miRNA targets.
Identification of Conserved Mature miRNAs
To identify conserved mature miRNAs, all mature miRNA sequences of Viridiplantae from miRBase (version 22) (Kozomara and Griffiths-Jones, ) were downloaded and mapped against A. angustifolia clean small reads with bowtie I (Langmead et al., ), allowing no mismatches.
Prediction of Conserved and Novel miRNA Precursors
miR-PREFeR (Lei and Sun, ) was used for prediction of miRNA precursor sequences. This pipeline uses SAM files obtained from the mapping between small RNAs and the complete transcriptome assembled with Bowtie (Langmead et al., ). The candidate precursors obtained were manually revised with Tablet software (Milne et al., ), and confirmed according to the anchoring patterns: correctly stem-loop secondary structures should harbor the mature miRNA sequence at one arm of the stem and the antisense miRNA sequence (miRNA*), when detected, at the opposite arm.
The predicted precursor miRNAs were compared with miRBase stem-loop and mature sequences by BLASTn allowing no mismatches and classified into two categories, conserved precursors or putative novel precursors. The precursor stem-loop structures, as well as their minimal folding free energy (MFE), were analyzed using the annotation algorithm from the UEA sRNA toolkit (Moxon et al., ).
Comparison Between Small RNA-Seq Data of Brazilian Pine and Other Conifer Species
Small RNA-seq libraries of another six-conifer species were downloaded from NCBI: P. abies (SRR824149; SRR824150) (Källman et al., ), Ginkgo biloba (SRR1658896, SRR1658901), Cunninghamia lanceolata (SRR066638) (Wan et al., ), Taxus mairei (SRR797042) (Hao et al., ), and Taxus wallichiana (SRR1343578). The parameters for data cleaning and preprocessing were applied as described in the section Bioinformatic Analysis of sRNA Library. Next, all libraries were collapsed into a unique library, redundancy was removed, and reads were tagged with species code and read counts number. Using bowtie (Langmead et al., ), all libraries were mapped against Brazilian pine miRNA precursors, separately, allowing no mismatches. Using Tablet software (Milne et al., ), the anchoring patterns were visualized.
To investigate the presence of novel miRNAs predicted in A. angustifolia in another species of Araucaria genus, RNA-seq libraries of A. cunninghamii were downloaded from GenBank (accession PRJNA277081) and this data was analyzed as follows. The libraries were processed, low-quality reads and adaptors were trimmed. The complete transcriptome was de novo assembled with Trinity. Sequences of novel miRNA precursors from A. angustifolia were blasted against A. cumnigamia unigenes. Matched sequences comprising a total extension or at least the region flanked by mature and antisense miRNAs were folded using UEA sRNA workbench (Stocks et al., ).
Prediction of miRNA Targets
The prediction of target genes of the mature miRNAs from the conserved and novel pre-miRNAs was performed using psRNAtarget (Dai and Zhao, ) using A. angustifolia assembled unigenes. Default parameters and the expectation value of 3.0 were considered in this analysis. Blast2Go software (Conesa and Götz, ) was used to understand the functions of the putative target genes.
Biological Confirmation of Predicted miRNAs by Stem-Loop RT-qPCR
To validate and analyze patterns of expression of Brazilian pine predicted miRNAs, Stem-loop RT-qPCR method was performed. Seeds were germinated, and seedlings were grown until reaching an age of 3-months. Then, RNA samples were isolated using the Trizol reagent (Invitrogen, CA, USA). Five different tissues: young leaf, old leaf, stem, main root, and secondary root, were analyzed using four biological replicates. RNA quality was evaluated using 1% agarose gel electrophoresis and Nanodrop. cDNA was obtained for 42 miRNAs based on the stem-loop method (Chen, ). Primers sequences for stem-loop cDNA synthesis and mature miRNA expression are in Table S1. RT-qPCR reactions were performed in a CFX 384 RealTime PCR System (Bio-Rad). PCR mixes were carried out in a final volume of 10 μL, containing 5 μL of diluted cDNA (1:100) and 5 μL of reagents mix: 1X SYBR Green, 0.025 mM dNTP, 1X PCR buffer, 3 mM MgCl2, 0.25 U Platinum Taq DNA Polymerase (Invitrogen) and 200 nM of each reverse and forward primer. The RT-qPCR conditions were configured in this way: 94°C for 5 min, 40 cycles of 94°C for 15 s, 60°C for 10 and 25 s at 72°C. Melting curves were analyzed at the end of RT-qPCR runs to confirm the quality of amplified products. Samples were evaluated in four technical replicates. Using geNorm (https://genorm.cmgg.be/), normalizations for miRNA were performed with the Aang-miR171, Aang-nmiR009, and Aang-nmiR046, as the best combination of normalizers, following well-stablished criteria for miRNA RT-qPCR analysis (Kulcheski et al., ). To calculate the relative expression of miRNAs 2−ΔΔCt method was used (Livak and Schmittgen, ). To carry out statistical analysis, ANOVA was applied using SAS software Version 9.4 (SAS Institute, Cary, NC, USA) and Duncan's multiple range test was performed to compare pairwise differences in expression, considering p < 0.05.
Results
Diversity of Small RNA in Araucaria angustifolia
A total of 26,102,142 reads were obtained from the A. angustifolia small RNA library. This library was processed. Adapters, low-quality reads, base redundant reads, as well as reads longer than 25 and shorter than 18 nucleotides were removed. The clean library comprised 19,505,320 (74.73%) reads (Table 1), which were used for further analysis (Figure 1). The small RNA length distribution in Brazilian pine shown an interesting pattern. The length distribution and diversity of sRNAs are shown in the Figure 2A and Table S2. The highest abundance was observed in sequences with 21 and 24 nt, with 21 nt small RNA comprising more than 10 million sequences. The library composition analysis showed that 14.45% of reads matched miRNAs, 19.20% matched rRNA, 1.52% matched tRNAs, 0.06% matched snRNAs, 0.69% matched snoRNAs, 7.35% matched mtRNA, 6.64% matched cpRNA, 13.16% matched transposons (TEs), and 36.93% matched other RNAs (Table 1).
Table 1
| Type of small RNA | Number of reads | Percentage |
|---|---|---|
| Total* | 26,102,142 | 100.00% |
| <18 nt | 3,847,477 | 14.74% |
| >25 nt | 2,749,345 | 10.53% |
| 18–25 nt | 19,505,320 | 74.73% |
| miRNA | 2,819,381 | 10.80% |
| rRNA | 3,744,202 | 14.34% |
| tRNA | 297,176 | 1.14% |
| snRNA | 12,395 | 0.05% |
| snoRNA | 135,169 | 0.52% |
| mtRNA | 1,433,049 | 5.49% |
| cpRNA | 1,295,819 | 4.96% |
| retrotransposon | 2,567,205 | 9.84% |
| Other sRNA | 7,200,924 | 27.59% |
Summary of data from A. angustifolia small RNA library sequencing.
Reads with length up to 44 nt.
Figure 1
Figure 2
Identification of Conserved miRNAs in Araucaria angustifolia
All the Viridiplantae mature miRNAs deposited in miRBase were downloaded and mapped against A. angustifolia sRNA data with bowtie (Langmead et al., ). In this analysis, mismatches were not considered. As shown in Figure 2B and Table S3, 115 sequences matched miRNAs from 30 conserved families (miR156, miR159, miR160, miR164, miR165, miR166, miR167, miR168, miR169, miR171, miR319, miR390, miR394, miR395, miR396, miR397, miR398, miR399, miR403, miR408, miR529, miR535, miR858, miR894, miR947, miR1314, miR3711, miR4995, miR5139, miR5145, miR6300, and miR6478). The number of unique sequences per family as well as their read counts were highly variable, suggesting complex expression patterns of conserved miRNAs in A. angustifolia (Table S3).
Identification of Pre-miRNAs Hairpin Sequences in Araucaria angustifolia
Brazilian pine has no nuclear genome sequenced yet. Instead, mRNA-seq data are available in GenBank. Then, 24 libraries were downloaded and the complete transcriptome was de novo assembled with Trinity. The assembly features are shown in Table 2. The assembled Araucaria transcriptome comprised 360,259 transcripts with an average length of 673 nt. To identify A. angustifolia miRNA precursors, the reference transcriptome, as well as the sRNA library, were loaded into miR-PREFeR (Lei and Sun, ). This tool follows the criteria for plant miRNA annotation, using expression patterns of miRNAs to predict plant miRNAs from small RNA-Seq data (Lei and Sun, ). In this way, 106 miRNA precursors, were predicted (Datas S1, S2). Using BLAST search, sequences of mature miRNA and antisense miRNA (miRNA*) of each precursor was compared with mature and stem-loop Viridiplantae data from the miRBase platform, and mismatches were not considered. Following this stringent condition, 41 precursor sequences (pre-miRNAs) of 16 conserved miRNA families were reported (Table 3, Table S4, and Data S1).
Table 2
| Total reads | 326,525,998 |
| Number of transcripts | 360,259 |
| Median transcript length (nt) | 256 |
| Mean transcript length (nt) | 673 |
| Max transcript length (nt) | 23,129 |
| Number of transcripts >1 Kbases | 77,861 |
| N50 | 1,606 |
Statistics on Araucaria angustifolia libraries and transcriptome de novo assembly.
Table 3
| Loci | Pre-miRNA | miRNA 5P | miRNA 3P | |||||||
|---|---|---|---|---|---|---|---|---|---|---|
| miRNA family | miRNA locus | Length | Read counts | MFE | Sequence | Read count | Length | Sequence | Read count | Length |
| miR156 | 156 | 87 | 15,116 | −48.5 | TGACAGAAGAGAGTGAGCAC | 13,367 | 20 | GCTCACCATCTCTTTCTGTCAGC | 1,583 | 23 |
| miR159 | 159a | 198 | 91,542 | −71.8 | CTTGGATTGAAGGGAGCTCC | 77,010 | 20 | AAGCTTCCTTCAGTCCAATCG | 3 | 21 |
| 159b | 168 | 91,690 | −91.2 | AGCTCCCTTCGGTCCAATT | 244 | 19 | CTTGGATTGAAGGGAGCTCC | 77,010 | 20 | |
| miR160 | 160 | 87 | 80 | −47.1 | TGCCTGGCTCCCTGTATGCCA | 44 | 20 | GTTGGCATAGAGGGAATCAAG | 3 | 21 |
| miR166 | 166a | 71 | 1,008,365 | −46.2 | AAGGGGATTGCGGTCTGGCT | 243 | 20 | TCGGACCAGGCTTCATTCCCC | 997,155 | 21 |
| 166b | 94 | 92,632 | −42.6 | GGACTGTTGTCTGGCTCGAAG | 33 | 21 | CCGGACCAGGCTTCATTCCCC | 90,796 | 21 | |
| 166c | 110 | 1,008,012 | −54.5 | GGAATGTTGTCTGGCTCGAGG | 22 | 21 | TCGGACCAGGCTTCATTCCCC | 997,155 | 21 | |
| 166d | 78 | 1,008,989 | −47.7 | GGAATGTTGTCTGGCTCGACT | 780 | 21 | TCGGACCAGGCTTCATTCCCC | 997,155 | 21 | |
| 166e | 96 | 1,063,754 | −47.0 | Non-detected | – | – | TCGGACCAGGCTTCATTCCCC | 997,155 | 21 | |
| miR167 | 167a | 120 | 13,848 | −66.8 | TGAAGCTGCCAGCATGATCTGA | 11,024 | 22 | AGATCATCTGGTAGCTTCAGC | 580 | 21 |
| 167b | 117 | 4,785 | −58.7 | TGAAGCTGCCAGCATGATCTGG | 2,611 | 22 | TACCAGATCATGGTGGTGGCC | 2 | 21 | |
| 167c* | 91 | 4,788 | −53.2 | TGAAGCTGCCAGCATGATCTGG | 2,611 | 22 | AGGTCATCTGGCAGTTTCACC | 6 | 21 | |
| 167d | 109 | 4,783 | −57.3 | TGAAGCTGCCAGCATGATCTGG | 2,611 | 22 | TACCAGATCATGGTGGTGGCC | 2 | 21 | |
| miR168 | 168 | 184 | 6,801 | −74.0 | TCGCTTGGTGCAGGTCGGGAA | 4,436 | 21 | CCCTGCTTGCATCAACTGAAT | 332 | 21 |
| miR169 | 169a | 136 | 1,137 | −99.7 | AACAACTTGCCGGCTATTCTA | 1 | 21 | GGCAAGTTGTTCTCGGCTATG | 918 | 21 |
| 169b | 131 | 195 | −42.8 | AAGCCAAGGATGAATTGCCGC | 13 | 21 | GGCAAGTTGTTCTTGGCTACG | 72 | 21 | |
| 169c | 155 | 1,802 | −72.7 | AAGCCAAGGATGATTTGCCGG | 938 | 21 | GGCAAGTTGTTCTTGGCTACG | 72 | 21 | |
| 169d | 158 | 2,886 | −71.5 | AAGCCAAGGATGATTTGCCGG | 938 | 21 | GGCAAGTTGTTCTCGGCTATG | 918 | 21 | |
| miR171 | 171a | 94 | 491 | −50.4 | GGATATTGGAGCGGTTCAACC | 2 | 21 | TTGAGCCGTGCCAATATCGCA | 378 | 21 |
| 171b | 114 | 46 | −63.1 | GTGATGTTGGCTGGGCTCAAT | 4 | 21 | TGAGCCGTGCCAATATCACAA | 16 | 21 | |
| 171c | 84 | 36 | −47.1 | GTGATGTTGGCTGGGCTCAAT | 4 | 21 | TGAGCCGTGCCAATATCACAA | 16 | 21 | |
| miR390 | 390a | 83 | 1,998 | −45.0 | AAGCTCAGGAGGGATAGCGCC | 1,952 | 21 | CGCTATCTATCCTGAGCTTTT | 13 | 23 |
| miR394 | 394 | 80 | 611 | −43.7 | CTGGCATTCTGTCCACCTCC | 312 | 21 | AGGCGGACGGTATGCCAAGT | 18 | 20 |
| miR395 | 395a | 112 | 2,712 | −52.1 | GTTCCCTCAACTACTTCAGAA | 157 | 21 | CTGAAGAGTTTGGGGGAACTC | 2,157 | 21 |
| 395b | 95 | 2,514 | −36.5 | Non-detected | – | 21 | CTGAAGAGTTTGGGGGAACTC | 2,157 | 21 | |
| miR396 | 396 | 120 | 5,246 | −56.8 | TTCCACAGCTTTCTTGAACTT | 4,606 | 21 | TTCAAGATTGCTGTGGGAAA | 1 | 20 |
| miR399 | 399a | 114 | 136 | −66.3 | GGGGAGCTCTCCTTTGGCGGG | 6 | 21 | TGCCAAAGGAGAGTTGCCCTG | 120 | 21 |
| 399b | 102 | 129 | −64.0 | GGGGGGCTCTCCTTTGGTGGG | 2 | 21 | TGCCAAAGGAGAGTTGCCCTG | 120 | 21 | |
| 399c | 111 | 134 | −63.0 | GGGGAGCTCTCCTTTGGCAGG | 2 | 21 | TGCCAAAGGAGAGTTGCCCTG | 120 | 21 | |
| 399d | 96 | 135 | −53.2 | GGGGAGCTCTCCTTTGGCGGG | 6 | 21 | TGCCAAAGGAGAGTTGCCCTG | 120 | 21 | |
| 399e | 112 | 134 | −67.1 | GGGGAGCTCTCCTTTGGCAGG | 2 | 21 | TGCCAAAGGAGAGTTGCCCTG | 120 | 21 | |
| miR408 | 408 | 93 | 93 | −55.6 | GCCGGGAAGAGATAGCGCAT | 1 | 20 | TGCACTGCCTCTTCCCTGGCTG | 79 | 22 |
| miR529 | 529a | 115 | 542 | −51.9 | AGAAGAGAGAGAGCACAGCCT | 357 | 21 | AGGCTGTGCTCTCTCTCTTC | 1 | 21 |
| 529b | 89 | 544 | −51.4 | AGAAGAGAGAGAGCACAGCCT | 357 | 21 | GCTGTGCTCTCTCTCTTCTTC | 2 | 21 | |
| 529c | 119 | 540 | −49.1 | AGAAGAGAGAGAGCACAGCCT | 357 | 21 | Non-detected | – | – | |
| 529d | 104 | 540 | −44.6 | AGAAGAGAGAGAGCACAGCCT | 357 | 21 | Non-detected | – | – | |
| 529e | 103 | 463 | −51.5 | AGAAGAGAGAGAGTACAGCCC | 35 | 21 | GTTGTGCTCTCTCTCTTCTTC | 383 | 21 | |
| miR1314 | 1314a* | 71 | 6,878 | −34.1 | CTCCTACATTTAGGGTCGCCG | 1,188 | 21 | TCGGCCTTGAATGTTAGGAGAG | 4,523 | 22 |
| 1314b* | 107 | 6,904 | −53.5 | CTCCTACATTTAGGGTCGCCG | 1,188 | 21 | TCGGCCTTGAATGTTAGGAGAG | 4,523 | 22 | |
| 1314c* | 135 | 5,853 | −60.2 | CTTCTAAATTTAAGGTCGCCG | 560 | 21 | TCGGCCTTGAATGTTAGGAGAG | 4,523 | 22 | |
| 1314d* | 124 | 5,921 | −51.5 | CTTCTAAATTTAAGGTCGCCG | 560 | 21 | TCGGCCTTGAATGTTAGGAGAG | 4,523 | 22 | |
pre-miRNAs and mature miRNAs identified in A. angustifolia matching miRNA families in other plant species.
Precursor miRNAs also identified in Araucaria cunninghamii.
The most represented miRNA families were miR166, miR399, and miR529 with five members, followed by miR167 and miR 1314, with four members (Figures 2C,D). In contrast, with only one member, miR156, miR160, miR168, miR390, miR394, miR396, and miR408 were the less represented miRNA families. The length of conserved pre-miRNAs ranged from 71 to 198 nt and the minimal folding free energy (MFE) ranged from −34.15 to −99.70 kcal/mol (Table 3 and Table S4). The anchoring patterns, as well as stem-loop structures of the 41 known precursors, are shown in the Data S1. The other 65 precursors were considered putative novel pre-miRNAs (Figure 3 and Data S2). The length of these sequences ranged from 61 to 422 nt and the MFE ranged from −335.72 to −14.9 kcal/mol with an average negative folding value of −55, 9 kcal/mol (Table 4 and Table S5). As occurred with conserved pre-miRNAs, all novel pre-miRNAs showed regular hairpin structures. Also, in 58 out of 65 (89.23%) novel pre-miRNAs were possible to detect the antisense miRNA (miRNA*), which strongly support their prediction (Table 4, Table S5, and Data S2), indicating that these pre-miRNAs integrate the A. angustifolia miRNAome.
Figure 3
Table 4
| Loci | Pre-nmiRNA | nmiRNA 5P | nmiRNA 3P | ||||||
|---|---|---|---|---|---|---|---|---|---|
| nmiRNA | Length | Read count | MFE | Sequence | Read count | Length | Sequence | Read count | Length |
| Aang-nmiR001 | 93 | 303,638 | −41.7 | ACTGTGGGATGATGTCAAAAA | 945 | 21 | TTTGACATCACACCCGCGGTGA | 294,457 | 22 |
| Aang-nmiR002 | 78 | 299,835 | −30.1 | TATTCATCTATCACTGTGGAAA | 4,092 | 22 | TCCCCGGTGATTGATGAAGACA | 231,531 | 22 |
| Aang-nmiR003* | 90 | 132,298 | −40.4 | CGTGGGCGACCGGGGAAAATT | 51 | 21 | TTTTCCCTGATCCGCCCATGCC | 102,252 | 22 |
| Aang-nmiR004 | 126 | 83,829 | −94.3 | TAGTAGACCAGACTCGCCATC | 82,793 | 21 | CGAGTCGGATCTACTACAACCT | 243 | 22 |
| Aang-nmiR005* | 85 | 82,708 | −38.0 | TGACAGCCCGAAATCAGCGAGT | 75,705 | 22 | TGGCTGATTCGGACTATCAAG | 311 | 21 |
| Aang-nmiR006 | 90 | 76,565 | −36.8 | GTGGTCGGCGAGAAGAATCC | 37,665 | 20 | Non-detected | – | – |
| Aang-nmiR007 | 142 | 73,017 | −63.1 | TGGGCTTACATGTCTGTCGATG | 141 | 22 | TCAGCAGACATGTAGGCCAACC | 71,657 | 22 |
| Aang-nmiR008 | 117 | 62,299 | −55.8 | TCCCAAACATCGTCCAGAAATA | 41,135 | 22 | GTTTGGACGATGTTTGGAATG | 548 | 21 |
| Aang-nmiR009 | 132 | 47,232 | −65.3 | TCTCGAACATCCTGCAGCCATT | 44,752 | 22 | TGGCTGCACGACTTCGAGATA | 546 | 21 |
| Aang-nmiR010 | 243 | 37,279 | −93.6 | CTGGTAAACAGATGGGGCACT | 181 | 21 | CGCCCCATCTGATTACCGGTC | 36,806 | 21 |
| Aang-nmiR011 | 94 | 21,441 | −30.8 | TCACCGTGGACCGATGTAAAA | 354 | 21 | TTACGTCAGGTCCTCTGTGATT | 17,626 | 22 |
| Aang-nmiR012 | 90 | 18,386 | −35.7 | CCATCCGGCACTTGATGTCAAA | 39 | 22 | TGACGTCAGGTCCTCGATGGTT | 16,623 | 22 |
| Aang-nmiR013 | 94 | 13,462 | −34.0 | CCATTGAGCGCTTGGTGTCAAA | 7,534 | 22 | TAACATCAGGCCCTCGATGATT | 310 | 22 |
| Aang-nmiR014 | 69 | 13,373 | −14.9 | TCTTGGATTTATGGAAGACGAACC | 8,615 | 24 | AGGATGTTTTCATTAATCAAGAAC | 19 | 24 |
| Aang-nmiR015* | 64 | 13,041 | −24.9 | GGTCGTCACGGTCGGTCCGCC | 3,356 | 21 | Non-detected | – | – |
| Aang-nmiR016 | 67 | 12,997 | −38.5 | CGAGGAAATAATGTGAAGAAC | 1,048 | 21 | TCTTCACATCCTTTCCTCGGA | 7,684 | 21 |
| Aang-nmiR017 | 81 | 11,945 | −40.7 | CGTGGGGGCGTTGGACAAAACC | 353 | 22 | TTTTGCCAATACCTCCCATGCC | 11,321 | 22 |
| Aang-nmiR018 | 114 | 11,727 | −40.7 | CCGTATTCATTAACCATAGAG | 328 | 21 | CCTGTGGTTAATGAATACATCG | 7,218 | 22 |
| Aang-nmiR019 | 76 | 11,414 | −35.3 | CCATCGAGGCTTGACGTCAAAA | 119 | 22 | TTACGTCAGGTCCTCTATGGTT | 8,998 | 22 |
| Aang-nmiR020 | 100 | 6,535 | −45.4 | AGGGCTGTCCGTGATTGGGCA | 15 | 21 | TCATACCCAATCACCGACAGC | 3,981 | 21 |
| Aang-nmiR021* | 149 | 6,158 | −61.0 | Non-detected | – | – | TTTTTCCAATTCCGCCCATGCC | 5,945 | 21 |
| Aang-nmiR022 | 140 | 5,009 | −44.1 | CCGTATTCATTAACCATAGAG | 328 | 21 | CCTATGATTAATGAATACATCG | 4,168 | 22 |
| Aang-nmiR023 | 99 | 4,579 | −59.2 | TGACTGTCGTGGATGTATATC | 4,464 | 21 | TGTACATGCACGACGGTCACG | 6 | 21 |
| Aang-nmiR024 | 109 | 3,727 | −55.0 | CAGCCAAGAATGATTTGCCCGCC | 565 | 23 | GGCAGGTCATTCTTGGTGCT | 1,089 | 20 |
| Aang-nmiR025 | 177 | 3,525 | −64.0 | CCGCATCAGGTCTCCAAGGTG | 3,447 | 21 | Non-detected | – | – |
| Aang-nmiR026 | 88 | 3,344 | −40.2 | TGGATAGGAGGAGGATTCATG | 1 | 21 | TAAATCCTTCTGCTGTCCATA | 3,062 | 21 |
| Aang-nmiR027 | 75 | 2,601 | −37.9 | CCACCGTGGACCTGGTGTGAA | 10 | 21 | TCACGTCAGGACCTCGGTGGTT | 1,810 | 22 |
| Aang-nmiR028 | 102 | 2,574 | −49.3 | TCCGGAGACGTCGGCGGGGGC | 1,114 | 21 | Non-detected | – | – |
| Aang-nmiR029 | 126 | 2,184 | −54.9 | AAAACCATTGACTATCAAAAGA | 47 | 22 | TTTTGATAGCCAGTGGCAATC | 1,364 | 21 |
| Aang-nmiR030 | 127 | 1,798 | −72.4 | TGCGCCCTCGCGGCGGGCC | 74 | 19 | GCGCTGGCCGGCGGGCTTTC | 454 | 20 |
| Aang-nmiR031 | 123 | 1,751 | −65.2 | ACCTCGCCAACAATCTCAGC | 110 | 22 | TGAGATTGTTGGAGAGGTTCG | 847 | 21 |
| Aang-nmiR032 | 104 | 1,283 | −43.3 | AGAAGAGAGAAAGCACATCCC | 814 | 21 | GTTGTGCTCTCTCTCTTCTTC | 383 | 21 |
| Aang-nmiR033* | 176 | 1,238 | −68.2 | TTACCACGCCCGCCCATGCCTA | 379 | 22 | GGCGTTGCCGGTCTGGTAAAA | 571 | 21 |
| Aang-nmiR034 | 61 | 1,232 | −18.3 | GTCCTATTCCGTTGGCCT | 563 | 18 | GAATAACGTGATAGGAGTCTG | 12 | 21 |
| Aang-nmiR035 | 127 | 1,203 | −68.1 | ATGCTTGTTATCTCTGTGCGGC | 548 | 22 | CCGCGCAGAAATAAAAGCATG | 19 | 21 |
| Aang-nmiR036* | 112 | 1,122 | −46.0 | CCTTGTTCCTATTTACTGGCA | 932 | 21 | TCAATAAATAGGAACACAGGTT | 133 | 22 |
| Aang-nmiR037 | 122 | 855 | −42.6 | AGTCAACTCAAGTCTTTGAAA | 14 | 21 | TTAAAGATTTGAGTTGTCCAA | 666 | 21 |
| Aang-nmiR038 | 65 | 830 | −37.9 | AGTGGGAGGAACGGGCAAAAACT | 96 | 23 | TTTTCCCGGCTCCTCCCATTCC | 662 | 22 |
| Aang-nmiR039 | 112 | 777 | −48.5 | ATTGGACAACTCAATCTTTGA | 260 | 21 | CTCAAGGACTTGAGCTGTCCAA | 113 | 22 |
| Aang-nmiR040 | 135 | 713 | −55.4 | CAGCAAGTGGAAAACTAGAAT | 11 | 21 | TATTTCAGTTCTTCACTTGCT | 506 | 21 |
| Aang-nmiR041 | 114 | 668 | −45.3 | TGGGCTTACATGTCTGTCGATG | 141 | 22 | TCAGCAGATATGTCAGCCAACC | 379 | 22 |
| Aang-nmiR042 | 141 | 664 | −69.0 | CACATTTTTAGTCTGAAACTG | 317 | 21 | TTCAGACTAAAGATGTGTATT | 235 | 21 |
| Aang-nmiR043 | 81 | 590 | −47.3 | TCCGAATTCCGCGACGCTCCA | 255 | 21 | GGACCGTCGCTGAATTCGGAG | 143 | 21 |
| Aang-nmiR044* | 104 | 559 | −22.9 | TTGCTGTCCATCAAAGAAGGC | 377 | 21 | Non-detected | – | – |
| Aang-nmiR045 | 143 | 465 | −71.7 | TTTCTGTGAACAAAATTTCAA | 112 | 21 | TTTGAAATTTTGGTCATAGAG | 38 | 21 |
| Aang-nmiR046 | 86 | 442 | −42.8 | AGTGGGATGCGAGGATAAGACT | 259 | 22 | TCTTTCCTACGCCTCCCATTCC | 172 | 22 |
| Aang-nmiR047 | 131 | 392 | −45.2 | AGAATTGAAAAACTTGCCTAT | 126 | 21 | AGGCATGTTTTTCAATTCTGA | 40 | 21 |
| Aang-nmiR048 | 69 | 370 | −27.0 | CCATTGAGCACTTGTTGTCAA | 9 | 21 | TTTTTTTGACATCAGGCCCTC | 206 | 21 |
| Aang-nmiR049 | 89 | 369 | −45.6 | TGATAAGGCCCTAATGACACAA | 269 | 21 | GTGTTATTTGGGCTTGTCATT | 27 | 21 |
| Aang-nmiR050 | 82 | 343 | −44.6 | TTATTGAATACTGGTGAAAGG | 12 | 21 | TTACCAGTCCTCAATGAGATC | 279 | 21 |
| Aang-nmiR051 | 198 | 330 | −149.8 | GTCTGCAGAGTGTATGGCCTG | 272 | 21 | CCTGCAGTCCAACATATACG | 1 | 20 |
| Aang-nmiR052 | 188 | 275 | −67.6 | TTCCAAAGCAGATAGATTGCCA | 86 | 22 | GCATCTGTCTGCTTCGGAATA | 41 | 21 |
| Aang-nmiR053 | 94 | 231 | −64.6 | GCAATGAATCGGCTGAATCGC | 141 | 21 | Non-detected | – | – |
| Aang-nmiR054 | 99 | 223 | −61.4 | TGACCGTCGTGGATGTATATC | 175 | 21 | TGCACGACGGTCACGACTGCC | 14 | 21 |
| Aang-nmiR055 | 67 | 218 | −20.2 | GTGGCCTATCGATCCTTTAG | 33 | 20 | CTAGAGGTGTCAGAAAAGTTAC | 105 | 22 |
| Aang-nmiR056 | 100 | 180 | −34.4 | CTTGATGATGATAACCGTTGACG | 15 | 23 | CACGGTTTGTCTGAAAGAT | 43 | 19 |
| Aang-nmiR057 | 135 | 176 | −82.1 | TGCTGAAATCGGTCGTACTGA | 78 | 21 | GGTACGATCGATTTCGGTATA | 70 | 21 |
| Aang-nmiR058 | 265 | 173 | −102.0 | TGGCATGACTTGCAAATTATG | 9 | 21 | CAATTTGTAAGGCCATGCTAAT | 145 | 22 |
| Aang-nmiR059 | 422 | 147 | −335.7 | AGTGTCCAGCATTTCTCGTCT | 4 | 21 | TGAGAAATGCTGGACACTTCT | 127 | 21 |
| Aang-nmiR060 | 98 | 147 | −47.9 | TGTTTCTACTGAGTTGGTTTCC | 68 | 22 | AACCTACTTAGTGAAAACATG | 2 | 21 |
| Aang-nmiR061 | 83 | 123 | −36.0 | CGTGGGCGTCTTGGACAAAGC | 21 | 22 | TTTTTCCAATGCCGCCCATGCC | 91 | 22 |
| Aang-nmiR062 | 85 | 101 | −49.6 | CCCGTATTGAAGATCAACCCA | 11 | 21 | GGTTGATCTTCAATATGGCGC | 59 | 21 |
| Aang-nmiR063 | 90 | 86 | −54.8 | TTTTGATTTTCAGTACGAATA | 13 | 21 | TTCGTACTGAAAATCAAAATC | 8 | 21 |
| Aang-nmiR064 | 158 | 72 | −87.6 | GTTTTAACTCATGGATATGCA | 42 | 21 | CATATCCATGAGTTAAAACCC | 18 | 21 |
| Aang-nmiR065 | 113 | 46 | −46.9 | TTTATTGATTTGATGCTAATGA | 3 | 22 | CTTAACACCAGACTAATGAACA | 12 | 22 |
Characteristics of novel pre-miRNAs and mature miRNAs identified in A. angustifolia.
Correspond to pre-miRNAs also identified in Araucaria cunninghamii.
Comparison Between Araucaria angustifolia Predicted Precursors and Small RNA Data From Different Gymnosperms
As a set of mature miRNA sequences of 65 precursors did not match miRBase data, the main online repository for all miRNA sequences and annotation (Kozomara and Griffiths-Jones, ), they could be classified as potential species-specific miRNAs. However, the miRBase platform has a restricted miRNA annotation in gymnosperms. There are only four conifer species of three genera with miRNA sequences annotated in this platform, C. lanceolata, P. abies, P. densata, and P. taeda. In addition, a series of studies with plant miRNAs, including conifer miRNAs (Chen et al., ; Zhang et al., ; Li et al., ), was published and several novel plant miRNAs were proposed, but these data were not present in the currently miRBase version. Thus, it is possible that novel miRNAs proposed in different species and classified as species-specific miRNAs could be also present in other taxa.
To avoid overestimation of novel miRNAs, and to provide a comprehensive comparison between conifer miRNAs, sRNA-seq libraries of five conifer species, P. abies, G. biloba, C. lanceolata, T. mairei, and T. wallichiana, were downloaded from GenBank (the accession codes were shown in Materials and Methods section). A comprehensive Phylogenetic relationship among this species is shown in Lu et al. (). The libraries were processed and only reads with length ranging from 18 to 25 nt remained. Then, using Bowtie (Langmead et al., ), the conifer sRNA libraries were mapped against the Brazilian pine miRNA precursors in two different ways. First, the small conifer libraries were mapped separately allowing no mismatches. Second, all libraries, including Brazilian pine sRNA, were collapsed, organized into unique reads, and mapped allowing no mismatches, and the mapping was visualized with Tablet (Milne et al., ).
All precursor sequences of miR156, miR159, miR160, miR166, miR167, miR168, miR390, and miR396 families matched with reads of all libraries, with high redundancy (Table 5). For the other conserved pre-miRNAs, the mapping pattern was not the same. For example, the four precursor members of conifer conserved family miR1314 (Berruezo et al., ) showed correspondent reads in four of seven libraries, with the highest redundancy in G. biloba and the minimal coverage in T. wallichiana library (Table 5).
Table 5
| miRNA precursor | Cunninghamia lanceolata | Ginkgo biloba | Picea abies | Taxus mairei | Taxus wallichiana | Araucaria angustifolia | |
|---|---|---|---|---|---|---|---|
| Conserved miRNAs | Aang-miR156 | 283,795 | 4,702 | 230 | 21,967 | 1,574 | 15,118 |
| Aang-miR159ab | 1 | 52 | 244 | 28 | 1 | 91,693 | |
| Aang-miR160 | 38 | 269 | 270 | 48 | 2 | 80 | |
| Aang-miR166abcde | 4,983 | 719,138 | 433,596 | 21,426 | 4,574 | 1,063,754 | |
| Aang-miR167abcd | 33,472 | 5,437 | 955 | 779 | 45 | 13,848 | |
| Aang-miR168 | 5,639 | 37,003 | 1,184 | 27,150 | 59,574 | 6,801 | |
| Aang-miR169abcd | 0 | 28 | 0 | 0 | 0 | 2,887 | |
| Aang-miR171abc | 17 | 10,214 | 89 | 3 | 4 | 491 | |
| Aang-miR390a | 395 | 8,922 | 1,974 | 164 | 52,965 | 1,999 | |
| Aang-miR394 | 0 | 11,081 | 312 | 4 | 0 | 611 | |
| Aang-miR395ab | 0 | 2 | 0 | 2 | 0 | 2,722 | |
| Aang-miR396 | 102 | 4,147 | 254 | 794 | 1,166 | 5,246 | |
| Aang-miR399abcde | 2 | 20 | 3 | 28 | 0 | 136 | |
| Aang-miR408 | 325 | 47 | 0 | 190 | 22,972 | 94 | |
| Aang-miR529abcde | 0 | 77 | 58 | 174 | 11 | 1,086 | |
| Aang-miR1314abcd | 0 | 64,219 | 20 | 0 | 1 | 6,904 | |
| Novel miRNAs | Aang-nmiR006 | 0 | 116 | 0 | 24 | 1,151 | 110,647 |
| Aang-nmiR014 | 1,301 | 295 | 24 | 426 | 1,085 | 32,638 | |
| Aang-nmiR025 | 355 | 3,118 | 242 | 2,441 | 787 | 5,040 | |
| Aang-nmiR028 | 696 | 564 | 3 | 68 | 11,953 | 3,084 | |
| Aang-nmiR031 | 0 | 126,984 | 42,987 | 1,402 | 163 | 1,751 | |
| Aang-nmiR034 | 0 | 4 | 0 | 1,470 | 2,148 | 1,869 | |
| Aang-nmiR046 | 0 | 31 | 0 | 41 | 742 | 442 | |
| Aang-nmiR055 | 345 | 409 | 128 | 46 | 1,402 | 618 | |
| Aang-nmiR063 | 43 | 252 | 57 | 60 | 357 | 454 |
Mapping patterns of sRNAs from different gymnosperms against conserved and novel miRNA precursors predicted in A. angustifolia.
Interestingly, 9 out of 65 novel pre-miRNAs (Aang-nmiR006, Aang-nmiR014, Aang-nmiR025, Aang-nmiR028, Aang-nmiR031, Aang-nmiR034, Aang-nmiR046, Aang-nmiR055, and Aang-nmiR063) also matched sRNA sequences of other conifer species (Table 5), and the mapping patterns suggest that these miRNAs could represent potential conserved miRNAs among gymnosperms. By visualizing the mapping patterns, it is possible to note that reads of libraries of different species aligned to precursors in the same way as miRNA and miRNA* sequences from A. angustifolia, as illustrated in Figure 4A. An interesting characteristic of miRNA genes is that most loci are conserved across organisms (Mutum et al., ). Therefore, it is necessary to be careful to nominate novel miRNAs as species-specific (Taylor et al., ). These results suggest that, although the putative novel miRNAs did not match miRBase sequences, they not necessarily represent species-specific miRNAs. Instead, is possible that nine putative novel miRNAs predicted in A. angustifolia may have evolved early in Gymnosperm lineages. On the other hand, 56 out of 65 novel pre-miRNAs seem to be novel non-conserved miRNAs in Brazilian pine or specific at some lower taxonomic level, like the Araucariaceae family.
Figure 4
Identification of Brazilian Pine Pre-miRNA Orthologs in Araucaria cunninghamii
To obtain insights about diversity and evolution of novel and conserved miRNA precursors predicted in A. angustifolia, these precursors were blast-searched against A. cunninghamii complete transcriptome assembled. In this way, five pre-miRNAs of two conserved miRNA families (miR167 and miR1314) and six putative novels pre-miRNAs (Aang-nmiR003, Aang-nmiR005, Aang-nmiR015, Aang-nmiR021, Aang-nmiR033, and Aang-miR036) predicted in A. angustifolia were identified in A. cunninghamii RNA-seq data (Datas S3, S4). Among the conserved miRNAs, all members of the family miR1314, only present in conifers, were found in A. cunninghamii (Data S3). In the pre-miRNA sequence alignments it was possible to note that base identity rates varied to 86% between Aang-miR1314a and putative Acun-miR1314a to 95% between Aang-miR1314d and putative Acun-miR1314d (Data S3). Among novel miRNAs base identity rates were also high, reaching 96% in the Aang-nmiR003/putative Acun-nmiR003 (Figure 4B) and Aang-nmiR033/putative Acun-nmiR033 pairs. In all cases mismatches appeared in loop regions, antisense miRNA (miRNA*) region and five to seven bases up or downstream mature miRNAs (Datas S3, S4). These results reinforce the validation of predicted miRNAs in this study and indicate that some novel miRNAs predicted in A. angustifolia are conserved in Araucaria genus or Araucariaceae family.
Identification of Targets for Conserved and Novel Brazilian Pine Pre-miRNAs
To add information about the biological function of miRNAs in Brazilian pine, miRNA-targets were computationally predicted using the psRNATarget platform. In this analysis, conserved and putative novel predicted mature sequences were aligned to a set of Brazilian pine assembled unigenes. A cut-off threshold of 3 was applied for expectation value. Following this criterion, 54 potential targets were found (for 32 miRNA families) of which, 11 were targets of conserved miRNAs (10 miRNA families) and 43 were targets of novel miRNAs (22 miRNA families). Detailed annotation outputs are shown in Table 6. Among the conserved miRNAs with predicted targets, Aang-miR156 was the only one that exhibited two targets, SBP (Squamosa promoter binding) and a gene encoding an exocyst complex component (EXO70A1-like). The other conserved miRNAs exhibited only one predicted potential target. For instance, Aang-miR159/transcription factor GAMYB, Aang-miR395/ATP sulfurylase 2 and Aang-miR1314/transcription factor ice 1. Among the novel miRNAs only eight exhibited only one predicted target (Aang-nmiR012, Aang-nmiR019, Aang-nmiR021, Aang-nmiR022, Aang-nmiR029, Aang-nmiR031, Aang-nmiR038, and Aang-nmiR052), the others showed two or more targets, for instance, Aang-nmiR006 exhibited four potential targets. The potential target genes regulated by the conserved miRNAs seem to display physiological functions, such as ATP-sulfurylase, regulation of gene expression (transcription factors GAMYB and ice1), RNA metabolism (U5 small ribonucleoprotein) and signaling cascades (ethylene receptor) (Table 6). Interestingly, a series of novel miRNAs (17 out of 22) are predicted to target genes related to disease resistance, like nucleotide binding site leucine-rich repeats (NBS-LRR). These results suggest that the conserved miRNAs are involved in a broad range of relatively conserved physiological functions, whereas most of the novel miRNAs, possibly Araucariaceae-specific miRNAs, seem to be involved in disease resistance.
Table 6
| miRNA | Target acc. | Expectation | Inhibition | Annotation |
|---|---|---|---|---|
| Aang-miR156 | 1931415 | 2.5 | Cleavage | squamosa promoter-binding 20 |
| 796092 | 3 | Cleavage | exocyst complex component EXO70A1-like | |
| Aang-miR159 | 977162 | 2.5 | Cleavage | transcription factor GAMYB |
| Aang-miR166 | 1704393 | 2 | Cleavage | homeodomain-leucine zipper transcription factor HB-3 |
| Aang-miR167 | 1066318 | 2.5 | Cleavage | zinc finger ZAT4-like |
| Aang-miR171 | 2163525 | 1.5 | Cleavage | scarecrow 6 |
| Aang-miR390 | 153786 | 2.5 | Cleavage | ethylene receptor 2-like |
| Aang-miR395 | 968805 | 3 | Cleavage | ATP sulfurylase 2 |
| Aang-miR399 | 1917714 | 2.5 | Cleavage | probable apyrase 6 |
| Aang-miR529 | 1928068 | 2 | Cleavage | U5 small nuclear ribonucleoprotein 40 kDa isoform X1 |
| Aang-miR1314 | 1949852 | 2.5 | Translation | transcription factor ice1 |
| Aang-nmiR003 | 799146 | 2.5 | Translation | diterpene synthase |
| 2042481 | 2.5 | Cleavage | disease resistance RPP13 4 | |
| Aang-nmiR005 | 1950381 | 2.5 | Translation | probable receptor kinase At1g49730 |
| 1063530 | 2.5 | Cleavage | G-type lectin S-receptor-like serine threonine- kinase At2g19130 | |
| Aang-nmiR006 | 1926825 | 0 | Cleavage | disease resistance RGA3 |
| 2436549 | 0 | Cleavage | LRR and NB-ARC domain disease resistance | |
| 1580639 | 1 | Cleavage | NBS-LRR disease resistance | |
| 2026350 | 1 | Cleavage | disease resistance RPM1-like | |
| Aang-nmiR009 | 225523 | 3 | Cleavage | disease resistance RGA3 |
| 967108 | 3 | Cleavage | disease resistance RGA2-like isoform X1 | |
| 1681946 | 1.5 | Cleavage | disease resistance RPM1-like | |
| Aang-nmiR012 | 1601456 | 2 | Cleavage | TMV resistance N-like |
| Aang-nmiR017 | 2007770 | 2.5 | Cleavage | NBS-LRR |
| 2105514 | 2.5 | Cleavage | disease resistance TAO1-like | |
| 2105518 | 2.5 | Cleavage | TMV resistance N-like | |
| Aang-nmiR018 | 811798 | 2.5 | Translation | disease resistance (TIR-NBS-LRR class) |
| 811797 | 2.5 | Translation | TMV resistance N-like | |
| 160015 | 2.5 | Translation | disease resistance (TIR-NBS-LRR class) | |
| Aang-nmiR019 | 1997048 | 2.5 | Translation | TMV resistance N-like |
| Aang-nmiR021 | 2023865 | 1.5 | Cleavage | disease resistance RPM1-like |
| Aang-nmiR022 | 2023865 | 1.5 | Cleavage | disease resistance RPM1-like |
| Aang-nmiR025 | 869457 | 1.5 | Cleavage | senescence-associated |
| 1863722 | 0 | Cleavage | cytochrome P450 like TBP | |
| Aang-nmiR027 | 169430 | 2.5 | Cleavage | cyclin-T1-3-like isoform X1 |
| 1942123 | 2.5 | Cleavage | cyclin-T1-5 | |
| Aang-nmiR029 | 1580639 | 2.5 | Translation | NBS-LRR disease resistance |
| Aang-nmiR031 | 2026350 | 2.5 | Translation | disease resistance RPM1-like |
| Aang-nmiR038 | 1946613 | 1.5 | Translation | disease resistance RPP13 4 |
| Aang-nmiR039 | 1799491 | 2.5 | Cleavage | disease resistance RGA1 |
| 2509763 | 2.5 | Cleavage | resistance family | |
| Aang-nmiR051 | 2060073 | 0 | Cleavage | probable xyloglucan endotransglucosylase hydrolase 10 |
| 706255 | 1.5 | Cleavage | probable xyloglucan endotransglucosylase hydrolase 32 | |
| Aang-nmiR052 | 984263 | 2.5 | Translation | L-ascorbate oxidase homolog |
| Aang-nmiR054 | 1236307 | 3 | Cleavage | L-ascorbate oxidase |
| 2324731 | 2.5 | Cleavage | cell division cycle 123 homolog | |
| Aang-nmiR059 | 2131961 | 2.5 | Cleavage | target of AVRB operation1 |
| 2174518 | 0 | Cleavage | SUPPRESSOR OF npr1- CONSTITUTIVE 1-like | |
| 2151817 | 1.5 | Cleavage | TMV resistance N-like isoform X2 | |
| Aang-nmiR062 | 818026 | 0 | Cleavage | TMV resistance N-like |
| 2126093 | 0 | Cleavage | disease resistance TAO1-like | |
| 1820645 | 0 | Cleavage | Disease resistance (TIR-NBS-LRR class) family | |
| Aang-nmiR064 | 1749835 | 2.5 | Cleavage | disease resistance (TIR-NBS-LRR class) |
| 2031676 | 2.5 | Cleavage | TMV resistance N-like |
Predicted putative targets of novel and conserved miRNAs in A. angustifolia.
Expression Profiles of Conserved and Novel miRNAs From Araucaria angustifolia
The stem-loop RT-qPCR method was used to validate and measure the expression of 12 conserved miRNAs (Aang-miR156, Aang-miR159, Aang-miR166, Aang-miR167, Aang-miR168, Aang-miR169, Aang-miR171, Aang-miR390, Aang-miR395, Aang-miR399, Aang-miR529, Aang-miR1314) and 30 novel miRNAs (Aang-nmiR001, Aang-nmiR002, Aang-nmiR003, Aang-nmiR004, Aang-nmiR005, Aang-nmiR007, Aang-nmiR008, Aang-nmiR009, Aang-nmiR011, Aang-nmiR012, Aang-nmiR016, Aang-nmiR017, Aang-nmiR018, Aang-nmiR019, Aang-nmiR021, Aang-nmiR023, Aang-nmiR025, Aang-nmiR026, Aang-nmiR027, Aang-nmiR029, Aang-nmiR038, Aang-nmiR040, Aang-nmiR044, Aang-nmiR046, Aang-nmiR049, Aang-nmiR051, Aang-nmiR054, Aang-nmiR057, Aang-nmiR059, Aang-nmiR061) in five tissues (young leaves, old leaves, stem, main root and secondary root) of 3-month-old plants (Figure 5A). The expression patterns of these miRNAs are illustrated in Figures 5B,C, Datas S5–S7.
Figure 5
Four conserved miRNAs showed high expression patterns in just one tissue compared to the others, Aang-miR166, and Aang-miR168 in the stem, and Aang-miR156 and Aang-miR159 in secondary root. The other conserved miRNAs showed different expression patterns (Figure 5B). For example, Aang-miR395 showed high levels in stem, followed by young and old leaves and low levels in main and secondary roots, Aang-miR167 showed highest expression levels in young leaves and secondary roots and Aang-miR399 showed highest levels in old leaves and secondary roots (Figure 5B).
The relative expression data of the novel miRNAs suggested highly complex expression patterns over the plant body (Figure 5C, Datas S6, S7). Among thirty novel miRNAs, twenty-seven exhibited some degree of variation in expression between the tissues (Figure 5C, Datas S6, S7). For example, Aang-nmiR038 was abundantly expressed in young and old leaves, moderately expressed in stem and main root and weakly in secondary roots (Datas S6, S7). The expression level of Aang-nmiR008 was approximately 3-fold higher in old leaves than in young leaves, 2-fold higher than secondary roots and slightly higher than stem and main root (Datas S6, S7). Three novel miRNAs, Aang-nmiR003, Aang-nmiR021, and Aang-nmiR061, had higher expression in the stem than in other tissues (Datas S6, S7). Aang-nmiR021 was barely detected in old leaves, moderately expressed in young leaves, main and secondary roots and strongly expressed in the stem with a predominant expression pattern in this tissue (Datas S6, S7). Aang-nmiR023, Aang-nmiR044, and Aang-nmiR051 showed ubiquitous expression levels in the tissues with a slight increase in main root (Datas S6, S7). Fourteen novel miRNAs had higher expression in secondary roots than in other tissues: Aang-nmiR001, Aang-nmiR002, Aang-nmiR004, Aang-nmiR005, Aang-nmiR007, Aang-nmiR011, Aang-nmiR012, Aang-nmiR016, Aang-nmiR017, Aang-nmiR018, Aang-nmiR019, Aang-nmiR025, Aang-nmiR026, and Aang-nmiR049 (Figure 5, Datas S6, S7). In some cases, the expression levels were more than 10-fold higher in secondary roots than in other tissues. For example, Aang-nmiR016 was more than 50-fold higher in secondary roots than in young leaves and Aang-nmiR025 was 20-fold higher in secondary roots than in stem (Data S6). In contrast, some novel miRNAs were homogenously expressed among the tissues, as illustrated by Ang-nmiR40 and Aang-nmiR044 (Figure 5, Datas S6, S7).
Discussion
High-throughput sequencing technologies represent a breakthrough in the molecular biology scientific world. Thousands of genome sequences, RNA-seq, and sRNA-seq projects have been released and a plethora of biological process have been comprehensively analyzed. In the plant small RNA biology field, a series of miRNAs have been identified in a series of groups, but there is no genetic data available about miRNAs in any species (members) of Araucariaceae family. Araucaria angustifolia is the most important endemic conifer species in Brazil and has been used in a series of genetic studies (Auler et al., ; Souza et al., ; Elbl et al., ), but there is no available information about miRNAs in this species. In the present study, Illumina technology was used for deep sequencing of small RNA library to identify miRNAs in A. angustifolia.
The small RNA length distribution in A. angustifolia shows high abundance in sequences with 21 and 24 nt, (Figure 2B). Plant sRNAs are commonly reported in two principal size classes, 21 nucleotides and 24 nucleotides (Chávez Montes et al., ). This distribution pattern is well-documented in angiosperms (Guzman et al., , ; Källman et al., ). However, non-angiosperm species comprise alternative ones (Chávez Montes et al., ). For example, in conifers, P. abies and Pinus contorta fail to produce significant numbers of 24-nt long small RNAs (Dolgosheina et al., ), whereas Chinese fir (Wan et al., ) libraries were predominantly represented by this length class. The length enrichment toward 21 nt in the Brazilian pine sRNAome was also reported in other conifer-species (Dolgosheina et al., ; Chávez Montes et al., ; Zhang et al., ). The high abundance of 24 nt sRNAs, as well as the presence of 2.5 million sRNA, reads related to transposable elements (TEs) may reflect a myriad of sRNA types and functions, including events of transposition regulation. Liu and El-Kassaby pinpointed that a significant portion of 24 nt sRNAs may be related to TE silencing in Picea glauca during early developmental stages, and this expression decreases throughout the progression of phases (Liu and El-Kassaby, ). Also, a small RNA class called hc-siRNAs, mostly 24 nts in length, are substantially numerous in sRNA libraries of land plants (Axtell and Meyers, ). Usually, these sRNAs are derived from intergenic, repetitive and transposon-related genomic regions (Axtell and Meyers, ). Therefore, the high diversity of 24 nt sequences, as well as the high number of sequences related to TEs suggests a role of these elements in Brazilian pine biology.
By comparing small RNA data from A. angustifolia with miRBase, 115 sequences representing 30 evolutionary plant miRNA families were identified (Figure 2C). The number of family members, as well as abundancy, varied among families and sequences, respectively. Among conserved mature miRNA families identified in A. angustifolia, miR156, miR159, miR166, and miR167 have been documented as high abundant in several plant groups (Taylor et al., ). Other evolutionary conserved miRNA families were also identified in Brazilian pine. Among them, seven are conserved from Embryophytes (Berruezo et al., ), miR160, miR171, miR319, mir395, miR396, miR408, miR535, three are conserved from Tracheophytes (Berruezo et al., ), miR162, miR168, and miR403, and three conserved from Gymnosperms (Chávez Montes et al., ), miR947, miR1314, and miR3711 (Figure 6).
Figure 6
The miR403 family was reported in conifers and other vascular plant groups (Jagtap and Shivaprasad,
By using a well-established approach in the prediction of pre-miRNAs (Lei and Sun,
To exceed the miRBase blast-search in the novel and conserved miRNA identification and obtain more insights about the presence of pre-miRNAs predicted in A. angustifolia in different conifer species, including one of the same genera, some alternative approaches were applied. First, sRNA-seq of conifer species from different taxa, including the basal conifer G. biloba were mapped against pre-miRNAs predicted in A. angustifolia. Since mismatches were not considered, and the mapping patterns could be visualized, the output data showed that all predicted conserved pre-miRNAs were matched with small RNAs of different conifers, mainly in their sequences of 5P and 3P mature miRNAs. Interestingly, the same pattern was observed in 9 of the 65 novel pre-miRNAs predicted in A. angustifolia (Table 5). It is important to mention that pre-miRNAs were obtained from RNAseq data of embryonic tissues, not representing neither the juvenile phase used in the RT-qPCR analysis, nor the mature leaf tissue used to obtain the small RNA seq data corresponding to mature miRNAs. So far, the amount pre-miRNAs detected in this study is certainly underestimated. These findings are particularly important since these miRNAs were considered novel because their mature and antisense miRNAs do not have potential ortholog sequences in the current version of the miRBase platform, even with data from four conifer species. In addition, the probable presence of these nine miRNAs in G. biloba suggests that they may evolved in early divergence times of gymnosperms. Extending this idea, another unusual analysis was applied with a similar purpose, but this time toward a related taxon. There is no data from miRNAs in the Araucariaceae family. This family has three genera, Agathis, Araucaria, and Wollemia with a primarily Southern Hemisphere distribution, with the clear majority of species endemic to Australia, New Zealand, New Guinea, and New Caledonia and just two species, Araucaria araucana and A. angustifolia endemic to South America (Escapa and Catalano,
Several genes were predicted as potential targets for novel and conserved A. angustifolia miRNAs (Table 6). Among the conserved miRNA targets, functions related to energetic metabolism, signal transduction and gene expression control were found. The targets related to conserved miRNAs were reported in several taxa, miR171-scarecrow 6 (Zhu et al., 2015), miRNA395-ATP sulfurylase (Zhang et al.,
Interestingly, a series of novel miRNAs are predicted to target NBS-LRR disease resistance genes. A strong association between the diversity NBS-LRRs and miRNAs was reported in a genome-wide study with 70 land plants (Zhang et al., 2016). There are evidence that NBS-LRR genes keep giving birth to new miRNAs targeting themselves in various plant lineages (Cui et al.,
Stem-loop RT-qPCR was applied to analyze expression patterns of conserved and novel miRNAs predicted in A. angustifolia (Figure 5, Datas S5–S7). Among the conserved miRNAs analyzed, Aang-nmiR166 showed highest expression levels on stem compared to the other tissues. This miRNA family has HD-ZIPIII transcription factors as potential targets, as reported in a series of studies and in the present work. Experimental analysis showed that the knock-down of miR166 promoted a substantial increase in expression levels of HD-ZIPIII genes OsHB3 and OsHB4 in stem, compared to other tissues in rice plants (Zhang J. et al.,
The expression of Aang-miR156 in secondary roots was 200-fold higher than in leaves, 10-fold higher than in stem and more than 5-fold higher than in main root (Figure 5B). These results corroborate other findings of the requirement of high levels of miR156 expression for adventitious root formation in Arabidopsis and Malus xiaojinensis (Xu et al.,
Aang-miR171 was expressed homogeneously in all tissues, which suggests that this miRNA family plays important roles in different organs in A. angustifolia. A series of studies reported several phenotypes in miR171-overexpressing lineages in different species (Hai et al.,
Thirty novel miRNAs predicted in A. angustifolia were also biologically validated via stem-loop RT-qPCR (Figure 5, Datas S6, S7). These miRNAs exhibited extremely diverse expression patterns among young leaves, old leaves, stem, main root and secondary roots (Figure 5, Datas S6, S7). The association between RT-qPCR data and target prediction rises important clues about the function of these miRNAs in A. angustifolia. For instance, among the novel miRNAs targeting disease-resistant genes, Aang-nmiR038 (targeting RPP13) was high expressed in young and old leaves, Aang-nmiR003 (targeting RPP13) and Aang-nmiR021 (targeting RPM1-like) were high expressed in stem, and other three novel miRNAs were higher expressed in secondary roots, Aang-nmiR017 (targeting NBS-LRR, TAO and TMV), Aang-nmiR018 (targeting NBS-LRR class), Aang-nmiR019 (targeting TMV). These patterns and associations suggest that the novel pre-miRNAs predicted in A. angustifolia integrate a series pathways in this species.
Conclusion
In the present study, a small-RNA library was constructed by high-throughput sequencing of A. angustifolia leaves with the aim of identifying miRNA precursors, mature miRNAs and miRNA targets in this species. Also, a series of conserved and novel miRNAs predicted in A. angustifolia was identified in RNA-seq data from different conifers, including the Australian native congeneric species A. cunninghamii. This study provides the first report on the transcriptome-wide identification of miRNAs as well as the first view of the diversity, abundance and expression patterns of these small RNAs in Araucariaceae. Bioinformatics analysis suggests that Brazilian pine conserved and novel miRNAs might contribute to several physiological processes by targeting multiple targets and affecting different pathways. The novel lineage-specific miRNAs seem to be more involved with response to pathogens by targeting NBS-LRR resistance genes. Experimental analysis indicates that these miRNAs are expressed in different patterns through the plant body. This miRNA-target interaction remains to be further explored in order to achieve novel biological and evolutionary aspects in Brazilian pine and related species. It is possible that a series of miRNAs annotated in the present study can integrate the genetic pool of several non-studied conifers, including species from Araucariaceae and genus Araucaria. Therefore, these data represent valuable information for future genetic studies of miRNAs in Gymnosperms, by providing insights about biology, diversity, expression and evolution of these small RNAs. The upload of these data in miRBase will also be important for comparative analysis with other plant groups.
Statements
Author contributions
RM, JG, and FG conceived and designed the study. FG conducted in silico analysis. JG and ME conducted the RT-qPCR experiments. JG and RM analyzed the data. JG and RM drafted the manuscript. All authors have read and approved the manuscript.
Funding
RM is the recipient of a research fellowship 309030/2015-3 from Conselho Nacional de Pesquisa e Desenvolvimento Científico e Tecnológico, CNPq. JG and ME are the recipients of a Ph.D. fellowships from Conselho Nacional de Pesquisa e Desenvolvimento Científico e Tecnológico, CNPq. FG is the recipient of Post-Doctoral fellowship from Coordenação de Aperfeiçoamento de Pessoal de Nível Superior CAPES. The present study was also partially supported through a grant from INCT-MCTIC.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fgene.2019.00222/full#supplementary-material
References
1
AraldiC. G.MariaC.CoelhoM. (2015). Establishment of post-harvest early-developmental categories for viability maintenance of Araucaria angustifolia seeds. Acta Bot. Bras.. 29, 524–531. 10.1590/0102-33062015abb0061
2
AraziT. (2012). MicroRNAs in the moss Physcomitrella patens. Plant Mol. Biol.80, 55–65. 10.1007/s11103-011-9761-5
3
AulerN. M. F.dos ReisM. S.GuerraM. P.NodariR. O. (2002). The genetics and conservation of Araucaria angustifolia: I. Genetic structure and diversity of natural populations by means of non-adaptive variation in the state of Santa Catarina, Brazil. Genet. Mol. Biol.25, 329–338. 10.1590/S1415-47572002000300014
4
AxtellM. J.MeyersB. C. (2018). Revisiting criteria for plant MicroRNA annotation in the era of big data. Plant Cell30, 272–284. 10.1105/tpc.17.00851
5
BerruezoF.de SouzaF. S. J.PiccaP. I.NemirovskyS. I.Martínez TosarL.RiveroM.et al. (2017). Sequencing of small RNAs of the fern Pleopeltis minima (Polypodiaceae) offers insight into the evolution of the microrna repertoire in land plants. PLoS ONE12:e0177573. 10.1371/journal.pone.0177573
6
BiswasS.HazraS.ChattopadhyayS. (2016). Identification of conserved miRNAs and their putative target genes in Podophyllum hexandrum (Himalayan Mayapple). Plant Gene6, 82–89. 10.1016/j.plgene.2016.04.002
7
BudakH.AkpinarB. A. (2015). Plant miRNAs: biogenesis, organization and origins. Funct. Integr. Genomics15, 523–531. 10.1007/s10142-015-0451-2
8
ChaoY.-T.SuC.-L.JeanW.-H.ChenW.-C.ChangY.-C. A.ShihM.-C. (2014). Identification and characterization of the microRNA transcriptome of a moth orchid Phalaenopsis aphrodite. Plant Mol. Biol.84, 529–548. 10.1007/s11103-013-0150-0
9
Chávez MontesR. A.de Rosas-CárdenasF. F.De PaoliE.AccerbiM.RymarquisL. A.MahalingamG.et al. (2014). Sample sequencing of vascular plants demonstrates widespread conservation and divergence of microRNAs. Nat. Commun.5:3722. 10.1038/ncomms4722
10
ChenC. (2005). Real-time quantification of microRNAs by stem-loop RT-PCR. Nucleic Acids Res.33:e179. 10.1093/nar/gni178
11
ChenC.-J.LiuQ.ZhangY.-C.QuL.-H.ChenY.-Q.GautheretD. (2011). Genome-wide discovery and analysis of microRNAs and other small RNAs from rice embryogenic callus. RNA Biol.8, 538–547. 10.4161/rna.8.3.15199
12
ChenY. T.ShenC. H.LinW. D.ChuH. A.HuangB. L.KuoC. I.et al. (2013). Small RNAs of Sequoia sempervirens during rejuvenation and phase change. Plant Biol.15, 27–36. 10.1111/j.1438-8677.2012.00622.x
13
ChorosteckiU.MoroB.RojasA. L. M.DebernardiJ. M.SchapireA. L.NotredameC.et al. (2017). Evolutionary footprints reveal insights into plant MicroRNA biogenesis. Plant Cell29, 1248–1261. 10.1105/tpc.17.00272
14
ConesaA.GötzS. (2008). Blast2GO: a comprehensive suite for functional analysis in plant genomics. Int. J. Plant Genomics2008:619832. 10.1155/2008/619832
15
CuiJ.YouC.ChenX. (2017). The evolution of microRNAs in plants. Curr. Opin. Plant Biol.35, 61–67. 10.1016/j.pbi.2016.11.006
16
CuperusJ. T.FahlgrenN.CarringtonJ. C. (2011). Evolution and functional diversification of MIRNA genes. Plant23, 431–442. 10.1105/tpc.110.082784
17
DaiX.ZhaoP. X. (2011). psRNATarget: a plant small RNA target analysis server. Nucleic Acids Res.39, W155–W159. 10.1093/nar/gkr319
18
DolgosheinaE. V.MorinR. D.AksayG.SahinalpS. C.MagriniV.MardisE. R.et al. (2008). Conifers have a unique small RNA silencing signature. RNA14, 1508–1515. 10.1261/rna.1052008
19
ElblP.LiraB. S.AndradeS. C. S.JoL.dos SantosA. L. W.CoutinhoL. L.et al. (2015). Comparative transcriptome analysis of early somatic embryo formation and seed development in Brazilian pine, Araucaria angustifolia (Bertol.) Kuntze. Plant Cell Tissue Organ Cult.120, 903–915. 10.1007/s11240-014-0523-3
20
EscapaI. H.CatalanoS. A. (2013). Phylogenetic analysis of Araucariaceae: integrating molecules, morphology, and fossils. Int. J. Plant Sci.174, 1153–1170. 10.1086/672369
21
GaspinC.RuéO.ZytnickiM. (2016). Ingredients for in silico miRNA identification and annotation. JSM Biotechnol Bioeng.3:1071.
22
GuzmanF.AlmerãoM. P.KorbesA. P.ChristoffA. P.ZanellaC. M.BeredF.et al. (2013). Identification of potential miRNAs and their targets in Vriesea carinata (Poales, Bromeliaceae). Plant Sci.210, 214–223. 10.1016/j.plantsci.2013.05.013
23
GuzmanF.AlmerãoM. P.KörbesA. P.Loss-MoraisG.MargisR. (2012). Identification of MicroRNAs from Eugenia uniflora by high-throughput sequencing and bioinformatics analysis. PLoS ONE7:e49811. 10.1371/journal.pone.0049811
24
HaasB. J.PapanicolaouA.YassourM.GrabherrM.BloodP. D.BowdenJ.et al. (2013). De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc.8, 1494–1512. 10.1038/nprot.2013.084
25
HaiB. Z.QiuZ. B.HeY. Y.YuanM. M.LiY. F. (2018). Characterization and primary functional analysis of Pinus densata miR171. Biol. Plant.62, 318–324. 10.1007/s10535-018-0774-7
26
HaoD.-C.YangL.XiaoP.-G.LiuM. (2012). Identification of Taxus microRNAs and their targets with high-throughput sequencing and degradome analysis. Physiol. Plant.146, 388–403. 10.1111/j.1399-3054.2012.01668.x
27
HeL.HannonG. J. (2004). Correction: MicroRNAs: small RNAs with a big role in gene regulation. Nat. Rev. Genet.5, 631–631. 10.1038/nrg1415
28
JagadeeswaranG.NimmakayalaP.ZhengY.GowduK.ReddyU. K.SunkarR. (2012). Characterization of the small RNA component of leaves and fruits from four different cucurbit species. BMC Genomics13:329. 10.1186/1471-2164-13-329
29
JagtapS.ShivaprasadP. V. (2014). Diversity, expression and mRNA targeting abilities of Argonaute-targeting miRNAs among selected vascular plants. BMC Genomics15:1049. 10.1186/1471-2164-15-1049
30
JühlingF.MörlM.HartmannR. K.SprinzlM.StadlerP. F.PützJ. (2009). tRNAdb 2009: compilation of tRNA sequences and tRNA genes. Nucleic Acids Res.37, 159–162. 10.1093/nar/gkn772
31
KällmanT.ChenJ.GyllenstrandN.LagercrantzU. (2013). A significant fraction of 21-nucleotide small RNA originates from phased degradation of resistance genes in several perennial species. Plant Physiol.162, 741–754. 10.1104/pp.113.214643
32
KarlovaR.Van HaarstJ. C.MaliepaardC.Van De GeestH.BovyA. G.LammersM.et al. (2013). Identification of microRNA targets in tomato fruit development using high-throughput sequencing and degradome analysis. J. Exp. Bot.64, 1863–1878. 10.1093/jxb/ert049
33
KozomaraA.Griffiths-JonesS. (2014). miRBase: annotating high confidence microRNAs using deep sequencing data. Nucleic Acids Res.42, D68–D73. 10.1093/nar/gkt1181
34
KulcheskiF. R.Marcelino-GuimaraesF. C.NepomucenoA. L.AbdelnoorR. V.MargisR. (2010). The use of microRNAs as reference genes for quantitative polymerase chain reaction in soybean. Anal. Biochem.406, 185–192. 10.1016/j.ab.2010.07.020
35
LangmeadB.TrapnellC.PopM.SalzbergS. L. (2009). Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biol.10:R25. 10.1186/gb-2009-10-3-r25
36
LeeR. C.FeinbaumR. L.AmbrosV. (1993). The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell75, 843–854. 10.1016/0092-8674(93)90529-Y
37
LeiJ.SunY. (2014). miR-PREFeR: an accurate, fast and easy-to-use plant miRNA prediction tool using small RNA-Seq data. Bioinformatics30, 2837–2839. 10.1093/bioinformatics/btu380
38
LiQ.DengC.XiaY.KongL.ZhangH.ZhangS.et al. (2017a). Identification of novel miRNAs and miRNA expression profiling in embryogenic tissues of Picea balfouriana treated by 6-benzylaminopurine. PLoS ONE12:e0176112. 10.1371/journal.pone.0176112
39
LiX.XieX.LiJ.CuiY.HouY.ZhaiL.et al. (2017b). Conservation and diversification of the miR166 family in soybean and potential roles of newly identified miR166s. BMC Plant Biol.17:32. 10.1186/s12870-017-0983-9
40
LiuY.El-KassabyY. A. (2017). Landscape of fluid sets of hairpin-derived 21-/24-nt-long small RNAs at seed set uncovers special epigenetic features in Picea glauca. Genome Biol. Evol.9, 82–92. 10.1093/gbe/evw283
41
LivakK. J.SchmittgenT. D. (2001). Analysis of relative gene expression data using real-time quantitative PCR and the 2–CRCT method. Methods25, 402–408. 10.1006/meth.2001.1262
42
LonghiS. J.BrenaD. A.RibeiroS. B.GracioliC. R.LonghiR. V.MastellaT. (2009). Fatores ecológicos determinantes na ocorrência de Araucaria angustifolia e Podocarpus lambertii, na Floresta Ombrófila Mista da FLONA de São Francisco de Paula, RS, Brasil. Ciência Rural40, 57–63. 10.1590/S0103-84782009005000220
43
LuS.SunY. H.AmersonH.ChiangV. L. (2007). MicroRNAs in loblolly pine (Pinus taeda L.) and their association with fusiform rust gall development. Plant J.51, 1077–1098. 10.1111/j.1365-313X.2007.03208.x
44
LuY.RanJ.-H.GuoD.-M.YangZ.-Y.WangX.-Q. (2014). Phylogeny and divergence times of gymnosperms inferred from single-copy nuclear genes. PLoS ONE9:e107679. 10.1371/journal.pone.0107679
45
MicaE. (2006). Characterization of five microRNA families in maize. J. Exp. Bot.57, 2601–2612. 10.1093/jxb/erl013
46
MilneI.BayerM.CardleL.ShawP.StephenG.WrightF.et al. (2009). Tablet-next generation sequence assembly visualization. Bioinformatics26, 401–402. 10.1093/bioinformatics/btp666
47
MiskiewiczJ.TomczykK.MickiewiczA.SarzynskaJ.SzachniukM. (2017). Bioinformatics study of structural patterns in plant MicroRNA precursors. Biomed Res. Int.2017, 1–8. 10.1155/2017/6783010
48
Moreira-souzaM.JurandyE.NogueiraB. (1994). Practical Method For germination of Araucaria angustifolia (Bert.) O. Ktze. seeds. Sci. Agric. 60, 389–391.
49
MoxonS.SchwachF.DalmayT.MacLeanD.StudholmeD. J.MoultonV. (2008). A toolkit for analysing large-scale plant small RNA datasets. Bioinformatics24, 2252–2253. 10.1093/bioinformatics/btn428
50
MutumR. D.KumarS.BalyanS.KansalS.MathurS.RaghuvanshiS. (2016). Identification of novel miRNAs from drought tolerant rice variety Nagina 22. Sci. Rep.6:30786. 10.1038/srep30786
51
OmidvarV.MohorianuI.DalmayT.FellnerM. (2015). Identification of miRNAs with potential roles in regulation of anther development and male-sterility in 7B-1 male-sterile tomato mutant. BMC Genomics16:878. 10.1186/s12864-015-2077-0
52
ParooZ.LiuQ.WangX. (2007). Biochemical mechanisms of the RNA-induced silencing complex. Cell Res.17, 187–194. 10.1038/sj.cr.7310148
53
PrattA. J.MacRaeI. J. (2009). The RNA-induced silencing complex: a versatile gene-silencing machine. J. Biol. Chem.284, 17897–17901. 10.1074/jbc.R900012200
54
SamadA. F. A.SajadM.NazaruddinN.FauziI. A.MuradA. M. A.ZainalZ.et al. (2017). MicroRNA and transcription factor: key players in plant regulatory network. Front. Plant Sci.8:565. 10.3389/fpls.2017.00565
55
SantosA. L. W.dos SilveiraV.SteinerN.MaraschinM.GuerraM. P. (2010). Biochemical and morphological changes during the growth kinetics of Araucaria angustifolia suspension cultures. Braz. Arch. Biol. Technol.53, 497–504. 10.1590/S1516-89132010000300001
56
SantosA. L. W.dos SilveiraV.SteinerN.VidorM.GuerraM. P. (2002). Somatic embryogenesis in parana pine (Araucaria angustifolia (Bert.) O. Kuntze). Braz. Arch. Biol. Technol.45, 97–106. 10.1590/S1516-89132002000100015
57
SchmiederR.EdwardsR. (2011). Quality control and preprocessing of metagenomic datasets. Bioinformatics27, 863–864. 10.1093/bioinformatics/btr026
58
SeverinA. J.WoodyJ. L.BolonY.-T.JosephB.DiersB. W.FarmerA. D.et al. (2010). RNA-Seq atlas of Glycine max: a guide to the soybean transcriptome. BMC Plant Biol.10:160. 10.1186/1471-2229-10-160
59
SouzaM. I. F.de SalgueiroF.Carnavale-BottinoM.FélixD. B.Alves-FerreiraM.BittencourtJ. V. M.et al. (2009). Patterns of genetic diversity in southern and southeastern Araucaria angustifolia (Bert.) O. Kuntze relict populations. Genet. Mol. Biol.32, 546–556. 10.1590/S1415-47572009005000052
60
SteinerN.CatarinaC. S.BalbuenaT. S.GuerraM. P. (2008). Araucaria angustifolia biotechnology. Funct. Plant Sci. Biotechnol.2, 20–28.
61
StocksM. B.MoxonS.MaplesonD.WoolfendenH. C.MohorianuI.FolkesL.et al. (2012). The UEA sRNA workbench : a suite of tools for analysing and visualizing next generation sequencing microRNA and small RNA datasets. Bioinformatics28, 2059–2061. 10.1093/bioinformatics/bts311
62
TaylorR. S.TarverJ. E.HiscockS. J.DonoghueP. C. J. (2014). Evolutionary history of plant microRNAs. Trends Plant Sci.19, 175–182. 10.1016/j.tplants.2013.11.008
63
ThomasP. (2013). Araucaria angustifolia. The IUCN Red List of Threatened Species 2013:e.T32975A2829141. 10.2305/IUCN.UK.2013-1.RLTS.T32975A2829141.en
64
TzarfatiR.SelaI.GoldschmidtE. E. (2013). Graft-induced changes in MicroRNA expression patterns in citrus leaf petioles. Open Plant Sci. J.7, 17–23. 10.2174/1874294701307010017
65
Velayudha Vimala KumarK.SrikakulamN.PadbhanabhanP.PandiG. (2017). Deciphering microRNAs and their associated hairpin precursors in a non-model plant, Abelmoschus esculentus. Noncoding RNA3:19. 10.3390/ncrna3020019
66
WanL.-C.WangF.GuoX.LuS.QiuZ.ZhaoY.et al. (2012a). Identification and characterization of small non-coding RNAs from Chinese fir by high throughput sequencing. BMC Plant Biol.12:146. 10.1186/1471-2229-12-146
67
WanL. C.ZhangH.LuS.ZhangL.QiuZ.ZhaoY.et al. (2012b). Transcriptome-wide identification and characterization of miRNAs from Pinus densata. BMC Genomics13:132. 10.1186/1471-2164-13-132
68
WangJ.-F. (2004). Identification of 20 microRNAs from Oryza sativa. Nucleic Acids Res.32, 1688–1695. 10.1093/nar/gkh332
69
XieF.FrazierT. P.ZhangB. (2010). Identification and characterization of microRNAs and their targets in the bioenergy plant switchgrass (Panicum virgatum). Planta232, 417–434. 10.1007/s00425-010-1182-1
70
XuM.HuT.ZhaoJ.ParkM. Y.EarleyK. W.WuG.et al. (2016). Developmental functions of miR156-regulated SQUAMOSA PROMOTER BINDING PROTEIN-LIKE (SPL) genes in Arabidopsis thaliana. PLoS Genet.12:e1006263. 10.1371/journal.pgen.1006263
71
XuX.LiX.HuX.WuT.WangY.XuX.et al. (2017). High miR156 expression is required for auxin-induced adventitious root formation via MxSPL26 independent of PINs and ARFs in Malus xiaojinensis. Front. Plant Sci.8:1059. 10.3389/fpls.2017.01059
72
XueT.LiuZ.DaiX.XiangF. (2017). Primary root growth in Arabidopsis thaliana is inhibited by the miR159 mediated repression of MYB33, MYB65 and MYB101. Plant Sci.262, 182–189. 10.1016/j.plantsci.2017.06.008
73
YakovlevI. A.FossdalC. G.JohnsenØ. (2010). MicroRNAs, the epigenetic memory and climatic adaptation in Norway spruce. New Phytol.187, 1154–1169. 10.1111/j.1469-8137.2010.03341.x
74
ZhangJ.WuT.LiL.HanS.LiX.ZhangS.et al. (2013). Dynamic expression of small RNA populations in larch (Larix leptolepis). Planta237, 89–101. 10.1007/s00425-012-1753-4
75
ZhangJ.ZhangH.SrivastavaA. K.PanY.BaiJ.FangJ.et al. (2018). Knockdown of rice MicroRNA166 confers drought resistance by causing leaf rolling and altering stem xylem development. Plant Physiol.176, 2082–2094. 10.1104/pp.17.01432
76
ZhangL.ZhengY.JagadeeswaranG.LiY.GowduK.SunkarR. (2011). Identification and temporal expression analysis of conserved and novel microRNAs in Sorghum. Genomics98, 460–468. 10.1016/j.ygeno.2011.08.005
77
ZhangQ.LiJ.SangY.XingS.WuQ.LiuX. (2015). Identification and characterization of MicroRNAs in Ginkgo biloba var. epiphylla Mak. PLoS ONE10:e0127184. 10.1371/journal.pone.0127184
78
ZhangY.WangY.GaoX.LiuC.GaiS. (2018). Identification and characterization of microRNAs in tree peony during chilling induced dormancy release by high-throughput sequencing. Sci. Rep.8:4537. 10.1038/s41598-018-22415-5
79
ZhangY.XiaR.KuangH.MeyersB. C. (2016). The diversification of plant NBS-LRR defense genes directs the evolution of MicroRNAs that target them. Mol. Biol. Evol.33, 2692–2705. 10.1093/molbev/msw154
80
ZhuX.LengX.SunX.MuQ.WangB.LiX.et al. (2015). Discovery of conservation and diversification of genes by phylogenetic analysis based on global genomes. Plant Genome8, 1–11. 10.3835/plantgenome2014.10.0076
Summary
Keywords
Araucaria angustifolia, Araucariaceae, microRNAs, non-coding RNAs, transcriptome
Citation
Galdino JH, Eguiluz M, Guzman F and Margis R (2019) Novel and Conserved miRNAs Among Brazilian Pine and Other Gymnosperms. Front. Genet. 10:222. doi: 10.3389/fgene.2019.00222
Received
12 December 2018
Accepted
28 February 2019
Published
22 March 2019
Volume
10 - 2019
Edited by
Rosane Garcia Collevatti, Universidade Federal de Goiás, Brazil
Reviewed by
André Luis Wendt Dos Santos, University of São Paulo, Brazil; Evandro Novaes, Universidade Federal de Lavras, Brazil
Updates

Check for updates
Copyright
© 2019 Galdino, Eguiluz, Guzman and Margis.
This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Rogerio Margis rogerio.margis@ufrgs.br
This article was submitted to RNA, a section of the journal Frontiers in Genetics
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.