Original Research ARTICLE
Analyses of Genomic tRNA Reveal Presence of Novel tRNAs in Oryza sativa
- Department of Biotechnology, Yeungnam University, Gyeongsan, South Korea
Transfer rRNAs are important molecules responsible for the translation event during protein synthesis. tRNAs are widespread found in unicellular to multi-cellular organisms. Analysis of tRNA gene family members in Oryza sativa revealed the presence of 750 tRNA genes distributed unevenly in different chromosomes. The length of O. sativa tRNAs genes were ranged from 66 to 91 nucleotides encoding 52 isoacceptor in total. tRNASer found in chromosome 8 of O. sativa encoded only 66 nucleotides which is the smallest tRNA of O. sativa and to our knowledge, this is the smallest gene of eukaryotic lineage reported so far. Analyses revealed the presence of several novel/pseudo tRNA genes in O. sativa which are reported for the first time. Multiple sequence alignment of tRNAs revealed the presence of family specific conserved consensus sequences. Functional study of these novel tRNA and family specific conserved consensus sequences will be crucial to decipher their importance in biological events. The rate of transition of O. sativa tRNA was found to be higher than the rate of transversion. Evolutionary study revealed, O. sativa tRNAs were evolved from the lineages of multiple common ancestors. Duplication and loss study of tRNAs genes revealed, majority of the O. sativa tRNA were duplicated and 17 of them were found to be undergone loss during the evolution. Orthology and paralogy study showed, the majority of O. sativa tRNA were paralogous and only a few of tRNASer were found to contain orthologous tRNAs.
Transfer RNAs (tRNA) are the most common ancestral short non-coding RNA present in the world and considered one of the single largest gene family in the genome of any species (Mallick et al., 2005; Michaud et al., 2011; Zuo et al., 2013). These molecules can be traced back to the RNA world before the separation of the three major domains of life (Eigen et al., 1989) demonstrating that, all tRNA genes are homologs and derived from an ancestral proto tRNA (Eigen and Winkler-Oswatitsch, 1981). The abundance of tRNA constitutes about 4–10% of all the cellular RNA (Kirchner and Ignatova, 2014). The role of tRNA in life is very important as it forms the interface between the genetic code and amino cids and acts as an adaptor molecule in the protein translation machinery. Additionally, the tRNA is a multifunctional molecule present in the cytoplasm, chloroplasts, and mitochondria (Mallick et al., 2005; Michaud et al., 2011). It has been reported that the tRNA molecules are ~70–100 nucleotides long and form a “clover leaf” like secondary structure and an L-shaped tertiary structure (Torres et al., 2014). The clover like secondary structure is characterized by the presence of an acceptor stem, D-stem, D-loop, anti-codon stem, anti-codon loop, variable loop, TΨC-stem, and TΨC-loop. The important functional part of tRNA is the anti-codon triplet and 3′-CCA nucleotides. The anti-codon triplet reads the messenger RNA (mRNA) codons and an amino acid cognate to tRNA attached to the 3′-CCA nucleotides with the help of aminoacyl tRNA synthetase during the translation events (Goodenbour and Pan, 2006). The aminoacyl tRNA synthetase discriminate its cognate tRNA by the presence of the anti-codon loop and a discriminator base at position 73 (N73) located before the 3′-CCA tail (Kirchner and Ignatova, 2014). tRNA are also involved in the tRNA derived RNA silencing pathway and a major source of short interspersed nuclear elements (SINE) (Bermudez-Santana et al., 2010; Phizicky and Hopper, 2010). During the evolution of tRNA, full and functional tRNA genes were formed as a result of several mutation, duplication, and re-organization events. Although, several basic studies been conducted to investigate the tRNA of lower and higher eukaryotes, only a very little information is available regarding in detailed study of plant tRNAs. The basic details regarding the tRNA biology of plant is very scanty. Therefore, in this study we have investigated the complete genomic details of Oryza sativa tRNA and reported here.
Results and Discussion
Oryza sativa tRNAs Contain 52 Anti-codons
The genome of any given species encodes genes for three major types of RNA, which includes mRNA (messenger RNA), rRNA (ribosomal RNA), and tRNA (transfer RNA). tRNA plays crucial role during the protein synthesis process. Overall, there are 20 different types of essential amino acids present in the genome that forms the backbones of the proteins; hence there are 20 different types of tRNA genes present in the genome to ensure efficient protein synthesis. Genome-wide analysis showed the presence of 750 tRNA in O. sativa that encoded 20 standard amino acids (Table 1). In addition to the presence of standard tRNA, the genome of O. sativa also encoded 26 pseudo tRNA genes, but no suppressor tRNA was found. The nuclear genome contains multiple tRNA genes encoding an isodecoder tRNA to allow sufficient supply of the tRNA. Earlier, Michaud et al. (2011) reported that, O. sativa genome encodes 723 tRNA in total including true tRNA, organellar tRNA and pseudo tRNA (Michaud et al., 2011). The tRNA gene population is very diverse in plants like O. sativa relative to the archaea and bacteria. Analyses showed O. sativa tRNA contain 52 different anti-codons/isoacceptors in 750 tRNA, with tRNASer having the highest abundance of isoacceptors (GCT, TGA, AGA, CGA, and GGA; Table 1). Overall, in total there were 59 tRNAArg and 60 tRNAMet genes were found in O. sativa. The tRNAMet contains only a single isoacceptor CAT, while tRNAArg contains five different isoacceptors (ACG, TCT, CCT, CCG, and TCG). The abundance of tRNACys and tRNATrp genes in O. sativa were found to be the lowest, accounting for only 18 tRNA genes in each. The tRNATrp contains only one isoacceptor (CCA), while tRNACys encodes two isoacceptors (GCA and ACA). The universal genetic code is degenerate where 20 amino acids are encoded by 61 triplet codons (Agris et al., 2007). Therefore, 61 different triplet codes are usually required for 61 different anti-codons/isoacceptors. However, O. sativa encodes only 52 isoacceptors and codon degeneracy, codon usage and wobble base pairing might playing significant role in harboring all the codons in the genome of O. sativa. O. sativa also lacked of a suppressor (CTA and TTA) and selenocysteine (TCA) tRNA. The amino acid selenocysteine is not present universally in all the organisms because of its toxic nature, which might be the reason for its absence in the genome of O. sativa. The suppressor tRNAs are mutant form of tRNAs which allow to insert a suitable amino acid at a mutant site in the protein encoding genes and suppresses the phenotypic effect of coding mutation. The suppressor tRNA can affect the production functional cellular protein. Therefore, they are found very rarely in the cells to allow predominance of normal translational event in cell. This might be the reason, why O. sativa genome do not have suppressor tRNA. tRNA isoacceptor ACC (glycine), GGG (proline), AAA (phenylalanine), ATT(asparagine), ATG (histidine), ACT (serine), GCG (leucine), GAG (arginine), and GAT (isoleucine were found to be absent from the genome O. sativa.
Table 1. Detail of O. sativa tRNA and anti-codon nucleotide sequences. O. sativa encodes maximum of 62 tRNASer in its genome with five different anti-codons.
tRNAs Are Distributed Unevenly in Different Chromosomes
It is important to understand how tRNAs genes are distributed among different chromosome and whether similar iso-acceptors are clustered in the same region of a chromosome. The results of the present study revealed that tRNAs were unevenly distributed in the chromosomes and that a few tRNAs with similar anti-codon were found in a single cluster. Overall, eight tRNAGln with TTG anti-codon were found in a cluster in the chromosome 2. Additionally, tRNAAsp with GTC anti-codon was found to be in a cluster in the chromosome 5. Other tRNA molecules found in clusters were tRNAVal (AAC), tRNALys (CTT), tRNAPro (AGG), and tRNAThr (TGT) in chromosome 3, 8, 11, and 12, respectively. The distribution of tRNAs in different chromosome showed that chromosome 1 has encoded maximum number of tRNAs (Table 2) with an average abundance of 4.8 genes for each tRNA. The second highest number of tRNA was encoded by chromosome 4, with an average abundance of 4.76 genes for each tRNA. Similarly, the lowest number of tRNA genes was encoded by chromosome 11, with an average richness of 1.61 genes for each tRNA. The richness of tRNAArg, tRNAMet, and tRNASer was higher relative to other tRNAs (Table 1). The result showed, the maximum numbers of tRNAArg genes (11) were encoded in chromosome 3 (Table 2). Similarly, chromosome 12 encoded higher numbers of tRNAMet (11) while chromosome 2 encoded higher numbers of tRNASer (11). Although, different tRNAs were encoded by different chromosomes, chromosome 6 did not encode tRNAAsn, tRNACys, tRNATyr, tRNAIle, and tRNAArg (Table 2). Similarly, chromosome 9 did not encode tRNAGln, tRNAMet, and tRNATrp tRNA whereas chromosome 11 did not encode tRNACys, tRNAGln, tRNAThr, tRNATyr, tRNAPhe, and tRNATrp. However, the exact reasons for the absence of a specific tRNA gene from a particular chromosome are difficult to delineate. Particularly, it is not clear why certain chromosomes did not encoded all of the required tRNA and more specifically, why chromosome 11 lacked genes six tRNA genes family. The genes of common evolutionary origin and similar function often clustered together. This might be the reason why tRNAs were found in cluster in chromosome 2, 3, 5, 8, 11, and 12.
The length of O. sativa tRNAs Varied from 66 to 91 Nucleotides
The 20 different types of tRNAs encode for 61 unique codons in the genome. Therefore, it can be speculated that, the length and composition of different tRNAs will also vary significantly. Different researchers have reported varied findings regarding the length of tRNAs. Kirchner and Ignatova (Kirchner and Ignatova, 2014) reported that tRNAs were of 73–90 nucleotides length, while Goodenbour and Pan (2006) reported that tRNAs were of 75–95 nucleotides length. Furthermore, Schimmel and Tamura (2003) reported that, tRNAs were of 76 nucleotides length (Schimmel and Tamura, 2003), similar to Mallick et al. (2005). However, majority of these studies were based on tRNA of animal/bacterial species, while little information is available regarding the plant tRNA. In this study, we found that the O. sativa tRNAs varied from 66 to 91 nucleotides (Table 3). One gene of tRNASer (chr8.tRNA1) encoded only 66 nucleotides whereas tRNAMet (chr3.tRNA35) encoded 91 nucleotides (Table 3). Moreover, three tRNAMet genes encoded 68 nucleotides and one tRNASer gene encoded 69 nucleotides. To the best of our knowledge, tRNASer of chromosome 8 having 66 nucleotides is the smallest gene in eukaryotic kingdom reported so far. With the exception of a few tRNA, the majority tRNA of O. sativa reside within 72–74 nucleotides length (Figure 1, Table 3). Additionally, all of the tRNAGln were found to be 72 nucleotides, whereas all the tRNAIle were found to be 74 nucleotides. Unlike tRNAGln and tRNAIle, all of the tRNAAla and tRNAPhe were found to be 73 nucleotides. Moreover, all the tRNATyr and tRNALeu were found to be 80 nucleotides or more. None of a single tRNATyr and tRNAIle was found to be below 80 nucleotides in length. The presences of varied number of nucleotides found in tRNA are most probably necessary for their interaction with the cognate aminoacyl-tRNA synthetase for proper function. This suggests that amino-acyl tRNA synthetases of the respective tRNA are structurally different and hence it requires different nucleotide composition for each of them for their proper interactions.
Figure 1. Correlation-regression analysis of number of nucleotides vs. total number of tRNAs in O. sativa. Analysis shows, majority of tRNA contains 70–75 nucleotides.
O. sativa tRNA Contain Introns
Introns are well-known entities of the genome that interrupts the continuity of many eukaryotic genes. Introns are reportedly present in the tRNA of all three kingdoms of life (Abelson et al., 1998; Yoshihisa, 2014). More specifically, the eukaryotic and the archaeal genomes are reported to be good sources of tRNA introns. We also found that, O. sativa tRNAs contain introns. In total, 34 (4.53%) tRNAs of O. sativa were found to contain introns (Table 4). Michaud et al. (2011) reported that the O. sativa genome possessed only 27 introns containing tRNA (Michaud et al., 2011). However, in our study we found that 34 tRNA genes of O. sativa contained introns. From a total of 60 tRNAMet, 15 (25%) tRNA genes contained introns, which was the highest number of intron possessed by a tRNA family in O. sativa. Unlike tRNAMet, 14 (63.63%) of the 22 tRNATyr genes were found to contain introns. In addition to tRNAMet and tRNATyr, two tRNACys, one tRNASer, and one tRNAThr were found to contain intron. Early study reported that only two tRNA family members tRNAMet and tRNATyr contained introns in higher plants (Michaud et al., 2011). However, in our study we found that RNAAla, tRNACys, tRNASer, and tRNAThr also contain intron. It was previously believed that most of the intron containing tRNAs in archaea and eukaryotes possessed intron at a canonical position, one nucleotide 3′ to the anti-codon loop. Our study also shows the presence of introns at a canonical position 3′ to the anti-codon loop. However, a few intron containing tRNA genes were shown to have very unusual structures (Figure 2). Several of the intron containing tRNAs showed hairpin like structures in either the acceptor arm, D-arm, anti-codon arm, or T-arm. However, intron was present at the anti-codon loop in the majority of cases (Figure 2). Whiles a few tRNA contained one hairpin loop, other contained more than one hairpin loop. Tocchini-Valentini et al. (2009) reported that ~75% of all tRNA introns were found at the canonical position (37/38 position) and about 25% of tRNA introns were located at noncannonical positions including the anti-codon loop, anti-codon stem, variable loop, D-and T-arms, and acceptor stem (Tocchini-Valentini et al., 2009). The presence of unusual and hairpin like bulged nucleotide sequences in different parts of tRNA represents the characteristics structure of split tRNA (Sugahara et al., 2006). The presence of several bulge like helix-loop tRNA structure reflect the presence of split tRNA in O. sativa. The tRNAAla (chr1.tRNA34) contained only one intron; however, it was the largest intron containing tRNA. Moreover, it possessed 125 nucleotides, making it larger than the tRNA itself, while tRNASer (chr12.tRNA22 and chr9.tRNA4) was the smallest intron containing tRNA that contained only eight nucleotides. The intron of tRNAMet was around 17 nucleotides length and contained a conserved G-C-T-A and G-A-G-T nucleotide consensus sequences (Figure 3). The G-C-T-A conserved consensus sequence was present at the 5′ end of the intron and the G-A-G-T consensus sequence was found to present one nucleotide up-stream to the 3′ end of the intron (Figure 3). The G-A-G-T nucleotides are responsible for base pairing that needed to maintain the proper structure of pre-tRNAMet which is pre-requisite for intron splicing (Akama et al., 2000). The presence of intron 3′ to the cannonical anti-codon position is usually found in plastidal tRNA which is self-splicing group I introns (Tocchini-Valentini et al., 2009). These introns are very long and sometimes longer than the sequence of tRNA itself (Sugahara et al., 2006; Tocchini-Valentini et al., 2009). The intron of tRNATyr was around 15 nucleotides length and contained a conserved 5′-x-T-x3-T-G-x2-G-3′ nucleotide sequence (Figure 3). The tRNASer was found to contain conserved 5′-A-A-x-C-x3-A-3′ sequences in its intron. These conserved structural features of introns are important to in-vitro intron splicing (Stange et al., 1992; Akama et al., 2000). Mutation in the cytosine residue at either the 5′ or 3′ end of the intron border leads to deleterious effects on maturation of tRNA and involvement of the D-stem and variable loop (Stange et al., 1992; Akama et al., 1997, 2000). The intron sequence of pre-tRNAMet was conserved more than the intron sequence of pre-tRNATyr. The introns of pre-tRNASer genes were found to contain similar sequences. In tRNATyr, only T was found to be conserved at the first or second position, T-G at the fifth and sixth position and G nucleotide at 0–4 nucleotides upstream of the 3′ end of the intron (Figure 3). The T residue of the intron at the first or second position is most important for formation of pseudouridine at the second position of the anti-codon loop (Akama et al., 1997). The intron in tRNAAla was found in between position 39 and 40 while the position of the intron in tRNACys, tRNAMet, and tRNATyr was present in between nucleotides 38 and 39. The intron of tRNAThr was found between nucleotides 36 and 37, while the intron of tRNASer was found in between 37 and 38 nucleotides. It was previously believed that only tRNAMet and tRNATyr contained introns (Michaud et al., 2011). However, the results of the present study demonstrated that tRNAThr, tRNAAla, tRNACys, and tRNASer also contain introns (Table 4). This is a quite novel finding and further studies of other plant species may lead to identification of additional intron containing tRNA in the plant kingdom.
Figure 2. General structures of intron containing tRNAs. Earlier it was reported that only tRNAMet and tRNATyr contains introns in higher plants. However, we found that (H) tRNATyr; (G) tRNASer, (A) tRNACys, (F) tRNAThr, and (B) tRNAAla contain introns in additional to (C–E) tRNAMet. The introns in tRNA are usually found in the anti-codon loop. Adenine nucleotide is the predominant nucleotide before the start site of introns in all tRNA.
Figure 3. Conserved consensus sequence of intron of O. sativa tRNA. tRNAMet contains G-C-T-A and G-A-G-T, tRNATyr contains x-T-x3-T-G-x2-G and tRNASer contains A-A-x-C-x3-A conserved consensus sequence.
O. sativa tRNA Are Varied in Their Structure
It was recently reported that, the nuclear encoded eukaryotic tRNA and bacterial tRNA that play important roles in translation have a length of 73–90 nucleotides (Kirchner and Ignatova, 2014). The acceptor arm is seven bp long, the D-stem ranges from 3 to 4 bp, and the anti-codon stem is of 5 bp (Kirchner and Ignatova, 2014). The D-loop occasionally varied from 4 to 12 nucleotides. The anti-codon loop is always numbered from 34 to 36, whereas the variable region always starts at residue 44 and is 4–23 nucleotides long (Kirchner and Ignatova, 2014). The Ψ-loop possesses seven nucleotides and the Ψ-stem possesses five nucleotides. The CCA tail is always numbered 74–76 (Kirchner and Ignatova, 2014). However, in our study we found several variations in the tRNA of O. sativa. The majority of the acceptor arm was found to contain seven nucleotides whereas; a few were found to contain less than seven nucleotides. tRNAMet22 (Chr10), tRNAMet57 (Chr10), and tRNAMet96 (Chr4) were found to contain three nucleotides whereas tRNAArg92 (Chr1), and tRNASer20 (Chr2) were found to contain four nucleotides and tRNAMet21 (Chr12), tRNAAsn43 (Chr8), and tRNASer27 (Chr3) were found to contain six nucleotides in the acceptor arm (Supplementary Table 1). The D-stem usually contains 3–4 nucleotides. In our study we found that, a total of 501 tRNA genes were contained four nucleotides (66.97%) and 234 contained three nucleotides (31.28%) in the D-stem. All of the tRNALys, tRNAAsp, and tRNATrp were found to contain four nucleotides in the D-stem whereas all other tRNAs were found to contain either three or four nucleotides. Only one tRNALeu gene was found to contain four nucleotides in the D-stem. Additionally, one gene from each of the tRNAAla, tRNAAsn, tRNAGlu, tRNAIle, tRNAPhe, and tRNAThr were found to contain three nucleotides in the D-stem, whereas all of the tRNALeu were found to contain three nucleotides in the D-stem. Few tRNAs were found to contain less than three nucleotides in the D-stem. tRNALeu34 (Chr9), tRNALeu21 (Chr11), tRNALeu38 (Chr9), tRNAPro71 (Chr12), tRNATyr49 (Chr8), tRNASer27 (Chr3), tRNASer1 (Chr8), tRNATyr34 (Chr1), and tRNAArg31 (Chr5) were found to contain only two nucleotides in the D-stem. The D-stem was found to be absent in tRNASer27 (Chr3) and tRNASer1 (Chr8) (Supplementary Table 1). The D-stem helps to stabilize the tertiary structure of the tRNA and the presence of more nucleotides make the tRNA more stable compared to those which contained less nucleotide. Therefore, the majorities of tRNAs were found to possess four nucleotides in the D-stem.
Previously it was reported that the D-loop contain 4–12 nucleotides (Kirchner and Ignatova, 2014); however, in the present study the D-loop was found to contain 0–15 nucleotides (Supplementary Table 1). tRNAGlu11 (Chr7) was found to contain15 nucleotides, whereas tRNALys7 (chr12), tRNATyr34 (chr1), and tRNATyr49 (chr8) were found to contain 13 nucleotides each in the D-loop (Supplementary Table 1). The majority of tRNA were found to contain seven (113), eight (227), or nine (212) nucleotides in the D-loop, whereas only tRNASer43 (chr7) was found to contain four nucleotides. The D-loop was found to be absent from tRNASer27 (chr3) and tRNASer1 (chr8). When the tRNA specific nucleotide composition in the D-loop was considered, tRNALeu was found to possess 10 (14), 11 (18), or 12 (23) nucleotides in the D-loop. Several tRNAHis were found to contain either 8 or 11 nucleotides; tRNAMet contained 7, 8, 9, 10, or 11 nucleotides; tRNAPro contained 7, 9, 10, or 11 nucleotides; tRNAVal contained 7, 10, or 11 nucleotides; tRNASer contained 8, 9, 10, or 11; tRNAArg contained 8, 9, or 10 nucleotides, tRNAGlu contained 8, 9, or 10 nucleotides, tRNAAsn contained 7 or 9 nucleotides, tRNAAsp contained 8 or 9 nucleotides, tRNACys contained 7 or 9 nucleotides, tRNAGln contained 8 or 9 nucleotides, tRNAGly contained 7, 8, or 9 nucleotides, tRNAThr contained 7, 8, or 9 nucleotides, tRNATrp contained 7 or 9 nucleotides, tRNATyr contained 8 or 9 nucleotides, tRNAAla and tRNALys contained 7 or 8 nucleotides, tRNAIle contained 9 nucleotides and tRNAPhe contained nucleotides in the D-loop. The absence of the D-arm and the D-loop tRNA reveal that, it is a novel form of tRNA and laboratory based experimental study of these tRNA will reveal their functional significance.
It is well-known that the anti-codon arm possesses five nucleotides; however we found a few variations in this region. Specifically, tRNAAla43 (chr1) and tRNATyr49 (chr8) were found to contain only three nucleotides in their anti-codon arm (Supplementary Table 2), while at least 22 genes of tRNAMet were found to contain four nucleotides in this region. Other tRNA those were found to contain four nucleotides in the anti-codon arm were tRNAPhe83 (chr1), tRNAGlu11 (chr7), tRNASer27 (chr3), tRNASer68 (chr1), and tRNASer1 (chr4). The anti-codon loop possess seven nucleotides as do the majority of O. sativa tRNA. However, tRNAArg17 (chr7) was found to contains eight nucleotides, tRNAGln30 (chr6) contained nine, tRNATyr49 (chr8) contained 12, and tRNASer68 (chr1) contained 13 nucleotides. These variations in the nucleotide composition in the anti-codon loop might be responsible for the coding of novel protein translation machinery that has yet to be elucidated.
The variable region of tRNA is very dynamic and possess 4–23 nucleotides (Kirchner and Ignatova, 2014). The variable region of the tRNA of O. sativa was found to contain 2–20 nucleotides. The majority of tRNA were found to contain five (457; 60.93%) nucleotides in the variable region. Moreover, approximately 138 tRNA were found to contain four nucleotides in the variable region that constitutes 18.44% of the total tRNA. Only 10 tRNA were found to contain six or seven nucleotides in the variable region (Supplementary Table 2). All tRNALeu were found to contains either 11 (33), 13 (13), or 15 (12) nucleotides whereas tRNASer was found to contain either 11 (2), 14 (33), 17 (1), 18 (6), 19 (14), or 20 (1) nucleotides in the variable region. The presence of variable number of nucleotides in the variable loop region of tRNALeu and tRNASer reflect, these tRNA are much diverse then the other tRNAs and addition of nucleotides might happened in the variable loop region. This also suggests that tRNALeu and tRNASer are also highly prone to the genetic mutation to incorporate the nucleotides in the variable region. Gaus et al. (1979) reported that type I tRNA contain four or five nucleotides in the variable loop, whereas type II tRNA contain more than five nucleotides (Gaus et al., 1979; Tocchini-Valentini et al., 2000). These results suggest that all tRNALeu and tRNASer of O. sativa are belonged to the type II tRNA. Only type II tRNA forms the stem-loop structure and these gross structural differences of tRNAs might playing a significant role in recognition of tRNA by minoacyl-tRNA synthateses. tRNA is characterized by its clover leaf like structure that contains a small stem-loop structure in the variable region. However, the majority of O. sativa tRNA do not contain the loop in the variable region. Additionally, the variable loop was found to be absent from all tRNAs those were found to contain <11 nucleotides. In our study we found that, only tRNALeu and tRNASer contained a loop in the variable region. There was only a few tRNATyr (seven) and tRNAVal (one) those were found to contain loop in the variable region (Supplementary Table 2).
It is well known fact that the Ψ-arm possess only five nucleotides; however, O. sativa tRNA were found to contain three to five nucleotides in the Ψ-arm. Although, the majority of tRNA were found to contain five nucleotides in the Ψ-arm, a few tRNA were found to contain either three or four nucleotides. Specifically, tRNAGly (7), tRNACys (2), tRNALeu (1), tRNALys (1), tRNAMet (1), tRNAAla (1), tRNAThr (1), and tRNAVal (1) were found to contain only four nucleotides in the Ψ-arm, whereas a tRNAMet was found to encode only three nucleotides. The Ψ-loop of tRNA reportedly contains seven nucleotides. Although, the Ψ-loop of the majority of O. sativa tRNA was found to possess seven nucleotides (Supplementary Table 2), a few of them were found to contain more than seven nucleotides. Specifically, tRNAGly36 (chr12), tRNALeu7 (chr2), and tRNAVal59 (chr5) were found to contain nine nucleotides, whereas tRNAVal45 (chr5) contained 11 nucleotides. The presence of such novel form of tRNA might have some novel function as well, and laboratory based experimental study can reveal the exact function of the presence of additional number of nucleotides in the Ψ-loop.
O. sativa tRNAs Are Conserved
The nucleotide residue at a specific position in the tRNA accounted for the specific tertiary folds and aminoacylation. Beside this, they also dictate to transit different sites in the ribosome. Protein synthesis cannot occur until and unless tRNA gets properly engaged with the ribosome. Therefore, understanding the specific nucleotide position in tRNA is very important and hence we analyzed the tRNA sequences to find their conserved regions. Analyses revealed that the acceptor arm (AM), D-arm (DA), D-loop (DL), anti-codon arm, anti-codon loop, variable region (VR), Ψ-arm, and Ψ-loop were found to contain conserved nucleotides at the specific position (Table 5). Specifically, tRNAArg, tRNAAsp, tRNACys, tRNAGly, tRNAHis, tRNAIle, tRNALeu, tRNALys, tRNAPhe, tRNASer, tRNAThr, tRNATrp, and tRNAAla were found to contain conserved 5′ G nucleotide at the first position in the acceptor arm (Supplementary Figure 1, Table 5). The tRNAs those were found to contain conserved U nucleotide at the first position of the acceptor arm were tRNAAsn, tRNAGlu, and tRNAGln. A few tRNAMet, tRNAPro, and tRNAVal were found to contain A nucleotide at the first position. The D-arm was found to contain a conserved 5′ G nucleotide at the first position, except for tRNATyr and tRNAPro. Few of the tRNAs those were found to contain conserved G-C nucleotides at the first and second position were tRNAAla, tRNAIle, tRNACys, tRNAAsn, tRNALys, and tRNAGlu, whereas tRNALeu, tRNAMet, and tRNAThr did not contain any conserved residues in the D-arm. The D-loop was found to be highly conserved and was found to contain an A-G/x-U/x consensus sequence. In the D-loop, nucleotide A was found to be conserved at the first position and it can be called as the identity element of the D-loop (Supplementary Figure 1, Table 5); however, tRNAPro and tRNAVal were found to contain conserved U instead of A nucleotide at the first position. tRNAMet and tRNASer were devoid of any conserved nucleotide in the D-loop. In the majority of tRNAs, the anti-codon arm was found to contain a G nucleotide at the first position. However, tRNAVal was found to contain a U nucleotide at the same position whereas tRNAAla, tRNAAsp, and tRNALeu were found to contain C nucleotide. In the anti-codon loop, a U nucleotide was found to be conserved at the first position in the majority of tRNAs. However, tRNALys, tRNACys, tRNATyr, and tRNAGlu were found to contain a C nucleotide in the first position in the anti-codon loop. The C-A-U-A nucleotides of tRNAMet were found to be conserved from 3rd to 6th position in the anti-codon loop. No conserved site was found for tRNASer in the anti-codon loop. The variable loop is considered very dynamic because of its non conserved and variable nucleotide composition. However, the variable loop was also found to be conserved in different tRNA when tRNA was studied as individual gene family (Supplementary Figure 1, Table 5). All tRNA were found to contain conserved nucleotide sequences at different position in the variable region except for tRNAGly, tRNAPhe, tRNAGln, and tRNAAsp (Supplementary Figure 1, Table 5). In the majority of cases, the G nucleotide was found to be conserved at different positions in different tRNA (Supplementary Figure 1, Table 5). No conserved nucleotides were found in the Ψ-arm of tRNAPro. The Ψ-loop was found to contain conserved U-U-C-x-A nucleotides in most of the tRNA. These nucleotides were conserved at the first, second, third and fifth position of the Ψ-loop. However, tRNAAla, tRNAMet, and tRNAHis were found to contain a conserved U-C-x-A consensus sequence and the pseudouridine residue was found to be conserved at the second position of the Ψ-loop (Table 5). Except for tRNASer, the opposite side of the Ψ-arm that present immediately after the Ψ-loop was found to contain a conserved C nucleotide at the first position. The nucleotide sequences of the acceptor arm were conserved at different positions. These conserved residues were resulted when the tRNA sequences were grouped according to their individual gene family. When all tRNA sequences were aligned, we did not find significant conserved residues, which was concordant with previously reported results. This might have been due to the engagement of different tRNA with different cognate enzymes during the event of protein synthesis. It was also a common concept that all tRNA contain conserved G nucleotides at the first position in the acceptor arm. However, the majority of tRNAs, but not all, were found to contain a G nucleotide at the first position (Supplementary Figure 1). The tRNAs those lacked conserved G nucleotide in the 5′ position were tRNAAsn, tRNAGln, tRNAGlu, tRNAMet, tRNAPro, tRNATyr, and tRNAVal (Supplementary Figure 1). The 3′CCA sequence from 74 to 76 nucleotides is added to all tRNA during the processing of tRNA. The 3′CCA sequence plays an important role in amino acid attachment sites for tRNA to participate to the protein synthesis. It is also play important role in transport of tRNA from the nucleus to the cytoplasm and acts as a checkpoint for tRNA maturity. The 3′CCA sequence is synthesized by a CCA-adding enzyme popularly known as tRNA nucleotidyl transferase (Shi et al., 1998; Vortler and Morl, 2010; Betat and Mörl, 2015). The attachment of CCA does not occur in few prokaryotes because they already contain the CCA tail in their genetic architecture and a few of them lack CCA tail adding enzyme. However, for species that does not have CCA tail, it adds the tail by CCA tail adding enzyme (Hou, 2010). A few species of archaea contain class I whereas bacteria and eukaryote have class II nucleotidyl transferase CCA tail adding enzyme (Yue et al., 1996). In Bacillus subtilis, the CCA tail adding enzyme adds the tail under certain kinetic conditions similar to in-vivo. However, this CCA tail adding happens under certain stress and DNA damage conditions and cannot imply direct implementation of the concepts to cells (Hou, 2010). In O. sativa tRNAAla and tRNAIle already contain the 3′CCA sequence in their tRNA gene (Supplementary Figure 1). When we compared a tRNAAla and tRNAIle of O. sativa with prokaryotic tRNAAla and tRNAIle in the tRNA scan server, we found a similar structure in both the cases (Figure 4). In prokaryotes, the enzyme that adds the 3′ CCA tail to the tRNA is absent; hence, they directly encode the 3′CCA sequence in the tRNA gene. However, unlike prokaryotes, plants encode enzymes that add a 3′CCA tail and still contain a CCA sequence in tRNA genes. These findings indicate that, although tRNA undergoes processing to add the 3′CCA tail, a few tRNA already possess it. However, most of the tRNA does not have nucleotides for 3′CCA tail. tRNAAla and tRNAIle are classic examples of this novel phenomenon and contain embedded 3′CCA nucleotide sequence. It is also possible that, O. sativa tRNAAla and tRNAIle are the primitive forms of tRNA that evolved from their common ancestor of prokaryotic lineages and later diversified.
Figure 4. Similarity in the 3′CCA tail of O. sativa (A) tRNAAla and (B) tRNAIle with prokaryotic (Calothrix desertica) (C) tRNAAla and (D) tRNAIle. The tRNA gene sequence in both the organisms contains the 3′CCA tail sequence. Prokaryotic organisms usually lack of nucleotidyl transferase enzyme, which adds the 3′CCA into the tRNA, while eukaryotic organisms contain this enzyme. However, the presence of 3′CCA sequence in the O. sativa tRNAAla and tRNAIle gene shows that these tRNA might not require undergoing processing for addition of the 3′CCA tail. The presence of a 3′CCA sequence in O. sativa tRNAAla and tRNAIle indicates that O. sativa tRNAs have evolved from common prokaryotic ancestors. CdtRNA; Calothrix desertica tRNA, OstRNA; Oryza sativa tRNA.
O. sativa Encodes Novel/Pseudo tRNA
Modification of nucleotide bases in tRNA are seen as an integral part of its novelty. We found several modifications in the tRNA of O. sativa and named them accordingly. Type I tRNA was found to contain seven nucleotides in the acceptor arm (Figure 5A). Additionally, the D-arm and the D-loop were found to be absent from type I tRNA (tRNASer1, chr.8) and was found to contain 12 nucleotides in between the acceptor arm and the anti-codon arm. The anti-codon arm and the anti-codon loop were found to contain five and seven nucleotides, respectively while the variable region was found to contain five nucleotides. The Ψ-arm and the Ψ-loop were found to contain five and seven nucleotides, respectively. Type II tRNA was found to possess six nucleotides in the acceptor arm (Figure 5B). The D-arm and the D-loop were found to be absent from type II tRNA, whereas it was found to contain 17 nucleotides in between the acceptor arm and the anti-codon arm (tRNASer27, chr.3) (Figure 5B). The anti-codon loop of type II tRNA was found to contain a modified inosine base at the second position. In type III tRNA, the acceptor arm was almost absent and instead it was found to contain a hairpin loop-like bulge structure (tRNASer20, chr.2) of having four nucleotides (Figure 5C). There was presence of only four nucleotides from the acceptor arm to the D-arm. The D-arm was found to contain a loop-like bulge having only three nucleotides. The D-loop was found to contain 10 nucleotides with the presence of pseudouridine bases at 8th and 9th position. The anti-codon arm, the anti-codon loop, the Ψ-arm, and the Ψ-loop of type III tRNA were found to be similar to that of the canonical tRNA. Type IV tRNA (tRNAGlu11, chr.7) was found to contain the canonical tRNA form, except for the D-loop (Figure 5D). The D-loop of type IV tRNA was found to contain 15 nucleotides with most probably modified bases. The type V tRNA (tRNAUndet16, chr.2) was found to contain a hairpin loop-like bulge in the acceptor arm, and the D-arm where the D-loop was found to contain 11 nucleotides (Figure 5E). The anti-codon arm was found to contain only four nucleotides whereas the anti-codon loop was found to contain only six nucleotides which was one nucleotide lesser than the usual canonical tRNA. The type VI (tRNATyr49, chr.8) tRNA was found to contain a hairpin loop-like bulge structure in the acceptor arm and was found to contain only two nucleotides in the D-arm (Figure 5F). The D-loop was found to contain 13 nucleotides whereas the anti-codon arm was found to contain only three nucleotides. The anti-codon loop was found to contain 12 nucleotides which was much higher than the canonical tRNA. Type VII (tRNAVal30, chr.9) tRNA was found to contain a hairpin loop-like bulge in the acceptor arm and three nucleotides in the D-arm (Figure 5G). The D-loop of type VII tRNA was found to contain 11 nucleotides, whereas the anti-codon arm and the anti-codon loop were found to contain five and seven nucleotides, respectively. The Ψ-arm was found to contain two hairpin loop-like structures attached to the Ψ-loop. Type VIII (tRNALys18, chr.10) tRNA showed similarities with the usual canonical tRNA, except for the Ψ-arm (Figure 5H). Here, one side of the Ψ-arm was found to contain five nucleotides, whereas the other side contained eight nucleotides. Type IX (tRNAMet22, chr.10) tRNA was found to contain a shorter acceptor arm having three nucleotides on one side and six on the other side (Figure 5I). With the exception of the acceptor arm, the other three arms of tRNA were found to contain a hairpin loop-like bulge (Figure 5I). The shorter acceptor arm contained only three nucleotides on one side and six nucleotides on the other side. The D-arm was found to contain only three nucleotides and forms a hairpin loop-like structure. Similarly, the anti-codon arm and Ψ-arm was found to contain a hairpin loop-like structure. Type X (tRNAMet21, chr.12) tRNA was found to contain a hairpin loop-like bulge in the acceptor arm and the Ψ-arm, whereas other parts of the tRNA was found to be similar to the canonical tRNA (Figure 5J). Type XI (tRNAUndet46, chr.10) tRNA was found to contain a hairpin loop-like bulge in the acceptor arm and D-arm (Figure 5K). The anti-codon arm contained only four nucleotides and the anti-codon loop contained only six nucleotides. The Ψ-arm was found to contain only three nucleotides instead of five nucleotides. Type XII (tRNAMet55, chr.1) tRNA was found to contain a hairpin loop-like bulge in the acceptor arm and only four nucleotides in the anti-codon arm (Figure 5L). The anti-codon loop contains introns and the Ψ- arm contained a hairpin loop-like bulged structure.
Figure 5. Novel tRNAs of O. sativa. Analysis through tRNA scan revealed the presence of 14 novel forms of tRNA (denoted type A to L). (A) D arm and D-loop was found to be absent, (B) D arm was found to be absent and it contained 17 nucleotides in between the acceptor arm and the anti-codon arm, (C) presence of loop like acceptor arm, (D) D-loop contained 15 nucleotides with modified bases, (E) bulged acceptor and D-arm, (F) bulged acceptor arm and the D-arm containd only two nucleotides (G) bulged acceptor arm and only three nucleotides in the D-arm, (H) one side of Ψ-arm contained five nucleotides whereas other side contianed eight, (I) shorter acceptor arm having three nucleotides in one side whereas six in other side, (J) bulged acceptor and Ψ-arm (K) bulged acceptor arm and D-arm, and (L) bulged acceptor arm and only four nucleotide in the anti-codon arm. These novel tRNA are might be responsible for novel functional roles in O. sativa.
Different organisms have evolved different method for processing and maturation of tRNA at every step; therefore, some are might be utilizing novel mechanisms. This study revealed the presence of multiple novel forms of tRNA in O. sativa. These novel types of tRNA might have some novel function and processing pathways which are yet to be elucidated. Indeed, pseudo tRNA was found to mediate antibiotic resistance in Bacillus cereus (Rogers et al., 2012); therefore, the role of annotated pseudo tRNA cannot be underestimated and additional study is required to elucidate their exact function and mechanisms.
O. sativa tRNA were Evolved from Their Multiple Common Ancestors
A phylogenetic tree was constructed using the maximum likelihood method to infer the evolutionary relationship of O. sativa tRNA (Figure 6). The tRNAs of the prokaryotic organism Anabaena cylindrica was used as a probable common ancestor of O. sativa tRNA. The phylogenetic tree showed the presence of at least 44 different groups from 20 different types of tRNA (Figure 6). Some tRNA were found to form one group, while a few were found to form more than one group in the phylogenetic tree at different phylogenetic distances. The tRNAs those were found to form only one group in the phylogenetic tree were tRNAPhe, tRNAAla, and tRNASer whereas those clustered into two groups were tRNAGlu, tRNAAsp, tRNAGly, tRNAHis, tRNAPro, tRNAThr, tRNAAsn, tRNATyr, tRNALys, tRNATrp, tRNACys, tRNAIle, tRNAVal, and tRNAGln (Figure 6). Moreover, tRNALeu and tRNAMet were clustered into four groups whereas tRNAArg was clustered into five groups. Multiple grouping of different tRNA in the phylogenetic tree reflected their evolution from multiple common ancestors. At least 16 tRNA families of O. sativa evolved were from their common ancestors of two or more lineages. tRNAPhe, tRNAAla, and tRNASer of O. sativa were most probably descended from a single lineage, whereas tRNAMet might have descended from at least four and tRNAArgmight have descended from five lineages. To support the evolution of O. sativa tRNA from multi lineages, we studied the orthology and paralogy relationship. We found that all of the O. sativa tRNA are paralogous to each other. O. sativa was found to contain only twelve orthologous tRNA which were restricted to tRNASer only. These orthologous tRNAs were tRNASer15 (chr2), tRNASer26 (chr2), tRNASer31 (chr2), tRNASer6 (chr4), tRNASer5 (chr4), tRNASer83 (chr4), tRNASer89 (chr4), tRNASer10 (chr10), tRNASer28 (chr10), tRNASer36 (chr10), tRNASer62 (chr10), and tRNASer25 (ch11). The absence of orthologous genes further reflected their evolution from multiple common ancestors. Similarly, an orthology study with tRNA of A. cylindrica showed orthologous evolution of 15 O. sativa tRNA which included tRNAPhe, tRNAAsn, tRNATrp, tRNALys, tRNAArg, tRNAIle, tRNAAsp, tRNAPro, tRNAMet, tRNAGlu, tRNAGln, tRNATyr, tRNASer, tRNAHis, and tRNAGly. tRNA tRNAAla, tRNAThr, tRNAVal, tRNAArg, and tRNALeu of O. sativa had no orthologous ancestor tRNA in common with A. cylindrica. However, a few modified orthologous tRNA of A. cylindrica were present in the O. sativa. The orthologous tRNAPhe, tRNAThr, tRNAArgÃĆÂý tRNAAsp, tRNAThr, tRNATrp, tRNATyr, and tRNAMet of A. cylindrica were found to be modified and present as substitute where they were substituted as follows: tRNAPhe > tRNAThr, tRNAThr > tRNAMet, tRNAArg > tRNAMet, tRNAAsp > tRNAVal, tRNATyr > tRNALeu, tRNAMet> tRNAGly. These finding indicated that several ancestral orthologous tRNA of one form evolved and have modified to another form, which might have occurred in response to several instances of evolutionary pressure. We find these novel modification of O. sativa tRNA and this might be due to fact that, as tRNAs are conserved in different regions, but they are significantly varied in the anti-codon region. This modification makes them as distinct tRNA. However, modification of one form of orthologous tRNA (e.g., modification of A. cylindrica tRNAPhe to tRNAThr or tRNAThr to tRNAMet, etc.) of one species to another form in different species is of particular interest. Therefore, we studied gene duplication and loss event of O. sativa and A. cylindrica tRNA. After reconciliation of O. sativa and A. cylindrica species trees we found that 778 genes were duplicated (Figure 7) whereas only 50 genes were found to be lost (Figure 8). There was no conditional duplication in O. sativa tRNA. Only a few tRNAs of O.sativa were found to be lost during the evolution process, suggesting the duplication event has favored the evolution of O. sativa tRNA. Overall, we found that at least 17 O. sativa and 34 A. cylindrica tRNA were lost during the evolution.
Figure 6. Phylogenetic tree of O. sativa tRNAs was constructed using the tRNA genes of O. sativa. tRNA of A. cylindrica was used as probable prokaryotic common ancestor. Phylogenetic tree was constructed used using MEGA6 software.
Figure 7. Duplication of O. sativa tRNAs study revealed the presence of 778 duplicated nodes. Duplication study was conducted using Notung 2.6 software.
Figure 8. Loss of O. sativa tRNAs study revealed 17 O. sativa tRNA were lost the during evolution. tRNA loss was conducted using Notung 2.6 software.
Transition/Transversion of O. sativa tRNA
In 1962 Zuckerkandl and Pauling proposed molecular clock hypothesis which indicated the rate of molecular evolution is approximately constant over time for all proteins in all lineages (Zuckerkandl and Pauling, 1962). Based on this theory, any time of divergence between proteins, genes or lineages can be dated by measuring the number of changes between the sequences. Later, Holmquist, Jukes, and Cantor proposed a stochastic model for substitution of DNA in which the nucleotide substitution occurs at an equal rate, and whenever a nucleotide is substituted, any one of the other nucleotides replaces the substitution (Holmquist et al., 1972). Substitutions that change a purine nucleotide to another purine (A to G or G to A) or pyrimidine to another pyrimidine (C to U/T or U/T to C) are called transitions whereas those changes a purine nucleotide to a pyrimidine (A or G to U/T or C) or vice versa (U/T or C to A or G) are called transversions. The rate of mutation/substitution varies among and within the genome and affected by several factors including nearest neighbor bases, chromosomal position, and efficiency of repair system between the leading and lagging DNA strands. During the repair process, the replacement of a base is facilitated by a similar one or a derivative of a similar one. As a result, transition occurs more persistently (sometimes twice as frequently) than transversion. The differences in substitution rates tend to decrease TA and CG dimers and produce excess CT and TG dimers. Substitution analysis of O. sativa tRNA revealed, the rate of transition from A to G was found to be highest for tRNAAsp (62.18) and lowest for tRNAAsn (0.31) (Table 6). Similarly, the transition from G to A was found to be highest in tRNAAsp (27.52) and lowest in tRNAAsn (0.18). The rate of transition from U to C was found to be highest in tRNAGln (36.44) and lowest in tRNAThr (6.25). Similarly, the transition from C to U was found to be highest for tRNAGln (40.1) and lowest for tRNAThr (7.2). Based on these findings, it was clear that the substitution rate of A to G was highest and that of G to A was lowest (Table 6). Unlike transition, the transversion rate from A to C/U or vice versa was found to be maximum for tRNATyr (6.02) and minimum for tRNAAsp (0.62). The transversion rate from U to A/G or vice versa was maximum for tRNATyr (7.24) and minimum for tRNAAsp (0.93). The transversion rate from C to A/G or vice versa was maximum for tRNAGly and minimum for tRNAAsp. The transversion from G to U/C or vice versa was found to be maximum for tRNAIle (8.35) and minimum for tRNAAsp (1.39). Overall, the result clearly indicated that the highest or lowest transition/transversion rate was concentrated to only few tRNA. The maximum and minimum transition/transversion rate in either of the nucleotide was mostly concentrated in tRNAAsp. The tRNA those have the highest and lowest transition/transversion rates were tRNAAsp, tRNAThr, tRNAGly, tRNAHis, tRNAAsn, tRNAIle, and tRNAGln. All of them belonged to two evolutionary groups in the phylogenetic tree. These finding indicated that tRNA are more prone to substitution for evolution of novel tRNA, which might explain why several orthologous tRNA of A. cylindrica have modified orthologous tRNA in O. sativa. When we studied the transition/transversion bias, we found it was highest in tRNAAsp (R = 9.473) and lowest in tRNATyr (R = 0.684) (Table 7). When R = 0, there is no bias toward either transition/transversion where bias R = [A*G*k1 + T*C*k2]/[(A+G)*(T+C)] and k1 and k2 indicating the purine and pyrimidine, respectively. The transition/transversion bias is highly correlated with the highest and lowest substitution of tRNAAsp. As all pairwise distances in a distance matrix are correlated because of phylogenetic relationships among the sequences, the sum of their log-likelihoods is a composite likelihood. The likelihood ratio observed on the present study suggests that the transition/transversion ratio might be variable among evolutionary lineages; however, an earlier report suggests that the rate of transition in DNA is higher than the rate of transversion. Our study in O. sativa tRNA also showed that the rate of transition is higher than the rate of transversion.
Materials and Methods
O. sativa tRNA were downloaded from the genomic tRNA database GtRNAdb (http://gtrnadb.ucsc.edu/). This species was selected for study because it is one of the most studied model crop plant and has a well annotated genome and tRNA. Downloaded tRNA sequences were scanned in stable version tRNAscan-SE 1.23 and 1.3 to check for the presence of a clover leaf structure (Lowe and Eddy, 1997). Default parameters were used to run the sequences in tRNAscan-SE 1.23 and 1.3.
Multiple Sequence Alignment
We conducted multiple sequence alignment of all tRNA family individually and collectively as well to identify the shared and conserved sequence homology. Multalin server was used to conduct the multiple sequence alignment (http://multalin.toulouse.inra.fr/multalin/). The statistical parameters used to run the multiple sequence alignments were described previously (Mohanta et al., 2014, 2015b; Mohanta and Bae, 2015), with minor modification. In short they are as follows: sequence input format, fasta; nucleotide weight matrix. Blosum62-12-2; gap penalty at opening, default; gap penalty at extension, default; gap penalties at extremities, none; one iteration only, no; high consensus value, 90% and low consensus value, 50%.
Construction of Phylogenetic Tree
A phylogenetic tree infers the evolutionary relationship between different biological species or sequences. Therefore, we constructed a phylogenetic tree of O. sativa tRNA by considering the tRNA of A. cylindrica as a probable common ancestor. Therefore, the tRNA of O. sativa and A. cylindrica was subjected to clustal omega server to generate clustal files for all individual tRNA, as well separately to understand their evolution independently. The generated clustal files were later converted to MEGA file format using the MEGA6 software (Tamura et al., 2013). The phylogenetic tree was constructed with minor modification as described previously (Mohanta et al., 2015a, 2017; Mohanta T. et al., 2015; Mohanta and Bae, 2017) with the following statistical parameters: analysis, phylogenetic reconstruction; statistical method, maximum likelihood; test of phylogeny, bootstrap method; number of bootstrap replications, 1,000; substitutions type, nucleotides; model/method, Jukes-Cantor model; rates among sites, uniform rates; gaps/missing data treatment, partial deletion; site coverage cutoff (%), 95; ML heuristic methods, nearest-neighbor-interchange (NNI); initial tree for ML, default-NJ/BioNJ; and branch swap filter, very strong. The phylogenetic tree that contained all of the tRNA of O. sativa and A. cylindrica was saved as the “gene tree.”
Gene duplication or loss, which is one of the most important events that occurs in a genome leads to evolution of the genome and new genomic content. Duplication/loss events of tRNA were studied using the Notung 2.6 software. A species tree of O. sativa and A. cylindrica was generated using the NCBI taxonomy browser (http://www.ncbi.nlm.nih.gov/Taxonomy/CommonTree/wwwcmt.cgi). The generated “gene tree” and species tree was analyzed by Notung 2.6 software. During the analysis, the gene tree was reconciled with the species tree to get the duplication/loss events of O. sativa and A. cylindrica tRNA.
Analysis of Transition/Transversion
Transition/transversion explains the substitution rates of nucleotides; therefore, we studied the transition/transversion of all 20 tRNA. The generated MEGA files of all of the tRNA genes were analyzed with the MEGA6 software to find the transition/transversion rate. The statistical parameters used to analyze the transition/transversion rate were as follows: analysis, substitution pattern estimation (MCL); scope, all selected taxa; statistical method, maximum likelihood; substitution type, nucleotides; model/method, Tamura-Nei (automatic selection); gaps/missing data treatment, partial deletion, and site coverage cut off (%), 95.
Analysis of Transition/Transversion Bias
Some mutations occur more frequently than others; accordingly it is important to understand these biases to interpret the differences that have occurred between different tRNA of the same species. Because the tRNAs are very important molecules involved in protein translation machinery, understanding their transition/transversion is of particular importance. Therefore, we conducted analysis of transition/transversion bias with all tRNA using the MEGA6 software. The generated MEGA file of the tRNA was analyzed for transition/transversion bias using the MEGA6 software based on the following statistical parameters: analysis, estimated transition/transversion bias (MCL); scope, all selected taxa; statistical method, maximum composite likelihood (MCL); substitution type, nucleotides; model/method, Tamura-Nei model; gaps/missing data treatment, partial deletion, and site coverage cutoff, 95%.
TM: Conceived the idea, performed the experiments and statistical analysis, drafted and revised the manuscript; HB: revised the manuscript.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
This work was carried out with the support of the Next-Generation Biogreen 21 Program (PJ011113), Rural Development Administration, Republic of Korea.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/article/10.3389/fgene.2017.00090/full#supplementary-material
Akama, K., Naß, A., Junker, V., and Beier, H. (1997). Characterization of nuclear tRNATyr introns: their evolution from red algae to higher plants. FEBS Lett. 417, 213–218. doi: 10.1016/S0014-5793(97)01288-X
Bermudez-Santana, C., Attolini, C. S.-O., Kirsten, T., Engelhardt, J., Prohaska, S. J., Steigele, S., et al. (2010). Genomic organization of eukaryotic tRNAs. BMC Genomics 11:270. doi: 10.1186/1471-2164-11-270
Eigen, M., Lindemann, B. F., Tietze, M., Winkler-oswatitsch, R., Dress, A., and Haeseler, A. V. O. N. (1989). How old is the genetic code? Statistical geometry of tRNA provides an answer. Science 244, 673–679. doi: 10.1126/science.2497522
Gaus, D., Gruter, F., and Sprinzl, M. (1979). “Proposed numbering system of nucleotides in tRNAs based on yeast tRNA,” in Transfer RNA: Structure, Properties, and Recognition, eds P. R. Schimmel, D. Söll, and J. N. Abelson (New York, NY; Cold Spring Harbor, NY: Cold Spring Harbor Laboratory Press), 818–519.
Holmquist, R., Cantor, C., and Jukes, T. (1972). Improved procedures for comparing homologous sequences in molecules of proteins and nucleic acids. J. Mol. Biol. 64, 145–161. doi: 10.1016/0022-2836(72)90326-9
Mohanta, T., Malnoy, M., Mohanta, N., and Kanchiswamy, C. (2014). In-silico identification and phylogenetic analysis of auxin efflux carrier gene family in Setaria italica L. African J. Biotechnol. 13, 211–225. doi: 10.5897/AJB2014.13617
Mohanta, T., Mohanta, N., and Bae, H. (2015). Identification and expression analysis of PIN-Like (PILS) gene family of rice treated with auxin and cytokinin. Genes 6, 622–640. doi: 10.3390/genes6030622
Mohanta, T. K., Arora, P. K., Mohanta, N., Parida, P., and Bae, H. (2015a). Identification of new members of the MAPK gene family in plants shows diverse conserved domains and novel activation loop variants. BMC Genomics 16:58. doi: 10.1186/s12864-015-1244-7
Mohanta, T. K., and Bae, H. (2017). Cloning and characterization of auxin efflux carrier genes EcPIN1a and EcPIN1b from finger millet Eleusine coracana L. 3 Biotech 7, 51. doi: 10.1007/s13205-017-0689-6
Mohanta, T. K., Mohanta, N., Mohanta, Y. K., Parida, P., and Bae, H. (2015b). Genome-wide identification of Calcineurin B-Like (CBL) gene family of plants reveals novel conserved motifs and evolutionary aspects in calcium signaling events. BMC Plant Biol. 15:189. doi: 10.1186/s12870-015-0543-0
Mohanta, T. K., Pudake, R. N., and Bae, H. (2017). Genome-wide identification of major protein families of cyanobacteria and genomic insight into the circadian rhythm. Eur. J. Phycol. 52, 149–165. doi: 10.1080/09670262.2016.1251619
Rogers, T. E., Ataide, S. F., Dare, K., Katz, A., Seveau, S., Roy, H., et al. (2012). A pseudo-tRNA modulates antibiotic resistance in Bacillus cereus. PLoS ONE 7:e41248. doi: 10.1371/journal.pone.0041248
Stange, N., Beier, D., and Beier, H. (1992). Intron excision from tRNA precursors by plant splicing endonuclease requires unique features of the mature tRNA domain. Eur. J. Biochem. 210, 193–203. doi: 10.1111/j.1432-1033.1992.tb17408.x
Sugahara, J., Yachie, N., Sekine, Y., Soma, A., Matsui, M., Tomita, M., et al. (2006). SPLITS: a new program for predicting split and intron-containing tRNA genes at the genome level. In Silico Biol. 6, 411–418.
Tocchini-Valentini, G. D., Fruscoloni, P., and Tocchini-Valentini, G. P. (2009). Processing of multiple-intron-containing pretRNA. Proc. Natl. Acad. Sci. U.S.A. 106, 20246–20251. doi: 10.1073/pnas.0911658106
Yue, D., Maizels, N., and Weiner, A. M. (1996). CCA-adding enzymes and poly(A) polymerases are all members of the same nucleotidyltransferase superfamily: characterization of the CCA-adding enzyme from the archaeal hyperthermophile Sulfolobus shibatae. RNA 2, 895–908.
Keywords: tRNA, duplication, deletion, Oryza sativa, novel tRNA
Citation: Mohanta TK and Bae H (2017) Analyses of Genomic tRNA Reveal Presence of Novel tRNAs in Oryza sativa. Front. Genet. 8:90. doi: 10.3389/fgene.2017.00090
Received: 06 March 2017; Accepted: 09 June 2017;
Published: 30 June 2017.
Edited by:Ghanshyam Upadhyay, City College of New York (CUNY), United States
Reviewed by:Jian Lu, Johns Hopkins University, United States
Prabhat Kumar Sharma, Children's Hospital of Philadelphia, United States
Geetha Durairaj, University of California, Irvine, United States
Copyright © 2017 Mohanta and Bae. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.