Universal RNA Secondary Structure Insight Into Mosquito-Borne Flavivirus (MBFV) cis-Acting RNA Biology

Mosquito-borne flaviviruses (MBFVs) spread between vertebrate (mammals and birds) and invertebrate (mosquitoes) hosts. The cis-acting RNAs of MBFV share common evolutionary origins and contain frequent alterations, which control the balance of linear and circular genome conformations and allow effective replication. Importantly, multiple cis-acting RNAs interact with trans-acting regulatory RNA-binding proteins (RBPs) and affect the MBFV lifecycle process, including viral replicase binding, viral RNA translation-cyclisation-synthesis and nucleocapsid assembly. Considering that extensive structural probing analyses have been performed on MBFV cis-acting RNAs, herein the homologous RNA structures are online folded and consensus structures are constructed by sort. The specific traits and underlying biology of MBFV cis-acting RNA are illuminated accordingly in a review of RNA structure. These findings deepen our understanding of MBFV cis-acting RNA biology and serve as a resource for designing therapeutics in targeting protein-viral RNA interaction or viral RNA secondary structures.


INTRODUCTION
The members of the mosquito-borne flavivirus (MBFV) replicate in vertebrates and/or mosquitoes and include a large number of zoonotic pathogens that are divided into eight groups with genetic and serotype divergences: the Japanese encephalitis group (JEVG), Ntaya virus group (NTAVG), Kokobera virus group (KOKVG), Aroa virus group (AROVG), Lammi virus group (LAMVG), YFV group (YFVG), Spondweni virus group (SPOVG) and Dengue virus group (DENVG) (Benzarti et al., 2019). In terms of the naturally infected vertebrate hosts, the majority of MBFV groups (MBFVGs) demonstrate a strong preference for primates (Fernandez-Sanles et al., 2017), while birds act as the primary hosts for JEVG and NTAVG (bird-adaptable MBFVGs), and they serve as amplifying or bridging hosts that have the risk of transmitting the virus to humans (Maharaj et al., 2018).
The MBFV genome is flanked by terminal cis-acting RNAs that contain a number thermodynamically stable and shape-conserved structural elements. At the same time, sequence and structural polymorphisms of cis-acting RNAs are observed across MBFVGs.
The 5 -terminal cis-acting RNA contains two stem-loops (SL; 5 SLA and 5 SLB), and the adjacent hairpin in the capsidcoding region (5 cHP) (Gebhard et al., 2011) Figure S1a). The 3 -terminal cis-acting RNA can be divided into three independent domains Gould, 2006a,b, 2007) (Supplementary Figure S1a): Domain I features a variable region (3 VR) located downstream of the terminal codon of the coding region, where the divergence is primarily concentrated, consisting of long deletions, insertions, sequence repeats. Domain II displays two dumbbell-like secondary structures (3 DB1 and 3 DB2). Domain III contains the conserved sequence 3 CS1, a small hairpin (3 sHP) and the 3 SL. Local RNA structures throughout the genome have been computationally predicted by Proutski et al. (1997). Further extensive structural probing analyses have been experimentally performed on cis-acting RNAs of several MBFVs (Lodeiro et al., 2009;Wang et al., 2017;Huber et al., 2019). However, the sequence divergence and structure heterogeneity of MBFV cis-acting RNA are less-known. With that in mind, prediction of the possible folding of the homologous RNA structure of cis-acting RNA of each MBFV was done by the Mfold 1 . Remarkably, cis-acting RNAs can be a signal for antiviral or proviral cofactors in a structure dependent manner (Tables 1-3; Chahar et al., 2013;Li et al., 2013Phillips et al., 2016;Ward et al., 2016;Chavali et al., 2017;Chiu et al., 2018), which further indicated that RNAs share common evolutionary origins and contain frequent covariations to retain these RNA structures. In this sense, a comparative and thorough analysis on cis-acting RNA structures are needed to be conducted as means of identifying possible determinants of MBFV replication and pathogenicity.

(Supplementary
Beyond local RNA structure in the linear genome, the long-range 5 -to-3 circularized cis-acting RNA (5 -3 cirRNA) has been shown to position the flaviviral RNA-dependent RNA polymerase (RdRp) close to the transcription start site . When no viral proteins are present, the 3 SL in the linear genome may enhance viral translation at the initiation stage, possibly via an interaction with the cellular translation machinery ( Table 1; Holden and Harris, 2004). As the amount of viral protein accumulates to form the viral replicase, 5 SLB, 3 sHP, and the lower stem of 3 SL of the linear conformation need to unfold to permit the 5 -3 cirRNA formation via the reversed complementary sequences, such as 5 UAR/3 UAR (upstream AUG region), 5 DAR/3 DAR (downstream AUG region) and 5 CS/3 CS (conserved sequence) elements (Wang et al., 2017) (Supplementary Figures S1a,b). It's believed that the 5 UAR-flanking stem (5 UFS) functions as a riboswitch by dictating NS5 recruitment and vRNA cyclization. NS5 specifically recognizes and binds to 5 SLA and 5 UFS of the linear genome. Following that, the vRNA circularization would start and highly structured 5 UFS would unwind (Liu et al., 2016). Concurrently, it would favor NS5 to transfer the 3 SL, hence properly positioning the polymerase for initiating the negative-strand RNA synthesis, which is the first step of progeny propagation. In the replicase complex, the multifunctional NS3 protease/helicase is suggested to be involved in folding or 1 http://unafold.rna.albany.edu/?q=mfold unfolding of vRNA structures stimulated by NS5 (Xu et al., 2019). In addition to the NS5 replicase, viral as well as cellular proteins have been suggested to support flavivirus RNA replication. The circularization mechanism is prerequisite for vRNA synthesis, nevertheless, several complementary elements show low conservation rates amongst MBFVGs. Particularly, the synergy and respective functions of the complementary elements involved in 5 -3 cirRNA formation remain to be clarified.
Here optimal RNA secondary structures were predicted, and consensus secondary structures of the core cis-acting RNA were modeled. The homology and polymorphism of secondary structures of each MBFVG cis-acting RNA element were characterized by group. Considering that the current MBFV cis-acting RNA biology isn't a complete mechanism system, we also describe the cis-acting RNA biology and pinpoint possible functional determinants of the cis-acting RNA accordingly in a review of MBFV RNA structure (Tables 1-3). Antiviral and vaccine strategies that target protein-viral RNA interaction or viral RNA secondary structures are promising.

Similarity and Difference on MBFV 5 -3 cirRNA Complementary Sequences
Mosquito-borne flaviviruses have evolved into eight distinct groups with profound differences in their genome sequences ( Figure 1A). Although cis-acting RNAs typically show large variance both in length and sequence composition, rendering them inconsistently aligned and ill-suited for phylogeny reconstruction. This genome-based phylogenetic classification is in good agreement with 5 -3 cirRNA secondary structure clustering, mainly because all MBFVs share a common circularized architecture ( Figure 1B). All the 5 -3 cirRNAs are maintained by an intact 5 SLA, 5 cHP and incomplete 3 SL at the termini (Supplementary Figures S1a,b). Moreover, the representative 5 -3 cirRNAs of bird-adaptable MBFVG possess shorter single-stranded spacer neighboring the 5 SLA (Supplementary Figure S1b). At least three pairs of complementary sequences have been shown to participate in the 5 -3 cirRNA composition, i.e. the 5 CS/3 CS, 5 DAR/3 DAR and 5 UAR/3 UAR elements ( Figure 1B). More recently, a new pair of complementary sequences, 5 C1/3 -DB1, has been identified by disassembling the hairpin residing in the 5 cHp and 3 DB1 (De Borba et al., 2015). Notably, except the YFVG, SPOVG and DENVG, all the rest of MBFVGs share two putative DAR regions (DARI, DARII) ( Figure 1B and Supplementary Figure S1b). It's worth mentioning that the 5 G/3 U rich duplex element is preserved as diverse sizes across MBFVGs but lacking for the YFVG (Figure 1B and Supplementary Figure S1b), whereas the 5 G/3 U element in the 5 -3 cirRNA hasn't been previously proved and much concerned (Basu and Brinton, 2011).
Owing to the inconsistent neighboring bases close to the highly conserved 5 CS/3 CS, the first base-pairing of 5 CS/3 CS is undefined in MBFVGs (Figure 2).Both the 5 CS/3 CS and 5 DARII/3 DARII elements in bird-adaptable MBFVGs are highly conserved in sequence and structure (Figure 2A), while the YFVG 5 CS/3 CS is typically longer (Figure 2B,  Supplementary Figure S2). Structural polymorphisms of 5 DARI/3 DARI were observed among the bird-adaptable MBFVGs even though the 5 DARI and 3 DARI sequences are relatively conserved. For instance, the initial base-pairing of 5 DARI/3 DARI is also inconsistent resulting from the uncertain neighboring matchings (Figure 2A and Supplementary Figure S2). Interestingly, both the topology and sequence of 5 DAR/3 DAR in none-bird-adaptable MBFVGs closely  Cui et al., 1998 resembles 5 -3 DARII in the dual-DAR MBFVGs ( Figure 2B). Last but not least, the 5 UAR/3 UAR are strikingly divergent with different quantities and sizes of bulges even in the same MBFVG (Figure 2). Additionally, the internal unmatched base-pairings in 5 UAR/3 UAR are less conserved in 5 -3 cirRNA and tend to participate in the local secondary structure in the linear viral genome.

Efficient and Alternative 5 -3 cirRNA Dominates MBFV vRNA Synthesis
During the MBFV cyclization process, the dynamic 5 -3 cirRNA initiates from the 5 CS/3 CS duplex formation, then extends the initial interaction by the 5 DAR/3 DAR, and assists the interaction of the 5 UAR/3 UAR element which would result from unwinding the bottom of terminal 3 SL (Polacek et al., 2009a;Friebe et al., 2011). Moreover, a sequence present downstream of the 5 CS element in the capsid-coding sequence called "downstream CS"(dCS) impacts genome circularization by modulating the topology of 5 cis-acting RNA (Friebe et al., 2012). Earlier reports showed that 5 CS/3 CS element was demonstrated as the most stable interactions in both the DENV and ZIKV viruses (Huber et al., 2019). Under stringent cellular conditions, the isoform p45 of host protein AUF1 (also termed hnRNP D) destabilizes 5 SLB and the 3 SL, thereby exposing the 5 UAR element and promoting vRNA cyclization, and positively regulates the replication of WNV, DENV and ZIKV ( Table 1; Friedrich et al., 2014Friedrich et al., , 2018. It is conceivable that depletion of AUF45 inhibits virus replication and infection. In another report, the core protein was found to induce a dramatic acceleration in the 5 -3 cirRNA annealing in vitro (Ivanyi-Nagy and Darlix, 2012). Upon core protein chaperoning, it might stimulate intramolecular RNA rearrangements or intermolecular 5 -3 cirRNA interactions without ATP consumption. Indeed, binding of the cellular and viral protein to 5 -3 cirRNA of MBFVs seems to be a viable mechanism for regulating the 5 -3 cirRNA affinity which is critical for the initiation of negative-stranded RNA through NS5 recruitment to 3 cis-acting RNA (Table 1; Garcia-Montalvo et al., 2004;Vashist et al., 2009Vashist et al., , 2011Gomila et al., 2011;Bhullar et al., 2014).
Deletion of the complete 3 CS could partially restore 5 -3 cirRNA conformation and rescue the lethal WNV through compensatory mutations on the 5 UAR/3 UAR and 5 DAR/3 DAR elements (Zhang et al., 2010). Furthermore, the effect of the individual base-pairing on replicons varied with their position (Suzuki et al., 2008), mutations on the central position of 5 CS/3 CS sequence have negligible effect on replication, whereas base-pairings in the terminal position severely affect viral replication. In this case, a number of adjacent 5 CS/3 CS mismatching combinations were also rescued by a second site mutation that created additional base-pairings on the internal genomic side of the 5 CS/3 CS element (Basu and Brinton, 2011). Interestingly, mutations in the DENV single 5 DAR disrupts the 5 -3 cirRNA interaction, whereas mutant 3 DAR still retains alternative 5 -3 cirRNA conformation (Friebe and Harris, 2010). Likewise, mutations in the central three nucleotides of the WNV 5 -3 DARII decreased but not disrupted the affinity of 5 -3 cirRNA (Friebe et al., 2011). As such, the abrogated viral replication may directly result from the decreased affinity of the 5 -3 cirRNA, otherwise mediated by the corresponding 5 DARII nucleotides and/or 3 DARII structures in the linear viral genome. Consistently, the 5 DARII is single-stranded, while the 3 DARII is involved in the 3 sHP formation in the linear viral genome. Both the complementarity 5 DAR/3 DAR element and the formation of 3 sHP and 3 SL are required for vRNA replication of MBFVs. In most of the 5 UAR/3 UAR cases, a single mutation disrupting complementarity can greatly compromise vRNA synthesis, and compensatory mutations could potentially reestablish alternative 5 UAR/3 UAR element, then modulate vRNA synthesis but not viral translation (Alvarez et al., 2005b(Alvarez et al., , 2008. Given that the internal unmatched base-pairing in 5 -3 cirRNA is less conserved and tends to participate in the local secondary structure of the viral linear genome, it is likely that these nucleotides would regulate stability of the 5 -3 cirRNA.
Promoter and Enhancer of 5 cis-Acting RNA

RNA-Binding Proteins (RBPs) Direct 5 SLA Promoter-Dependent vRNA Synthesis
Mosquito-borne flaviviruse genomic RNA begins as a Y-shapedlike long stem structure (termed the 5 SLA) that comprises three stems (S1, S1, and S3) characterized by the U bulge, top loop (5 TL) and side structure domain (SSD). Although the size and shape of the 5 SLA are highly conserved, sequence conservation isn't restricted to the local stems and loop regions (Figure 3 and Supplementary Figure S3). For instance, three stem regions are preserved though a number of covariations, not least because S3 can tolerate large variations on the length and sequence (Lodeiro et al., 2009). That is, NTAVG and SLEV have insertions resulting in overall lengthening of the S3 region (Supplementary Figure  S3). Such substantial sequence divergences likely contribute to 5 SLA specific binding ability for RBPs. At the forefront of the 5 SLA promoter are the well-conserved dinucleotide "AG" in all MBFVs except the KEDV and DGV. However, nucleotides at the third site extensively undergo an 3rd A/U nucleotide substitution, and the 4th nucleotides seem to show sequence and structure divergences even in the same JEVG or DENVG ( Figure 3B and Supplementary Figure S3). The junction loop between S2 and S3 exhibits irregular in sequence and structure among MBFVGs. Nevertheless, the bird-adaptable group is characterized by a relatively regular junction loop "GAA/G" (Figure 3 and Supplementary Figure S3). The structurally conserved 5 TL is less variable in length than the loop region of SSD (SSL). The SSD that prominently sticks out from the stem backbone demonstrates extensive heterogeneity in size, sequence and secondary structure (Figure 3 and Supplementary Figure S3). Furthermore, a single-isolated "A" nucleotide is located aside SSD, which exclusively belongs to the bird-adaptable groups as well as DENVG and CHAOV.
The 5 m7GpppAmpN2 cap (5 cap) is absolutely conserved in MBFVGs, which enables translation through the canonical cap-dependent translation initiation (Zhao et al., 2015). The addition of 5 cap of vRNA is mediated by the methyltransferase activity of NS5 in combination with nucleotide triphosphatase activity of NS3 (Issur et al., 2009;Zhao et al., 2015). Under these conditions the forefront nucleotides are specifically recognized and regulated by the cap-binding eIF4E protein (Gingras et al., 1999;Lloyd, 2015;Harvey et al., 2018; Table 2). As flavivirus NS5 RdRp contains an ATP-specific priming site, which imposes a strong preference for the de novo synthesis using a dinucleotide "AG" primer (Nomaguchi et al., 2003;Zhao et al., 2015). The initially terminal 5 -AGAA-3 is the most optimal template for DENVG RdRp (Nomaguchi et al., 2003). DENV NS3 were previously shown to affect the translocation of vRNA through dynamic interactions with nucleotides at positions 4th U and 5th G of the 5 SLA (terminal sequence 5 -AGUUGUUAGUCU-3 ). Residuals D290 and R538 of DENV NS3 also have specific interactions with the 2nd G and 5th G, respectively, Swarbrick et al. (2017). The presence of 2nd G and 5th G presumably drives a molecular switch of NS3 and leads to significantly higher activation of ATPase activity of NS3. Functional analysis of S1 and S2 of 5 SLA demonstrated that disruption of the stems abolished vRNA replication. Genomelength viral RNAs with reconstitution of these stems replicated at a moderately lower rate and generated revertants or second-site mutations upon passaging (Filomatori et al., 2006;Li et al., 2010;Liu et al., 2017). WNV NS5 binds specifically to S1 region (Dong et al., 2008), and requires distinct RNA elements within the S1 for two successive cap methylations (Dong et al., 2007). Beyond that, both the 5 TL and SSL are involved in specific NS5 binding (Table 1). However, such binding determinants were considered to be necessary to promote NS5 polymerase activity but not the prior NS5 binding process . Despite the diversity on SSD, its stable secondary structure was found to be essential for infectivity (Dong et al., 2008;Li et al., 2010;Zhao et al., 2015). Notably, the structurally stable SSD is also sensitive to oligomer (PMO), which is a key antiviral molecular (Dong et al., 2007). Neither the deletion of U bulge nor GAA/G loop alter RdRp binding and activity , whereas it's proposed that they must interact with other proteins required for viral replication (Table 1).
Interestingly, the 5 cap of viral RNA functions to subvert innate host antiviral responses through escape of IFIT-mediated viral suppression (Daffis et al., 2010;Szretter et al., 2012). In addition, it has been hypothesized that the double-stranded stem of this 5 SLA, which is located several nucleotides away from the 5 triphosphates (5 -3P), may act a potent agonist and lead to interferon secretion in infected cells (Deo et al., 2014(Deo et al., , 2015Chazal et al., 2018). During replication, RIG-I must bind the 5 -3P of nascent vRNA before capping, which consists of releasing the terminal phosphate from the 5 -3P of the (+) strand by the viral NS3 protein (Cui et al., 1998;Klema et al., 2015). Such recognition by RIG-I of one of the most conserved elements within the vRNA would facilitate virus immune escape. FIGURE 3 | (A) Mean relative evolutionary rates are shown for each site of the 5 cis-acting RNA. The analysis involved 25 nucleotide sequences from all the MBFVGs, spanning the 5 SLA, 5 SLB, 5 cHP and 5 CS. Evolutionary analyses were conducted in MEGA6. These rates are scaled such that the average evolutionary rate across all sites is 1. This means that sites showing a rate <1 are evolving slower than average, and those with a rate >1 are evolving faster than average. (B) MBFV structured promoter 5 SLA features divergent nucleotides. The 3rd A/U nucleotide (*) undergoes a nucleotide substitution, and the 4th nucleotide (×) shows sequence and structure divergences even in the same JEVG or DENVG.

Group Structure-Specific 5 SLB Dictates NS5 Recruitment and vRNA Cyclization
The top region of 5 SLB contains the translation initiation codon and projects structure-specificity in each MBFVG (Figure 4). Therefore, we reasonably speculate that some selection pressure can contribute to the top part of 5 SLB. At the bottom of 5 SLB, the oligo(U) region of MBFVG forms a canonical duplex with complementary sequences in or near the 5 UAR sequence and vRNA translation start region, which is designated as the 5 UFS. Because the local folding pattern of 5 SLB and 5 -3 cirRNA in the YFVG are strikingly different from other MBFVGs, the corresponding 5 UFS in YFVG is recognized as the AUrich hairpin (Figure 4 and Supplementary Figure S1b). The structural 5 UFS unwinds in response to 5 -3 cirRNA, leading to the decreased NS5 affinity for the 5 cis-acting RNA and NS5 transferring to the 3 cis-acting RNA. However, the G-C base-pairings might increase the potential stability of 5 UFS, as in WNV and USUV (Figure 4). Further research also showed that stabilizing 5 UFS impaired both vRNA cyclization and replication (Liu et al., 2016). An unstructured 5 DARII decollates the 5 SLB and 5 cHP, specially preserved in the bird-adaptable group, KOKVG and AROAVG (Supplementary Figure S2).

cHP Facilitates vRNA Translation Initiation Selection as Well as vRNA Synthesis
For each MBFVG, comparison on stem region of the homologous 5 cHP element reveals varying degrees of nucleotide conservation, regardless of the extremely non-conservative loop sequence (Figures 3A, 5). That is, the position of the conserved nucleotide is MBFVG-specific, and besides that the corresponding conserved nucleotides of the two stems FIGURE 4 | The secondary structure of 5 SLB from eight MBFVGs is group-specific. The 5 UFS base-pairing is indicated at the bottom of 5 SLB in the black bracket. The nucleotides that participate in the MBFV 5 UAR and 5 DAR are labeled in respective colors as above.
(S1, S2) can precisely be base-paired. Particular for the YFVG, SPOVG and DENVG, these nucleotides varied with MBFVGs. Importantly, the conserved nucleotides determine the key amino acid as well as the stem structure, heralding an evolution restriction of both the RNA structures and key residuals. The 5 cHP's structure serves primarily to stall the ribosome over the translation initiation codon, with variable sequence on the stem and loop, but its stability and location with respect to the translation initiation codon apparently correlate with translation efficiency (Clyde et al., 2008). On the other hand, the 5 cHP likely stabilizes the overall 5 -3 cirRNA formation or participates in the recruiting cofactors associated with the replicase machinery during vRNA synthesis.

Duplicated 3 DBs Confer Contrasting Functions and Host Specificity
To achieve resilience in host adaptation, 3 DB duplication strategies have been proposed as an evolutionary trait for MBFV. It's worth mentioning that the single 3 DB is still present in AROAVG, YFVG and SPOVG (Figure 6A and Supplementary Figure S4). The single-stranded linker separates DB1 and DB2 and varies in length and sequence (Supplementary  Figures S5, S6). Considering that the top hairpin of 3 DBs should be extraordinary for its similarity in both structure and sequence, only the consensus structures of the bottom part of 3 DBs are constructed and sorted. Additionally, the group-conserved nucleotides are mapped on the homologous structures ( Figure 6B). Indeed, nucleotides in the bottom stem of DB1 do not preserve consistency among MBFVGs. The ensuing 5 -CCC-3 trinucleotide of DB1 allows the first "C" nucleotide to be freely unpaired and consequently constitutes an interior loop, which distinguishes the bird-adaptable groups from the majority of MBFVGs (Figure 6B and Supplementary Figure S4). Unlike DB1, the structure and the nucleotides in the interior loop of DB2 are instead invariant in most of the dual-DB MBFVGs (Figure 6B and Supplementary Figure S6). Such DB2 elements are more easily clustering than DB1, hinting to a lower evolution pressure. Duplicated 3 DBs are incorporating two repeated conserved sequences (CS2 and RCS2). The pseudoknot formation of TL1/PK2 and TL2/PK1 is promoted by the presence of respective loops TL1/TL2 and complementary pentanucleotide PK1/PK2 as well as RCS2 and CS2 (Supplementary Figure S1a).
There are two main explanations to the presence of dual-DB structures: (a) their putative interaction with dimeric proteins and (b) their requirement to sustain replication in a dual-host system of vertebrate hosts and invertebrate vectors. However, the latter hypothesis is proven. Deletion of DENV DB1 reduces viral replication both in human and mosquito cells. It's intriguingly that deletion of DENV DB2 was greatly advantageous for mosquito infection, with low impact in human cells. Analysis of viruses obtained from infected mosquitoes showed selective mutations mapped in the DB2 (De Borba et al., 2019). Importantly, DB1 and DB2 differentially modulate viral genome cyclization, the pseudoknot formed within DB2 competes with long-range 5 -3 cirRNA interactions (De Borba et al., 2019). Additionally, the respective contributions of TL1 and TL2 to translation appear unequal: TL1 mutation alone does not have any effect; TL2 mutation has only a modest effect in translation; but translation is reduced in the TL1/TL2 double mutant, indicating that TL1 exhibits a cooperative synergy with TL2 in translation (Manzano et al., 2011). A 30ntdeletion corresponding to TL2 of the DB2 structure attenuates all four DENV serotypes and is currently being tested as a vaccine candidate (Alvarez et al., 2005a;Kirkpatrick et al., 2016). Moreover, a 10nt deletion of the 3 DB of ZIKV has shown a target for viral attenuation (Shan et al., 2017). Although mutations abrogating TL/PK complementarity can imped viral translation and replication. Remarkably, restoration of pseudoknots can rescue the translation level but not replication defects. In contrast to TL1 and TL2, PK1 and PK2 are not absolutely necessary for translation, suggesting its alternative TL receptors within the vRNA. Despite the lack of a poly-A tail, PABP appears to specifically bind to the A-rich sequences flanking the 3 DB structures, where the  corresponding binding nucleotides aren't exactly identical. Such interaction mimics the role of mRNA poly-A tail and presumably stimulates translation initiation (Supplementary Figure S4 and Table 3).

Viral Replication Shows High Vulnerability to the Overlapping RNA Signals on 3 sHP
The 3 UAR and 3 DAR of 5 -3 cirRNA sequences overlap the 3 sHP. In most cases, the highly conserved sequences on the left stem are involved in 5 DARII/3 DARII formation, and the majority of the loop region overlaps with the 3 DARI. Accordingly, the right stems participate in the 3 UAR or 3 DARI in some MBFVs. However, several nucleotides of the loop and right stem only participated in the 3 sHP structure but not the cyclization formation (Figure 7). Hence, we have been wondering if enhancing these bases complementary to the 5 cis-acting RNA would affect the viral replication via a more stable 5 -3 cirRNA. The overlapping sequences within the 3 sHP regulate the equilibrium between the two alternative conformations of the genome (Wang et al., 2017).
Except the SLEV of JEVG, the bottom two "C-G" basepairings of 3 sHP separate JEVG, NTAVG and KOKVG from the other MBFVGs ( Figure 7A). In other words, the lacking of "C-G" base-pairings results in a four-base-pair shortage in the non-bird-adaptable MBFVGs (Figure 7B). Studies of the first base-pairing "C-G" of DENVG 3 sHP confirmed that the stability can alter interconversion between the linear and circular conformations of the vRNA during replication (Wang et al., 2017). Alternatively, it has been shown that disruption of the 3 sHP stem abolishes viral replication unless reversion occurs in order to rescue replication via the balance between the alternative conformations (Villordo et al., 2010;Davis et al., 2013). Interestingly, point mutations in the 3 sHP abrogated infection in mosquito cells without affecting replication in mammalian cells (Villordo and Gamarnik, 2013). Remarkably, the loop region of 3 sHP that resembles the typical GNNRA motif, is longer in most members of JEVG, and the sequence is not exactly identical even in the same MBFVG (Figure 7). It has been demonstrated that nucleotides within that loop could form a pseudoknot with corresponding nucleotides in the 3 SL, which has implications for virulence, attenuation and vaccine development (Shi et al., 1996).

SL Is Endowed With a Virus-Specific Required Sequence (VRS)
Three homologous stems are defined in the most well-studied 3 SL according to the loci of corresponding bulges (Figure 8). Based on that, the minimal VRS of 3 SL in individual MBFV is easily confirmed using a chimeric genome. The sequence and structure of stem 1 (S1) of the 3 SL are highly conserved across the JEVG, NTAVG and KOKVG ( Figure 8A). Notably, the homologous S1 elements are lengthened slightly by extra nucleotides in remaining MBFVGs. The extra basepair insertion of S1 region is shown in the red bracket or box (Figures 8B,C). Region S1 of MBFVs consists of two conserved base-pairings (U-A, G-C) that are flanked by below and above internal loops. Additionally, the relatively position-fixed C/C bulge in S1 is strictly conserved among JEVG, NTAVG and KOKVG (Figures 8A,B). Region S2 has undergone sequence and structure changes and harbors extra bulges (Figures 8A-C). Specifically, in JEVG, the base-pairings flanked by the terminal bulges of S2 region characterize strictly homologous sequences. Nevertheless, the nucleotides in the middle region of S2 vary in size and sequence ( Figure 8A). Totally, the S2 element of DENVG is relatively short and lacking for a bulge region ( Figure 8C). In addition, the NTAVG is characteristic of the U-A at the lowest edge of the S2 region ( Figure 8A). Despite divergent nucleotide variations, S3 region contains a number of covariations to maintain the stem structure. Except for the YFVG, a stable hairpin comprising three invariable base-pairing and partially conserved nucleotides are observed at the top region of 3 SL (3 TL).
Despite the topological similarity of 3 SL across MBFVs, the analogous 3 SL sequence could not substitute for the original Three homologous stems (S1, S2, S3) were defined in corresponding colors. The highly conserved homologous bulge and adjacent structure are indicated by the orange box in JEVG and NTAVG. The homologous bulge in the S2 region is indicated by a black box in each group. The extra base-pair insertion in the S1 region is shown in the red bracket or box. Special TMUV/CQW1 insertion marked by the red bracket is in S2 region.
3 SL to support vRNA replication (Yu and Markoff, 2005). In the DENV2 chimeric genome, containing the WNV 3 SL nucleotide sequences, the VRS is an 11bp segment comprising the majority nucleotides in the S2 region (Zeng et al., 1998). In that case, the two specific bulges of S2 region are critical for viable DENV. Further analysis on the replication phenotypes of WNV chimeric genome has revealed that a bulge within the top portion (S2 and S3) in the 3 SL is essential for WNV replication. In addition, the introduction of a second bulge into the lower part of the long stem of the WNV 3 SL can be an enhancer of replication in cultured mosquito cells but not monkey kidney cells (Yu and Markoff, 2005). Generally, the loci of these bulges are not well conserved among flavivirus species. Nonetheless, the integrity of specific bulges is required for vRNA replication. Most likely, bulges are critical sites for binding of viral and cellular proteins to form the flavivirus replication complex (Yu and Markoff, 2005;Paranjape and Harris, 2007). Besides that, bulge regions are related to the low temperature transition (Davis et al., 2013). Mutation of the bulges or adjacent nucleotides is detrimental to virus replication . In contrast, engineered substitutions within S3 for WNV in the DENV backbone don't result in loss of infectivity, which is associated with the significant variability of region S3 in MBFV (Zeng et al., 1998).

Functional Interplay Between MBFV and Additional 3 cis-Acting RNAs
Even though the deletion of the 3 VR decreases DENV2 RNA synthesis in BHK cells, and these recombinant viruses are viable and delayed for replication, it has no effect on RNA replication in C6/36 cells (Alvarez et al., 2005a). Conversely, another report showed that deletion of the 3 VR increased replication in mosquito cells . Anyway, RNA structure specialization and duplication in 3 VR are utilized for maintaining host-fitness. The 3 VR of MBFV contains an AU-rich region that is thought to have evolved due to the RdRp stuttering on the UAA stop codon (Gritsun and Gould, 2007) (Supplementary Figures S1a, S6). The length of the AUrich region varies among MBFVGs, with the WNV and MVEV having the longest AU-rich regions (Gritsun and Gould, 2006b). Importantly, the 3 VR contains a succession of hypervariable exonuclease-resistant structures (xRNAs) structures and plays critical roles in virus-host interactions ( Table 3; Pijlman et al., 2008;Roby et al., 2014;Gokhale and Horner, 2017;Chiu et al., 2018;Liao et al., 2018;Soto-Acosta et al., 2018).
Upon flavivirus infection, accumulation of viral subgenomic flaviviral RNAs (sfRNAs) is observed, which are associated with viral replication, pathogenesis and cytopathicity (Chapman et al., 2014). Mechanistically, sfRNAs are induced by incomplete degradation of viral gRNA by stalling 5 -3 exoribonuclease Xrn1 at xrRNAs, including different types of stem-loop (SL) and DB elements (Chapman et al., 2014;Chen et al., 2018;Macfadden et al., 2018). These structures efficiently stall Xrn1 from progressing through from the 5 -terminal direction, thus protecting the downstream RNA from degradation. In mammalian cells, sfRNAs have been shown to globally inhibit inflammatory gene expression and dampen the cellular type I IFN response (Moon et al., 2012;Schuessler et al., 2012). On the other hand, sfRNAs are described to impair the innate immune response by targeting of cytoplasmic pattern recognition receptors (PRRs). High levels of sfRNAs of particular DENV strain would directly bind the TRIM25 in a sequence specific fashion (Manokaran et al., 2015), thus impairs the ubiquitination of RIG-I and thereby its activation. Furthermore, it's previously proposed that sfRNAs may inhibit interferon-stimulated genes (ISGs) translation through binding of G3BP1, G3BP2, and CAPRIN1 (Bidet et al., 2014). Interestingly, sfRNAs are believed to represent the smallest fragments of DENV RNA that can be replicated during an infection and might influence DENV transmission (Li and Aaskov, 2014).

CONCLUSION
Mosquito-borne flaviviruse cis-acting RNA exemplifies a paradoxical twist in the homology and polymorphism of RNA sequence and structure, highlighting the potential implications of flavivirus evolution and diversification in successful infection. The four large flavivirus clusters, MBFVs, TBFVs, no-knownvector flaviviruses (NKVs) and insect-specific flaviviruses (ISFVs), possess differences in their host ranges. On one hand, structure organization and regulation mechanism of cis-acting RNA are more restricted to MBFVGs. On the other hand, the dynamics of host-virus interaction drive the heritable genetic diversity and structural polymorphisms on cis-acting RNA of MBFVGs.
We review the feature and biology of cis-acting RNAs of MBFVGs as follows: (I) MBFVG divergent 5 -3 cirRNAs regulate MBFV replication in an alternative conformation (II) MBFVG 5 cis-acting RNA compromises structurally homologous promoter and enhancer despite the low sequence identity (III) 3 cis-acting RNA has shown apparently group-specific elements exemplified by 3 DB and 3 sHP structure (IV) Structurally heterogenous 3 SL is endowed with a critical VRS. Additionally, sfRNA can regulate the host cell upon infection. Combining structural and sequence analysis, a large amount of structural heterogeneity between the bird-adaptable and non-bird-adaptable groups is observed. After that, further experiments are necessary to clarify the molecular biology of these structural differences across MBFVGs.

SUPPLEMENTARY MATERIAL
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmicb. 2020.00473/full#supplementary-material FIGURE S1 | (a) Linear models of MBFV cis-acting RNA and the RNA-acting binding proteins (RBP) for MBFV. Using JEVG as model, the proposed secondary structures of the core cis-acting RNA represent the typical MBFV genomic RNA. Meanwhile, the conformations of individual cis-acting RNA secondary structure are annotated. (b) Summary of the representative MBFV 5 -3 cirRNA models. The minimum-free-energy structure is adopted. Colored lines indicate the interacting RNA elements involved in genome cyclization. The same color is used for equivalent structures in different MBFVGs. Above the multiple alignments, the secondary structure of JEVG is shown using an arc plot.
FIGURE S3 | Sequence and secondary structure of MBFV 5 SLA. (a) Comparison of the nucleotide sequences of the MBFV 5 SLA. The multiple sequence-alignments are annotated according to the homologous structures. The S1/2 and S1/2 are complementary base pairings, and the ( * ) and (× ) markers represent the divergent nucleotides. (b) Annotated homologous RNA secondary structure on 5 SLA across MBFVGs. The red dots and arrow in NTAVG indicate the nucleotide variations and insertion compared to the CQW1 strain, respectively. The RNA secondary structure was predicted online by Mfold (http://unafold.rna.albany.edu/?q = mfold).