Fragment-Based Quantum Mechanical Calculation of Excited-State Properties of Fluorescent RNAs

Shen, Chenfei; Wang, Xianwei; He, Xiao

doi:10.3389/fchem.2021.801062

ORIGINAL RESEARCH article

Front. Chem., 22 December 2021

Sec. Theoretical and Computational Chemistry

Volume 9 - 2021 | https://doi.org/10.3389/fchem.2021.801062

This article is part of the Research TopicFragment-Based Electronic Structure Methods for SolidsView all 4 articles

Fragment-Based Quantum Mechanical Calculation of Excited-State Properties of Fluorescent RNAs

Chenfei Shen¹^†

Xianwei Wang²*^†

Xiao He^1,3*^†

¹Shanghai Engineering Research Center of Molecular Therapeutics and New Drug Development, School of Chemistry and Molecular Engineering, East China Normal University, Shanghai, China
²College of Science, Zhejiang University of Technology, Hangzhou, China
³New York University-East China Normal University Center for Computational Chemistry at New York University Shanghai, Shanghai, China

Fluorescent RNA aptamers have been successfully applied to track and tag RNA in a biological system. However, it is still challenging to predict the excited-state properties of the RNA aptamer–fluorophore complex with the traditional electronic structure methods due to expensive computational costs. In this study, an accurate and efficient fragmentation quantum mechanical (QM) approach of the electrostatically embedded generalized molecular fractionation with conjugate caps (EE-GMFCC) scheme was applied for calculations of excited-state properties of the RNA aptamer–fluorophore complex. In this method, the excited-state properties were first calculated with one-body fragment quantum mechanics/molecular mechanics (QM/MM) calculation (the excited-state properties of the fluorophore) and then corrected with a series of two-body fragment QM calculations for accounting for the QM effects from the RNA on the excited-state properties of the fluorophore. The performance of the EE-GMFCC on prediction of the absolute excitation energies, the corresponding transition electric dipole moment (TEDM), and atomic forces at both the TD-HF and TD-DFT levels was tested using the Mango-II RNA aptamer system as a model system. The results demonstrate that the calculated excited-state properties by EE-GMFCC are in excellent agreement with the traditional full-system time-dependent ab initio calculations. Moreover, the EE-GMFCC method is capable of providing an accurate prediction of the relative conformational excited-state energies for different configurations of the Mango-II RNA aptamer system extracted from the molecular dynamics (MD) simulations. The fragmentation method further provides a straightforward approach to decompose the excitation energy contribution per ribonucleotide around the fluorophore and then reveals the influence of the local chemical environment on the fluorophore. The applications of EE-GMFCC in calculations of excitation energies for other RNA aptamer–fluorophore complexes demonstrate that the EE-GMFCC method is a general approach for accurate and efficient calculations of excited-state properties of fluorescent RNAs.

Introduction

RNA directly regulates a large number of cellular processes, and effective methods are desirable to fluorescently label and track RNA in living cells (Autour et al., 2018). Because RNA lacks inherent fluorescence, it is difficult to track RNA molecules in real time. (Dolgosheina et al., 2014). Unrau and coworkers demonstrated that the high binding affinity of RNA Mango to its fluorophore provided a useful tool for single-molecule RNA visualization and for fluorescently monitoring RNA complexes while simultaneously using the fluorophore as a purification tag (Dolgosheina et al., 2014). Fluorescent aptamers have been successfully applied to track and tag RNA in a biological system, but the excited-state properties of the RNA aptamer–fluorophore complexes can hardly be predicted by the traditional quantum mechanical (QM) methods due to their large molecular size (Raghavachari and Saha, 2015; Li et al., 2016; Liu et al., 2017; Nakai and Yoshikawa, 2017).

Mango II is an RNA aptamer that can accurately image the subcellular localization of three small noncoding RNAs in fixed and live mammalian cells (Autour et al., 2018; Trachman et al., 2018). These aptamers normally contain a closing RNA stem as shown in Figure 1C, which isolates a small fluorophore-binding core from the external sequence, making them easy to insert into arbitrary biological RNA (Trachman et al., 2019). Usually, the fluorophore would bind to RNA with non-covalent interactions.

FIGURE 1

FIGURE 1. Graphical representations of fluorophore (EKJ) and Mango II RNA aptamer (PDB id: 6C63) and illustration of the two-body (2B) treatment of the excitation calculation in the EE-GMFCC fragmentation method for RNA systems. (A) An example of 2B calculation; the yellow stick represents the locally excited region of the fluorophore (m). A22 is a ribonucleotide in the RNA, whose spatial distance from the fluorophore is within a predefined threshold of λ(2B) (here 4 $Å$ was employed). (B) The EE-GMFCC scheme where the bond is cut for the ith ribonucleotide (in black dotted box), and the ith fragment was utilized in two-body QM calculation with the EKJ. (C) Mango II-EKJ complex. Mango II RNA consists of two parts: a stem and fluorophore-binding pocket.

Many QM methods were proposed for excited-state calculations, such as the approximate coupled cluster singles and double (CCSD) model (CC2) (Christiansen et al., 1995), symmetry-adapted cluster configuration interaction (SAC-CI) (Nakatsuji, 1979), complete active space second-order perturbation theory (CASPT2) (Finley et al., 1998), equation-of-motion CCSD (EOM-CCSD) (Stanton and Bartlett, 1993), time-dependent Hartree–Fock (TD-HF) (McLachlan and Ball, 1964), configuration interaction singles (CIS) (Foresman et al., 1992), and time-dependent density functional theory (TD-DFT) (Gross and Kohn, 1990; Adamo and Jacquemin, 2013; Kocherzhenko et al., 2017). Nevertheless, the applications of such methods to large molecules, such as proteins and RNAs, suffer from the limitations of expensive computational cost.

In order to reduce the computational cost of the excited-state calculations, a series of approaches were proposed for the systems with a localized electronic excitation (Jin et al., 2020). The simplest treatment is on the basis of the QM/MM method, which only apply the high-level QM method to the fluorophore, while the rest of the system was modeled with an empirical molecular mechanical (MM) method (Dahlke and Truhlar, 2007; Khait and Hoffmann, 2010; Kluner et al., 2011; Isborn et al., 2012; Daday et al., 2013; Milanese et al., 2017). More sophisticated approaches are using fragmentation QM techniques, which seek to reproduce full-system QM calculations by taking a proper combination of calculations of a series of individual fragments (Collins and Bettens, 2015; Scholz and Neugebauer, 2021).

Some fragmentation QM approaches have been proposed for calculations of the excited-state properties of large systems, including the generalized energy-based fragmentation (GEBF) approach developed by Li and coworkers (Li et al., 2016), the divide-and-conquer (D and C) method of Nakai and coworkers (Yoshikawa et al., 2013), the extension of the binary-interaction method (Hirata et al., 2005) of Hirata et al., and the fragment molecular orbital (FMO) method of Kitaura and coworkers (Chiba et al., 2007; Nakata et al., 2014). Recently, the electrostatically embedded generalized molecular fractionation with conjugate caps (EE-GMFCC) method was developed to calculate the excited-state properties of molecular crystals (Liu et al., 2019; Zhang et al., 2020) and fluorescent proteins (Jin et al., 2020) by our group. In this work, the EE-GMFCC method was further extended to predict the excited-state properties of fluorescent RNA–aptamer systems.

This paper is organized as follows. First, the convergence of the EE-GMFCC excitation energy as a function of the distance threshold for two-body QM interactions was tested for fluorescent RNA–aptamer systems. Second, the accuracy of the EE-GMFCC method on prediction of the excitation energy was investigated at both TD-HF and TD-DFT levels by comparison with the traditional full-system calculations. Third, the accuracy of calculated transition electric dipole moments (TEDM) and atomic forces by the EE-GMFCC method was demonstrated. Furthermore, a 100-ns molecular dynamics (MD) simulation of Mango II RNA in explicit water solvent was performed, and the relative excited-state energies of 10 different configurations extracted from the MD simulation were calculated by EE-GMFCC and compared with the results obtained from corresponding full-system QM calculations. Finally, the performance of the EE-GMFCC approach was extensively assessed on some other fluorescent RNA–aptamer systems by direct comparison with the full-system QM calculations.

Computational Approaches

Our previous work showed that the QM effects from the protein environment played a significant role in the calculations of the excited-state properties of green fluorescent protein (GFP) (Creemers et al., 1999; Creemers et al., 2000; Jin et al., 2020). Therefore, the local chemical environment was expected to be treated by the electronic structure method for accurately capturing the QM effects. Herein, the fragment-based QM method EE-GMFCC was proposed for incorporating the environmental effects of RNA on calculation of the excited-state properties of chromophore in the RNA aptamer–fluorophore complex. The EE-GMFCC method is an extension of the GMFCC/MM approach (He and Zhang, 2006). In the GMFCC/MM method, a system (proteins or RNA) would be divided into a series of unit-based fragments, and the properties, such as the total energy of the system, were obtained by taking a proper combination of the QM properties of subsystems. Generally, two-body interaction energy calculations were performed to capture the QM effects between the non-neighboring units that are spatially in close contact within a predefined distance threshold (Jin et al., 2017; Liu et al., 2017; Liu and He, 2017; Wang et al., 2018; Liu et al., 2019; Zhang et al., 2020). For further accounting for higher-order many-body electrostatic effects, an electrostatically embedding scheme was employed in the EE-GMFCC method by using embedding charges representing the remaining fragments in each fragment QM calculation (Wang et al., 2013; Jin et al., 2017). For clarity, before describing the application of the EE-GMFCC method in the calculation of the excited state of the RNA aptamer–fluorophore complex, we would give a brief description of the EE-GMFCC method for calculating the ground-state energies of RNAs.

The EE-GMFCC Method for Calculation of Ground-State Energy of RNA

The EE-GMFCC method was initially developed for calculations of the ground-state total energies of proteins (Wang et al., 2013; He et al., 2014; Jin et al., 2017; Liu and He, 2020). In the framework of the EE-GMFCC method, a protein with N residues is divided into N-2 fragments with each residue capped by its neighboring residues (conjugate caps) (Wang et al., 2013; Jin et al., 2017; Wang et al., 2018), and then the total energy of the given protein is obtained via taking a proper combination of the QM-calculated energies of the neighboring residues. For accurately capturing the QM effect between non-neighboring residues in close contact (within a predefined distance threshold λ), the corresponding two-body interactions are also treated at the QM level. Generally, higher-order interactions within the EE-GMFCC scheme are small and can be neglected due to the electrostatic embedding treatment.

Similar to the treatment of the protein, EE-GMFCC can be utilized in the calculation of the ground-state energy of a given RNA system. The ground-state energy of a RNA system is calculated using the EE-GMFCC method as follows (Jin et al., 2017),

\begin{array}{l} E_{EE - GMFCC}^{Ground state} = E_{MFCC} + E_{ribonucleotide - pair} - E_{double - counting} \\ = \sum_{i = 2}^{N - 1} \tilde{E} ({Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}) - \sum_{i = 2}^{N - 2} \tilde{E} ({Cap}_{i}^{*} {Cap}_{i + 1}) \\ + \sum_{i = 1}^{N - 3} \sum_{\begin{matrix} j = i + 3 \\ | R_{i j} | \leq λ \end{matrix}}^{N} {({\tilde{E}}_{i j}^{r i b o} - {\tilde{E}}_{i}^{r i b o} - {\tilde{E}}_{j}^{r i b o})}_{Q M} - E_{double - counting} \end{array} (1)

where $\tilde{E}$ denotes the sum of the self-energy of a fragment and the interaction energy between the fragment and its background charges. ${Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}$ is the ith ribonucleic acid unit (A_i) capped with a left-side cap of ${Cap}_{i - 1}^{*}$ and a right-side cap of ${Cap}_{i + 1}$ . The concap is fused molecular species by two neighboring ribonucleotides of ${Cap}_{i}^{*} {Cap}_{i + 1} (A_{i} A_{i + 1}$ ), and $\tilde{E} {Cap}_{i}^{*} {Cap}_{i + 1}$ is deducted in the EE-GMFCC method to avoid the double counting of overlapping fragments. The $E_{ribonucleotide - pair}$ represents the two-body QM interaction energy of two non-neighboring ribonucleotides within a distance threshold λ, and $R_{i j}$ is the distance between the two closest atoms of ith and jth nucleotides. $E_{double - counting}$ is calculated for all atom pairs whose interaction energies are doubly counted in the previous terms of Eq. 1. The $E_{double - counting}$ energy is approximated by pairwise charge–charge interactions. Usually, the atomic charges taken from the Amber force field are used in the embedding electrostatic treatment of the QM calculations for all fragments and the calculation of the $E_{double - counting}$ term. The more detailed description of the total ground-state energy calculation of RNA using the EE-GMFCC method can be found in our previous work (Jin et al., 2017).

EE-GMFCC for Excited-State Calculations of Fluorescent RNA–Aptamer Systems

Calculation of Excitation Energies

The EE-GMFCC method has been applied in the calculation of excitation energies of GFP in our previous work (Liu et al., 2019; Jin et al., 2020). The treatment of excited-state calculations with the EE-GMFCC method usually follows the condition of local excitation; that is, the dominant electronic reorganization that occurs in response to excitation of the system is only within a small region (Jin et al., 2020). Since the fluorophore in GFP is bonded with an amino acid as a specific residue, the excitation energy ω of the system could be obtained using Eqs. 2, 3, when the excitation center is in the mth residue.

\begin{array}{l} ω = \sum_{i = m - 1}^{m + 1} ω ({Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}) - \sum_{i = m - 1}^{m} ω ({Cap}_{i - 1}^{*} {Cap}_{i + 1}) \\ + \sum_{\begin{matrix} j = 1 \\ | R_{m j} | \leq λ_{2} \\ j \notin [m - 2, m + 2] \end{matrix}}^{N} (ω_{m j} - ω_{m}) \end{array} (2)

where the first term is the sum of excitation energy of the (m-1)th, mth, and (m+1)th fragments, which can be obtained as follows:

\sum_{i = m - 1}^{m + 1} ω ({Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}) = \sum_{i = m - 1}^{m + 1} {\tilde{E}}^{'} ({Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}) - \sum_{i = m - 1}^{m + 1} \tilde{E} ({Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}) (3)

where the excitation energy ω for a fragment of ${Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}$ (or ${Cap}_{i}^{*} {Cap}_{i + 1})$ was calculated in the field of the remaining part of the protein represented by atomic charges taken from the MM force field. ${\tilde{E}}^{'} {Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}$ is the total energy calculated at the excited state for the fragment of ${Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1},$ including the interactions between the excited-state wave function of ${Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}$ and background charges of the remaining system, while $\tilde{E} {Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}$ is the summation of the ground-state energy of ${Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}$ and interaction between the fragment of ${Cap}_{i - 1}^{*} A_{i} {Cap}_{i + 1}$ and background charges of the remaining system. A more detailed description of the excitation energy calculation of protein using the EE-GMFCC method can be found in our previous work (Jin et al., 2020). Different from the fluorescent protein, the fluorophore binds to the Mango II RNA–aptamer through nonbonded interactions. Therefore, the excitation energy for such a system is calculated based on the EE-GMFCC method as follows,

ω_{EE - GMFCC} = ω_{fluorophore} + ω_{two - body} = ω_{m} + \sum_{\begin{matrix} j = 1 \\ | R_{m j} | \leq λ \end{matrix}}^{N} (ω_{m j} - ω_{m}) (4)

Here, $ω_{m}$ is the calculated excitation energy for the fluorophore using the QM/MM method with the remaining part of the system as background charges. $ω_{m j}$ represents the two-body (2B) QM excitation energy of the fluorophore m and jth ribonucleotide, which was obtained by QM calculation in the field of the rest of the system represented by the MM point charges taken from the ff99OL3 force field (Wang et al., 2000; Pérez et al., 2007; Zgarbová et al., 2011). The illustration of the 2B calculation is shown in Figure 1A.

The RNA was cut at the bond between C3 atom and O3 atom as shown in Supplementary Figure S1 of the Supporting Information. For instance, the Nth ribonucleotide will be separated from the (N-1)th ribonucleotide and the (N+1)th ribonucleotide at the bonds of C3(N-1)-O3(N-1) and C3(N)-O3(N), where C3(N) and O3(N) represent the C3 and O3 atoms of the Nth ribonucleotide in the RNA chain, respectively. The H atom would be utilized to saturate the dangling bond, as shown in Supplementary Figure S1. The bond length of H-O3 was set to 0.96 $Å$ , and the bond length of C3-H was set to 1.09 $Å$ . The QM calculations were performed using the Gaussian 16 package (Frisch et al., 2016).

Calculations of the TEDM and Atomic Forces

The TEDM between two states under the EE-GMFCC method could be obtained as follows:

μ_{i}^{EE - GMFCC} = μ_{i}^{m} + \sum_{\begin{matrix} j = 1 \\ | R_{m j} | \leq λ_{2 B} \end{matrix}}^{N} (μ_{i}^{m, j} - μ_{i}^{m}) (5)

where $μ_{i}^{EE - GMFCC}$ ( $i = x, y, or z$ ) represents the calculated TEDM between two states in the x, y, or z direction, and $μ_{i}^{m}$ is the TEDM of the excited state from the fragment QM calculation for the mth fragment (which is the fluorophore in this study) in the x, y, or z direction and it corresponds to the one-body (1B) term in the EE-GMFCC calculation. $μ_{i}^{m, j}$ represents the calculated TEDM of the two-body fragment consisting of the fluorophore and an adjacent ribonucleotide within the distance threshold $λ_{2 B}$ from the fluorophore. $μ_{i}^{m, j} - μ_{i}^{m}$ denotes the two-body QM correction for the TEDM from the interaction between the fluorophore and adjacent ribonucleotide in RNA. N is the total number of ribonucleotides in the RNA.

The total TEDM of the system between two states with the EE-GMFCC method could be utilized to calculate the oscillator strength as follows:

f^{EE - GMFCC} = \frac{2}{3} ω^{EE - GMFCC} ({| μ_{x}^{EE - GMFCC} |}^{2} + {| μ_{y}^{EE - GMFCC} |}^{2} + | {| μ_{z}^{EE - GMFCC} |}^{2}) (6)

where $ω^{EE - GMFCC}$ is the calculated excitation energy using the EE-GMFCC method, and $μ_{x}^{EE - GMFCC}$ , $μ_{y}^{EE - GMFCC}$ , and $μ_{z}^{EE - GMFCC}$ are calculated TEDMs in the x, y, and z directions, respectively.

The atomic forces of the kth atom in the fluorophore molecule at the excited state could also be calculated using the EE-GMFCC method by replacing the TEDM of $μ$ in Eq. 5 with the atomic force of F.

F_{k, i}^{EE - GMFCC} = F_{k, i}^{m} + \sum_{\begin{matrix} j = 1 \\ | R_{m j} | \leq λ_{2 B} \end{matrix}}^{N} (F_{k, i}^{m, j} - F_{k, i}^{m}) (7)

The superscripts and subscripts in Eq. 7 are similar to those in Eq. 5.

Structure Preparation

The initial structure of the Mango-II RNA aptamer system was taken from the X-ray crystal structure (PDB id: 6C63). The generalized Amber force field (GAFF) (Wang et al., 2004) and AM1-BCC (Jakalian et al., 2000) charges were utilized to simulate the fluorophore (EKJ37) in the classical MD simulation (Walker et al., 2008). The ff99OL3 (Wang et al., 2000; Pérez et al., 2007; Zgarbová et al., 2011) force field was employed for handling the RNA. The missing hydrogen atoms were added by the LEaP module of the Amber 18 package (Case et al., 2005; Salomon-Ferrer et al., 2013; Case et al., 2018).

The residue name of the fluorophore in this study is called EKJ (in PDB id: 6C63), which consists of a thiazole orange1 (TO1) and part of the polyethylene glycol linker. The EKJ binds to the Mango II RNA with a high affinity, and the high brightness of the EKJ in the RNA allows its application in live-cell imaging and also in conventional fixed cell methodologies (Autour et al., 2018; Trachman et al., 2018). The aptamer (Mango II RNA) contains a closing RNA stem and a fluorophore-binding pocket. The high binding affinity between the fluorophore and aptamer makes it possible to discern the signal coming from the free fluorophore or the fluorophore bound to RNA, imaging small non-coding RNAs in mammalian cells (Autour et al., 2018). The structures of fluorophore, RNA stem, and binding pocket of Mango II RNA are shown in Figure 1C.

Truncated Full-System QM Calculations

The computational cost of the full-system QM calculation of Mango II RNA containing 1,243 atoms was very expensive. Considering the localization of the QM effect on the excited-state property, we constructed several smaller model systems for the Mango II RNA, which contains the fluorophore and its neighboring ribonucleotides within a predefined distance threshold $λ_{F S}$ (a minimum distance between any two atoms on the ribonucleotide and fluorophore, respectively), while in the QM calculation for each model system, the rest of Mango II RNA was treated by the MM model represented by the atomic point charges. Therefore, in this study the truncated full-system calculation is the traditional QM/MM calculation with the constructed central model system partitioned into the QM subsystem and the rest of the Mango II RNA partitioned into the MM subsystem. A series of model systems were constructed with different distance thresholds of $λ_{F S}$ = 3, 4, 5, 6, and 7 $Å$ . The QM/MM calculations were performed on those model systems. The corresponding results of the calculated excited-state properties for different model systems were labeled as “fullsys ( $λ_{F S}$ )”. A detailed graphical illustration of the truncated full-system approach is shown in Figure 2.

FIGURE 2

FIGURE 2. Illustration of the model system for the truncated full-system QM calculation. The QM subsystem was shown in sticks, and the MM subsystem was shown in 80% transparency backbone. (A) The one-body QM calculation in the EE-GMFCC method for such a system is only for fluorophore EKJ (shown in yellow). (B) Fullsys (4) which was constructed with a predefined distance threshold of $λ_{F S}$ = 4 Å, (C) fullsys (5) ( $λ_{F S}$ = 5 Å), (D) fullsys (6) ( $λ_{F S}$ = 6 Å), and (E) fullsys (7) ( $λ_{F S}$ = 7 Å), respectively.

Results and Discussion

Accuracy of EE-GMFCC for Excitation Energy Calculations

The accuracy of the EE-GMFCC method for prediction of the excitation energies of the RNA aptamer–fluorophore complex was investigated by comparison with the truncated full-system QM/MM calculations. In the framework of the EE-GMFCC method, a two-body QM calculation was utilized to account for the QM effect from the adjacent ribonucleotides on the calculations of excited-state properties of the fluorescent aptamer. A predefined distance threshold ( $λ_{2 B}$ ) was used in the 2B calculations to achieve the balance between the attained accuracy and computational efficiency. The convergence of EE-GMFCC calculations as a function of $λ_{2 B}$ was first tested for both the TD-HF and TD-DFT methods by comparison with the truncated full-system calculations. $λ_{F S}$ was set to 3, 4, 5, 6, and 7 Å, respectively, to test the impact of the chemical environment on calculations of the excitation energies. The truncated full-system calculations demonstrated that the difference of the obtained excitation energies between fullsys (6) and fullsys (7) was merely 0.001 eV, as shown in Supplementary Figure S2 of the Supporting Information. Therefore, the system constructed with $λ_{F S}$ = 7 Å was used as the benchmark system for investigating the $λ_{2 B}$ dependence of EE-GMFCC calculations. Furthermore, the results in Supplementary Figure S2 also show that the calculated excitation energy of fullsys ( $λ_{F S}$ = 4 Å) was very close to that of fullsys ( $λ_{F S}$ = 7 Å) at both TD-HF/6-31G* and TD-ωB97X/6-31G* levels. This indicates that, to some extent, the constructed “full system” with $λ_{F S}$ of 4 Å can sometimes be used as a candidate system for some benchmark studies of the EE-GMFCC method.

The calculated excitation energy using truncated full-system calculation of the system would be set as the reference. The results of the excitation energies calculated by EE-GMFCC with different $λ_{2 B}$ are shown in Supplementary Table S1 of the Supporting Information. It can be seen that the results of EE-GMFCC calculations are close to convergence when $λ_{2 B}$ is set to 4 Å (see Figure 3), and the result is also close to that of the full-system calculation ( $λ_{F S}$ = 7 Å) with the absolute error within 0.008 eV. Therefore, we conclude that the distance threshold of 4 Å is sufficient for the two-body QM calculation of the excitation energies using the EE-GMFCC method.

FIGURE 3

FIGURE 3. Calculated excitation energies (red line) using EE-GMFCC as a function of the distance threshold of $λ_{2 B}$ at the TD-HF/6-31G* and TD-ωB97X/6-31G* levels, respectively. The calculated excitation energies from the model system of fullsys (7) constructed using $λ_{F S}$ = 7 Å are regarded as the reference (black line).

Calculation of TEDM and Atomic Forces Using EE-GMFCC

In addition to excitation energies, EE-GMFCC can also be utilized in the calculations of other excited-state properties, including TEDM (Tanabe et al., 1965; Verma et al., 2020) and atomic forces (Weisenhorn et al., 1989; Sarid et al., 1991). The calculated TEDMs for different model systems constructed with different $λ_{F S}$ of 4, 5, 6, and 7 $Å$ using the EE-GMFCC method are shown in Figure 4B, and the $λ_{2 B}$ for two-body QM calculations with the EE-GMFCC method was also set to the same value as $λ_{F S}$ . For comparison, the results of the TEDM calculated using truncated full-system calculation are also shown in Figure 4.

FIGURE 4

FIGURE 4. The calculated μ_x, μ_y, and μ_z of TEDM with (A) full-system calculations and (B) the EE-MFCC method for different model systems constructed using various $λ_{FS}$ at the TD-HF/6-31G* level. The red (μ_x), blue (μ_y), and green (μ_z) lines denote the corresponding values calculated on the truncated full system of fullsys (7).

One can see from Figure 4A that the calculated TEDM of the system of 1B ( $λ_{F S}$ = 0 $Å$ ) shows a relatively larger deviation from that of fullsys (3), especially for µ_x, which indicates that incorporating the local QM effects from the environment is crucial for accurate prediction of the TEDM. For comparison, the results of the calculated TEDM and the oscillator strength at the TD-HF/6-31G*¹³ level for different model systems using the truncated full-system and EE-GMFCC calculations are also shown in Table 1 (the corresponding results calculated at the TD-ωB97X/6-31G* (Chai and Head-Gordon, 2008) level are shown in Supplementary Table S2 of the Supporting Information). It can be seen that the results of the EE-GMFCC method are in good agreement with the full-system calculations for all excited-state properties of different model systems.

TABLE 1

TABLE 1. Transition electric dipole moment (TEDM) calculated by EE-GMFCC and corresponding full-system calculations at the TD-HF/6-31G* level for different model systems with different distance thresholds $λ_{F S}$ . μ_x, μ_y, and μ_z are the calculated TEDM values in x, y, and z directions, respectively. ω and f are the excitation energy and the corresponding oscillator strength, respectively. Here, the $λ_{2 B}$ employed in EE-GMFCC calculations for different model system was set to the same value as the $λ_{F S}$ .

The correlation of the calculated atomic forces for the fluorophore molecule between the EE-GMFCC and full-system calculations is shown in Figure 5A. The results demonstrate that the EE-GMFCC method could reproduce well the corresponding atomic forces from full-system calculations with a mean unsigned error lower than 0.001 hartree/bohr. The absolute errors of the calculated atomic forces in x, y, and z directions using EE-GMFCC-1B and EE-GMFCC-2B with reference to full-system calculations are shown in Figures 5B–D, respectively. The results show that EE-GMFCC-2B could provide more accurate results for atomic forces than the EE-GFMCC-1B treatment, indicating that accounting for the QM effects from the local chemical environment is essential for calculations of atomic forces.

FIGURE 5

FIGURE 5. (A) Correlation of the calculated atomic forces between the EE-GMFCC method and truncated full-system calculations. F_x, F_y, and F_z are the calculated atomic forces for all atoms of the fluorophore in x, y, and z directions, respectively. The calculations were performed at the TD-HF/6-31G* level. The dashed line is the strict correlation curve. The mean unsigned error (MUE) between the EE-GMFCC and the truncated full-system method is 0.00022 hartree/bohr. (B) The unsigned error for F_x between the EE-GMFCC method and truncated full-system calculations under the one-body (1B) and two-body (2B) treatment. (C) Similar to panel b but for F_y. (D) Similar to panel b but for F_z.

Computational Efficiency of EE-GMFCC

Figure 6 shows the comparison of the CPU time for excitation energy calculations of different Mango-II RNA aptamer systems on the Intel Xeon Gold 6,130 2.1-GHz processor with the full-system calculations and EE-GMFCC approach at the TD-HF/6-31G* and TD-ωB97X/6-31G* levels, respectively. As expected, the computational scale of the EE-GMFCC approach shows O(N) as a function of the number of atoms in the system, while the computational cost for the traditional full-system TD-HF and TD-DFT calculations exhibits O ( $N^{2.61}$ ) and O ( $N^{2.80}$ ), respectively. The obtained regression equation for the full-system calculations at the TD-HF/6-31G* level is $y = 0.00451 x^{2.61}$ . In contrast, it is y = 7.862x for EE-GMFCC. As mentioned above, the calculated excited-state properties could reach convergence when $λ_{FS}$ is set to 4 Å. For the model system constructed with the distance threshold of $λ_{FS}$ = 4 Å the full-system calculation took 22,729 min of CPU time at the TD-HF/6-31G* level. In contrast, the EE-GMFCC method took only 2,957 min. The scale of the EE-GMFCC method at TD-ωB97X/6-31G* has a similar trend as compared to that at the TD-HF/6-31G* level (see Figure 6B).

FIGURE 6

FIGURE 6. CPU time for conventional full-system and EE-GMFCC calculations as a function of the number of atoms of the truncated model systems. The calculations were performed (A) at the TD-HF/6-31G* level and (B) at the TD-ωB97X/6-31G* level, respectively.

Prediction of the Relative Excitation Energies Using EE-GMFCC

The performance of the EE-GMFCC method on prediction of the relative excitation energies of different configurations was also investigated. A 100-ns classical MD simulation was first carried out on the Mango II RNA in explicit water solvent, and then excitation energy calculations were performed with both the EE-GMFCC and truncated full-system approaches on 10 different configurations of the Mango II RNA system extracted from the MD simulation trajectory.

The calculated excitation energies at the TD-HF/6-31G* level are shown in Table 2 and Figure 7. It can be seen that the predicted relative excitation energies by EE-GMFCC-2B show good agreement with those from full-system calculations with the mean unsigned deviation (MUD) of 0.02 eV. The calculated relative excitation energies of the 10 different configurations with EE-GMFCC-1B are also shown in Table 2, which shows larger errors compared to EE-GMFCC-2B with reference to full-system calculations (MUD = 0.08 eV). The results demonstrate that QM treatment of the RNA local chemical environment is essential for accurate calculation of both absolute and relative excited-state properties. The excitation energies calculated at the TD-ωB97X/6-31G* level are shown in Supplementary Table S3 of the Supporting Information.

TABLE 2

TABLE 2. Predicted excitation energies for 10 different configurations generated from the 100-ns classical MD simulation for the fluorescent RNA–aptamer (PDB id: 6C63) system using the EE-GMFCC and truncated full-system calculations at the TD-HF/6-31G* level. The model systems were constructed with $λ_{FS}$ = 4 Å for the 10 different configurations. A distance threshold $λ_{2 B}$ = 4 Å was employed for two-body QM calculations in EE-GMFCC.

FIGURE 7

FIGURE 7. Comparison of the calculated excitation energies for 10 different configurations of the fluorescent RNA–aptamer (PDB id: 6C63) system generated from the 100-ns MD simulation between the EE-GMFCC-2B approach and the truncated full-system calculations at the TD-HF/6-31G* level. $λ_{2 B}$ was set to 4 Å for the two-body QM calculations in EE-GMFCC. $λ_{FS}$ was set to 4 Å for the truncated full-system calculations.

Ribonucleotide-Based Decomposition of Excitation Energies

Investigation of the ribonucleotide-based decomposition of the excitation energy around the fluorescent molecule (Shen et al., 2021) is essential for finding the so-called hotspots and attendant rational design of the fluorescent RNA–aptamer complex using the point mutation technology. Herein, the fragmentation QM method was utilized to decompose the contribution of each ribonucleotide to the excitation energy. Since the electrostatic embedding treatment in the EE-GMFCC method would incorporate many-body effects, which obscures the individual contribution, the GMFCC scheme (without the electrostatic embedding field) was thus employed. The fragmentation treatment of the GMFCC method for RNA (or protein) systems is the same as that of the EE-GMFCC approach. However, in QM calculation of each fragment with the GMFCC scheme, the background charges were not introduced as compared to the EE-GMFCC method. The influence of each ribonucleotide around the fluorophore molecule on the calculated excitation energy was predicted by GMFCC. The results of 10 different configurations extracted from the 100-ns MD simulation were utilized to approximately represent the ensemble-averaged value due to the expensive computational cost. The excitation energies were calculated at the TD-HF/6-31G* and TD-ωB97X/6-31G* levels, respectively.

The decomposition of the excitation energy of the fluorescent RNA–aptamer (PDB id: 6C63) is shown in Table 3 and Figure 8. One can see that the G13, A17, and G29 ribonucleotides contribute mostly to the calculated excitation energy. However, the G13 and A17 ribonucleotides give blue-shift contributions to the excitation spectrum, and the G29 ribonucleotide gives a red-shift contribution to the excitation spectrum. As shown in Figure 8, the spatial positions of those three ribonucleotides are close to the fluorophore molecule, and the G13 and G29 ribonucleotides locate at the right and left sides of the fluorophore molecule, respectively. The opposite effects of the two ribonucleotides (G13 and G29) on the excitation energy indicate the importance of relative spatial location.

TABLE 3

TABLE 3. Contributions of the ribonucleotides in close contact with the fluorophore to the calculated excitation energy predicted by the GMFCC approach, based on 10 snapshots extracted from the 100-ns MD simulation. The calculations were performed at the TD-HF/6-31G* and TD-ωB97X/6-31G* levels, respectively. “Ex” is the calculated excitation energy (in eV) for the fluorophore molecule of EKJ37 or the two-body (2B) molecular species consisting of one ribonucleotide in RNA and EKJ37 (shown in ribonucleotide name in the table). “ΔEx” represents the excitation energy difference between the two-body (2B) molecular fragment and EKJ37, and ΔWL represents the wavelength difference (in nm) converted from ΔEx.

FIGURE 8

FIGURE 8. Decomposed excitation energy contributions of some ribonucleotides close to the fluorophore molecule using the GMFCC method. The results are the average values calculated on 10 snapshots extracted from the 100-ns MD simulation every 10 ns. The contributions were converted into wavelengths and presented by the color between blue and red. Ribonucleotides with positive 2B QM corrections have blue shifts of the absorption spectrum and are colored in blue, while ribonucleotides with negative 2B QM corrections have red shifts and are colored in red. EKJ is the fluorophore in this RNA system. The wavelength contribution of each ribonucleotide with bold comes from TD-HF/6-31G* calculations, while those without bold come from TD-ωB97X/6-31G* results, respectively.

Previous theoretical and experimental studies (Park and Rhee, 2016; Hagras and Glover, 2018; Langeland et al., 2018; Jin et al., 2020; Romei et al., 2020) on the GFP have emphasized the significant influence of the electrostatic effect from the protein environment on the fluorescence of the chromophore. Here, the two-body fragments constructed using the GMFCC method were utilized to further investigate the possible physical origins of the influence of the environment. The excitation energies for a series of two-body fragments were calculated using full QM and QM/MM methods, respectively. The rest of the RNA was excluded in all of those calculations to avoid the ambiguity caused by multiple interactions from the complex environment.

The results of QM/MM calculations are usually affected by the MM parameters for mimicking the atomic point charges. For more accurately reproducing the classical electrostatic effect of the adjacent ribonucleotide on the excited-state properties of the fluorescent molecule in the full 2-body QM calculation, the excited-state calculations were first performed with the full-system QM method (the given two-body fragment including the corresponding adjacent ribonucleotide and the fluorescent molecule was treated by the QM method), and then the obtained ESP charges were utilized in the QM/MM calculations to serve as the background charges (the results were labeled as QM/ESP). For investigating the parameter dependence of the QM/MM calculations, the excited-state calculations were also performed with the adjacent ribonucleotide represented by the ff99OL3 force field.

As shown in Supplementary Table S5 of the Supporting Information, the results of the QM/ESP method are different from those of the QM/OL3 calculations, and the largest deviation between the two methods is up to 0.042 eV (the fluorophore-A15 fragment of the 6UP0 chain-C system), indicating the significant MM parameter dependence of the excited-state QM/MM calculations on the fluorescent RNA system. Since the ESP charges used in the excited-state QM/ESP calculations were taken from the full QM calculations, they could be taken as the good representation of the classical electrostatic interactions. While a significant difference can be found between the QM and QM/ESP methods, the deviations between the two methods are up to 0.094 eV for the fluorophore-G14 fragment, and 0.093 eV for the fluorophore-A15 fragment of the 6UP0 chain-C system, respectively. However, both treatments (ESP and ff99OL3 representations for mimicking the MM point charges) for the QM/MM calculations could give the correct direction of the change of the calculated excited-state energies with reference to the full QM calculations for most of the two-body fragments except the fluorophore-G10 fragment of the 6UP0 chain-C system and the fluorophore-A12 fragment of the 6C63 system with small two-body effects, indicating the important influence of the classical electrostatic interactions on the calculation of the excited-state energies.

Accordingly, the opposite effects (the blue and red shifts) of the same kind of ribonucleotides (G13 and G29, A12, and A17) on the excitation energy (see Figure 8) might be explained by the fact that the fluorescent molecule experiences the electric field in the opposite directions exerted by those adjacent ribonucleotides, which is consistent with the previous study by Park and Rhee (Park and Rhee, 2016). Overall, the results demonstrate that both the classical Coulomb interaction and the quantum exchange effects play significant roles in the calculations of the excited-state energies for the RNA–aptamer systems.

The Application of the EE-GMFCC Method for Other Fluorescent RNA–Aptamer Systems

In order to test the performance of the EE-GMFCC method on different fluorescent RNA–aptamer systems, the calculations of the excitation energies for seven other fluorescent RNA–aptamer systems taken from the PDB (shown in Figure 9) were performed using the EE-GMFCC and truncated full-system methods, respectively. The calculated excitation energies using the EE-GMFCC and full-system calculations are shown in Table 4 and Figure 10. One can see from Table 4 that EE-GMFCC-2B can give an accurate excitation energy prediction for all the fluorescent RNA–aptamer systems at the TD-HF/6-31G* level, as compared to the truncated full-system calculations, with the MUD of 0.024 eV, which demonstrates that the EE-GMFCC method is a general approach for an accurate prediction of the excited-state properties of the fluorescent RNA–aptamer systems. In contrast, the MUD of the EE-GMFCC-1B results is 0.145 eV with reference to the full-system calculations, and the absolute deviations between the EE-GMFCC-1B and full-system calculations could reach up to 0.228, 0.245, and 0.218 eV for the systems of 6UP0 chain-C, 6UP0 chain-D, and 6E8S, respectively, indicating the importance of the QM treatment of the local chemical environment in the calculation of the excited-state properties. Therefore, the EE-GMFCC-2B method is recommended to be employed in the study requiring a highly accurate prediction of excitation energies, while the EE-GMFCC-1B approach can be applied in qualitative or semiquantitative studies for efficiency. The calculated excitation energy results at the TD-ωB97X/6-31G* level are shown in Supplementary Table S4 of the Supporting Information.

FIGURE 9

FIGURE 9. Three-dimensional structures of seven different fluorescent RNA–aptamer systems. The fluorophore is shown in the yellow stick model.

TABLE 4

TABLE 4. Comparison of the calculated excitation energies for a series of fluorescent RNA–aptamer systems between the EE-GMFCC ( $λ_{2 B}$ = 4 Å) and truncated full-system calculations at the TD-HF/6-31G* level. The model systems constructed with $λ_{FS}$ = 4 Å for all fluorescent RNA–aptamer systems are utilized here.

FIGURE 10

FIGURE 10. Comparison of the excitation energies of eight different fluorescent RNA systems between the EE-GMFCC approach and the truncated full-system calculations at the TD-HF/6-31G* level. The distance threshold $λ_{2 B}$ was set to 4.0 Å for the two-body QM calculations in the EE-GMFCC method. $λ_{FS}$ was set to 4 Å for the truncated full-system approach.

For further illustrating the performance of the EE-GMFCC-2B method on predicting the relative excitation energies between different configurations and different systems of the RNA–aptamer complex, the correlations of calculated excitation energies between the EE-GMFCC (1B and 2B) method and truncated full-system calculations at the TD-HF/6-31G* level are plotted in Supplementary Figure S3 of the Supporting Information, for different configurations of the 6C63 system (Supplementary Figure S3a) and different fluorophore RNA–aptamer systems (Supplementary Figure S3b), respectively. The results show that the EE-GMFCC-2B method gives a better correlation with the truncated full-system calculations than the EE-GMFCC-1B method. The correlation coefficients (R² (Dolgosheina et al., 2014)) of the EE-GMFCC-2B method are 0.937 for different configurations of 6C63, and 0.998 for different fluorophore RNA–aptamer systems, respectively, while the correlation coefficients (R² (Dolgosheina et al., 2014)) of the EE-GMFCC-1B method are only 0.909 and 0.878, respectively. The results demonstrate that the EE-GMFCC-2B method is capable of providing a better description of the relative excitation energies for the RNA–aptamer system than the EE-GMFCC-1B method.

Conclusion

In this study, the electrostatically embedded generalized molecular fractionation with conjugate caps (EE-GMFCC) method was applied to calculations of the excited-state properties of the fluorescent RNA aptamer systems. The two-body fragment QM calculations were utilized to account for the QM effect from the local RNA chemical environment on the excited-state properties of the fluorescent molecule. The benchmark study on the Mango-II RNA aptamer system demonstrated that EE-GMFCC could give good agreement with traditional full-system QM calculations of the absolute and relative excitation energies, and the 4 Å distance threshold for the two-body QM calculations could strike a good balance between the attained accuracy and the computational expense incurred for the EE-GMFCC method. Furthermore, the EE-GMFCC method could provide an accurate prediction of other excited-state properties, namely, TEDM and atomic forces. This work demonstrated that incorporating the QM effects of a local RNA chemical environment was essential for an accurate prediction of the excited-state properties of the fluorescent molecule in RNAs, and hundreds of atoms were usually required to be treated with electronic structure theories. It is challenging for the traditional full-system QM calculation to handle such large systems due to the expensive computational cost. In contrast, the computational cost of the EE-GMFCC method is linear-scaling with a low prefactor, and thus the EE-GMFCC approach is computationally efficient, which could be applied for tackling the macromolecular systems. The applications of the EE-GMFCC method in calculations of the excitation energies for different fluorescent RNA aptamer systems demonstrate that the EE-GMFCC is a general approach for the excited-state property calculations of large complex molecular systems.

Supporting Information

Illustration of the EE-GMFCC fragmentation scheme; Calculated excitation energies as a function of the distance threshold; Calculated TEDM at the TD-ωB97X/6-31G* level for different model systems using the truncated full-system and EE-GMFCC method; The relative excitation energies for different configurations of the RNA system (pdb id: 6C63) predicted by the EE-GMFCC method and truncated full-system calculations at the TD-ωB97X/6-31G* level; The relative excitation energies for different RNA systems predicted by the EE-GMFCC method and truncated full-system calculations at the TD-ωB97X/6-31G* level.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author Contributions

XH designed the research; CS, and XW performed the research; CS, XW, and XH analyzed the data; and CS, XW, and XH wrote the paper.

Funding

This work was supported by the National Key R and D Program of China (Grant Nos. 2019YFA0905200, and 2016YFA0501700), National Natural Science Foundation of China (Grant Nos. 21922301, 21761132022, and 21703206), and Fundamental Research Funds for the Central Universities.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The reviewer TZ declared a shared affiliation, with the authors CS and XH to the handling editor at the time of the review

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We also thank the Supercomputer Center of East China Normal University (ECNU Multifunctional Platform for Innovation 001) for providing computer resources.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fchem.2021.801062/full#supplementary-material

References

Adamo, C., and Jacquemin, D. (2013). The Calculations of Excited-State Properties with Time-dependent Density Functional Theory. Chem. Soc. Rev. 42 (3), 845–856. doi:10.1039/c2cs35394f

PubMed Abstract | CrossRef Full Text | Google Scholar

Autour, A., C. Y. Jeng, S., D. Cawte, A., Abdolahzadeh, A., Galli, A., Panchapakesan, S. S. S., et al. (2018). Fluorogenic RNA Mango Aptamers for Imaging Small Non-coding RNAs in Mammalian Cells. Nat. Commun. 9 (1), 656. doi:10.1038/s41467-018-02993-8

PubMed Abstract | CrossRef Full Text | Google Scholar

Case, D. A., Ben-Shalom, I. Y., Brozell, S. R., Cerutti, D. S., Cheatham, T. E., Cruzeiro, V. W. D., et al. (2018). AMBER18. San Francisco, CA, USA: University of California.

Google Scholar

Case, D. A., Cheatham, T. E., Darden, T., Gohlke, H., Luo, R., Merz, K. M., et al. (2005). The Amber Biomolecular Simulation Programs. J. Comput. Chem. 26 (16), 1668–1688. doi:10.1002/jcc.20290

CrossRef Full Text | Google Scholar

Chai, J.-D., and Head-Gordon, M. (2008). Systematic Optimization of Long-Range Corrected Hybrid Density Functionals. J. Chem. Phys. 128 (8), 084106. doi:10.1063/1.2834918

CrossRef Full Text | Google Scholar

Chiba, M., Fedorov, D. G., and Kitaura, K. (2007). Time-dependent Density Functional Theory Based upon the Fragment Molecular Orbital Method. J. Chem. Phys. 127 (10), 104108. doi:10.1063/1.2772850

CrossRef Full Text | Google Scholar

Christiansen, O., Koch, H., and Jørgensen, P. (1995). The Second-Order Approximate Coupled Cluster Singles and Doubles Model CC2. Chem. Phys. Lett. 243, 409–418. doi:10.1016/0009-2614(95)00841-q

CrossRef Full Text | Google Scholar

Collins, M. A., and Bettens, R. P. A. (2015). Energy-Based Molecular Fragmentation Methods. Chem. Rev. 115 (12), 5607–5642. doi:10.1021/cr500455b

PubMed Abstract | CrossRef Full Text | Google Scholar

Creemers, T. M., Lock, A. J., Subramaniam, V., Jovin, T. M., and Völker, S. (1999). Three Photoconvertible Forms of green Fluorescent Protein Identified by Spectral Hole-Burning. Nat. Struct. Biol. 6, 706–560. doi:10.1038/10763

PubMed Abstract | CrossRef Full Text | Google Scholar

Creemers, T. M. H., Lock, A. J., Subramaniam, V., Jovin, T. M., and Volker, S. (2000). Photophysics and Optical Switching in green Fluorescent Protein Mutants. Proc. Natl. Acad. Sci. 97, 2974–2978. doi:10.1073/pnas.97.7.2974

PubMed Abstract | CrossRef Full Text | Google Scholar

Daday, C., König, C., Valsson, O., Neugebauer, J., and Filippi, C. (2013). State-Specific Embedding Potentials for Excitation-Energy Calculations. J. Chem. Theor. Comput. 9 (5), 2355–2367. doi:10.1021/ct400086a

PubMed Abstract | CrossRef Full Text | Google Scholar

Dahlke, E. E., and Truhlar, D. G. (2007). Electrostatically Embedded Many-Body Expansion for Large Systems, with Applications to Water Clusters. J. Chem. Theor. Comput. 3, 46–53. doi:10.1021/ct600253j

PubMed Abstract | CrossRef Full Text | Google Scholar

Dolgosheina, E. V., Jeng, S. C. Y., Panchapakesan, S. S. S., Cojocaru, R., Chen, P. S. K., Wilson, P. D., et al. (2014). RNA Mango Aptamer-Fluorophore: a Bright, High-Affinity Complex for RNA Labeling and Tracking. ACS Chem. Biol. 9 (10), 2412–2420. doi:10.1021/cb500499x

PubMed Abstract | CrossRef Full Text | Google Scholar

Finley, J., Malmqvist, P.-Å, Roos, B. O., and Serrano-Andrés, L. (1998). The Multi-State CASPT2 Method. Chem. Phys. Lett. 288, 299–306. doi:10.1016/s0009-2614(98)00252-8

CrossRef Full Text | Google Scholar

Foresman, J. B., Head-Gordon, M., Pople, J. A., and Frisch, M. J. (1992). Toward a Systematic Molecular Orbital Theory for Excited States. J. Phys. Chem. 96, 135–149. doi:10.1021/j100180a030

CrossRef Full Text | Google Scholar

Frisch, M. J., Trucks, G. W., Schlegel, H. B., Scuseria, G. E., Robb, M. A., Cheeseman, J. R., et al. (2016). Gaussian 16, Revision A.03. Wallingford, CT, USA: Gaussian, Inc.

Google Scholar

Gross, E. K. U., and Kohn, W. (1990). Time-dependent Density-Functional Theory. Advances in Quantum Chemistry 21. 255–291. doi:10.1016/s0065-3276(08)60600-0

CrossRef Full Text | Google Scholar

Hagras, M. A., and Glover, W. J. (2018). Polarizable Embedding for Excited-State Reactions: Dynamically Weighted Polarizable QM/MM. J. Chem. Theor. Comput. 14 (4), 2137–2144. doi:10.1021/acs.jctc.8b00064

PubMed Abstract | CrossRef Full Text | Google Scholar

He, X., and Zhang, J. Z. H. (2006). The Generalized Molecular Fractionation with Conjugate Caps/molecular Mechanics Method for Direct Calculation of Protein Energy. J. Chem. Phys. 124 (18), 184703. doi:10.1063/1.2194535

CrossRef Full Text | Google Scholar

He, X., Zhu, T., Wang, X., Liu, J., and Zhang, J. Z. H. (2014). Fragment Quantum Mechanical Calculation of Proteins and its Applications. Acc. Chem. Res. 47 (9), 2748–2757. doi:10.1021/ar500077t

PubMed Abstract | CrossRef Full Text | Google Scholar

Hirata, S., Valiev, M., Dupuis, M., Xantheas, S. S., Sugiki, S., and Sekino, H. (2005). Fast Electron Correlation Methods for Molecular Clusters in the Ground and Excited States. Mol. Phys. 103 (15-16), 2255–2265. doi:10.1080/00268970500083788

CrossRef Full Text | Google Scholar

Isborn, C. M., Götz, A. W., Clark, M. A., Walker, R. C., and Martínez, T. J. (2012). Electronic Absorption Spectra from MM and Ab Initio QM/MM Molecular Dynamics: Environmental Effects on the Absorption Spectrum of Photoactive Yellow Protein. J. Chem. Theor. Comput. 8 (12), 5092–5106. doi:10.1021/ct3006826

PubMed Abstract | CrossRef Full Text | Google Scholar

Jakalian, A., Bush, B. L., Jack, D. B., and Bayly, C. I. (2000). Fast, Efficient Generation of High-Quality Atomic Charges. AM1-BCC Model: I. Method. J. Comput. Chem. 21, 132–146. doi:10.1002/(sici)1096-987x(20000130)21:2<132:aid-jcc5>3.0.co;2-p

CrossRef Full Text | Google Scholar

Jin, X., Glover, W. J., and He, X. (2020). Fragment Quantum Mechanical Method for Excited States of Proteins: Development and Application to the Green Fluorescent Protein. J. Chem. Theor. Comput. 16 (8), 5174–5188. doi:10.1021/acs.jctc.9b00980

PubMed Abstract | CrossRef Full Text | Google Scholar

Jin, X., Zhang, J. Z. H., and He, X. (2017). Full QM Calculation of RNA Energy Using Electrostatically Embedded Generalized Molecular Fractionation with Conjugate Caps Method. J. Phys. Chem. A. 121 (12), 2503–2514. doi:10.1021/acs.jpca.7b00859

PubMed Abstract | CrossRef Full Text | Google Scholar

Khait, Y. G., and Hoffmann, M. R. (2010). Embedding Theory for Excited States. J. Chem. Phys. 133 (4), 044107. doi:10.1063/1.3460594

CrossRef Full Text | Google Scholar

Kluner, T., Govind, N., Wang, Y. A., and Carter, E. A. (2011). Quantum Mechanical Embedding Theory Based on a Unique Embedding Potential. J. Chem. Phys. 134 (15), 154110.

Google Scholar

Kocherzhenko, A. A., Sosa Vazquez, X. A., Milanese, J. M., and Isborn, C. M. (2017). Absorption Spectra for Disordered Aggregates of Chromophores Using the Exciton Model. J. Chem. Theor. Comput. 13 (8), 3787–3801. doi:10.1021/acs.jctc.7b00477

PubMed Abstract | CrossRef Full Text | Google Scholar

Langeland, J., Kjær, C., Andersen, L. H., and Brøndsted Nielsen, S. (2018). The Effect of an Electric Field on the Spectroscopic Properties of the Isolated Green Fluorescent Protein Chromophore Anion. ChemPhysChem 19 (14), 1686–1690. doi:10.1002/cphc.201800225

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, W., Li, Y., Lin, R., and Li, S. (2016). Generalized Energy-Based Fragmentation Approach for Localized Excited States of Large Systems. J. Phys. Chem. A. 120 (48), 9667–9677. doi:10.1021/acs.jpca.6b11193

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J., and He, X. (2017). Accurate Prediction of Energetic Properties of Ionic Liquid Clusters Using a Fragment-Based Quantum Mechanical Method. Phys. Chem. Chem. Phys. 19 (31), 20657–20666. doi:10.1039/c7cp03356g

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J., and He, X. (2020). Fragment-based Quantum Mechanical Approach to Biomolecules, Molecular Clusters, Molecular Crystals and Liquids. Phys. Chem. Chem. Phys. 22 (22), 12341–12367. doi:10.1039/d0cp01095b

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J., Qi, L.-W., Zhang, J. Z. H., and He, X. (2017). Fragment Quantum Mechanical Method for Large-Sized Ion-Water Clusters. J. Chem. Theor. Comput. 13 (5), 2021–2034. doi:10.1021/acs.jctc.7b00149

PubMed Abstract | CrossRef Full Text | Google Scholar

Liu, J., Sun, H., Glover, W. J., and He, X. (2019). Prediction of Excited-State Properties of Oligoacene Crystals Using Fragment-Based Quantum Mechanical Method. J. Phys. Chem. A. 123 (26), 5407–5417. doi:10.1021/acs.jpca.8b12552

PubMed Abstract | CrossRef Full Text | Google Scholar

McLachlan, A. D., and Ball, M. A. (1964). Time-Dependent Hartree-Fock Theory for Molecules. Rev. Mod. Phys. 36 (3), 844–855. doi:10.1103/revmodphys.36.844

CrossRef Full Text | Google Scholar

Milanese, J. M., Provorse, M. R., Alameda, E., and Isborn, C. M. (2017). Convergence of Computed Aqueous Absorption Spectra with Explicit Quantum Mechanical Solvent. J. Chem. Theor. Comput. 13 (5), 2159–2171. doi:10.1021/acs.jctc.7b00159

PubMed Abstract | CrossRef Full Text | Google Scholar

Nakai, H., and Yoshikawa, T. (2017). Development of an Excited-State Calculation Method for Large Systems Using Dynamical Polarizability: A divide-and-conquer Approach at the Time-dependent Density Functional Level. J. Chem. Phys. 146 (12), 124123. doi:10.1063/1.4978952

CrossRef Full Text | Google Scholar

Nakata, H., Fedorov, D. G., Yokojima, S., Kitaura, K., Sakurai, M., and Nakamura, S. (2014). Unrestricted Density Functional Theory Based on the Fragment Molecular Orbital Method for the Ground and Excited State Calculations of Large Systems. J. Chem. Phys. 140 (14), 144101. doi:10.1063/1.4870261

CrossRef Full Text | Google Scholar

Nakatsuji, H. (1979). Cluster Expansion of the Wavefunction. Electron Correlations in Ground and Excited States by SAC (Symmetry-adapted-cluster) and SAC CI Theories. Chem. Phys. Lett. 67 (2-3), 329–333. doi:10.1016/0009-2614(79)85172-6

CrossRef Full Text | Google Scholar

Park, J. W., and Rhee, Y. M. (2016). Electric Field Keeps Chromophore Planar and Produces High Yield Fluorescence in Green Fluorescent Protein. J. Am. Chem. Soc. 138 (41), 13619–13629. doi:10.1021/jacs.6b06833

PubMed Abstract | CrossRef Full Text | Google Scholar

Pérez, A., Marchán, I., Svozil, D., Sponer, J., Cheatham, T. E., Laughton, C. A., et al. (2007). Refinement of the AMBER Force Field for Nucleic Acids: Improving the Description of α/γ Conformers. Biophysical J. 92 (11), 3817–3829. doi:10.1529/biophysj.106.097782

CrossRef Full Text | Google Scholar

Raghavachari, K., and Saha, A. (2015). Accurate Composite and Fragment-Based Quantum Chemical Models for Large Molecules. Chem. Rev. 115 (12), 5643–5677. doi:10.1021/cr500606e

PubMed Abstract | CrossRef Full Text | Google Scholar

Romei, M. G., Lin, C.-Y., Mathews, I. I., and Boxer, S. G. (2020). Electrostatic Control of Photoisomerization Pathways in Proteins. Science 367, 76–79. doi:10.1126/science.aax1898

PubMed Abstract | CrossRef Full Text | Google Scholar

Salomon-Ferrer, R., Case, D. A., and Walker, R. C. (2013). An Overview of the Amber Biomolecular Simulation Package. Wires Comput. Mol. Sci. 3 (2), 198–210. doi:10.1002/wcms.1121

CrossRef Full Text | Google Scholar

Sarid, D., Coratger, R., Ajustron, F., and Beauvillain, J. (1991). Scanning Force Microscopy - with Applications to Electric, Magnetic and Atomic Forces. Microsc. Microanal. Microstruct. 2 (6), 649. doi:10.1051/mmm:0199100206064900

CrossRef Full Text | Google Scholar

Scholz, L., and Neugebauer, J. (2021). Protein Response Effects on Cofactor Excitation Energies from First Principles: Augmenting Subsystem Time-dependent Density-Functional Theory with Many-Body Expansion Techniques. J. Chem. Theor. Comput. 17 (10), 6105–6121. doi:10.1021/acs.jctc.1c00551

CrossRef Full Text | Google Scholar

Shen, C., Jin, X., Glover, W. J., and He, X. (2021). Accurate Prediction of Absorption Spectral Shifts of Proteorhodopsin Using a Fragment-Based Quantum Mechanical Method. Molecules 26 (15). doi:10.3390/molecules26154486

CrossRef Full Text | Google Scholar

Stanton, J. F., and Bartlett, R. J. (1993). The Equation of Motion Coupled‐cluster Method. A Systematic Biorthogonal Approach to Molecular Excitation Energies, Transition Probabilities, and Excited State Properties. J. Chem. Phys. 98 (9), 7029–7039. doi:10.1063/1.464746

CrossRef Full Text | Google Scholar

Tanabe, Y., Moriya, T., and Sugano, S. (1965). Magnon-induced Electric Dipole Transition Moment. Phys. Rev. Lett. 15 (26), 1023–1025. doi:10.1103/physrevlett.15.1023

CrossRef Full Text | Google Scholar

Trachman, R. J., Abdolahzadeh, A., Andreoni, A., Cojocaru, R., Knutson, J. R., Ryckelynck, M., et al. (2018). Crystal Structures of the Mango-II RNA Aptamer Reveal Heterogeneous Fluorophore Binding and Guide Engineering of Variants with Improved Selectivity and Brightness. Biochemistry 57 (26), 3544–3548. doi:10.1021/acs.biochem.8b00399

PubMed Abstract | CrossRef Full Text | Google Scholar

Trachman, R. J., Autour, A., Jeng, S. C. Y., Abdolahzadeh, A., Andreoni, A., Cojocaru, R., et al. (2019). Structure and Functional Reselection of the Mango-III Fluorogenic RNA Aptamer. Nat. Chem. Biol. 15 (5), 472–479. doi:10.1038/s41589-019-0267-9

PubMed Abstract | CrossRef Full Text | Google Scholar

Verma, M., Jayich, A. M., and Vutha, A. C. (2020). Electron Electric Dipole Moment Searches Using Clock Transitions in Ultracold Molecules. Phys. Rev. Lett. 125 (15), 153201. doi:10.1103/physrevlett.125.153201

PubMed Abstract | CrossRef Full Text | Google Scholar

Walker, R. C., Crowley, M. F., and Case, D. A. (2008). The Implementation of a Fast and Accurate QM/MM Potential Method in Amber. J. Comput. Chem. 29, 1019–1031. doi:10.1002/jcc.20857

CrossRef Full Text | Google Scholar

Wang, J., Cieplak, P., and Kollman, P. A. (2000). How Well Does a Restrained Electrostatic Potential (RESP) Model Perform in Calculating Conformational Energies of Organic and Biological Molecules? J. Comput. Chem. 21, 1049–1074. doi:10.1002/1096-987x(200009)21:12<1049:aid-jcc3>3.0.co;2-f

CrossRef Full Text | Google Scholar

Wang, J., Wolf, R. M., Caldwell, J. W., Kollman, P. A., and Case, D. A. (2004). Development and Testing of a General Amber Force Field. J. Comput. Chem. 25 (9), 1157–1174. doi:10.1002/jcc.20035

CrossRef Full Text | Google Scholar

Wang, X., Liu, J., Zhang, J. Z. H., and He, X. (2013). Electrostatically Embedded Generalized Molecular Fractionation with Conjugate Caps Method for Full Quantum Mechanical Calculation of Protein Energy. J. Phys. Chem. A. 117 (32), 7149–7161. doi:10.1021/jp400779t

CrossRef Full Text | Google Scholar

Wang, Y., Liu, J., Li, J., and He, X. (2018). Fragment-based Quantum Mechanical Calculation of Protein-Protein Binding Affinities. J. Comput. Chem. 39 (21), 1617–1628. doi:10.1002/jcc.25236

CrossRef Full Text | Google Scholar

Weisenhorn, A. L., Hansma, P. K., Albrecht, T. R., and Quate, C. F. (1989). Forces in Atomic Force Microscopy in Air and Water. Appl. Phys. Lett. 54 (26), 2651–2653. doi:10.1063/1.101024

CrossRef Full Text | Google Scholar

Yoshikawa, T., Kobayashi, M., Fujii, A., and Nakai, H. (2013). Novel Approach to Excited-State Calculations of Large Molecules Based on divide-and-conquer Method: Application to Photoactive Yellow Protein. J. Phys. Chem. B 117 (18), 5565–5573. doi:10.1021/jp401819d

CrossRef Full Text | Google Scholar

Zgarbová, M., Otyepka, M., Šponer, J., Mládek, A., Banáš, P., Cheatham, T. E., et al. (2011). Refinement of the Cornell et al. Nucleic Acids Force Field Based on Reference Quantum Chemical Calculations of Glycosidic Torsion Profiles. J. Chem. Theor. Comput. 7 (9), 2886–2902. doi:10.1021/ct200162x

CrossRef Full Text | Google Scholar

Zhang, W., Liu, J., Jin, X., Gu, X., Zeng, X. C., He, X., et al. (2020). Quantitative Prediction of Aggregation‐Induced Emission: A Full Quantum Mechanical Approach to the Optical Spectra. Angew. Chem. Int. Ed. 59 (28), 11550–11555. doi:10.1002/anie.202003326

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: fluorescent RNA, fragment-based quantum mechanical method, excited-state properties, molecular dynamics simulation, molecular modelling

Citation: Shen C, Wang X and He X (2021) Fragment-Based Quantum Mechanical Calculation of Excited-State Properties of Fluorescent RNAs. Front. Chem. 9:801062. doi: 10.3389/fchem.2021.801062

Received: 24 October 2021; Accepted: 24 November 2021;
Published: 22 December 2021.

Edited by:

Peter Strizhak, National Academy of Science of Ukraine, Ukraine

Reviewed by:

Tong Zhu, East China Normal University, China
Haibo Ma, Nanjing University, China

Copyright © 2021 Shen, Wang and He. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xianwei Wang, eHd3YW5nQHpqdXQuZWR1LmNu; Xiao He, eGlhb2hlQHBoeS5lY251LmVkdS5jbg==

^†ORCID: Chenfei Shen, orcid.org/0000-0001-9662-355X; Xianwei Wang, orcid.org/0000-0003-4471-426X; Xiao He, orcid.org/0000-0002-4199-8175

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.