<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mol. Biosci.</journal-id>
<journal-title>Frontiers in Molecular Biosciences</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mol. Biosci.</abbrev-journal-title>
<issn pub-type="epub">2296-889X</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">756219</article-id>
<article-id pub-id-type="doi">10.3389/fmolb.2021.756219</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Molecular Biosciences</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Towards Elucidating Structure&#x2013;Spectra Relationships in Rhamnogalacturonan II: Computational Protocols for Accurate <sup>13</sup>C and <sup>1</sup>H Shifts for Apiose and Its Borate Esters</article-title>
<alt-title alt-title-type="left-running-head">Bharadwaj et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">Computational Apio-Borate NMR Shift Predictions</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Bharadwaj</surname>
<given-names>Vivek S.</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<xref ref-type="fn" rid="fn1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/747136/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Westawker</surname>
<given-names>Luke P.</given-names>
</name>
<xref ref-type="fn" rid="fn1">
<sup>&#x2020;</sup>
</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1474424/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Crowley</surname>
<given-names>Michael F.</given-names>
</name>
<uri xlink:href="https://loop.frontiersin.org/people/251246/overview"/>
</contrib>
</contrib-group>
<aff>
<institution>Renewable Resources and Enabling Sciences Center</institution>, <institution>National Renewable Energy Laboratory</institution>, <addr-line>Golden</addr-line>, <addr-line>CO</addr-line>, <country>United&#x20;States</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/761473/overview">Yong Wang</ext-link>, Zhejiang University, China</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/69414/overview">Ioannis P. Gerothanassis</ext-link>, University of Ioannina, Greece</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1300551/overview">Qinghua Liao</ext-link>, University of Barcelona, Spain</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Vivek S. Bharadwaj, <email>vivek.bharadwaj@nrel.gov</email>
</corresp>
<fn fn-type="equal" id="fn1">
<label>
<sup>&#x2020;</sup>
</label>
<p>These authors have contributed equally to this&#x20;work</p>
</fn>
<fn fn-type="other">
<p>This article was submitted to Biological Modeling and Simulation, a section of the journal Frontiers in Molecular Biosciences</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>24</day>
<month>01</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>8</volume>
<elocation-id>756219</elocation-id>
<history>
<date date-type="received">
<day>10</day>
<month>08</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>24</day>
<month>11</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2022 Bharadwaj, Westawker and Crowley.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Bharadwaj, Westawker and Crowley</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Apiose is a naturally occurring, uncommon branched-chain pentose found in plant cell walls as part of the complex polysaccharide Rhamnogalacturonan II (RG-II). The structural elucidation of the three-dimensional structure of RG-II by nuclear magnetic resonance (NMR) spectroscopy is significantly complicated by the ability of apiose to cross-link <italic>via</italic> borate ester linkages to form RG-II dimers. Here, we developed a computational approach to gain insight into the structure&#x2013;spectra relationships of apio&#x2013;borate complexes in an effort to complement experimental assignments of NMR signals in RG-II. Our protocol involved structure optimizations using density functional theory (DFT) followed by isotropic magnetic shielding constant calculations using the gauge-invariant atomic orbital (GIAO) approach to predict chemical shifts. We evaluated the accuracy of 23 different functional&#x2013;basis set (FBS) combinations with and without implicit solvation for predicting the experimental <sup>1</sup>H and <sup>13</sup>C shifts of a methyl apioside and its three borate derivatives. The computed NMR predictions were evaluated on the basis of the overall shift accuracy, relative shift ordering, and the ability to distinguish between dimers and monomers. We demonstrate that the consideration of implicit solvation during geometry optimizations in addition to the magnetic shielding constant calculations greatly increases the accuracy of NMR chemical shift predictions and can correctly reproduce the ordering of the <sup>13</sup>C shifts and yield predictions that are, on average, within 1.50&#xa0;ppm for <sup>13</sup>C and 0.12&#xa0;ppm for <sup>1</sup>H shifts for apio&#x2013;borate compounds.</p>
</abstract>
<kwd-group>
<kwd>Quantum chemistry</kwd>
<kwd>Apiose</kwd>
<kwd>Borate</kwd>
<kwd>NMR chemical shifts</kwd>
<kwd>
<sup>1</sup>H and <sup>13</sup>C</kwd>
<kwd>Gauge invariance approach</kwd>
<kwd>DFT</kwd>
<kwd>Rhamnogalacturonan II</kwd>
</kwd-group>
<contract-sponsor id="cn001">Basic Energy Sciences<named-content content-type="fundref-id">10.13039/100006151</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>Computational prediction of the spectroscopic chemical shifts of nuclear magnetic resonance (NMR) can significantly improve its capability as an essential technique for the identification and characterization of complex biomolecular structures in solution (<xref ref-type="bibr" rid="B16">Duus et&#x20;al., 2000</xref>; <xref ref-type="bibr" rid="B51">W&#xfc;thrich, 2003</xref>; <xref ref-type="bibr" rid="B10">Bifulco et&#x20;al., 2007</xref>; <xref ref-type="bibr" rid="B28">Lodewyk et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B39">Tantillo, 2013</xref>; <xref ref-type="bibr" rid="B21">Iron, 2017</xref>; <xref ref-type="bibr" rid="B25">Krivdin, 2019</xref>). NMR techniques play a vital role in the structural elucidation of one of the least structurally characterized classes of biological molecules&#x2014;carbohydrates&#x2014;since they cannot otherwise be characterized <italic>via</italic> methods such as X-ray crystallography (<xref ref-type="bibr" rid="B48">Van Halbeek, 1994</xref>; <xref ref-type="bibr" rid="B16">Duus et&#x20;al., 2000</xref>; <xref ref-type="bibr" rid="B46">Toukach and Ananikov, 2013a</xref>). Carbohydrates have a diverse set of building block residues that can be tied together with a range of inter-residue linkages to form complex macromolecules of biological significance in plants and animals (<xref ref-type="bibr" rid="B17">Duus et&#x20;al., 1999</xref>). Plants harness the sugar building blocks available in nature to build large, complex, and heterogenous polysaccharide structures that constitute the plant cell&#x20;wall.</p>
<p>Rhamnogalacturonan-II (RG-II) is a pectin molecule that exemplifies this phenomenon as the most complex polysaccharide known in nature, consisting of 12 monosaccharide units interconnected by 21 glycosidic linkages (<xref ref-type="bibr" rid="B30">Ndeh et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B33">O&#x2019;Neill et&#x20;al., 2020</xref>). Apiose is one of the 12 monosaccharide units in RG-II and has special significance as it is involved in dimerizing RG-II <italic>via</italic> a borate&#x2013;ester cross-link (<xref ref-type="bibr" rid="B32">O&#x27;Neill et&#x20;al., 1996</xref>; <xref ref-type="bibr" rid="B9">Bharadwaj et&#x20;al., 2020</xref>). The formation of this borate&#x2013;ester cross-link is required for normal plant growth, and most RG-II found in the plant cell wall remains cross-linked (<xref ref-type="bibr" rid="B31">O&#x27;Neill et&#x20;al., 2004</xref>). The chemical structure and topology of RG-II (<xref ref-type="fig" rid="F1">Figure&#x20;1</xref>) have only been recently determined by systematically deconstructing RG-II using a multitude of linkage-specific hydrolase enzymes (<xref ref-type="bibr" rid="B30">Ndeh et&#x20;al., 2017</xref>). However, the three-dimensional (3D) structure of RG-II in solution in its monomeric and dimerized states still remains to be elucidated.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Topological structure of Rhamnogalacturonan II (RG-II) depicted as linked sugar molecules, with apiose as a <italic>blue pentagon</italic>.</p>
</caption>
<graphic xlink:href="fmolb-08-756219-g001.tif"/>
</fig>
<p>Characterizing the 3D structure of complex polysaccharides using NMR is challenging due to the myriad linkages between monosaccharides that often only differ from each other in stereochemistry (<xref ref-type="bibr" rid="B16">Duus et&#x20;al., 2000</xref>). Furthermore, the diverse palette of available monosaccharides allows for structurally complex molecules, all of which share very similar chemical groups (hydroxyl groups and glycosidic linkages) resulting in NMR chemical shifts located in a much narrower spectral region than for proteins and nucleic acids (<xref ref-type="bibr" rid="B46">Toukach and Ananikov, 2013a</xref>), leading to overlaps in spectral data that limit the differentiation between specific residues within a polysaccharide (<xref ref-type="bibr" rid="B12">Bubb, 2003</xref>).</p>
<p>Computational approaches play an important role in overcoming these challenges. The use of empirical methods that are based on experimental chemical shift databases for spectral assignments may offer quick predictions, but are frequently inaccurate for carbohydrates (<xref ref-type="bibr" rid="B35">Pierens, 2014</xref>). The wide diversity of substructures and the complex 3D arrangements of polysaccharides far exceed the parameterization space employed for the development of these databases (<xref ref-type="bibr" rid="B35">Pierens, 2014</xref>). As a result, these methods are often unable to distinguish between diastereomers or account for interspatial coupling (<xref ref-type="bibr" rid="B50">Willoughby et&#x20;al., 2014</xref>; <xref ref-type="bibr" rid="B8">Benassi, 2017</xref>), greatly hindering the elucidation of stereochemistry within a polysaccharide. On the other hand, quantum mechanical (QM) approaches to predicting NMR shifts have been more successful in aiding experimentalists to improve the reliability of NMR and correlate the spectra to the structure by predicting the chemical shift values of complex compounds (<xref ref-type="bibr" rid="B35">Pierens, 2014</xref>; <xref ref-type="bibr" rid="B41">Tarazona et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B4">Ahmed et&#x20;al., 2020</xref>). This approach has even been used to correct previously misassigned spectra that relied on experimental databases and empirical-based predictions (<xref ref-type="bibr" rid="B35">Pierens, 2014</xref>).</p>
<p>QM-based approaches use the gauge-invariant atomic orbital (GIAO) method to calculate the isotropic magnetic shielding values from electronic structure calculations (<xref ref-type="bibr" rid="B15">Ditchfield, 1974</xref>; <xref ref-type="bibr" rid="B37">Schreckenbach and Ziegler, 1995</xref>). Early studies applied the GIAO approach to predict NMR shifts for small molecules and relied on Hartree&#x2013;Fock (HF)- and post-Hartree&#x2013;Fock (post-HF)-based methods (<xref ref-type="bibr" rid="B37">Schreckenbach and Ziegler, 1995</xref>; <xref ref-type="bibr" rid="B20">Helgaker et&#x20;al., 1999</xref>). However, the high computational costs of the HF and post-HF-based <italic>ab initio</italic> methods for larger molecules, such as carbohydrates, have prompted investigations into using density functional theory (DFT) as a more tractable method (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B5">Bagno et&#x20;al., 2007</xref>; <xref ref-type="bibr" rid="B46">Toukach and Ananikov, 2013a</xref>). However, protocols that utilize DFT have been shown to have limitations (<xref ref-type="bibr" rid="B43">Teale et&#x20;al., 2013</xref>; <xref ref-type="bibr" rid="B8">Benassi, 2017</xref>), with many previous studies having large discrepancies from experimental data, including being unable to correctly predict the ordering of chemical shifts (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B5">Bagno et&#x20;al., 2007</xref>), which is a vital piece of information for spectral analysis. Additionally, several studies have struggled to replicate either <sup>1</sup>H or <sup>13</sup>C data, often with proton data showing little variation despite changes in the levels of theory while carbon data were shown to vary greatly depending on the chosen level of theory (<xref ref-type="bibr" rid="B8">Benassi, 2017</xref>).</p>
<p>It has been shown that changing the specified density functional and basis set (FBS) combination can greatly affect the level of accuracy of a computational prediction (<xref ref-type="bibr" rid="B35">Pierens, 2014</xref>). Another important factor is accounting for the effect of solvent, which can be modeled using either implicit (<xref ref-type="bibr" rid="B14">Cramer and Truhlar, 1999</xref>) or explicit (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B5">Bagno et&#x20;al., 2007</xref>) solvation models in QM calculations, molecular dynamics (MD), and molecular mechanics (MM) simulations. Implicit solvation models approximate the presence of a solvent using a dielectric constant, thereby imparting a screening effect. While explicit solvation models may be considered to be intuitively more accurate, Bagno et&#x20;al. demonstrated that the presence of explicit water molecules did not result in improved <sup>13</sup>C and <sup>1</sup>H shift predictions for glucose (<xref ref-type="bibr" rid="B5">Bagno et&#x20;al., 2007</xref>). This may be attributed to the fact that, in explicit solvation models, the shielding constant calculations are affected by the conformational distribution of the hydroxyl and hydroxymethylene groups in glucose, which in turn are impacted by the presence of explicit solvent molecules, the number of solvation shells considered, and the inherent effects of the force fields considered for water and glucose. Hence, considering the additional computational costs and the lower accuracy of explicit solvation models for NMR shift calculations, implicit solvation models were considered in this&#x20;study.</p>
<p>Here, we applied the DFT approach and evaluated the accuracy of various FBS combinations and solvation models in predicting the chemical shifts of <sup>13</sup>C NMR for four apiosyl compounds (<xref ref-type="fig" rid="F2">Figure&#x20;2</xref>). The ability to accurately predict how the chemical shifts of apiose change upon dimerization with boric acid will aid experimentalists in using NMR to probe the dimerization process of RG-II. Therefore, this study is focused on apiose and its borate esters and is not a generalized study on the efficacy of these methods for all carbohydrates. Another reason for this focus is the fact that the experimental NMR shift data from Ishii and Ono (<xref ref-type="bibr" rid="B23">Ishii and Yanagisawa, 1998</xref>; <xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>) for apiose and its borate esters in D<sub>2</sub>O using methanol as the <sup>13</sup>C NMR reference compound are the only data available for benchmarking the computational predictions presented&#x20;here.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Compounds <bold>1</bold>&#x2013;<bold>4</bold> of methyl apioside and apio&#x2013;borate esters (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>).</p>
</caption>
<graphic xlink:href="fmolb-08-756219-g002.tif"/>
</fig>
<p>DFT calculations for chemical shift predictions involve two steps, a geometry optimization followed by an NMR shielding constant calculation, each of which may be performed using a specific FBS combination and a solvation model. Recent literature on NMR shift predictions in carbohydrates often has not accounted for solvation during geometry optimizations and has occasionally done so only during NMR shielding constant calculations (<xref ref-type="bibr" rid="B5">Bagno et&#x20;al., 2007</xref>; <xref ref-type="bibr" rid="B47">Toukach and Ananikov, 2013b</xref>; <xref ref-type="bibr" rid="B8">Benassi, 2017</xref>). We evaluated 23 FBS combinations based on the levels of theory recommended in the literature for NMR chemical shift predictions and applied them using implicit solvation models and a reference correction to study apio&#x2013;borate compounds. We demonstrate that considering implicit solvation during both the geometry optimization and the shielding constant calculation results in lower mean absolute errors (MAEs) for computational predictions with respect to the experimental data for apio&#x2013;borate compounds. Additionally, we found that while considering solvation is important during both geometry optimizations and shielding constant calculations, there was a greater positive impact from using implicit solvation during geometry optimization for both charged and charge-neutral compounds. Across all 23 FBS combinations tested, we list the top-performing FBS combinations that had lower MAEs for <sup>13</sup>C data than those reported in the literature for NMR predictions of carbohydrates (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B47">Toukach and Ananikov, 2013b</xref>; <xref ref-type="bibr" rid="B8">Benassi, 2017</xref>), while also replicating the correct ordering of chemical shifts.</p>
</sec>
<sec id="s2">
<title>Computational Methods</title>
<sec id="s2-1">
<title>Consideration of &#x3b2;-<sc>D</sc>-Apiosyl Compounds</title>
<p>All four apiosyl compounds studied here were capped with methyl groups on C1 and C3&#x2032; to resemble the connectivity of apiose to other carbohydrates within RG-II (<xref ref-type="fig" rid="F1">Figure&#x20;1</xref>). The &#x3b2;-<sc>D</sc>-methyl apiofuranoside (compound <bold>1</bold>) and its borate diol monoester (compound <bold>2</bold>) and diesters (compounds <bold>3</bold> and <bold>4</bold>) are the same as those studied by Ishii and Ono (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>). During the dimerization process, compound <bold>1</bold> first formed a borate diol monoester (compound <bold>2</bold>), followed by the formation of stereo-isomeric borate diol diesters (compounds <bold>3</bold> and <bold>4</bold>), which were observed to be present in equimolar ratios (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>).</p>
<p>The observation of Ishii et&#x20;al. that compound <bold>1</bold> makes up the majority of methyl apiofuranosides is consistent with the fact that &#x3b2;-<sc>D</sc>-apiose is the most prevalent form of apiose in nature (<xref ref-type="bibr" rid="B23">Ishii and Yanagisawa, 1998</xref>). While the &#x3b2;-<sc>D</sc>-apiose structure of compound <bold>1</bold> promotes dimerization, the &#x237a;-<sc>L</sc>-apiose structure of its diastereomer is a less common form of a methyl apiofuranoside that rarely undergoes dimerization partially due to steric hindrances during borate esterification, arising from the <italic>trans</italic>-orientation of the hydroxyl groups (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>). Although this is the case, we did predict the chemical shifts for methyl &#x237a;-<sc>L</sc>-apiofuranoside, which are discussed in <xref ref-type="sec" rid="s11">
<italic>Section S1</italic>
</xref> of the Electronic Supplementary Material (ESM).</p>
</sec>
<sec id="s2-2">
<title>Geometry Optimization and NMR Shielding Constant Calculations</title>
<p>The geometries of compounds <bold>1</bold>&#x2013;<bold>4</bold> {methyl &#x3b2;-<sc>D</sc>-apiofuranoside; methyl 3-C-(hydroxymethyl)-&#x3b2;-<sc>D</sc>-threo-tetrofuranose 2,3-borate; bis[methyl 3-C-(hydroxymethyl)-&#x3b2;-<sc>D</sc>-threo-tetrofuranose]-(S)-2,3:2&#x2032;,3&#x2032;-borate; and bis[methyl 3-C-(hydroxymethyl)-&#x3b2;-<sc>D</sc>-threo-tetrofuranose]-(<italic>R</italic>)-2,3:2&#x2032;,3&#x2032;-borate} and the reference compounds (acetone and methanol) were optimized using Gaussian16 (<xref ref-type="bibr" rid="B18">Frisch et&#x20;al., 2016</xref>) at the B3LYP/6-31&#x2b;G(d,p) level of theory, unless otherwise noted. The convergence criteria and the thresholds for self-consistent field (SCF) calculations are listed in <xref ref-type="sec" rid="s11">Supplementary Table S2</xref>. Complete convergence for optimized structures was ensured by a vibrational analysis that revealed no imaginary eigenfrequencies. Geometry optimizations were conducted under gas-phase conditions and in the presence of an implicit solvent using a self-consistent reaction field (SCRF) (<xref ref-type="bibr" rid="B44">Tomasi et&#x20;al., 2005</xref>) with the optimized coordinates for all compounds listed in <xref ref-type="sec" rid="s11">
<italic>ESM Section S10</italic>
</xref> and depicted in <xref ref-type="sec" rid="s11">Supplementary Figure S5</xref>. This implicit solvation model used a polarizable continuum model with the integral equation formalism variant (IEFPCM) (<xref ref-type="bibr" rid="B45">Tomasi et&#x20;al., 1999</xref>), for which a dielectric constant of 78.06 was specified to represent the D<sub>2</sub>O solvent used in the experimental conditions.</p>
<p>NMR shielding constant calculations were then conducted on each optimized geometry to yield an isotropic magnetic shielding value for each carbon and hydrogen atom. Unlike the geometry optimization for FBS, which was held constant, the various NMR FBS combinations listed in <xref ref-type="table" rid="T1">Table&#x20;1</xref> were employed for the NMR shielding constant calculation, whose specific route lines are listed in <xref ref-type="sec" rid="s11">Supplementary Table S3</xref>. These specified NMR FBS combinations were combined with two other independent variables: the solvation model for the optimization and the solvation model for the NMR shielding constant calculation.</p>
<table-wrap id="T1" position="float">
<label>TABLE 1</label>
<caption>
<p>Nuclear magnetic resonance (NMR) functional&#x2013;basis sets (FBS) 1&#x2013;23 identified by their FBS combinations for NMR shielding constant calculations.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th align="left">NMR FBS no.</th>
<th align="center">Functional<xref ref-type="table-fn" rid="Tfn1">
<sup>a</sup>
</xref>
</th>
<th align="center">Basis set<xref ref-type="table-fn" rid="Tfn1">
<sup>a</sup>
</xref>
</th>
<th align="center">Literature reference for NMR predictions<xref ref-type="table-fn" rid="Tfn2">
<sup>b</sup>
</xref>
</th>
<th align="center">
<sup>13</sup>C MAEs for <xref ref-type="fig" rid="F4">Figures 4</xref> and <xref ref-type="fig" rid="F5">5</xref> (ppm)<xref ref-type="table-fn" rid="Tfn3">
<sup>c</sup>
</xref>
</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">1</td>
<td align="left">B3LYP (<xref ref-type="bibr" rid="B6">Becke, 1993</xref>)</td>
<td align="left">6-31G(d)</td>
<td align="left">
<xref ref-type="bibr" rid="B13">Che&#x142;mecka et&#x20;al. (2007)</xref>, <xref ref-type="bibr" rid="B35">Pierens (2014)</xref>, <xref ref-type="bibr" rid="B8">Benassi (2017)</xref>
</td>
<td align="char" char=".">1.50</td>
</tr>
<tr>
<td align="left">2</td>
<td align="left">B3LYP (<xref ref-type="bibr" rid="B6">Becke, 1993</xref>)</td>
<td align="left">6-311&#x2b;G(2d,p)</td>
<td align="left">
<xref ref-type="bibr" rid="B35">Pierens (2014</xref>), <xref ref-type="bibr" rid="B50">Willoughby et&#x20;al. (2014</xref>)</td>
<td align="char" char=".">2.28</td>
</tr>
<tr>
<td align="left">3</td>
<td align="left">B3LYP (<xref ref-type="bibr" rid="B6">Becke, 1993</xref>)</td>
<td align="left">cc-pVDZ</td>
<td align="left">
<xref ref-type="bibr" rid="B35">Pierens (2014)</xref>
</td>
<td align="char" char=".">2.37</td>
</tr>
<tr>
<td align="left">4</td>
<td align="left">B3LYP (<xref ref-type="bibr" rid="B6">Becke, 1993</xref>)</td>
<td align="left">aug-cc-pVDZ</td>
<td align="left">
<xref ref-type="bibr" rid="B35">Pierens (2014)</xref>
</td>
<td align="char" char=".">1.93</td>
</tr>
<tr>
<td align="left">5</td>
<td align="left">BMK (<xref ref-type="bibr" rid="B11">Boese and Martin, 2004</xref>)</td>
<td align="left">6-31G(d)</td>
<td align="left">
<xref ref-type="bibr" rid="B35">Pierens (2014)</xref>
</td>
<td align="char" char=".">2.58</td>
</tr>
<tr>
<td align="left">6</td>
<td align="left">BMK (<xref ref-type="bibr" rid="B11">Boese and Martin, 2004</xref>)</td>
<td align="left">6-311G(d)</td>
<td align="left">
<xref ref-type="bibr" rid="B35">Pierens (2014)</xref>
</td>
<td align="char" char=".">3.39</td>
</tr>
<tr>
<td align="left">7</td>
<td align="left">mPW1PW91 (<xref ref-type="bibr" rid="B1">Adamo and Barone, 1998</xref>)</td>
<td align="left">6-311&#x2b;G(2d,p)</td>
<td align="left">
<xref ref-type="bibr" rid="B35">Pierens (2014)</xref>, <xref ref-type="bibr" rid="B8">Benassi (2017)</xref>
</td>
<td align="char" char=".">1.85</td>
</tr>
<tr>
<td align="left">8</td>
<td align="left">PBE0 (<xref ref-type="bibr" rid="B2">Adamo and Barone, 1999</xref>)</td>
<td align="left">6-311&#x2b;G(2d,p)</td>
<td align="left">
<xref ref-type="bibr" rid="B35">Pierens (2014)</xref>, <xref ref-type="bibr" rid="B8">Benassi (2017)</xref>
</td>
<td align="char" char=".">1.87</td>
</tr>
<tr>
<td align="left">9</td>
<td align="left">WC04 (<xref ref-type="bibr" rid="B49">Wiitala et&#x20;al., 2006</xref>)</td>
<td align="left">6-31g(d)</td>
<td align="left">
<xref ref-type="bibr" rid="B35">Pierens (2014)</xref>
</td>
<td align="char" char=".">7.34</td>
</tr>
<tr>
<td align="left">10</td>
<td align="left">WP04 (<xref ref-type="bibr" rid="B49">Wiitala et&#x20;al., 2006</xref>)</td>
<td align="left">aug-cc-pvdz</td>
<td align="left">
<xref ref-type="bibr" rid="B35">Pierens (2014)</xref>, <xref ref-type="bibr" rid="B8">Benassi (2017)</xref>
</td>
<td align="char" char=".">1.88</td>
</tr>
<tr>
<td align="left">11</td>
<td align="left">CAM-B3LYP (<xref ref-type="bibr" rid="B52">Yanai et&#x20;al., 2004</xref>)</td>
<td align="left">6-311&#x2b;G(2d,p)</td>
<td align="left">
<xref ref-type="bibr" rid="B32">O&#x27;Neill et&#x20;al. (1996)</xref>
</td>
<td align="char" char=".">2.13</td>
</tr>
<tr>
<td align="left">12</td>
<td align="left">mPW1LYP</td>
<td align="left">6-311&#x2b;G(2d,p)</td>
<td align="left">
<xref ref-type="bibr" rid="B32">O&#x27;Neill et&#x20;al. (1996)</xref>
</td>
<td align="char" char=".">2.13</td>
</tr>
<tr>
<td align="left">13</td>
<td align="left">B3LYP (<xref ref-type="bibr" rid="B6">Becke, 1993</xref>)</td>
<td align="left">6- 311G(d, p)</td>
<td align="left">
<xref ref-type="bibr" rid="B32">O&#x27;Neill et&#x20;al. (1996)</xref>
</td>
<td align="char" char=".">2.36</td>
</tr>
<tr>
<td align="left">14</td>
<td align="left">CSGT-LC-TPSSTPSS (<xref ref-type="bibr" rid="B40">Tao et&#x20;al., 2003</xref>)</td>
<td align="left">cc-pVTZ</td>
<td align="left">
<xref ref-type="bibr" rid="B21">Iron (2017)</xref>
</td>
<td align="char" char=".">1.95</td>
</tr>
<tr>
<td align="left">15</td>
<td align="left">PBE (<xref ref-type="bibr" rid="B34">Perdew et&#x20;al., 1996</xref>)</td>
<td align="left">6-311G&#x2a;&#x2a;</td>
<td align="left">
<xref ref-type="bibr" rid="B24">Kemp (2016)</xref>
</td>
<td align="char" char=".">1.91</td>
</tr>
<tr>
<td align="left">16</td>
<td align="left">mPW1PW91 (<xref ref-type="bibr" rid="B1">Adamo and Barone, 1998</xref>)</td>
<td align="left">6-31G(d)</td>
<td align="left">
<xref ref-type="bibr" rid="B33">O&#x2019;Neill et&#x20;al. (2020)</xref>
</td>
<td align="char" char=".">1.90</td>
</tr>
<tr>
<td align="left">17</td>
<td align="left">B3LYP (<xref ref-type="bibr" rid="B6">Becke, 1993</xref>)</td>
<td align="left">TZVP</td>
<td align="left">
<xref ref-type="bibr" rid="B42">Taubert et&#x20;al. (2005)</xref>
</td>
<td align="char" char=".">2.52</td>
</tr>
<tr>
<td align="left">18</td>
<td align="left">BP86</td>
<td align="left">TZVP</td>
<td align="left">
<xref ref-type="bibr" rid="B42">Taubert et&#x20;al. (2005)</xref>
</td>
<td align="char" char=".">3.27</td>
</tr>
<tr>
<td align="left">19</td>
<td align="left">B3PW91 (<xref ref-type="bibr" rid="B6">Becke, 1993</xref>)</td>
<td align="left">6-31&#x2b;G(d)</td>
<td align="left">
<xref ref-type="bibr" rid="B26">Kupka et&#x20;al. (1999)</xref>, <xref ref-type="bibr" rid="B46">Toukach and Ananikov (2013a)</xref>
</td>
<td align="char" char=".">1.72</td>
</tr>
<tr>
<td align="left">20</td>
<td align="left">B3LYP (<xref ref-type="bibr" rid="B6">Becke, 1993</xref>)</td>
<td align="left">6-311G&#x2b;&#x2b;(2d,2p)</td>
<td align="left">
<xref ref-type="bibr" rid="B46">Toukach and Ananikov (2013a)</xref>
</td>
<td align="char" char=".">2.31</td>
</tr>
<tr>
<td align="left">21</td>
<td align="left">PBE (<xref ref-type="bibr" rid="B34">Perdew et&#x20;al., 1996</xref>)</td>
<td align="left">TZ2p</td>
<td align="left">
<xref ref-type="bibr" rid="B7">Belyakov and Ananikov (2011)</xref>, <xref ref-type="bibr" rid="B46">Toukach and Ananikov (2013a)</xref>
</td>
<td align="char" char=".">2.04</td>
</tr>
<tr>
<td align="left">22</td>
<td align="left">WC04 (<xref ref-type="bibr" rid="B49">Wiitala et&#x20;al., 2006</xref>)</td>
<td align="left">aug-cc-pVDZ</td>
<td align="left">&#x2014;</td>
<td align="char" char=".">5.91</td>
</tr>
<tr>
<td align="left">23</td>
<td align="left">WP04 (<xref ref-type="bibr" rid="B49">Wiitala et&#x20;al., 2006</xref>)</td>
<td align="left">6-31G(d)</td>
<td align="left">&#x2014;</td>
<td align="char" char=".">1.79</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>MAEs, mean absolute errors.</p>
</fn>
<fn id="Tfn1">
<label>a</label>
<p>Specific route lines are listed in <xref ref-type="sec" rid="s11">Supplementary Table S3</xref>.</p>
</fn>
<fn id="Tfn2">
<label>b</label>
<p>Use of FBS in the literature for predicting chemical shifts.</p>
</fn>
<fn id="Tfn3">
<label>c</label>
<p>MAEs for <sup>13</sup>C predictions using implicit solvation during both geometry optimizations and shielding constant calculations.</p>
</fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec id="s2-3">
<title>Chemical Shift Calculations</title>
<p>The computed isotropic magnetic shielding values from the shielding constant calculation resembled the unreferenced NMR data found in the experimental analysis. Consequently, reference compounds were needed to convert the computed isotropic magnetic shielding values into NMR chemical shift data (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>), so as to be comparable to the experimental data reported in parts per million (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>). The choice of a reference did not change the relative ordering of shifts as the reference approach applied a constant correction value to all isotropic magnetic shielding values. Our initial results were consistent with the findings of other studies that tetramethylsilane (TMS) alone does not serve as a consistently good reference for NMR predictions (<xref ref-type="sec" rid="s11">
<italic>ESM Section S4</italic>
</xref>) (<xref ref-type="bibr" rid="B36">Sarotti and Pellegrinet, 2009</xref>). Therefore, all our computational shifts were referenced to the experimental reference compounds used by <xref ref-type="bibr" rid="B22">Ishii and Ono (1999</xref>) (i.e.,&#x20;methanol for <sup>13</sup>C and acetone for <sup>1</sup>H), which are also structurally representative of the compounds being studied (<xref ref-type="bibr" rid="B35">Pierens, 2014</xref>; <xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B36">Sarotti and Pellegrinet, 2009</xref>; <xref ref-type="bibr" rid="B26">Kupka et&#x20;al., 1999</xref>). The reference correction values are reported in <xref ref-type="sec" rid="s11">Supplementary Table S4</xref>. For every permutation of the solvation model and the NMR FBS combination that was run on a methyl apioside and its borate ester derivatives, the identical NMR FBS was also run on the appropriate reference compound (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>).</p>
<p>
<xref ref-type="disp-formula" rid="e1">Equation 1</xref> (see <xref ref-type="sec" rid="s11">
<italic>ESM Section S2</italic>
</xref> for derivation) (<xref ref-type="bibr" rid="B35">Pierens, 2014</xref>; <xref ref-type="bibr" rid="B36">Sarotti and Pellegrinet, 2009</xref>) was used to calculate the computed chemical shift (<italic>&#x3b4;</italic>
<sub>comp</sub>) value for our compound. This was done by first comparing the computed isotropic magnetic shielding value of the second reference (<italic>&#x3c3;</italic>
<sub>ref</sub>) to its known experimental value (<italic>&#x3b4;</italic>
<sub>ref</sub>) in order to yield a reference correction (<xref ref-type="sec" rid="s11">Supplementary Table S4</xref>) (<xref ref-type="bibr" rid="B19">Fulmer et&#x20;al., 2010</xref>), from which we then subtracted the isotropic magnetic shielding value compound of interest (<italic>&#x3b4;</italic>
<sub>comp</sub>) to obtain its chemical shift value (<italic>&#x3c3;</italic>
<sub>comp</sub>).<disp-formula id="e1">
<mml:math id="m1">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="italic">&#x3b4;</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x3d;</mml:mo>
<mml:msub>
<mml:mi mathvariant="italic">&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2b;</mml:mo>
<mml:msub>
<mml:mi mathvariant="italic">&#x3b4;</mml:mi>
<mml:mrow>
<mml:mi>r</mml:mi>
<mml:mi>e</mml:mi>
<mml:mi>f</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mo>&#x2212;</mml:mo>
<mml:msub>
<mml:mi mathvariant="italic">&#x3c3;</mml:mi>
<mml:mrow>
<mml:mi>c</mml:mi>
<mml:mi>o</mml:mi>
<mml:mi>m</mml:mi>
<mml:mi>p</mml:mi>
</mml:mrow>
</mml:msub>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>
</p>
<p>For compounds <bold>3</bold> and <bold>4</bold>, which are dimers, the computed isotropic magnetic shielding values reported were averaged over the two analogous atoms from each monomer before applying the reference correction. The <sup>1</sup>H data for methyl peaks were averaged in a similar way. All statistical analyses were performed on these averaged chemical shifts. The overall protocol of how the chemical shift values were calculated is summarized in <xref ref-type="fig" rid="F3">Figure&#x20;3</xref>.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Protocol for quantum mechanical (QM)-based nuclear magnetic resonance (NMR) shift calculations. The <italic>dark blue</italic> and <italic>gray arrows</italic> indicate the analysis methodology for the compound of interest and the reference compound, respectively.</p>
</caption>
<graphic xlink:href="fmolb-08-756219-g003.tif"/>
</fig>
</sec>
<sec id="s2-4">
<title>Statistical Analysis of FBS and Solvation Model Performance</title>
<p>To assess the efficacy of each FBS combination and solvation model, we used a variety of statistical analyses for determining the precision and accuracy of a chemical shift prediction. The primary tool was the MAE reported in parts per million, which measured the difference between the computed chemical shift value and the known experimental value (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>). The MAE for a given NMR FBS averaged the absolute errors of every atom on all four compounds. The mean absolute percent errors (MAPEs) were then used to compare the accuracy of the heteronuclear NMR predictions, as the MAE values were incomparable between <sup>1</sup>H and <sup>13</sup>C predictions due to differences in the magnitude of the chemical shifts. The MAE metric was contrasted with the mean signed error (MSE), which averaged all errors using signed values so as to measure whether a NMR FBS consistently over- or underpredicted the shift values, as observed by <xref ref-type="bibr" rid="B42">Taubert et&#x20;al. (2005)</xref>. Standard deviations (SDs), reported in parts per million, were calculated using such signed errors. Lastly, <italic>r</italic>
<sup>2</sup> values were calculated by comparing the computed chemical shift values of each atom on compounds <bold>1</bold>&#x2013;<bold>4</bold> with the experimental values. A low SD indicates high precision, also indicated by a high <italic>r</italic>
<sup>2</sup> value. A smaller MSE indicates high accuracy. A low MAE or MAPE value tends to indicate both adequate precision and accuracy.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec id="s3-1">
<title>Impact of Implicit Solvation Only During NMR Shielding Constant Calculations</title>
<p>Apiose and its charged borate esters were experimentally characterized in the presence of the solvent D<sub>2</sub>O (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>). To quantify the impact of considering solvation during the NMR shielding calculations, we predicted <sup>13</sup>C shifts using NMR FBS 1&#x2013;10, with and without implicit solvation during this NMR calculation. <xref ref-type="fig" rid="F4">Figure&#x20;4</xref> depicts circular plots (donut plots) that reflect the errors with respect to the experimental <sup>13</sup>C NMR data for all six carbons in compounds <bold>1</bold>&#x2013;<bold>4</bold> for these 10 NMR FBS combinations with their respective solvation model. For this comparison, all geometry optimizations were run in the gas phase without implicit solvation. <xref ref-type="fig" rid="F4">Figure&#x20;4</xref> shows that the MAEs ranged from 1.96 to 6.70&#xa0;ppm, with the best performance by NMR FBS combinations 10, 4, 7, 8, and 1 with implicit solvation. The NMR computations that considered implicit solvation <italic>via</italic> IEFPCM (highlighted in blue) primarily resulted in MAEs lower than those of their counterparts run in purely gas-phase conditions without any solvation (white background).</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Mean absolute errors (MAEs) in parts per million for <sup>13</sup>C nuclear magnetic resonance (NMR) predictions for NMR functional&#x2013;basis sets (FBS) 1&#x2013;10 represented as donut plots for gas-phase geometry optimizations. The donut size reflects the averaged MAE for each NMR FBS. The four colors within each donut indicate errors for each compound, and the shades within each color indicate errors on a specific carbon atom. MAEs and r<sup>2</sup> values are shown in blue and red, respectively. Donuts on white and blue backgrounds indicate NMR values calculated in the gas phase and with implicit solvation, respectively.</p>
</caption>
<graphic xlink:href="fmolb-08-756219-g004.tif"/>
</fig>
<p>
<xref ref-type="fig" rid="F4">Figure&#x20;4</xref> shows that most NMR FBS combinations performed similarly across all four compounds, with errors relatively evenly split between compounds (each compound accounted for about a quarter of the total error). While the error was not evenly split between carbon atoms, there was no consistent trend about which atoms had the highest errors. In some cases, carbons C1 and OCH<sub>3</sub> had higher errors, but they did not affect the shift ordering as they were the farthest upfield and farthest downfield chemical shift values, respectively.</p>
</sec>
<sec id="s3-2">
<title>Impact of Implicit Solvation During Both Geometry Optimization and NMR Shielding Constant Calculations</title>
<p>While MAEs provide insights into the average accuracy of an FBS and solvation model, they do not show whether the shifts are in the correct order, which is an important aspect of NMR characterization. The experimental data in <xref ref-type="table" rid="T2">Table&#x20;2</xref> indicated the following order toward the downfield direction for both methyl apiose and its borate derivatives: OCH<sub>3</sub>, C3&#x2032;, C4, C2, C3, C1. The use of implicit solvation during the NMR shielding constant calculations improved the shift ordering and decreased the MAEs. However, all calculations that used gas-phase geometry optimizations, regardless of the FBS or solvation model used in the NMR shielding constant calculations, were unable to perfectly replicate the ordering of the <sup>13</sup>C shift data. <xref ref-type="table" rid="T2">Table&#x20;2</xref> shows that the orders of C2 and C3 for gas-phase geometry predictions were switched in the apio&#x2013;borate compounds <bold>2</bold>&#x2013;<bold>4</bold> compared with the experimental shift values. Incorrect ordering was observed for all other NMR FBS combinations for C2/C3 in at least one of the compounds.</p>
<table-wrap id="T2" position="float">
<label>TABLE 2</label>
<caption>
<p>
<sup>13</sup>C shifts for compounds <bold>1</bold>&#x2013;<bold>4</bold> from <xref ref-type="bibr" rid="B22">Ishii and Ono (1999)</xref> compared to the computationally predicted shifts using either gas phase or implicit solvation (IEFPCM).</p>
</caption>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left">Compound</th>
<th colspan="2" align="center">Solvation model</th>
<th colspan="6" align="center">Chemical shifts (ppm) (downfield &#x2192; upfield)<xref ref-type="table-fn" rid="Tfn5">
<sup>b</sup>
</xref>
</th>
<th rowspan="2" align="center">MAE<xref ref-type="table-fn" rid="Tfn6">
<sup>c</sup>
</xref>
</th>
</tr>
<tr>
<th align="left">Optimization</th>
<th align="left">NMR (FBS 1)<xref ref-type="table-fn" rid="Tfn4">
<sup>a</sup>
</xref>
</th>
<th align="center">C1</th>
<th align="center">C3</th>
<th align="center">C2</th>
<th align="center">C4</th>
<th align="center">C3&#x2032;</th>
<th align="center">OCH<sub>3</sub>
</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td rowspan="3" align="left">
<bold>1</bold>
</td>
<td align="left">&#x2013;</td>
<td align="left">Ishii et&#x20;al.</td>
<td align="char" char=".">110.20</td>
<td align="char" char=".">80.08</td>
<td align="char" char=".">77.30</td>
<td align="char" char=".">74.32</td>
<td align="char" char=".">64.23</td>
<td align="char" char=".">56.78</td>
<td align="center">&#x2013;</td>
</tr>
<tr>
<td align="left">Gas phase</td>
<td align="left">IEFPCM</td>
<td align="char" char=".">108.53</td>
<td align="char" char=".">79.62</td>
<td align="char" char=".">78.76</td>
<td align="char" char=".">73.17</td>
<td align="char" char=".">64.47</td>
<td align="char" char=".">52.73</td>
<td align="char" char=".">1.50</td>
</tr>
<tr>
<td align="left">IEFPCM</td>
<td align="left">IEFPCM</td>
<td align="char" char=".">108.38</td>
<td align="char" char=".">79.25</td>
<td align="char" char=".">78.59</td>
<td align="char" char=".">73.04</td>
<td align="char" char=".">63.26</td>
<td align="char" char=".">53.35</td>
<td align="char" char=".">1.60</td>
</tr>
<tr>
<td rowspan="3" align="left">
<bold>2</bold>
</td>
<td align="left">&#x2013;</td>
<td align="left">Ishii et&#x20;al.</td>
<td align="char" char=".">110.70</td>
<td align="char" char=".">86.92</td>
<td align="char" char=".">83.42</td>
<td align="char" char=".">75.08</td>
<td align="char" char=".">65.38</td>
<td align="char" char=".">55.08</td>
<td align="center">&#x2013;</td>
</tr>
<tr>
<td align="left">Gas phase</td>
<td align="left">IEFPCM</td>
<td align="char" char=".">107.08</td>
<td align="char" char=".">84.03</td>
<td align="char" char=".">85.83</td>
<td align="char" char=".">77.21</td>
<td align="char" char=".">66.50</td>
<td align="char" char=".">50.30</td>
<td align="char" char=".">2.82</td>
</tr>
<tr>
<td align="left">IEFPCM</td>
<td align="left">IEFPCM</td>
<td align="char" char=".">110.41</td>
<td align="char" char=".">83.92</td>
<td align="char" char=".">83.30</td>
<td align="char" char=".">72.33</td>
<td align="char" char=".">64.20</td>
<td align="char" char=".">52.24</td>
<td align="char" char=".">1.70</td>
</tr>
<tr>
<td rowspan="3" align="left">
<bold>3</bold>
</td>
<td align="left">&#x2013;</td>
<td align="left">Ishii et&#x20;al.</td>
<td align="char" char=".">110.18</td>
<td align="char" char=".">87.09</td>
<td align="char" char=".">83.25</td>
<td align="char" char=".">74.86</td>
<td align="char" char=".">65.13</td>
<td align="char" char=".">55.06</td>
<td align="center">&#x2013;</td>
</tr>
<tr>
<td align="left">Gas phase</td>
<td align="left">IEFPCM</td>
<td align="char" char=".">108.45</td>
<td align="char" char=".">84.92</td>
<td align="char" char=".">85.85</td>
<td align="char" char=".">76.95</td>
<td align="char" char=".">66.43</td>
<td align="char" char=".">50.96</td>
<td align="char" char=".">2.33</td>
</tr>
<tr>
<td align="left">IEFPCM</td>
<td align="left">IEFPCM</td>
<td align="char" char=".">109.90</td>
<td align="char" char=".">85.71</td>
<td align="char" char=".">84.37</td>
<td align="char" char=".">73.92</td>
<td align="char" char=".">65.20</td>
<td align="char" char=".">52.18</td>
<td align="char" char=".">1.11</td>
</tr>
<tr>
<td rowspan="3" align="left">
<bold>4</bold>
</td>
<td align="left">&#x2013;</td>
<td align="left">Ishii et&#x20;al.</td>
<td align="char" char=".">110.28</td>
<td align="char" char=".">86.94</td>
<td align="char" char=".">83.42</td>
<td align="char" char=".">74.90</td>
<td align="char" char=".">65.13</td>
<td align="char" char=".">55.06</td>
<td align="center">&#x2013;</td>
</tr>
<tr>
<td align="left">Gas phase</td>
<td align="left">IEFPCM</td>
<td align="char" char=".">106.76</td>
<td align="char" char=".">84.25</td>
<td align="char" char=".">86.90</td>
<td align="char" char=".">76.68</td>
<td align="char" char=".">66.11</td>
<td align="char" char=".">50.49</td>
<td align="char" char=".">2.84</td>
</tr>
<tr>
<td align="left">IEFPCM</td>
<td align="left">IEFPCM</td>
<td align="char" char=".">110.48</td>
<td align="char" char=".">85.39</td>
<td align="char" char=".">84.24</td>
<td align="char" char=".">72.16</td>
<td align="char" char=".">63.64</td>
<td align="char" char=".">52.22</td>
<td align="char" char=".">1.61</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>IEFPCM, polarizable continuun model using the integral equation formalism variant; <italic>FBS</italic>, functional&#x2013;basis set; <italic>MAE</italic>, mean absolute&#x20;error.</p>
</fn>
<fn id="Tfn4">
<label>a</label>
<p>NMR shielding constant calculations using FBS 1 and implicit solvation (IEFPCM).</p>
</fn>
<fn id="Tfn5">
<label>b</label>
<p>Incorrect ordering of C3/C2 shifts as compared with the experimental data of Ishii et&#x20;al. is highlighted in&#x20;red.</p>
</fn>
<fn id="Tfn6">
<label>c</label>
<p>Accounting for implicit solvation during geometry optimizations decreases the MAEs.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>To ensure the correct ordering of the computationally predicted <sup>13</sup>C shifts, we considered the impact of implicit solvation on geometry optimizations in addition to the NMR shielding constant calculations. The use of implicit solvation during both steps resulted in an ordering that was consistent with the experimental data for all compounds (<xref ref-type="table" rid="T2">Table&#x20;2</xref>). Additionally, we observed that using implicit solvation during both steps resulted in lower MAEs and increased <italic>r</italic>
<sup>2</sup> values for all NMR FBS combinations than those for geometries obtained in the gas phase. <xref ref-type="fig" rid="F5">Figure&#x20;5</xref> depicts the improved MAEs and <italic>r</italic>
<sup>2</sup> values for the four best-performing NMR FBS combinations out of the initial 10: 1, 7, 8, and&#x20;10.</p>
<fig id="F5" position="float">
<label>FIGURE 5</label>
<caption>
<p>Mean absolute errors (MAEs, <italic>blue</italic>) in parts per million and <italic>r</italic>
<sup>2</sup> values (<italic>red</italic>) for <sup>13</sup>C nuclear magnetic resonance (NMR) predictions. NMR functional&#x2013;basis sets (FBS) 11&#x2013;23 are compared to the top-performing methods from NMR FBS 1&#x2013;10, shown at the <italic>top</italic> (1, 7, 8, and 10). Donut plots for all NMR FBS 1-23 are shown in <xref ref-type="sec" rid="s11">Supplementary Figure S2</xref>.</p>
</caption>
<graphic xlink:href="fmolb-08-756219-g005.tif"/>
</fig>
</sec>
<sec id="s3-3">
<title>Determining the Best FBS Combinations for Accurate NMR Shift Predictions</title>
<p>Having established a protocol that consistently replicated the ordering of chemical shifts by using implicit solvation for both geometry optimizations and NMR shielding constant calculations, we then proceeded to evaluate other NMR FBS combinations by using our protocol to improve upon other FBS combinations used in the literature. We identified 13 other NMR FBS combinations that had been recommended in the literature for NMR predictions and that could be used with implicit solvation on all our compounds (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B46">Toukach and Ananikov, 2013a</xref>; <xref ref-type="bibr" rid="B50">Willoughby et&#x20;al., 2014</xref>; <xref ref-type="bibr" rid="B24">Kemp, 2016</xref>; <xref ref-type="bibr" rid="B8">Benassi, 2017</xref>; <xref ref-type="bibr" rid="B21">Iron, 2017</xref>). This additionally allowed us to probe how a change in functional or basis set affects the performance of an FBS for apio&#x2013;borate compounds.</p>
<p>We tested the same functional and various basis set combinations; for instance, switching the basis sets of NMR FBS 9&#x2013;10 yielded NMR FBS 22&#x2013;23. NMR FBS 11&#x2013;23 were compared in <xref ref-type="fig" rid="F5">Figure&#x20;5</xref> to our previous top 4 NMR FBS combinations: 1, 7, 8, and 10. <xref ref-type="table" rid="T3">Table&#x20;3</xref> shows the statistical analyses of the new top 6&#x20;best-performing NMR FBS combinations (NMR FBS 1, 19, 23, 7, 8, and 16) based on the lowest MAE values when paired with the implicit solvation models for both geometry optimization and shielding constant calculation.</p>
<table-wrap id="T3" position="float">
<label>TABLE 3</label>
<caption>
<p>Statistical analysis [mean absolute errors (MAEs), mean absolute percent errors (MAPEs), and <italic>r</italic>
<sup>2</sup> values] of the performance of the top 6 nuclear magnetic resonance (NMR) functional&#x2013;basis set (FBS) combinations for predicting <sup>13</sup>C and <sup>1</sup>H shifts using implicit solvation during both geometry optimizations and NMR shielding constant calculations.</p>
</caption>
<table>
<thead valign="top">
<tr>
<th rowspan="2" align="left">NMR FBS no.</th>
<th colspan="2" align="center">MAE (ppm)</th>
<th colspan="2" align="center">MAPE (%)</th>
<th colspan="2" align="center">
<italic>r</italic>
<sup>2</sup>
</th>
<th colspan="2" align="center">MSE (ppm)</th>
<th colspan="2" align="center">SD (ppm)</th>
</tr>
<tr>
<th align="center">
<sup>13</sup>C</th>
<th align="center">
<sup>1</sup>H</th>
<th align="center">
<sup>13</sup>C</th>
<th align="center">
<sup>1</sup>H</th>
<th align="center">
<sup>13</sup>C</th>
<th align="center">
<sup>1</sup>H</th>
<th align="center">
<sup>13</sup>C</th>
<th align="center">
<sup>1</sup>H</th>
<th align="center">
<sup>13</sup>C</th>
<th align="center">
<sup>1</sup>H</th>
</tr>
</thead>
<tbody valign="top">
<tr>
<td align="left">1</td>
<td align="char" char=".">1.50</td>
<td align="char" char=".">0.12</td>
<td align="char" char=".">2.16</td>
<td align="char" char=".">2.98</td>
<td align="char" char=".">0.995</td>
<td align="char" char=".">0.892</td>
<td align="char" char=".">&#x2212;1.21</td>
<td align="char" char=".">&#x2212;0.02</td>
<td align="char" char=".">1.39</td>
<td align="char" char=".">0.16</td>
</tr>
<tr>
<td align="left">16</td>
<td align="char" char=".">1.90</td>
<td align="char" char=".">0.11</td>
<td align="char" char=".">2.63</td>
<td align="char" char=".">2.69</td>
<td align="char" char=".">0.996</td>
<td align="char" char=".">0.913</td>
<td align="char" char=".">&#x2212;1.80</td>
<td align="char" char=".">&#x2212;0.05</td>
<td align="char" char=".">1.19</td>
<td align="char" char=".">0.14</td>
</tr>
<tr>
<td align="left">19</td>
<td align="char" char=".">1.72</td>
<td align="char" char=".">0.10</td>
<td align="char" char=".">2.51</td>
<td align="char" char=".">2.64</td>
<td align="char" char=".">0.996</td>
<td align="char" char=".">0.918</td>
<td align="char" char=".">&#x2212;1.52</td>
<td align="char" char=".">0.00</td>
<td align="char" char=".">1.43</td>
<td align="char" char=".">0.14</td>
</tr>
<tr>
<td align="left">23</td>
<td align="char" char=".">1.79</td>
<td align="char" char=".">0.13</td>
<td align="char" char=".">2.53</td>
<td align="char" char=".">3.44</td>
<td align="char" char=".">0.993</td>
<td align="char" char=".">0.927</td>
<td align="char" char=".">&#x2212;1.38</td>
<td align="char" char=".">&#x2212;0.11</td>
<td align="char" char=".">1.56</td>
<td align="char" char=".">0.13</td>
</tr>
<tr>
<td align="left">7</td>
<td align="char" char=".">1.85</td>
<td align="char" char=".">0.13</td>
<td align="char" char=".">2.61</td>
<td align="char" char=".">3.44</td>
<td align="char" char=".">0.996</td>
<td align="char" char=".">0.893</td>
<td align="char" char=".">&#x2212;0.49</td>
<td align="char" char=".">&#x2212;0.06</td>
<td align="char" char=".">2.19</td>
<td align="char" char=".">0.16</td>
</tr>
<tr>
<td align="left">8</td>
<td align="char" char=".">1.87</td>
<td align="char" char=".">0.14</td>
<td align="char" char=".">2.60</td>
<td align="char" char=".">3.48</td>
<td align="char" char=".">0.996</td>
<td align="char" char=".">0.888</td>
<td align="char" char=".">&#x2212;0.29</td>
<td align="char" char=".">&#x2212;0.06</td>
<td align="char" char=".">2.26</td>
<td align="char" char=".">0.16</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn>
<p>The combination of low MAE values and low standard deviation (SD) between individual errors suggests both high accuracy and precision for NMR FBS 1 and&#x20;23.</p>
</fn>
<fn>
<p>MSE, mean signed&#x20;error.</p>
</fn>
</table-wrap-foot>
</table-wrap>
<p>While we were able to improve our MAEs using different functionals for NMR shielding constant calculations, it was shown that our consistent use of the B3LYP functional for geometry optimizations (which was kept constant to limit the number of variables) yielded better results for NMR predictions than some suggested in the literature. Specifically, NMR FBS 15, 16, and 1 have been shown in the literature to work well with wB97XD, M062X, and wB97XD as geometry optimization functionals, respectively, when calculating chemical shifts for carbohydrates (<xref ref-type="bibr" rid="B24">Kemp, 2016</xref>; <xref ref-type="bibr" rid="B8">Benassi, 2017</xref>). The results in <xref ref-type="sec" rid="s11">Supplementary Figure S3</xref> show that these literature functionals for geometry optimizations performed worse than B3LYP.</p>
<p>The top 4 NMR FBS combinations for <sup>13</sup>C predictions (<xref ref-type="table" rid="T3">Table&#x20;3</xref>) also performed the best at predicting <sup>1</sup>H data (<xref ref-type="sec" rid="s11">
<italic>ESM Section S7</italic>
</xref>). Due to the different magnitudes of the shift values of <sup>13</sup>C and <sup>1</sup>H, we used MAPEs to compare their ability to predict both <sup>13</sup>C and <sup>1</sup>H shifts. As seen in <xref ref-type="table" rid="T3">Table&#x20;3</xref>, the MAPE values for <sup>1</sup>H were slightly larger than those for <sup>13</sup>C, but NMR FBS 19, the best at replicating <sup>1</sup>H data, had a MAPE of 2.65%, which was comparable to the corresponding 2.51% MAPE for <sup>13</sup>C. The experimental chemical shift values for the protons within the apiosyl compounds were similar enough that our predictions can only replicate general trends, and we were unable to exactly reproduce the shift ordering for <sup>1</sup>H. Even in the experiments by Ishii et&#x20;al., they were unable to distinguish H3&#x2032;a and H3&#x2032;b and noted the miniscule differences, such as the 0.04-ppm difference between H2 and H4a and the 0.003-ppm difference for H1 in compounds <bold>2</bold>&#x2013;<bold>4</bold> (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>).</p>
</sec>
<sec id="s3-4">
<title>Exploring the Impact of Two Different Solvation Models</title>
<p>The aforementioned calculations were performed using the IEFPCM implicit solvation model (<xref ref-type="bibr" rid="B45">Tomasi et&#x20;al., 1999</xref>). The SMD model (solvation model based on density) is an alternative implicit solvation model that considers more optimized radii for the bulk electrostatics contributions and has been demonstrated to result in more accurate solvation energies (<xref ref-type="bibr" rid="B29">Marenich et&#x20;al., 2009</xref>). In order to explore whether the SMD solvation model provides further accuracy improvements to the NMR predictions for apiose and its esters, we calculated the <sup>13</sup>C MAEs for the top 6 FBS combinations using an SMD solvation model. We observed that for all 6 FBS methods, SMD solvation resulted in improvements to the NMR predictions, with the most significant being for FBS 16, 1, 23, and 19 (MAE improvements of 0.36, 0.33, 0.33, and 0.29&#xa0;ppm, respectively), while FBS 7 and 8 demonstrated only marginal improvements (MAE improvements of 0.06 and 0.03&#xa0;ppm).</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>RG-II has numerous exotic sugar moieties that have not been extensively characterized through experiments or computational models, and the protocols recommended in the literature for NMR chemical shift predictions for carbohydrates remain to be evaluated for these sugars. Here, we have shown that common techniques, such as gas-phase geometry optimizations and the use of scaling factors to convert chemical shifts, lacked accurate predictive capabilities for the NMR chemical shifts of the apiosyl compounds of RG-II. Modeling of this system is complicated by the diastereomers, charged states, dimerized configurations, solvent interactions, and the effects of&#x20;boron.</p>
<p>
<xref ref-type="fig" rid="F6">Figure&#x20;6</xref> illustrates the impact of the consideration of solvation for geometry optimizations and NMR shielding constant calculations (with each quadrant representing a different solvation combination) for the four compounds of interest (indicated by the four different colors). For each NMR FBS combination, the vertical and horizontal arrows are quantitative representations of MAE improvements upon solvation for the geometry optimizations and shielding constant calculations, respectively. The arrows point to the protocol that enables the highest improvements in MAE for computational predictions. The vertical arrows, which are much larger, consistently point to the lower quadrants, showing the large and consistent benefit of using implicit solvation during geometry optimizations. Modeling implicit solvation benefits all compounds, particularly charged compounds <bold>2</bold>&#x2013;<bold>4</bold>. The shorter horizontal arrows indicate a lower benefit of applying implicit solvation during shielding constant calculation.</p>
<fig id="F6" position="float">
<label>FIGURE 6</label>
<caption>
<p>Mean absolute error (MAE) improvements upon solvation for the top-performing nuclear magnetic resonance (NMR) functional&#x2013;basis sets (FBS): 1 <bold>(A)</bold>, 19 <bold>(B)</bold>, 23 <bold>(C)</bold>, 16 <bold>(D)</bold>, 7 <bold>(E)</bold>, and 8 <bold>(F)</bold>. Arrows are color-coded according to the compound number. Vertical arrows indicate MAE improvements due to the consideration of solvation during the geometry optimization, while horizontal arrows signify improvements during shielding constant calculations. The arrows point into the quadrant showing the most improvement. The listed MAE values (ppm) are the average of all compounds for that solvent combination. Their size indicates the size of MAE improvement upon a change in solvation model, where the full-length of a box is equal to an MAE improving by 1.23&#xa0;ppm.</p>
</caption>
<graphic xlink:href="fmolb-08-756219-g006.tif"/>
</fig>
<p>It may be noted that based on the MAE value alone, NMR FBS 23 and 16 appeared to perform best in the lower left quadrant with gas-phase NMR shielding constant calculations. However, a closer look at the gas-phase NMR predictions (<xref ref-type="sec" rid="s11">Supplementary Table S8</xref>) revealed an incorrect ordering of the shifts, with C2/C3 either being in the wrong order or being too similar to distinguish. When considering chemical shift ordering, the <italic>r</italic>
<sup>2</sup> values, and SDs in addition to the MAE values across all four quadrants (<xref ref-type="sec" rid="s11">Supplementary Table S9</xref>), the lower right quadrant (both implicit solvation) was the most accurate solvation combination for all NMR&#x20;FBS.</p>
<p>While there are examples of NMR predictions of carbohydrates in the literature, these primarily studied uncharged compounds without solvent interactions. Our protocol, when compared with other studies in the literature, more accurately reproduced shift values for both charged and uncharged carbohydrates (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>; <xref ref-type="bibr" rid="B47">Toukach and Ananikov, 2013b</xref>; <xref ref-type="bibr" rid="B13">Che&#x142;mecka et&#x20;al., 2007</xref>; <xref ref-type="bibr" rid="B11">Boese and Martin, 2004</xref>). Taubert et&#x20;al. utilized B3LYP/TZVP (NMR FBS 17) without solvation to predict the chemical shifts of a few saccharides (MAE&#xa0;&#x3d;&#xa0;2.28 ppm, <italic>r</italic>
<sup>2</sup>&#xa0;&#x3d;&#xa0;0.97) (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>), but our top-performing NMR FBS coupled with implicit solvation for our compounds produced better MAE (1.50&#xa0;ppm) and <italic>r</italic>
<sup>2</sup> value (0.99) than those in their study. Additionally, Taubert&#x2019;s top-performing method (NMR FBS 17 in this study) was not adept at modeling compounds <bold>1</bold>&#x2013;<bold>4</bold>, as seen in <xref ref-type="fig" rid="F5">Figure&#x20;5</xref> (MAE&#xa0;&#x3d;&#xa0;2.89&#xa0;ppm) (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>).</p>
<p>The accuracy of our top-performing NMR FBS 1 paired with implicit solvation can also be analyzed to report a root mean squared error (RMSD) of 1.82&#xa0;ppm, which is equivalent to the MAE value of 1.51&#xa0;ppm (<xref ref-type="sec" rid="s11">Supplementary Table S6</xref>). This is an improvement over the use of B3LYP/6-31G(d) (NMR FBS 1), as done by Chelmecka et&#x20;al., for studying glucopyranoside derivatives as their non-solvated gas-phase geometry optimizations and NMR shielding constant calculations yielded larger RMSEs (3&#x2013;6&#xa0;ppm) (<xref ref-type="bibr" rid="B47">Toukach and Ananikov, 2013b</xref>; <xref ref-type="bibr" rid="B13">Che&#x142;mecka et&#x20;al., 2007</xref>). On the other hand, Kupka et&#x20;al. reported a better RMSE value of 1.75&#xa0;ppm during their study of glucose derivatives; they did not consider solvent effects when they used B3PW91/6-31&#x2b;G&#x2a; [comparable to B3PW91/6-31&#x2b;G(d), NMR FBS 19] (<xref ref-type="bibr" rid="B47">Toukach and Ananikov, 2013b</xref>; <xref ref-type="bibr" rid="B26">Kupka et&#x20;al., 1999</xref>). Applying NMR FBS 19 to our protocol showed that implicit solvation was required to correctly predict shift order (<xref ref-type="sec" rid="s11">Supplementary Table S8</xref>). Belyakov et&#x20;al. studied saccharides, but their use of PBE/TZ2p (NMR FBS 21) in gas-phase conditions without any solvation considerations yielded a higher RMSE of 2.43 ppm, and our study of NMR FBS 21 showed that it was not adequate at modeling compounds <bold>1</bold>&#x2013;<bold>4</bold> (MAE&#xa0;&#x3d;&#xa0;2.04&#xa0;ppm) (<xref ref-type="bibr" rid="B7">Belyakov and Ananikov, 2011</xref>; <xref ref-type="bibr" rid="B47">Toukach and Ananikov, 2013b</xref>).</p>
<p>Some of the protocols recommended in the literature used a different type of correction&#x2014;the scaling factor correction&#x2014;to convert computed isotropic magnetic shielding values to chemical shifts (<xref ref-type="bibr" rid="B5">Bagno et&#x20;al., 2007</xref>; <xref ref-type="bibr" rid="B35">Pierens, 2014</xref>). This uses a computational method on a set of compounds, which are then compared to their experimental values so as to calculate scaling factors to convert computed isotropic magnetic shielding values to chemical shift data using a non-constant correction. Pierens et&#x20;al. used scaling factors to predict experimental values with MAEs of &#x3c;1.94 for <sup>13</sup>C and 0.154&#xa0;ppm for <sup>1</sup>H for a non-carbohydrate compound using the same geometry optimization FBS of B3LYP/6-31&#x2b;G(d,p) as in this study, but with non-solvated gas-phase optimizations, combined with NMR FBS 1&#x2013;10, using a mix of gas-phase and implicit solvation models. We found that Pierens&#x2019;s scaling factor values were unable to reliably predict the NMR data for compounds <bold>1</bold>&#x2013;<bold>4</bold>, yielding errors of 2.5&#x2013;8.8&#xa0;ppm (<xref ref-type="sec" rid="s11">Supplementary Table S10</xref>). Additionally, our protocol of modeling implicit solvation for geometry optimizations and shielding constant calculations followed by a constant reference correction method produced lower MAEs (30% lower for <sup>13</sup>C and 15% lower for <sup>1</sup>H) than those calculated using the scaling factor correction reported by Pierens.</p>
<p>We also quantified the effect of implicit solvation on the performance of the scaling factors, showing in <xref ref-type="sec" rid="s11">Supplementary Table S10</xref> that consideration of implicit solvation during both steps greatly reduced the MAEs and improved the relative shift ordering regardless of the correction method. Additionally, considering implicit solvation improved the reference correction method regardless of the reference compound. While using purely gas-phase conditions, TMS as a reference performed rather poorly compared to using methanol. However, as seen in <xref ref-type="sec" rid="s11">Supplementary Table S5</xref>, the accuracy of using any reference greatly improved when implicit solvation was used during both steps, to the extent that TMS could yield lower MAEs than methanol for certain methods. As discussed in <italic>Supplementary Information</italic>, we continued to use methanol due to other benefits besides MAE and to match the experimental reference, as recommended in the literature (<xref ref-type="bibr" rid="B26">Kupka et&#x20;al., 1999</xref>; <xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>; <xref ref-type="bibr" rid="B36">Sarotti and Pellegrinet, 2009</xref>; <xref ref-type="bibr" rid="B35">Pierens, 2014</xref>). This study demonstrated that implicit solvation improved predictions regardless of the correction method or the reference&#x20;used.</p>
<p>Another type of correction uses an internal reference. Bagno et&#x20;al. referenced the predicted chemical shifts of all carbon nuclei in glucose to the known isotropic magnetic shielding value of anomeric carbon instead of using a reference compound (<xref ref-type="bibr" rid="B5">Bagno et&#x20;al., 2007</xref>). In addition to the limited applicability of internal references, the anomeric atoms in our study tended to have higher errors (<xref ref-type="sec" rid="s11">Supplementary Table S6</xref>), making this approach unreasonable. Additionally, Bagno et&#x20;al. only studied one compound, so their low MAE of 1.12&#xa0;ppm was for one FBS that was optimized for one neutral sugar complex, just as our MAE for NMR FBS 1 for compound <bold>3</bold> was only 1.11&#xa0;ppm. While Bagno et&#x20;al. showed promise in using MD to take a sampling of 50&#x2013;100 geometry conformations and averaging their chemical shift predictions, our NMR FBS 3&#x2013;4 (similar to Bagno&#x2019;s use of B3LYP/cc-pVTZ to reoptimize structures following MD simulations) did not perform well when applied to compounds <bold>1</bold>&#x2013;<bold>4</bold>.</p>
<p>The metrics by which we measured the efficacy of a method can also guide improvements to the protocol. On the one hand, a well-performing protocol (i.e.,&#x20;low MAE value) that exhibited high precision (i.e.,&#x20;<italic>r</italic>
<sup>2</sup> close to 1) but had low accuracy (i.e.,&#x20;a large MSE) was likely limited by constant systematic error that still was able to replicate the relative ordering of shifts. On the other hand, a well-performing protocol (i.e.,&#x20;low MAE value) that exhibited high accuracy (i.e.,&#x20;a small MSE) but had low precision (i.e.,&#x20;<italic>r</italic>
<sup>2</sup> far from 1) was likely limited by a non-constant error that significantly hindered the applicability of the protocol for chemical shift predictions.</p>
<p>NMR FBS 16 exhibited high precision, as evidenced by the best <italic>r</italic>
<sup>2</sup> value and the lowest SD, shown in <xref ref-type="table" rid="T3">Table&#x20;3</xref> and <xref ref-type="sec" rid="s11">Supplementary Table S9</xref>. However, out of all NMR FBS tested, 16 had the second largest MSE, which indicates very low accuracy. This can be observed in <xref ref-type="fig" rid="F7">Figure&#x20;7</xref> where the red linear best-fit line of NMR FBS 16 was significantly but consistently offset from the experiment&#x2019;s black line. NMR FBS 16 is an example where the protocol predicting the relative shift values and ordering to high accuracy, but is unable to perfectly replicate the magnitude of the chemical&#x20;shift.</p>
<fig id="F7" position="float">
<label>FIGURE 7</label>
<caption>
<p>Balancing accuracy with precision. Comparison of the top 6 performing nuclear magnetic resonance (NMR) functional&#x2013;basis sets (FBS): 1 <bold>(A)</bold>, 19 <bold>(B)</bold>, 23 <bold>(C)</bold>, 16 <bold>(D)</bold>, 7 <bold>(E)</bold>, and 8 <bold>(F)</bold>. Implicit solvation was modeled for both geometry optimizations and shielding constant calculations. Each of the six carbons is denoted by a different shape, while each of the four compounds is denoted by a different color. The black line represents the literature values for the experimental chemical shifts, whereas the red line represents the best-fit line for the computed chemical shifts used to extract the r<sup>2</sup> values.</p>
</caption>
<graphic xlink:href="fmolb-08-756219-g007.tif"/>
</fig>
<p>
<xref ref-type="fig" rid="F7">Figure&#x20;7</xref> illustrates that the computed values of the top 6 NMR FBS combinations were typically overpredicted compared to the expected experimental values, as seen by the red line (linear fit of the computed values) being above the black line. NMR FBS 8, on the other hand, had the best (smallest) MSE values while still maintaining one of the best <italic>r</italic>
<sup>2</sup> values, suggesting high accuracy that came at the price of having a poor MAE value. <xref ref-type="fig" rid="F7">Figure&#x20;7</xref> shows that, unlike the constant offset displayed by NMR FBS 16, both NMR FBS 7 and 8 had different slopes from those of the experimental values, suggesting that they could benefit from a non-constant correction, such as the scaling correction method employed by <xref ref-type="bibr" rid="B35">Pierens (2014)</xref>. This is consistent with <xref ref-type="sec" rid="s11">Supplementary Table S10</xref>, where our application of Pierens&#x2019;s non-constant scaling factors performed best for NMR FBS 7 and 8, with MAEs of just over 2.5&#xa0;ppm. Conversely, the non-constant correction performed worst for NMR FBS 1, whose isotropic magnetic shielding values already showed a linear relationship with the experimental values and greatly benefited from a constant reference correction.</p>
<p>While some studies have suggested that a change in the basis set does not significantly change the predicted shift values (<xref ref-type="bibr" rid="B8">Benassi, 2017</xref>), our findings indicated that this does not apply for methyl apioside and its borate esters. While we did see an example of this behavior when the recommended NMR FBS of B3LYP/6-31G&#x2a; (<xref ref-type="bibr" rid="B13">Che&#x142;mecka et&#x20;al., 2007</xref>) produced indistinguishable chemical shifts from NMR FBS 1 that used B3LYP/6-31G(d), the majority of our results pointed to the importance of specifying a FBS combination as opposed to merely recommending a functional for chemical shift predictions. While NMR FBS 23 [WP04/6-31G(d)] was our third top-performing method with an MAE of 1.79&#xa0;ppm, changing the basis set to aug-cc-pVDZ (NMR FBS 9) increased the MAE by 0.1&#xa0;ppm, as seen in <xref ref-type="fig" rid="F5">Figure&#x20;5</xref>. Conversely, changing the functional to B3LYP (NMR FBS 1) reduced the MAE by 0.29&#xa0;ppm. Similar trends were observed when comparing NMR FBS 1&#x2013;4, 13, 17, and 20, as they all shared the same functional (B3LYP) with different basis sets and thus yield various chemical shift predictions. Additionally, while the literature suggested that very large basis sets are needed to obtain accurate NMR predictions (<xref ref-type="bibr" rid="B42">Taubert et&#x20;al., 2005</xref>), we did not find this to be the case for our system as our top-performing NMR FBS combinations did not have particularly large basis sets. These findings, in agreement with similar observations by <xref ref-type="bibr" rid="B27">Kupka et&#x20;al. (2010)</xref> and <xref ref-type="bibr" rid="B4">Ahmed et&#x20;al. (2020)</xref>, highlighted the fact that generalized rules of using large basis sets do not inherently lead to accurate chemical shift predictions and thereby necessitate the evaluation of the efficacy of each combination.</p>
</sec>
<sec sec-type="conclusion" id="s5">
<title>Conclusion</title>
<p>In this study, we evaluated the efficacy of various functional and basis set combinations and implicit solvation protocols for the accurate prediction of <sup>13</sup>C chemical shifts for methyl apioside and its borate derivatives. In addition to the improved <sup>13</sup>C shift predictions compared to those in recent literature on carbohydrates, we have been able to replicate the correct ordering of shifts for <sup>13</sup>C data and distinguish between the apiofuranosides upon dimerization. We achieved this by considering implicit solvation during both the geometry optimization and the NMR shielding constant calculations. This protocol correctly predicted the same 6- to 7-ppm change in <sup>13</sup>C chemical shift for C2/C3 that experimentalists used to distinguish between compound <bold>1</bold> and its borate esters (<bold>2</bold>&#x2013;<bold>4</bold>) (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>).</p>
<p>Among the 23 FBS combinations that were assessed based on the SD values, <italic>r</italic>
<sup>2</sup> analyses, and correct shift ordering, we observed four combinations (FBS 1, 16, 19, and 23) that all reproduced the correct ordering and were well within 2&#xa0;ppm of the experimental values. We observed that our protocol was also able to demonstrate relatively accurate <sup>1</sup>H predictions in most cases, with similar MAPE values to those observed for <sup>13</sup>C. In some instances, the ordering of the <sup>1</sup>H shifts was incorrectly predicted and could be attributed to the proton shifts of apiose being highly similar, a fact that was also experienced by experimentalists who struggled to distinguish them without using two-dimensional NMR (<xref ref-type="bibr" rid="B22">Ishii and Ono, 1999</xref>). Specifically, our protocol did not replicate the very precise 0.003-ppm change in the <sup>1</sup>H chemical shift for H7 that experimentalists noted to distinguish between the borate esters (<bold>2</bold>&#x2013;<bold>4</bold>).</p>
<p>From the RG-II context, in the future, in addition to being able to predict the ordering of <sup>1</sup>H shifts to greater accuracy, we would also like to extend our protocol to predict chemical shifts in other sugar moieties such as galactose. We believe that this protocol can successfully be applied to study other complex RG-II mono- and oligosaccharides with NMR spectra that are difficult to assign and will play a key role in aiding our understanding of the role of specific sugars in the RG-II dimerization process.</p>
<p>In general, the use of NMR shifts to characterize non-covalent interactions and structural conformations is a crucial approach toward resolving the conformational ambiguities in complex organic molecules (<xref ref-type="bibr" rid="B3">Adams and Lerner, 1992</xref>; <xref ref-type="bibr" rid="B38">Siskos et&#x20;al., 2015</xref>). The potential of computational predictions in elucidating structure&#x2013;spectra relationships hinges on such detailed studies that are pivotal in enabling unique insights into the function and mechanisms of various chemical and biochemical systems.</p>
</sec>
</body>
<back>
<sec id="s6">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="s11">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7">
<title>Author Contributions</title>
<p>VB conceptualized the study and supervised and developed the methodology for the calculations. LW performed the calculations, curated the data, and led the formal analysis and visualization of the results along with VB. VB and LW wrote the manuscript. MC provided critical feedback on the manuscript.</p>
</sec>
<sec id="s8">
<title>Funding</title>
<p>This work was authored in part by the Alliance for Sustainable Energy, LLC, the manager and operator of the National Renewable Energy Laboratory for the U.S. Department of Energy (DOE) under contract no. DE-AC36-08GO28308. The authors acknowledge the Division of Chemical Sciences, Geosciences, and Biosciences, Office of Basic Energy Sciences of the United&#x20;States Department of Energy through grant DESC0008472 for funding structural studies of RG-II. We are also grateful for the access and use of NREL&#x2019;s Computational Sciences&#x2019; resources (Eagle) supported by the DOE Office of EERE under contract no. DE-AC36-08GO28308. This work was supported in part by the U.S. Department of Energy, Office of Science, Office of Workforce Development for Teachers and Scientists (WDTS) under the Science Undergraduate Laboratory Internship (SULI) program. This study received funding from US Department of Energy. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication. All authors declare no other competing interests.</p>
</sec>
<sec sec-type="COI-statement" id="s9">
<title>Conflict of Interest</title>
<p>This study received funding from US Department of Energy. The funder was not involved in the study design, collection, analysis, interpretation of data, the writing of this article or the decision to submit it for publication. All authors declare no other competing interests.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors, and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s11">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmolb.2021.756219/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmolb.2021.756219/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.docx" id="SM1" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>REFERENCES</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adamo</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Barone</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Exchange Functionals with Improved Long-Range Behavior and Adiabatic Connection Methods without Adjustable Parameters: The mPW and mPW1PW Models</article-title>. <source>J.&#x20;Chem. Phys.</source> <volume>108</volume>, <fpage>664</fpage>&#x2013;<lpage>675</lpage>. <pub-id pub-id-type="doi">10.1063/1.475428</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adamo</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Barone</surname>
<given-names>V.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Toward Reliable Density Functional Methods without Adjustable Parameters: The PBE0 Model</article-title>. <source>J.&#x20;Chem. Phys.</source> <volume>110</volume>, <fpage>6158</fpage>&#x2013;<lpage>6170</lpage>. <pub-id pub-id-type="doi">10.1063/1.478522</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Adams</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Lerner</surname>
<given-names>L.</given-names>
</name>
</person-group> (<year>1992</year>). <article-title>Observation of Hydroxyl Protons of Sucrose in Aqueous Solution: No Evidence for Persistent Intramolecular Hydrogen Bonds</article-title>. <source>J.&#x20;Am. Chem. Soc.</source> <volume>114</volume>, <fpage>4827</fpage>&#x2013;<lpage>4829</lpage>. <pub-id pub-id-type="doi">10.1021/ja00038a055</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ahmed</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Varras</surname>
<given-names>P. C.</given-names>
</name>
<name>
<surname>Siskos</surname>
<given-names>M. G.</given-names>
</name>
<name>
<surname>Siddiqui</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Choudhary</surname>
<given-names>M. I.</given-names>
</name>
<name>
<surname>Gerothanassis</surname>
<given-names>I. P.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>NMR and Computational Studies as Analytical and High-Resolution Structural Tool for Complex Hydroperoxides and Diastereomeric Endo-Hydroperoxides of Fatty Acids in Solution-Exemplified by Methyl Linolenate</article-title>. <source>Molecules</source> <volume>25</volume>, <fpage>4902</fpage>. <pub-id pub-id-type="doi">10.3390/molecules25214902</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bagno</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Rastrelli</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Saielli</surname>
<given-names>G.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Prediction of the 1H and 13C NMR Spectra of &#x3b1;-d-Glucose in Water by DFT Methods and MD Simulations</article-title>. <source>J.&#x20;Org. Chem.</source> <volume>72</volume>, <fpage>7373</fpage>&#x2013;<lpage>7381</lpage>. <pub-id pub-id-type="doi">10.1021/jo071129v</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Becke</surname>
<given-names>A. D.</given-names>
</name>
</person-group> (<year>1993</year>). <article-title>Density&#x2010;functional Thermochemistry. III. The Role of Exact Exchange</article-title>. <source>J.&#x20;Chem. Phys.</source> <volume>98</volume>, <fpage>5648</fpage>&#x2013;<lpage>5652</lpage>. <pub-id pub-id-type="doi">10.1063/1.464913</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Belyakov</surname>
<given-names>P. A.</given-names>
</name>
<name>
<surname>Ananikov</surname>
<given-names>V. P.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Erratum to: "Modeling of NMR Spectra and Signal Assignment Using Real-Time DFT/GIAO Calculations"</article-title>. <source>Russ. Chem. Bull.</source> <volume>60</volume>, <fpage>2626</fpage>. <pub-id pub-id-type="doi">10.1007/s11172-011-0405-3</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Benassi</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Benchmarking of Density Functionals for a Soft but Accurate Prediction and Assignment of1H and13C NMR Chemical Shifts in Organic and Biological Molecules</article-title>. <source>J.&#x20;Comput. Chem.</source> <volume>38</volume>, <fpage>87</fpage>&#x2013;<lpage>92</lpage>. <pub-id pub-id-type="doi">10.1002/jcc.24521</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bharadwaj</surname>
<given-names>V. S.</given-names>
</name>
<name>
<surname>Crowley</surname>
<given-names>M. F.</given-names>
</name>
<name>
<surname>Pe&#xf1;a</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Urbanowicz</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>O&#x2019;Neill</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Mechanism and Reaction Energy Landscape for Apiose Cross-Linking by Boric Acid in Rhamnogalacturonan II</article-title>. <source>J.&#x20;Phys. Chem. B</source> <volume>124</volume>, <fpage>10117</fpage>&#x2013;<lpage>10125</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jpcb.0c06920</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bifulco</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Dambruoso</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Gomez-Paloma</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Riccio</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Determination of Relative Configuration in Organic Compounds by NMR Spectroscopy and Computational Methods</article-title>. <source>Chem. Rev.</source> <volume>107</volume>, <fpage>3744</fpage>&#x2013;<lpage>3779</lpage>. <pub-id pub-id-type="doi">10.1021/cr030733c</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Boese</surname>
<given-names>A. D.</given-names>
</name>
<name>
<surname>Martin</surname>
<given-names>J.&#x20;M. L.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Development of Density Functionals for Thermochemical Kinetics</article-title>. <source>J.&#x20;Chem. Phys.</source> <volume>121</volume>, <fpage>3405</fpage>&#x2013;<lpage>3416</lpage>. <pub-id pub-id-type="doi">10.1063/1.1774975</pub-id> </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bubb</surname>
<given-names>W. A.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>NMR Spectroscopy in the Study of Carbohydrates: Characterizing the Structural Complexity</article-title>. <source>Concepts Magn. Reson.</source> <volume>19A</volume>, <fpage>1</fpage>&#x2013;<lpage>19</lpage>. <pub-id pub-id-type="doi">10.1002/cmr.a.10080</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Che&#x142;mecka</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Pasterny</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Gawlik</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Szeja</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Wrzalik</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2007</year>). <article-title>Theoretical and Experimental Studies on Methyl Alpha-D-Glucopyranoside Derivatives</article-title>. <source>J.&#x20;Mol. Struct.</source> <volume>834-836</volume>, <fpage>498</fpage>&#x2013;<lpage>507</lpage>. </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cramer</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Truhlar</surname>
<given-names>D. G.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Implicit Solvation Models: Equilibria, Structure, Spectra, and Dynamics</article-title>. <source>Chem. Rev.</source> <volume>99</volume>, <fpage>2161</fpage>&#x2013;<lpage>2200</lpage>. <pub-id pub-id-type="doi">10.1021/cr960149m</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ditchfield</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>1974</year>). <article-title>Self-consistent Perturbation Theory of Diamagnetism</article-title>. <source>Mol. Phys.</source> <volume>27</volume>, <fpage>789</fpage>&#x2013;<lpage>807</lpage>. <pub-id pub-id-type="doi">10.1080/00268977400100711</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Duus</surname>
<given-names>J.&#x20;&#xd8;.</given-names>
</name>
<name>
<surname>Gotfredsen</surname>
<given-names>C. H.</given-names>
</name>
<name>
<surname>Bock</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2000</year>). <article-title>Carbohydrate Structural Determination by NMR Spectroscopy: Modern Methods and Limitations</article-title>. <source>Chem. Rev.</source> <volume>100</volume>, <fpage>4589</fpage>&#x2013;<lpage>4614</lpage>. <pub-id pub-id-type="doi">10.1021/cr990302n</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Duus</surname>
<given-names>J.&#x20;&#xd8;.</given-names>
</name>
<name>
<surname>Hilaire</surname>
<given-names>P. M. S.</given-names>
</name>
<name>
<surname>Meldal</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Bock</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Carbohydrate Chemistry: Synthetic and Structural Challenges towards the End of the 20th century</article-title>. <source>Pure Appl. Chem.</source> <volume>71</volume>, <fpage>755</fpage>&#x2013;<lpage>765</lpage>. <pub-id pub-id-type="doi">10.1351/pac199971050755</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Frisch</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Trucks</surname>
<given-names>G. W.</given-names>
</name>
<name>
<surname>Schlegel</surname>
<given-names>H. B.</given-names>
</name>
<name>
<surname>Scuseria</surname>
<given-names>G. E.</given-names>
</name>
<name>
<surname>Robb</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Cheeseman</surname>
<given-names>J.&#x20;R.</given-names>
</name>
<etal/>
</person-group> (<year>2016</year>). <source>GaussView 5.0</source>. <publisher-loc>Wallingford</publisher-loc>: <publisher-name>E.U.A</publisher-name>, <fpage>1</fpage>. </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fulmer</surname>
<given-names>G. R.</given-names>
</name>
<name>
<surname>Miller</surname>
<given-names>A. J.&#x20;M.</given-names>
</name>
<name>
<surname>Sherden</surname>
<given-names>N. H.</given-names>
</name>
<name>
<surname>Gottlieb</surname>
<given-names>H. E.</given-names>
</name>
<name>
<surname>Nudelman</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Stoltz</surname>
<given-names>B. M.</given-names>
</name>
<etal/>
</person-group> (<year>2010</year>). <article-title>NMR Chemical Shifts of Trace Impurities: Common Laboratory Solvents, Organics, and Gases in Deuterated Solvents Relevant to the Organometallic Chemist</article-title>. <source>Organometallics</source> <volume>29</volume>, <fpage>2176</fpage>&#x2013;<lpage>2179</lpage>. <pub-id pub-id-type="doi">10.1021/om100106e</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Helgaker</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Jaszu&#x144;ski</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Ruud</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Ab Initio Methods for the Calculation of NMR Shielding and Indirect Spin&#x2212;Spin Coupling Constants</article-title>. <source>Chem. Rev.</source> <volume>99</volume>, <fpage>293</fpage>&#x2013;<lpage>352</lpage>. <pub-id pub-id-type="doi">10.1021/cr960017t</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Iron</surname>
<given-names>M. A.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Evaluation of the Factors Impacting the Accuracy of 13C NMR Chemical Shift Predictions Using Density Functional Theory-The Advantage of Long-Range Corrected Functionals</article-title>. <source>J.&#x20;Chem. Theor. Comput.</source> <volume>13</volume>, <fpage>5798</fpage>&#x2013;<lpage>5819</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jctc.7b00772</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ishii</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Ono</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Recovery and fine Structure Variability of RGII Sub-domains in Wine (Vitis vinifera Merlot)</article-title>. <source>Carbohydr. Res.</source> <volume>4</volume>, <fpage>1</fpage>. </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ishii</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yanagisawa</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>1998</year>). <article-title>Synthesis, Separation and NMR Spectral Analysis of Methyl Apiofuranosides</article-title>. <source>Carbohydr. Res.</source> <volume>313</volume>, <fpage>189</fpage>. <pub-id pub-id-type="doi">10.1016/S0008-6215(98)00262-6</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="book">
<person-group person-group-type="author">
<name>
<surname>Kemp</surname>
<given-names>M. T.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Optimizing Protocols for Carbohydrate NMR Chemical Shift Computations</article-title>. <source>Graduate Theses and Dissertations</source>. <comment>
<ext-link ext-link-type="uri" xlink:href="https://digitalcommons.usf.edu/etd/6273">https://digitalcommons.usf.edu/etd/6273</ext-link>
</comment>. </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Krivdin</surname>
<given-names>L. B.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Computational Protocols for Calculating 13C NMR Chemical Shifts</article-title>. <source>Prog. Nucl. Magn. Reson. Spectrosc.</source> <volume>112-113</volume>, <fpage>103</fpage>&#x2013;<lpage>156</lpage>. <pub-id pub-id-type="doi">10.1016/j.pnmrs.2019.05.004</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kupka</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Pasterna</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Lodowski</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Szeja</surname>
<given-names>W.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>GIAO-DFT Prediction of Accurate NMR Parameters in Selected Glucose Derivatives</article-title>. <source>Magn. Reson. Chem.</source> <volume>37</volume>, <fpage>421</fpage>&#x2013;<lpage>426</lpage>. <pub-id pub-id-type="doi">10.1002/(sici)1097-458x(199906)37:6&#x3c;421:aid-mrc479&#x3e;3.0.co;2-w</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kupka</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Stach&#xf3;w</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Nieradka</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kaminsky</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Pluta</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>2010</year>). <article-title>Convergence of Nuclear Magnetic Shieldings in the Kohn&#x2212;Sham Limit for Several Small Molecules</article-title>. <source>J.&#x20;Chem. Theor. Comput.</source> <volume>6</volume>, <fpage>1580</fpage>&#x2013;<lpage>1589</lpage>. <pub-id pub-id-type="doi">10.1021/ct100109j</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lodewyk</surname>
<given-names>M. W.</given-names>
</name>
<name>
<surname>Siebert</surname>
<given-names>M. R.</given-names>
</name>
<name>
<surname>Tantillo</surname>
<given-names>D. J.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Computational Prediction of 1H and 13C Chemical Shifts: A Useful Tool for Natural Product, Mechanistic, and Synthetic Organic Chemistry</article-title>. <source>Chem. Rev.</source> <volume>112</volume>, <fpage>1839</fpage>&#x2013;<lpage>1862</lpage>. <pub-id pub-id-type="doi">10.1021/cr200106v</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Marenich</surname>
<given-names>A. V.</given-names>
</name>
<name>
<surname>Cramer</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Truhlar</surname>
<given-names>D. G.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>Universal Solvation Model Based on Solute Electron Density and on a Continuum Model of the Solvent Defined by the Bulk Dielectric Constant and Atomic Surface Tensions</article-title>. <source>J.&#x20;Phys. Chem. B</source> <volume>113</volume>, <fpage>6378</fpage>&#x2013;<lpage>6396</lpage>. <pub-id pub-id-type="doi">10.1021/jp810292n</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Ndeh</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Rogowski</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Cartmell</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Luis</surname>
<given-names>A. S.</given-names>
</name>
<name>
<surname>Basl&#xe9;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Gray</surname>
<given-names>J.</given-names>
</name>
<etal/>
</person-group> (<year>2017</year>). <article-title>Complex Pectin Metabolism by Gut Bacteria Reveals Novel Catalytic Functions</article-title>. <source>Nature</source> <volume>544</volume>, <fpage>65</fpage>&#x2013;<lpage>70</lpage>. <pub-id pub-id-type="doi">10.1038/nature21725</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>O&#x27;Neill</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Ishii</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Albersheim</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Darvill</surname>
<given-names>A. G.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>RHAMNOGALACTURONAN II: Structure and Function of a Borate Cross-Linked Cell Wall Pectic Polysaccharide</article-title>. <source>Annu. Rev. Plant Biol.</source> <volume>55</volume>, <fpage>109</fpage>&#x2013;<lpage>139</lpage>. <pub-id pub-id-type="doi">10.1146/annurev.arplant.55.031903.141750</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>O&#x27;Neill</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Warrenfeltz</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Kates</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Pellerin</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Doco</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Darvill</surname>
<given-names>A. G.</given-names>
</name>
<etal/>
</person-group> (<year>1996</year>). <article-title>Rhamnogalacturonan-II, a Pectic Polysaccharide in the Walls of Growing Plant Cell, Forms a Dimer that Is Covalently Cross-Linked by a Borate Ester</article-title>. <source>J.&#x20;Biol. Chem.</source> <volume>271</volume>, <fpage>22923</fpage>&#x2013;<lpage>22930</lpage>. <pub-id pub-id-type="doi">10.1074/jbc.271.37.22923</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>O&#x2019;Neill</surname>
<given-names>M. A.</given-names>
</name>
<name>
<surname>Black</surname>
<given-names>I.</given-names>
</name>
<name>
<surname>Urbanowicz</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Bharadwaj</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Crowley</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Koj</surname>
<given-names>S.</given-names>
</name>
<etal/>
</person-group> (<year>2020</year>). <article-title>Locating Methyl-Etherified and Methyl-Esterified Uronic Acids in the Plant Cell Wall Pectic Polysaccharide Rhamnogalacturonan II</article-title>. <source>SLAS TECHNOLOGY: Translating Life Sci. Innovation</source> <volume>25</volume>, <fpage>329</fpage>&#x2013;<lpage>344</lpage>. </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Perdew</surname>
<given-names>J.&#x20;P.</given-names>
</name>
<name>
<surname>Burke</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Ernzerhof</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>1996</year>). <article-title>Generalized Gradient Approximation Made Simple</article-title>. <source>Phys. Rev. Lett.</source> <volume>77</volume>, <fpage>3865</fpage>&#x2013;<lpage>3868</lpage>. <pub-id pub-id-type="doi">10.1103/physrevlett.77.3865</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pierens</surname>
<given-names>G. K.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>1&#x20;H and 13 C NMR Scaling Factors for the Calculation of Chemical Shifts in Commonly Used Solvents Using Density Functional Theory</article-title>. <source>J.&#x20;Comput. Chem.</source> <volume>35</volume>, <fpage>1388</fpage>&#x2013;<lpage>1394</lpage>. <pub-id pub-id-type="doi">10.1002/jcc.23638</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sarotti</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Pellegrinet</surname>
<given-names>S. C.</given-names>
</name>
</person-group> (<year>2009</year>). <article-title>A Multi-Standard Approach for GIAO 13C NMR Calculations</article-title>. <source>J.&#x20;Org. Chem.</source> <volume>74</volume>, <fpage>7254</fpage>&#x2013;<lpage>7260</lpage>. <pub-id pub-id-type="doi">10.1021/jo901234h</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Schreckenbach</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Ziegler</surname>
<given-names>T.</given-names>
</name>
</person-group> (<year>1995</year>). <article-title>Calculation of NMR Shielding Tensors Using Gauge-Including Atomic Orbitals and Modern Density Functional Theory</article-title>. <source>J.&#x20;Phys. Chem.</source> <volume>99</volume>, <fpage>606</fpage>&#x2013;<lpage>611</lpage>. <pub-id pub-id-type="doi">10.1021/j100002a024</pub-id> </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Siskos</surname>
<given-names>M. G.</given-names>
</name>
<name>
<surname>Tzakos</surname>
<given-names>A. G.</given-names>
</name>
<name>
<surname>Gerothanassis</surname>
<given-names>I. P.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>Accurate Ab Initio Calculations of O-H&#x22ef;O and O-H&#x22ef;&#x2212;O Proton Chemical Shifts: towards Elucidation of the Nature of the Hydrogen Bond and Prediction of Hydrogen Bond Distances</article-title>. <source>Org. Biomol. Chem.</source> <volume>13</volume>, <fpage>8852</fpage>&#x2013;<lpage>8868</lpage>. <pub-id pub-id-type="doi">10.1039/c5ob00920k</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tantillo</surname>
<given-names>D. J.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Walking in the Woods with Quantum Chemistry - Applications of Quantum Chemical Calculations in Natural Products Research</article-title>. <source>Nat. Prod. Rep.</source> <volume>30</volume>, <fpage>1079</fpage>&#x2013;<lpage>1086</lpage>. <pub-id pub-id-type="doi">10.1039/c3np70028c</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tao</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Perdew</surname>
<given-names>J.&#x20;P.</given-names>
</name>
<name>
<surname>Staroverov</surname>
<given-names>V. N.</given-names>
</name>
<name>
<surname>Scuseria</surname>
<given-names>G. E.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>Climbing the Density Functional Ladder: Nonempirical Meta-Generalized Gradient Approximation Designed for Molecules and Solids</article-title>. <source>Phys. Rev. Lett.</source> <volume>91</volume>, <fpage>146401</fpage>. <pub-id pub-id-type="doi">10.1103/physrevlett.91.146401</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tarazona</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Benedit</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Fern&#xe1;ndez</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>P&#xe9;rez</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Rodr&#xed;guez</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Jim&#xe9;nez</surname>
<given-names>C.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>Can Stereoclusters Separated by Two Methylene Groups Be Related by DFT Studies? the Case of the Cytotoxic Meroditerpenes Halioxepines</article-title>. <source>J.&#x20;Nat. Prod.</source> <volume>81</volume>, <fpage>343</fpage>&#x2013;<lpage>348</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jnatprod.7b00807</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Taubert</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Konschin</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Sundholm</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Computational Studies of 13C NMR Chemical Shifts of Saccharides</article-title>. <source>Phys. Chem. Chem. Phys.</source> <volume>7</volume>, <fpage>2561</fpage>&#x2013;<lpage>2569</lpage>. <pub-id pub-id-type="doi">10.1039/b505546f</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Teale</surname>
<given-names>A. M.</given-names>
</name>
<name>
<surname>Lutn&#xe6;s</surname>
<given-names>O. B.</given-names>
</name>
<name>
<surname>Helgaker</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tozer</surname>
<given-names>D. J.</given-names>
</name>
<name>
<surname>Gauss</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Benchmarking Density-Functional Theory Calculations of NMR Shielding Constants and Spin-Rotation Constants Using Accurate Coupled-Cluster Calculations</article-title>. <source>J.&#x20;Chem. Phys.</source> <volume>138</volume>, <fpage>024111</fpage>. <pub-id pub-id-type="doi">10.1063/1.4773016</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tomasi</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Mennucci</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Cammi</surname>
<given-names>R.</given-names>
</name>
</person-group> (<year>2005</year>). <article-title>Quantum Mechanical Continuum Solvation Models</article-title>. <source>Chem. Rev.</source> <volume>105</volume>, <fpage>2999</fpage>&#x2013;<lpage>3094</lpage>. <pub-id pub-id-type="doi">10.1021/cr9904009</pub-id> </citation>
</ref>
<ref id="B45">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Tomasi</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Mennucci</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Canc&#xe8;s</surname>
<given-names>E.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>The IEF Version of the PCM Solvation Method: an Overview of a New Method Addressed to Study Molecular Solutes at the QM Ab Initio Level</article-title>. <source>J.&#x20;Mol. Struct. THEOCHEM</source> <volume>464</volume>, <fpage>211</fpage>&#x2013;<lpage>226</lpage>. <pub-id pub-id-type="doi">10.1016/s0166-1280(98)00553-3</pub-id> </citation>
</ref>
<ref id="B46">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Toukach</surname>
<given-names>F. V.</given-names>
</name>
<name>
<surname>Ananikov</surname>
<given-names>V. P.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Recent Advances in Computational Predictions of NMR Parameters for the Structure Elucidation of Carbohydrates: Methods and Limitations</article-title>. <source>Chem. Soc. Rev.</source> <volume>42</volume>, <fpage>8376</fpage>&#x2013;<lpage>8415</lpage>. <pub-id pub-id-type="doi">10.1039/c3cs60073d</pub-id> </citation>
</ref>
<ref id="B47">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Toukach</surname>
<given-names>F. V.</given-names>
</name>
<name>
<surname>Ananikov</surname>
<given-names>V. P.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>Recent Advances in Computational Predictions of NMR Parameters for the Structure Elucidation of Carbohydrates: Methods and Limitations</article-title>. <source>Chem. Soc. Rev.</source> <volume>40</volume>, <fpage>1</fpage>. </citation>
</ref>
<ref id="B48">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Van Halbeek</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>NMR Developments in Structural Studies of Carbohydrates and Their Complexes</article-title>. <source>Curr. Opin. Struct. Biol.</source> <volume>4</volume>, <fpage>697</fpage>&#x2013;<lpage>709</lpage>. <pub-id pub-id-type="doi">10.1016/s0959-440x(94)90168-6</pub-id> </citation>
</ref>
<ref id="B49">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Wiitala</surname>
<given-names>K. W.</given-names>
</name>
<name>
<surname>Hoye</surname>
<given-names>T. R.</given-names>
</name>
<name>
<surname>Cramer</surname>
<given-names>C. J.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Hybrid Density Functional Methods Empirically Optimized for the Computation of 13C and 1H Chemical Shifts in Chloroform Solution</article-title>. <source>J.&#x20;Chem. Theor. Comput.</source> <volume>2</volume>, <fpage>1085</fpage>&#x2013;<lpage>1092</lpage>. <pub-id pub-id-type="doi">10.1021/ct6001016</pub-id> </citation>
</ref>
<ref id="B50">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Willoughby</surname>
<given-names>P. H.</given-names>
</name>
<name>
<surname>Jansma</surname>
<given-names>M. J.</given-names>
</name>
<name>
<surname>Hoye</surname>
<given-names>T. R.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>A Guide to Small-Molecule Structure Assignment through Computation of (1H and 13C) NMR Chemical Shifts</article-title>. <source>Nat. Protoc.</source> <volume>9</volume>, <fpage>643</fpage>&#x2013;<lpage>660</lpage>. <pub-id pub-id-type="doi">10.1038/nprot.2014.042</pub-id> </citation>
</ref>
<ref id="B51">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>W&#xfc;thrich</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2003</year>). <article-title>NMR Studies of Structure and Function of Biological Macromolecules (Nobel Lecture)</article-title>. <source>Angew. Chem. Int. Ed.</source> <volume>42</volume>, <fpage>3340</fpage>&#x2013;<lpage>3363</lpage>. <pub-id pub-id-type="doi">10.1002/anie.200300595</pub-id> </citation>
</ref>
<ref id="B52">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yanai</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Tew</surname>
<given-names>D. P.</given-names>
</name>
<name>
<surname>Handy</surname>
<given-names>N. C.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>A New Hybrid Exchange-Correlation Functional Using the Coulomb-Attenuating Method (CAM-B3lyp)</article-title>. <source>Chem. Phys. Lett.</source> <volume>393</volume>, <fpage>51</fpage>&#x2013;<lpage>57</lpage>. <pub-id pub-id-type="doi">10.1016/j.cplett.2004.06.011</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>