<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article article-type="research-article" dtd-version="2.3" xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Chem.</journal-id>
<journal-title>Frontiers in Chemistry</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Chem.</abbrev-journal-title>
<issn pub-type="epub">2296-2646</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="publisher-id">737579</article-id>
<article-id pub-id-type="doi">10.3389/fchem.2021.737579</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Chemistry</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Thermodynamics-Based Model Construction for the Accurate Prediction of Molecular Properties From Partition Coefficients</article-title>
<alt-title alt-title-type="left-running-head">Chen et&#x20;al.</alt-title>
<alt-title alt-title-type="right-running-head">LFER for property prediction</alt-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name>
<surname>Chen</surname>
<given-names>Deliang</given-names>
</name>
<xref ref-type="corresp" rid="c001">&#x2a;</xref>
<uri xlink:href="https://loop.frontiersin.org/people/1399116/overview"/>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Huang</surname>
<given-names>Xiaoqing</given-names>
</name>
</contrib>
<contrib contrib-type="author">
<name>
<surname>Fan</surname>
<given-names>Yulan</given-names>
</name>
</contrib>
</contrib-group>
<aff>Jiangxi Key Laboratory of Organo-Pharmaceutical Chemistry, Chemistry and Chemical Engineering College, Gannan Normal University, <addr-line>Ganzhou</addr-line>, <country>China</country>
</aff>
<author-notes>
<fn fn-type="edited-by">
<p>
<bold>Edited by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/240838/overview">Sudip Pan</ext-link>, University of Marburg, Germany</p>
</fn>
<fn fn-type="edited-by">
<p>
<bold>Reviewed by:</bold> <ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/947121/overview">Santanab Giri</ext-link>, Haldia Institute of Technology, India</p>
<p>
<ext-link ext-link-type="uri" xlink:href="https://loop.frontiersin.org/people/1347568/overview">Gourhari Jana</ext-link>, Indian Institute of Technology Bombay, India</p>
</fn>
<corresp id="c001">&#x2a;Correspondence: Deliang Chen, <email>deliang2211@hotmail.com</email>
</corresp>
<fn fn-type="other">
<p>This article was submitted to Theoretical and Computational Chemistry, a section of the journal Frontiers in Chemistry</p>
</fn>
</author-notes>
<pub-date pub-type="epub">
<day>13</day>
<month>09</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>9</volume>
<elocation-id>737579</elocation-id>
<history>
<date date-type="received">
<day>07</day>
<month>07</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>20</day>
<month>08</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#xa9; 2021 Chen, Huang and Fan.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Chen, Huang and Fan</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/">
<p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these&#x20;terms.</p>
</license>
</permissions>
<abstract>
<p>Developing models for predicting molecular properties of organic compounds is imperative for drug development and environmental safety; however, development of such models that have high predictive power and are independent of the compounds used is challenging. To overcome the challenges, we used a thermodynamics-based theoretical derivation to construct models for accurately predicting molecular properties. The free energy change that determines a property equals the sum of the free energy changes (&#x394;G<sub>F</sub>s) caused by the factors affecting the property. By developing or selecting molecular descriptors that are directly proportional to &#x394;G<sub>F</sub>s, we built a general linear free energy relationship (LFER) for predicting the property with the molecular descriptors as predictive variables. The LFER can be used to construct models for predicting various specific properties from partition coefficients. Validations show that the models constructed according to the LFER have high predictive power and their performance is independent of the compounds used, including the models for the properties having little correlation with partition coefficients. The findings in this study are highly useful for applications in drug development and environmental safety.</p>
</abstract>
<kwd-group>
<kwd>computational chemistry</kwd>
<kwd>linear free energy relationships</kwd>
<kwd>molecular properties</kwd>
<kwd>partition coefficient</kwd>
<kwd>noncovalent interactions</kwd>
<kwd>quantitative structure-property relationships</kwd>
</kwd-group>
<contract-num rid="cn001">21763002 21473041</contract-num>
<contract-num rid="cn002">20202ACBL203008</contract-num>
<contract-sponsor id="cn001">Foundation for Innovative Research Groups of the National Natural Science Foundation of China<named-content content-type="fundref-id">10.13039/501100012659</named-content>
</contract-sponsor>
<contract-sponsor id="cn002">Natural Science Foundation of Jiangxi Province<named-content content-type="fundref-id">10.13039/501100004479</named-content>
</contract-sponsor>
</article-meta>
</front>
<body>
<sec id="s1">
<title>Introduction</title>
<p>The rapid development of new organic compounds in various chemical-related laboratories and industries has increased the difficulty in measuring the physicochemical, and absorption, distribution, metabolism, excretion, and toxicity (ADME/Tox) properties of all possible compounds. Therefore, the development of techniques for predicting these properties via computational tools is imperative (<xref ref-type="bibr" rid="B31">Sarkar et&#x20;al., 2012</xref>; <xref ref-type="bibr" rid="B23">Li et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B34">Sun et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B33">Suay-Garcia et&#x20;al., 2020</xref>). Quantitative structure&#x2013;property relationships (QSPRs) with multiple predictive variables are widely used for predicting various properties of organic compounds. QSPR employs regression statistics using algorithms, such as artificial neural networks, (<xref ref-type="bibr" rid="B13">Deeb et&#x20;al., 2011</xref>; <xref ref-type="bibr" rid="B32">Song et&#x20;al., 2017</xref>), machine learning (<xref ref-type="bibr" rid="B6">Bushdid et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B12">Cheng and Ng, 2019</xref>; <xref ref-type="bibr" rid="B41">Zheng et&#x20;al., 2019</xref>), and partial least square (<xref ref-type="bibr" rid="B13">Deeb et&#x20;al., 2011</xref>; <xref ref-type="bibr" rid="B35">T. Stanton, 2012</xref>), with predictive variables usually selected from a few thousand molecular descriptors based on mathematical and statistical tools (<xref ref-type="bibr" rid="B26">Mansouri et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B21">Lee et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B15">Fioressi et&#x20;al., 2020</xref>). A large number of articles related to QSPR were published per year and QSPR has gained importance in a wide range of fields, such as drug design, pesticide design, and environmental toxicology (<xref ref-type="bibr" rid="B29">Roy et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B37">Yang et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B42">Zhu et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B17">He et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B20">Khan et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B43">Zhu et&#x20;al., 2020a</xref>). For example, predicting the ADME/Tox of drug candidates before synthesis can significantly reduce the cost and time of drug development and increase the success rate (<xref ref-type="bibr" rid="B11">Cheng et&#x20;al., 2013</xref>; <xref ref-type="bibr" rid="B14">Dickson et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B42">Zhu et&#x20;al., 2018</xref>). Predicting soil/water partition coefficients and the toxicities of organic compounds is vital for environmental risk assessments (<xref ref-type="bibr" rid="B16">Freitas et&#x20;al., 2014</xref>; <xref ref-type="bibr" rid="B30">Sabour et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B20">Khan et&#x20;al., 2019</xref>). Some properties can be predicted accurately with hydrophobicity (logP<sub>oct</sub>, the logarithm of the partition coefficient between n-octanol and water) and/or other commonly used molecular descriptors, e.g., electrophilicity index (<italic>&#x3c9;</italic>) (<xref ref-type="bibr" rid="B28">Raevsky, 2004</xref>; <xref ref-type="bibr" rid="B27">Pal et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B18">Jana et&#x20;al., 2020</xref>). For example, logP<sub>oct</sub> has been used to predict the water solubility with high accuracy, (<xref ref-type="bibr" rid="B28">Raevsky, 2004</xref>), Robust multiple linear regression (MLR) models for toxicity prediction can be constructed by using the combinations of electronic factor (&#x3c9;, &#x3c9;<sup>2</sup>, or &#x3c9;<sup>3</sup>) and hydrophobicity factor [logP<sub>oct</sub>, (logP<sub>oct</sub>)<sup>2</sup>] as predictors (<xref ref-type="bibr" rid="B27">Pal et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B18">Jana et&#x20;al., 2020</xref>). The robustness of the models were ascertained by neural networks. However, for many properties, constructing QSPR models with high predictive accuracy and reliability remains a challenge. The performance of QSPR models greatly depends on the compounds used for investigation, quality of the data, and modelling methodology employed (<xref ref-type="bibr" rid="B32">Song et&#x20;al., 2017</xref>; <xref ref-type="bibr" rid="B26">Mansouri et&#x20;al., 2018</xref>; <xref ref-type="bibr" rid="B40">Zhang et&#x20;al., 2020</xref>). For a given property, the predictive variables would be different if the data in the training set are different. In addition, QSAR models usually work well only for the compounds within their applicability domains and do not have good predictive accuracy for other compounds (<xref ref-type="bibr" rid="B19">Kaneko, 2017</xref>; <xref ref-type="bibr" rid="B25">Liu and Wallqvist, 2019</xref>). However, it is difficult to define the accurate applicability domains for QSPR models because there is no general agreement for quantifying compound similarity (<xref ref-type="bibr" rid="B7">Carri&#xf3; et&#x20;al., 2016</xref>). It is thus important to develop a new methodology for constructing models that have high predictive power and the performance of the models is independent of the compounds&#x20;used.</p>
<p>The quantitative formula and quantitative relationships that are developed via theoretical derivation in physical chemistry are absolutely correct and are independent of the compounds used. For example, the partition coefficient between water and an organic solvent (logP<sub>ow</sub>) for a solute is directly proportional to the standard free energy change for transferring the solute from water to the organic solvent (&#x394;G<sub>tr</sub>). The &#x394;G<sub>tr</sub> in turn depends on the standard enthalpy change (&#x394;H<sub>tr</sub>) and entropy change (&#x394;S<sub>tr</sub>) of the phase-transferring process. Thus, at a given temperature, the model logP<sub>ow</sub> &#x3d; b<sub>1</sub>&#x394;H<sub>tr</sub> &#x2b; b<sub>2</sub>&#x394;S<sub>tr</sub> &#x2b; c (b<sub>1</sub>, b<sub>2</sub>, and c are constants) is absolutely correct and has high predictive power for predicting logP<sub>ow</sub>. This example indicates that the models developed via thermodynamics-based theoretical derivations may overcome the shortages of the models developed by using mathematical and statistical tools. A large number of physicochemical properties, ADME/Tox qualities, and many other properties of organic compounds depend on the changes in free energy caused by the intermolecular noncovalent interactions of the compounds with their environments. The enormous catalytic power of many enzymes depends on the noncovalent interactions between substrates and enzymes (<xref ref-type="bibr" rid="B36">Warshel et&#x20;al., 2006</xref>; <xref ref-type="bibr" rid="B8">Chen et&#x20;al., 2019</xref>). It is thus expected that models with high predictive power for many properties can be developed by considering the free energy changes related to the properties. In this study, we used a thermodynamics-based theoretical derivation to develop a general linear free energy relationship (LFER) for predicting various properties of organic compounds. The LFER can be used to construct models for many specific properties. Validation shows that the models for specific properties have high predictive power and their performance is independent of the compounds&#x20;used.</p>
</sec>
<sec id="s2">
<title>Computational Methods</title>
<sec id="s2-1">
<title>Data set selection</title>
<p>In this study, all experimental data of logP<sub>oct</sub>, logP<sub>16</sub> (the logarithm of the partition coefficient between hexadecane and water), logP<sub>chl</sub> (the logarithm of the partition coefficient between chloroform and water), logP<sub>aln</sub> (the logarithm of the partition coefficient between aniline and water), logK<sub>brain</sub> (the logarithm of the partition coefficient from air to human brain) and logK<sub>p</sub> <bold>(</bold>logarithm of experimental human skin permeability) are collected from literatures (<xref ref-type="bibr" rid="B1">Abraham et&#x20;al., 1994</xref>; <xref ref-type="bibr" rid="B4">Abraham et&#x20;al., 1999</xref>; <xref ref-type="bibr" rid="B3">Abraham and Martins, 2004</xref>; <xref ref-type="bibr" rid="B2">Abraham et&#x20;al., 2006</xref>; <xref ref-type="bibr" rid="B5">Abraham et&#x20;al., 2015</xref>; <xref ref-type="bibr" rid="B39">Zhang et&#x20;al., 2017</xref>). Hydrogen bond acceptors (HBAs) include very weak H-bond acceptors. For example, the sp2 carbon atoms from carbon-carbon double bonds and aromatic rings are weak HBAs. Hydrogen bond donors (HBDs) include very weak H-bond donors. For example, the hydrogen atoms in CHCl<sub>3</sub> and CH<sub>3</sub>NO<sub>2</sub> are weak&#x20;HBDs.</p>
</sec>
<sec id="s2-2">
<title>Calculation of S<sub>m</sub>
</title>
<p>S<sub>m</sub> is a molecular descriptor developed in this study. The S<sub>m</sub> values of organic compounds were calculated based on the formula of the compounds. Assume the formula of a neutral organic compound is C<sub>c</sub>H<sub>h</sub>O<sub>o</sub>N<sub>n</sub>S<sub>s</sub>F<sub>f</sub>Cl<sub>cl</sub>Br<sub>br</sub>I<sub>i</sub>, the S<sub>m</sub> of this compound is<disp-formula id="e1">
<mml:math id="m1">
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">S</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#x3d;c&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.3h&#x2b;o&#x2b;n&#x2b;2s&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.6&#x2a;f&#x2b;1</mml:mi>
<mml:mi mathvariant="normal">.8cl&#x2b;2</mml:mi>
<mml:mi mathvariant="normal">.2br&#x2b;2</mml:mi>
<mml:mi mathvariant="normal">.6i-0</mml:mi>
<mml:mi mathvariant="normal">.2</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">c3</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">-0</mml:mi>
<mml:mi mathvariant="normal">.6</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">N</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">c4</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mtext>.</mml:mtext>
</mml:mrow>
</mml:math>
<label>(1)</label>
</disp-formula>
</p>
<p>In <xref ref-type="disp-formula" rid="e1">Eq. 1</xref>, c, h, o, n, s, f, cl, br and i are the numbers of carbon, hydrogen, oxygen, nitrogen, sulfur, fluoride, chloride, bromide and iodide atoms in the solute, N<sub>c3</sub> is the number of sp3 carbons connecting three heavy atoms (fluoride is not included), N<sub>c4</sub> is the number of sp3 carbons connecting four heavy atoms (fluoride is not included).</p>
</sec>
<sec id="s2-3">
<title>Calculation of H<sub>M_HBD</sub>
</title>
<p>H<sub>M_HBD</sub> values of solutes were calculated based on the approach reported in a previous study (<xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>).</p>
</sec>
<sec id="s2-4">
<title>Calculation of Flexibility</title>
<p>In this study, the flexibility of a solute is calculated by summarizing the flexibilities of the bonds of the solute. If a bond is not rotatable or if the rotation of a bond does not change the conformation of the solute, the flexibility of the bond is set to zero (note: hydrogen atoms are not included for determining conformations). The flexibility of the C&#x2014;C bond in R<sup>1</sup>CH<sub>2</sub>&#x2014;CH<sub>2</sub>R<sup>2</sup> is set to one. If the energy barrier for rotating a bond is obviously higher than that for rotating the R<sup>1</sup>CH2&#x2014;CH<sub>2</sub>R (<xref ref-type="bibr" rid="B34">Sun et&#x20;al., 2019</xref>) bond, the flex value is set to zero. For example, the C&#x2014;N bond in RCO&#x2014;NH and the C&#x2014;C bond in Ar&#x2014;CO are set to zero. If the energy barrier for rotating a bond is obviously lower than that for rotating the R<sup>1</sup>CH<sub>2</sub>&#x2014;CH<sub>2</sub>R<sup>2</sup> bond, the flex value is set to 1.5. For example, the energy barrier for rotating the R<sup>1</sup>O&#x2014;CH<sub>2</sub>R<sup>2</sup> bond is lower than that for R<sup>1</sup>CH<sub>2</sub>&#x2014;CH<sub>2</sub>R (<xref ref-type="bibr" rid="B34">Sun et&#x20;al., 2019</xref>) and thus the Flex value of the C&#x2014;O bond is set to 1.5. Also, the flexibility of C&#x2014;C in R<sup>1</sup>CH<sub>2</sub>&#x2014;C<sub>6</sub>H<sub>5</sub> is set to 0.5 because of the symmetry of phenyl&#x20;ring.</p>
</sec>
<sec id="s2-5">
<title>Calculation of the effects of HBAs on the logP<sub>oct</sub>/logP<sub>chl</sub>
</title>
<p>The free energy changes for transferring depolarized solutes from water to hexadecane (&#x394;G<sub>tr_depol</sub>) were calculated based on the method reported in previous study (<xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>). Based on the logP<sub>oct</sub> (or logP<sub>chl</sub>) and &#x394;G<sub>tr_depol</sub> values of nonpolar compounds, the model for the regression of logP<sub>oct</sub> (or logP<sub>chl</sub>) against &#x394;G<sub>tr_depol</sub> was developed. This model was then used to calculate the logP<sub>oct</sub> (or logP<sub>chl</sub>) values for depolarized solutes<italic>.</italic> For a solute containing HBAs but no HBDs, the difference between the calculated logP<sub>oct</sub> (or logP<sub>chl</sub>) for the depolarized solute and the experimental logP<sub>oct</sub> (or logP<sub>chl</sub>) of the solute is the effect of HBAs on the logP<sub>oct</sub> (or logP<sub>chl</sub>) of the solute.</p>
</sec>
<sec id="s2-6">
<title>Model development</title>
<p>All the models and the statistical reliabilities of the models were obtained by performing the multiple linear regressions implemented in Excel.</p>
</sec>
</sec>
<sec sec-type="results|discussion" id="s3">
<title>Results and Discussion</title>
<sec id="s3-1">
<title>Thermodynamics-Based Theoretical Derivation for Generating a Linear Free Energy Relationship</title>
<p>In the theoretical derivation, we used &#x201c;Y&#x201d; to represent a property and the symbol &#x201c;&#x394;G<sub>Y</sub>&#x201d; to represent the free energy change that determines Y. The &#x394;G<sub>Y</sub> values for many properties are not easy to be calculated directly. Thus, we decomposed &#x394;G<sub>Y</sub> into the free energy changes that are caused by the factors affecting Y. The free energy change caused by a factor is denoted by &#x201c;&#x394;G<sub>F</sub>&#x201d;. Thus, &#x394;G<sub>Y</sub> equals the summarization of the &#x394;G<sub>F</sub>s for all the factors affecting Y.<disp-formula id="e2">
<mml:math id="m2">
<mml:mrow>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mi mathvariant="normal">Y</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#x3d;</mml:mi>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mi mathvariant="normal">F</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
<label>(2)</label>
</disp-formula>
</p>
<p>For the properties depending on the noncovalent interactions of solutes, they are affected by the molecular sizes, hydrogen-bond acceptors (HBAs), and hydrogen-bond donors (HBDs) of the solutes, which was demonstrated in a previous study (<xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>). Many properties are also affected by the flexibilities of solutes. For example, the partition coefficients of organic compounds between a flexible environment (e.g., blood) and a much less flexible environment (e.g., muscle) are obviously affected by the flexibilities of the compounds. It is challenging to accurately quantify the &#x394;G<sub>F</sub>s for various properties. However, it is possible to develop molecular descriptors that are directly proportional to &#x394;G<sub>F</sub>s. We used D<sub>F</sub> to represent the molecular descriptor that is directly proportional to &#x394;G<sub>F</sub>. Then, &#x394;G<sub>Y</sub> can be expressed as:<disp-formula id="e3">
<mml:math id="m3">
<mml:mrow>
<mml:mi mathvariant="normal">&#x394;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">G</mml:mi>
<mml:mi mathvariant="normal">Y</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#x3d;</mml:mi>
<mml:mstyle displaystyle="true">
<mml:mo>&#x2211;</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mi mathvariant="normal">k</mml:mi>
<mml:mi mathvariant="normal">F</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="normal">D</mml:mi>
<mml:mi mathvariant="normal">F</mml:mi>
</mml:msub>
</mml:mrow>
</mml:mstyle>
</mml:mrow>
</mml:math>
<label>(3)</label>
</disp-formula>
</p>
<p>The k<sub>F</sub> values are constant for a given property. Theoretically, if the molecular descriptors apply to various properties, <xref ref-type="disp-formula" rid="e3">Eq. 3</xref> can be used to construct models with high predictive power for the properties. Many properties are mainly affected by the molecular sizes, HBAs, HBDs and flexibilities of solutes. Thus, in this study, we developed or selected molecular descriptors for quantifying the effects of molecular size, HBAs, HBDs and flexibility on the properties.</p>
<p>The molecular descriptor we developed for quantifying the effects of molecular size on properties is denoted by &#x201c;S<sub>m</sub>&#x201d;. The S<sub>m</sub> values of organic compounds represent the relative molecular sizes of the compounds and can be easily calculated from their molecular formulas (see Computational Methods). For example, the S<sub>m</sub> for catechol (formula: C<sub>6</sub>H<sub>6</sub>O<sub>2</sub>) is 9.8 (num. for C &#x2b; 0.3<inline-formula id="inf1">
<mml:math id="m4">
<mml:mrow>
<mml:mo>&#xd7;</mml:mo>
<mml:mo>&#xa0;</mml:mo>
</mml:mrow>
</mml:math>
</inline-formula>num. for H &#x2b; num. for O). To illustrate whether S<sub>m</sub> is an ideal molecular descriptor for molecular size, we first explored the linear associations between logP<sub>16</sub> and S<sub>m</sub> and between logP<sub>oct</sub> and S<sub>m</sub> for a series of alkane compounds (<xref ref-type="fig" rid="F1">Figure&#x20;1A</xref>). The logP<sub>16</sub> and logP<sub>oct</sub> values for alkane compounds are affected merely by the sizes of the compounds. The robust linear associations in <xref ref-type="fig" rid="F1">Figure&#x20;1A</xref> support that S<sub>m</sub> is directly proportional to the effects of molecular size on logP<sub>16</sub> and logP<sub>oct</sub>. We next explore whether S<sub>m</sub> is also an ideal molecular descriptor of molecular size for the properties that have little correlation with logP<sub>16</sub> or logP<sub>oct</sub>. As reported in a previous study, logK<sub>brain</sub> has little correlation with logP<sub>16</sub> and logP<sub>oct</sub> (<xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>). We thus explored the linear association between logK<sub>brain</sub> and S<sub>m</sub> for nonpolar solutes (<xref ref-type="fig" rid="F1">Figure&#x20;1B</xref>). The R<sup>2</sup> and SD values indicate that there is a strong linear association between logK<sub>brain</sub> and S<sub>m</sub>. In <xref ref-type="fig" rid="F1">Figure&#x20;1C</xref>, we plotted the free energy changes for transferring the depolarized compounds from water to hexadecane (&#x394;G<sub>tr_depol</sub>) against the S<sub>m</sub> values for the compounds from <xref ref-type="sec" rid="s9">Supplementary Table S1</xref> of a previous study (<xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>). The high statistical reliability for the regression of &#x394;G<sub>tr_depol</sub> against S<sub>m</sub> further supports that S<sub>m</sub> is an ideal molecular descriptor for quantifying the effect of molecular size on the properties depending on noncovalent interactions. Thus, S<sub>m</sub> is an ideal molecular descriptor for molecular size and applies to various properties.</p>
<fig id="F1" position="float">
<label>FIGURE 1</label>
<caption>
<p>Correlations between the molecular descriptor S<sub>m</sub> and the effects of molecular size on various properties. <bold>(A)</bold> Linear associations between logP<sub>16</sub>/logP<sub>oct</sub> and S<sub>m</sub> for alkanes. <bold>(B)</bold> Linear association between logK<sub>brain</sub> (log of the partition coefficient from air to human brain) and S<sub>m</sub> for nonpolar compounds. <bold>(C)</bold> Plot of the water to hexadecane phase-transferring free energy for depolarized solutes (&#x394;G<sub>tr_depol</sub>) against the S<sub>m</sub> values of the solutes.</p>
</caption>
<graphic xlink:href="fchem-09-737579-g001.tif"/>
</fig>
<p>In previous studies, we defined the water to hexadecane phase transferring free energy contributed by the electrostatic interactions of the HBAs of a solute as the overall H-bonding capability of the HBAs of the solute (<xref ref-type="bibr" rid="B8">Chen et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>; <xref ref-type="bibr" rid="B9">Chen et&#x20;al., 2016</xref>) and this overall H-bonding capability is donated by &#x201c;H<sub>M_HBA</sub>.&#x201d; The definition indicates that H<sub>M_HBA</sub> is an ideal molecular descriptor for quantifying the effects of HBAs on logP<sub>16</sub>. We next explored whether H<sub>M_HBA</sub> is an ideal molecular descriptor for logP<sub>oct</sub> and logP<sub>chl</sub>. The strong linear associations between the effect of HBAs on logP<sub>oct</sub> and H<sub>M_HBA</sub> (<xref ref-type="fig" rid="F2">Figure&#x20;2A</xref>) and between the effect of HBAs on logP<sub>chl</sub> and H<sub>M_HBA</sub> (<xref ref-type="fig" rid="F2">Figure&#x20;2B</xref>) suggest that H<sub>M_HBA</sub> is an ideal molecular descriptor for quantifying the effects of HBAs on various properties. Similarly, we defined the water to hexadecane phase transferring free energy contributed by the electrostatic interactions of the HBDs of a solute as the overall H-bonding capability of the HBDs of the solute (H<sub>M_HBD</sub>) (<xref ref-type="bibr" rid="B9">Chen et&#x20;al., 2016</xref>; <xref ref-type="bibr" rid="B8">Chen et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>). In a previous study, we revealed that the contribution of a protein-ligand H-bond to the protein-ligand binding free energy is directly proportional to the H-bonding capability of the HBA and the H-bonding capability of the HBD (<xref ref-type="bibr" rid="B9">Chen et&#x20;al., 2016</xref>). We also found that the effect of an enzyme-substrate H-bond interaction on the free energy barrier of the enzymatic reaction is directly proportional to the H-bonding capability of the atom from the enzyme (<xref ref-type="bibr" rid="B8">Chen et&#x20;al., 2019</xref>). Thus, we believe that the effects of HBAs and HBDs of solutes on the properties related to noncovalent interactions are directly proportional to the H<sub>M_HBA</sub> and H<sub>M_HBD</sub> values of the solutes. H<sub>M_HBA</sub> and H<sub>M_HBD</sub> are ideal molecular descriptors for quantifying the effect of HBAs and HBDs on the properties related to noncovalent interactions.</p>
<fig id="F2" position="float">
<label>FIGURE 2</label>
<caption>
<p>Strong linear associations between the effects of HBAs on properties and HM_HBA. H<sub>M_HBA</sub>: overall H-bonding capabilities of the HBAs of a solute. <bold>(A)</bold> For the property logP<sub>oct</sub> (log of the partition coefficient between n-octanol and water). <bold>(B)</bold> For the property logP<sub>chl</sub> (log of the partition coefficient between chloroform and water).</p>
</caption>
<graphic xlink:href="fchem-09-737579-g002.tif"/>
</fig>
<p>The molecular descriptor for quantifying the effect of molecular flexibility on properties is denoted by &#x201c;Flex.&#x201d; The effects of molecular flexibility on properties mainly result from rotatable bonds of the solutes because the rotatable bonds of the solutes can rotate more freely in some environments than in other environments. The flexibilities of solutes are calculated from the rotatable bonds of the solutes, especially the rotatable bonds that change the conformations of solutes (see Computational Methods). Thus, for many properties that are affected by molecular size, HBAs, HBDs and flexibility, they can be quantified with the following equation<disp-formula id="e4">
<mml:math id="m5">
<mml:mrow>
<mml:mi mathvariant="normal">Y&#x3d;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">k</mml:mi>
<mml:mi mathvariant="normal">1</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="normal">S</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">k</mml:mi>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#xa0;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">H</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">M_HBA</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">k</mml:mi>
<mml:mi mathvariant="normal">3</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="normal">H</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">M_HBD</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">k</mml:mi>
<mml:mi mathvariant="normal">4</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">Flex&#x2b;c</mml:mi>
</mml:mrow>
</mml:math>
<label>(4)</label>
</disp-formula>where k<sub>1</sub>, k<sub>2</sub>, k<sub>3</sub>, k<sub>4</sub> and c are constants for a give property. Organic compounds usually contain multiple HBAs and the HBAs affect each other. The accurate calculation of H<sub>M_HBA</sub> for many organic compounds is not easy. <xref ref-type="disp-formula" rid="e4">Eq. 4</xref> would become simpler and easier to use if H<sub>M_HBA</sub> is replaced by logP<sub>ow</sub> because logP<sub>ow</sub> is a well-known molecular descriptor for predicting properties (<xref ref-type="bibr" rid="B24">Liu et&#x20;al., 2019</xref>; <xref ref-type="bibr" rid="B44">Zhu et&#x20;al., 2020b</xref>) and can be obtained accurately via experimental and/or computational approaches. Based on the fact that logP<sub>ow</sub> is a property and <xref ref-type="disp-formula" rid="e4">Eq. 4</xref> also applies to logP<sub>ow</sub>, we can convert <xref ref-type="disp-formula" rid="e4">Eqs. 4</xref> to <xref ref-type="disp-formula" rid="e5">5</xref> (see <xref ref-type="sec" rid="s9">Supplementary Text S1</xref> for the detail of the process of the conversion).<disp-formula id="e5">
<mml:math id="m6">
<mml:mrow>
<mml:mi mathvariant="normal">Y&#x3d;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">b</mml:mi>
<mml:mi mathvariant="normal">1</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">ow</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">b</mml:mi>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="normal">S</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">b</mml:mi>
<mml:mi mathvariant="normal">3</mml:mi>
</mml:msub>
<mml:msub>
<mml:mi mathvariant="normal">H</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">M_HBD</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">b</mml:mi>
<mml:mi mathvariant="normal">4</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">Flex&#x2b;c</mml:mi>
</mml:mrow>
</mml:math>
<label>(5)</label>
</disp-formula>where b<sub>1</sub>, b<sub>2</sub>, b<sub>3</sub>, and b<sub>4</sub> are constants, logP<sub>ow</sub> is logP<sub>16</sub> or logP<sub>oct</sub>. <xref ref-type="disp-formula" rid="e5">Eq. 5</xref> is identical to <xref ref-type="disp-formula" rid="e4">Eq. 4</xref>. Both equations are correct for the properties that are determined by the noncovalent interactions of solutes with flexible environments. All the factors related to effects of noncovalent interactions on phase-transferring free energies, including electrostatic interaction, desovation, van der Waals interactions, entropy change, etc. are considered in <xref ref-type="disp-formula" rid="e5">Eq. 5</xref>. <xref ref-type="disp-formula" rid="e5">Eq. 5</xref> is the general LFER we developed for predicting the properties that depends on the noncovalent interactions of solutes with flexible environments. Although S<sub>m</sub> and H<sub>M_HBD</sub> may be strongly correlated with logP<sub>ow</sub> for some properties, none of the molecular descriptors can be omitted because <xref ref-type="disp-formula" rid="e4">Eq. 4</xref> is a general LFER for various different properties.</p>
</sec>
<sec id="s3-2">
<title>Validation of the General LFER: Model Construction for Specific Properties</title>
<sec id="s3-2-1">
<title>Prediction of Various Organic Solvent/Water Partition Coefficients</title>
<p>To prove that this general LFER can be used to construct models with high predictive power for various specific properties, we first demonstrated that it can be used to predict an organic solvent/water partition coefficient from another organic solvent/water partition coefficient with high accuracy. Eighty-nine compounds with experimental logP<sub>oct</sub>, logP<sub>16</sub>, and logP<sub>chl</sub> values (<xref ref-type="bibr" rid="B1">Abraham et&#x20;al., 1994</xref>; <xref ref-type="bibr" rid="B4">Abraham et&#x20;al., 1999</xref>) (<xref ref-type="sec" rid="s9">Supplementary Table S1</xref>) were used for this investigation. Among the compounds, 45 compounds contain HBAs but no HBDs and 41 compounds containing HBDs. The equations and statistical results of the simple regressions of logP<sub>16</sub> against logP<sub>oct</sub> and logP<sub>chl</sub> against logP<sub>oct</sub> for various types of compounds are shown in <xref ref-type="sec" rid="s9">Supplementary Text S2</xref>. The R<sup>2</sup> (squared correlation coefficient) values of the regressions range from 0.501 to 0.972 (gray columns, <xref ref-type="fig" rid="F3">Figures 3A,B</xref>) and the SD (standard deviation) values of the regressions range from 0.241 to 0.965, indicating that the strength of the linear associations between two partition coefficients largely depends on compounds used for investigation. Then the same data for constructing the simple regressions were used to construct models according to the general LFER and the results are also shown in <xref ref-type="sec" rid="s9">Supplementary Text S2</xref> (note: the model descriptor Flex is not used because Flex has little effect on logP<sub>ow</sub>). The R<sup>2</sup> values of the models range from 0.947 to 0.992 (black columns, <xref ref-type="fig" rid="F3">Figures 3A,B</xref>) and the SD values range from 0.183 to 0.248. The results indicate that the models constructed according to the LFER have a high statistical reliability and the performance of the models is independent of the compounds for investigation.</p>
<fig id="F3" position="float">
<label>FIGURE 3</label>
<caption>
<p>Prediction of organic solvent/water partition coefficients for validating the general LFER. <bold>(A, B)</bold> R<sup>2</sup> values of the simple regressions (gray columns) of logP<sub>16</sub>(A)/logP<sub>chl</sub>(B) against logP<sub>oct</sub> and of the corresponding models constructed according to the LFER (black columns). HBA: compounds containing HBAs but no HBDs; HBD: compounds containing HBDs; apolar: nonpolar compounds; <bold>(C)</bold> Plot of observed logP<sub>16</sub> against the logP<sub>16</sub> calculated from the model constructed according to the LFER; <bold>(D)</bold> Plot of observed logP<sub>16</sub> against the logP<sub>16</sub> calculated from the model with logP<sub>oct</sub> as predictive valuable.</p>
</caption>
<graphic xlink:href="fchem-09-737579-g003.tif"/>
</fig>
<p>To demonstrate whether the models have high predictive power, we compared the experimental logP<sub>16</sub> values of 200 organic compounds [from <xref ref-type="sec" rid="s9">Supplementary Table S1</xref> of a previous study (<xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>)] and the logP<sub>16</sub> values calculated from the model constructed according to the LFER by using an external validation approach (<xref ref-type="fig" rid="F3">Figure&#x20;3C</xref> and <xref ref-type="sec" rid="s9">Supplementary Text S3</xref>). The result shows that the model has high predictive power. For comparison, the predictive power of the corresponding simple regression was also investigated (<xref ref-type="fig" rid="F3">Figure&#x20;3D</xref>), which is much worse than that for the model constructed according to the LFER. Thus, the LFER is powerful for constructing models with high predictive&#x20;power.</p>
</sec>
<sec id="s3-2-2">
<title>Prediction of the Human Skin Permeability</title>
<p>We next used the LFER to construct a model for predicting the human skin permeability of neutral organic molecules. <xref ref-type="sec" rid="s9">Supplementary Table S2</xref> shows the logK<sub>p</sub> (<xref ref-type="bibr" rid="B3">Abraham and Martins, 2004</xref>; <xref ref-type="bibr" rid="B39">Zhang et&#x20;al., 2017</xref>) values of 51 organic compounds. Thirty-two of the compounds were used as training set to develop the model with logP<sub>oct</sub> as a predictive valuable and the other 19 compounds as a test set to validate the model. The model constructed according to the LFER is shown below.<disp-formula id="e6">
<mml:math id="m7">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="normal">log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">K</mml:mi>
<mml:mi mathvariant="normal">p</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.6157log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">oct</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.0156</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">S</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">-0</mml:mi>
<mml:mi mathvariant="normal">.0626</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">H</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">M_HBD</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">-0</mml:mi>
<mml:mi mathvariant="normal">.0988Flex-5</mml:mi>
<mml:mi mathvariant="normal">.646;</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="normal">N&#x3d;32,</mml:mi>
<mml:msup>
<mml:mi mathvariant="normal">R</mml:mi>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msup>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.953,</mml:mi>
<mml:msup>
<mml:mi mathvariant="normal">Q</mml:mi>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msup>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mi mathvariant="normal">ext</mml:mi>
</mml:mrow>
<mml:mo>)</mml:mo>
</mml:mrow>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.966,SD&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.178;F&#x3d;136</mml:mi>
<mml:mi mathvariant="normal">.7</mml:mi>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
<label>(6)</label>
</disp-formula>
</p>
<p>This model is characterized by high statistical reliability according to the R<sup>2</sup> and SD values. It is used to calculate the logK<sub>p</sub> values of the 19 solutes in the test set. The plot of the experimental logK<sub>p</sub> values versus the calculated logK<sub>p</sub> values is characterized by statistically robust linearity (<xref ref-type="fig" rid="F4">Figure&#x20;4A</xref>). The accurate prediction of logK<sub>p</sub> can provide a rapid and accurate prediction of human skin permeability of organic compounds, which is very useful for evaluating environmental risks due to contact with&#x20;skin.</p>
<fig id="F4" position="float">
<label>FIGURE 4</label>
<caption>
<p>Predictive power of models constructed according to the LFER. <bold>(A)</bold> External validation. Plots of observed logK<sub>p</sub> (log of human skin permeability) against the logK<sub>p</sub> calculated from the model constructed according to the LFER (logP<sub>oct</sub> is used). <bold>(B)</bold> LOO cross-validation. Plots of observed logK<sub>brain</sub> against the logK<sub>brain</sub> calculated from the model constructed according to the LFER.</p>
</caption>
<graphic xlink:href="fchem-09-737579-g004.tif"/>
</fig>
</sec>
<sec id="s3-2-3">
<title>Prediction of Air to Human Brain Partition Coefficient</title>
<p>To further illustrate the reliability and accuracy of the LFER, we used the LFER to construct models for the properties that have little correlation with logP<sub>ow</sub>. The strength of the linear association between logK<sub>brain</sub> and logP<sub>oct</sub> (or logP<sub>16</sub>) is weak (R<sup>2</sup> &#x3c; 0.1) (<xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>). <xref ref-type="sec" rid="s9">Supplementary Table S3</xref> lists the compounds that were used to demonstrate the weak linear association between logK<sub>brain</sub> and logP<sub>16</sub> (or logP<sub>oct</sub>) in a previous study (<xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>). Based on the experimental logK<sub>brain</sub>, logP<sub>16</sub> and logP<sub>oct</sub> data of the compounds, we constructed two models according to the LFER.<disp-formula id="e7">
<mml:math id="m8">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="normal">log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">K</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">brain</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x3d;&#xa0;-0</mml:mi>
<mml:mi mathvariant="normal">.5129log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">16</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.5006</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">S</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.1009</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">H</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">M_HBD</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">-0</mml:mi>
<mml:mi mathvariant="normal">.1893Flex-1</mml:mi>
<mml:mi mathvariant="normal">.64;</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="normal">N&#x3d;34,</mml:mi>
<mml:msup>
<mml:mi mathvariant="normal">R</mml:mi>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msup>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.964,</mml:mi>
<mml:msubsup>
<mml:mi mathvariant="normal">Q</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">LOO</mml:mi>
</mml:mrow>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msubsup>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.947;SD&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.265;F&#x3d;195</mml:mi>
<mml:mi mathvariant="normal">.7</mml:mi>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
<label>(7)</label>
</disp-formula>
<disp-formula id="e8">
<mml:math id="m9">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="normal">log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">K</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">brain</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x3d;-&#xa0;0</mml:mi>
<mml:mi mathvariant="normal">.7755log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">oct</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.5473</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">S</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.1790</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">H</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">M_HBD</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">-0</mml:mi>
<mml:mi mathvariant="normal">.0986Flex-1</mml:mi>
<mml:mi mathvariant="normal">.386</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="normal">N&#x3d;31,</mml:mi>
<mml:msup>
<mml:mi mathvariant="normal">R</mml:mi>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msup>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.931,</mml:mi>
<mml:msubsup>
<mml:mi mathvariant="normal">Q</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">LOO</mml:mi>
</mml:mrow>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msubsup>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.914;SD&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.368;F&#x3d;87</mml:mi>
<mml:mi mathvariant="normal">.4</mml:mi>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
<label>(8)</label>
</disp-formula>
</p>
<p>Results indicate that both models have high predictive power and the model with logP<sub>16</sub> as predictive variable is better than the model with logP<sub>oct</sub> as predictive variable. The predicted logK<sub>brain</sub> obtained from the LOO cross-validation (logP<sub>16</sub> is used) and the observed logK<sub>brain</sub> show a robust linear association (<xref ref-type="fig" rid="F4">Figure&#x20;4B</xref>). Thus, the general LFER works well for the properties that have little correlation with partition coefficients, supporting the reliability and efficacy of the general LFER in the accurate prediction of the properties related to noncovalent interactions. We believe that the thermodynamics-based theoretical derivation is a powerful methodology for developing robust models and will be useful in many fields, including drug design, environmental safety and human health.</p>
</sec>
<sec id="s3-2-4">
<title>Model Simplification</title>
<p>In some cases, not all the molecular descriptors in the LFER are required for specific models with high predictive power. Some models still have high predictive power without using the molecular descriptor H<sub>M_HBD</sub>. For example, if the HBAs and HBDs of an organic solvent are obviously weaker than the HBAs and HBDs of water, the partition coefficient between water and the organic solvent can be predicted accurately from the model with logP<sub>16</sub> and S<sub>m</sub> as predictive variables. <xref ref-type="disp-formula" rid="e9">Eq. 9</xref> is the model for predicting the aniline/water partition coefficient (logP<sub>aln</sub>) with logP<sub>16</sub> and S<sub>m</sub> as predictive variables (see <xref ref-type="sec" rid="s9">Supplementary Table S4</xref> for the data). Its statistical reliability is high and is obviously better than that for the simple regression (<xref ref-type="disp-formula" rid="e10">Eq. 10</xref>).<disp-formula id="e9">
<mml:math id="m10">
<mml:mrow>
<mml:mi mathvariant="normal">log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">aln</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.4695log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">16</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.1506</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">S</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.010;N&#x3d;54,</mml:mi>
<mml:msup>
<mml:mi mathvariant="normal">R</mml:mi>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msup>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.975,SD&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.208,F&#x3d;1008</mml:mi>
<mml:mi mathvariant="normal">.</mml:mi>
</mml:mrow>
</mml:math>
<label>(9)</label>
</disp-formula>
<disp-formula id="e10">
<mml:math id="m11">
<mml:mrow>
<mml:mi mathvariant="normal">log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">aln</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.6416log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">16</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.726;N&#x3d;54;</mml:mi>
<mml:msup>
<mml:mi mathvariant="normal">R</mml:mi>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msup>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.910;SD&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.394,F&#x3d;524</mml:mi>
<mml:mi mathvariant="normal">.9</mml:mi>
<mml:mi mathvariant="normal">.</mml:mi>
</mml:mrow>
</mml:math>
<label>(10)</label>
</disp-formula>
</p>
<p>Without using H<sub>M_HBD</sub>, the model for predicting logK<sub>brain</sub> from logP<sub>16</sub>, S<sub>m</sub>, and Flex still has high predictive power.<disp-formula id="e11">
<mml:math id="m12">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="normal">log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">K</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">brain</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x3d;-0</mml:mi>
<mml:mi mathvariant="normal">.6194log</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">P</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">16</mml:mi>
</mml:mrow>
</mml:msub>
<mml:mi mathvariant="normal">&#x2b;0</mml:mi>
<mml:mi mathvariant="normal">.5446</mml:mi>
<mml:msub>
<mml:mi mathvariant="normal">S</mml:mi>
<mml:mi mathvariant="normal">m</mml:mi>
</mml:msub>
<mml:mi mathvariant="normal">-0</mml:mi>
<mml:mi mathvariant="normal">.1928Flex-1</mml:mi>
<mml:mi mathvariant="normal">.637;</mml:mi>
</mml:mtd>
</mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi mathvariant="normal">N&#xa0;&#x3d;&#xa0;34,</mml:mi>
<mml:msup>
<mml:mi mathvariant="normal">R</mml:mi>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msup>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.954,</mml:mi>
<mml:msubsup>
<mml:mi mathvariant="normal">Q</mml:mi>
<mml:mrow>
<mml:mi mathvariant="normal">LOO</mml:mi>
</mml:mrow>
<mml:mi mathvariant="normal">2</mml:mi>
</mml:msubsup>
<mml:mi mathvariant="normal">&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.938;SD&#x3d;0</mml:mi>
<mml:mi mathvariant="normal">.295;F&#x3d;207</mml:mi>
<mml:mi mathvariant="normal">.7</mml:mi>
</mml:mtd>
</mml:mtr>
</mml:mtable>
</mml:math>
<label>(11)</label>
</disp-formula>
</p>
<p>Because the calculation of S<sub>m</sub> and Flex is easy, the accurate prediction of some properties from logP<sub>16</sub> or logP<sub>oct</sub> is easy for the researchers across various fields. For example, logK<sub>brain</sub> can be accurately predicted from logP<sub>16</sub>, without the need for additional experimental data or complicated calculations. Without using the LFER, the accurate prediction of logK<sub>brain</sub> from logP<sub>16</sub> or another organic solvent/water partition coefficient is difficult because there is little correlation between logK<sub>brain</sub> and organic solvent/water partition coefficients. For the models containing H<sub>M_HBD</sub>, the H<sub>M_HBD</sub> values of solutes are calculated with computer software. All the molecular descriptors in the LFER are easy to be understood and used by the researchers in various research fields. However, when constructing QSPR models by using mathematical and statistical tools, the predictive variables are usually selected from a few thousand molecular descriptors. The meanings of many predictive variables, e.g., the 3D-MoRSE descriptors, (<xref ref-type="bibr" rid="B38">Zapadka et&#x20;al., 2019</xref>), are not easy to be understood or used by many researchers in various research fields.</p>
</sec>
</sec>
<sec id="s3-3">
<title>Performance of Models With all Molecular Descriptors Calculated From Solute Structures</title>
<p>Because logP<sub>oct</sub> and logP<sub>16</sub> can be calculated accurately from the structures of solutes (<xref ref-type="bibr" rid="B10">Chen et&#x20;al., 2020</xref>), it is expected that this method still performs well when all of the molecular descriptors in the LFER are calculated from solute structures. For example, the R<sup>2</sup>, Q<sup>2</sup>
<sub>ext</sub> and SD of the model for predicting human skin permeability, in which all predictive variables are calculated from solute structures, are 0.940, 0.957, and 0.202 (see <xref ref-type="sec" rid="s9">Supplementary Text S4</xref>). Thus, the general LFER developed in this study has obvious advantages in predicting many properties related to noncovalent interactions.</p>
</sec>
<sec id="s3-4">
<title>Importance of Thermodynamics-Based Theoretical Derivation</title>
<p>Above examples indicate that the models constructed according to the LFER for many specific properties have high predictive power. Moreover, the performance of the models is independent of the compounds for investigation, suggesting that the models can provide guidance for improving properties of organic compounds and designing compounds with optimal properties. The merits of the LFER result from the theoretical derivation, which ensures that the quantitative relationships in the models constructed according to the LFER are correct in the aspect of thermodynamics. For the QSPR models developed using mathematical and statistical tools, the predictive variables are selected from a few thousand molecular descriptors based on the data of the compounds in training sets. The relationships between the properties and molecular descriptors in the QSPR models are statistical relationships for the compounds in training sets. The QSPR models usually work well only for the compounds in the training set and similar compounds, but may do not work well for other compounds. Thus, for the properties determined by the noncovalent interactions of solutes with flexible environments, the models developed according to the proposed LFER performs better than the QSPR models developed by using mathematical and statistical tools, including robust artificial neural networks. Developing models according to the proposed LFER is faster and computationally cheap than developing traditional QSPR models because the process of the variable selection is not required. Moreover, the proposed LFER is quite simple and can be easily used by the researchers across various fields, while expert knowledge is required for developing robust artificial neural networks, such as the knowledge in choosing the most appropriate approach. Thus, the method developed in study has obvious advantages over the traditional QSPR construction method. Thermodynamics-based theoretical derivation can be used to solve many problems that are hard to be solved by using mathematical and statistical tools. In addition, results in this study demonstrate that there are quantitative relationships between the properties related to thermodynamics, suggesting that many properties can be accurately predicted from other properties.</p>
</sec>
<sec id="s3-5">
<title>Future Works</title>
<p>The theoretical derivation in this study is based on the assumption that solutes have similar interactions with their environments, which requires that the environments for the&#x20;properties are flexible or the properties have little relationship with the conformation or orientations of solutes. Thus, the present LFER may not work well in predicting the binding affinities of ligands because the binding sites of proteins are not flexible. If the environment for a property is rigid (e.g., the binding sites of proteins), the model for predicting the property should consider H-bond interactions individually, rather than the overall H-bond interactions. In our further study, we will explore how to develop models for the properties related to rigid environments, which can be used to develop scoring functions for predicting protein-ligand binding affinities and develop QSAR models for screening databases of ligands. Furthermore, in this study, we demonstrated to effectiveness of the LFER for predicting the properties of neutral organic compounds. If a dataset contains ionizable compounds, it will be necessary to include molecular descriptors for the ionized forms. Although several approaches currently exist for considering the effects of ionization on various molecular properties (<xref ref-type="bibr" rid="B22">Li et&#x20;al., 2006</xref>; <xref ref-type="bibr" rid="B39">Zhang et&#x20;al., 2017</xref>), our future work with involve attempts to adapt the proposed LFER for use in these situations.</p>
</sec>
</sec>
<sec sec-type="conclusion" id="s4">
<title>Conclusion</title>
<p>In this study, we used a thermodynamics-based theoretical derivation to develop a general LFER for accurately predicting various properties from partition coefficients. The theoretical derivation ensures that many specific properties can be accurately quantified with the molecular descriptors in the LFER. It overcomes the shortages of constructing QSPR models by using mathematical and statistical tools. It is expected that the thermodynamics-based theoretical derivation can be used to solve many difficult problems, including the accurate prediction of protein-ligand binding affinities.</p>
</sec>
</body>
<back>
<sec id="s5">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="sec" rid="s9">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>DC and XH conceived and designed the study. YF and XH built their models, validated the models. DC and YF wrote the first draft of the manuscript. All authors read, revised and approved the submitted version.</p>
</sec>
<sec id="s7">
<title>Funding</title>
<p>This work is supported by grants from the National Natural Science Foundation of China (21763002, 21473041) and the Natural Science Foundation of Jiangxi Province (20202ACBL203008).</p>
</sec>
<sec sec-type="COI-statement" id="s8">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. At the time of publication a provisional patent was filed on the findings.</p>
</sec>
<sec id="s9" sec-type="disclaimer">
<title>Publisher&#x2019;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
<sec id="s10">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fchem.2021.737579/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fchem.2021.737579/full&#x23;supplementary-material</ext-link>
</p>
<supplementary-material xlink:href="DataSheet1.docx" id="SM1" mimetype="application/docx" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abraham</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Chadha</surname>
<given-names>H. S.</given-names>
</name>
<name>
<surname>Whiting</surname>
<given-names>G. S.</given-names>
</name>
<name>
<surname>Mitchell</surname>
<given-names>R. C.</given-names>
</name>
</person-group> (<year>1994</year>). <article-title>Hydrogen Bonding. 32. An Analysis of Water-Octanol and Water-Alkane Partitioning and the &#x394;log P Parameter of Seiler</article-title>. <source>J.&#x20;Pharm. Sci.</source> <volume>83</volume>, <fpage>1085</fpage>&#x2013;<lpage>1100</lpage>. <pub-id pub-id-type="doi">10.1002/jps.2600830806</pub-id> </citation>
</ref>
<ref id="B2">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abraham</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Ibrahim</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Acree</surname>
<given-names>W. E.</given-names>
<suffix>Jr.</suffix>
</name>
</person-group> (<year>2006</year>). <article-title>Air to Brain, Blood to Brain and Plasma to Brain Distribution of Volatile Organic Compounds: Linear Free Energy Analyses</article-title>. <source>Eur. J.&#x20;Med. Chem.</source> <volume>41</volume>, <fpage>494</fpage>&#x2013;<lpage>502</lpage>. <pub-id pub-id-type="doi">10.1016/j.ejmech.2006.01.004</pub-id> </citation>
</ref>
<ref id="B3">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abraham</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Martins</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Human Skin Permeation and Partition: General Linear Free&#x2010;Energy Relationship Analyses</article-title>. <source>J.&#x20;Pharm. Sci.</source> <volume>93</volume>, <fpage>1508</fpage>&#x2013;<lpage>1523</lpage>. <pub-id pub-id-type="doi">10.1002/jps.20070</pub-id> </citation>
</ref>
<ref id="B4">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abraham</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Platts</surname>
<given-names>J.&#x20;A.</given-names>
</name>
<name>
<surname>Hersey</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Leo</surname>
<given-names>A. J.</given-names>
</name>
<name>
<surname>Taft</surname>
<given-names>R. W.</given-names>
</name>
</person-group> (<year>1999</year>). <article-title>Correlation and Estimation of Gas-Chloroform and Water-Chloroform Partition Coefficients by a Linear Free Energy Relationship Method</article-title>. <source>J.&#x20;Pharm. Sci.</source> <volume>88</volume>, <fpage>670</fpage>&#x2013;<lpage>679</lpage>. <pub-id pub-id-type="doi">10.1021/js990008a</pub-id> </citation>
</ref>
<ref id="B5">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Abraham</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Zad</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Acree</surname>
<given-names>W. E.</given-names>
</name>
</person-group> (<year>2015</year>). <article-title>The Transfer of Neutral Molecules from Water and from the Gas Phase to Solvents Acetophenone and Aniline</article-title>. <source>J.&#x20;Mol. Liquids</source> <volume>212</volume>, <fpage>301</fpage>&#x2013;<lpage>306</lpage>. <pub-id pub-id-type="doi">10.1016/j.molliq.2015.09.033</pub-id> </citation>
</ref>
<ref id="B6">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Bushdid</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>de March</surname>
<given-names>C. A.</given-names>
</name>
<name>
<surname>Fiorucci</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Matsunami</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Golebiowski</surname>
<given-names>J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Agonists of G-Protein-Coupled Odorant Receptors Are Predicted from Chemical Features</article-title>. <source>J.&#x20;Phys. Chem. Lett.</source> <volume>9</volume>, <fpage>2235</fpage>&#x2013;<lpage>2240</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jpclett.8b00633</pub-id> </citation>
</ref>
<ref id="B7">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Carri&#xf3;</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Sanz</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Pastor</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Toward a Unifying Strategy for the Structure-Based Prediction of Toxicological Endpoints</article-title>. <source>Arch. Toxicol.</source> <volume>90</volume>, <fpage>2445</fpage>&#x2013;<lpage>2460</lpage>. <pub-id pub-id-type="doi">10.1007/s00204-015-1618-2</pub-id> </citation>
</ref>
<ref id="B8">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>X.</given-names>
</name>
<name>
<surname>Guo</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Savidge</surname>
<given-names>T.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Quantitative Effects of Substrate-Environment Interactions on the Free Energy Barriers of Reactions</article-title>. <source>J.&#x20;Phys. Chem. C</source> <volume>123</volume>, <fpage>13586</fpage>&#x2013;<lpage>13592</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jpcc.9b01094</pub-id> </citation>
</ref>
<ref id="B9">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Oezguen</surname>
<given-names>N.</given-names>
</name>
<name>
<surname>Urvil</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Ferguson</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Dann</surname>
<given-names>S. M.</given-names>
</name>
<name>
<surname>Savidge</surname>
<given-names>T. C.</given-names>
</name>
</person-group> (<year>2016</year>). <article-title>Regulation of Protein-Ligand Binding Affinity by Hydrogen Bond Pairing</article-title>. <source>Sci. Adv.</source> <volume>2</volume>, <fpage>e1501240</fpage>. <pub-id pub-id-type="doi">10.1126/sciadv.1501240</pub-id> </citation>
</ref>
<ref id="B10">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Chen</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Q.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Fan</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>A General Linear Free Energy Relationship for Predicting Partition Coefficients of Neutral Organic Compounds</article-title>. <source>Chemosphere</source> <volume>247</volume>, <fpage>125869</fpage>. <pub-id pub-id-type="doi">10.1016/j.chemosphere.2020.125869</pub-id> </citation>
</ref>
<ref id="B11">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheng</surname>
<given-names>F.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2013</year>). <article-title>In Silico ADMET Prediction: Recent Advances, Current Challenges and Future Trends</article-title>. <source>Ctmc</source> <volume>13</volume>, <fpage>1273</fpage>&#x2013;<lpage>1289</lpage>. <pub-id pub-id-type="doi">10.2174/15680266113139990033</pub-id>In </citation>
</ref>
<ref id="B12">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Cheng</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Ng</surname>
<given-names>C. A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Using Machine Learning to Classify Bioactivity for 3486 Per- and Polyfluoroalkyl Substances (PFASs) from the OECD List</article-title>. <source>Environ. Sci. Technol.</source> <volume>53</volume>, <fpage>13970</fpage>&#x2013;<lpage>13980</lpage>. <pub-id pub-id-type="doi">10.1021/acs.est.9b04833</pub-id> </citation>
</ref>
<ref id="B13">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Deeb</surname>
<given-names>O.</given-names>
</name>
<name>
<surname>Khadikar</surname>
<given-names>P. V.</given-names>
</name>
<name>
<surname>Goodarzi</surname>
<given-names>M.</given-names>
</name>
</person-group> (<year>2011</year>). <article-title>Prediction of Gas/Particle Partitioning Coefficients of Semi Volatile Organic Compounds via QSPR Methods: PC-ANN and PLS Analysis</article-title>. <source>Jics</source> <volume>8</volume>, <fpage>176</fpage>&#x2013;<lpage>192</lpage>. <pub-id pub-id-type="doi">10.1007/bf03246214</pub-id> </citation>
</ref>
<ref id="B14">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Dickson</surname>
<given-names>C. J.</given-names>
</name>
<name>
<surname>Hornak</surname>
<given-names>V.</given-names>
</name>
<name>
<surname>Pearlstein</surname>
<given-names>R. A.</given-names>
</name>
<name>
<surname>Duca</surname>
<given-names>J.&#x20;S.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Structure-Kinetic Relationships of Passive Membrane Permeation from Multiscale Modeling</article-title>. <source>J.&#x20;Am. Chem. Soc.</source> <volume>139</volume>, <fpage>442</fpage>&#x2013;<lpage>452</lpage>. <pub-id pub-id-type="doi">10.1021/jacs.6b11215</pub-id> </citation>
</ref>
<ref id="B15">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Fioressi</surname>
<given-names>S. E.</given-names>
</name>
<name>
<surname>Bacelo</surname>
<given-names>D. E.</given-names>
</name>
<name>
<surname>Aranda</surname>
<given-names>J.&#x20;F.</given-names>
</name>
<name>
<surname>Duchowicz</surname>
<given-names>P. R.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Prediction of the Aqueous Solubility of Diverse Compounds by 2D-QSPR</article-title>. <source>J.&#x20;Mol. Liquids</source> <volume>302</volume>, <fpage>112572</fpage>. <pub-id pub-id-type="doi">10.1016/j.molliq.2020.112572</pub-id> </citation>
</ref>
<ref id="B16">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Freitas</surname>
<given-names>M. R.</given-names>
</name>
<name>
<surname>Freitas</surname>
<given-names>M. P.</given-names>
</name>
<name>
<surname>Macedo</surname>
<given-names>R. L. G.</given-names>
</name>
</person-group> (<year>2014</year>). <article-title>Aug-MIA-QSPR Modeling of the Soil Sorption of Carboxylic Acid Herbicides</article-title>. <source>Bull. Environ. Contam. Toxicol.</source> <volume>93</volume>, <fpage>489</fpage>&#x2013;<lpage>492</lpage>. <pub-id pub-id-type="doi">10.1007/s00128-014-1356-9</pub-id> </citation>
</ref>
<ref id="B17">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>He</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Xiao</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>C.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Yang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Z.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>Insights into Pesticide Toxicity against Aquatic Organism: QSTR Models on Daphnia Magna</article-title>. <source>Ecotoxicology Environ. Saf.</source> <volume>173</volume>, <fpage>285</fpage>&#x2013;<lpage>292</lpage>. <pub-id pub-id-type="doi">10.1016/j.ecoenv.2019.02.014</pub-id> </citation>
</ref>
<ref id="B18">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Jana</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Pal</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Sural</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chattaraj</surname>
<given-names>P. K.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Quantitative Structure-Toxicity Relationship: An "In Silico Study" Using Electrophilicity and Hydrophobicity as Descriptors</article-title>. <source>Int. J.&#x20;Quan. Chem.</source> <volume>120</volume>, <fpage>e26097</fpage>. <pub-id pub-id-type="doi">10.1002/qua.26097</pub-id> </citation>
</ref>
<ref id="B19">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Kaneko</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>A New Measure of Regression Model Accuracy that Considers Applicability Domains</article-title>. <source>Chemometrics Intell. Lab. Syst.</source> <volume>171</volume>, <fpage>1</fpage>&#x2013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1016/j.chemolab.2017.09.018</pub-id> </citation>
</ref>
<ref id="B20">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Khan</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Benfenati</surname>
<given-names>E.</given-names>
</name>
<name>
<surname>Roy</surname>
<given-names>K.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Consensus QSAR Modeling of Toxicity of Pharmaceuticals to Different Aquatic Organisms: Ranking and Prioritization of the DrugBank Database Compounds</article-title>. <source>Ecotoxicology Environ. Saf.</source> <volume>168</volume>, <fpage>287</fpage>&#x2013;<lpage>297</lpage>. <pub-id pub-id-type="doi">10.1016/j.ecoenv.2018.10.060</pub-id> </citation>
</ref>
<ref id="B21">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Lee</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Kumar</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Lee</surname>
<given-names>S.-Y.</given-names>
</name>
<name>
<surname>Park</surname>
<given-names>S. J.</given-names>
</name>
<name>
<surname>Kim</surname>
<given-names>M.-h.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Development of Predictive Models for Identifying Potential S100A9 Inhibitors Based on Machine Learning Methods</article-title>. <source>Front. Chem.</source> <volume>7</volume>, <fpage>779</fpage>. <pub-id pub-id-type="doi">10.3389/fchem.2019.00779</pub-id> </citation>
</ref>
<ref id="B22">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Cui</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>He</surname>
<given-names>Z.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Quantitative Structure-Retention Relationship Studies Using Immobilized Artificial Membrane Chromatography I: Amended Linear Solvation Energy Relationships with the Introduction of a Molecular Electronic Factor</article-title>. <source>J.&#x20;Chromatogr. A</source> <volume>1132</volume>, <fpage>174</fpage>&#x2013;<lpage>182</lpage>. <pub-id pub-id-type="doi">10.1016/j.chroma.2006.07.073</pub-id> </citation>
</ref>
<ref id="B23">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Li</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Arnot</surname>
<given-names>J.&#x20;A.</given-names>
</name>
<name>
<surname>Wania</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>How Are Humans Exposed to Organic Chemicals Released to Indoor Air?</article-title>. <source>Environ. Sci. Technol.</source> <volume>53</volume>, <fpage>11276</fpage>&#x2013;<lpage>11284</lpage>. <pub-id pub-id-type="doi">10.1021/acs.est.9b02036</pub-id> </citation>
</ref>
<ref id="B24">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Fu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Zhu</surname>
<given-names>D.</given-names>
</name>
<name>
<surname>Qu</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Prediction of Apolar Compound Sorption to Aquatic Natural Organic Matter Accounting for Natural Organic Matter Hydrophobicity Using Aqueous Two-phase Systems</article-title>. <source>Environ. Sci. Technol.</source> <volume>53</volume>, <fpage>8127</fpage>&#x2013;<lpage>8135</lpage>. <pub-id pub-id-type="doi">10.1021/acs.est.9b00529</pub-id> </citation>
</ref>
<ref id="B25">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Liu</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Wallqvist</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Molecular Similarity-Based Domain Applicability Metric Efficiently Identifies Out-Of-Domain Compounds</article-title>. <source>J.&#x20;Chem. Inf. Model.</source> <volume>59</volume>, <fpage>181</fpage>&#x2013;<lpage>189</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jcim.8b00597</pub-id> </citation>
</ref>
<ref id="B26">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Mansouri</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Grulke</surname>
<given-names>C. M.</given-names>
</name>
<name>
<surname>Judson</surname>
<given-names>R. S.</given-names>
</name>
<name>
<surname>Williams</surname>
<given-names>A. J.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>OPERA Models for Predicting Physicochemical Properties and Environmental Fate Endpoints</article-title>. <source>J.&#x20;Cheminform</source> <volume>10</volume>. <pub-id pub-id-type="doi">10.1186/s13321-018-0263-1</pub-id> </citation>
</ref>
<ref id="B27">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Pal</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Jana</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Sural</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chattaraj</surname>
<given-names>P. K.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Hydrophobicity versus Electrophilicity: A New Protocol toward Quantitative Structure-Toxicity Relationship</article-title>. <source>Chem. Biol. Drug Des.</source> <volume>93</volume>, <fpage>1083</fpage>&#x2013;<lpage>1095</lpage>. <pub-id pub-id-type="doi">10.1111/cbdd.13428</pub-id> </citation>
</ref>
<ref id="B28">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Raevsky</surname>
<given-names>O.</given-names>
</name>
</person-group> (<year>2004</year>). <article-title>Physicochemical Descriptors in Property-Based Drug Design</article-title>. <source>Mrmc</source> <volume>4</volume>, <fpage>1041</fpage>&#x2013;<lpage>1052</lpage>. <pub-id pub-id-type="doi">10.2174/1389557043402964</pub-id> </citation>
</ref>
<ref id="B29">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Roy</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Ambure</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Kar</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Ojha</surname>
<given-names>P. K.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>Is it Possible to Improve the Quality of Predictions from an "intelligent" Use of Multiple QSAR/QSPR/QSTR Models?</article-title>. <source>J.&#x20;Chemometr.</source> <volume>32</volume>, <fpage>2992</fpage>. <pub-id pub-id-type="doi">10.1002/cem.2992</pub-id> </citation>
</ref>
<ref id="B30">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sabour</surname>
<given-names>M. R.</given-names>
</name>
<name>
<surname>Moftakhari Anasori Movahed</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Movahed</surname>
<given-names>A.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Application of Radial Basis Function Neural Network to Predict Soil Sorption Partition Coefficient Using Topological Descriptors</article-title>. <source>Chemosphere</source> <volume>168</volume>, <fpage>877</fpage>&#x2013;<lpage>884</lpage>. <pub-id pub-id-type="doi">10.1016/j.chemosphere.2016.10.122</pub-id> </citation>
</ref>
<ref id="B31">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sarkar</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Anderson</surname>
<given-names>K. C.</given-names>
</name>
<name>
<surname>Kellogg</surname>
<given-names>G. E.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>Computational Analysis of Structure-Based Interactions and Ligand Properties Can Predict Efflux Effects on Antibiotics</article-title>. <source>Eur. J.&#x20;Med. Chem.</source> <volume>52</volume>, <fpage>98</fpage>&#x2013;<lpage>110</lpage>. <pub-id pub-id-type="doi">10.1016/j.ejmech.2012.03.008</pub-id> </citation>
</ref>
<ref id="B32">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Song</surname>
<given-names>R.</given-names>
</name>
<name>
<surname>Keller</surname>
<given-names>A. A.</given-names>
</name>
<name>
<surname>Suh</surname>
<given-names>S.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>Rapid Life-Cycle Impact Screening Using Artificial Neural Networks</article-title>. <source>Environ. Sci. Technol.</source> <volume>51</volume>, <fpage>10777</fpage>&#x2013;<lpage>10785</lpage>. <pub-id pub-id-type="doi">10.1021/acs.est.7b02862</pub-id> </citation>
</ref>
<ref id="B33">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Suay-Garcia</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Ignacio Bueso&#x2010;Bordils</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Falc&#xf3;</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>P&#xe9;rez&#x2010;Gracia</surname>
<given-names>M. T.</given-names>
</name>
<name>
<surname>Ant&#xf3;n&#x2010;Fos</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Alem&#xe1;n&#x2010;L&#xf3;pez</surname>
<given-names>P.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Quantitative Structure-Activity Relationship Methods in the Discovery and Development of Antibacterials</article-title>. <source>Wires Comput. Mol. Sci.</source> <volume>10</volume>, <fpage>e1472</fpage>. <pub-id pub-id-type="doi">10.1002/wcms.v10.6</pub-id> </citation>
</ref>
<ref id="B34">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Sun</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Shi</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Q.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>Development of Quantitative Structure-Activity Relationship Models to Predict Potential Nephrotoxic Ingredients in Traditional Chinese Medicines</article-title>. <source>Food Chem. Toxicol.</source> <volume>128</volume>, <fpage>163</fpage>&#x2013;<lpage>170</lpage>. <pub-id pub-id-type="doi">10.1016/j.fct.2019.03.056</pub-id> </citation>
</ref>
<ref id="B35">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>T. Stanton</surname>
<given-names>D.</given-names>
</name>
</person-group> (<year>2012</year>). <article-title>QSAR and QSPR Model Interpretation Using Partial Least Squares (PLS) Analysis</article-title>. <source>Cad</source> <volume>8</volume>, <fpage>107</fpage>&#x2013;<lpage>127</lpage>. <pub-id pub-id-type="doi">10.2174/157340912800492357</pub-id> </citation>
</ref>
<ref id="B36">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Warshel</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Sharma</surname>
<given-names>P. K.</given-names>
</name>
<name>
<surname>Kato</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Xiang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Olsson</surname>
<given-names>M. H. M.</given-names>
</name>
</person-group> (<year>2006</year>). <article-title>Electrostatic Basis for Enzyme Catalysis</article-title>. <source>Chem. Rev.</source> <volume>106</volume>, <fpage>3210</fpage>&#x2013;<lpage>3235</lpage>. <pub-id pub-id-type="doi">10.1021/cr0503106</pub-id> </citation>
</ref>
<ref id="B37">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Yang</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Sun</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Li</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>G.</given-names>
</name>
<name>
<surname>Tang</surname>
<given-names>Y.</given-names>
</name>
</person-group> (<year>2018</year>). <article-title>In Silico Prediction of Chemical Toxicity for Drug Design Using Machine Learning Methods and Structural Alerts</article-title>. <source>Front. Chem.</source> <volume>6</volume>, <fpage>30</fpage>. <pub-id pub-id-type="doi">10.3389/fchem.2018.00030</pub-id>In </citation>
</ref>
<ref id="B38">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zapadka</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kaczmarek</surname>
<given-names>M.</given-names>
</name>
<name>
<surname>Kupcewicz</surname>
<given-names>B.</given-names>
</name>
<name>
<surname>Dekowski</surname>
<given-names>P.</given-names>
</name>
<name>
<surname>Walkowiak</surname>
<given-names>A.</given-names>
</name>
<name>
<surname>Kokotkiewicz</surname>
<given-names>A.</given-names>
</name>
<etal/>
</person-group> (<year>2019</year>). <article-title>An Application of QSRR Approach and Multiple Linear Regression Method for Lipophilicity Assessment of Flavonoids</article-title>. <source>J.&#x20;Pharm. Biomed. Anal.</source> <volume>164</volume>, <fpage>681</fpage>&#x2013;<lpage>689</lpage>. <pub-id pub-id-type="doi">10.1016/j.jpba.2018.11.024</pub-id> </citation>
</ref>
<ref id="B39">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Abraham</surname>
<given-names>M. H.</given-names>
</name>
<name>
<surname>Liu</surname>
<given-names>X.</given-names>
</name>
</person-group> (<year>2017</year>). <article-title>An Equation for the Prediction of Human Skin Permeability of Neutral Molecules, Ions and Ionic Species</article-title>. <source>Int. J.&#x20;Pharmaceutics</source> <volume>521</volume>, <fpage>259</fpage>&#x2013;<lpage>266</lpage>. <pub-id pub-id-type="doi">10.1016/j.ijpharm.2017.02.059</pub-id> </citation>
</ref>
<ref id="B40">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhang</surname>
<given-names>K.</given-names>
</name>
<name>
<surname>Zhong</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Predicting Aqueous Adsorption of Organic Compounds onto Biochars, Carbon Nanotubes, Granular Activated Carbons, and Resins with Machine Learning</article-title>. <source>Environ. Sci. Technol.</source> <volume>54</volume>, <fpage>7008</fpage>&#x2013;<lpage>7018</lpage>. <pub-id pub-id-type="doi">10.1021/acs.est.0c02526</pub-id> </citation>
</ref>
<ref id="B41">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zheng</surname>
<given-names>S.</given-names>
</name>
<name>
<surname>Chang</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Xu</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Lin</surname>
<given-names>F.</given-names>
</name>
</person-group> (<year>2019</year>). <article-title>E-Sweet: A Machine-Learning Based Platform for the Prediction of Sweetener and its Relative Sweetness</article-title>. <source>Front. Chem.</source> <volume>7</volume>, <fpage>35</fpage>. <pub-id pub-id-type="doi">10.3389/fchem.2019.00035</pub-id> </citation>
</ref>
<ref id="B42">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Zhao</surname>
<given-names>J.</given-names>
</name>
<name>
<surname>Zhang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Zhou</surname>
<given-names>W.</given-names>
</name>
<name>
<surname>Yin</surname>
<given-names>L.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<etal/>
</person-group> (<year>2018</year>). <article-title>ADME Properties Evaluation in Drug Discovery: In Silico Prediction of Blood-Brain Partitioning</article-title>. <source>Mol. Divers.</source> <volume>22</volume>, <fpage>979</fpage>&#x2013;<lpage>990</lpage>. <pub-id pub-id-type="doi">10.1007/s11030-018-9866-8</pub-id> </citation>
</ref>
<ref id="B43">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Jiang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>R. P.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>B.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>Development of Pp-LFER and QSPR Models for Predicting the Diffusion Coefficients of Hydrophobic Organic Compounds in LDPE</article-title>. <source>Ecotoxicology Environ. Saf.</source> <volume>190</volume>, <fpage>110179</fpage>. <pub-id pub-id-type="doi">10.1016/j.ecoenv.2020.110179</pub-id> </citation>
</ref>
<ref id="B44">
<citation citation-type="journal">
<person-group person-group-type="author">
<name>
<surname>Zhu</surname>
<given-names>T.</given-names>
</name>
<name>
<surname>Yan</surname>
<given-names>H.</given-names>
</name>
<name>
<surname>Singh</surname>
<given-names>R. P.</given-names>
</name>
<name>
<surname>Wang</surname>
<given-names>Y.</given-names>
</name>
<name>
<surname>Cheng</surname>
<given-names>H.</given-names>
</name>
</person-group> (<year>2020</year>). <article-title>QSPR Study on the Polyacrylate-Water Partition Coefficients of Hydrophobic Organic Compounds</article-title>. <source>Environ. Sci. Pollut. Res.</source> <volume>27</volume>, <fpage>17550</fpage>&#x2013;<lpage>17560</lpage>. <pub-id pub-id-type="doi">10.1007/s11356-019-06389-z</pub-id> </citation>
</ref>
</ref-list>
</back>
</article>