<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Mater.</journal-id>
<journal-title>Frontiers in Materials</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Mater.</abbrev-journal-title>
<issn pub-type="epub">2296-8016</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fmats.2019.00087</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Materials</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Hierarchical Machine Learning Model for Mechanical Property Predictions of Polyurethane Elastomers From Small Datasets</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Menon</surname> <given-names>Aditya</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/701835/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Thompson-Col&#x000F3;n</surname> <given-names>James A.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/682920/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Washburn</surname> <given-names>Newell R.</given-names></name>
<xref ref-type="aff" rid="aff3"><sup>3</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/682499/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Materials Science and Engineering, Carnegie Mellon University</institution>, <addr-line>Pittsburgh, PA</addr-line>, <country>United States</country></aff>
<aff id="aff2"><sup>2</sup><institution>Covestro LLC</institution>, <addr-line>Pittsburgh, PA</addr-line>, <country>United States</country></aff>
<aff id="aff3"><sup>3</sup><institution>Department of Chemistry and Department of Biomedical Engineering, Carnegie Mellon University</institution>, <addr-line>Pittsburgh, PA</addr-line>, <country>United States</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Christian Johannes Cyron, Hamburg University of Technology, Germany</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: David Cereceda, Villanova University, United States; Youyong Li, Soochow University, China</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Newell R. Washburn <email>washburn&#x00040;andrew.cmu.edu</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Computational Materials Science, a section of the journal Frontiers in Materials</p></fn></author-notes>
<pub-date pub-type="epub">
<day>08</day>
<month>05</month>
<year>2019</year>
</pub-date>
<pub-date pub-type="collection">
<year>2019</year>
</pub-date>
<volume>6</volume>
<elocation-id>87</elocation-id>
<history>
<date date-type="received">
<day>05</day>
<month>02</month>
<year>2019</year>
</date>
<date date-type="accepted">
<day>08</day>
<month>04</month>
<year>2019</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2019 Menon, Thompson-Col&#x000F3;n and Washburn.</copyright-statement>
<copyright-year>2019</copyright-year>
<copyright-holder>Menon, Thompson-Col&#x000F3;n and Washburn</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Polyurethanes are a broad class of material that finds application in coatings, foams, and solid elastomers. The urethane chemistry allows a diversity of monomers to be used, and prediction of mechanical properties, which are determined by complex interplay between monomer chemistry and chain architecture, is an unresolved challenge. Urethanes are based on aromatic or cyclic isocyanates and linear or branched polyols, and polymerization results in linear chains for bifunctional monomers or branched chains for multifunctional monomers. Strong intermolecular interactions between aromatic groups result in the formation of hard-segment domains that generate physical crosslinks between disorganized rubbery domains and anchor the material microstructure, contributing to resistance to deformation. Here, a general hierarchical machine learning (HML) model for predicting the stress-at-break, strain-at-break, and Tan &#x003B4; for thermoplastic and thermoset polyurethanes is presented. The algorithm was trained on a library of 18 polymers with different diisocyanates, bifunctional or trifunctional polyols, and NCO:OH index. HML reduces data requirements through robust embedding of domain knowledge and surrogate data in a middle layer that bridges input variables (composition) and output responses (mechanical properties). In this work, the middle layer included information on overall polymer composition, predictions of chain architecture derived from Monte Carlo simulations of polymerization, information on interchain interactions from empirically derived molecular potentials and shifts in infrared (IR) spectroscopy absorbances. The HML predictions are shown to be more accurate than those from a random forest model directly relating composition and properties, suggesting that embedding domain knowledge provides significant advantages in predicting the properties of complex material systems based on small datasets.</p></abstract>
<kwd-group>
<kwd>polyurethane</kwd>
<kwd>machine learning</kwd>
<kwd>structure-property relationship</kwd>
<kwd>property prediction</kwd>
<kwd>tunability</kwd>
</kwd-group>
<counts>
<fig-count count="9"/>
<table-count count="2"/>
<equation-count count="2"/>
<ref-count count="33"/>
<page-count count="12"/>
<word-count count="5693"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Polyurethanes are ubiquitous materials found in coatings, foams, and solid elastomers (Oertel, <xref ref-type="bibr" rid="B19">1994</xref>; Engels et al., <xref ref-type="bibr" rid="B6">2013</xref>). Prototypical polyurethanes are formed through step-growth polymerization of an aromatic diisocyanate and an aliphatic diol, resulting in the formation of a material having aggregated aromatic hard segments bridged by rubbery segments. This microstructure is the basis for the remarkable mechanical properties of polyurethanes characterized by primarily elastic behavior and large values of ultimate elongation. However, modern polyurethanes are based on a highly diverse family of monomers that provide control over the number of reactive isocyanate or alcohol groups, which allow the preparation of linear of branched materials, varying monomer chemistries, which tune the interactions between hard segments and soft segments, and control over the NCO:OH index, which controls the degree of polymerization. Developing a model to predict the mechanical properties of these materials based on all these compositional variables is an unresolved challenge.</p>
<p>There are many analytical approaches to predicting properties of polymers based on their linear or crosslinked structure. Viscoelastic and rheological properties of linear polymers have been predicted by tube models (Milner and McLeish, <xref ref-type="bibr" rid="B18">1998</xref>; Pattamaprom et al., <xref ref-type="bibr" rid="B20">2000</xref>; van Ruymbeke et al., <xref ref-type="bibr" rid="B32">2002</xref>, <xref ref-type="bibr" rid="B31">2005</xref>) where reptation theory has been extended with contour length fluctuations and constraint release mechanisms, leading to successful predictions especially with low molecular weight polymers. For crosslinked polymers, mechanical properties have been predicted using group interaction modeling (GIM) (Foreman et al., <xref ref-type="bibr" rid="B8">2008</xref>) where a mean-field potential is calculated from cohesive energy and other molar constants derived using a group-additivity approach based on each component in the repeating unit. Another model for thermo-mechanical behavior prediction uses a molecular-modeling approach (Shenogina et al., <xref ref-type="bibr" rid="B27">2012</xref>) whereas Eom et al show the effect of native topology on mechanical strength of crosslinked polymer chains (Eom et al., <xref ref-type="bibr" rid="B7">2003</xref>).</p>
<p>With the advent of machine learning in many traditional scientific disciplines and the Materials Genome Initiative, there have also been many data-driven, ML-based approaches for prediction of polymer properties (de Pablo et al., <xref ref-type="bibr" rid="B4">2014</xref>; Agrawal and Choudhary, <xref ref-type="bibr" rid="B1">2016</xref>). Viscoelastic properties have been modeled using a multi scale computational framework on inverse Boltzmann method (Li et al., <xref ref-type="bibr" rid="B13">2012</xref>) and specific properties (mechanical, thermal, optical, and electrical) have been trained on microscopic, mesoscopic, and macroscopic structures from polymer databases available online using artificial neural networks (Roy et al., <xref ref-type="bibr" rid="B25">2006</xref>). Recently, Kim et al. (<xref ref-type="bibr" rid="B10">2018</xref>) have developed a polymer informatics platform which trains machine learning models of a dataset of high throughput DFT calculations and experimental data from the polymer literature.</p>
<p>Since most of the ML based approaches rely on large datasets, Hierarchical Machine Learning (HML) was developed on small experimental datasets to predict properties of complex material systems utilizing an intermediate layer between the desired responses and system variables (Menon et al., <xref ref-type="bibr" rid="B15">2017</xref>). These intermediate variables are based on latent physicochemical factors from domain knowledge pertaining to the material system. This methodology was validated on a system of dispersant dosed concentrated MgO suspension, which acted as a non-setting model of cement. Building upon previous work, HML was successfully utilized to designed a superplasticizer tailored specifically for metakaolin-portland blend cement blends (Menon et al., <xref ref-type="bibr" rid="B14">2018</xref>).</p>
<p>In this work, HML was applied to a system of linear and crosslinked polyurethanes modeling mechanical responses: stress-at-break, strain-at-break, and Tan &#x003B4; with system variables which are polymer structure, molecular weights and densities of the reactants (diisocyanates, polyols), chain length of polyol and isocyanate: alcohol (NCO:OH) index. Intermediate variables utilized for predicting mechanical responses were chosen to simplistically represent intermolecular, interchain and crosslinking behaviors in the system. The input dataset of 18 synthesized polymers was split into a training set of 14 and test set of 4 data points. A model was developed on the training set and validated against the test set. Finally, a comparison of the HML algorithm was performed with a random forest (RF) model (Breiman, <xref ref-type="bibr" rid="B2">2001</xref>; Pedregosa et al., <xref ref-type="bibr" rid="B21">2011</xref>) that directly predicted mechanical properties based on composition using the same training set.</p>
</sec>
<sec id="s2">
<title>Training Dataset</title>
<sec>
<title>Materials</title>
<p>The oligomers and monomers used to build the training set, poly(tetramethylene ether) glycol(PTMEG) (M<sub>n</sub> &#x0003D; 1,000), polycaprolactone triol (PCL) (M<sub>n</sub> &#x0003D; 900), toluene-2,4-diisocyanate (TDI), hexamethylene diisocyanate (HDI), isophorone diisocyanate(IPDI), were purchased from Sigma-Aldrich, as well as the catalyst used for the polymerization reaction, dibutyltin dilaurate (DBDTL). Dichloromethane was acquired from EMD Millipore. All of the materials were used as received.</p>
</sec>
<sec>
<title>Polymer Synthesis</title>
<p>The training set consisted of 18 samples, all of which were prepared by reacting a bifunctional diisocyanate with either a bifunctional or trifunctional polyol at NCO:OH indices of 1.0, 1.2, or 1.5, as shown in <xref ref-type="supplementary-material" rid="SM2">Supplementary Table 1</xref>. The reactions were carried out at room temperature in 8 ml of dichloromethane as a solvent under the presence of DBTDL as a catalyst. Films were cast from the synthesized polymers and were left to dry at room temperature for 24 h and then again dried in a vacuum oven for 24 h at 60&#x000B0;C to remove any residual solvent.</p>
</sec>
<sec>
<title>Measurements</title>
<p>Stress-at-break and strain-at-break were measured for all polymers in a universal testing machine (Instron). Gage width, parallel section width and thickness for each sample tested were 12 mm, 4 mm, and 2mm, respectively. Tan &#x003B4; for all polymers was measured via a frequency sweep in a Discovery HR-2 rheometer (TA instruments) and the value at 1 Hz was used as the characteristic system response in this study. FT-IR analysis was performed on 2 mm thick film specimens in a Frontier Spectrometer (PerkinElmer) in the standard wavenumber range (4000&#x02013;700 cm<sup>&#x02212;1</sup>).</p>
</sec>
<sec>
<title>Monte Carlo Modeling</title>
<p>Monte Carlo simulations of step growth polymerization were performed in R (R Core Team, <xref ref-type="bibr" rid="B23">2018</xref>). We focused on parameters which would provide insight into the crosslinking tendency of polymer chains due to polyfunctionality in either the polyisocyanates or the polyol reactants. These parameters were determined through a Monte Carlo simulation of a spatially homogeneous chemical ensemble of monomers; for these simulations we used 200,000 monomers. The algorithm performs an event-based stochastic process analogous to the approach described by Mikes and Dusek (<xref ref-type="bibr" rid="B16">1982</xref>), then repeats the stochastic process until all of the limiting reactive group are consumed, which in our simulations was the OH group. For the recipes with NCO:OH index of 1, the simulation is at this point complete. For the recipes with index 1.2 and 1.5 we further simulate for moisture cure. Moisture cure refers to the reaction of some of the remaining unreacted&#x02014;NCO react with ambient moisture to form to&#x02014;NH<sub>2</sub> with further reactions with&#x02014;NCO to form urea bonds. In these cases, once all the -OH is consumed, the appropriate amount of water is added to the ensemble and the stochastic process is continued until no more bonds can be formed. The simulation provides parameters that describe the connectivity for all monomers in the post-gel thermoset. For our analysis, we identify and quantify three type of molecular configurations, shown in <xref ref-type="fig" rid="F1">Figure 1</xref>: Sol-which are the oligomers that are not part of the infinite network formed during polymerization, Elastic link&#x02014;which are the effective links formed between the reactive groups and form part of the crosslinked core gel component and Dangler&#x02014;which are the pendant groups that are attached to the infinite network and are also known as tethered plasticizer and ineffective links. The parameters which are relevant are the effective crosslinks per kg of polymer (nEff) calculated using Miller and Macosko&#x00027;s recursive method (Miller and Macosko, <xref ref-type="bibr" rid="B17">1976</xref>), the average molecular weight of the elastic links weighed as a percent of total polymer weight (elastic_link_mtw), the average molecular weight of the core gel component weighed as a percent of total polymer weight (Mtw), the percentage of sol present in the synthesized polymer (sol_pctWgt) and the percentage of core gel component in the synthesized polymer (Core_pctWgt).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Schematic of sol, danglers, elastic links, and core gel components modeled over 300 monomers using Monte Carlo simulations.</p></caption>
<graphic xlink:href="fmats-06-00087-g0001.tif"/>
</fig>
</sec>
</sec>
<sec id="s3">
<title>HML Modeling</title>
<p>HML modeling was performed in Python (Rossum, <xref ref-type="bibr" rid="B24">1995</xref>) using the Scikit learn library (Pedregosa et al., <xref ref-type="bibr" rid="B21">2011</xref>) for machine learning estimators. HML modeling has been used with multiple systems now; in all of these systems, the top layer will represent a complex system response that has to be either predicted or optimized with respect to a bottom layer which consists of simple experimentally tunable variables. However, there is an intermediate middle layer which consists of physical or chemical factors parameterized from the variables in bottom layer through surrogate physical measurements and existing physical/chemical relationships pertaining to the specific material system. The algorithm can be better understood with the scheme shown in <xref ref-type="fig" rid="F2">Figure 2</xref>.</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Schematic depicting the three layers in the Hierarchical Machine Learning approach.</p></caption>
<graphic xlink:href="fmats-06-00087-g0002.tif"/>
</fig>
<p>The bottom layer of input or experimental variables in the model for PU consisted of the repeating unit in the synthesized polymer split into chemical structural units per kg of polymer, molecular weights and densities of the diisocyanates and polyols, the NCO:OH indices and the estimated chain length of polyols, assuming a PDI of 1. The chemical structures of the diisocyanates and polyols used in the training set are depicted in <xref ref-type="fig" rid="F3">Figure 3</xref>. Each of the 18 polymers synthesized from these reactants have been categorized into a vector of structural units per kg of polymer (c, ch, ch2, ch3, c6h6, co, nh, o, nh2) calculated from the groups present in the reactants&#x02014;the diisocyanates and polyols, the NCO:OH index, chain length of polyols and the total weight of the polymer synthesized.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Chemical structures of reactants used in the training set <bold>(A)</bold> hexamethylene diisocyanate <bold>(B)</bold> isophorone diisocyanate <bold>(C)</bold> toluene diisocyanate <bold>(D)</bold> PTMEG 1000 and <bold>(E)</bold> polycaprolactone 900.</p></caption>
<graphic xlink:href="fmats-06-00087-g0003.tif"/>
</fig>
<p>The middle layer or the intermediate physical/chemical variables have been grouped into three categories: the first will probe into the intermolecular interactions and absorption characteristics of the synthesized polymer molecule through FT-IR spectroscopy (Siesler, <xref ref-type="bibr" rid="B28">1980</xref>). The presence of spectral features in specific regions of the spectrum is indicative of certain functional groups which have vibrational modes with large displacements and are minimally affected by the presence of other functional groups or atoms (Griffiths and de Haseth, <xref ref-type="bibr" rid="B9">2007</xref>). For our study on polyurethanes, the wavenumbers exhibited by CO and NH groups as well as the ratio of absorbance values of NH to CO groups for each sample, are of particular importance and will provide better correlation to mechanical responses. It has been observed before that the CO stretching vibration and NH stretching vibration show different wavenumbers depending on the degree of H-bonding as well as crosslinking occurring due to trifunctional hydroxyl group polyol which significantly impact mechanical properties of such polyurethanes (Tsai et al., <xref ref-type="bibr" rid="B29">1998</xref>). It is expected that a particular amount or degree of these factors, h-bonding and crosslinking, which results in the optimal mechanical response suited to a particular end-user application. The degree of H-bonding is also influenced through the symmetry of chemical structure of the reactants and presence of even/odd number of atoms (Caracciolo et al., <xref ref-type="bibr" rid="B3">2009</xref>), making these variables highly significant for model prediction. These variables have been parameterized with respect to all the bottom-layer variables using a Gaussian regression-based framework using the scikit learn&#x02014;ML library in Python (Williams and Rasmussen, <xref ref-type="bibr" rid="B33">1996</xref>; Pedregosa et al., <xref ref-type="bibr" rid="B21">2011</xref>). The FT-IR variables were then recalculated using the predict function from the Gaussian process models to be used with the top-layer variables.</p>
<p>The second set of middle layer variables consists of intermolecular chain interactions and properties pertaining to polyurethane polymer system. Hard segment (HS%) and soft segment (SS%) were easily calculated by mass of diisocyanate and polyols with respect to the total mass of the polymer. Similarly, % aromatic and % cyclic (non-aromatic) nature was calculated based on mass of respective structural units with respect to total mass of polymer. The solubility parameter and cohesive energy density (CED) was calculated using molar attraction constant, molar volume and cohesive energy of the polymer repeating unit using a group additivity-based approach on the structural units present in the bottom layer. The values for group contributions to the molar attraction constant, molar volume and cohesive energy are easily available in literature and have been extensively used before, for other polymer systems.</p>
<p>The third set of middle layer variables were the predictions of chain architecture from Monte Carlo simulations as described in the section above. Based on existing literature, these three categories of variables sufficiently model the main forces and interactions that govern the mechanical behavior of polyurethanes&#x02014;the microstructure consisting of soft and hard domains which control permanent deformation, high modulus and tensile strength, hydrogen bonding between neighboring polymer chains control the elasticity as well as strain deformation behavior whereas simulation of chemical crosslinking addresses the mechanical behavior due to network formation.</p>
<p>Finally, the top layer, which consists of system responses (stress-at-break, strain-at-break, and Tan &#x003B4;) have been modeled with the middle-layer variables using a random forest regression-based model from the scikit-ML library in Python. Random forest regression is an ensemble learning technique based on multiple decision trees learned from the provided variables. One of the advantages of a random forest model is its use of bagging or bootstrap aggregation where each decision tree is modeled on a subset of the input set but by drawing samples with replacement the subset has the same size as the original input set. Then, averaging is performed on all the decision trees to improve the prediction accuracy and to control overfitting. The number of decision trees used in our training set is equal to 100 and the max depth of trees was unrestricted since modeling was performed on a sparse dataset and are not concerned about memory consumption or computational efficiency, thus leading to better predictive power for the model. The estimator used from scikit -ML library is a &#x0201C;RandomForestRegressor&#x0201D; with the following attributes: bootstrap &#x0003D; True, criterion &#x0003D; &#x0201C;mse,&#x0201D; max_depth &#x0003D; None, max_features &#x0003D; &#x0201C;auto,&#x0201D; max_leaf_nodes &#x0003D; None, min_impurity_decrease &#x0003D; 0.0, min_impurity_split &#x0003D; None, min_samples_leaf &#x0003D; 1, min_samples_split &#x0003D; 2, min_weight_fraction_leaf &#x0003D; 0.0, n_estimators &#x0003D; 100, n_jobs &#x0003D; 4, oob_score &#x0003D; False, random_state &#x0003D; 0, verbose &#x0003D; 0, warm_start &#x0003D; False. Each of these attributes are well described at the scikit learn website for random forest regression:</p>
<p><ext-link ext-link-type="uri" xlink:href="https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestRegressor.html">https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestRegressor.html</ext-link></p>
<p>and the code is available with <xref ref-type="supplementary-material" rid="SM3">Supplementary Files</xref>.</p>
</sec>
<sec id="s4">
<title>Results and Discussion</title>
<p><xref ref-type="fig" rid="F4">Figure 4</xref> shows the mechanical responses for the training set. In general, Tan &#x003B4; and strain-at-break are expected to decrease and stress-at-break is expected to increase with increasing NCO:OH index associated with the transition from rubbery toward a glassy state (Petrovi&#x00107; et al., <xref ref-type="bibr" rid="B22">2002</xref>; Levine et al., <xref ref-type="bibr" rid="B12">2012</xref>) due to the presence of increasing urea linkages in the polymer network from moisture cure of excess NCO content. Furthermore, similar trends are expected with a general replacement of a bifunctional polyol with a higher functionality polyol, owing to crosslinking and network formation (Du&#x00161;ek and Du&#x00161;kov&#x000E1;-Smr&#x0010D;kov&#x000E1;, <xref ref-type="bibr" rid="B5">2000</xref>). Crosslinking also occurs either physically through hydrogen bonding between hard urethane segments or chemically through allophanate linkages due to excess NCO content during the polymerization reaction (Kontou et al., <xref ref-type="bibr" rid="B11">1990</xref>). We see that even though some of our samples show expected behavior, others behave differently in either different response metrics or in all of them. In <xref ref-type="fig" rid="F5">Figure 5</xref>, it can be observed that there is poor correlation between the various responses under a wide range of attained measurements. The diagonal grid represents the one-dimensional spread of values for single responses whereas the top right section represents the scatter plot correlation between response pairs and the bottom grid represents the two-dimensional spread as well as density of values for the response pairs. This suggests that there are multiple factors controlling these responses that may be competing.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Mechanical responses measured for the training set of 18 polymers.</p></caption>
<graphic xlink:href="fmats-06-00087-g0004.tif"/>
</fig>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Lattice plot depicting the relation between various mechanical properties.</p></caption>
<graphic xlink:href="fmats-06-00087-g0005.tif"/>
</fig>
<p>In order to deconvolute the relationship between these mechanical responses and the variables in the bottom layer of the model, the middle layer variables of our algorithm were parameterized in terms of the variables in bottom layer. Here, CO wavenumber, NH wavenumber and the ratio of NH absorbance per CO absorbance were modeled with a Gaussian Process regression using a train/test split to ensure accuracy and predictive capability. The train and test scores for the three variables are shown in <xref ref-type="table" rid="T1">Table 1</xref>. The IR values were regenerated from the learned GP model to be used further in the next training step.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Training and test scores for GP model between FTIR variables and bottom layer variables.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>ML framework</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>CO_W</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>NH_W</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>NH_A per CO_A</bold></th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>Train score</bold></th>
<th valign="top" align="center"><bold>Test score</bold></th>
<th valign="top" align="center"><bold>Train score</bold></th>
<th valign="top" align="center"><bold>Test score</bold></th>
<th valign="top" align="center"><bold>Train score</bold></th>
<th valign="top" align="center"><bold>Test score</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Gaussian processes regression</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="center">0.63</td>
<td valign="top" align="center">0.99</td>
<td valign="top" align="center">0.92</td>
<td valign="top" align="center">0.97</td>
<td valign="top" align="center">0.96</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>The solubility parameter and the cohesive energy density for the polymers were calculated using group contribution methods (Van Krevelen and Te Nijenhuis, <xref ref-type="bibr" rid="B30">2009</xref>) using the equation (1) for solubility parameter</p>
<disp-formula id="E1"><label>(1)</label><mml:math id="M1"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:mo>&#x003B4;</mml:mo><mml:mo>=</mml:mo><mml:mo>&#x02211;</mml:mo><mml:msub><mml:mrow><mml:mi>F</mml:mi></mml:mrow><mml:mrow><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:mo>&#x02211;</mml:mo><mml:msub><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>F</italic><sub><italic>i</italic></sub> is the molar attraction contribution and <italic>V</italic><sub><italic>m,i</italic></sub> is the molar volume contribution for the ith structural unit in the bottom layer and the equation (2) for cohesive energy density is</p>
<disp-formula id="E2"><label>(2)</label><mml:math id="M2"><mml:mtable class="eqnarray" columnalign="left"><mml:mtr><mml:mtd><mml:msub><mml:mrow><mml:mi>e</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>h</mml:mi></mml:mrow></mml:msub><mml:mo>=</mml:mo><mml:mo>&#x02211;</mml:mo><mml:msub><mml:mrow><mml:mi>E</mml:mi></mml:mrow><mml:mrow><mml:mi>c</mml:mi><mml:mi>o</mml:mi><mml:mi>h</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub><mml:mo>/</mml:mo><mml:mo>&#x02211;</mml:mo><mml:msub><mml:mrow><mml:mi>V</mml:mi></mml:mrow><mml:mrow><mml:mi>m</mml:mi><mml:mo>,</mml:mo><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:mtd></mml:mtr></mml:mtable></mml:math></disp-formula>
<p>where <italic>E</italic><sub><italic>coh, i</italic></sub> is the cohesive energy contribution for the ith structural unit in the bottom layer. HS%, SS%, nEff, elastic_link_mtw, Mtw, sol_pctWgt, and core_pctWgt were calculated and simulated as mentioned in the previous section.</p>
<p>After generating the set of middle layer variables, a random forest regression model was fitted between the latter and the mechanical responses (stress-at-break, strain-at-break, and Tan &#x003B4;), and the train/test scores for all responses are shown in <xref ref-type="table" rid="T2">Table 2</xref>. The feature importance values from the RF model are shown in <xref ref-type="fig" rid="F6">Figure 6</xref>, the predicted vs. test values are shown in <xref ref-type="fig" rid="F7">Figure 7</xref> and the ensemble averaged trees are shown in <xref ref-type="fig" rid="F8">Figure 8</xref>.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Training and test scores for HML and Random Forest modeling of mechanical properties as a function of composition.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th/>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>Strain at break</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>Stress at break</bold></th>
<th valign="top" align="center" colspan="2" style="border-bottom: thin solid #000000;"><bold>Tan</bold> &#x003B4;</th>
</tr>
<tr>
<th/>
<th valign="top" align="center"><bold>Train score</bold></th>
<th valign="top" align="center"><bold>Test score</bold></th>
<th valign="top" align="center"><bold>Train score</bold></th>
<th valign="top" align="center"><bold>Test score</bold></th>
<th valign="top" align="center"><bold>Train score</bold></th>
<th valign="top" align="center"><bold>Test score</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">HML</td>
<td valign="top" align="center">0.92</td>
<td valign="top" align="center">0.94</td>
<td valign="top" align="center">0.93</td>
<td valign="top" align="center">0.92</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">0.85</td>
</tr>
<tr>
<td valign="top" align="left">Random forest</td>
<td valign="top" align="center">0.87</td>
<td valign="top" align="center">0.50</td>
<td valign="top" align="center">0.85</td>
<td valign="top" align="center">0.58</td>
<td valign="top" align="center">0.80</td>
<td valign="top" align="center">0.24</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>HML provides significantly greater accuracy through embedding domain knowledge in the algorithm, allowing it to build predictive models from small datasets</italic>.</p>
</table-wrap-foot>
</table-wrap>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p>Feature importance plots from the trained Random Forest model for each of the mechanical responses measured.</p></caption>
<graphic xlink:href="fmats-06-00087-g0006.tif"/>
</fig>
<fig id="F7" position="float">
<label>Figure 7</label>
<caption><p>Predicted vs. actual values for all mechanical responses from the HML model.</p></caption>
<graphic xlink:href="fmats-06-00087-g0007.tif"/>
</fig>
<fig id="F8" position="float">
<label>Figure 8</label>
<caption><p>Ensemble averaged decision tree from Random Forest model trained to predict <bold>(A)</bold> strain-at-break <bold>(B)</bold> stress-at-break, and <bold>(C)</bold> Tan &#x003B4;.</p></caption>
<graphic xlink:href="fmats-06-00087-g0008.tif"/>
</fig>
<p>Interestingly, the most important features for prediction of strain-at-break from the trained model were CO wavenumber, NH absorbance per CO absorbance, cohesive energy density, and NH wavenumber. This could be due to the vibrational shift in the CO and NH bands, which relate to the chemical environment obtained with different reactant combinations (polyols and diisocyanates) whereas the ratio of carbonyl peak absorbance to amide peak absorbance may indicate the effect of NCO:OH index. The shift in frequencies is an indication of hydrogen bonding strength between polymer chains, primarily due to interactions between CO and NH groups. Strong hydrogen bonding between the groups will make the bond within the carbonyl and NH groups weaker, for e.g., HDI and PTMEG 1000 exhibit strong hydrogen bonding, thus higher strain at break as the CO wavenumber is &#x0007E;1,683 cm<sup>&#x02212;1</sup> and NH wavenumber is &#x0007E;3,318 cm<sup>&#x02212;1</sup>. Similarly, between HDI and PCL 900, weaker hydrogen bonding reduces strain at break where the CO wavenumber is &#x0007E;1,732 cm<sup>&#x02212;1</sup> and NH wavenumber is &#x0007E;3,380 cm<sup>&#x02212;1</sup>. Cohesive energy density is a proxy for intermolecular forces within polymer chains, and as such strongly links to forces required for mechanical deformation of a polymeric material. CED is also notable as a property prediction tool for calculating relative strain at failure for similarly networked chains i.e., under presence of moderate chemical crosslinking, CED can improve toughness of chains during large strain deformation (Safranski and Gall, <xref ref-type="bibr" rid="B26">2008</xref>). Other parameters in our model such as % cyclic and Mtw (average molar weight of core gel component as a percentage of total polymer weight) also have smaller yet influencing behavior on strain.</p>
<p>Apart from the FTIR derived variables, the model for stress shows a reliance on hard segment %, core_pctWgt (percentage of core gel component) and % aromatic behavior. This makes sense as the hard segment in a polyurethane is the load bearing component under mechanical deformation and as a result, induces most of the elastic response in the system. At higher HS%, urethane&#x02014;urethane hydrogen bonding in particular is also increased. The core gel component represents the crosslinked structure in a polyurethane which again corresponds to mechanical strength and load bearing nature of the polymer. It is interesting that the model identified % aromatic behavior as an important feature: it impacts mechanical strength due to much more efficient hydrogen bonding and pi-stacking between aromatic groups in neighboring chains. Aromatic groups in isocyanates account for stiffer chains and result in a higher melting point polyurethane as well.</p>
<p>Tan &#x003B4; represents the damping behavior in the mechanical performance of a viscoelastic polymer, i.e., the ratio of plastic behavior to elastic behavior. Thus, it was not surprising to observe both hard segment % and soft segment % as important features, however % cyclic behavior has an interestingly significant impact on Tan &#x003B4;. Even though cyclic groups correspond to stiffness and rigidity, they contribute less than aromatic groups due to the possibility of configurational isomerism as well as non-planar structures. This might explain the ability of these groups to absorb more energy while mechanical stress is applied and provide a good balance between elastic and plastic performance. Other crucial features which were identified in the model were the CO wavenumber, cohesive energy density, and mer solubility parameter.</p>
<p>Each random forest model shown in <xref ref-type="fig" rid="F8">Figure 8</xref> represents an averaged decision tree from an ensemble of decision trees, for each mechanical response. The random forest classifier from Python scikit learn uses bootstrap aggregating in which multiple decision trees are modeled on subsets of training data, chosen randomly with replacement. Each predictor or feature is learned and is split for values based on a mean squared error reduction scheme, which is continued until all the data is split till the last node. Bootstrap aggregation is an excellent stochastic method of avoiding overfitting in the trained model and reduces variance in result without increasing the bias of the model. In random forests, this allows for an out-of-bag (OOB) error estimate to measure prediction error of a trained decision tree on the subset of data not used in that tree, thus negating the need of an independent validation dataset.</p>
<p>In order to evaluate the prediction efficiency on the sparseness of the training set, a standard big data approach was taken where the same random forest framework was applied on a training set containing the mechanical responses and our tunable formulation variables namely, polyol choice, diisocyanate choice, and NCO:OH index. As expected, the model converged with significantly lower test scores as shown in <xref ref-type="table" rid="T2">Table 2</xref> and the predicted responses vs. test data can be seen in <xref ref-type="fig" rid="F9">Figure 9</xref>.</p>
<fig id="F9" position="float">
<label>Figure 9</label>
<caption><p>Predicted vs. actual values for all mechanical responses from the Random Forest model, using a black-box approach.</p></caption>
<graphic xlink:href="fmats-06-00087-g0009.tif"/>
</fig>
<p>If compared with other approaches, this methodology mainly benefits from the planned surrogate physical and chemical measurements, and existing scientific literature to embed domain knowledge with statistical learning to strongly improve predictive capability. In traditional material industries, high throughput data for a single product family with specific end user application is hard to collect; it requires huge investment in time, effort, and cost. Thus, computational techniques relying on big data will not be beneficial for shortening the research and development cycle in such industries as shown earlier. Analytical approaches are better in some respects as they have physical laws and chemistry as underpinnings for property prediction however most of the approaches are highly complex to practically apply in an industry setting and highly sensitive to lack of required data/measurements. Design of experiments offers an accepted method for predicting structure-property relationships for small datasets, yet it does not provide insight into how the underlying forces interact with each other to achieve a specific system response since it is purely statistical in nature and may not accurately predict synergies between variables. HML aims to learn from the shortcomings as well as advantages of the previous mentioned approaches by utilizing and building upon the existing scientific domain expertise with much lesser measurements, providing a tool for not only property prediction but also to elucidate upon the nature of physical and chemical interactions that shape a system response.</p>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusion</title>
<p>Using HML algorithm, mechanical responses of a training set of polyurethanes were as a function of monomer chemistry, index, and chain architecture. The accuracy was compared against a random forest model and it was found that HML produced significantly better predictions of the test data. This was attributed to integration of an intermediate layer of variables comprising domain knowledge based physicochemical factors which significantly improved the model relating experimental formulation variables and mechanical responses of the cured elastomers. Some of the advantages of this approach are (a) the possibility of modeling categorical and qualitative responses of polyurethane products to formulation and processing variables and (b) predicting the properties of novel monomers, such as bio based materials. In future work, we intend to model such responses and also test our model by substituting polyols and diisocyanates to further investigate the predictive nature and capability of the HML algorithm on polymer systems.</p>
</sec>
<sec id="s6">
<title>Author Contributions</title>
<p>AM (CMU) conducted experiments, performed machine-learning modeling, and wrote parts of the manuscript. JT-C (Covestro, LLC) performed Monte Carlo simulations and wrote parts of the manuscript. NW (CMU) oversaw all aspects of the research and wrote parts of the manuscript.</p>
<sec>
<title>Conflict of Interest Statement</title>
<p>JT-C is employed by Covestro, LLC and NW has started a company to explore commercial applications of the algorithm described here and declares a potential conflict of interest. NW owns Ansatz AI, LLC. AM declares no competing financial interest.</p>
</sec>
</sec>
</body>
<back>
<sec sec-type="supplementary-material" id="s7">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fmats.2019.00087/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fmats.2019.00087/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_2.XLSX" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_3.DOCX" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Data_Sheet_1.zip" id="SM3" mimetype="application/zip" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Agrawal</surname> <given-names>A.</given-names></name> <name><surname>Choudhary</surname> <given-names>A.</given-names></name></person-group> (<year>2016</year>). <article-title>Perspective: materials informatics and big data: realization of the &#x0201C;fourth paradigm&#x0201D; of science in materials science</article-title>. <source>APL Mater.</source> <volume>4</volume>:<fpage>053208</fpage>. <pub-id pub-id-type="doi">10.1063/1.4946894</pub-id></citation></ref>
<ref id="B2">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Breiman</surname> <given-names>L.</given-names></name></person-group> (<year>2001</year>). <article-title>Random forests</article-title>. <source>Mach. Learn.</source> <volume>45</volume>, <fpage>5</fpage>&#x02013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id></citation></ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Caracciolo</surname> <given-names>P. C.</given-names></name> <name><surname>Buffa</surname> <given-names>F.</given-names></name> <name><surname>Abraham</surname> <given-names>G. A.</given-names></name></person-group> (<year>2009</year>). <article-title>Effect of the hard segment chemistry and structure on the thermal and mechanical properties of novel biomedical segmented poly(esterurethanes)</article-title>. <source>J. Mater. Sci. Mater. Med.</source> <volume>20</volume>, <fpage>145</fpage>&#x02013;<lpage>155</lpage>. <pub-id pub-id-type="doi">10.1007/s10856-008-3561-8</pub-id><pub-id pub-id-type="pmid">18704646</pub-id></citation></ref>
<ref id="B4">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Pablo</surname> <given-names>J. J.</given-names></name> <name><surname>Jones</surname> <given-names>B.</given-names></name> <name><surname>Kovacs</surname> <given-names>C. L.</given-names></name> <name><surname>Ozolins</surname> <given-names>V.</given-names></name> <name><surname>Ramirez</surname> <given-names>A. P.</given-names></name></person-group> (<year>2014</year>). <article-title>The materials genome initiative, the interplay of experiment, theory and computation</article-title>. <source>Curr. Opin. Solid State Mater. Sci.</source> <volume>18</volume>, <fpage>99</fpage>&#x02013;<lpage>117</lpage>. <pub-id pub-id-type="doi">10.1016/j.cossms.2014.02.003</pub-id></citation></ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Du&#x00161;ek</surname> <given-names>K.</given-names></name> <name><surname>Du&#x00161;kov&#x000E1;-Smr&#x0010D;kov&#x000E1;</surname> <given-names>M.</given-names></name></person-group> (<year>2000</year>). <article-title>Network structure formation during crosslinking of organic coating systems</article-title>. <source>Prog. Polym. Sci.</source> <volume>25</volume>, <fpage>1215</fpage>&#x02013;<lpage>1260</lpage>. <pub-id pub-id-type="doi">10.1016/S0079-6700(00)00028-9</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Engels</surname> <given-names>H.-W.</given-names></name> <name><surname>Pirkl</surname> <given-names>H.-G.</given-names></name> <name><surname>Albers</surname> <given-names>R.</given-names></name> <name><surname>Albach</surname> <given-names>R. W.</given-names></name> <name><surname>Krause</surname> <given-names>J.</given-names></name> <name><surname>Hoffmann</surname> <given-names>A.</given-names></name> <etal/></person-group>. (<year>2013</year>). <article-title>Polyurethanes: versatile materials and sustainable problem solvers for today&#x00027;s challenges</article-title>. <source>Angew. Chemie Int.</source> <italic>Ed</italic>. <volume>52</volume>, <fpage>9422</fpage>&#x02013;<lpage>9441</lpage>. <pub-id pub-id-type="doi">10.1002/anie.201302766</pub-id><pub-id pub-id-type="pmid">23893938</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eom</surname> <given-names>K.</given-names></name> <name><surname>Li</surname> <given-names>P.-C.</given-names></name> <name><surname>Makarov</surname> <given-names>D. E.</given-names></name> <name><surname>Rodin</surname> <given-names>G. J.</given-names></name></person-group> (<year>2003</year>). <article-title>Relationship between the mechanical properties and topology of cross-linked polymer molecules: parallel strands maximize the strength of model polymers and protein domains</article-title>. <source>J. Phys. Chem.</source> B <volume>107</volume>, <fpage>8730</fpage>&#x02013;<lpage>8733</lpage>. <pub-id pub-id-type="doi">10.1021/jp035178x</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Foreman</surname> <given-names>J. P.</given-names></name> <name><surname>Porter</surname> <given-names>D.</given-names></name> <name><surname>Behzadi</surname> <given-names>S.</given-names></name> <name><surname>Jones</surname> <given-names>F. R.</given-names></name></person-group> (<year>2008</year>). <article-title>A model for the prediction of structure-property relations in cross-linked polymers</article-title>. <source>Polymer</source> <volume>49</volume>, <fpage>5588</fpage>&#x02013;<lpage>5595</lpage>. <pub-id pub-id-type="doi">10.1016/j.polymer.2008.09.034</pub-id></citation></ref>
<ref id="B9">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Griffiths</surname> <given-names>P. R.</given-names></name> <name><surname>de Haseth</surname> <given-names>J. A.</given-names></name></person-group> (<year>2007</year>). <source>Fourier Transform Infrared Spectrometry</source>. <publisher-loc>Hoboken, NJ</publisher-loc>: <publisher-name>John Wiley &#x00026; Sons, Inc</publisher-name>.</citation></ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kim</surname> <given-names>C.</given-names></name> <name><surname>Chandrasekaran</surname> <given-names>A.</given-names></name> <name><surname>Huan</surname> <given-names>T. D.</given-names></name> <name><surname>Das</surname> <given-names>D.</given-names></name> <name><surname>Ramprasad</surname> <given-names>R.</given-names></name></person-group> (<year>2018</year>). <article-title>Polymer genome: a data-powered polymer informatics platform for property predictions</article-title>. <source>J. Phys. Chem</source>. C <volume>122</volume>, <fpage>17575</fpage>&#x02013;<lpage>17585</lpage>. <pub-id pub-id-type="doi">10.1021/acs.jpcc.8b02913</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kontou</surname> <given-names>E.</given-names></name> <name><surname>Spathis</surname> <given-names>G.</given-names></name> <name><surname>Niaounakis</surname> <given-names>M.</given-names></name> <name><surname>Kefalas</surname> <given-names>V.</given-names></name></person-group> (<year>1990</year>). <article-title>Physical and chemical cross-linking effects in polyurethane elastomers</article-title>. <source>Colloid Polym. Sci.</source> <volume>268</volume>, <fpage>636</fpage>&#x02013;<lpage>644</lpage>. <pub-id pub-id-type="doi">10.1007/BF01410405</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Levine</surname> <given-names>F.</given-names></name> <name><surname>Escarsega</surname> <given-names>J.</given-names></name> <name><surname>La Scala</surname> <given-names>J.</given-names></name></person-group> (<year>2012</year>). <article-title>Effect of isocyanate to hydroxyl index on the properties of clear polyurethane films</article-title>. <source>Prog. Org. Coatings</source> <volume>74</volume>, <fpage>572</fpage>&#x02013;<lpage>581</lpage>. <pub-id pub-id-type="doi">10.1016/j.porgcoat.2012.02.004</pub-id></citation></ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Li</surname> <given-names>Y.</given-names></name> <name><surname>Tang</surname> <given-names>S.</given-names></name> <name><surname>Abberton</surname> <given-names>B. C.</given-names></name> <name><surname>Kr&#x000F6;ger</surname> <given-names>M.</given-names></name> <name><surname>Burkhart</surname> <given-names>C.</given-names></name> <name><surname>Jiang</surname> <given-names>B.</given-names></name> <etal/></person-group>. (<year>2012</year>). <article-title>A predictive multiscale computational framework for viscoelastic properties of linear polymers</article-title>. <source>Polymer</source> <volume>53</volume>, <fpage>5935</fpage>&#x02013;<lpage>5952</lpage>. <pub-id pub-id-type="doi">10.1016/j.polymer.2012.09.055</pub-id></citation></ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Menon</surname> <given-names>A.</given-names></name> <name><surname>Childs</surname> <given-names>C. M.</given-names></name> <name><surname>Pocz&#x000F3;s</surname> <given-names>B.</given-names></name> <name><surname>Washburn</surname> <given-names>N. R.</given-names></name> <name><surname>Kurtis</surname> <given-names>K. E.</given-names></name></person-group> (<year>2018</year>). <article-title>Molecular engineering of superplasticizers for metakaolin-portland cement blends with hierarchical machine learning</article-title>. <source>Adv. Theory Simulations</source> <volume>2</volume>:<fpage>1800164</fpage>. <pub-id pub-id-type="doi">10.1002/adts.201800164</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Menon</surname> <given-names>A.</given-names></name> <name><surname>Gupta</surname> <given-names>C.</given-names></name> <name><surname>Perkins</surname> <given-names>K. M.</given-names></name> <name><surname>DeCost</surname> <given-names>B. L.</given-names></name> <name><surname>Budwal</surname> <given-names>N.</given-names></name> <name><surname>Rios</surname> <given-names>R. T.</given-names></name> <etal/></person-group>. (<year>2017</year>). <article-title>Elucidating multi-physics interactions in suspensions for the design of polymeric dispersants: a hierarchical machine learning approach</article-title>. <source>Mol. Syst. Des. Eng.</source> <volume>2</volume>, <fpage>263</fpage>&#x02013;<lpage>273</lpage>. <pub-id pub-id-type="doi">10.1039/C7ME00027H</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mikes</surname> <given-names>J.</given-names></name> <name><surname>Dusek</surname> <given-names>K.</given-names></name></person-group> (<year>1982</year>). <article-title>Simulation of polymer network formation by the Monte Carlo method</article-title>. <source>Macromolecules</source> <volume>15</volume>, <fpage>93</fpage>&#x02013;<lpage>99</lpage>. <pub-id pub-id-type="doi">10.1021/ma00229a018</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Miller</surname> <given-names>D. R.</given-names></name> <name><surname>Macosko</surname> <given-names>C. W.</given-names></name></person-group> (<year>1976</year>). <article-title>A New Derivation of Post Gel Properties of Network Polymers</article-title>. <source>Macromolecules</source> <volume>9</volume>, <fpage>206</fpage>&#x02013;<lpage>211</lpage>. <pub-id pub-id-type="doi">10.1021/ma60050a004</pub-id></citation></ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Milner</surname> <given-names>S. T.</given-names></name> <name><surname>McLeish</surname> <given-names>T. C. B.</given-names></name></person-group> (<year>1998</year>). <article-title>Reptation and contour-length fluctuations in melts of linear polymers</article-title>. <source>Phys. Rev. Lett.</source> <volume>81</volume>, <fpage>725</fpage>&#x02013;<lpage>728</lpage>. <pub-id pub-id-type="doi">10.1103/PhysRevLett.81.725</pub-id></citation></ref>
<ref id="B19">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Oertel</surname> <given-names>G.</given-names></name></person-group> (<year>1994</year>). <source>Polyurethane Handbook: Chemistry- Raw Materials-Processing-Application-Properties. Second Edition</source>. <publisher-loc>New York, NY</publisher-loc>: <publisher-name>Hanser Publishers</publisher-name>.</citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pattamaprom</surname> <given-names>C.</given-names></name> <name><surname>Larson</surname> <given-names>R. G.</given-names></name> <name><surname>Van Dyke</surname> <given-names>T. J.</given-names></name></person-group> (<year>2000</year>). <article-title>Quantitative predictions of linear viscoelastic rheological properties of entangled polymers</article-title>. <source>Rheol. Acta</source> <volume>39</volume>, <fpage>517</fpage>&#x02013;<lpage>531</lpage>. <pub-id pub-id-type="doi">10.1007/s003970000104</pub-id></citation></ref>
<ref id="B21">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pedregosa</surname> <given-names>F.</given-names></name> <name><surname>Varoquaux</surname> <given-names>G.</given-names></name> <name><surname>Gramfort</surname> <given-names>A.</given-names></name> <name><surname>Michel</surname> <given-names>V.</given-names></name> <name><surname>Thirion</surname> <given-names>B.</given-names></name> <name><surname>Grisel</surname> <given-names>O.</given-names></name> <etal/></person-group>. (<year>2011</year>). <article-title>Scikit-learn: machine learning in {p}ython</article-title>. <source>J. Mach. Learn. Res.</source> <volume>12</volume>, <fpage>2825</fpage>&#x02013;<lpage>2830</lpage>.</citation></ref>
<ref id="B22">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Petrovi&#x00107;</surname> <given-names>Z. S.</given-names></name> <name><surname>Zhang</surname> <given-names>W.</given-names></name> <name><surname>Zlatani&#x00107;</surname> <given-names>A.</given-names></name> <name><surname>Lava</surname> <given-names>C. C.</given-names></name> <name><surname>Ilavsk&#x000FD;</surname> <given-names>M.</given-names></name></person-group> (<year>2002</year>). <article-title>Effect of OH/NCO molar ratio on properties of soy-based polyurethane networks</article-title>. <source>J. Polym. Environ.</source> <volume>10</volume>, <fpage>5</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1023/A:1021009821007</pub-id></citation></ref>
<ref id="B23">
<citation citation-type="web"><person-group person-group-type="author"><collab>R Core Team</collab></person-group> (<year>2018</year>). <source>R: A Language and Environment for Statistical Computing</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.r-project.org/">http://www.r-project.org/</ext-link> (accessed February 1, 2019).<pub-id pub-id-type="pmid">30628467</pub-id></citation></ref>
<ref id="B24">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Rossum</surname> <given-names>G.</given-names></name></person-group> (<year>1995</year>). <source>Python Reference Manual</source>. <publisher-loc>Amsterdam</publisher-loc>: <publisher-name>CWI (Centre for Mathematics and Computer Science)</publisher-name>.</citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roy</surname> <given-names>N. K.</given-names></name> <name><surname>Potter</surname> <given-names>W. D.</given-names></name> <name><surname>Landau</surname> <given-names>D. P.</given-names></name></person-group> (<year>2006</year>). <article-title>Polymer property prediction and optimization using neural networks</article-title>. <source>IEEE Trans. Neural Netw.</source> <volume>17</volume>, <fpage>1001</fpage>&#x02013;<lpage>1014</lpage>. <pub-id pub-id-type="doi">10.1109/TNN.2006.875981</pub-id><pub-id pub-id-type="pmid">16856662</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Safranski</surname> <given-names>D. L.</given-names></name> <name><surname>Gall</surname> <given-names>K.</given-names></name></person-group> (<year>2008</year>). <article-title>Effect of chemical structure and crosslinking density on the thermo-mechanical properties and toughness of (meth)acrylate shape memory polymer networks</article-title>. <source>Polymer</source> <volume>49</volume>, <fpage>4446</fpage>&#x02013;<lpage>4455</lpage>. <pub-id pub-id-type="doi">10.1016/j.polymer.2008.07.060</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shenogina</surname> <given-names>N. B.</given-names></name> <name><surname>Tsige</surname> <given-names>M.</given-names></name> <name><surname>Patnaik</surname> <given-names>S. S.</given-names></name> <name><surname>Mukhopadhyay</surname> <given-names>S. M.</given-names></name></person-group> (<year>2012</year>). <article-title>Molecular modeling approach to prediction of thermo-mechanical behavior of thermoset polymer networks</article-title>. <source>Macromolecules</source> <volume>45</volume>, <fpage>5307</fpage>&#x02013;<lpage>5315</lpage>. <pub-id pub-id-type="doi">10.1021/ma3007587</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Siesler</surname> <given-names>H. W.</given-names></name></person-group> (<year>1980</year>). <article-title>Fourier transform infrared (ftir) spectroscopy in polymer research</article-title>. <source>J. Mol. Struct.</source> <volume>59</volume>, <fpage>15</fpage>&#x02013;<lpage>37</lpage>. <pub-id pub-id-type="doi">10.1016/0022-2860(80)85063-0</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tsai</surname> <given-names>Y.-M.</given-names></name> <name><surname>Yu</surname> <given-names>T.-L.</given-names></name> <name><surname>Tseng</surname> <given-names>Y.-H.</given-names></name></person-group> (<year>1998</year>). <article-title>Physical properties of crosslinked polyurethane</article-title>. <source>Polym. Int.</source> <volume>47</volume>, <fpage>445</fpage>&#x02013;<lpage>450</lpage>. <pub-id pub-id-type="doi">10.1002/(SICI)1097-0126(199812)47:4&#x0003C;445::AID-PI82&#x0003E;3.0.CO;2-B</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Van Krevelen</surname> <given-names>D. W.</given-names></name> <name><surname>Te Nijenhuis</surname> <given-names>K.</given-names></name></person-group> (<year>2009</year>). <source>Properties of Polymers</source>. <publisher-loc>Amsterdam</publisher-loc>: <publisher-name>Elsevier</publisher-name>. <pub-id pub-id-type="doi">10.1016/B978-0-08-054819-7.X0001-5</pub-id></citation></ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Ruymbeke</surname> <given-names>E.</given-names></name> <name><surname>Keunings</surname> <given-names>R.</given-names></name> <name><surname>Bailly</surname> <given-names>C.</given-names></name></person-group> (<year>2005</year>). <article-title>Prediction of linear viscoelastic properties for polydisperse mixtures of entangled star and linear polymers: modified tube-based model and comparison with experimental results</article-title>. <source>J. Nonnewton. Fluid Mech.</source> <volume>128</volume>, <fpage>7</fpage>&#x02013;<lpage>22</lpage>. <pub-id pub-id-type="doi">10.1016/j.jnnfm.2005.01.006</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Ruymbeke</surname> <given-names>E.</given-names></name> <name><surname>Keunings</surname> <given-names>R.</given-names></name> <name><surname>St&#x000E9;phenne</surname> <given-names>V.</given-names></name> <name><surname>Hagenaars</surname> <given-names>A.</given-names></name> <name><surname>Bailly</surname> <given-names>C.</given-names></name></person-group> (<year>2002</year>). <article-title>Evaluation of reptation models for predicting the linear viscoelastic properties of entangled linear polymers</article-title>. <source>Macromolecules</source> <volume>35</volume>, <fpage>2689</fpage>&#x02013;<lpage>2699</lpage>. <pub-id pub-id-type="doi">10.1021/ma011271c</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Williams</surname> <given-names>C. K. I.</given-names></name> <name><surname>Rasmussen</surname> <given-names>C. E.</given-names></name></person-group> (<year>1996</year>). <article-title>Gaussian processes for regression</article-title>. <source>Adv. Neural Inf. Process. Syst.</source> <volume>8</volume>, <fpage>515</fpage>&#x02013;<lpage>520</lpage>.</citation></ref>
</ref-list>
</back>
</article>